Quick search Find article
Quick search
Find article
Environ. Res. Lett. 2 (October-December 2007) 045032
doi:10.1088/1748-9326/2/4/045032

Mapping Russian forest biomass with data from satellites and forest inventories

R A Houghton1, D Butman2, A G Bunn3, O N Krankina4, P Schlesinger1 and T A Stone1

1 The Woods Hole Research Center, 149 Woods Hole Road, Falmouth, MA 02540, USA
2 Yale School of Forestry and Environmental Science, Yale University, New Haven, CT 06511, USA
3 Department of Environmental Sciences, Huxley College of the Environment, Western Washington University, 516 High Street, Bellingham, WA 98225-9181, USA
4 Department of Forest Science, Oregon State University, 202 Richardson Hall, Corvallis, OR 97331-5752, USA

E-mail: rhoughton@whrc.org

Received 10 October 2007
Accepted 22 November 2007
Published 21 December 2007

Abstract. The forests of Russia cover a larger area and hold more carbon than the forests of any other nation and thus have the potential for a major role in global warming. Despite a systematic inventory of these forests, however, estimates of total carbon stocks vary, and spatial variations in the stocks within large aggregated units of land are unknown, thus hampering measurement of sources and sinks of carbon. We mapped the distribution of living forest biomass for the year 2000 by developing a relationship between ground measurements of wood volume at 12 sites throughout the Russian Federation and data from the MODIS satellite bidirectional reflectance distribution function (BRDF) product (MOD43B4). Based on the results of regression-tree analyses, we used the MOD43B4 product to assign biomass values to individual 500 m × 500 m cells in areas identified as forest by two satellite-based maps of land cover. According to the analysis, the total living biomass varied between 46 and 67 Pg, largely because of different estimates of forest area. Although optical data are limited in distinguishing differences in biomass in closed canopy forests, the estimates of total living biomass obtained here varied more in response to different definitions of forest than to saturation of the optical sensing of biomass.

Keywords:  biomass, carbon, forests, forest inventory, MODIS, Russia

Contents

1. Introduction

Although systematic forest inventories have been used to define the amount of carbon held in northern mid-latitude forests (Goodale et al 2002), the spatial distribution of carbon stocks is not well characterized. The spatial distribution is, nevertheless, important for determining the emissions of carbon that result from disturbance or deforestation and the uptake of carbon that results from recovery and growth (Houghton 2005). Estimates of carbon sources and sinks based on forest inventories are variable, especially for Russia and the former Soviet Union, where the net carbon balance ranges between a source of 0.5 Pg C/yr and a sink of 1.02 Pg C/yr (review of 15 studies by Shvidenko et al (1996), Goodale et al (2002)). Not only does Russia represent the largest political unit in the northern hemisphere and contain the largest stocks of terrestrial carbon (Apps et al 1993, Goodale et al 2002), it is also the country where estimates of net carbon flux are most divergent.

The wide range of flux estimates based on Russian forest inventory data is ironic because most of the estimates are based on essentially the same data. The federal forest service of Russia collects detailed stand level information on stemwood volumes over about 30 million ha (or about 4% of the forest area) annually (Kukuev et al 1997). Changes in the forests (areas harvested, burned, planted) are collected by forest management enterprises (leskhozes) and aggregated to the provincial and national level. The variability among published estimates of biomass reflects not only updating and aggregation, but different methods for converting wood volumes to biomass, and biases introduced through economic and political incentives to exaggerate accomplishments (e.g., land areas planted) and to under-report problems (e.g., forest area burned) (Alexeyev et al 2004).

If a correlation could be demonstrated between contemporary satellite data and forest biomass, the relationship might be used to distribute forest biomass across all of Russia in a consistent, transparent manner. In this paper we report an attempt to use satellite data to map the biomass of Russian forests for the year 2000. We used MODIS satellite data (and MODIS products), calibrated against inventory data from individual stands sampled between 1998 and 2000, to estimate forest biomass across all of Russia.

2. Methods

2.1. Selection of sampling sites

The territory of Russia is commonly divided into four geopolitical regions: European Russia (including the Ural Mountains), Western Siberia, Eastern Siberia and the Far East (figure 1). In contrast to these longitudinally arranged geopolitical regions, the major forest biomes of the Russian Federation generally extend along east–west parallels (figure 1): northern taiga (1), middle taiga (2), southern taiga and mixed forest (3), temperate forest (4), and forest steppe (5). The intersection of the four geopolitical regions with the five biomes defines 15 major geographical units.

Figure 1

Figure 1. The locations of 12 field sites among four geographic regions (east–west) and five vegetation zones (north–south). Although the vegetation zones are labeled as forests, they are not necessarily forested, especially in the northern taiga and the forest steppe zones (both of which have large treeless areas). The characteristics and names of sites are given in table 1.

Table 1. Locations and characteristics of the sites used in this study.
Region Site description Regression tree r2 Number of polygons Average size of polygon (ha) Average biomass (Mg ha–1) Std dev of biomass
7. Krasnoyarsk Yart Larch dominant forest, avg. age 175 yrs 0.67 1130 47.15 111.80 53.72
9. Northern Khabarovsk Larch dominant with birch mixed forest, along floodplain, avg. age 65 yrs 0.37 1561 28.58 64.38 40.22
2. Karelia Pine/spruce dominant forest, avg. age 67 yrs 0.35 3759 16.16 102.67 49.50
8. Krasnoyark Usol Pine/birch mixed forest, avg. age 100 yrs 0.31 1895 27.47 99.61 48.53
12. Kamchatka Mixed deciduous forest of birch, poplar and alder, avg. age 122 yrs, mountain slope 0.22 1596 28.36 128.11 66.98
10. Southern Khabarovsk Mixed forest of pine, larch, elm and basswood, avg. age 135 yrs,  > 550 m elevation 0.19 3158 29.48 100.09 29.07
11. Magadan Coastal and mountain larch dominant forest with pine mixed, avg. age 105 yrs, small stature 0.19 1757 54.89 41.71 34.98
1. Murmansk Birch dominant forest with pine/spruce mix, northern most location, small stature, avg. age 120 yrs 0.17 2832 33.70 38.42 15.59
5. Udmurtia Pine/spruce dominant with birch mixed forest, fragments surrounded by large scale agriculture, avg. age 43 yrs 0.07 2104 6.31 99.12 58.90
6. Novosibirsk Pine dominant forest fragment surrounded by agriculture, avg. age 85 yrs 0.07 5351 3.40 48.28 60.67
3. St. Petersburg Mixed forest with spruce, pine, birch, and aspen, avg. age 73 yrs 0.07 8194 4.06 133.18 57.09
4. Kursk Oak dominant forests with poplar/alder mix, isolated forest fragments surrounded by agriculture, avg. age 50 yrs 0.01 2019 4.69 121.93 61.48

2.2. Forest inventory data

As part of NASA's Land Cover/Land Use Change (LCLUC) program, we developed collaborative relationships with a number of Russian forest scientists. These scientists were crucial for obtaining local forest inventory data, within 12 of the 15 geographical units (figure 1, table 1). The specific locations for the sites were determined from the availability of forest inventory data. Sites had to be large enough to accommodate at least 1000 forest stands (polygons) of 3 ha or greater.

For each site, we obtained a digital map of forest polygons (stands) and the inventory data characterizing each polygon. Data included, for each polygon: area, forest type, dominant species, species composition, age, height, stocking density, volume of live stem wood (growing stock), and volume of snags and logs. Forest types or species groups included pine, spruce, mixed conifer, deciduous, and mixed (coniferous and deciduous) forest. Altogether, for the 12 sites, we obtained information for 42 182 polygons, covering a total area of 750 907 ha. Polygons ranged in size from 0.7 to 1503 ha, with an average of 18 ha/polygon.

Forest growing stock (m3 of wood per hectare) was converted to biomass (Mg ha–1) for each polygon using the allometric equations of Alexeyev and Birdsey (1998). The coefficients for the equations varied with forest age, species group, and region. They defined total forest biomass, including living above- and below-ground tree biomass and under-story vegetation. Carbon was assumed to be 0.5 × biomass.

2.3. Evaluating the relationship between forest biomass and MODIS products

We used the MODIS bidirectional reflectance distribution function (BRDF) product (MOD43B4) as the independent variable for predicting biomass. The MOD43B4 product is corrected for the off-nadir characteristics of scanning sensors and for atmospheric haze and aerosols. The product is a 16-day composite of MODIS reflectance at a spatial resolution of 1 km. We used the composite for mid-July 2000, and resampled to 500 m resolution using a nearest-neighbor algorithm.

To geo-register the satellite and ground data at each site, we used Landsat ETM +  data, along with spatially explicit GIS layers, including the hydrological network, stand polygon boundaries, and geographic identification. We then superimposed the MODIS product over the geo-referenced map of forest polygons. Because the stand polygons varied in size, some MODIS cells included a single polygon and some included multiple small polygons. When a cell included more than one forest polygon, we calculated an area-weighted mean biomass for each 500 m × 500 m MODIS cell (figure 2). Cells that contained polygons of non-forest were omitted from the training procedure. The calculation of weighted mean biomass was an aggregation procedure that often prevented us from assigning a particular forest type to a MODIS cell. Thus, we did not distinguish among species groups or forest types but, instead, lumped all species groups together as forests. The aggregation procedure also reduced the effective number of forest `polygons' (now aggregated to 500 m × 500 m cells), at some sites substantially.

Figure 2

Figure 2. MODIS 500 m × 500 m cells superimposed on polygons from field sites in Krasnoyarsk–Yartsevsky (a) and Novosibirsk (b). The different shades indicate variations in biomass.

Because satellite reflectance data (predictive variables) are highly intercorrelated, and the response (biomass) is potentially non-linear, standard multiple regression techniques were unsuitable. Instead, we used bootstrapped regression trees to develop associations between mean reflectance and biomass. Specifically, we used Breiman and Cutler's random forests (RF) ensemble prediction method (Breiman 2001, Liaw and Wiener 2002). Using the RF algorithm, we built 500 regression trees using different random samples of the data. Model error using the RF algorithm was quantified with the one-third of the data randomly excluded from the construction of each of the trees. The analysis was performed with the randomForest package (Liaw and Wiener 2002) in the R programming environment (R Development Core Team 2006).

We used two different land-cover maps, the GLC2000 land-cover product (Bartalev et al 2003) and the MOD12Q1 land-cover product (Schaaf et al 2002), to identify areas of forest and non-forest throughout Russia. The final step was to assign a biomass value to each 500 m × 500 m cell of forest using the results from the ensemble of 500 regression trees.

3. Results

For individual sites, the fraction of the variance explained by the regression trees varied between 0.01 and 0.67 (table 1 and figure 3). When data from all of the sites were lumped in the regression trees, the predictive capability of the model (i.e., variance explained) was 0.61. In general, the models underestimated polygons with high biomass and overestimated polygons with low biomass.

Figure 3

Figure 3. Observed and predicted biomass for 500 m × 500 m cells of forest. (a) Predictions based on regression-tree models incorporating data from all 12 sites. (b) Three examples of predictions based on regression-tree models with data from individual sites.

The highest values of biomass appeared in the middle and southern taiga. The lowest values were in the northern taiga and the forest steppe (savanna) at the southern limit of forest distribution (figure 4). The frequency distribution of biomass classes in the forests of Russia was skewed toward forests with lower biomass, especially with the GLC2000 land-cover product (figure 4, histograms).

Figure 4

Figure 4. (a) Map of Russian forest biomass as predicted by the MODIS land-cover product (MOD12Q1). (b) Map of Russian forest biomass as predicted by the GLC2000 land-cover product (Bartalev et al 2003).

4. Discussion

The number and distribution of the 12 sites seem to have been adequate for capturing the spectral signatures of forests throughout Russia, as calculated by Euclidian distance between the MODIS spectral bands. Despite the good spectral coverage given by the sites, however, the models of biomass developed for individual sites never explained more than 45% of the variation in biomass at other sites and often explained less than 10% of the variation. When the combined model developed with data from all 12 sites was tested at individual sites, its explanatory power varied between 0.04 and 0.71.

4.1. Why were the satellite data so poor at explaining biomass at some sites?

The error of our estimates of biomass was  ~ 40%. In other words, 39% of the variability in predicted biomass was unexplained by the regression model. It is not surprising that satellite optical data were poor at distinguishing variations in biomass, especially in closed-canopy forests. We found that more of the variance in biomass was explained when we considered only young forests ( < 20 years) (presumably with open canopies), confirming that at least part of the difficulty in predicting biomass resulted from differences in biomass under closed canopies.

A second limitation to the prediction of above-ground biomass with MODIS data seems to have been the extent to which the forest polygon data matched the spatial resolution of MODIS cells. Table 1 ranks the test sites by the ability of the regression-tree approach to predict biomass at that site (column 3, r2). The goodness-of-fit was roughly correlated with the average size of the forest polygons, although Karelia, Magadan, and Murmansk were exceptions.

It was not the number of polygons that limited the method's success. We used subsets of the polygons in the Krasnoyarsk Yart site to test whether the predictive capability of the regression trees was sensitive to the number as well as the size of training polygons. The relationship between observed and predicted biomass was robust with as little as 3% of the data.

The errors of the forest inventory data are uncertain. One crude estimate of error may be obtained from two recent estimates of Russian forest biomass. Alexeyev and Birdsey (1998) and Shvidenko and Nilsson (2003) used the same inventory data and the same forest area, yet reported total biomass of 56 Pg and 68.8 Pg, respectively (table 2). The difference (20% of the mean) is probably a conservative estimate of `inventory' error because it pertains only to that part of the error related to allometry (calculation of biomass from wood volumes).

Table 2. Estimates of biomass for the forests of Russia.
Forest area (106 ha) Total biomass (Pg) Average biomass (Mg ha–1) Reference
884 148.0 167.4 Dixon et al (1994)
771.1 94.2 122.2 Krankina and Dixon (1994)
771.1 70.2 91.0 Isaev et al (1995)
763.5 84.2 110.2 Krankina et al (1996)
623.2 139 223.0 Turner et al (1998)
771.1 56 72.6 Alexeyev and Birdsey (1998)
774.2 68.8 88.8 Shvidenko and Nilsson (2003)
523.6 46.2 88.2 This study: MODIS land-cover product
826.6 66.6 80.6 This study: GLC2000

Other studies have used optical satellite data, calibrated with data from forest inventories, to determine biomass over large areas in temperate zone and boreal regions (Myneni et al 2001, Baccini et al 2004, Zhang and Kondragunta 2006, Muukkonen and Heiskanen 2007) and in the tropics (Foody et al 2001, Saatchi et al 2007). These studies suggest that our predictive model might have been improved (1) had we used finer spatial resolution data (e.g., ASTER data) for integrating ground measurement with the coarser resolution MODIS data (Muukkonen and Heiskanen 2007); (2) had we used averaged MODIS data (several dates) (Muukkonen and Heiskanen 2007); and (3) had we included climatic, edaphic, and topographic data as additional independent variables (Baccini et al 2004). Finally, although optical data were important in discriminating biomass classes in tropical Amazonian forests, the overall capacity of the predictive model developed by Saatchi et al (2007) was improved when optical data were used in combination with radar data.

4.2. How do these results compare with other estimates?

We are unaware of other maps of forest biomass or growing stock for the forests of Russia. The first phase of the European SIBERIA project recently used synthetic aperture radar (SAR) data to estimate forest biomass over a 900 000 km2 area in Central Siberia, but the approach failed to distinguish among biomass classes greater than 80 m3 ha–1 ( ~ 64 Mg biomass ha–1) (Gaveau et al 2003, Wagner et al 2003). According to our analysis,  ~ 60% of the forest area in Russia had biomass values greater than 64 Mg ha–1.

Despite the lack of biomass datasets at high spatial resolution, several studies have estimated the total carbon stocks in forests for broad administrative units. Alexeyev and Birdsey (1998), for example, reported the biomass of forests for Oblasts, Kray, or Republics (71 of them across Russia). When we summed the biomass values for all of the forested cells within these same units, our results gave remarkably similar totals (figure 5).

Figure 5

Figure 5. Observed (inventory) and predicted (regression-tree approach) total biomass for 71 administrative districts of Russia. The line is the 1:1 slope.

Our estimates of total forest biomass for all of Russia (46 and 67 Pg biomass for the MODIS and GLC2000 maps, respectively) include the recent estimate by Alexeyev and Birdsey (1998) (56 Pg), and our higher estimate is similar to that of Shvidenko and Nilsson (2003) (69 Pg) (table 2). These comparisons are not very satisfying, however, because total biomass is sensitive to forest area. Average forest biomass, in contrast, allows a better comparison, and our higher estimate (88.2 Mg ha–1) is similar to the estimate by Shvidenko and Nilsson (2003) (88.8 Pg).

It is important to recognize that the two estimates of total forest biomass we report result from hugely different estimates of forest area. The total area of Russian forests was 523.6 × 106 ha according to the MODIS land-cover product (MOD12Q1) (Schaaf et al 2002) and 826.6 × 106 ha according to the GLC2000 product (Bartalev et al 2003). However, neither estimate included the areas of forest contained in the mixed category of `forest and agriculture', so both may be underestimates. The major difference between the two satellite-derived estimates was their treatment of woodlands and shrublands. Such lands are included as forests in the GLC2000 product and excluded in the MOD12Q1 product (figure 4). The difference is particularly conspicuous in the `northern taiga' and includes our site in northern Murmansk. The Russian forest inventories recognize an intermediate area of forest in their classification (771 × 106 ha) (table 2).

5. Conclusion

The attempt to use MODIS data to distribute forest biomass across Russia was only partially successful. Positive aspects included the observations (1) that MODIS data and forest biomass were generally well correlated at sites where the forest polygons were larger than MODIS cells (500 m × 500 m); (2) that the spectral signatures from the 12 training sites selected in this study seemed to represent forests throughout the country; (3) that the map of forest biomass produced from this work appeared reasonable in terms of the distribution of biomass classes; and (4) that the total forest biomass for individual political units compared well with estimates based on data from the forest inventories of Russia.

The less successful aspects of the work included the observations (1) that MODIS data and forest biomass were not well correlated at sites where forest polygons were smaller than MODIS cells and even in some sites with large forest polygons; and (2) that predictive models of forest biomass developed at individual sites did not apply outside the borders of the training site. It is not news that optical data are insensitive to biomass under closed canopies. Nevertheless, MODIS data did capture gross differences in biomass across broad environmental gradients and across obvious differences in forest structure (for example, non-forests, open forests, young forests, and old forests). The maps of forest biomass obtained through this analysis will help constrain estimates of carbon emissions associated with changes in land use, fires, and other disturbances (Houghton 2005).

The difference between the maps of forest cover confirms the importance of determining biomass independently of land cover. Biomass is a continuous variable, with a wide range of values within each ecoregion or ecosystem. It is not well characterized by discrete classes of land cover or forest type. The capability of determining forest biomass from space would eliminate much of the arbitrariness of distinguishing forest from woodland, and of defining forests and deforestation for carbon accounting. Sources and sinks of carbon are the result of changes in biomass (carbon stocks), whatever the cause. They are better estimated by measuring changes in carbon stocks directly than by observing transitions across an arbitrary threshold of forest–non-forest.

Acknowledgments

We thank Rudolf Treyfeld and Evgeny Povarov of the North-Western State Forest Inventory Enterprise (SFIE) (St Petersburg), Vladimir Manovich of the West-Siberian SFIE (Novosibirsk), Victor Skudine of the East-Siberian SFIE (Krasnoyarsk), and Vladimir Trush of the Far Eastern SFIE (Khabarovsk) for their participation in this study. We thank Warren Cohen, Tom Maiersperger, and Doug Oetter for generous advice during the early stages of this work, and Scott Goetz for helpful comments on earlier versions of the manuscript. Research was supported by the Land Cover/Land Use Change Program at NASA (grant number NAG5-11286).

References

Alexeyev V A and Birdsey R A (ed) 1998 Carbon storage in forests and peatlands of Russia General Technical Report NE-244, USDA, Forest Service, Northeast Research Station, Radnor, PA 
Alexeyev V A, Markov M V and Birdsey R A 2004 Statistical Data on Forest Fund of Russia and Changing of Forest Productivity in the Second Half of the XX Century (St Petersburg: St Petersburg Research Institute of Forestry) 
Apps M J, Kurz W A, Luxmoore R J, Nilsson L O, Sedjo R A, Schmidt R, Simpson S G and Vinson T S 1993 Boreal forests and tundra Water Air Soil Pollut. 70 39–53 
CrossRef
Baccini A, Friedl M A, Woodcock C E and Warbington R 2004 Forest biomass estimation over regional scales using multisource data Geophys. Res. Lett. 31 L10501 
CrossRef
Bartalev S, Belward A S, Erchov D and Isaev A S 2003 A new SPOT4-VEGETATION derived land cover map of Northern Eurasia Int. J. Remote Sens. 24 1977–82 
CrossRef
Breiman L 2001 Random forests Machine Learning 45 5–32 
CrossRef
Dixon R K, Brown S, Houghton R A, Solomon A M, Trexler M C and Wisniewski J 1994 Carbon pools and flux of global forest ecosystems Science 263 185–90 
CrossRefPubMed
Foody G M, Cutler M E, McMorrow J, Pelz D, Tangki H, Boyd D S and Douglas I 2001 Mapping the biomass of Bornean tropical rain forest from remotely sensed data Glob. Ecol. Biogeogr. 10 379–87 
CrossRef
Gaveau D L A, Balzter H and Plummer S 2003 Forest woody biomass classification with satellite-based radar coherence over 9000 000 km2 in Central Siberia Forest Ecol. Manage. 174 65–75 
CrossRef
Goodale C L et al 2002 Forest carbon sinks in the northern hemisphere Ecol. Appl. 12 891–99 
CrossRef
Houghton R A 2005 Aboveground forest biomass and the global carbon balance Glob. Change Biol. 11 945–58 
CrossRef
Isaev A, Korovin G, Zamolodchikov D, Utkin A and Pryashnikov A 1995 Carbon stock and deposition in phytomass of the Russian forests Water Air Soil Pollut. 82 247–56 
CrossRef
Krankina O N and Dixon R K 1994 Forest management options to conserve and sequester terrestrial carbon in the Russian Federation World Resource Rev. 6 88–101 
Krankina O N, Harmon M E and Winjum J K 1996 Carbon storage and sequestration in the Russian forest sector Ambio 25 284–8 
Kukuev Y A, Krankina O N and Harmon M E 1997 The forest inventory system in Russia J. Forestry 95 15–20 
Liaw A and Wiener M 2002 Classification and regression by randomForest R-News 2 18–22 
Muukkonen P and Heiskanen J 2007 Biomass estimation over a large area based on standwise forest inventory data and ASTER and MODIS satellite data: a possibility to verify carbon inventories Remote Sens. Environ. 107 617–24 
CrossRef
Myneni R, Dong J, Tucker C, Kaufmann R, Kauppi P, Liski J, Zhou L, Alexeyev V and Hughes M 2001 A large carbon sink in the woody biomass of northern forest Proc. Natl Acad. Sci. USA 98 14784–9 
CrossRefPubMed
R Development Core Team 2006 R: A Language and Environment for Statistical Computing ISBN 3-900051-07-0 http://www.R-project.org 
Saatchi S S, Houghton R A, dos Santos Alvala R C, Soares J V and Yu Y 2007 Distribution of aboveground live biomass in the Amazon basin Glob. Change Biol. 13 816–37 
CrossRef
Schaaf C B et al 2002 First operational BRDF, albedo and nadir reflectance products from MODIS Remote Sens. Environ. 83 135–48 
CrossRef
Shvidenko A and Nilsson S 2003 A synthesis of the impact of Russian forests on the global carbon budget for 1961–1998 Tellus B 55 391–415 
CrossRef
Shvidenko A Z, Nilsson S, Rozhkov V A and Strakhov V V 1996 Carbon budget of the Russian boreal forests: a systems analysis approach to uncertainty Forest Ecosystems, Forest Management and the Global Carbon Cycle (NATO ASI Series vol I 40) (Berlin: Springer) pp 145–62 
Turner D P, Winjum J K, Kolchugina T P, Vinson T S, Schroeder P E, Phillips D L and Cairns M A 1998 Estimating the terrestrial carbon pools of the Former Soviet Union, conterminous US, and Brazil Clim. Res. 9 183–96 
CrossRef
Wagner W et al 2003 Large-scale mapping of boreal forest in SIBERIA using ERS tandem coherence and JERS backscatter data Remote Sens. Environ. 85 125–44 
CrossRef
Zhang X and Kondragunta S 2006 Estimating forest biomass in the USA using generalized allometric models and MODIS land products Geophys. Res. Lett. 33 L09402 
CrossRef




Please login to access our web services, or create an account if you don't yet have one.

You must have cookies enabled in your web browser to be able to login.

Username
Password

Forgotten your password? Get a new one here.