This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy. Close this notification
NOTICE: Ukraine: Read IOP Publishing's statement.
Letter The following article is Open access

Agricultural yield geographies in the United States


Published 6 May 2021 © 2021 The Author(s). Published by IOP Publishing Ltd
, , Citation Emily K Burchfield and Katherine S Nelson 2021 Environ. Res. Lett. 16 054051

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.



We examine the geographies of agricultural yields in the United States, home to some of the most productive agricultural systems on the planet. We model and map yield divergence from biophysical expectations and regional norms for five major crops—corn, soy, wheat, alfalfa, and hay—and assess how this divergence interacts with farm-level resources, farm(er) characteristics, and landscape context. Our results highlight the ways in which human activity has reinforced and intensified the yield geographies defined by sun, soil, and water alone. Yield gains brought by human activity are strongly associated with increased expenditure on inputs to production and receipts from federal programs, but not with net revenue gains for farmers. These yield gains vary across operator race, gender, farm size, and major US region. We also find that beyond a threshold, increased input expenditure is associated with marginally decreasing yields, raising important questions about the interactions between yields and farmer livelihoods. We conclude by discussing the importance of broadening the production-centric paradigm that has dominated agricultural innovation over the last century to include the well-being of the farmers and ecological systems on which agricultural production ultimately depends.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Human innovation has pushed agricultural production well beyond the bounds of what sun, soil, and water alone can support. Over the last 70 years, the US yields of soy and winter wheat have more than tripled, while corn yields have increased over five-fold [1]. These yield gains have supported a more than tripling of global population and brought tremendous changes to agricultural systems worldwide. Today, the US is home to some of the most productive, consolidated, and specialized agricultural systems on the planet. These systems have been strongly shaped by an economic paradigm that prioritizes production and efficiency, technological innovations that target simplified large-scale systems [2, 3], federal policies that incentivize specialized commodity production [4, 5], and by the increased integration of rural economies in globalized markets [6, 7]. Today, over half of US land is devoted to agricultural production, with over 80% of this land cultivated with corn, wheat, soy, alfalfa, or hay [8]. An increasing share of these crops are grown for export, with US farmers exporting, on average, more than 20% of what they produce [1]. Moreover, the operations on which these crops are grown are increasing in size, with large-scale operations earning more than $500 000 annually comprising only 7.5% of US farms but operating over 41% of agricultural land [1, 9].

While there is evidence that large-scale simplified systems do, in fact, generate higher yields [1012], there is growing concern about the implications of these trends for people and planet. US farm debt has increased by over 30% in the last decade to levels not seen since the 1980s Farm Crisis [13]. Over the same period, farm income has sharply declined, with the USDA reporting a median farm income of negative $1735 in 2018 [14]. Many families have left the agricultural sector, as a national trend of farm consolidation has pushed small- and medium-sized farms out of production [2, 15, 16]. Existing farms are increasingly exposed to fluctuations in global markets, often with devastating consequences [1719]. At the same time, changing climate [2024] and declining environmental quality [2527] threaten agricultural collapse in many of the same regions.

With these trends in mind, scientists, practitioners, and policy makers have started to question the production-centric paradigm that has dominated agricultural innovation over the last half-century, questioning its viability for the farmers and ecological systems on which agricultural production ultimately depends [2830]. If one of the most productive agricultural systems on the planet is also associated with farmer debt, farm consolidation, declining incomes, and a slew of negative environmental externalities, is production really the best metric of agricultural success?

We begin to explore this question by identifying the farm-level resources, farm(er) characteristics, and landscape context of the most productive agricultural systems in the US. We identify unexpectedly high yielding agricultural regions by estimating yield divergence from what would be expected given only biophysical conditions and regional norms for the five major crops that comprise nearly 80% of cultivated acreage in the US: corn, soy, wheat, hay, and alfalfa. By removing yield variance explained by biophysical conditions and regional norms, we isolate the human yield signal, or the human contribution to yield geographies. We then assess the farm-level resources, farm(er) characteristics, and landscape context of counties the for which the human yield signal is particularly strong.

We find that human activity has intensified and amplified the yield geographies explained by sun, soil, and water alone. In areas with unexpectedly high yields for these five crops, farmers spend more on major inputs to production (fertilizer, chemicals, machinery, and labor) and report higher rates of federal support, both in terms of government receipts and crop insurance. Surprisingly, we find that, on average, farmers in these unexpectedly high yielding systems do not earn higher net revenues. We also find that, beyond a threshold, increased input expenditure is associated with marginally decreasing yields. Taken together, this work suggests that while US agriculture excels at meeting human demand, it falls short in satisfying other important objectives. This raises serious questions about the future viability of US agriculture and highlights the importance of expanding the production-centric paradigm by reconceptualizing agriculture as a major force for positive, regenerative change for both people and planet.

2. Methods

We constructed panel datasets merging county-level yield estimates from the USDA NASS Survey with seasonal sun, soil, and water characteristics for US counties cultivating alfalfa, corn, hay, soy, and (winter) wheat from 2008 to 2018, excluding counties with fewer than three yield observations over this period. Together, these crops dominate US agriculture, covering 78% of agricultural land in the US (figure S10 (available online at [8] and representing nearly 60% of crop value produced in the US [31]. We constructed two county-season weather indicators—growing degree days and total precipitation—from gridded daily 4 km temperature and precipitation data provided by the PRISM Climate Group (2004) [32]. For each indicator, we defined season duration using the spatially explicit, crop-specific planting and harvesting dates provided by [33]. Growing degree days—an indicator of cumulative temperature exposure—are the sum of mean daily temperatures above a crop-specific threshold over the crop's growing season 3 , while total precipitation is the sum of precipitation (in mm) throughout the growing season. We also collected data describing the percent of a county's agricultural land irrigated from the USDA NASS [1]. When this data was unavailable, we replaced missing values with linearly interpolated estimates from the MiRAD project [34] standardized by agricultural extent estimates derived from the USDA's Cropland Data Layer. We constructed county-level indicators of soil characteristics by averaging at the county scale four variables in the Harmonized World Soil Database [35]: topsoil pH, topsoil organic carbon, topsoil cation exchange capacity, and topsoil exchangeable sodium percentage. These predictors were selected from the full HWSD dataset based on their known importance to agricultural production and their contributions to the prediction of yield in a random forests model (figure S9).

To capture well-established non-linearities in the effects of seasonal precipitation and temperature on yields [36, 37], we modeled the interactions between yields and county-year seasonal weather predictors using a second-order random walk function $f\left( {{X_{tc}}} \right)$. This structure allows the effect of these predictors to vary non-linearly while also accounting for temporal autocorrelation in predictors effects. To avoid overfitting, we modeled soil parameters and percent irrigated land as county-level linear controls. We also included a dummy indicator for year (${{TIM}}{{{E}}_t}$) to capture any dynamics that affect all counties across the country in a particular year such as major market or policy changes. Any spatial variability in the impact of these shifts is captured in the county-level random effects described below.

One of the challenges of modeling county-level agricultural production is strong spatial dependency in the data, i.e. neighboring counties tend to have similar yields. We address this issue by modeling county-level random effects using a Besag-York-Mollié model [38] that includes both county-specific spatially unstructured random effects (modeled as exchangeable) and county-specific spatially structured effects modeled using a spatial weights matrix linking counties that share borders. The unstructured random effects (${v_{0cr}}$) capture time-invariant factors associated with a county that influence yield (individual spice), while the spatially-structured effects (${u_{0cr}}$) account for the fact that observations from neighboring counties exhibit higher correlation than more distant counties. Following Cinner et al (2016), we also include regional random effects for each of the USDA's nine Farm Resource Regions (${v_{00r}}$). These regions capture important variability in farm size, farmer demographics, cropping patterns, sociocultural context, land management norms, and market access across the US (figure S1) [39]. By examining county effects in the context of the region in which they are nested, we can assess yields given the norms of a region. The final model is specified as:

where t indexes time, c indexes counties, and r indexes regions. It is the difference between the county-level random effects (${u_{0cr}} + {v_{0cr}}$) and the expected regional yields (${v_{00r}}$) that serves as the basis of the remaining analyses. This difference can be thought of as the extent to which, given biophysical conditions, regional norms, and national temporal trends in yield, a county's yields have diverged from national averages over the last decade. This approach removes the share of variance explained by biophysical factors alone, allowing us to assess the characteristics of yield geographies unexplained by the biophysical. These residual geographies represent the influence of human activities on yield, which we refer to as the human yield signal.

We estimated models using the R-INLA package [40] which uses Integrated Nested Laplace Approximations to increase the computational efficiency of model estimation. The model uses uninformative (reduced precision) prior distributions for linear effects (${Z_{tc}}$) and penalized complexity priors for non-linear seasonal weather predictors $\left( {{{\boldsymbol{X}}_{tc}}} \right)$. We employed default and recommended settings for penalized complexity priors as provided by Simpson et al [41]. Regional effects (${v_{00r}}$) were modeled using a logGamma prior with weakly informative priors. County-level random effects were modeled with a Besag-Mollie-York model which includes both iid random effects and Besag spatial effects using R-INLA default priors. Model fit was evaluated using the deviance information criterion (DIC), the conditional predictive ordinate, mean squared error and Bayesian R-squared (R2) (table S1). Model scripts and additional information on model diagnostics and robustness checks are available at

We collected available data describing farm resources and farm(er) characteristics from the USDA Agricultural Census. We included only variables with less than 10% missing data for the period of interest (Census years 2007, 2012, and 2017, table S3), with the exception of operator race. 74% and 41% of counties were missing data describing operated acreage by Black and Hispanic farmers respectively; data were most likely withheld by the USDA NASS to preserve confidentiality in counties where very few minority operators cultivate. Despite this limitation, we decided to include these variables in our descriptive assessments due to their growing importance in our national conversation about diversity and sustainability. We standardized predictors (where applicable) to facilitate comparison across counties using 'total operated acres' which includes agricultural land used for crops, pasture, or grazing, as well as woodlands, farm roads, and farm buildings. To model the effect of variability in land use patterns on agricultural production, we also constructed additional indicators of landscape composition and configuration from the USDA Cropland Data Layer—a 30 meter annual land use dataset based on satellite imagery and extensive ground truth data. We include two indicators of landscape composition—an indicator of crop diversity (Shannon Diversity Index) and a measure of the predominance of undeveloped and uncultivated landcover classes (Percent Natural Cover)—and two indicators of landscape configuration—an indicator of largest patch size (Mean Patch Size) and an indicator of the overall patchiness of a landscape (Edge Density). We compare values of each variable across a gradient of low (−1 sd) to high (+1 sd) unexpected yields using box and whisker plots. We also assess the contribution of each variable to the prediction of the human yield signal using a random forests analysis implemented using the randomForests package in R [42]. Random forests is well-suited to high dimensional, multicollinear data, and makes no distributional assumptions about the data on which it is trained, allowing it to handle complex nonlinear interactions among predictors. Though the variable importance estimates generated by the random forest algorithm do not have causal interpretations, by ranking the relative contribution of farm-level resources, farm(er) characteristics, and landscape context to the prediction of yield divergence from expectations, they provide initial insights into the human factors that may be associated with surprising yields.

3. Results

3.1. Biophysical correlates of yield

Variations in the agricultural yields of corn, soy, wheat, alfalfa, and hay are largely explained by differences in access to sun, soil, and water (figure 1, table S1). Seasonal temperature and precipitation have strong effects on the yields of all crops, with stronger effects for temperature than for precipitation (figure 1(A)). Beyond a threshold, increasing temperature exposure is associated with significant yield declines, particularly for corn and soy, corroborating prior research highlighting the threat posed by global heating [37, 4345]. The yields of corn, wheat, and soy also decline above a seasonal precipitation threshold (figure 1(B)), though hay and alfalfa show sustained yield gains across the range of seasonal precipitation. Though effect strength varies across crops, irrigation and topsoil pH have consistent positive effects on crop yields (figure 2(A)). Finally, the significant effects of year on yields (figure 2(B)) highlights the importance of major shifts in management, technology, markets, and policies that affect production nationally. These effects are comparable in size to seasonal weather effects.

Figure 1.

Figure 1. The crop-specific yield response to (a) seasonal growing degree days and (b) total seasonal precipitation estimated using a second-order random walk function. Solid lines show the median effect and shaded bands the 95% credibility limits. Model R2 for each crop above 0.7 with the exception of hay (table S1).

Standard image High-resolution image
Figure 2.

Figure 2. Regression coefficients for (a) the average effect of linear soil and irrigation predictors on yields and (b) the average effect of each year on yields. Year effects capture major shifts in management, technology, markets, and policies that affect production nationally. The points represent the mean estimate, while the error bars represent the 95% confidence interval. All continuous variables were standardized prior to running the models, so these effects represent the standard deviation change in yield associated with a one standard deviation increase in each predictor. For temporal fixed effects, the baseline year was set to 2008.

Standard image High-resolution image

In addition to controlling for county-level variability in biophysical factors, we also include regional random effects which place county-level yield dynamics in the context of the larger region in which they are located. This allows us to compute county-level yield expectations given regional norms, which we define using the USDA Farm Resource Regions [39]. We observe higher regional yield norms in the Midwestern US (Heartland, Mississippi Portal, Southeastern Seaboard, and Eastern Uplands) and lower yield norms in the Northern Great Plains and Prairie Gateway regions (figure S2).

County-level variability in yields unexplained by biophysical conditions, national temporal trends, and regional norms exhibits a strong spatial structure (figure 3), confirmed by the dominance of the spatially-structured component of the county-scale random effects (${u_{0cr}}$) as compared to the unstructured component (${v_{0cr}}$) and the presence of strong spatial autocorrelation in yield divergence from expectations (table S2). Even after controlling for the biophysical factors and regional norms that make the Midwestern US particularly well-suited to corn and soy production, the yields of these crops are significantly higher in the Midwest than expected. Corn, soy, alfalfa, and hay yields are also unexpectedly high in counties with access to the High Plains Aquifer. Alfalfa, hay, and winter wheat are particularly productive in the Pacific Northwest, though with significant intra-regional variability. In Arizona and New Mexico, alfalfa yields are up to five standard deviations above national averages, even after accounting for regional norms. The Southeastern US and the region along the Mississippi River show unexpectedly high yields across all crops, while the northern Great Plains show unexpectedly low yields across crops. We also compute average unexpected yields across crops, weighted by the proportion of a county's agricultural land cultivated with each crop, which deemphasizes high yield estimates in counties where little of a crop is cultivated (figure 3(F)). Once again, the Midwestern and Southeastern US emerge as unexpectedly productive regions, as well as the Southwestern US, though this signal is largely dominated by highly productive and large-scale alfalfa cultivation.

Figure 3.

Figure 3. The human yield signal, or yield geographies unexplained by biophysical conditions (sun, soil, and climate), national temporal trends, and regional norms. For example, a county with a value of two has observed yields that—given county-level biophysical conditions, national yield growth, and regional norms—are two standard deviations above the national average yield. Counties for which insufficient yield data was available (fewer than three observations through time) are shaded gray. Panel F shows the average of unexpected yields across crops, weighted by the proportion of a county's agricultural land cultivated with each crop and divided by the total proportion of agricultural land cultivated with the five crops we assess. Farm resource region boundaries are shown in dark gray.

Standard image High-resolution image

3.2. The human yield signal

What factors, beyond the biophysical controls and regional norms included in our models, might help to explain unexpected yields? To begin to answer this question, we collected and constructed data describing three categories of human activity known to directly or indirectly influence yields: farm-level resources, farm and farmer characteristics, and the land use decisions that shape landscape context (table S3). Farm-level resources include per acre expenditure on major inputs to production (machinery, labor, fertilizer, and chemicals) and per acre receipts from federal programs. These factors bound the seasonal on-farm decisions that have a strong, direct influence on production. Farm(er) characteristics such as farmer race, gender, experience, tenure, and farm size can influence access to farm-level resources as well as the adoption of new technologies and practices that influence production [46, 47]. Finally, the landscape context in which a farm operates can strongly influence the provisioning of ecosystem services essential to agricultural production [4853]. Agricultural activity has dramatically transformed landscapes in many part of the US—today, over half of US land is devoted to agricultural production [8]. Recent research suggests that differences in the composition—the quantity of land cover categories on a landscape—and configuration—how these land cover categories are arranged on a landscape—of the landscape in which a farm operates can have significant impacts on production outcomes [54].

To understand the major components of the human yield signal, we examine how farm-level resources, farm(er) characteristics, and landscape context vary across counties with unexpectedly high yields (counties with yields more than one standard deviation above expectations) and unexpectedly low yields (counties with yields less than one standard deviation below expectations) 4 . Because these high and low yielding categories are defined by the yields of only five crops—albeit five crops that comprise nearly 80% of what's grown in the US and which make up over 80% of cultivated acreage in over two-thirds of the counties included in this analysis (figure S10)—we note that the patterns in these human factors may be influenced by other crops in a county's cropping mix. These other crops, however, reported insufficient yields to include in this national analysis. Though this data availability limits our investigation, several compelling patterns emerge.

First, in areas of unexpectedly high production for major US crops, farmers spend more, on average, on major inputs to production (fertilizer, labor, machinery, chemicals) than in low production areas (figure 4(A)). Median fertilizer expense in unexpectedly productive regions was $27.90 per acre as compared to $7.81 in less productive regions. Median per acre expenditure on chemicals and machinery in high production areas was $14.86 and $ 389.07 respectively, as compared to $4.31 and $165.48 per acre in low production areas. This expenditure pattern is consistent across all crops included in the analysis (figure S3). Farmers in highly productive regions also receive over double the receipts from government programs 5 than farmers in low production areas (median values of $10.61 and $5.12 per acre respectively) and, on a per crop basis, have slightly higher rates of participation in federal crop insurance programs,ystem services essential to agry (figure S3(F)).

Figure 4.

Figure 4. Box and whisker plot showing differences in (a) farm-level resources, (b) farm(er) characteristics, and (c) landscape context in counties where the across-crop weighted average of yield divergence from expectations is greater than one standard deviation above the mean (+1 sd) or one standard deviation below the mean (−1 sd). Bold vertical lines represent the median. Values are standardized for comparison. For crop-specific results, see figure S3.

Standard image High-resolution image

Across the available farm and farmer characteristics we examine, the strongest consistent association we observe is for farm size (figure 4(B)). Surprisingly, larger farms are associated with lower than expected yields across the five crops included in our analysis (figure S4(D)). Median farm size for farmers in unexpectedly productive counties is 85 acres as compared to 172 acres in less productive counties. Farmer knowledge—measured as years of farming experience and whether the principal operator's primary occupation is farming—as well as the nature of land ownership do not have strong associations with unexplained yields, with the exception of tenancy, for which a higher percent of agricultural acreage operated by tenants is associated with noticeably higher yields for soy, alfalfa, and corn (figure S4(C)).

Another finding that merits further discussion is the interaction between farmer race and yields. In most regions of the US, white farmers operate well over 90% of cultivated land (figures S4(G) and S7(C)). Despite the systematic underrepresentation of Black farmers in US agriculture, increased acreage operated by Black farmers is associated with higher than expected yields, particularly for soy and hay (figure S4(E)). Hispanic farmers are slightly more prevalent in areas with higher alfalfa yields (figure S4(F)), while higher rates of female operated acreage are weakly associated with lower yields (figure S4(H)). We note, however, that these findings are tentative given the low level of data availability for farmer race.

We find no clear association between indicators of landscape composition (diversity and percent natural cover), configuration (edge density and mean patch size) and yields (figure 4(C)). Across crops, lower percent natural cover and higher mean patch area (larger contiguous areas of a single land cover type) are weakly associated with higher production levels (figure S5).

We use random forests regression to determine which factors most strongly predict divergence from yield expectations (figure 5). While we were unable to investigate racial associations with yields due to high levels of missing data, we included other available predictors of farm resources, farm(er) characteristics, and landscape context. We find that fertilizer and chemical expenses, which have strong positive associations with yield divergence (figures S3(A), S3(B) and 4(A)), are the strongest predictors of yield divergence from expectations. Government receipts are also strongly predictive of divergence from expectations, also with a strongly positive association (figures S3(E) and 4(A)). Median farm size is also predictive of unexpected yields, with smaller farm sizes associated with increased yields (figure S4(D) and 4(B)). Finally, edge density and crop diversity emerge as landscape characteristics most predictive of unexpected yields, with predictive strengths similar to those for machinery and labor inputs and stronger than those of farm(er) characteristics, though the direction of these effects are unclear (figures S5 and 4(C)).

Figure 5.

Figure 5. Variable importance plot showing the percent increase in mean squared error (MSE) associated with removing each predictor from the model. Model MSE on 25% held-out data was 0.287. Note that we had to exclude race from this analysis due to data availability. Results here predicting the weighted average of unexpected yields across crops.

Standard image High-resolution image

4. Discussion

Our results reveal several interesting and important patterns in the yield geographies of major US crops. First, for these five crops, yields that are not explained by differences in sun, soil, water, and regional norms exhibit a strong spatial structure, highlighting both the importance of explicitly accounting for spatial structure in yield models and the contribution of the broader context in which a county operates to agricultural yields. This finding also implies that major shifts in agricultural yields occur at a scale between US county—the scale at which yields are systematically reported by the USDA—and USDA Farm Resource Regions. This is supported by the relatively small effects of Farm Resource Regions on yields (figure S2), which suggests that yields vary nearly as much within regions as they do across regions.

Despite controlling for county-level differences in access to sun, soil, and water, the geographies presented in figure 3 reveal yield gradients that strongly follow known ecological and biophysical boundaries. For example, despite controlling for irrigation in our models, corn, soy, and hay show abrupt yield shifts that align strongly with the boundaries of the High Plains Aquifer (figure 6). Areas that are characterized by an abundance of surface water resources and highly suitable soil, such as the Mississippi River Valley, show unexpectedly high yields across all crops, while areas with more constrained water resources such as the Western US show high yields for only a limited number of crops. We interpret the persistence of these biophysical signals as the human amplification of natural patterns of biophysical suitability. Agricultural production expands and intensifies in regions particularly well-suited to the cultivation of a specific crop [55]; increased agricultural activity further amplifies yield gains, increasing farm-level resources, and reinforcing and intensifying the suitability geographies defined by sun, soil, and water access.

Figure 6.

Figure 6. Yield divergence from expectations in the High Plains Aquifer.

Standard image High-resolution image

This persistent spatial structure also highlights the limits to human amplification of biophysical suitability. Consider, for example, the regions at the periphery of the Midwestern US that exhibit lower than expected yields for the two commodities that dominate this region: corn and soy. Over the last 50 years, the intersecting forces of consumer demand, trade policy, and market prices have pushed the cultivation of corn and soy beyond the regions that are most biophysically suited to their cultivation [5, 56, 57]. Since the 1960s, harvested corn and soy acreage has increased by 76%, with corn and soy alone comprising over 56% of harvested cropland in the US [9].

The second major finding is that farm-level expenditures on fertilizer, chemicals, machinery, and labor are consistently higher in unexpectedly productive regions of the US. This finding is consistent across all crops and inputs (figure S3). We estimate that the median total per acre expense on these inputs is $458 per acre in unexpectedly productive counties, over two times higher than the $185 per acre spent in low yielding counties. Farmers may benefit from increased input use if increased input expenditures result in higher yields; however, our comparison of livelihood indicators available from the USDA suggests that though crop sales are higher in unexpectedly productive regions, there is no observable difference in net farm income across this yield gradient (figure 7(A)). This finding is consistent across crops—with the exception of soy and, to a lesser extent, alfalfa (figure S6(B)). While this finding may be influenced by other farm activities beyond the cultivation of the five major crops examined here, such as high value livestock production or cultivation of high value specialty crops in areas with low crop yields, it nevertheless points to a disconnect between increased efforts and expenditures to boost production of the most widely grown crops in the US and compensation for those efforts. The partial dependence plots from the random forest regression also suggest that increased input expenditure is associated with diminishing yield returns (figure 7(B)). Only at exceptionally high expenditure rates of around $260 and $217 dollars per acre for fertilizer and chemical expense respectively are increased inputs associated with higher yields. This merits further exploration, especially given that per farm average expenditures—of which labor, machinery, fertilizer, and chemicals comprise over 30%—reached over $176 000 in 2017 [1]. These increased expenditures are strongly associated with unprecedented levels of farmer debt [13] and a nearly 40% drop in net farm income since 2013 [14, 56, 58].

Figure 7.

Figure 7. (A) Box and whisker plot showing differences in standardized county-level livelihood indicators across weighted average yield divergence from expectations. Bold vertical lines represent the median. Values are standardized for comparison. Here, net farm income is derived by subtracting total farm expenses from total sales, while total farm income is the average operation-scale income, before taxes and expenses. (B) Partial dependence plots for inputs and government receipts. Partial dependence is the dependence of the outcome on one predictor after averaging out the effects of the other predictors in the model, graphically characterizing the relationship between an individual predictor and the predicted values of yield divergence.

Standard image High-resolution image

Farm-level decisions are strongly bounded by the larger structures, institutions, and policies that shape US agriculture [5, 9]. Our results highlight the strong influence of these factors on yield outcomes across the nation. First, the temporal effects we estimate are comparable in magnitude to the effects of seasonal weather on yields, highlighting the strong contribution of interannual variability in markets and policies to yield outcomes. Note, for example, the diverging effects for corn and soy over the period from 2010 to 2013—a period of tremendous price volatility for both crops. Note also, the steady decline in year effects from 2016 to 2018. This period is marked by geopolitical unpredictability as well as trade wars between the US and China that significantly strained US commodity exports. We also observe that farmers in unexpectedly productive regions receive over two times more support from government programs than those cultivating in unexpectedly low yielding regions. Government receipts are also highly predictive of unexplained yields (figure 5), though like physical inputs to production, they show diminishing returns in their contribution to the prediction of unexpected yields (figure 7(B)). The implications of the strong association between government receipts and high yields are mixed. Federal programs are an important source of income stabilization for US farmers; however, research suggests that participation in these programs may have negative implications for farm-level adaptive capacity and resource use [2, 5961]. Large farms tend to have greater access to these programs [2, 4], and while the link between farm size and yields is mixed [11], this finding may highlight that higher-yielding systems are those that are more likely to have access to and benefit from government support.

The third major finding is that while there are no clear overall associations between farmer characteristics and unexplained yields, increased acreage operated by Black farmers is associated with higher than expected yields for several crops. This is important given the relative challenges Black, Indigenous, and other farmers of color face in accessing land and capital [62, 63] and the ways in which government programs (strongly associated with higher yields) have been shown to systematically privilege white farmers [64]. Our exploration of available data also highlights the strong spatial concentration of minority farmers, with Black operators cultivating primarily in the southeastern, and Hispanic operators in the southwestern US (figure S7).

Fourth, and rather surprisingly, larger farms are associated with lower than expected yields. Over the last decades, the number of small- and mid-sized farms in the US has declined. In 2018, large-scale farms (sales greater than 500k) covered over 40% of agricultural land but comprised only 7.5% of the total number of farms [1, 9]. The shift towards large-scale industrial farming is strongly driven by the assumption that larger farms are more productive [12, 65]. Though this assumption has been widely debated [12], recent evidence suggests that in developed economies large farms tend to be more productive [3, 10, 11]. This evidence is particularly strong in grain production, where many recent technological advances, such as large combine harvesters and precision agricultural technologies, most strongly benefit large-scale farms [3, 11]. While we note that while yields (outputs) are distinct from productivity (the ratio of inputs to outputs), this finding provides some preliminary evidence that, after controlling for the natural assets of a farm, size may not necessarily bring high yields.

Finally, though no clear associations between landscape context and yields emerge from these analyses, for specific crops, counties with larger contiguous areas of the same crop see higher than expected yields. While this appears to contradict the finding that larger farm operations are associated with lower than expected yields, it may suggest that it is not the size of the operation but the extent of specialization within the operation that is associated with yield gains. We also observe lower than expected yields in areas with greater percent natural cover, though this likely reflects the fact that areas with more natural cover may also be those that are less well-suited to agricultural production. Though no major differences emerge across yield divergence from expectations (figure 4(C)), edge density and agricultural diversity are strongly predictive of yields. This suggests that landscape characteristics, though not significantly different across yield divergence from expectations, still show interesting nonlinear impacts on the prediction of yields after accounting for other factors. The difference here also highlights how the effects of these landscape characteristics on yields are likely nonlinear and not fully captured by high to low yield gradient we explore. As edge density increases, predicted production decreases, suggesting that increased configurational complexity of a landscape may not necessarily be good for production. Interestingly, the effect of crop diversity on production corroborates published research suggesting that diversity, particularly high levels of diversity, is associated with higher yields [54, 66] (figure S8(B)).

5. Conclusions

Taken together, these results highlight the ways in which human activity has amplified biophysical suitability signals in the geography of US agricultural production. While unexpectedly high yields are strongly associated with increased expenditure on major inputs to production and higher federal support, they are not strongly associated with net revenue gains for farmers. We also find that beyond a threshold, increased input expenditure is associated with marginally decreasing returns to production. This provides preliminary evidence that the intensification of agricultural production in highly suitable areas is not necessarily associated with improvements in net farm revenues. This raises important questions about the tradeoffs between yields and farm livelihoods. What are the ultimate goals of agricultural production? Meeting human demand for food, fuel, and fiber is a must, but what of the livelihoods of the individuals who grow our food? And what of the environmental base that supports all food production? Our analyses suggest that some of the most unexpectedly productive regions of the US fail to meet the equally important objectives of economic viability and environmental sustainability.

The geographies revealed by these analyses also highlight regions of concern. First, the strong yield signal in the High Plains Aquifer highlights production intensification brought by heavy groundwater irrigation. This aquifer is expected to lose an estimated 24% of irrigated area in this century due to agricultural withdrawals [67], and whether these high yields can be sustained into the future is currently under debate [20, 68, 69]. Second, federal support of corn and soy production has pushed corn and soy production beyond the periphery of the Corn Belt, into regions we find have lower than expected yield outcomes for these crops [29, 39]. This expansion has displaced other crops that may be better suited to this region and exerted a number of environmental externalities [7072]. While changing climate that will likely increase the biophysical suitability of these peripheral regions to the cultivation of corn and soy [23, 57], the strong market and policy signals driving the expansion of corn and soy may limit farmer capacity to explore more sustainable alternatives [73].

Many have argued that meeting projected food demand while also preserving rural livelihoods and ecological integrity will require a significant reconceptualization of the ultimate goals of agriculture [7476]. One such reconceptualization proposed by the USDA defines sustainable agriculture as an integrated system that satisfies human demand while also enhancing environmental quality, efficiently using natural resources, supporting farmer livelihoods, and enhancing quality of life for farmers and society as a whole [77]. Our research shows that while US agriculture excels at meeting human demand, we fall short in satisfying other important objectives, raising serious concerns about the economic and ecological viability of US agriculture over time. We applaud and encourage those in the agricultural community who are critically expanding the production-centric paradigm to reposition agriculture as a force for positive, regenerative change for both people and planet. We also urge members of the scientific community to continue to reimagine and redefine agriculture as tool to cultivate socioecological well-being.

Data availability statement

The data that support the findings of this study are available upon reasonable request from the authors.


This work is supported in part by United States Department of Agriculture (USDA) National Institute of Food and Agriculture Grant No. 2020-67019-31157. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the authors and do not necessarily reflect the view of the U.S. Department of Agriculture. Many thanks to Britta Schumacher for invaluable help in wrangling the USDA Census data into submission. Thanks to Kaitlyn Spangler and Britta Schumacher for providing thoughtful and constructive feedback on the manuscript. Thanks to Dr. Jeremy Cowan for his valuable insights on soil dynamics.


  • GDD baseline temperature of 0 °C for winter wheat and hay, 10 °C for corn and soy and 15 °C for alfalfa [78, 79].

  • We use one standard deviation to have a reasonable sample size, but note that when using two standard deviations, this strengthens the differences we describe. We include summaries of average systems (between −1 and +1 SD) in the SI.

  • This category consists of direct payments from the government and includes: payments from Conservation Reserve Program, Wetlands Reserve Program, Farmable Wetlands Program, and Conservation Reserve Enhancement Program; loan deficiency payments; disaster payments; other conservation programs; and all other federal farm programs under which payments were made directly to farm operators. Commodity Credit Corporation proceeds, local and state government agricultural program payments, and federal crop insurance payments are not tabulated in this category (USDA NASS, 2019, p. 759).

Please wait… references are loading.