Hostname: page-component-8448b6f56d-c47g7 Total loading time: 0 Render date: 2024-04-16T05:23:34.510Z Has data issue: false hasContentIssue false

Water availability and crop growth at the crop plot level in South Africa modelled from satellite imagery

Published online by Cambridge University Press:  08 April 2014

E. BLANC*
Affiliation:
Massachusetts Institute of Technology, Joint Program on the Science and Policy of Global Change, 77 Massachusetts Avenue, Cambridge, MA 02139-4307, USA
E. STROBL
Affiliation:
Ecole Polytechnique, Department of Economics, Cedex 91128, Palaiseau, France
*
*To whom all correspondence should be addressed. Email: eblanc@mit.edu
Rights & Permissions [Opens in a new window]

Summary

Although the effect of weather on crop growth has been studied widely, the contribution of other water sources has been less well studied, mainly due to data limitation. To address this gap, the current analysis considers the importance of water availability on crop growth by taking advantage of crop field boundaries and information on South Africa's four major grain producing provinces (Northwest, Mpumalanga, Free State and Gauteng) provided by the Agricultural Geo-referenced Information System dataset. To capture crop growth along the crop growing cycle at the plot level, the MODIS's MOD13Q1 dataset of 16-day normalized difference vegetation index (NDVI) was used. To estimate the determinants of crop growth, weather effects were considered and represented by rainfall and reference evapotranspiration satellite derived data provided by the National Oceanic and Atmospheric Administration's RFE and GDAS dataset, respectively. Hydrologic and irrigation determinants were estimated based on the HYDRO1K river network dataset produced by the US Geological Survey. The results show that although weather is an important explanatory factor, other sources of water, such as irrigation, proximity to perennial and ephemeral rivers, and stream flow are also influential. Taking into account the interaction effects between weather and water availability related factors is also important to determine the effect of water availability on crop growth.

Type
Crops and Soils Research Papers
Copyright
Copyright © Cambridge University Press 2014 

INTRODUCTION

Agriculture is an important sector of South Africa's economy, representing 0·03 of GDP and 0·065 of total national exports (van Niekerk Reference van Niekerk2012). However, while 0·12 of South Africa's land can be used for crop production, only 0·22 of this is considered as high-potential arable land. The greatest limitation in this regard is the availability of water, with uneven and unreliable rainfall over South Africa's seven climatic regions, causing at times severe agricultural losses to commercial and subsistence farmers. Thus, understanding the role that water scarcity plays in crop productivity is a necessary step to ensuring food sufficiency and export earnings in the region.

Crop growth studies considering the role of climatic factors generally can be categorized as either biophysical or statistical models. Biophysical models rely on controlled environments (growth chambers and/or field trial) experiments over small areas (Azam-Ali & Squire Reference Azam-Ali and Squire2002), whereas statistical models are based on time series of single crop plots or temporally pooled cross-sectional data. Traditionally, the statistical models have been based on country or regional level data and thus the unit of analysis is spatially relatively aggregated. However, advances in satellite technologies have provided crop biomass production indicators, such as net primary production (NPP) or the normalized difference vegetation index (NDVI), at a high resolution, which have led to a rising number of crop development statistical models at a much lower unit of analysis than those using information collected by national statistical agencies. Benedetti & Rossini (Reference Benedetti and Rossini1993) show that such an index provides a relatively low-cost and useful crop monitoring tool. For instance, a study by Yang et al. (Reference Yang, Yang and Merchant1997) of Nebraska at a 1-km resolution showed a strong correlation between NDVI temporal change and temperature. Karnieli et al. (Reference Karnieli, Agam, Pinker, Anderson, Imhoff, Gutman, Panov and Goldberg2010) use NDVI data at an 8 km resolution over North America and also find a significant correlation between NDVI and land surface temperature. In Western Africa, Malo & Nicholson (Reference Malo and Nicholson1990) showed a linear relationship between precipitation and NDVI at a 3×5-km resolution, whereas Schultz & Halpert (Reference Schultz and Halpert1993) showed at a global 1° grid scale that the influence of climatic factors on NDVI differ regionally.

Although the recent surge of satellite-data-based statistical models have enabled researchers to study the climatic determinants of crop production at a small unit of analysis while still covering large spatial areas, they have generally been limited to examining cropland without being able to distinguish between crop types over space and time – see, for example, Milesi et al. (Reference Milesi, Samanta, Hashimoto, Krishna Kumar, Ganguly, Thenkabail, Srivastava, Nemani and Myneni2010) and Blanc & Strobl (Reference Blanc and Strobl2013). Additionally, existing analyses have not been able to consider water availability, which is location-specific. In the present paper, these weaknesses are addressed using detailed spatial crop field information in the four major grain producing provinces of South Africa – Northwest, Mpumalanga, Free State and Gauteng – representing >0·90 of the total summer grain growing area in the country (Ferreira et al. Reference Ferreira, Newby, du Preez, Baruth, Royer and Genovese2006). More specifically, in contrast to the usual gridded cropland data, the current dataset not only identifies exact crop field boundaries, but also the type of crop grown and whether the field is irrigated. Moreover, knowing the specific location of the crop plot enables one to estimate their distance from water sources. For each of these fields, a satellite-based measure of crop biomass productivity is calculated every 16 days over the crop's growing cycle and the impact of water availability is estimated in a regression framework. Water availability in this regard is measured along a number of dimensions, including local precipitation, river flow and distance to the nearest river.

The remainder of the paper is organized as follows. The next section describes the data and provides some summary statistics, whereas the Modelling Framework and Results section outlines the regression methodology and provides results. Conclusions, limitations and future work are discussed in the final section.

DATA

Spatial delineation: crop plot boundaries

Crop field boundaries were provided by the Agricultural Geo-referenced Information System (AGIS) developed by the South African National Department of Agriculture (available online from www.agis.agric.za). These data were available for the provinces of Free State, Gauteng, Northwest and Mpumalanga. The field boundaries in this dataset were determined using the Producer Independent Crop Estimate System (PICES) which combines satellite imagery, Geographic Information System (GIS), point frame statistical platforms and aerial observations (Ferreira et al. Reference Ferreira, Newby, du Preez, Baruth, Royer and Genovese2006). Satellite imagery of crop field of cultivated fields was obtained from the SPOT 5 satellite at a 2·5-m resolution. Crop field boundaries were then digitized using GIS. When cloud-free satellite images were not available, field polygons hidden by clouds were removed before processing. This ensured a more accurate dataset. Over the four regions of interest, PICES distinguished c. 280 000 plots covering an area of c. 6·5 million hectares.

To approximately match the resolution of the crop growth indicator data described below, which were only available at the 250-m resolution, the analysis was limited to plots larger than 6·25 ha. This restricted the sample to 213 110 crop fields. A geographical representation of the crop fields’ location is presented in Fig. 1.

Fig. 1. Crop plot location in South Africa.

Crop types

Within each crop plot described above, AGIS provided information on the crop cultivated. Crop types were determined using the digitized satellite images described above. Sample points were selected randomly and surveyed by trained observers from a very light aircraft in order to determine crop type (Ferreira et al. Reference Ferreira, Newby, du Preez, Baruth, Royer and Genovese2006). Crop information collected during the aerial surveys on the sample points was used as a training set for crop type classification for each field and for accuracy assessment. For crops planted sparsely, field verification on certain sample points was not possible and therefore larger areas were considered to include these crops. These estimated crop classifications were checked against a producer based survey for the Gauteng region. The Gauteng census survey showed that less than 1·8% of crop types had been misclassified.

All in all, seven summer crops were distinguished (classification of winter crops was generated for the season 2007/08 only, therefore these crops were not considered in the analysis): cotton, dry beans, groundnuts, maize, sorghum, soya and sunflower. Given the small number of cotton observations (110 plots over only one province), these were excluded from the analysis. Although the data also identified fallow and pasture plots, these were also not considered given the focus of the analysis on cropland.

Crop classification data were available for the summer season 2006/07 for the provinces of Free State, Gauteng, Northwest and Mpumalanga, and for one province only, Free State, for the summer season 2007/08. A geographical representation of crop types for each season is provided in Fig. 2. Figure 3 provides a close-up representation of crop type change across seasons. As can be seen, crops showed patterns of alternation between crop types and fallow/pasture.

Fig. 2. Crop type location for the summer seasons 2006/07 and 2007/08.

Fig. 3. Crop type changes across the summer seasons 2006/07 and 2007/08.

Crop growth measure

Crop biomass production was estimated using the NDVI. Vegetation indices such as the NDVI are particularly attractive as they provide consistent spatial and temporal representations of vegetation conditions. As a matter of fact, numerous studies have demonstrated that NDVI values are significantly correlated with crop yields including wheat (Das et al. Reference Das, Mishra and Kalra1993; Gupta et al. Reference Gupta, Prasad, Rao and Nadham1993; Doraiswamy & Cook Reference Doraiswamy and Cook1995; Hochheim & Barber Reference Hochheim and Barber1998; Labus et al. Reference Labus, Nielsen, Lawrence, Engel and Long2002), sorghum (Potdar Reference Potdar1993), corn (Hayes & Decker Reference Hayes and Decker1996; Prasad et al. Reference Prasad, Chai, Singh and Kafatos2006), rice (Quarmby et al. Reference Quarmby, Milnes, Hindle and Silleos1993; Nuarsa et al. Reference Nuarsa, Nishio and Hongo2011), soybean (Prasad et al. Reference Prasad, Chai, Singh and Kafatos2006), barley (Weissteiner & Kühbauch Reference Weissteiner and Kühbauch2005), millet (Groten Reference Groten1993) and tomato (Koller & Upadhyaya Reference Koller and Upadhyaya2005). Moreover, NDVI has also been shown to provide a very good indicator of crop phenological development (Benedetti & Rossini Reference Benedetti and Rossini1993).

The NDVI index is calculated using ratios of vegetation spectral reflectance over incoming radiation in each spectral band. More specifically, NDVI can be formulated as:

$${\rm NDVI} = ({\rm NIR} - {\rm VIS})/({\rm NIR} + {\rm VIS})$$

where the difference between near-infrared reflectance (NIR) and visible reflectance (VIS) values are normalized by the total reflectance and vary between −1·0 and 1·0 (Eidenshink Reference Eidenshink1992). Negative and very low values corresponding to water and barren areas were excluded from the analysis by design. The NDVI data were extracted from the MOD13Q1 dataset (available online from: https://lpdaac.usgs.gov/lpdaac/content/view/full/6652), which regroups reflectance information collected by the MODerate-resolution Imaging Spectroradiometer (MODIS) instrument operating on NASA's Terra satellite (Huete et al. Reference Huete, Didan, Miura, Rodriguez, Gao and Ferreira2002). The NDVI estimates were available as 16-day composite indices at a resolution of 250 m. Area-weighted averages of the 16-day NDVI values for each crop field were calculated using the ‘zonal statistics’ tool in ArcGIS.

Growing season

Crop growing seasons are characterized by the planting date and the phenology cycle, which determine the length of the season. In South Africa, planting dates are spread from October to December in order to reduce the vulnerability to erratic rainfalls (Ferreira et al. Reference Ferreira, Newby, du Preez, Baruth, Royer and Genovese2006). At the same time, however, phenology cycles also differ among crops. The TIMESAT program (Jönsson & Eklundh Reference Jönsson and Eklundh2002, Reference Jönsson and Eklundh2004) was used to determine crop- and field-specific growing seasons. The algorithm within the software is commonly used to extract seasonality information from satellite time-series data. Within this context, it allows one to approximate the start and end of growing seasons based on distribution properties of NDVI. The repartition of growing season for each crop is presented in Fig. 4. This bar chart shows that within the area considered, groundnuts had the shortest growing season, whereas sunflower had the more widespread planting period.

Fig. 4. Crop growing season estimated using Timesat. Notes: Average growing season bars represent the average start and end of growing season for each crop. Start and End range bars represent respectively start and end dates of growing seasons comprised between the 10th and the 90th percentile.

Using these growing season estimates for each crop plot, it was also possible to determine the stage of the biomass development within the cropping season. In this exercise, the position of each 16-day period was determined relative to the length of the cropping season. For instance, the first 16-day period of a 160-day cropping season was assigned the value 0·1, and the last was assigned the value 1. This measure therefore accounts for the difference in growing season length of each crop plot.

Weather

Daily rainfall data were extracted from the rainfall estimation algorithm RFE (version 2.0) dataset implemented by the National Oceanic and Atmospheric Administration (NOAA) – Climate Prediction Center (CPC). These data, which were generated from a combination of rain gauges and satellite observations, were available at FEWS NET Africa Data Portal (available online from: http://earlywarning.usgs.gov/fews/africa/index.php) at the 0·1° resolution (c. 10 km).

Reference evapotranspiration (ETo), which represents the evaporative demand of the air, was calculated using the Penman–Monteith equation following the FAO methodology (Allen et al. Reference Allen, Pereira, Raes and Smith1998). These data are also available from: http://earlywarning.usgs.gov/fews/global/index.php: this source labels ETo as ‘potential evaporation’. However, as noted in Allen et al. (Reference Allen, Pereira, Raes and Smith1998), ‘the use of other denominations such as potential ET is strongly discouraged due to ambiguities in their definitions.’ Daily ETo data at 1° resolution were calculated using a 6-hourly assimilation of conventional and satellite observational data of air temperature, atmospheric pressure, wind speed, relative humidity and solar radiation extracted from the NOAA's Global Data Assimilation System.

Irrigation

The AGIS crop boundaries dataset also provides information regarding irrigation for each crop plot. As displayed on the left-hand side of Fig. 5, cropland in South Africa was mainly rain-fed. Moreover, the right-hand side of Fig. 5, which represents a detailed view of the crop plots, shows that irrigated crop plots were generally clustered around streams. It should be noted that the lines representing streams were not representative of the width of the actual streams, which may explain the ‘gap’ between the plots and the streams.

Fig. 5. Crop plots irrigation.

Other water sources

To account for water sources other than local precipitation, the proximity of crop plots to streams and the daily flow of these were calculated. The proximity of each crop plot to a river or stream (perennial and non-perennial) was estimated using the river network of the African continent from the HYDRO1K dataset (USGS 2011). Stream flow was estimated using the Geospatial Stream Flow Model (GeoSFM) (Artan et al. Reference Artan, Asante, Smith, Pervez, Entenmann, Verdin and Rowland2008 ), which simulates the dynamics of runoff processes using spatial information on river basin and network coverage, land cover type, soil characteristics, and daily precipitation and evapotranspiration data. River basins in South Africa were delineated using the HYDRO1K dataset, which provides drainage basin boundaries data derived from river network and flow direction data. Soil characteristics (water-holding capacity, hydrologically active soil depth, texture, average saturated hydraulic conductivity) were extracted from the Digital Soil Map of the World (FAO 2011). The GeoSFM model produced daily stream flow in terms of cubic meters per second (m3/s). Each crop plot was then assigned to a river basin and its corresponding river flow. For crop plots spreading over more than one river basin, the basin to which the plot belongs is determined by the largest area share located in the basin.

Summary statistics

Summary statistics of time-invariant plot characteristics for each crop are provided in Table 1. Maize was the most prevalent crop with nearly 50 000 plots in the 2006/07 season. Groundnuts plots were on average the largest (37 ha). Irrigation summary statistics indicate that the irrigation rate for crops ranged from 4 to 7% (i.e. lowest for sorghum and highest for sunflower). Dry beans and groundnuts plots were located the furthest from perennial rivers, with an average distance of 13 km.

Table 1. Crop plots summary statistics

Crop plot time-series statistics are presented by crop type in Table 2 for all provinces over the 2006/07 season. Table 3 provides similar statistics for the Free State province over both the 2006/07 and 2007/08 seasons. The NDVI index, showing daily biomass productivity, was the largest for soya. Productivity was generally slightly larger during the 2007/08 season. One may also note that although the NDVI index is theoretically bounded, it never actually came close to either bound within this dataset.

Table 2. Summary statistics by crop type for all provinces for the 2006/07 season

Table 3. Summary statistics by crop type for the Free State province for the 2006/07 and 2007/08 seasons

The growing season length was the longest for dry beans and the shortest for groundnuts, with an average of 172 and 141 days, respectively. Weather variables show that sorghum and soya were grown in the wettest conditions (i.e. the highest rainfalls and the lowest ETo rates) during the 2006/07 season. Dry beans and soya were cultivated within the basin having the lowest stream flows in both seasons.

The skewness statistics show that the Stream flow and Rain were positively skewed and may need to be transformed.

MODELLING FRAMEWORK AND RESULTS

Regression specification

In the current analysis, a vector of various weather and irrigation factors was considered to estimate the determinants of crop growth. The base regression specification was formulated as follows:

(1) $$NDVI_{\,pt} = \alpha + \delta X_{\,pt} + \varepsilon _{\,pt} $$

where p and t are plot and time indicators, respectively, X represents a vector of explaining variables, δ are the estimated coefficients representing the marginal effects and ε a standard independent and identically distributed (i.i.d.) error term. The vector X includes weather variables, represented by rainfall and ETo, and irrigation-related variables such as the distance to perennial and non-perennial rivers, stream flow and irrigation application. Many factors explaining crop growth, such as soil fertility, ozone concentration, crop variety, nutrient application and other management practices, were not included in the analysis. The omission of these explaining factors was an important issue for this, and many other, studies but was unavoidable due to data limitation. However, in order to control for region-specific fixed effects, a set of province factors was also included in all specifications. This regional distinction was deemed the most appropriate as it captures the geographic and political unobservable specificities. Also, it may reflect data collection discrepancies across regions as the data were collected at the province level. As a sensitivity analysis, regression (6) was re-estimated for the full sample using alternatives to the province factors. Climatic zone factors were considered first. The study area spreads over three major climate zones defined by FAO (1996): warm Sub-Tropics, cool Sub-Tropics and cold Sub-Tropics. A second alternative considered agro-ecological zones (AEZ) developed by Monfreda et al. (Reference Monfreda, Ramankutty, Hertel, Hertel, Rose and Tol2009), who defines zones globally by overlaying six categories of growing length periods with climatic zones. In the study area, five AEZ categories are distinguished. However, neither of these alternative regional indicators changed the estimation results in any noticeable qualitative or quantitative manner. Detailed results are available from the authors upon request. It should be noted that, since the full set of these factors would be perfectly correlated if included together, the Free State factor was excluded so that any marginal effects on the others must be interpreted in terms of additional marginal productivity relative to this reference province. Additionally, as these fixed effects assume that the explanatory variables coefficients are similar across provinces, the interaction of the province factor variables with each determinant were also included to relax this condition.

The statistical analysis was performed using Stata 12.0 (StataCorp 2011). To investigate the possibility that crop growth is spatially correlated, a Moran's I test (Moran Reference Moran1950) was conducted. Since this test indicated that the crop growth measure was spatially correlated, the non-parametric covariance matrix estimator proposed by Driscoll & Kraay (Reference Driscoll and Kraay1998) was implemented to obtain robust s.e. for all estimations.

To account for the skewness of the Stream flow and Rain variables, the log of Stream flow and the square root of Rain, which were deemed the most appropriate at reducing the skewness, were also considered. However, the results indicated that the transformed variables did not improve the fit of the regression and the distribution of the residuals. Therefore, for ease of interpretation, the Stream flow and Rain variables were kept in levels.

Regression results

Regressions results at the crop level are presented in Tables 4–6. These regressions were obtained through a selection process to remove non-significant secondary variables (i.e. non-linear terms and interaction terms) from the most general specification of Eqn (1). In order to make the results more legible, the dependent variable was scaled by a factor of 10 000 in all specifications. The crop samples were pooled across the four provinces. Differences between provinces were accounted for by including a set of province factors (the reference case is Free State) and their interaction terms with explanatory variables.

Table 4. Regression results

Notes: The dependent variable, NDVI, is scaled by a factor of 10 000; s.e. in parentheses; the reference case for the province factors is Free State.

Table 5. Regression results

Notes: The dependent variable, NDVI, is scaled by a factor of 10 000; s.e. in parentheses; the reference case for the province factors is Free State.

Table 6. Regression results

Notes: The dependent variable, NDVI, is scaled by a factor of 10 000; s.e. in parentheses; the reference case for the province factors is Free State.

Rainfall and evapotranspiration were considered in the model to represent the effect of weather on crop biomass productivity. The Rain coefficients suggest that precipitation had a direct beneficial impact on sorghum and soya biomass productivity in all provinces. For dry beans, the beneficial effect was observed in the Mpumalanga province only. Rainfall had no significant effect on sunflower and groundnuts. Increases in evapotranspiration had a negative effect on dry beans, except in the Mpumalanga province, where it was beneficial to biomass productivity growth. For groundnuts, the effect of increased evapotranspiration was detrimental in all provinces, whereas it was beneficial for maize, sorghum and sunflower. The largest evapotranspiration effect was observed for sorghum.

In order to investigate whether there may be non-linearities in the relationship between these weather factors and cropland biomass productivity, their squared terms were included. Accordingly, there is strong evidence that evapotranspiration had a non-linear relationship with biomass productivity of maize, sorghum and sunflower. For these crops, the regressions indicate that evapotranspiration had a biomass productivity enhancing effect up to a certain point. The coefficients suggest that this threshold was about 7·4 mm/day for maize, 8·3 mm/day for sunflower and 10·3 mm/day for sorghum. In contrast, the impact of rainfall was better modelled by simply including it in its linear form.

To account for the effect of other forms of water availability on crop biomass productivity, the effect of the presence of irrigation on the crop plot was investigated. The results show that irrigation enhanced cropland biomass productivity substantially for all crops except sorghum, which was the less irrigated crop of the sample (<0·02 of the plots). The size of the coefficients suggests that irrigation was the most important for maize in the Northwest. More specifically, the estimated coefficient indicates that, in the Northwest, an irrigated maize plot had a 0·2 greater NDVI than a non-irrigated plot. In contrast, irrigation of a sunflower plot in Mpumalanga only increased biomass productivity by 0·003 NDVI.

Stream-related factors such as stream flow and the distance of each plot from perennial and ephemeral rivers were also considered. Stream flow had a positive direct effect on biomass productivity for most crops, although the quantitative effect was very small (e.g. one sd of river flow entailed an increase of less than 0·00005 NDVI). In a few cases (i.e. dry beans in Mpumalanga, maize in Gauteng and soya and sunflower in Mpumalanga), however, the increased stream flow appears to have had a detrimental effect on biomass productivity. For sorghum, the direct effect of stream flow was not significant.

Distance to perennial and non-perennial rivers were also significant determinants of the dependent variable. Generally, being closer to a perennial river increased the biomass productivity of a plot but being closer to an ephemeral river decreased it. For instance, for each kilometre closer to a perennial river, sunflower biomass productivity increased by up to 0·02 NDVI in Gauteng. In Mpumalanga, productivity was slightly larger for dry beans and sunflower plots being farther away from a stream. No effect was estimated for groundnuts. In contrast, being farther away from ephemeral streams increased biomass productivity for maize, sorghum and soya in the Free State and sorghum in Gauteng only.

Thus far, it was assumed that each of the explanatory factors had potentially only isolated effects on the biomass productivity of crops. This may arguably be a rather restrictive and unrealistic assumption. For instance, the importance of rainfall in providing water to crops depends on the degree of evapotranspiration. Similarly, the effect of river flow will be related to the distance of the plot from rivers. Moreover, the existence of irrigation may make the dependence on other time-varying water resources less important. To investigate these factors, a set of interaction terms was included. As can be seen for the regressions, this unearthed some interesting features from the data. When considering the interaction effect between rainfall and evapotranspiration, the results indicate that evapotranspiration not only had a direct effect on sorghum, but also reduced the productivity enhancing impact of precipitation by about 20%. Similar impacts were found for dry beans in Mpumalanga and soya in Northwest.

As gauged by the interaction term between the Irrigation and Rain variables, irrigation also appeared to reduce the reliance on precipitation for most crops except dry beans and groundnuts. Irrigation reduced the importance of rainfall as a water source for sunflower biomass production the most. When considering the combined effect of Stream flow with Irrigation, stream flow increased sorghum biomass productivity only if the plots were irrigated. For maize in Gauteng, in contrast, the negative effect of stream flow on biomass productivity was exacerbated by irrigation. For groundnuts in the Northwest and dry beans, stream flow had a smaller beneficial impact on biomass productivity in irrigated plots. Finally, the distance to perennial and ephemeral rivers can reduce the impact of stream flow for some crops. More specifically, the interaction terms of these factors with the stream flow proxy show that being farther away from a perennial river reduced the impact of stream flow on soya and sunflower in all provinces and for maize in Free State and Northwest, sorghum in Mpumalanga, and groundnuts and dry beans in Northwest.

All regressions included the growing season length, growing season stage and its square term. The growing season length unsurprisingly reduced the biomass productivity of plots within South Africa, as crops having a longer cropping season grow more slowly and produce less biomass daily. The growing season stage coefficients indicate that crop biomass increased as the growing season advanced and then decreased past an optimal point in the growing season. This inverted U shape is representative of the biomass evolution of crops along the phenological cycle. These results are consistent across all specifications in Tables 4–6.

The inspection of the residual for each crop-specific regression confirmed that variances are constant over the fitted range and that the residuals have a normal distribution.

DISCUSSION

The current analysis considers the determinants of crop growth at the crop plot level in four grain producing provinces of South Africa. It is unique in that it is based on a detailed high-frequency (every 16 days) dataset of satellite derived productivity measures for four crop types at the plot level covering the main agricultural areas of the country: Northwest, Mpumalanga, Free State and Gauteng. The regression results show that water availability plays an important role in cropland biomass productivity in South Africa. Weather, as represented by evapotranspiration and precipitation, has a significant and well-known impact on crop biomass productivity. The results also demonstrate the importance of other sources of water for crop biomass productivity growth by considering irrigation, stream flow and proximity to perennial and ephemeral rivers. However, the effect of these factors depends on the crop type and province considered, with the best representation obtained for dry beans.

It is noteworthy that the R 2 in the estimations suggests that the regression analyses explain biomass productivity growth better for some crops than others. For instance, c. 68% of dry beans productivity is explained by the regressions. When considering sunflower and groundnuts, however, the R 2 is less than 0·35. This may not be surprising given that the vector of crop biomass determinants is missing a number of potentially important factors as discussed above. Moreover, NDVI is inevitably only a proxy of actual crop yield and thus entails some measurement error. Finally, the growing season of a crop is determined by a decision rule, similarly inducing some measurement errors.

One should practise some caution when interpreting the determinants of crop growth in a strictly causal way given that they are derived from an observational study. As already indicated by the less than perfect explanatory power, there are likely to be many other determining factors driving crop growth that have not been able to be included, but that could be correlated with the ones that are. For example, crop location is implicitly taken as given, but farmers with greater access to fertilizers or machinery may be those that choose, or can afford to, locate in more irrigated areas. Similarly, the choice of crop for a particular location may be determined by such potentially confounding factors. Thus, it is perhaps best to view the estimated relationships, particularly for the non-climatic factors, as correlations rather than causal determinants.

With these caveats in mind, the present results are arguably most useful in terms of anticipating the impact of weather shocks on the growth of a given crop. More specifically, one could use these quantitative estimates to forecast the growth of existing crops in specific regions in South Africa once one observes the weather in a given growing season. Alternatively, one may use the results to compare short-term crop growth under different climate scenarios. Once more crop-level data – such as the one employed in the present study – become available over a longer time period, one could also then extend this analysis to model crop choice.

Finally, one should note that the availability of crop plot level information played an essential role in this analysis. These plot information combined with biomass productivity data would be also extremely useful in devising an identification scheme of crop type based on NDVI temporal evolution. More generally, and related to this, it is encouraging that satellite imagery is becoming increasingly more sophisticated, offering products at finer spatial scale and higher frequency. This will allow greater precision in terms of crop identification and growth patterns.

References

Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. (1998). Crop Evapotranspiration – Guidelines for Computing Crop Water Requirements. FAO Irrigation and Drainage Paper no. 56. Rome: FAO.Google Scholar
Artan, G. A., Asante, K., Smith, J., Pervez, S., Entenmann, D., Verdin, J. P. & Rowland, J. (2008). Users’ Manual for the Geospatial Stream Flow Model (GeoSFM). U.S. Geological Survey Open-File Report 2007–1440. Reston, Virginia, USA: U.S. Geological Survey.Google Scholar
Azam-Ali, S. N. & Squire, G. R. (2002). Principles of Tropical Agronomy. Wallingford, UK: CABI Publishing.Google Scholar
Benedetti, R. & Rossini, P. (1993). On the use of NDVI profiles as a tool for agricultural statistics: the case study of wheat yield estimate and forecast in Emilia Romagna. Remote Sensing of Environment 45, 311326.Google Scholar
Blanc, E. & Strobl, E. (2013). The impact of climate change on cropland productivity: evidence from satellite based products at the river basin scale in Africa. Climatic Change 117, 873890.Google Scholar
Das, D. K., Mishra, K. K. & Kalra, N. (1993). Assessing growth and yield of wheat using remotely-sensed canopy temperature and spectral indices. International Journal of Remote Sensing 14, 30813092.Google Scholar
Doraiswamy, P. C. & Cook, P. W. (1995). Spring wheat yield assessment using NOAA AVHRR data. Canadian Journal of Remote Sensing 21, 4351.Google Scholar
Driscoll, J. C. & Kraay, A. C. (1998). Consistent covariance matrix estimation with spatially dependent panel data. Review of Economics and Statistics 80, 549560.Google Scholar
Eidenshink, J. C. (1992). The 1990 conterminous US AVHRR data set. Photogrammetric Engineering and Remote Sensing 58, 809813.Google Scholar
FAO (1996). Major Climates of Africa. Rome: FAO GeoNetwork. Available from: http://www.fao.org/geonetwork/srv/en/metadata.show?id=6&currTab=simple (accessed 19 February 2014).Google Scholar
FAO (2011). Digital Soil Map of the World. Rome: FAO. Available from: http://www.fao.org/geonetwork/srv/en/metadata.show?id=14116 (accessed 12 February 2014).Google Scholar
Ferreira, S. L., Newby, T. & du Preez, E. (2006). Use of remote sensing in support of crop area estimates in South Africa. In SPRS Archives XXXVI-8/W48 Workshop Proceedings: Remote Sensing Support to Crop Yield Forecast and Area Estimates (Eds Baruth, B., Royer, A. & Genovese, G.), pp. 5152. Ispra, Italy: Joint Research Centre.Google Scholar
Groten, S. M. E. (1993). NDVI-crop monitoring and early yield assessment of Burkina Faso. International Journal of Remote Sensing 14, 14951515.Google Scholar
Gupta, R. K., Prasad, S., Rao, G. H. & Nadham, T. S. V. (1993). District level wheat yield estimation using NOAA/AVHRR NDVI temporal profile. Advances in Space Research 13, 253256.Google Scholar
Hayes, M. J. & Decker, W. L. (1996). Using NOAA AVHRR data to estimate maize production in the United States Corn Belt. International Journal of Remote Sensing 17, 31893200.Google Scholar
Hochheim, K. P. & Barber, D. G. (1998). Spring wheat yield estimation for Western Canada using NOAA NDVI data. Canadian Journal of Remote Sensing 24, 1727.Google Scholar
Huete, A., Didan, K., Miura, T., Rodriguez, E. P., Gao, X. & Ferreira, L. G. (2002). Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sensing of Environment 83, 195213.Google Scholar
Jönsson, P. & Eklundh, L. (2002). Seasonality extraction by function fitting to time-series of satellite sensor data. IEEE Transactions on Geoscience and Remote Sensing 40, 18241832.Google Scholar
Jönsson, P. & Eklundh, L. (2004). TIMESAT – a program for analysing time-series of satellite sensor data. Computers and Geosciences 30, 833845.CrossRefGoogle Scholar
Karnieli, A., Agam, N., Pinker, R. T., Anderson, M., Imhoff, M. L., Gutman, G. G., Panov, N. & Goldberg, A. (2010). Use of NDVI and land surface temperature for drought assessment: merits and limitations. Journal of Climate 23, 618633.Google Scholar
Koller, M. & Upadhyaya, S. K. (2005). Prediction of processing tomato yield using a crop growth model and remotely sensed aerial images. Transactions of the ASAE 48, 23352341.Google Scholar
Labus, M. P., Nielsen, G. A., Lawrence, R. L., Engel, R. & Long, D. S. (2002). Wheat yield estimates using multi-temporal NDVI satellite imagery. International Journal of Remote Sensing 23, 41694180.Google Scholar
Malo, A. R. & Nicholson, S. E. (1990). A study of rainfall and vegetation dynamics in the African Sahel using normalized difference vegetation index. Journal of Arid Environments 19, 124.Google Scholar
Milesi, C., Samanta, A., Hashimoto, H., Krishna Kumar, K., Ganguly, S., Thenkabail, P. S., Srivastava, A. N., Nemani, R. R. & Myneni, R. B. (2010). Decadal variations in NDVI and food production in India. Remote Sensing 2, 758776.Google Scholar
Monfreda, C., Ramankutty, N. & Hertel, T. W. (2009). Global agricultural land use data for climate change analysis. In Economic Analysis of Land Use in Global Climate Change Policy (Eds Hertel, T. W., Rose, S. & Tol, R. S. J.), pp. 3348. Abingdon, Oxon, UK: Routledge.Google Scholar
Moran, P. A. P. (1950). Notes on continuous stochastic phenomena. Biometrika 37, 1723.Google Scholar
van Niekerk, L. (2012). Pocket Guide to South Africa 2011/2012. Pretoria, South Africa: GCIS.Google Scholar
Nuarsa, I. W., Nishio, F. & Hongo, C. (2011). Relationship between rice spectral and rice yield using MODIS data. Journal of Agricultural Science (Canada) 3, 8088.Google Scholar
Potdar, M. B. (1993). Sorghum yield modelling based on crop growth parameters determined from visible and near-IR channel NOAA AVHRR data. International Journal of Remote Sensing 14, 895905.Google Scholar
Prasad, A. K., Chai, L., Singh, R. P. & Kafatos, M. (2006). Crop yield estimation model for Iowa using remote sensing and surface parameters. International Journal of Applied Earth Observation and Geoinformation 8, 2633.CrossRefGoogle Scholar
Quarmby, N. A., Milnes, M., Hindle, T. L. & Silleos, N. (1993). The use of multi-temporal NDVI measurements from AVHRR data for crop yield estimation and prediction. International Journal of Remote Sensing 14, 199210.Google Scholar
Schultz, P. A. & Halpert, M. S. (1993). Global correlation of temperature, NDVI and precipitation. Advances in Space Research 13, 277280.Google Scholar
StataCorp (2011). Stata Statistical Software: Release 12. College Station, TX, USA: StataCorp LP.Google Scholar
USGS (2011). HYDRO1k Dataset. Washington, DC: U.S. Geological Survey. Available from: https://lta.cr.usgs.gov/HYDRO1K (accessed 12 February 2014).Google Scholar
Weissteiner, C. J. & Kühbauch, W. (2005). Regional yield forecasts of malting barley (Hordeum vulgare L.) by NOAA-AVHRR remote sensing data and ancillary data. Journal of Agronomy and Crop Science 191, 308320.Google Scholar
Yang, W., Yang, L. & Merchant, J. W. (1997). An assessment of AVHRR/NDVI-ecoclimatological relations in Nebraska, U.S.A. International Journal of Remote Sensing 18, 21612180.CrossRefGoogle Scholar
Figure 0

Fig. 1. Crop plot location in South Africa.

Figure 1

Fig. 2. Crop type location for the summer seasons 2006/07 and 2007/08.

Figure 2

Fig. 3. Crop type changes across the summer seasons 2006/07 and 2007/08.

Figure 3

Fig. 4. Crop growing season estimated using Timesat. Notes: Average growing season bars represent the average start and end of growing season for each crop. Start and End range bars represent respectively start and end dates of growing seasons comprised between the 10th and the 90th percentile.

Figure 4

Fig. 5. Crop plots irrigation.

Figure 5

Table 1. Crop plots summary statistics

Figure 6

Table 2. Summary statistics by crop type for all provinces for the 2006/07 season

Figure 7

Table 3. Summary statistics by crop type for the Free State province for the 2006/07 and 2007/08 seasons

Figure 8

Table 4. Regression results

Figure 9

Table 5. Regression results

Figure 10

Table 6. Regression results