Skip to main content Accessibility help


  • Access
  • Open access
  • Cited by 9



      • Send article to Kindle

        To send this article to your Kindle, first ensure is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about sending to your Kindle. Find out more about sending to your Kindle.

        Note you can select to send to either the or variations. ‘’ emails are free but can only be sent to your device when it is connected to wi-fi. ‘’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

        Find out more about the Kindle Personal Document Service.

        Imported cases and minimum temperature drive dengue transmission in Guangzhou, China: evidence from ARIMAX model
        Available formats

        Send article to Dropbox

        To send this article to your Dropbox account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your <service> account. Find out more about sending content to Dropbox.

        Imported cases and minimum temperature drive dengue transmission in Guangzhou, China: evidence from ARIMAX model
        Available formats

        Send article to Google Drive

        To send this article to your Google Drive account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your <service> account. Find out more about sending content to Google Drive.

        Imported cases and minimum temperature drive dengue transmission in Guangzhou, China: evidence from ARIMAX model
        Available formats
Export citation


Dengue is the fastest spreading mosquito-transmitted disease in the world. In China, Guangzhou City is believed to be the most important epicenter of dengue outbreaks although the transmission patterns are still poorly understood. We developed an autoregressive integrated moving average model incorporating external regressors to examine the association between the monthly number of locally acquired dengue infections and imported cases, mosquito densities, temperature and precipitation in Guangzhou. In multivariate analysis, imported cases and minimum temperature (both at lag 0) were both associated with the number of locally acquired infections (P < 0.05). This multivariate model performed best, featuring the lowest fitting root mean squared error (RMSE) (0.7520), AIC (393.7854) and test RMSE (0.6445), as well as the best effect in model validation for testing outbreak with a sensitivity of 1.0000, a specificity of 0.7368 and a consistency rate of 0.7917. Our findings suggest that imported cases and minimum temperature are two key determinants of dengue local transmission in Guangzhou. The modelling method can be used to predict dengue transmission in non-endemic countries and to inform dengue prevention and control strategies.


Dengue is the fastest spreading mosquito-borne viral disease worldwide and is primarily transmitted by Aedes albopictus and Aedes aegypti. It affects 3.6 billion people, with 96 million out of approximately 390 million infections annually resulting in symptomatic infection [Reference Bhatt1]. Dengue circulates in more than 100 countries each year and has the great impact in the Americas, Asia, Africa and Australia [Reference Diamond and Pierson2], imposing a great economic burden on affected areas with accounting for approximately 1 billion US dollars in annual economic cost for Southeast Asian countries [Reference Shepard, Undurraga and Halasa3].

The causative agent of dengue is dengue virus (DENV), which is antigenically divided into four serotypes (DENV-1, DENV-2, DENV-3 and DENV-4) [Reference Kularatne4]. The spectrum of dengue infection ranges from inapparent infection, non-specific febrile illness and self-limited dengue fever (DF) to severe dengue, which manifested by dengue hemorrhagic fever (DHF) and dengue shock syndrome (DSS). A second infection caused by a different serotype can increase the risk for severe disease mainly due to antibody-dependent enhancement (ADE) [Reference Dejnirattisai5]. Rapid fluid loss into tissue spaces can lead to death from hemoconcentration and hypotension [Reference Diamond and Pierson2]. At present, the licensed vaccine must be combined with effective vector control to prevent large-scale outbreaks due to its modest efficacy [Reference Wilder-Smith and Gubler6].

Dengue was first identified in mainland China in 1978 from Foshan City after 1949, that is adjacent to Guangzhou [Reference Qiu7] and has been a notifiable disease since 1 September 1989 [Reference Lai8]. Dengue transmission mostly occurred in Guangzhou City with more than 50% of the cases in mainland China reported since 1989 [Reference Jing9]. All four serotypes had circulated in Guangzhou since 1978 with one serotype dominating each transmission year [Reference Jing9]. Reported indigenous cases meaning locally acquired dengue infections in Guangzhou soared dramatically beyond expectations in 2014, with 37 340 indigenous cases that accounted for approximately 80% of the total reported cases in mainland China [Reference Cheng10]. However, the critical risk factors of dengue transmission in Guangzhou are still poorly understood.

Statistical models indicate that dengue incidence on a local scale is associated with imported cases also named travel-related dengue infections, vector densities, climate variables, socioeconomic and environmental factors, population immune status and human movement [Reference Sang11Reference Luz15]. However, the exact relationship between the risk factors and dengue incidence varies by location; for instance, a variable can increase the dengue risk in some districts but decrease it in other areas [Reference Morin, Comrie and Ernst16]. Early importation and larger numbers of imported cases in the calendar year have been identified as the most important determinants underlying dengue transmission in China [Reference Cheng10], Japan [Reference Quam17] and Australia [Reference Ernst18]. Most studies suggest that precipitation is positively correlated with dengue risk in tropical countries; however, a few epidemics have also been documented in areas with low rainfall or larval indices, such as Thailand in 1987 [Reference Eamchan19] and Singapore in 1986 [Reference Goh20]. Warmer temperature is associated with dengue incidence in Southeast Asia [Reference van Panhuis21]. Most dengue transmission occurs when the temperature is higher than 20 °C. However, extremely high temperatures (>36 °C) can also limit the feeding activity of female mosquitoes to restrain transmission [Reference Morin, Comrie and Ernst16]. A series of statistical methods including negative binomial regression model, generalised additive model, times series were used in such studies. And due to the autocorrelation of dengue occurrence, times series model showed easy operation and powerful prediction ability in the actual application.

In this study, we use prewhitened cross-correlation and autoregressive integrated moving average (ARIMA) model to analyse the dengue occurrence in Guangzhou from 2001 to 2016. We seek to improve the understanding of critical risk factors for dengue transmission in Guangzhou, with the aim of developing a predictive dengue model to inform dengue prevention plans.

Materials and methods

Ethics statement

The present study was fully reviewed and approved by the Ethics Committee of the Guangzhou Center for Disease Control and Prevention (GZCDC) and the Ethics Committee of the School of Public Health of Sun Yat-sen University. All of the patient data were de-identified and the data were analysed anonymously.

Research area

Guangzhou is the capital city of Guangdong Province in southern China. With a current population of more than 12.84 million, it is one of the most highly populated cities in the world. Located at 112°57 E to 114°3 E and 22°26 N to 23°56 N and cover a total area of 7434.40 km2, Guangzhou has 10 administrative districts (Liwan, Yuexiu, Haizhu, Baiyun, Tianhe, Huangpu, Luogang, Panyu, Nansha and Huadu) and two satellite cities (Conghua and Zengcheng) (Fig. 1). Guangzhou is one of the most urbanised areas in China and is the most important center of international commerce in south China. It has a humid subtropical climate that is influenced by Asian monsoons [Reference Lu22], resulting in a wet and warm climate that is favourable for vector reproduction [Reference Cheng10]. Over the past three decades, Aedes albopictus was observed to be the sole vector for dengue transmission in Guangzhou with the absence of Aedes aegypti. According to vector surveillance data, Aedes albopictus ranked second at a proportion of approximately 5.89% of all adult mosquitos, following only Culex fatigans (89.90%) in Guangdong province [Reference Cai23].

Fig. 1. The location of Guangzhou in China. The left plot is the whole view of China, with a dark colour indicating Guangdong Province. The right graph zooms in on Guangdong Province, with a dark colour indicating its capital city Guangzhou.

Data sources

Monthly dengue cases aggregated by onset date from 2001 to 2016 were collected by the GZCDC in accordance with the Law on Prevention and Treatment of Infectious Diseases of China. Once diagnosed in medical institutions, the case must be reported to the web-based National Notifiable Infectious Disease Reporting Information System within 24 h according to the National Diagnostic Criteria for Dengue Fever (WS216-2008) published by the Chinese Ministry of Health [Reference Jing9]. Through the face-to-face interviews conducted by GZCDC, the cases are classified as either imported or indigenous based on the patient's travel history and dengue's human incubation period.

Monthly vector surveillance which was conducted in each midmonth days in total 36 representative communities, with the Breteau index (BI) for indoor larval density, standard space index (SSI) for outdoor larval density and adult mosquito density index (ADI) monitored by the GZCDC [Reference Luo24]. The definitions of the indices are as follows: BI = the number of containers positive for larvae or pupae per 100 houses, SSI = the number of containers positive for larvae or pupae per 100 standard spaces in which one stand space of 15 m2 outdoors and the ADI = the number of Aedes per hour by the human snare method. Monthly temperature and precipitation data for Guangzhou from 2001 to 2016 were downloaded from the China Meteorological Data Sharing Service System ( The mean (T mean), maximum (T max) and minimum temperatures (T min) and the total (P total), mean (P mean) and maximum precipitation (P max) were used in this study.

Data processing and time series analysis

The time series data collected above were divided into two parts as follows: the first 168 months (2001–2014) were used to construct the time series model with the Box and Jenkins approach [Reference Helfenstein25, Reference Box and Jenkins26] and the last 24 months (2015–2016) was used to validate the predictive power of the model.

ARIMA model for dengue occurrence

Because of temporal autocorrelations between data points and dengue's seasonal transmission patterns, time series analyses, especially ARIMA model, are effective to model dengue transmission. The model provides a general framework for analysing seasonal disease incidence data such as dengue [Reference Luz15], influenza [Reference Soebiyanto, Adimi and Kiang27], leptospirosis [Reference Chadsuthi28], tuberculosis [Reference Permanasari, Rambli and Dominic29], malaria [Reference Wangdi30] and brucellosis [Reference Lee31]. Accounting for temporal autocorrelations between variables can lead to higher accuracy using ARIMA on the same data [Reference Chadsuthi28]. Furthermore, it can incorporate external factors, such as climatic variables, which can improve the fitting and prediction accuracy; under these circumstances, the model as an extension of the ARIMA model is named the ARIMAX model.

The model expression ARIMA (p, d, q) (P, D, Q)s [Reference Cryer and Chan32], p is the order of the autoregression (AR) process, q is the number of moving average (MA) terms, d represents the differencing process to form a stationary times series, and P, D and Q are the seasonal orders of the AR, differencing and MA processes, respectively. Additionally, s denotes the seasonal period. Differencing was used to stabilise the variance and mean to fit the model's requirements [Reference Gharbi33]. In the ARIMA model, if the time series was nonstationary, the log transformation and differencing were used to remove the nonstationary terms. In our study, we generated a log transformation plus 2 (log (the number of indigenous cases +2)) for the monthly indigenous dengue cases. We added 2 to case incidence data to avoid zero prior to taking the log transformation and modeling processes. The predicted number of indigenous dengue cases at time t (Y t) was determined by the model equation (1) [Reference Box and Jenkins26, Reference Gharbi33]:

(1)$$Y_t = \displaystyle{{\theta (B)\Theta (B^s )a_t} \over {\Phi (B)^s \phi (B)(1 - B)^d (1 - B^s )^D}} $$

here, θ(B) is the operator for MA, Θ (B s) is the seasonal MA operator, ϕ(B) is the AR operator, Φ(B s) is the seasonal AR operator, (1 − B)d and (1 − B s)D denote the ordinary and seasonal difference components, respectively, a t and Y t represent the white noise and the dependent variable, respectively [Reference Gharbi33]. The p and q orders can be inferred using the cutoff time lag of the autocorrelation function (ACF) and the partial autocorrelation function (PACF). If the time series showed a seasonal pattern with an ACF peak at a time lag of s, then the same procedure was applied to the seasonal ARIMA model. The Ljung-Box test was used to test the model residuals did not show residual autocorrelation.

Cross-correlation analysis

Due to strong autocorrelations in the data, correlations of the time series of the monthly number of indigenous cases with imported cases, vector densities and climate were difficult to identify. In this study, a prewhitening process was applied to the data among the multiple external regressors. Prewhitening to avoid common trends between incidence and risk factors [Reference Box and Jenkins26], was conducted in three steps. First, we determined a time series model for the y-variable in step 1 and stored the residuals from this model and then filtered the x-variable series using the y-variable model in step 2 and finally examined the cross-correlation function (CCF) between the residuals from Step 1 and the filtered x-values from Step 2.

The CCF of the prewhitened imported cases, vector density and climate variables with the indigenous cases time series were calculated to identify the significant time lags. The factors that did not show a significant time lag not included in the model analysis [Reference Chadsuthi34]. Based on the results from previous studies as well as biological and epidemiological plausibility, we selected those positively significant lag values for the next step of the analysis with maximum lag steps of four.

Univariate and multivariate ARIMAX analyses

The lag value of the statistically significant factors identified by the cross-correlation analysis was incorporated as external covariates into the ARIMAX model constructed above. The ARIMAX model is described by equation (2) [Reference Box and Jenkins26, Reference Gharbi33]:

(2)$$Y_t = \displaystyle{{\theta (B)\Theta (B^s )a_t} \over {\Phi (B)^s \phi (B)(1 - B)^d (1 - B^s )^D}} + X$$

here, X is the external regressor, which may be univariate or multivariate. The other parameters are as described in the ARIMA model above. In this study, we first took a single lag value into the univariate ARIMAX model. For the multivariate analysis, the incorporated factors were selected from the factors with significant regression coefficients in the univariate analysis (P < 0.10). P < 0.05 in the regression coefficient test was considered statistically significant in the multivariate model. The coefficients of the model were estimated using the maximum likelihood method. The AIC was used to identify the best fitting model while taking into account possible overfitting of the data. In this study, RMSE was also used to determine the best performance. For the model diagnostics, the ARIMA and ARIMAX models were used to test whether model residuals were significantly different from white noise. Finally, we used cross-validation of one-step method using the selected model to measure model performance.

Model validation for testing outbreak

According to the Technical Guidelines for Dengue Control and Prevention enacted by Chinese Center for Diseases Control and Prevention, a dengue outbreak is as the appearance of three confirmed indigenous cases in a period no longer than 15 days in a confined geographic area, such as one community, village, school or other collective units.

To investigate model robustness, the cross-validation for the testing outbreak was applied. However when applied to time series data traditional leave-one-out or k-fold cross validation may produce overly optimistic results due to temporal autocorrelation in the data, violating usual assumptions of independence. In our time series analysis, cross-validated error taking into account temporal dependence in the data was obtained as follows [Reference Arlot and Celisse35]: (1) Fit the model to the observed data Y 1, Y 2,…,Y t and let $\widehat{Y}_{t + 1} $ denotes the predicted value of the next observation. Then compute the error (e t+1 = Y t+1 − $\widehat{Y}_{t + 1} $) for the forecast observation. (2) Repeat step 1 for t = k,…, n − 1 where k is the minimum number of observations needed for fitting the model and n is the total number of observations. (3) Compute the mean RMSE from e k+1,…, e n. In our paper, we took k as 168 for our initial training data from January 2001 to December 2014 and the following 24 observations as testing data with initial forecast observation on January 2015 and the final forecast observation on December 2016.

The receiver operating characteristic curve (ROC) was used to determine the optimal cut-off point for the occurrence of an outbreak. Model accuracy was evaluated by the area under the ROC curve (AUC), larger values indicating higher accuracy. Afterwards the sensitivity, specificity and consistency rate for predicting outbreaks were computed, based on evaluating model error on testing data from January 2015 to December 2016 [Reference Zhang36].

The R software (version 3.2.2, the R foundation for Statistical Computing, Vienna, Austria) with the TSA, forecast and pROC packages were used for the time series analysis and graphical display.


Dengue occurrence patterns and their associated factors

A total of 42 470 dengue cases were reported from 2001 to 2016, including 41 370 (97.41%) indigenous cases and 1100 (2.59%) imported cases. The dengue transmission season was generally restricted to the period between summer (June–August) and autumn (September–November), with a peak in September. There were 4 years of major local outbreaks consisting of 1422 cases in 2002, 765 cases in 2006, 1249 cases in 2013 and 37 340 cases in 2014. To stabilise the variance and subsequent differencing of indigenous dengue case incidence data, we calculated the natural log transformation plus 2 (log (number of indigenous cases +2)) for the monthly indigenous cases in the study, which is depicted in Figure 2.

Fig. 2. Natural logarithm of the monthly indigenous dengue cases +2 from 2001 to 2016 in Guangzhou, China.

The temperature (T mean, T max and T min), precipitation (P total, P max and P mean) and vector density (BI, SSI and ADI) variables showed similar patterns, with high values in the summer and autumn. The average monthly number of imported cases over the study period is plotted in Figure 3 and Supplementary Table S1. A monthly number of imported cases generally proceeded indigenous cases, which arose through the local transmission. The highest total precipitation (P total = 3006.5 mm) occurred in 2016, although there were 204 reported indigenous cases in that year; meanwhile, the lowest total precipitation (P total = 1338.7 mm) occurred in 2003, when there were 76 reported indigenous cases.

Fig. 3. The patterns of the recorded risk factors, including (a) imported cases (n), (b) temperature (°C), (c) precipitation (10* mm) and (d) vector density (percentages for BI and SSI, numbers per hour for ADI) per month.

ARIMA model

The best fitting model selected according to AIC was the ARIMA(0,1,1)(0,0,2)12 model with an AIC of 400.8343 and RMSE of 0.7762. The ACF and PACF plots were used to estimate the temporal autocorrelation, as demonstrated in Figure 4. The Ljung-Box test indicated that model residuals did not significantly depart from a white noise process (P > 0.05).

Fig. 4. Autocorrelation function (ACF) and partial ACF (PACF) plots of the log transformation plus 2 for the original monthly indigenous dengue cases (a and b) and the ACF and PACF plots for the residuals of the ARIMA (0,1,1)(0,0,2)12 model (c and d). The dotted lines indicate the 95% confidence intervals.

Cross-correlation analysis

After prewhitening, we calculated the cross-correlation coefficients between the mosquito densities, climate variables and imported cases. Table 1 shows that the log(indigenous cases + 2) time series correlated positively with the imported cases at the 0 and 1-month lags, the mean temperature at lag 0, the maximum temperature at lag 0 and the minimum temperature at lag 0.

Table 1. Cross-correlations between the prewhitened factors and the indigenous cases (log (number of indigenous cases +2)) time series between 2001 and 2014

ICs, imported cases; BI, Breteau index; SSI, standard space index; ADI, adult density index; T mean, mean temperature; T max, maximum temperature; T min, minimum Temperature; P total, total precipitation; P mean, mean precipitation; P max, maximum precipitation.

‘+’, positive statistical significance; ‘.’ no statistical significance; ‘−’, negative statistical significance.

Univariate and multivariate ARIMAX analyses

Based on the previous cross-correlation analysis, we tested the ARIMAX model by incorporating the imported cases at lag 0 and lag 1, mean temperature at lag 0, maximum temperature at lag 0 and minimum temperature at lag 0 as external regressors based on the ARIMA (0,1,1)(0,0,2)12 model from a log (indigenous cases +2) time series.

In the fitting process, the imported cases at lag 0 and the minimum temperature at lag 0 were statistically significant in the univariate analysis. As shown in Table 2, the ARIMAX (0,1,1)(0,0,2)12 model with the imported cases had the lowest AIC (395.9477) and fitting RMSE (0.7607) in the fitted model and the lower validation RMSE (0.6813) compared with the ARIMAX (0,1,1)(0,0,2)12 model with the minimum temperature. In the multivariate analysis, both the imported cases at lag 0 and minimum temperature at lag 0 were statistically significant, as demonstrated in Table 3. The ACF and PACF for the residuals of the ARIMAX (0,1,1)(0,0,2)12 model with the imported cases and the ARIMAX (0,1,1)(0,0,2)12 model incorporating both the imported cases and minimum temperature at lag 0 were depicted in Supplementary Figure S1.

Table 2. Summary of the fitted parameters of the univariate ARIMAX model analysis in Guangzhou, 2001–2016

ARIMAX, autoregressive integrated moving average with external regressors; Fitted, fitted results; Test, test results; AIC, Akaike Information Criterion; RMSE, root mean square error; Coef., coefficient of risk factors; lag, time lag of risk factors; s.e., standard error; t, t statistic; ma1, MA(1); sma1, seasonal MA(1); sma2, seasonal MA(2).

**P < 0.01, *P < 0.10.

Table 3. Summary of the fitted parameters of the multivariate ARIMAX model analysis in Guangzhou, 2001–2016

ARIMAX, autoregressive integrated moving average with external regressors; Fitted, fitted results; Test, test results; AIC, Akaike Information Criterion; RMSE, root mean square error; Coef., coefficient of risk factors; lag, time lag of risk factors; s.e., standard error; t, t statistic; ma1, MA(1); sma1, seasonal MA(1); sma2, seasonal MA(2).

**P < 0.01, *P < 0.05.

Therefore, the ARIMAX (0,1,1)(0,0,2)12 model incorporating both the imported cases and the minimum temperature at lag 0 was the best fitting model, with the lowest AIC (393.7854) and RMSE of 0.7520. The validation RMSE (0.6445) was lower than for the ARIMAX (0,1,1)(0,0,2)12 model with the imported cases at lag 0(0.6813), an decrease of 5.40% (Fig. 5).

Fig. 5. Log transformation of the monthly dengue indigenous cases in Guangzhou from 2001 to 2016. (a) ARIMAX (0,1,1)(0,0,2)12 model with the imported cases at lag 0 tested by one-step validation method for the test data, (b) ARIMAX (0,1,1)(0,0,2)12 model with both the imported cases and the minimum temperature at lag 0, predicted by one-step validation method by test data.

Model validation for testing outbreak

Forty-five (23.44%) outbreaks were identified among the 192 observations from January 2001 to December 2016. On the training dataset from January 2001 to December 2014, the AUC for the ARIMAX (0,1,1)(0,0,2)12 model including imported cases at lag 0 was 0.9719 with the best threshold as 1.5524 and the AUC for the ARIMAX (0,1,1)(0,0,2)12 model including both imported cases and minimum temperature at lag 0 was 0.9754 with the best threshold as 1.6096 (Supplementary Fig. S2). On the testing dataset (testing outbreak), the ARIMAX(0,1,1)(0,0,2)12 model with imported cases at lag 0 had a sensitivity of 1.0000 (1/1), a specificity of 0.6842(13/19) and a consistency rate of 0.7500(18/24). The ARIMAX(0,1,1)(0,0,2)12 incorporating both the imported cases and minimum temperature at lag 0 had a sensitivity of 1.0000 (1/1), a specificity of 0.7368 (14/19) and a consistency rate of 0.7917 (19/24). More details can be found in Figure 6 and Supplementary Table S2.

Fig. 6. Cross-validation for testing dengue outbreak in Guangzhou. The upper row of points in the graph denotes the predicted outbreaks by our model and the lower row shows the observed outbreaks. The lines show the predicted values and actual values respectively. (a) Testing performance for the ARIMAX (0,1,1)(0,0,2)12 model with the imported cases at lag 0. (b) Testing performance for the ARIMAX (0,1,1)(0,0,2)12 model with both the imported cases and minimum temperature at lag 0.


In this study, we developed and evaluated time series models to characterise the risk factors of local dengue transmission in Guangzhou to inform control measures. We found that the number of imported cases and the minimum temperature were the important variables associated with local dengue transmission in Guangzhou City. And the multivariate ARIMAX model that included both the imported cases and the minimum temperature at lag 0 had a better fit than the inclusion of one variable alone. This modeling technique can be a useful tool for planning local control interventions and could be implemented during routine dengue surveillance in Guangzhou.

A higher number of overseas travellers boost the risk of global dengue transmission [Reference Polwiang37]. Previous studies of phylogenetic analysis and travel history surveys suggest that imported cases could affect dengue transmission [Reference Jing9, Reference Wang38]. However, few studies have quantitatively modeled imported cases with dengue occurrence. In the unprecedented 2014 dengue outbreak in Guangzhou, the number and timing of the imported dengue cases strongly determined the local occurrence and outbreak size [Reference Cheng10]. Virus dissemination due to population movement contributes to an increase in the outbreak potential in non-endemic areas [Reference Gharbi33]. Increased tourism and commerce can introduce the dengue virus into non-endemic areas or new districts. Currently, dengue is endemic in Southeast Asian countries such as Thailand, Malaysia, Philippines, Cambodia, Laos, Vietnam, Indonesia and India. The high frequency of international travel and large shipping industry in Guangzhou increases the risk of importation of dengue cases which can lead to indigenous epidemics.

Temperature also influences the dengue transmission dynamics, mainly through indirect effects. Increasing temperatures can increase the biting frequency of Aedes and the survival rates during emergence [Reference Focks39] and reduce the extrinsic incubation period of dengue in the mosquito [Reference Watts40]. However, there is little evidence that the highest temperature drives patterns of dengue epidemics in Southeast Asia [Reference van Panhuis21]. Interestingly, the minimum temperature played a significant role in dengue transmission in this study. Several studies have highlighted the association between the minimum temperature and dengue transmission. In Taiwan, the minimum temperature correlated with dengue cases at a lag of 1–3 months [Reference Wu41]. In Guadeloupe, minimum temperature with a 5 week time lag was the best climatic variable for the prediction of dengue outbreaks [Reference Gharbi33]. Thus, the evidence suggests that more severe dengue transmission occurs at higher monthly minimum temperature.

Prewhitening was conducted prior to calculating the cross-correlation between the dengue cases and risk factors. This process can measure the association between temporally auto-correlated data, whereas the Pearson or Spearman correlation requires independent and identically distributed observations [Reference Luz15]. Finally, we found that dengue cases significantly correlated with both the number of imported cases and the minimum temperature at lag 0. However, no positive correlation was found with the BI, SSI, ADI and precipitation at any lag. The mean, maximum and minimum temperatures all showed a strong correlation with the number of dengue cases at lag 0. The models incorporating these variables as external regressors indicated improved model performance [Reference Wu41]. At present, it is generally known that the increasing risk of dengue transmission is dependent on vector density, however, its correlation with dengue is generally poor, as shown in our study [Reference Luz15]. Excepting for vector density, this phenomenon may in part be due to the fact, vector capacity is also related with biting rate, the probability of vector survival, extrinsic incubation period and vector competence [Reference Lambrechts42]. This result may also be associated with the method itself due to the subjectivity of the vector density survey and potential bias in data collection. Especially for adult mosquito density, which theoretically shows a higher correlation with dengue risk than larval indices because only the adult life stage directly contributes to dengue transmission dynamics was also in poor association with dengue incidence.

Our study provides deep insights into the dengue risk in Guangzhou through the characterization of the temporal autocorrelation structure. The results of this study are consistent with previous research showing that the number of imported cases is highly correlated with local transmission in Guangzhou [Reference Jing9, Reference Cheng10]. The methodology used in the study is an effective complement for understanding dengue risk factors in the region of Guangzhou and can be applied in high-risk areas of south China. As the imported cases are highly associated with local transmission, port inspection and quarantine may be able to play a critical role in dengue control in the region. The quarantine mainly targets fever cases (⩾38 °C) entering Guangzhou by plane, with the serum collected and polymerase chain reaction laboratory test followed. Approximately 30% of the total imported cases were identified by port quarantine in Guangzhou. When imported cases escaped detection by port monitoring systems, the rapid rate of case detection in local health institutions is also important in control practices, which can be improved by training to increase knowledge, attitudes and practices of both health workers and the public [Reference Jing43]. When the minimum temperature arises, the vector control efforts should be strengthened, especially in the area with a high likelihood of imported cases. Due to Guangzhou as the imported dengue area, the strategy under this study is an economic and effective way to prevent dengue local outbreak.

Admittedly, there were limitations in our study. The ARIMAX model can only identify a correlation between local dengue incidence and the risk variables but cannot identify causal relationships [Reference Soebiyanto, Adimi and Kiang27]. Furthermore, our study did not take into account the effects of human movement [Reference Stoddard14], water supply [Reference Schmidt13], land utilisation [Reference Buczak44] and socio-economic factors [Reference Banu45]. These factors also play certain roles in determining dengue transmission in the study regions. Due to the experiences and lessons from 2014, Guangzhou authorities took strict intervention measures, especially in hospital isolation and early diagnosis using the NS1 antigen test [Reference Jing46], to successfully reduce the number of local cases during the 2015 transmission season. Finally, due to the lag 0 for both factors including in the models, it is reasonable for fitting the model conforming to the actual public health practices. Using time interval with monthly data, when an imported case is introduced, it will cost longest 20 days roundly to disperse, including an incubation period about 3–15 day and longest infectious period 5 days after onset which is less than 1 month. Therefore, the introduced imported case in a current 1 month is more important. Furthermore, it is a bit delayed to use the onset date for monthly aggregation analysis instead of the infection date or biting date. However, it would be more objective if the accurate date could be confirmed.

The frequency and scale of dengue transmission in China have increased in recent years [Reference Lai8]. A better understanding of the relationship between dengue occurrence and various risk factors can be used to prevent further large outbreaks. Our ARIMAX modelling provides a practical approach for monitoring dengue in Guangzhou City. The model and its outcome can give objective guidance for local health authorities in order to better understand future large epidemics.

Supplementary material

The supplementary material for this article can be found at


This work was supported by National Natural Science Foundation of China (No. 81273139), Guangdong Natural Science Foundation (No. S2013010013637), the Project for Key Medicine Discipline Construction of Guangzhou Municipality (No. 2017-2019-04), the Medical Scientific Research Foundation of Guangdong Province (No. A2017481), the Science and Technology Plan Project of Guangzhou (No. 201804010121), the Collaborative innovation project of Bureau of Science and Technology of Guangzhou Municipality (No. 201508020263, No. 201704020226). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Conflict of Interest

The authors have declared that no competing interests exist.


1.Bhatt, S et al. (2013) The global distribution and burden of dengue. Nature 496, 504507.
2.Diamond, MS and Pierson, TC (2015) Molecular insight into dengue virus pathogenesis and its implications for disease control. Cell 162, 488492.
3.Shepard, DS, Undurraga, EA and Halasa, YA (2013) Economic and disease burden of dengue in Southeast Asia. PLoS Neglected Tropical Diseases 7, e2055.
4.Kularatne, SA (2015) Dengue fever. BMJ 351, h4661.
5.Dejnirattisai, W et al. (2010) Cross-reacting antibodies enhance dengue virus infection in humans. Science 328, 745748.
6.Wilder-Smith, A and Gubler, DJ (2015) Dengue vaccines at a crossroad. Science 350, 626627.
7.Qiu, FX et al. (1993) Dengue in China: a clinical review. Bulletin of the World Health Organization 71, 349359.
8.Lai, S et al. (2015) The changing epidemiology of dengue in China, 1990–2014: a descriptive analysis of 25 years of nationwide surveillance data. BMC Medicine 13, 100.
9.Jing, QL et al. (2012) Emergence of dengue virus 4 genotype II in Guangzhou, China, 2010: survey and molecular epidemiology of one community outbreak. BMC Infectious Diseases 12, 87.
10.Cheng, Q et al. (2016) Climate and the timing of imported cases as determinants of the dengue outbreak in Guangzhou, 2014: evidence from a mathematical model. PLoS Neglected Tropical Diseases 10, e0004417.
11.Sang, S et al. (2015) Predicting unprecedented dengue outbreak using imported cases and climatic factors in Guangzhou, 2014. PLoS Neglected Tropical Diseases 9, e0003808.
12.Sun, J et al. (2016) The epidemiological characteristics and genetic diversity of dengue virus during the third largest historical outbreak of dengue in Guangdong, China, in 2014. Journal of Infection 72, 8090.
13.Schmidt, WP et al. (2011) Population density, water supply, and the risk of dengue fever in Vietnam: cohort study and spatial analysis. PLoS Medicine 8, e1001082.
14.Stoddard, ST et al. (2013) House-to-house human movement drives dengue virus transmission. Proceedings of the National Academy of Sciences of the United States of America 110, 994999.
15.Luz, PM et al. (2008) Time series analysis of dengue incidence in Rio de Janeiro, Brazil. American Journal of Tropical Medicine and Hygiene 79, 933939.
16.Morin, CW, Comrie, AC and Ernst, K (2013) Climate and dengue transmission: evidence and implications. Environmental Health Perspectives 121, 12641272.
17.Quam, MB et al. (2016) Dissecting Japan's dengue outbreak in 2014. American Journal of Tropical Medicine and Hygiene 94, 409412.
18.Ernst, T et al. (2015) Emergence of a new lineage of dengue virus type 2 identified in travelers entering Western Australia from Indonesia, 2010–2012. PLoS Neglected Tropical Diseases 9, e0003442.
19.Eamchan, P et al. (1989) Epidemiology and control of dengue virus infections in Thai villages in 1987. American Journal of Tropical Medicine and Hygiene 41, 95101.
20.Goh, KT et al. (1987) Epidemiological aspects of an outbreak of dengue fever/dengue haemorrhagic fever in Singapore. Southeast Asian Journal of Tropical Medicine and Public Health 18, 295302.
21.van Panhuis, WG et al. (2015) Region-wide synchrony and traveling waves of dengue across eight countries in Southeast Asia. Proceedings of the National Academy of Sciences of the United States of America 112, 1306913074.
22.Lu, L et al. (2009) Time series analysis of dengue fever and weather in Guangzhou, China. BMC Public Health 9, 395.
23.Cai, S et al. (2014) Density fluctuation and population composition of mosquitoes in Guangdong Province. Chinese Journal of Hygienic Insecticides and Equipments 20, 357359.
24.Luo, L et al. (2012) Epidemiological, virological, and entomological characteristics of dengue from 1978 to 2009 in Guangzhou, China. Journal of Vector Ecology 37, 230240.
25.Helfenstein, U (1991) The use of transfer function models, intervention analysis and related time series methods in epidemiology. International Journal of Epidemiology 20, 808815.
26.Box, GEP and Jenkins, GM (1976) Time Series Analysis, Forecasting and Control. San Francisco: Holden-Day.
27.Soebiyanto, RP, Adimi, F and Kiang, RK (2010) Modeling and predicting seasonal influenza transmission in warm regions using climatological parameters. PLoS ONE 5, e9450.
28.Chadsuthi, S et al. (2012) Modeling seasonal leptospirosis transmission and its association with rainfall and temperature in Thailand using time-series and ARIMAX analyses. Asian Pacific Journal of Tropical Medicine 5, 539546.
29.Permanasari, AE, Rambli, DR and Dominic, PD (2011) Performance of univariate forecasting on seasonal diseases: the case of tuberculosis. Advances in Experimental Medicine and Biology 696, 171179.
30.Wangdi, K et al. (2010) Development of temporal modelling for forecasting and prediction of malaria infections using time-series and ARIMAX analyses: a case study in endemic districts of Bhutan. Malaria Journal 9, 251.
31.Lee, HS et al. (2013) Time series analysis of human and bovine brucellosis in South Korea from 2005 to 2010. Preventive Veterinary Medicine 110, 190197.
32.Cryer, JD and Chan, KS (2010) Time Series Analysis: With Applications in R. Springer, New York.
33.Gharbi, M et al. (2011) Time series analysis of dengue incidence in Guadeloupe, French West Indies: forecasting models using climate variables as predictors. BMC Infectious Diseases 11, 166.
34.Chadsuthi, S et al. (2015) Modeling seasonal influenza transmission and its association with climate factors in Thailand using time-series and ARIMAX analyses. Computational & Mathematical Methods in Medicine 2015, 18.
35.Arlot, S and Celisse, A (2009) A survey of cross-validation procedures for model selection. Statistics Surveys 4, 4079.
36.Zhang, Y et al. (2016) Developing a time series predictive model for dengue in Zhongshan, China based on weather and Guangzhou dengue surveillance data. PLoS Neglected Tropical Diseases 10, e0004473.
37.Polwiang, S (2015) The estimation of imported dengue virus from Thailand. Journal of Travel Medicine 22, 194199.
38.Wang, B et al. (2015) Phylogenetic analysis of dengue virus reveals the high relatedness between imported and local strains during the 2013 dengue outbreak in Yunnan, China: a retrospective analysis. BMC Infectious Diseases 15, 142.
39.Focks, DA et al. (2000) Transmission thresholds for dengue in terms of Aedes aegypti pupae per person with discussion of their utility in source reduction efforts. American Journal of Tropical Medicine and Hygiene 62, 1118.
40.Watts, DM et al. (1987) Effect of temperature on the vector efficiency of Aedes aegypti for dengue 2 virus. American Journal of Tropical Medicine and Hygiene 36, 143152.
41.Wu, P et al. (2007) Weather as an effective predictor for occurrence of dengue fever in Taiwan. Acta Tropica 103, 5057.
42.Lambrechts, L et al. (2011) Impact of daily temperature fluctuations on dengue virus transmission by Aedes aegypti. Proceedings of the National Academy of Sciences of the United States of America 108, 74607465.
43.Jing, Q et al. (2013) Epidemiology of imported dengue fever in Guangzhou from 2001 to 2010. Journal of Tropical Medicine 13, 15091512.
44.Buczak, AL et al. (2012) A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data. BMC Medical Informatics and Decision Making 12, 124.
45.Banu, S et al. (2011) Dengue transmission in the Asia-Pacific region: impact of climate change and socio-environmental factors. Tropical Medicine and International Health 16, 598607.
46.Jing, Q et al. (2016) The effect of hospital isolation in mid-stage dengue epidemic control. Chinese Journal of Disease Control & Prevention 20, 755758.