Continuous calibration of a digital twin: Comparison of particle filter and Bayesian calibration approaches

Rebecca Ward; Ruchi Choudhary; Alastair Gregory; Melanie Jans-Singh; Mark Girolami

doi:10.1017/dce.2021.12

Continuous calibration of a digital twin: Comparison of particle filter and Bayesian calibration approaches

Published online by Cambridge University Press: 30 September 2021

and

Rebecca Ward*: Affiliation:
Data-Centric Engineering, The Alan Turing Institute, The British Library, 96 Euston Road, NW1 2DB London, United Kingdom Engineering Department, University of Cambridge, Trumpington Street, CB2 1PZ Cambridge, United Kingdom
Ruchi Choudhary: Affiliation:
Data-Centric Engineering, The Alan Turing Institute, The British Library, 96 Euston Road, NW1 2DB London, United Kingdom Engineering Department, University of Cambridge, Trumpington Street, CB2 1PZ Cambridge, United Kingdom
Alastair Gregory: Affiliation:
Data-Centric Engineering, The Alan Turing Institute, The British Library, 96 Euston Road, NW1 2DB London, United Kingdom
Melanie Jans-Singh: Affiliation:
Engineering Department, University of Cambridge, Trumpington Street, CB2 1PZ Cambridge, United Kingdom
Mark Girolami: Affiliation:
Data-Centric Engineering, The Alan Turing Institute, The British Library, 96 Euston Road, NW1 2DB London, United Kingdom Engineering Department, University of Cambridge, Trumpington Street, CB2 1PZ Cambridge, United Kingdom
*: *Corresponding author. E-mail: rmw61@cam.ac.uk

Article contents

Abstract
Impact Statement
Introduction
Calibration Approach
Digital Twin of the Underground Farm
Calibration Results
Discussion
Conclusions
Data Availability Statement
Author Contributions
Funding Statement
Competing Interests
References

Abstract

Assimilation of continuously streamed monitored data is an essential component of a digital twin; the assimilated data are used to ensure the digital twin represents the monitored system as accurately as possible. One way this is achieved is by calibration of simulation models, whether data-derived or physics-based, or a combination of both. Traditional manual calibration is not possible in this context; hence, new methods are required for continuous calibration. In this paper, a particle filter methodology for continuous calibration of the physics-based model element of a digital twin is presented and applied to an example of an underground farm. The methodology is applied to a synthetic problem with known calibration parameter values prior to being used in conjunction with monitored data. The proposed methodology is compared against static and sequential Bayesian calibration approaches and compares favourably in terms of determination of the distribution of parameter values and analysis run times, both essential requirements. The methodology is shown to be potentially useful as a means to ensure continuing model fidelity.

Keywords

Bayesian calibration digital twin particle filter

Type: Research Article
Information: Data-Centric Engineering , Volume 2 , 2021 , e15

DOI: https://doi.org/10.1017/dce.2021.12 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited
Copyright: © The Alan Turing Institute and the Author(s), 2021. Published by Cambridge University Press

Impact Statement

This study addresses the problem of ensuring that a simulation model is able to continuously assimilate data and identify variation in the underlying model parameters. As the availability of monitored data rapidly increases across diverse industries, the requirement for tools that are able to make timely use of the data is also on the increase. While the technology for developing the simulation models already exists, in many fields, there is a dearth of tools that can facilitate rapid interpretation of streamed data to ensure continuing model fidelity. The methodology presented here potentially offers one way to fill this gap by proposing a sequential calibration process using a particle filter approach. The proposed approach is illustrated by application to a simulation model of an underground farm.

1. Introduction

The technological advancement and drop in price of monitoring equipment has led to a boom in availability of monitored data across industrial fields as diverse as aviation, manufacturing, and the built environment. This facilitates the development of digital twin technology. The specification of what constitutes a digital twin is still evolving, but the essence comprises a monitored system together with a computational model of the system which demonstrably replicates the system behavior in the context of interest (Worden et al., Reference Worden, Cross, Gardner, Barthorpe, Wagg and Barthorpe2020). The computational model may be either data-derived or physics-based; for engineering systems, it is more common for a physics-based model to be used, as this offers the advantage of representing the salient physical laws that govern the system. The intention is that the computational model can give information that may not be easily accessible from the system directly, and can be used to explore performance when it is impractical to run physical tests. The greatest potential for digital twinning perhaps lies in systems that are continuously operational generating live streamed data that inform the model, in which case the computational model can be simulated in (close to) real time to advise changes to operational parameters for improved efficiency (Madni et al., Reference Madni, Madni and Lucero2019).

Recent advancements in machine learning techniques are leading to a new generation of computational models that combine the physics and the data. On the one hand, physics-enhanced machine learning can improve a data-derived model by constraining it with the governing scientific laws (Choudhary et al., Reference Choudhary, Lindner, Holliday, Miller, Sinha and Ditto2020). A more common practice is to start with a physics-based model and incorporate the data to calibrate the model and ensure it matches reality as closely as possible over the range of the available data. This then means that when exercising the model beyond the realm of the data, for example, when exploring scenarios that cannot practically be tested with the real system, we can have greater confidence in the model predictions. Calibration can be performed manually, but it is time-consuming and is not always practical. Indeed, in the situation where model parameters are not static but dynamic, it may be impossible owing to the dimensions of the parameter space and the conflation between different reasons for the differences between model outputs and the data. In the case of a digital twin, an automated calibration process that forms an integral part of the system model is favourable, and thus a computationally expensive and complex calibration process is not well suited.

Bayesian calibration (BC) offers a formal way to combine measured data with model predictions to improve the model. In the field of building energy simulation considered here, the Bayesian framework proposed by Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) (KOH) has been explored in some depth (Chong et al., Reference Chong, Lam, Pozzi and Yang2017; Heo et al., Reference Heo, Choudhary and Augenbroe2012; Higdon et al., Reference Higdon, Kennedy, Cavendish, Cafeo and Ryne2004; Li et al., Reference Li, Augenbroe and Brown2016), specifically for the identification of static model parameters and the quantification of uncertainty. BC has been shown to be the optimal approach, as it incorporates prior knowledge into the calibration process which can significantly improve parameter identifiability. In addition, the KOH methodology enables not only the inference of uncertain (and important) model parameters but also of the model deficiency (model bias) and errors in observations. As such, the KOH methodology is considered the “gold standard” for calibration of computer models. However, the KOH methodology can be computationally expensive, increasingly so as the numbers of calibration parameters and data points increase, owing to the typical Markov chain Monte Carlo (MCMC) implementation which generates a random walk through the target distribution and is inherently inefficient (Chong et al., Reference Chong, Lam, Pozzi and Yang2017). This has implications for use in continuous calibration over short timescales—the run time of the calibration must be shorter than the time interval between acquisition of new data points, and yet there must be sufficient data points to characterize the parameter space. Nonetheless, there has been some exploration of its extension to dynamically varying parameters. For example, Chong et al. (Reference Chong, Xu, Chao and Ngo2019) used the KOH formulation in conjunction with data from building energy management systems to continuously calibrate a building energy model, updating the model every month, demonstrating that prediction of future performance is improved with continuous calibration. The study required selection of a reduced sample size and a more efficient MCMC algorithm to overcome the computational challenge presented by the large dataset.

An alternative approach that potentially offers a time-efficient solution to the problem of continuous calibration is particle filtering (PF). This is a sequential Bayesian technique in which a large sample of possible parameter estimates are filtered according to their likelihood given some data. The use of PFs for static parameter estimation is described in some detail by Andrieu et al. (Reference Andrieu, Doucet, Singh and Tadić2004), and the approach has been used for sequential data assimilation in other fields for which the memory effect has a significant impact on parameter values, for example, hydrology (Moradkhani and Hsu, Reference Moradkhani and Hsu2005), or for estimation of dynamic state and parameter values in nonlinear state-space models, for example, Cheng and Tourneret (Reference Cheng and Tourneret2017). The novelty of the study presented here is its potential impact on digital twin technology. The parameters are not memory-dependent but are time-varying, and we require up-to-date estimates of the parameter values to ensure that forecasting using the physics-based model is as accurate as possible. This is particularly important if the physics-based model can only be a relatively simple representation of the real system—which is often the case. To this end, we explore whether the PF approach can offer a suitable calibration mechanism. We use monitored data in conjunction with the PF to calibrate uncertain model parameters—we then use the estimates of the model parameters in conjunction with the model to predict model performance with a quantification of the uncertainty. The PF is also compared against a sequential BC model using the KOH formulation. While the KOH approach is typically used to get the best estimates of static parameter values, in this study, we also explore whether it is feasible to use it with sequentially changing datasets. We demonstrate that while both methods are suitable, the PF method is quicker without losing accuracy.

The digital twin considered here is a farm constructed in a previously disused underground tunnel, in which salad crops are grown hydroponically. A physics-based simulation model of the farm has been developed that calculates temperature and relative humidity (RH) as a function of external weather conditions and farm operational strategies. This is a relatively simple model based on a single thermal zone that simulates heat and mass transfer between the different system components. An extensive program of monitoring has also been carried out, so data are available for calibration of the model. The aim of the digital twin (or the physics-based simulation model) is that at any time, it can be used in conjunction with forecast weather data and operational scenario of the farm to predict environmental conditions within the tunnel with a view to alerting the farm operators to future potentially unsatisfactory conditions. Calibration is a critical component of the digital twin system in this instance, as it is the only mechanism by which the model can be adapted to represent the farm environment to an adequate degree of realism.

The paper is laid out as follows; in the following section, the calibration approaches are compared in more detail. We then describe the physics-based model of the underground farm. The feasibility of the PF and sequential KOH approaches are first tested with synthetic data and known parameters, and thereafter implemented with monitored data from the farm. In the discussion, we explore the extent to which the parameter values identified (a) are the values that give the best fit of the model to the data and (b) are indicative of “real” values, and compare the approaches used in terms of their applicability in the context of a digital twin.

2. Calibration Approach

In this study, we consider a static BC approach as formulated by Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) to be the basis against which we compare the viability of the proposed PF approach. We thus also use the static KOH approach in a sequential manner to explore the comparison as detailed below. Both approaches make use of Bayes’ formula, that is, for a parameter $ Y $ and an observed data point $ y(x) $ ,

(1)

$$ P\left(Y|y(x)\right)=\frac{P\left(y(x)|Y\right).P(Y)}{P\left(y(x)\right)}, $$

or the posterior probability of the parameter value $ Y $ , given the observed data point $ y(x) $ is equal to the likelihood of the observed data point, $ P\left(y(x)|Y\right) $ , multiplied by the probability of $ Y $ before making the observation—the prior probability, $ P(Y) $ —all normalized by a normalization factor $ P\left(y(x)\right) $ .

2.1. The KOH calibration framework

In the BC formulation proposed by Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001), a numerical simulator $ \eta \left(x,{\theta}^{\ast}\right) $ can be related to field observations $ y(x) $ by the following equation:

(2)

$$ y(x)=\eta \left(x,{\theta}^{\ast}\right)+\delta (x)+\varepsilon +{\varepsilon}_n, $$

where $ x $ are observable inputs into the model or scenarios, for example, location of sensor or time of sensing, and $ {\theta}^{\ast } $ represents the true but unknown values of the parameters $ \theta $ which characterize the model. This formulation inherently expects a discrepancy, or bias, between the model and reality, accounted for by $ \delta (x) $ . $ \varepsilon $ represents the observation error and $ {\varepsilon}_n $ the numerical error. Calibration of the model aims to identify the parameters $ {\theta}^{\ast } $ . Whereas traditional calibration approaches require multiple runs of the computer simulation with systematic variation of the input parameters and subsequent identification of the combination of parameters that gives the closest match to reality, the KOH framework offers a more efficient way to identify the uncertainty associated with the calibration parameters. Rather than using an exhaustive iterative approach, it is common practice to use an emulator to map the model inputs to the outputs. Gaussian process (GP) models are commonly used as the basis for the emulator, with separate models used for the simulator $ \eta \left(x,{\theta}^{\ast}\right) $ and the discrepancy term $ \delta (x) $ in the above equation. These GP models have their own hyperparameters, specifically precision hyperparameters $ {\lambda}_{\eta } $ and $ {\lambda}_b $ that relate to the magnitude of the emulator and model discrepancy, respectively, and $ {\beta}_{\eta } $ and $ {\beta}_b $ that determine the correlation strength in each along the dimensions of $ x $ and $ \theta $ and determine the smoothness of the emulator and discrepancy function. The random error terms for measurement and numerical error are included as unstructured error terms and are incorporated into the GP covariance matrix with precision hyperparameters $ {\lambda}_e $ and $ {\lambda}_{en} $ . All of these parameters are uncertain and require a prior probability distribution to be specified in the KOH approach. We have used prior distributions suggested in previous papers for the hyperparameters as detailed in Table 1 (Menberg et al., Reference Menberg, Heo and Choudhary2019).

Table 1. Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) approach hyperparameter prior distributions.

The practical application of the KOH approach to calibration of building energy simulation models is described in detail by Chong and Menberg (Reference Chong and Menberg2018), and the implementation of the framework is as described by Guillas et al. (Reference Guillas, Rougier, Maute, Richmond and Linkletter2009). The procedure is to compare measured data (observations) against outputs from computer simulations generated using plausible ranges of uncertain input parameters under known scenarios. By exploring the likelihood of the data given the simulation output, the posterior distribution of each calibration parameter is derived. The process, in sum, requires the following steps:

• sensitivity analysis to determine the model parameters that have the greatest impact on the simulation output,
• perform calibration runs—run simulations over the plausible range of the parameters identified in the sensitivity analysis, varying one parameter at a time,
• acquire field data—monitored observations corresponding to simulation output,
• fit GP emulator to the field data and simulation output by calculating the mean and covariance function as described by Higdon et al. (Reference Higdon, Kennedy, Cavendish, Cafeo and Ryne2004), and
• explore the parameter space using MCMC and extract the posterior distributions of the calibration parameters.

In this process, all the monitored data and simulation model outputs are included in the model at once, in the GP emulator; hence, the run time of calibration is very dependent on the number of data points, increasing in proportion to the square of the size of the dataset. This means that it is impractical to perform BC with a large number of observations.

The KOH framework also has at its heart an assumption of stationarity, that is, the calibration parameters, $ \theta $ , are assumed to be constant over the range of the data. We are interested in parameters that are not necessarily constant over the timescales considered. However, they may be essentially constant over shorter time periods, so it is possible to use the KOH framework to estimate the parameters over these shorter time periods. Given that assumption, it is feasible to implement the KOH calibration over successive time periods: at each timestep, the oldest data point is removed and a new data point added. In that instance, the posterior distribution for the previous timestep may be used as the prior distribution for the new dataset. In essence, this is a similar approach to the PF with the exception that data points are not considered singly but in groups over which the calibration parameter values are assumed to be constant. The potential benefit of this is that it facilitates exploration of the error terms in detail.

2.2. Particle filtering

A PF is a sequential Bayesian inference technique in which a large sample of possible parameter estimates are filtered according to their likelihood given some data. As an example, consider a virus spreading through a community of people. With no additional knowledge, we can only guess which members of the population have the virus. If then we get some information—say measured temperatures for each member, where an increased temperature is one of the symptoms of the virus—we can update the likelihood of each of our guesses according to their measured temperature. We then resample from our population taking the increased likelihood into account, generating a new sample where each member again has an equal probability of having the virus, and predict what the new temperatures will be based on our updated estimates of the location of the virus hosts. When we get new temperature information, we update the likelihood again and repeat the resampling and prediction.

In the context of this study, the particles consist of possible values of the uncertain model inputs/parameters, and the information is the monitored data. Based on the data, we update the likelihood of the particles, that is, we look to see how close the monitored data are to the outputs of the model with the parameter values assigned to each particle. We then resample the particles taking the likelihood into account and generate a new set of particles with equal weight. We repeat the process by updating the likelihoods of the new particle set using the next value of monitored data and the values predicted by the model using the parameter values from the new particles—and we then resample. This repeating process results in a set of values for the model parameters that continuously update in line with the monitored data. The resampling is an essential part of the filter to avoid degeneracy, that is, to avoid the situation where a few particles dominate the posterior distribution, as discussed in detail by Doucet and Johansen (Reference Doucet, Johansen, Crisan and Rovosky2008).

Mathematically, the framework for the PF approach is as follows: consider the model $ \eta \left(\theta \right) $ , where $ \theta $ are parameters of the model. We conjecture that observed data are given by $ y\sim \rho \eta +\delta +\varepsilon $ , where $ \rho $ is a scaling parameter, $ \delta $ is a mean-zero GP representing the mismatch between the model and the data, and $ \varepsilon $ is the measurement error with variance $ {\sigma}^2 $ . We assume that the smoothness of the emulator and model mismatch are determined by a length scale, $ l $ .

1. Start by sampling $ N $ different particles of the hyperparameters $ l $ , $ \rho $ , and model input parameters $ \theta $ from the prior distributions $ p(l) $ , $ p\left(\rho \right) $ , and $ p\left(\theta \right) $ . Denote these particles $ {\left\{{l}_j\right\}}_{j=1:N} $ , $ {\left\{{\rho}_j\right\}}_{j=1:N} $ , and $ {\left\{{\theta}_j\right\}}_{j=1:N} $ , respectively.
2. Obtain the model outputs $ {d}_i $ taken at the coordinates $ {X}_i^d $ and with the model input parameters $ {t}_i $ . Moreover, consider the monitored data $ {Y}_i $ taken at the coordinates $ {X}_i^Y $ .
3. Define the covariance matrices (for each of the particles) $ {K}_j=\left[\begin{array}{cc}{k}_{\eta}\left({X}_i^d,{X}_i^d,{t}_i,{t}_i|{l}_j\right)& {\rho}_j{k}_{\eta}\left({X}_i^d,{X}_i^Y,{t}_i,{\theta}_j|{l}_j\right)\\ {}{\rho}_j{k}_{\eta}\left({X}_i^Y,{X}_i^d,{\theta}_j,{t}_i|{l}_j\right)& {\rho}_j^2{k}_{\eta}\left({X}_i^Y,{X}_i^Y,{\theta}_j,{\theta}_j|{l}_j\right)+{k}_Y\left({X}_i^Y,{X}_i^Y,{\theta}_j,{\theta}_j|{l}_j\right)+{\sigma}^2\mathbf{I}\end{array}\right] $

using the covariance functions $ {k}_{\eta}\left(x,{x}^{\prime },t,{t}^{\prime }|l\right) $ associated with the computer model emulator, $ \eta $ and $ {k}_Y\left(x,{x}^{\prime },t,{t}^{\prime }|l\right) $ associated with the model-data mismatch $ \delta $ . Note that these are conditioned on the hyperparameter $ l $ .

4. For each of the particles, compute the mean-zero GP marginal likelihoods $ {\xi}_j=p\left({\left[{d}_i,{Y}_i\right]}^T|0,{K}_j\right) $ . These will be used to update the posterior distribution of $ l $ , $ \rho $ , and $ \theta $ in the next step.
5. Compute the normalized weights $ {w}_j={\xi}_j/\left({\Sigma}_{j=1}^N{\xi}_j\right) $ . Resample using these weights, such that the new sample of particles all have even weight.
6. Increase $ i $ by one, and repeat Steps 2–5 with this new sample of particles.

Throughout this process, the set of particles associated with the model input parameters provide an empirical approximation to their posterior distribution. The implementation of this approach has been written in Python and has used the GP tools included in the Scikit-learn package (Pedregosa et al., Reference Pedregosa, Varoquaux, Gramfort, Michel, Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos, Cournapeau, Brucher, Perrot and Duchesnay2011). In this implementation, we assume a value for the scaling parameter, $ \rho =1 $ , to match the context in Equation (2). For the benefit of brevity in explaining the approach, we also set the variance of the measurement error, denoted by $ {\sigma}^2 $ , to the mean of the prior distribution in Table 1, although this is not essential.

3. Digital Twin of the Underground Farm

Growing Underground is an unheated hydroponic farm developed in disused tunnels in London, United Kingdom. The farm grows salad crops for sale to restaurants and supermarkets across London using artificial lighting while relying on the relatively stable thermal ground conditions. The tunnels were built in the 1940s to act as air raid shelters but with the intention of ultimately being incorporated into the London Underground rail network. This never happened, and the tunnels have been disused since acting as temporary housing for migrants in the 1960s. They are 33 m below ground, a location where the deep soil temperature is approximately constant, and enjoy some heat transferred from the nearby rail tunnels. The heat source, however, is predominantly the LED growing lights which are typically switched on overnight and switched off during the daytime so as to compensate for external daily temperatures tending to be colder at night and warmer in the day. The farm is ventilated to the outside air using the original ventilation system designed in the 1940s. There are two ventilation shafts which operate in tandem, one at Carpenter’s Place (CP) at the main tunnel entrance and one at Clapham Common (CC) at the far end of the tunnel. These are manually operated and both affect the ventilation rate experienced in the farm.

It is important to maintain environmental conditions at optimum levels for plant growth, and to this end, the space is monitored for temperature, RH, and CO₂ levels (Jans-Singh et al., Reference Jans-Singh, Fidler, Ward, Choudhary, MJ, Schooling and GMB2019). The purpose of the monitoring is both to facilitate the maintenance of adequate conditions for optimal crop growth and to help optimize the energy use. The tunnel environment is complex to understand, primarily because of the age and lack of information relating to the design of the tunnels and change in system properties over time. Traditional control systems developed for the horticultural industry cannot be used here, as there is limited scope for automatic control of the tunnel environment. The aim of the digital twin is to combine the monitored data with a physics-based model which simulates the heat and mass exchanges within the tunnel and with the external environment. By combining both monitoring and simulation, we aim to develop a system which can be used to analyze performance, to predict future environmental conditions in conjunction with forecast weather data, and to help optimize the designs for expansion of the farm.

3.1. The simulation model

A simple physics-based numerical model has been developed representing a 1D slice through the central section of the farm. The model calculates temperature and relative humidity of the tunnel air as a function of time by solving heat and mass exchange equations pertinent to the tunnel geometry, subject to the temperature, moisture content, and CO₂ concentration of the incoming ventilation air and the deep soil temperature. The primary source of heat input is the LED lights which are typically operated overnight and switched off during daytime. For example, the time-dependent temperature variation can be calculated through a 1D slice of the tunnel using the heat balance equation:

(3)

$$ \frac{dT_j}{dt}=\frac{A_g}{m_j{c}_j}\sum \limits_i{q}_{i,j}, $$

which gives the change in temperature, $ T $ $ \left(\mathrm{K}/\mathrm{s}\right) $ , in each layer $ j $ — such as the growing medium, vegetation, air, and tunnel lining — as a function of the heat flows $ q $ $ \left(\mathrm{W}/{\mathrm{m}}^2\right) $ between the different layers. Here, $ i $ represents the different heat transfer processes, such as radiation, conduction and convection, $ {A}_g $ is the surface area ( $ {\mathrm{m}}^2 $ ), $ m $ is the mass of each layer ( $ \mathrm{kg} $ ), and $ c $ is the heat capacity ( $ \mathrm{J}/\mathrm{kgK} $ ).

In a similar manner, the moisture content of the air can be calculated from the mass balance equation:

(4)

$$ \frac{dC_a}{dt}=\frac{A_g}{h_{fg}V}\sum \limits_k{q}_k, $$

where $ {C}_a $ is the change in moisture content of the air ( $ \mathrm{kg}\;{\mathrm{m}}^{-3}\;{\mathrm{s}}^{-1} $ ), $ {h}_{fg} $ is the latent heat of condensation of water ( $ \mathrm{J}/\mathrm{kg} $ ), $ V $ is the volume of the unit ( $ {\mathrm{m}}^3 $ ), and $ {q}_k $ are the heat flows ( $ \mathrm{W}/{\mathrm{m}}^2 $ ) from the $ k $ different latent heat transfer processes. The moisture content and temperature are inextricably linked, as changes to the moisture content of the air are associated with latent heat transfer. Hence, the equations require solving in parallel to determine the temperatures and relative humidities within the tunnel.

Optimal conditions for crop growth in the tunnel require maintenance of stable temperatures and relative humidity. The two routes for heat exchange from the tunnel are conduction into the surrounding ground and ventilation to the outside air. Both routes are dependent on the temperature differential. The relative stability of the deep soil temperature, here assumed to be a constant $ {14}^{\mathrm{o}}\mathrm{C} $ , means that heat is typically lost to the surrounding ground, but this happens slowly owing to the ground thermal capacity and over time approaches an approximately steady-state temperature gradient. By comparison, heat exchange with the outside happens rapidly through bulk air transfer at rates of the order of $ 1-10 $ air changes per hour (ACH). Within the model, heat transfer due to ventilation is simulated according to the equation:

(5)

$$ {QV}_{ie}=N/3600\hskip0.22em V{\rho}_i{c}_i\Delta T, $$

where $ {QV}_{ie} $ is the heat flux from inside to outside ( $ \mathrm{W} $ ), $ N $ is the ventilation rate in $ \mathrm{ACH} $ , $ {rho}_i $ is the density of the humid air ( $ \mathrm{kg}/{\mathrm{m}}^3 $ ), $ {c}_i $ is the thermal capacity of the humid air ( $ \mathrm{J}/\mathrm{kgK} $ ), and $ \Delta T $ is the temperature difference between the inside of the tunnel and the outside ( $ \mathrm{K} $ ). As the tunnel environment may have a different relative humidity from outside, ventilation also gives rise to changes in the moisture content of the internal air. Within the farm, the heat transfer between the different farm layers by conduction and radiation is simulated according to the temperatures of the different layers. Heat transfer to the internal air, and hence then exchanged with the outside through the ventilation, is via convection from each of the farm layers. In the model, convection is simulated using the following equation (here given for the vegetation–air interface):

(6)

$$ {QV}_{iv}={A}_v Nu\hskip0.1em \lambda \hskip0.1em \Delta T/{d}_v, $$

where $ {QV}_{iv} $ is the heat transfer from the air to the vegetative surface ( $ \mathrm{W} $ ), $ {A}_v $ is the area of the surface ( $ {\mathrm{m}}^2 $ ), $ \lambda $ is the thermal conductivity of air ( $ \mathrm{W}\;{\mathrm{m}}^{-1}\;{\mathrm{K}}^{-1} $ ), and $ \Delta T $ is the temperature difference across the interface ( $ \mathrm{K} $ ). $ Nu $ is the dimensionless Nusselt number, which is a function of the speed of the air travelling over the convective surface.

The temperatures and relative humidities within the farm are directly dependent on the operational conditions such as ventilation rate and internal air speed. Only partial information is available concerning the design of the tunnel ventilation system, and these parameters are difficult to measure accurately. In addition, the simplicity of the model means that parameters may represent a mean effect of different physical components, for example, the combined effect of infiltration and controlled ventilation are represented in the model as a single bulk ventilation rate. Calibration of the model and estimation of the parameters, therefore, becomes an important step in improving the model accuracy. But the parameters are not constant, and therefore, it becomes necessary to recalibrate the model to infer the change in parameter values. The problem is that it is not always obvious when parameters are changing—some changes, such as manual alterations to the ventilation settings, are known, but the impact of the alteration in terms of the change to the ventilation rate is not a linear function of the dial settings.

This need to recalibrate, together with the availability of continuously monitored data, makes development of a sequential calibration process particularly desirable.

3.2. Sensitivity study

For calibration of the model, the first step is to ascertain which model output is the most important with respect to farm environment. From the viewpoint of the growers, the air temperature and the relative humidity are the main quantities of interest. Plants grow best in optimal temperature conditions, and if the relative humidity gets too high, growth of mould can occur, affecting crop yield. So, the calibration of the model focuses on these model outputs.

A sensitivity study has been carried out using the Morris method (Menberg et al., Reference Menberg, Heo and Choudhary2016) to identify the relative significance of each input parameter with respect to the quantities of interest. The eight input parameters of primary importance for the heat and mass balance and their plausible ranges are given in Table 2. The parameter ranges given in the table either have been derived from measurements, are anecdotal, or have been estimated based on an understanding of the farm processes. For example, the fraction of planted area approaching harvest has been estimated based on the records of the number of newly planted trays and the number of trays being harvested at any time. Equally, the mean mat saturation is estimated based on the watering strategy employed in the farm in which the trays are flooded periodically and then drained over a period of time. The internal air speed has been measured (Jans-Singh et al., Reference Jans-Singh, Fidler, Ward, Choudhary, MJ, Schooling and GMB2019). The ventilation rate range is based on the tunnel ventilation system having been designed for an air change rate of $ 4\;\mathrm{ACH} $ but with the knowledge that additional uncontrolled ventilation can occur through the access shafts. The fraction of the lighting power that is emitted as heat has been estimated according to values found in the literature (e.g., LED; Magazine, 2005), and the surface temperature of the lights is based on estimates provided by farm personnel. The characteristic dimensions of the mat and leaf control whether the heat convection is primarily characterized by bulk heat transfer from the entire tray area—in which case the characteristic dimension is similar to that of the tray, that is, $ 1 m $ , or whether it is more characterized by the dimension of the leaf or the gaps between plants.

Table 2. Uncertain parameters assessed in model sensitivity analysis.

The Morris method uses a factorial sampling strategy to discretize the parameter space in order to ensure that the entire space is covered efficiently. In this way, with 10 trajectories for each of the eight parameters, a total of 80 different combinations of the parameters have been run with each combination only one parameter value different from the previous one. A statistical evaluation of the elementary effects (EEs), that is, the impact of a change in one parameter by a predefined value on the output of the model, has been performed using the absolute median and standard deviation of the EEs (Menberg et al., Reference Menberg, Heo and Choudhary2019).

The results of the sensitivity study are summarized in Figure 1. This shows the standard deviation as a function of the absolute median of the EEs when considering the summed absolute difference between the predicted and monitored (a) temperature and (b) relative humidity, over a 30-day period. A higher value for the standard deviation for a specific parameter implies that changes to that parameter have a more significant impact on the output of the model. Figure 1a indicates that the most influential parameters for temperature are the ventilation rate, $ N $ , and the internal air speed, $ \mathrm{IAS} $ , closely followed by the fraction of energy emitted as heat by the lights, $ {f}_{heat} $ . For relative humidity, Figure 1b indicates that the internal air speed is very significant, followed again by $ {f}_{heat} $ . The ventilation rate, $ N $ , is the next most significant parameter. The heat fraction of the lighting has a significant impact on the simulation output, and we could calibrate the model for this parameter to get a better estimate of the true value. However, as we cannot influence this value in any way, we choose to focus on the two parameters over which we have some control, that is, the ventilation rate and internal air speed. This approach is directed by the digital twin ethos in that we wish to be able to forecast the impact of changes to the operational conditions on the tunnel internal environment.

Figure 1. Sensitivity analysis results showing the impact of each parameter on the simulation output—higher values imply a more significant effect.

RH is itself a function of temperature, so rather than calibrating the model separately for both parameters, we have chosen to calibrate for RH alone.

4. Calibration Results

4.1. Toy problem

As a test of the approach, we start by fixing values for $ N $ and $ IAS $ and run the simulation for 20 days to generate synthetic test data. These data are illustrated in Figure 2. The simulation gives an hourly output, plotted in blue, but for calibration of the model, we select values every 12 hr (shown as circles) so as to retain a manageable amount of data for the KOH approach. As a result, we have two data points per day: one when the LED lights are switched on and the second when the LED lights are switched off. The internal air speed is fixed at a constant value of $ 0.3\;\mathrm{m}/\mathrm{s} $ . The ventilation rate as model input is initially fixed, then changed halfway through the simulation period, so that for the first 20 data points, the ventilation rate is $ 4\;\mathrm{ACH} $ , dropping to $ 2\;\mathrm{ACH} $ for the remaining period. The values chosen are for illustration, but we believe they are typical of the magnitude of the ventilation rate in the tunnel. The purpose of changing the ventilation rate halfway through the simulation is to investigate whether the sequential KOH and PF approaches are capable of inferring the change in parameter value.

Figure 2. Synthetic data: the solid line is the output of the simulation model with fixed parameters, $ N=4\hskip0.5em \mathrm{ACH} $ and $ IAS=0.3\;\mathrm{m}/\mathrm{s} $ . The data points used for the calibration are indicated as red circles (Time Period 1) and green dots (Time Period 2).

As aforementioned, the synthetic test data have been generated from the simulation with fixed parameters. We also require outputs from the simulation over a range of parameter values for input into the KOH and PF calibration frameworks. For this, we run the simulation with a range of values of the two parameters of interest: for ventilation rate, we use a range from $ 1 $ to $ 10\;\mathrm{ACH} $ in steps of $ 1.5\;\mathrm{ACH} $ , and for internal air speed, we use a range from $ 0.1 $ to $ 0.85\;\mathrm{m}/\mathrm{s} $ in steps of $ 0.15\;\mathrm{m}/\mathrm{s} $ : taking all combinations gives a total of 42 calibration runs. Again, this toy problem is an illustration, but the range of values selected are in line with the values we expect to be realistic in the tunnel situation.

We use the light state—on/off—in conjunction with the external air moisture content as scenarios under which the model is executed. Within the KOH formulation, these are defined as $ x $ , and we seek posterior estimates of the parameters $ \theta $ , in this case $ N $ and $ IAS $ . The rationale for selecting light state as one of the $ x $ values is that the lights are the main providers of heat in the system, and as we expect to observe a diurnal variation in temperature and hence relative humidity according to whether the lights are on or off, it is essential to know the light state. The external moisture content of the air makes a less significant contribution, but it does directly impact on the relative humidity depending on the ventilation rate.

The test data are plotted as a function of these two parameters in Figure 3, together with the output of the simulations (shown in blue), showing that the synthetic test data fall within the range of outputs from the simulation runs, as expected.

Figure 3. Toy problem: synthetic data and model relative humidity outputs plotted as a function of (a) light state and (b) external moisture content, that is, scenarios $ x $ for the KOH approach.

The KOH approach uses an MCMC technique for exploration of the parameter space: for all of the simulations performed here, we have used three chains of 5,000 iterations, as this number has typically given good levels of convergence. Considering the first 20 data points which form the first half of the test data (shown as red open circles in Figure 2), using the KOH approach with a uniform prior distribution for each of the two calibration parameters gives the posterior distributions for the calibration parameters shown in Figure 4. These plots show the uniform prior distributions as a solid red line, with the posterior distributions shown as dashed blue lines. Moreover, indicated in the figure are the true values, $ N=4\hskip0.5em \mathrm{ACH} $ and $ IAS=0.3\;\mathrm{m}/\mathrm{s} $ . As expected, the KOH calibration results in estimates for the calibration parameters which are centered about a mean value close to the true value. Extending the KOH approach to all 40 data points gives the posterior distributions illustrated in Figure 5. This shows nicely the issue with the KOH approach for time-varying parameters. The simulation results in a value for $ N $ midway between the two true values— $ N=4\hskip0.5em \mathrm{ACH} $ for the first half of the simulation and $ N=2\hskip0.5em \mathrm{ACH} $ for the second half—whereas the mean value of $ IAS $ is close to the true value which remains static throughout the simulation.

Figure 4. Toy problem: prior and posterior parameter probability distributions for the KOH approach, Period 1 only.

Figure 5. Toy problem: prior and posterior parameter probability distributions for the KOH approach, Periods 1 and 2.

One of the benefits of the KOH approach is that it gives posterior distributions not only for the calibration parameters but also for the model hyperparameters and hence the error terms (see the description following Equation 2)). Figure 6 shows a comparison of the prior and posterior distributions for the four precision hyperparameters for simulation of the first 20 data points. While the posterior distribution of the emulator precision hyperparameter is very similar to the prior, the model discrepancy or bias parameter has a median value lower than our prior assumption, whereas the random error and numerical error are higher. This suggests that the calibration is not identifying a systematic error from the information provided, but instead is absorbing all differences between the model and the data into the random and numerical errors.

Figure 6. Toy problem: precision hyperparameters for the KOH approach, Period 1.

The particle filter methodology has also been run with the synthetic test data. In this approach 1,000 particles are initiated comprising values for the two calibration parameters, $ \theta $ , and the length scale, $ l $ . The $ \theta $ s are sampled from an initial uniform prior distribution over the possible range of values, whereas the values for $ l $ are sampled from a lognormal distribution to ensure they are positive. At the first timestep, the particle filter compares the value of relative humidity that would be predicted by the simulation using the values from each particle against the true data. Here, we do not run the simulation explicitly for each particle at each timestep but instead emulate the simulation output using a GP fitted to the simulation results over the range of calibration parameters run prior to the particle filtering. The likelihood of each particle is then calculated based on the comparison, and weights are assigned to each particle according to the likelihood. Particles are resampled from the prior distribution taking the particle weights into account, and this new distribution forms the prior for the next step. The key difference between this approach and the KOH approach is that, here, each data point is considered sequentially, whereas in the static KOH approach, all data points are considered together.

For this reason, in addition to the static Bayesian calibration, we have run the KOH approach sequentially over the whole time period. As we have already seen in Figure 5, using all the data gives posterior parameter estimates in line with the mean values over the whole time period, but we wish to know whether we can use fewer data points and run the model sequentially and identify a sudden change in parameter value. The sequential approach has been used with two, four, and eight data points, as the methodology requires at least two data points in order to fit the emulator. In this approach, at each timestep, the oldest data point is discarded and a new data point added—this means that for two data points, there are 39 runs of the model; for four data points, there are 37 runs; and for eight data points, there are 33 runs. In addition, the mean of the posterior distribution from the previous timestep is used as the mean of the prior for the new timestep. Note that we have not used the full posterior from the previous timestep as prior for the next, as the standard deviation of the posterior becomes too narrow, making the prior too specific—instead we have maintained the standard deviation of the first prior.

The evolution of the posterior probability distribution for each of the calibration parameters is illustrated in Figure 7 for the PF model and the sequential KOH approach using four data points. Each plot shows the evolution in the parameter value distribution, starting from the prior distribution at the top, progressing in steps of four data points to the end of the simulation at the bottom. Note that all 40 data points were used in the simulation, but the output for the intermediate points is suppressed for brevity (as might be inferred from the figures, the suppressed values are very similar to those shown on either side). The results are shown in blue for the PF model and in red for the sequential KOH approach. Here, the change in ventilation rate at the midpoint of the simulation is clearly visible (Figure 7a), and although there is a small perturbation to the values of the internal air speed (Figure 7b), the distribution recovers quickly and the mean stays close to the true value throughout the simulation. The similarity between the mean posterior values for $ N $ and $ IAS $ and the known true values ensures that the posterior predictions calculated using the mean parameter values are very close to the test data. For this test case, the root-mean-square error values for the posterior predictions from the PF and sequential KOH models are $ 1.1\% $ and $ 0.9\% $ , respectively.

Figure 7. Particle filter approach: evolution of posteriors, showing (a) $ N\;\left(\mathrm{ACH}\right) $ , (b) $ IAS\;\left(\mathrm{m}/\mathrm{s}\right) $ , and (c) length scale.

Figure 7c shows the evolution of the PF length scale parameter for this test case. The distribution of values shows a wider spread at the start of the analysis, but this narrows as the analysis progresses and the mean value remains close to a value of 0.2 throughout. This is a true unknown parameter, as it is a hyperparameter of the emulator, and approximately reflects the distance in the input space that corresponds to an impact on the output; the small value derived here reflects the high degree of variability from one data point to the next.

In Figure 7, it is clear that the results for the PF model exhibit a smaller variance than the sequential KOH approach, yet the mean values are in good agreement. Figure 8 shows in more detail how the results of the sequential KOH approach compare against the PF model in terms of the mean posterior values for N and IAS. Using two data points (n = 2), the results show some fluctuations from the true value, particularly toward the end of the simulation for the internal air speed (Figure 8b), but pick up the timing of the change in ventilation rate well (Figure 8a). As the number of data points increases, this timing is harder to identify, as the approach assumes a constant value over the data, so when, for example, eight data points are used, the timing of the change in the ventilation rate, $ N $ is smoothed and appears not as a sudden change but as a smoother transition. Using four data points appears to match the transition time well with a sharper transition than eight data points, yet the mean of the posterior shows less fluctuation than two data points; hence, this appears to be the best approach. Figure 9 shows the mean and $ 90\% $ confidence limits of the model bias function as a function of the step for the sequential KOH approach with $ n=4 $ , separated according to whether the lights are switched on or off. The model bias is small, less than $ 0.5\% $ , and hence will not make a significant difference to the results.

Figure 8. Sequential KOH approach: evolution of the mean posterior for (a) the ventilation rate, $ N $ , and (b) the internal air speed, $ IAS $ , showing the impact of increasing the number of data points, n, included in each step from 2 to 8.

Figure 9. Sequential KOH approach: mean and 90% confidence limits of the model bias function.

The synthetic test data are noise-free, that is, the data are generated directly from the simulation model. But real data are typically noisy with errors arising from measurement—how do the approaches compare for noisy data? To test this, the calibration exercise has been rerun with noise added to the data in the form of random values generated from a zero-mean normal distribution, with standard deviation $ \sigma $ , where for illustrative purposes, $ \sigma $ has been calculated such that $ 2\sigma $ is equal to relative humidity values of $ 0.01 $ , $ 0.03 $ , $ 0.05 $ , and $ 0.10 $ , that is, $ \pm 1 $ to $ 10\% $ , where $ 10\% $ is the difference in relative humidity observed for a change in ventilation rate from $ 4 $ to $ 2\;\mathrm{ACH} $ . Figure 10 illustrates the comparison for the static KOH approach. In this example, the prior distributions for the two calibration parameters have been chosen as normal distributions, as giving better information in the prior distribution should improve the ability of the methodology to identify the posterior. However, as the noise level increases, the mean of the posterior moves increasingly further from the true value, more significantly for the ventilation rate $ N $ (Figure 10a).

Figure 10. Toy problem: results of the KOH calibration with Period 1 data showing effect of increased noise in the data.

To understand the impact of introducing noise in the data on the PF approach, consider Figure 11, which plots the mean of the PF posterior distribution at each timestep. Again, the initial prior distribution for each $ \theta $ was specified as a normal distribution. Looking first at the ventilation rate (Figure 11a), the zero noise results, shown as a dashed blue line, clearly show the step change from a value of $ N=4\;\mathrm{ACH} $ to $ N=2\;\mathrm{ACH} $ at the midpoint of the analysis (step = 20). The results for 1 and 3% noise are not too dissimilar (green and cyan), and even at a noise level of 5%, the posterior finishes close to the true value after 40 steps, albeit without such a clear step-change. At a noise level of 10%, however, the PF approach cannot identify the change in parameter value. This is because the change in relative humidity due to a change in the ventilation rate from 4 to 2 is close to 10%, and so the model cannot distinguish what is noise and what is a genuine parameter change. It is clear that if there is too much noise in the data, then both sequential approaches fail to correctly infer the parameter values.

Figure 11. Toy problem: mean of particle filtering evolution demonstrating effect of noise in the data.

Figure 11b shows the evolution of the posterior of IAS as a function of the step for the increasing noise levels. The results are reasonably consistent up until timestep 20, after which the mean IAS posterior increases, dependent on the noise level, before dropping back to the true value; for 10% noise, the true value is not achieved by Step 40. Interestingly, although there are clear differences in the evolution of the mean length scale for the different noise levels, the absolute value varies most for zero noise, the mean value increasing from 0.21 to 0.26 (Figure 11c). For higher noise levels, the mean length scale value meanders close to 0.2. This suggests that despite the noise, the model is still detecting similar relationships between the data points.

The hyperparameters of the PF approach have been set to be a similar magnitude to the KOH approach to ensure comparability. One parameter that has no equivalence, however, is the number of particles. This is important, as it directly affects the run time of the model. Figure 12 shows the impact of changing the number of particles on the output, with particle number equal to 100 (green), 1,000 (blue), and 10,000 (magenta). The results demonstrate that the number of particles has little impact on the outputs of this test case.

Figure 12. Mean of particle filtering evolution demonstrating effect of the number of particles.

The more data points are used, however, the greater the time taken to run the analysis. Table 3 shows the run times for analysis of this toy problem using the different models. The models have been processed on a PC with an i7-6700 CPU processor and 32-MB RAM, and of course, a more powerful PC would improve these times, but the relative difference between the run times for the models will still hold. It is clear that the PF approach is substantially quicker than the static KOH approach, as even with 10,000 particles, the step time is only just over 3 min, giving a total time of just over 2 hr, whereas the KOH approach required a run time in excess of 6 hr for the same problem. The sequential KOH approach is substantially quicker, however, but using four data points, the run time is still much slower than the PF approach with 1,000 particles.

Table 3. Comparison of run times: the run times for the KOH approach are for the entire simulation, whereas for the PF and sequential KOH approaches, run times are given both for a single step and for the entire simulation.

4.2. Calibration with monitored data from the farm

The toy problem has facilitated comparison between the three approaches, but the main aim of this study is to explore the applicability of using the PF approach for model calibration in the context of a digital twin. To this end, real monitored data from August to October 2018 have been selected for further study. This time period has been chosen, as the data are complete and there were no known significant changes to operation except known changes to the ventilation settings. This is important, as we would like to understand to what extent the two calibration approaches are capable of identifying the change in parameter. While we know what the changes are to the ventilation settings, we do not know exactly what that means in terms of changes to the parameter value (the ventilation rate). We also do not know to what extent other operational conditions were in place that might have impacted on ventilation rate or air speed.

Implementation of the models with the monitored data has followed the same approach as described for the toy problem above. The monitored data and the corresponding simulation outputs across the input parameter ranges are illustrated in Figures 13 and 14. The dots in the first figure illustrates how the data evolve in time, whereas in the second figure, they show how the data vary as a function of the light state and external air moisture content, the two scenarios used in the KOH approach. As for the toy problem, data points have been extracted at 4 pm and 4 am, corresponding to the lights being off and on. Each black dot in Figure 13 is a data point, and we have selected two separate periods of 20 data points highlighted in red and green for analysis using the standard BC approach. Period 1 (red) is chosen, as it lies before the change in ventilation settings, and Period 2 (green) after the change. It is not feasible to use all 118 data points for the static KOH approach, as the run time would be too long. Moreover, shown in the figure are the outputs from the simulation runs executed for the calibration (blue crosses). Unlike the toy problem, here there are several observations which lie outside the range covered by the simulation outputs, specifically there are points of low and high relative humidity that are not predicted by the model. This is likely due to simplifications in the model that do not adequately represent the sources of humidity, for example, the irrigation of the plant trays is represented by a single mean saturation level that does not account for daily fluctuations in moisture level.

Figure 13. Monitored data used for the calibration and the corresponding outputs from the calibration simulations over 3 months.

Figure 14. Monitored data and the corresponding simulation outputs for the Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) approach, showing the scenarios (a) light state and (b) external moisture content.

As a first step, the KOH approach has been used to estimate the model input parameters for Periods 1 and 2. Figure 15 shows the posterior distributions for both parameters for both time periods, with the first time period shown in green (dashes and dots) and the second in magenta (dots). Here, a normal prior distribution has been used (shown in black) to give a more informative prior, as we expect the observations to have some degree of measurement error. We have selected prior mean and standard deviation values to position the prior at the center of the possible range of parameter values, but with sufficient variance to generate samples across the range, as we anticipate the parameter values being toward the center of the possible range of values. The posterior parameter distributions suggest that the ventilation rate drops from a mean value of $ 6.5\;\mathrm{ACH} $ in August to a mean value of $ 5.6\;\mathrm{ACH} $ in October, whereas the mean internal air speed increases from 0.23 to 0.44 $ \mathrm{m}/\mathrm{s} $ . These results are plausible given the increase in relative humidity observed toward the end of Period 2 where both a reduction in ventilation rate and an increase in internal air speed tend to result in an increase in relative humidity.

Figure 15. Prior and posterior distributions of ventilation rate, $ N $ , and internal air speed, $ IAS $ , using the Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) approach with monitored data from the farm.

It is particularly useful when using monitored data to explore the model bias function, as it can give some idea as to where the physics-based model might be improved. Figure 16 shows the mean and 90% confidence limits of the model bias for Periods 1 and 2 plotted separately for the two light states—off and on—as a function of external moisture content on the x axis. As for the toy problem, the magnitude of the bias is small, suggesting that differences between the model and the data are primarily accounted for by measurement error which has a mean posterior value of 0.77. However, there is a clear difference between the bias function for lights on and lights off. When the lights are on, the absolute magnitude of the bias function is greater implying that the agreement between the model and the data is better when the lights are off, particularly in Period 1. The fact that the bias function is negative when the lights are on suggests that the model is overpredicting the relative humidity when compared with the data. Relative humidity is a combined effect of temperature and air moisture content, so a value that is too high can be due to either temperatures that are too low or too much moisture in the air. This information gives us valuable insight into how to improve the physics-based model.

Figure 16. Model bias function.

As for the test case, we run the PF approach for the entire period from August to October, a total of 118 data points.

The outputs of the calibration process in terms of the evolution of the posterior distributions for the ventilation rate and the internal air speed are indicated in Figure 17, and the PF length scale remained fairly constant at a value of just under 0.2. Figure 17 shows first the monitored data used for the calibration in Figure 17a, and the mean of the posterior parameter values are shown in Figure 17b,c as solid blue lines. We have also run a sequential BC updating after four data points similar to the toy problem case, shown in the figures as the dashed red lines. The mean of the two static BCs are indicated in the figures as dash-dotted green and dotted magenta lines over the periods of interest. Moreover, indicated in Figure 17b are the settings of the two ventilation systems over this period. The CP ventilation setting is constant until September 22, at which point it is reduced from $ 2 $ to $ 0.5\;\mathrm{ACH} $ , whereas the CC setting is higher, at $ 4.5\;\mathrm{ACH} $ , during the initial period, reduced in early September to a value of $ 2\;\mathrm{ACH} $ .

Figure 17. Calibration input (a) and outputs (b–e) for monitored RH data.

Considering Figure 17b,c, the first point is that the results for the two sequential calibrations are consistent. The sequential KOH results are more extreme, particularly for the internal air speed (Figure 17c) where the value peaks on several occasions, reaching a high of $ 0.8\;\mathrm{m}/\mathrm{s} $ on August 30th. The PF approach has peaks at similar times, but they do not reach the same magnitude, only approaching $ 0.6\;\mathrm{m}/\mathrm{s} $ on August 30th. To match observed high peaks in monitored relative humidity, the model requires a low ventilation rate and a high internal air speed—the first prevents moisture from leaving the system, and the second increases evaporation and transpiration rates from the growing system. The more extreme peaks arise from the sequential KOH approach due to a combination of factors that can be appreciated with a consideration of Figure 17d,e which shows the evolution of the posterior distributions for each of the two calibration parameters over the highest peak in the monitored data on August 30th. The first factor is that the sequential KOH approach uses four monitored data points at each timestep, whereas the PF approach considers each data point individually. This means that for the sequential KOH approach, the influence of each data point extends over a longer time period and makes the approach more likely to attain higher parameter values. But it also means the sequential KOH approach is slower to respond to a sudden change in value than the PF approach; note, for example, in the PF approach, the internal air speed drops on August 30th–31st responding to the sudden drop in monitored relative humidity, whereas in the KOH approach, this drop does not happen until September 1st when the influence of the high monitored relative humidity on August 29th–30th is no longer felt. The second factor giving rise to the more extreme peaks is the fixing of the posterior variance in the sequential KOH approach which as described previously was necessary to avoid unacceptable narrowing of the posterior distribution. Reducing the fixed variance value for the sequential KOH value yields lower peak mean parameter values, as the smaller variance gives less credibility to the higher values. By comparison, the PF approach has no such constraint, so when a sudden change in the monitored data occurs, the variance of the posterior distribution can increase to encompass a wider range of possible parameter values. This can lead to a lower mean value relative to the sequential KOH approach, as, for example, illustrated for the internal air speed for the second data point on August 29th (Figure 17e).

The second point observable from Figure 17 is that the sequential KOH results are very different from the static KOH results, particularly for Period 2 where the static values of $ 5.6\;\mathrm{ACH} $ and $ 0.43\;\mathrm{m}/\mathrm{s} $ for the ventilation rate and the internal air speed, respectively, are substantially higher than the sequential values which are closer to $ 3\;\mathrm{ACH} $ and $ 0.2\;\mathrm{m}/\mathrm{s} $ for most of Period 2.

Third, neither of the sequential models strongly reflects the known changes to the settings of the ventilation system. The total ventilation rate is the combination of both controlled and uncontrolled ventilation. That neither model identifies changes corresponding to the changes to the controlled ventilation may suggest that the noncontrolled ventilation arising from lift shafts, door infiltration, and so forth is dominant. Equally, it may reflect the fact that the data are quite noisy, so the true values are masked by the noise in the data.

Even if the models are not able to identify “true” values for these parameters, are we able to infer parameter values that enable the model to accurately simulate the farm environment? To assess this, we have run the model using parameter values for $ N $ and $ IAS $ equal to the outputs of the sequential models, and also equal to the static values for Period 1.

The mean and 90% confidence limits of the relative humidity values simulated using the three different approaches are shown in Figure 18. Figure 18a shows the simulation results across the entire time period compared against the monitored data, with the complete data set shown as red circles and the data used for the calibration picked out as black dots. Specifics are difficult to pick out, but the overall trend for the sequential models to give a much closer approximation to the data than the static model is clear, as the PF approach results in blue and the sequential KOH approach results in red show a much greater degree of variation than the static KOH results in green, in line with the data. This is particularly clear over the central section of the simulation around August 30th, and this section of the plot is enlarged for clarity in Figure 18b. Here, we can clearly see that using the parameters derived for Period 1, using the static KOH approach gives a poor approximation to the data in the days before August 30th, whereas the two sequential approaches give an increase in RH over this period in line with the data. That they do not get even closer is due to the fact that these data lie outside the range of the simulation runs illustrated in Figure 13. Another point of interest from Figure 18a is that there is a gradual increase in variance of the PF results as time progresses, that is, the 90% confidence limit shown at the beginning of August is much narrower than that shown in October. This feature of the PF that the variance of the particle weights increases over time means that for longer term simulations, some form of correction will need to be implemented. This is not an issue for the sequential KOH approach, as we have not allowed the variance of the parameter values to change as the simulation progresses.

Figure 18. Relative humidity results for the PF, sequential KOH, and static KOH approaches, showing (a) the entire simulation and (b) a central period.

The KOH approach yields a model bias function for Periods 1 and 2, which suggests that the model may overestimate the relative humidity when the lights are switched on (Figure 16). Figure 19 shows the mean and 90% confidence limit of the bias function for the sequential KOH approach plotted as a function of time. The bias function when the lights are on tends to be opposite in sign to that when the lights are off, and again the sign of the bias suggests the model tends to overestimate the relative humidity when the lights are on and underestimates when the lights are off. The absolute magnitude is small, however, only peaking over 0.01 when the lights are off on August 19th, and at 0.02 when the lights are on in the morning of September 18th. The mean estimated measurement error is greater at 0.25 and peaks where the monitored relative humidity values are high on August 30th, suggesting that the differences between the model and the data are not systematic.

Figure 19. Sequential Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001) approach: mean and 90% confidence limits of the model bias function.

5. Discussion

In the context of a digital twin, calibrating the simulation element of the twin with current data is essential for ensuring that the simulation is representing reality as closely as possible. Manual calibration is time-consuming and not suitable when parameter values are changing. This study explores the suitability of a particle filter approach for calibration and compares it against static Bayesian calibration following the approach of Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001), and also uses the KOH approach sequentially as a further comparison.

The static KOH approach assumes that the calibration parameters do not change over the time span of the data and so is only able to give good estimates if there is confidence that the parameter values are unchanging. We have seen using a toy problem that if the parameter values do change, the KOH approach can only give an average of the true value (Figure 5). If the parameters are unchanging, however, it is still the “gold standard” for calibration against which we compare other approaches. It is particularly useful, as the KOH approach is formulated in such a way as to give estimates not only of the calibration parameters but also of the discrepancy between the model and the data (model bias), and the measurement and numerical errors. We have seen in the toy problem (Figure 6) that the posterior distributions of the error terms have low median values as expected when the data are derived directly from the model, but for monitored data, the approach gives a more useful insight, particularly into the model bias (Figure 16). The figure illustrates that while the bias only shows a small variation with the external air moisture content, there is a more significant difference according to whether the lights are switched on or off. With the lights on—a direct source of heat—the bias is negative, suggesting that the model is predicting relative humidities that are higher than those observed in the data. Relative humidity increases as temperature decreases for the same air moisture content, so if it is too high, that suggests either the predicted temperature is too low, or the predicted air moisture content is too high. This gives direct insight into how to improve the physics-based model going forward.

The sequential approaches offer the potential to track parameter values, which is what we desire for the digital twin. We have applied the different approaches to both a toy problem, with data derived from the simulation with known parameters and a controlled change in the value of ventilation rate, and to real monitored data. The difference is very clear; whereas with the toy problem, the sequential methods were able to track a clearly defined change in ventilation rate accurately (Figure 7), the monitored data give a much more variable output (Figure 17). We have also seen that in the monitored data, there are data points that fall outside the range of the simulation runs. The very high relative humidity values occurring around August 29th and September 5th (Figure 13) seem to be not following the diurnal pattern typically seen on other days so could be due to something happening within the farm of which we are not aware—a change to the operation of the dehumidifiers, for example. Equally, there are values which are lower than the simulation runs, particularly in early August. More work is needed to explore the model inadequacies, particularly for extreme values of relative humidity.

Since we know the settings of the ventilation system, we might expect a clearer link between the ventilation rate parameter values and the system settings, but this is not observed. One possible reason for this is that the data are subject to a high degree of measurement error—we saw in the toy problem how noisy data can prohibit the ability of the sequential approaches to track the parameter change. But we should consider what we are tracking: we are tracking the parameter values that give the best agreement of the model with the data. The relationship between the tracked parameters and reality depends on the extent to which the model represents reality. For example, here the ventilation rate assumed in the model must encompass all controlled and uncontrolled components of the ventilation and is not simply equal to the setting on the dial.

This highlights the balance to be considered in the complexity of the simulation model. Here, we use a simple physics-based model calibrated using monitored data to give the best agreement between the model and the data. But there is limited insight into why the ventilation rate and the internal air speed appear to fluctuate so dramatically—this could be a real effect, or as is more likely it could be that other events are happening that impact on the relative humidity but are not incorporated in the model. In this instance, the calibration approaches are seeking to match the observed data with a simulation based on an incomplete description of events and will push the parameter values to give as close a match as possible. The calibration parameters then become a proxy for all events affecting their values, that is, as outlined above, the ventilation rate has to encompass both controlled and uncontrolled ventilation, and the internal air speed will be affected by factors such as blockages in the tunnel which affect air flow. A more complex model could include a more detailed representation of the system processes explicitly, but a more complex model would take longer to run and longer to calibrate. This balance must be assessed in the design and development of a digital twin.

In this study, we have compared the PF approach against a sequential version of the KOH approach, and it is clear that they give similar results. The main differences between them are time and information—on the one hand, the PF is substantially quicker depending on the approach details—for the PF approach, a higher number of particles causes the run time to increase linearly, whereas in the KOH approach, the run time increases approximately as the square of the number of data points for the same number of MCMC iterations. On the other hand, the sequential KOH approach estimates the model bias and the measurement and numerical error terms in addition to the calibration parameters. The model bias extracted for the sequential KOH study of the monitored data (Figure 19) reinforces the trend observed for the toy problem, that is, the model tends to overpredict relative humidity when the lights are on and underpredict when the lights are off. This study was based purely on two data points each day, one from when the lights are switched on and one from when they are off; a more targeted choice of data or scenario could give more insight into the model discrepancy and the error terms (Menberg et al., Reference Menberg, Heo and Choudhary2019). Further work will assess the sensitivity of the approaches to the choice of data with reference to maximizing the impact on the accuracy of the digital twin.

The PF approach works well for the purposes of providing quick estimates of parameter probability distribution in real time and responds quickly to sudden changes in the monitored data. The sequential KOH approach also works well, but in this study has been shown to be more computationally intensive with longer run times, gives more extreme results, and reacts more slowly to a sudden change in the monitored data than the PF approach. This latter point is a result of the need to include more than one data point per timestep for the sequential KOH approach. As discussed in reference to Figure 17, the influence of each data point extends over multiple timesteps, and response to a sudden change in the monitored data only occurs once the influence of the value prechange is no longer felt.

In the implementation of the sequential KOH approach, we have prescribed the variance of the prior for all timesteps, that is, we have not allowed changes in variance to propagate through time. This was a necessary assumption to ensure the prior remained sufficiently diverse. By comparison, in implementing the PF approach, we have allowed the variance of the prior to propagate. As discussed with reference to Figure 17, the artificially prescribed variance for the sequential KOH approach contributes to the more extreme peaks in mean IAS observed in Figure 17c. At the locations of the peaks, the monitored RH values are greater than the values the model would predict with the parameter value ranges provided, and as a consequence, the models extract the highest values possible within the parameter bounds that have been set. The magnitude of the peak parameter value is dependent on the likelihood of that value given the data and the calibration run outputs. In the sequential KOH approach, we have observed that the magnitude of the peaks is dependent on the assumed prior variance—if a smaller variance is assumed, the peaks are significantly lower and more in line with the PF results. This is because a higher variance gives more credibility to a wider range of values, affording higher values a greater likelihood.

Both approaches make use of an emulator fitted to the simulation output in order to map the inputs of the model to the output of interest, and for both approaches, GP models have been used. This requires the physics-based model to be run a considerable number of times with parameter values covering the ranges of interest prior to running the calibration. Within the digital twin framework, this approach will require modification such that the model predictions are generated as the calibration progresses, rather than in advance. The PF approach may be more suited to this framework than the KOH approach, as the PF considers each data point separately rather than mapping the entire parameter space.

6. Conclusions

This study makes an essential contribution to the development of digital twins of the built environment. Combination of data and a model—in this case, the physics-based model—is at the heart of what is meant by a digital twin. To maximize the benefits, it is essential to ensure that the data are used to continuously update the model. In this study, the continuous calibration of the model has been explored using a PF approach. This has been compared against Bayesian calibration using the framework put forward by Kennedy and O’Hagan (Reference Kennedy and O’Hagan2001), and against a sequential form of the KOH approach.

The development of the digital twin for an underground farm provides an exemplar for the built environment more broadly. It is necessary to maintain favorable environmental conditions in the farm, and to that end, an extensive program of monitoring has been undertaken and is still ongoing. A simple physics-based model of the farm has also been developed which can aid prediction of environmental conditions and provide useful information for future planned farm expansion.

The PF approach has been shown to give a good estimate of the mean and variance of the model input parameters ventilation rate and internal air speed which were selected owing to their impact on the relative humidity in the farm. When using the calibrated time-varying parameters in the model, a much closer agreement with the monitored data is observed than when using parameters inferred from the static BC approach. It is clear, however, that the physics-based model is unable to predict the full range of relative humidities observed in reality; in particular, there are data points that fall both above and below the predicted range of relative humidity. This model inadequacy cannot be explored using the PF approach alone—the KOH approach is required in order to quantify the model bias function and error terms.

In terms of suitability for incorporation in a digital twin, the PF approach is more suitable than the sequential KOH approach used here, as it is quicker and is more suited to combination with execution of the physics-based model at each data point. It also quantifies the variance in the posterior parameter probability distributions, although the inevitable increase in variance over time will need to be addressed. A combined approach, using the PF approach in real time and the KOH approach for periodic assessment of the model bias function, would yield the best result for ensuring continuity of model accuracy. Further work will address the practical implementation of the PF approach within the digital twin framework.

Acknowledgment

We are grateful for the continued support of Growing Underground without whose input this study would not have been possible.

Data Availability Statement

The data used were made available by Growing Underground for the purpose of this study.

Author Contributions

Conceptualization: M.G., R.C., and A.G.; Methodology: R.W. and A.G.; Data visualization: R.W.; Writing-original draft: R.W. and R.C.; Data curation: M.J.-S. All authors approved the final submitted draft.

Funding Statement

This research was supported by AI for Science and Government (ASG), UKRI’s Strategic Priorities Fund awarded to the Alan Turing Institute, UK (EP/T001569/1), and the Lloyd’s Register Foundation program on Data-Centric Engineering.

Competing Interests

The authors declare no competing interests exist.

References

Andrieu, C, Doucet, A, Singh, SS and Tadić, VB (2004) Particle methods for change detection, system identification and control. Proceedings of the IEEE 92(3), 423–438.CrossRef Google Scholar

Cheng, C and Tourneret, J-Y (2017) Kernel density-based particle filter for state and time-varying parameter estimation in nonlinear state-space models. In 2017 25th European Signal Processing Conference (EUSIPCO). IEEE, pp. 1664–1668, https://doi.org/10.23919/EUSIPCO.2017.8081492 CrossRef Google Scholar

Chong, A, Lam, KP, Pozzi, M and Yang, J (2017) Bayesian calibration of building energy models with large datasets. Energy and Buildings 154, 343–355.CrossRef Google Scholar

Chong, A and Menberg, K (2018) Guidelines for the Bayesian calibration of building energy models. Energy and Buildings 174, 527–547.CrossRef Google Scholar

Chong, A, Xu, W, Chao, S and Ngo, N-T (2019) Continuous-time Bayesian calibration of energy models using BIM and energy data. Energy and Buildings 194, 177–190.CrossRef Google Scholar

Choudhary, A, Lindner, JF, Holliday, EG, Miller, ST, Sinha, S and Ditto, WL (2020) Physics-enhanced neural networks learn order and chaos. Physical Review E 101, 062207 (Published online 18 June 2020).CrossRef Google Scholar PubMed

Doucet, A and Johansen, AM (2008). Particle filtering and smoothing: Fifteen years later. In Crisan, D and Rovosky, B (eds), Handbook of Nonlinear Filtering. Cambridge: Cambridge University Press.Google Scholar

Guillas, S, Rougier, J, Maute, A, Richmond, AD and Linkletter, CD (2009) Bayesian calibration of the thermosphere-ionosphere electrodynamics general circulation model (TIE-GCM). Geoscience Model Development 2, 137–144.CrossRef Google Scholar

Heo, Y, Choudhary, R and Augenbroe, GA (2012) Calibration of building energy models for retrofit analysis under uncertainty. Energy and Buildings 47, 550–560.CrossRef Google Scholar

Higdon, D, Kennedy, M, Cavendish, JC, Cafeo, JA and Ryne, RD (2004) Combining field data and computer simulations for calibration and prediction. SIAM Journal on Scientific Computing 26(2), 448–466.CrossRef Google Scholar

Jans-Singh, M, Fidler, P, Ward, RM and Choudhary, R (2019) Monitoring the performance of an underground hydroponic farm. In MJ, DeJong, Schooling, JM, and GMB, Viggiana (eds), International Conference on Smart Infrastructure and Construction 2019 (ICSIC): Driving Data-Informed Decision-Making, 8–10 July 2019. London: ICE Publishing, pp. 133–141.CrossRef Google Scholar

Kennedy, MC and O’Hagan, A (2001) Bayesian calibration of computer models. Royal Statistical Society 63(3), 425–464.CrossRef Google Scholar

LED Magazine (2005) Fact or fiction—LEDs don’t produce heat. Available at https://www.ledsmagazine.com/leds-ssl-design/thermal/article/16696536/fact-or-fiction-leds-dont-produce-heat (accessed 11 May 2005).Google Scholar

Li, Q, Augenbroe, G and Brown, J (2016) Assessment of linear emulators in lightweight Bayesian calibration of dynamic building energy models for parameter estimation and performance prediction. Energy and Buildings 124, 194–202.CrossRef Google Scholar

Madni, AM, Madni, CC and Lucero, SD (2019) Leveraging digital twin technology in model-based systems engineering. Systems 7(7), 328–351.Google Scholar

Menberg, K, Heo, Y and Choudhary, R (2016) Sensitivity analysis methods for building energy models: Comparing computational costs and extractable information. Energy and Buildings 133, 433–445.CrossRef Google Scholar

Menberg, K, Heo, Y and Choudhary, R (2019) Influence of error terms in Bayesian calibration of energy system models. Journal of Building Performance Simulation 12(1), 82–96.CrossRef Google Scholar

Moradkhani, H and Hsu, K-L (2005) Uncertainty assessment of hydrologic model states and parameters: Sequential data assimilation using the particle filter. Water Resources Research 41, W05012.CrossRef Google Scholar

Pedregosa, F, Varoquaux, G, Gramfort, A, Michel, V, Thirion, B, Grisel, O, Blondel, M, Prettenhofer, P, Weiss, R, Dubourg, V, Vanderplas, J, Passos, A, Cournapeau, D, Brucher, M, Perrot, M and Duchesnay, E (2011) Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12, 2825–2830.Google Scholar

Worden, K, Cross, E, Gardner, P, Barthorpe, R and Wagg, DJ (2020) On digital twins, mirrors and virtualisations. In Barthorpe, R (ed), Model Validation and Uncertainty Quantification, Vol. 3. Cham: Springer, pp. 285–295.CrossRef Google Scholar

Table 1. Kennedy and O’Hagan (2001) approach hyperparameter prior distributions.

Table 2. Uncertain parameters assessed in model sensitivity analysis.

Figure 1. Sensitivity analysis results showing the impact of each parameter on the simulation output—higher values imply a more significant effect.

Figure 2. Synthetic data: the solid line is the output of the simulation model with fixed parameters, $ N=4\hskip0.5em \mathrm{ACH} $ and $ IAS=0.3\;\mathrm{m}/\mathrm{s} $. The data points used for the calibration are indicated as red circles (Time Period 1) and green dots (Time Period 2).

Figure 3. Toy problem: synthetic data and model relative humidity outputs plotted as a function of (a) light state and (b) external moisture content, that is, scenarios $ x $ for the KOH approach.

Figure 4. Toy problem: prior and posterior parameter probability distributions for the KOH approach, Period 1 only.

Figure 5. Toy problem: prior and posterior parameter probability distributions for the KOH approach, Periods 1 and 2.

Figure 6. Toy problem: precision hyperparameters for the KOH approach, Period 1.

Figure 7. Particle filter approach: evolution of posteriors, showing (a) $ N\;\left(\mathrm{ACH}\right) $, (b) $ IAS\;\left(\mathrm{m}/\mathrm{s}\right) $, and (c) length scale.

Figure 8. Sequential KOH approach: evolution of the mean posterior for (a) the ventilation rate, $ N $, and (b) the internal air speed, $ IAS $, showing the impact of increasing the number of data points, n, included in each step from 2 to 8.

Figure 9. Sequential KOH approach: mean and 90% confidence limits of the model bias function.

Figure 10. Toy problem: results of the KOH calibration with Period 1 data showing effect of increased noise in the data.

Figure 11. Toy problem: mean of particle filtering evolution demonstrating effect of noise in the data.

Figure 12. Mean of particle filtering evolution demonstrating effect of the number of particles.

Figure 13. Monitored data used for the calibration and the corresponding outputs from the calibration simulations over 3 months.

Figure 14. Monitored data and the corresponding simulation outputs for the Kennedy and O’Hagan (2001) approach, showing the scenarios (a) light state and (b) external moisture content.

Figure 15. Prior and posterior distributions of ventilation rate, $ N $, and internal air speed, $ IAS $, using the Kennedy and O’Hagan (2001) approach with monitored data from the farm.

Figure 16. Model bias function.

Figure 17. Calibration input (a) and outputs (b–e) for monitored RH data.

Figure 18. Relative humidity results for the PF, sequential KOH, and static KOH approaches, showing (a) the entire simulation and (b) a central period.

Figure 19. Sequential Kennedy and O’Hagan (2001) approach: mean and 90% confidence limits of the model bias function.

Submit a response

Comments

No Comments have been published for this article.

Article contents

Continuous calibration of a digital twin: Comparison of particle filter and Bayesian calibration approaches

Abstract

Keywords

Impact Statement

1. Introduction

2. Calibration Approach

2.1. The KOH calibration framework

2.2. Particle filtering

3. Digital Twin of the Underground Farm

3.1. The simulation model

3.2. Sensitivity study

4. Calibration Results

4.1. Toy problem

4.2. Calibration with monitored data from the farm

5. Discussion

6. Conclusions

Acknowledgment

Data Availability Statement

Author Contributions

Funding Statement

Competing Interests

References

Comments

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests