The main aim of the current study was to present the abilities of widely used crop models to simulate four different field crops (winter wheat, spring barley, silage maize and winter oilseed rape). The 13 models were tested under Central European conditions represented by three locations in the Czech Republic, selected using temperature and precipitation gradients for the target crops in this region. Based on observed crop phenology and yield from 1991 to 2010, performances of individual models and their ensemble were analyzed. Modelling of anthesis and maturity was generally best simulated by the ensemble median (EnsMED) compared to the ensemble mean and individual models. The yield was better simulated by the best models than estimated by an ensemble. Higher accuracy was achieved for spring crops, with the best results for silage maize, while the lowest accuracy was for winter oilseed rape according to the index of agreement (IA). Based on EnsMED, the root mean square errors (RMSEs) for yield was 1365 kg/ha for winter wheat, 1105 kg/ha for spring barley, 1861 kg/ha for silage maize and 969 kg/ha for winter oilseed rape. The AQUACROP and EPIC models performed best in terms of spread around the line of best fit (RMSE, IA). In some cases, the individual models failed. For crop rotation simulations, only models with reasonable accuracy (i.e. without failures) across all included crops within the target environment should be selected. Application crop models ensemble is one way to increase the accuracy of predictions, but lower variability of ensemble outputs was confirmed.