The European Union Effort Sharing Regulation (ESR) will require a 30% reduction in greenhouse gas (GHG) emissions by 2030 compared with 2005 from the sectors not included in the European Emissions Trading Scheme, including agriculture. This will require the estimation of current and future emissions from agriculture, including dairy cattle production systems. Using a farm-scale model as part of a Tier 3 method for farm to national scales provides a more holistic and informative approach than IPCC (2006) Tier 2 but requires independent quality control. Comparing the results of using models to simulate a range of scenarios that explore an appropriate range of biophysical and management situations can support this process by providing a framework for placing model results in context. To assess the variation between models and the process of understanding differences, estimates of GHG emissions from four farm-scale models (DairyWise, FarmAC, HolosNor and SFARMMOD) were calculated for eight dairy farming scenarios within a factorial design consisting of two climates (cool/dry and warm/wet)×two soil types (sandy and clayey)×two feeding systems (grass only and grass/maize). The milk yield per cow, follower:cow ratio, manure management system, nitrogen (N) fertilisation and land area were standardised for all scenarios in order to associate the differences in the results with the model structure and function. Potential yield and application of available N in fertiliser and manure were specified separately for grass and maize. Significant differences between models were found in GHG emissions at the farm-scale and for most contributory sources, although there was no difference in the ranking of source magnitudes. The farm-scale GHG emissions, averaged over the four models, was 10.6 t carbon dioxide equivalents (CO2e)/ha per year, with a range of 1.9 t CO2e/ha per year. Even though key production characteristics were specified in the scenarios, there were still significant differences between models in the annual milk production per ha and the amounts of N fertiliser and concentrate feed imported. This was because the models differed in their description of biophysical responses and feedback mechanisms, and in the extent to which management functions were internalised. We conclude that comparing the results of different farm-scale models when applied to a range of scenarios would build confidence in their use in achieving ESR targets, justifying further investment in the development of a wider range of scenarios and software tools.