The use of cluster analysis to derive dietary patterns: methodological considerations, reproducibility, validity and the effect of energy mis-reporting

Una M. Devlin; Breige A. McNulty; Anne P. Nugent; Micheal J. Gibney

doi:10.1017/S0029665112000729

The use of cluster analysis to derive dietary patterns: methodological considerations, reproducibility, validity and the effect of energy mis-reporting

Published online by Cambridge University Press: 06 August 2012

Anne P. Nugent and

Una M. Devlin*: Affiliation:
UCD Institute of Food and Health, University College Dublin, Belfield, Dublin 4, Republic of Ireland
Breige A. McNulty: Affiliation:
UCD Institute of Food and Health, University College Dublin, Belfield, Dublin 4, Republic of Ireland
Anne P. Nugent: Affiliation:
UCD Institute of Food and Health, University College Dublin, Belfield, Dublin 4, Republic of Ireland
Micheal J. Gibney: Affiliation:
UCD Institute of Food and Health, University College Dublin, Belfield, Dublin 4, Republic of Ireland
*: *Corresponding author: Una M. Devlin, emailuna.devlin@ucd.ie

Article contents

Abstract
Methodological considerations
Dietary patterns in healthy population groups
Dietary patterns and associations with chronic diseases
Dietary patterns and associations with nutritional biomarkers
Reproducibility and validity
Energy mis-reporting
Conclusion and future work
References

Rights & Permissions

Abstract

Over the last three decades, dietary pattern analysis has come to the forefront of nutritional epidemiology, where the combined effects of total diet on health can be examined. Two analytical approaches are commonly used: a priori and a posteriori. Cluster analysis is a commonly used a posteriori approach, where dietary patterns are derived based on differences in mean dietary intake separating individuals into mutually exclusive, non-overlapping groups. This review examines the literature on dietary patterns derived by cluster analysis in adult population groups, focusing, in particular, on methodological considerations, reproducibility, validity and the effect of energy mis-reporting. There is a wealth of research suggesting that the human diet can be described in terms of a limited number of eating patterns in healthy population groups using cluster analysis, where studies have accounted for differences in sex, age, socio-economic status, geographical area and weight status. Furthermore, patterns have been used to explore relationships with health and chronic diseases and more recently with nutritional biomarkers, suggesting that these patterns are biologically meaningful. Overall, it is apparent that consistent trends emerge when using cluster analysis to derive dietary patterns; however, future studies should focus on the inconsistencies in methodology and the effect of energy mis-reporting.

Keywords

Dietary patterns Cluster analysis Adults Energy mis-reporting

Type: Irish Section Postgraduate Symposium
Information: Proceedings of the Nutrition Society , Volume 71 , Issue 4 , November 2012 , pp. 599 - 609

DOI: https://doi.org/10.1017/S0029665112000729 [Opens in a new window]
Copyright: Copyright © The Authors 2012

hcy: homocysteine
%TE food: percentage total energy contribution from food

With the global prevalence of chronic diseases increasing, it is now widely accepted that diet has an important role to play, as many of these diseases may have a nutritional base or may be promoted by inappropriate dietary habits⁽ ¹ ^, Reference Kafatos and Codrington ² ⁾. Traditionally, nutritional epidemiology focused on a detailed examination of single nutrient intake; however, over the last three decades research has moved towards examining the combined effect of total food intake. This significant shift reflects a need to explore the complexity of individual total dietary intake and it is hoped that this alternative approach will help to increase our understanding of the role of diet in chronic diseases and improve the effectiveness of public health recommendations⁽ Reference Jacques and Tucker ³ ⁾. Furthermore, it has been recognised that individuals consume diverse diets consisting of many foods containing complex combinations of nutrients and it is likely that these nutrients will interact with each other, an effect that may be confounded within the single nutrient approach⁽ Reference Hu ⁴ ⁾.

One way to examine the combined effect of total food intake on health is to derive dietary patterns. Dietary patterns are typically characterised on the basis of habitual food intake and can be described as a measure of usual intake of food combination in individuals and groups where nutritional variables are grouped according to some criterion of nutritional status⁽ Reference Tucker ⁵ ⁾. Two analytical approaches are commonly used: a priori and a posteriori. The a priori approach is a theoretically driven method that focuses on constructing dietary scores using a predefined combination of diet quality based on published dietary guidelines⁽ Reference Kennedy, Ohls and Carlson ⁶ ⁾. The a posteriori approach is an exploratory method that uses multivariate statistical techniques to derive dietary patterns where large datasets representing total food intake are aggregated and reduced to smaller datasets to summarise total dietary exposure⁽ Reference Kant ⁷ ⁾. Factor analysis and cluster analysis are two a posteriori methods commonly used to derive dietary patterns in nutritional epidemiology. In factor analysis, linear combinations (factors) are created based on correlations between dietary intakes where each individual receives a score for the derived factors; however, these scores are difficult to interpret as an individual can belong to more than one factor⁽ Reference Hearty and Gibney ⁸ ⁾. Cluster analysis, on the other hand, offers the advantage of deriving dietary patterns which represent homogenous groups that can be related to other variables⁽ Reference Hu ⁴ ⁾.

In studies where factor and cluster analysis were used simultaneously to derive dietary patterns, results have shown good evidence of comparability. Two studies have indicated that there is a high resemblance between some of the clusters and factors identified due to similarities in food types⁽ Reference Crozier, Robinson and Borland ⁹ ^, Reference Cunha, Almeida and Pereira ¹⁰ ⁾. In addition, one study reported that three patterns dominated irrespective of which method was used⁽ Reference Hearty and Gibney ⁸ ⁾. Dietary patterns derived using both methods have also been compared with plasma lipid markers. Newby et al. reported that a cluster and a factor dominated by healthy foods were both inversely associated with plasma TAG, whereas a cluster and a factor dominated by alcohol were both directly associated with HDL and cholesterol⁽ Reference Newby, Muller and Tucker ¹¹ ⁾. Although both methods are directly comparable, it has been suggested that the choice of the dietary pattern analysis technique should depend on the type of outcome that is needed from the dataset as each method approaches the data from different angles and thus answers different questions⁽ Reference Hearty and Gibney ⁸ ⁾. Other authors have suggested that the ultimate way to approach dietary pattern analysis is to use a combination of factor and cluster analysis as complementary approaches⁽ Reference Hoffman, Schulze and Boeing ¹² ⁾ in order to give a better perspective and understanding of dietary habits⁽ Reference Engeset, Alsaker and Ciampi ¹³ ⁾.

Clustering methods separate individuals into mutually exclusive, non-overlapping clusters, where an individual can belong to one cluster only, therefore representing a unique cluster or dietary pattern⁽ Reference Hearty and Gibney ⁸ ⁾. Differences between clusters are based on mean dietary intake of each individual, where the dietary patterns derived are specific to individuals within each cluster and each cluster has a specific food and nutrient composition⁽ Reference Newby and Tucker ¹⁴ ⁾. Clusters are then labelled based on shared characteristics of dietary intake, where individuals with similar dietary intake will cluster together, away from others in dissimilar clusters. Dietary input variables can include nutrients, foods or food groups or a combination of all three⁽ Reference Togo, Osler and Sorensen ¹⁵ ⁾. However, within the literature, food groups are most commonly used⁽ Reference Hearty and Gibney ⁸ ^, Reference Pryer, Nichols and Elliott ¹⁶ ^– Reference Winkvist, Hornell and Hallmans ¹⁹ ⁾. One reason for using food groups as the preferred dietary input variable is that these groups can represent total dietary intake, accounting for any interaction between nutrients within the groups. Furthermore, various algorithms can also be used in the clustering procedure. The principle of all clustering algorithms is to calculate the Euclidean distance, which measures the distance between each dietary variable consumed together by similar individuals. Individuals are then grouped into clusters where the distance is maximised between the defined centre of each cluster from others, while the distance is minimised between any single individual and the centre of their closest cluster⁽ Reference Tucker ⁵ ⁾. Of these algorithms, the k-means approach is most frequently used⁽ Reference Hearty and Gibney ⁸ ^, Reference Winkvist, Hornell and Hallmans ¹⁹ ^– Reference Villegas, Salim and Collins ²¹ ⁾, although this algorithm has limitations which will be discussed later. This review examines the literature on dietary patterns derived by cluster analysis in adult population groups only, focusing in particular on methodological considerations, reproducibility, validity and the effect of energy mis-reporting.

Methodological considerations

Many dietary assessment tools are available to researchers to estimate dietary intake of an individual or a population group. These methods can be split into two categories: one is the prospective method, i.e. those that record data at the time of eating (dietary records) and the other is the retrospective method, i.e. those that collect data about the diet eaten in the past (diet histories, FFQ and dietary recalls)⁽ ²² ⁾. Within dietary pattern analysis, consideration should be given to the most appropriate method, as some may provide more ‘favourable’ results than others as several may not accurately identify the usual food pattern⁽ Reference Moeller, Reedy and Millen ²³ ⁾. The impact of the dietary assessment methods used in cluster analysis will be discussed later in the review.

In recent years, scrutiny of the statistical methodology concerning cluster analysis has been undertaken by many researchers, due to its highly exploratory nature. One issue of concern is researcher bias, which can ultimately influence the grouping of the dietary variables and the number of clusters in the final solution⁽ Reference Hearty and Gibney ⁸ ⁾. The frequently used k-means approach has a subjective element as the number of clusters needs to be predefined prior to analysis. To overcome this problem varying cluster solutions are usually run and then the clusters are examined for the best fit using cross-validation methods. Two approaches that can be used to examine the final cluster solution are to calculate the within cluster variance ratio⁽ Reference Lo Siou, Yasui and Csizmadi ²⁰ ^, Reference Michels and Schulze ²⁴ ^, Reference Anderson, Harris and Houston ²⁵ ⁾ or to generate scree plots⁽ Reference Wirfalt and Jeffery ²⁶ ^, Reference Bailey, Mitchell and Miller ²⁷ ⁾, where higher ratios indicate a better separation of clusters. It has been suggested, however, that there is no gold standard for determining the number of clusters⁽ Reference Togo, Osler and Sorensen ¹⁵ ⁾. In many cases, the appropriate number of clusters is determined by the author, taking into consideration those which are clearly distinct and nutritionally meaningful, while also maintaining a reasonable sample size⁽ Reference Anderson, Harris and Houston ²⁵ ⁾. In a similar way, there is no gold standard concerning the format of the dietary variable for the clustering procedure. Preferably, the dietary variables should be grouped to suitably represent the dataset to increase the likelihood of identifying sensible dietary patterns. When using food groups as the dietary variable, it has been suggested that food items consumed need to be aggregated into a limited number of groups avoiding the exclusion of subjects due to missing data⁽ Reference Berg, Lappas and Strandhagen ²⁸ ⁾. Previous studies have joined food groups together based on similarities in food group types⁽ Reference Hearty and Gibney ⁸ ^, Reference Pryer, Nichols and Elliott ¹⁶ ^, Reference Haveman-Nies, Tucker and de Groot ¹⁸ ⁾ or on nutrient content and culinary preference⁽ Reference Winkvist, Hornell and Hallmans ¹⁹ ^, Reference Newby, Muller and Hallfrisch ²⁹ ^, Reference O'Sullivan, Gibney and Brennan ³⁰ ⁾. In most cases authors have also differentiated between food groups, e.g. low- or high-energy and low- or high-fat⁽ Reference Hearty and Gibney ⁸ ^, Reference Pryer, Nichols and Elliott ¹⁶ ^, Reference Winkvist, Hornell and Hallmans ¹⁹ ^, Reference Newby, Muller and Hallfrisch ²⁹ ^, Reference O'Sullivan, Gibney and Brennan ³⁰ ⁾. Food groups are usually presented using three different methods (1) the frequency of the food consumed (servings)⁽ Reference Millen, Quatromoni and Gagnon ¹⁷ ^, Reference Winkvist, Hornell and Hallmans ¹⁹ ⁾, (2) the portion size of the food consumed (grams)⁽ Reference Hearty and Gibney ⁸ ^, Reference Villegas, Salim and Collins ²¹ ⁾ or (3) the percentage total energy contribution from food (%TE food)⁽ Reference Hearty and Gibney ⁸ ^, Reference O'Sullivan, Gibney and Brennan ³⁰ ^, Reference Anderson, Harris and Tylavsky ³¹ ⁾. Few studies have examined the impact of the methodological differences between these different methods. One author has proposed that when using the %TE food method, differences in energy needs due to sex, age, body weight and level of physical activity can be accounted for⁽ Reference Anderson, Harris and Houston ²⁵ ⁾. One study that compared two methods (servings and %TE food) reported similar clusters for food groups high in energy. However, clusters arising from %TE food were less likely to differentiate between low-energy foods such as fruit and vegetables. The authors therefore concluded that the servings approach best represented the patterns⁽ Reference Bailey, Gutschall and Mitchell ³² ⁾. In contrast, a second study that clustered using the grams and %TE food methods showed that the %TE food method best characterised the patterns, which were fully interpretable based on their contributing food group⁽ Reference Hearty and Gibney ⁸ ⁾. To the best of our knowledge no studies have examined the results obtained comparing all three methods in one dataset, therefore, it is difficult to make firm conclusions on the best method to use. One way to overcome the issue of high- or low-energy food groups affecting the patterns is to standardise the variables prior to analysis ensuring that variables with large variances which may have greater effects on resulting patterns than those with small variances can be accounted for⁽ Reference Michels and Schulze ²⁴ ⁾. Ideally, by standardising the input variables, all food groups will have equal influence on the clustering procedure. Research carried out by Wirfalt et al. examining the effect of standardising variables found that the distribution of individuals was more evenly spread and differences in nutrient intake across patterns were improved when using the un-standardised approach⁽ Reference Wirfält, Mattisson and Gullberg ³³ ⁾. Furthermore, in a follow-up study, Wirfalt reported that the transformation of variables by standardisation may have an effect on the dietary patterns identified as low-energy foods may be given equal weights to high-energy foods, which may represent poor dietary patterns⁽ Reference Wirfalt, Hedblad and Gullberg ³⁴ ⁾. Overall, there is insufficient evidence regarding the standardisation procedure and more research is needed.

Dietary patterns in healthy population groups

Throughout the last three decades many studies have identified meaningful dietary patterns in healthy population groups using cluster analysis as the patterning method. Initial studies focused on identifying patterns where nutrient intakes were inadequate v. published dietary recommendations, thus acknowledging that cluster analysis is a useful tool for identifying groups of people who may be at nutritional risk⁽ Reference Hulshof, Wedel and Lowik ³⁵ ^, Reference Greenwood, Cade and Draper ³⁶ ⁾. Later studies have accounted for the influence of sex, age, socio-economic status, geographical area and weight status. A range of dietary assessment methods were used including FFQ, dietary recalls and diet records. Only one study used nutrients as the clustering variable⁽ Reference Hulshof, Wedel and Lowik ³⁵ ⁾, whereas another used meal type⁽ Reference Holmback, Ericson and Gullberg ³⁷ ⁾; therefore, food groups were predominantly used and were presented using servings⁽ Reference Crozier, Robinson and Borland ⁹ ^, Reference Winkvist, Hornell and Hallmans ¹⁹ ^, Reference Greenwood, Cade and Draper ³⁶ ^, Reference Margetts, Thompson and Speller ³⁸ ^– Reference James ⁴⁴ ⁾, grams⁽ Reference Engeset, Alsaker and Ciampi ¹³ ^, Reference Pryer, Nichols and Elliott ¹⁶ ^, Reference Villegas, Salim and Collins ²¹ ^, Reference Delisle, Vioque and Gil ⁴⁵ ^– Reference Bamia, Orfanos and Ferrari ⁴⁷ ⁾ and %TE food⁽ Reference Hearty and Gibney ⁸ ^, Reference Haveman-Nies, Tucker and de Groot ¹⁸ ^, Reference Anderson, Harris and Tylavsky ³¹ ⁾. It is noteworthy that no matter which dietary assessment method or clustering variable was used, similar dietary patterns have been found across a collection of studies in healthy population groups.

In all studies, labels or names are normally assigned to characterise each pattern, based on the dietary intake that contributes relatively greater proportions⁽ Reference Newby, Muller and Tucker ¹¹ ^, Reference Anderson, Harris and Tylavsky ³¹ ^, Reference Quatromoni, Copenhafer and Demissie ⁴⁸ ⁾. Two commonly used terms are ‘healthy’ patterns characterised by the consumption of fruits and vegetables and ‘unhealthy’ patterns characterised by the consumption of foods high in fat and salt⁽ Reference Crozier, Robinson and Borland ⁹ ^, Reference Anderson, Harris and Tylavsky ³¹ ^, Reference Margetts, Thompson and Speller ³⁸ ^, Reference Martikainen, Brunner and Marmot ³⁹ ⁾. ‘Healthy’ patterns can also be referred to as ‘prudent’, while ‘unhealthy’ patterns can also be referred to as ‘western’ or ‘traditional’⁽ Reference Hearty and Gibney ⁸ ^, Reference Villegas, Salim and Collins ²¹ ^, Reference Delisle, Vioque and Gil ⁴⁵ ⁾. A strength of these studies is large sample size (n > 1379)⁽ Reference Hearty and Gibney ⁸ ^, Reference Villegas, Salim and Collins ²¹ ^, Reference Hulshof, Wedel and Lowik ³⁵ ^, Reference Margetts, Thompson and Speller ³⁸ ^, Reference Martikainen, Brunner and Marmot ³⁹ ⁾ (only one study of sample size n 213⁽ Reference Delisle, Vioque and Gil ⁴⁵ ⁾) though many were carried out in female⁽ Reference Crozier, Robinson and Borland ⁹ ^, Reference Greenwood, Cade and Draper ³⁶ ⁾ or older adults⁽ Reference Anderson, Harris and Tylavsky ³¹ ⁾ only. In one study of London adults aged 39–63 years, differences were reported in the type of ‘healthy’ patterns identified by using terms such as ‘very healthy’ or ‘moderately healthy’, similarly for ‘unhealthy’ patterns⁽ Reference Martikainen, Brunner and Marmot ³⁹ ⁾. Other descriptive labels used to characterise dietary patterns relate to ‘high- or low-nutrient density’⁽ Reference Millen, Quatromoni and Copenhafer ⁴⁰ ^, Reference Ledikwe, Smiciklas-Wright and Mitchell ⁴³ ⁾ or ‘glycaemic level’⁽ Reference Davis, Miller and Mitchell ⁴² ⁾; however, these findings are limited to three US studies in either females or older adults. Furthermore, many studies have examined differences in socio-economic status according to dietary patterns, reporting that typically ‘healthy’ patterns are associated with increased socio-economic status in males and females⁽ Reference Engeset, Alsaker and Ciampi ¹³ ^, Reference Villegas, Salim and Collins ²¹ ^, Reference Greenwood, Cade and Draper ³⁶ ^, Reference Martikainen, Brunner and Marmot ³⁹ ^, Reference Pryer, Cook and Shetty ⁴⁶ ⁾.

Significant differences among dietary patterns by sex have also been reported, highlighting the need to examine males and females separately in healthy population groups⁽ Reference Wirfalt and Jeffery ²⁶ ^, Reference Tucker, Dallal and Rush ⁴⁹ ⁾. In a study carried out in a representative sample of UK adults aged 16–64 years, it was reported that dietary patterns differ by sex⁽ Reference Pryer, Nichols and Elliott ¹⁶ ⁾, but these differences were lost in an older cohort aged 65+ years of the same study⁽ Reference Pryer, Cook and Shetty ⁴⁶ ⁾. Confirmation that dietary patterns differ by sex was reported in a cohort of older Italian adults aged 65+ years⁽ Reference Correa Leite, Nicolosi and Cristina ⁴¹ ⁾, Swedish adults aged 30–60 years⁽ Reference Winkvist, Hornell and Hallmans ¹⁹ ⁾, African–American adults aged 18+ years⁽ Reference James ⁴⁴ ⁾ and American adults aged 20–70 years⁽ Reference Millen, Quatromoni and Gagnon ¹⁷ ⁾. These studies suggest that dietary patterns differ by sex and this should therefore be accounted for in public health recommendations. Few studies have reported differences among age across dietary patterns⁽ Reference Pryer, Nichols and Elliott ¹⁶ ^, Reference Hulshof, Wedel and Lowik ³⁵ ^, Reference Millen, Quatromoni and Copenhafer ⁴⁰ ^, Reference Delisle, Vioque and Gil ⁴⁵ ⁾ and to the best of our knowledge no studies have examined the effect of age groups on dietary patterns in a large representative sample.

Dietary pattern analysis is also influenced by geography. Within large cohorts of older European adults, specific dietary patterns have been found to represent those living in Northern and Southern regions where one of these patterns is usually considered as more healthy⁽ Reference Haveman-Nies, Tucker and de Groot ¹⁸ ^, Reference Correa Leite, Nicolosi and Cristina ⁴¹ ^, Reference Bamia, Orfanos and Ferrari ⁴⁷ ^, Reference Schroll, Carbajal and Decarli ⁵⁰ ^). Differences have also been found at a national level; in a large study of Norwegian females aged 41–56 years, one dietary pattern was dominated by those living in a certain region of Norway⁽ Reference Engeset, Alsaker and Ciampi ¹³ ⁾. These results could therefore indicate that dietary patterns are influenced by geography and are associated with cultural perceptions, beliefs and attitudes about foods which can ultimately affect food choice. Although these studies are of large sample sizes, a limitation is that they are limited to groups of older adults and female populations only.

Three studies have also examined differences in weight status according to dietary patterns in healthy population groups. These studies have reported that BMI of individuals is significantly different across all patterns after controlling for age, sex, exercise and total energy intake in US adults (mean age 37 years)⁽ Reference Wirfalt and Jeffery ²⁶ ⁾ and UK adults aged 16–64 years⁽ Reference Pryer, Nichols and Elliott ¹⁶ ⁾. In the US study, the dietary pattern with the highest mean BMI was found to be predominantly male and had high intake of soft drinks. In contrast, in a large sample of Swedish adults aged 47–68 years, Holmback reported that the ‘fruit’ pattern had the greatest proportion of overweight individuals⁽ Reference Holmback, Ericson and Gullberg ³⁷ ⁾. These differences may perhaps be explained by the different types of clustering variables used (servings, %TE and meal type); however, further research is required.

The earlier studies in general show consistent findings across dietary patterns in healthy population groups. One issue of concern is that few have accounted for energy mis-reporters, with only two studies excluding such reporters from their analysis. This issue will be discussed later in the review. It is evident that from these studies, literature is accumulating in relation to using cluster analysis to derive dietary patterns taking into account sex, age, socio-economic status, geographical area and weight status; however, the lack of consensus of some studies warrants further research in this area.

Dietary patterns and associations with chronic diseases

The effect of diet on chronic diseases is a key consideration in nutritional epidemiology. By considering the effect of total diet using dietary pattern analysis, it is believed that various patterns may influence the development and possibly increase the risk of many diet related chronic diseases over time. An overview of the literature examining the association of dietary patterns and chronic diseases is outlined in Table 1 and reviewed briefly later.

Table 1. Associations between dietary patterns and chronic diseases

L, longitudinal; WC, waist circumference; P, prospective; CC, case–control; CS, cross-sectional; hsCRP, high-sensitivity C-reactive protein; WHR, waist:hip ratio; Lp-Pla2, lipoprotein.

* Disease v. control (CC studies only).

† Lowest contribution.

As previously discussed, evidence has suggested that weight status can differ according to dietary patterns in cross-sectional cohorts⁽ Reference Pryer, Nichols and Elliott ¹⁶ ^, Reference Wirfalt and Jeffery ²⁶ ^, Reference Holmback, Ericson and Gullberg ³⁷ ⁾. In studies, specifically examining the risk of obesity, it has been reported that in comparison with ‘healthy’ patterns and after adjustments for confounders, patterns that are considered ‘less healthy’ have a significantly larger BMI and waist circumference⁽ Reference Newby, Muller and Hallfrisch ²⁹ ^, Reference Lin, Bermudez and Tucker ⁵¹ ⁾, higher total percentage body fat (males only)⁽ Reference Anderson, Harris and Houston ²⁵ ⁾ and are associated with an increased risk of overweight (14–17%)⁽ Reference Quatromoni, Copenhafer and D'Agostino ⁵² ^, Reference Flores, Macias and Rivera ⁵³ ⁾ and obesity (20%)⁽ Reference Flores, Macias and Rivera ⁵³ ⁾. Interestingly, Carrera et al. found that no one pattern was associated with increased risks of obesity as it was reported that BMI and waist circumference were high among all patterns identified⁽ Reference Carrera, Gao and Tucker ⁵⁴ ⁾. Overall, arising from these large studies involving a wide variety of age groups, the consensus appears that subjects in ‘healthy’ patterns following current dietary recommendations are at lesser risk of becoming overweight or obese. Furthermore, it has been suggested that due to the complexity of total diet, future studies should consider the influence of total food volume on energy balance⁽ Reference Newby, Muller and Hallfrisch ²⁹ ⁾.

Dietary patterns have also been associated with CVD risk mainly in prospective studies. As before, ‘healthy’ patterns have been shown to be protective, lowering the risk of subclinical heart disease⁽ Reference Millen, Quatromoni and Nam ⁵⁵ ⁾ and carotid atherosclerosis⁽ Reference Millen, Quatromoni and Nam ⁵⁶ ⁾ by 4% and are favourably associated with anthropometric, blood pressure and blood lipid values⁽ Reference Berg, Lappas and Strandhagen ²⁸ ⁾ and with markers of inflammation⁽ Reference Hlebowicz, Persson and Gullberg ⁵⁷ ⁾ in comparison with the other patterns identified. However, one study relied on the analysis of non-fasting blood samples⁽ Reference Berg, Lappas and Strandhagen ²⁸ ⁾. In one case–control study, food groups associated with increased risk of acute myocardial infarction after adjustments for confounders were a ‘red meat and alcohol’ pattern in males and females and a ‘low fruit and vegetables’ pattern in females only, where the ‘red meat and alcohol’ pattern had significantly higher risks of CVD risk markers than those in a ‘healthy’ pattern⁽ Reference Oliveira, Rodriguez-Artalejo and Gaio ⁵⁸ ⁾. Interestingly, in one study no one pattern was associated with increased CVD risk although a ‘sweets’ pattern, showed a protective effect against CVD risk factors as significant associations were reported among HDL and elevated systolic blood pressure⁽ Reference Lopez, Rice and Weddle ⁵⁹ ⁾. These results provide support for the protective effects of ‘healthy’ dietary patterns against CVD.

Dietary patterns have also been linked to risk factors for diabetes. In one study, where 67 and 33% of subjects had normal and impaired glucose tolerance, respectively, it was reported that the ‘white bread’ pattern was associated with poorest insulin sensitivity and adiposity levels, whereas a ‘wine’ and ‘dark bread’ pattern was associated with improving these markers⁽ Reference Liese, Schulz and Moore ⁶⁰ ⁾. In non-diabetic cohorts, it has been reported that a pattern that is high in dairy products and low in staple foods is associated with a lower prevalence of type-2 diabetes⁽ Reference Villegas, Yang and Gao ⁶¹ ⁾, and a ‘healthy’ pattern improves insulin concentration and anthropometric profiles⁽ Reference Liu, McKeown and Newby ⁶² ⁾. One study also reported that a pattern with high intake of animal and soyabean products had a higher prevalence of glucose tolerance abnormalities, after adjustment for confounders⁽ Reference He, Ma, Zhai and Li ⁶³ ⁾. The cross-sectional study design of most of these studies is a limitation as information on diet (mainly collected using FFQ) and indicators of diabetes were collected at one specific point in time. This highlights the need for more prospective studies to be carried out in order to determine how the dietary patterns affect diabetes over a certain time frame.

Specific dietary patterns have also been associated with cancer risk, mainly in case–control studies. As before, ‘healthy’ dietary patterns were shown to have protective effects, and to reduce the risk of oesophageal cancer⁽ Reference Chen, Ward and Graubard ⁶⁴ ⁾, gastric cancer⁽ Reference Bastos, Lunet and Peleteiro ⁶⁵ ⁾, ovarian cancer⁽ Reference Edefonti, Randi and Decarli ⁶⁶ ⁾ and lung cancer in subjects who smoke⁽ Reference Tsai, McGlynn and Hu ⁶⁷ ⁾. ‘Unhealthy’ patterns increased the risk of oesophageal and colorectal cancer⁽ Reference Chen, Ward and Graubard ⁶⁴ ^, Reference Rouillier, Senesse and Cottet ⁶⁸ ⁾ and one pattern with high intake of bread and pasta was unfavourable for breast and ovarian cancer risk⁽ Reference Edefonti, Randi and Decarli ⁶⁶ ⁾. Although these results have shown patterns that may increase cancer risk and others that are protective, a difficulty in epidemiological studies of diet and cancer is lack of specific biomarkers for the disease. Further research needs to be carried out to establish environmental factors that may increase cancer risk.

The effect of dietary patterns on a combination of chronic diseases has also been evaluated. In one study, it was reported that after 16 years of follow-up, levels of overweight and obesity increased from 67 to 76% and 81 to 91%, respectively, whereas the rates of diabetes nearly doubled from 10 to 18% in the total population⁽ Reference Millen, Quatromoni and Pencina ⁶⁹ ⁾. No significant difference in risk was found according to dietary patterns, as it was reported that chronic disease risk factors were high in all patterns; however, the sample consisted of only males living in one suburban community of the US. In another study, a pattern characterised by the consumption of wholemeal bread, fruits, vegetables, pasta and rice lowered cancer mortality rate and myocardial infarction rates and a pattern characterised by wholemeal bread, fruits, vegetables and polyunsaturated margarine lowered the incidence of obesity⁽ Reference Brunner, Mosdol and Witte ⁷⁰ ⁾. This provides extra support for the health promoting effects of healthy diets.

Dietary patterns have also been explored in relation to the metabolic syndrome. In one study of Italian non-diabetic adults, the highest prevalence of the metabolic syndrome was found in the ‘starch’ and ‘animal products’ patterns and the lowest prevalence found in a ‘vegetable oil and fat spread’ pattern and a ‘vegetable and fruit’ pattern⁽ Reference Leite and Nicolosi ⁷¹ ⁾. Furthermore, in a Swedish study, it was reported that in males the ‘many foods and drinks’ and the ‘white bread’ pattern and in females the ‘white bread’ pattern only had increased risks of metabolic risk factors⁽ Reference Wirfalt, Hedblad and Gullberg ³⁴ ⁾. Song et al. also found increased risks of metabolic risk factors, although this time with a ‘meat and alcohol’ pattern, where it was also reported that a ‘traditional’ pattern that was characterised by high intake of white rice and vegetables had a 23% lower likelihood of having low HDL-cholesterol⁽ Reference Song and Joung ⁷² ⁾. One limitation of these studies is that divergent definitions were used to define the metabolic syndrome prior to analysis.

Few studies have examined the association of dietary patterns with a risk of osteoporosis. In one study an association with bone mineral density was reported, as it was demonstrated that a diet consisting of high intake of fruits, vegetables and breakfast cereals and limited in less nutrient dense foods may contribute to better bone mineral density in both males and females, though this association was not as strong in females, as levels of bone mineral density were fairly equal among all patterns identified⁽ Reference Tucker, Chen and Hannan ⁷³ ⁾.

Overall strength of these studies includes large sample sizes where a wide variety of clustering variables were also used; nevertheless as with healthy population groups the issue of energy mis-reporting is overlooked, as few authors have excluded these mis-reporters from their analysis. Findings mostly from cross-sectional studies have linked dietary patterns and numerous foods associated with these patterns to chronic diseases; however, further research including targeted nutrition interventions is warranted to fully assess the relationship taking into account all other environmental factors that may influence the disease. As it is well known that the progression of these chronic diseases gradually worsens over time, future studies should also consider the importance of prospective and case–control studies, to help advancements in the area.

Dietary patterns and associations with nutritional biomarkers

More recently, cluster analysis has been used firstly to derive dietary patterns, and thereafter differences in nutritional biomarkers explored in an attempt to examine the relationship between the two. It is hoped that this will enhance the knowledge base as to whether these dietary patterns are biologically meaningful.

In addition to the earlier studies on markers of lipid metabolism and inflammation, dietary patterns have been associated with markers of homocysteine (hcy) and vitamin B status. Hcy is an important and well-recognised biomarker in nutritional epidemiology as high levels have been linked to increasing the risk of CVD⁽ Reference McNulty and Scott ⁷⁴ ⁾. In a sample of 119 Chinese adults aged 35–49 years, it was found that relative to the ‘fruit and milk’ pattern, those subjects consuming a ‘refined cereals’ pattern were 4 and 5·2 times more likely to have high hcy and low vitamin B₁₂ concentration, respectively⁽ Reference Gao, Yao and McCrory ⁷⁵ ⁾. Another study investigated the levels of folate and hcy in a sample of 354 American males aged 21–88 years, following the folic acid fortification programme in the US. Within this study it was reported that plasma folate increased in all three dietary patterns identified, although plasma hcy decreased in the low fruit and vegetable pattern only⁽ Reference Knoops, Spiro and de Groot ⁷⁶ ⁾. Limitations of these studies include small sample sizes where one study was limited to males only.

A study has also linked dietary patterns to metabolic profiles in a small sample of Irish adults aged 18–63 years. Three dietary patterns were identified, and when compared with metabolic profiles (using metabolomics⁽ Reference Brennan ⁷⁷ ⁾), it was reported that food groups within patterns could be associated with concentration of metabolites⁽ Reference O'Sullivan, Gibney and Brennan ³⁰ ⁾. A pattern that had high intake of fruits and vegetables and a pattern that had high intake of red meat were associated with phenylacetylglutamine and O-acetylcarnitine, respectively. Although one major limitation of this study is its small sample size, the findings of this study underline the ability of metabolomics to identify novel biomarkers of dietary intake. Future studies should consider advancing these results in larger studies, in order to strengthen findings.

Reproducibility and validity

Although dietary pattern analysis has become of major interest in the field of nutritional epidemiology, the reproducibility and validity of the patterns derived are not clear, and few studies have fully evaluated this issue. As part of the Framingham Nutrition Studies, dietary patterns were identified for adult males and females aged 18–76 years separately. Five patterns were found to best represent each sex, with some patterns being associated with healthier nutrient profiles, while others were associated with disease risk⁽ Reference Millen, Quatromoni and Gagnon ¹⁷ ⁾. The internal validity of the five dietary patterns identified for women was assessed and it was found that 80% of the sample was correctly classified when using a discriminant analysis technique to measure the stability of the patterns⁽ Reference Quatromoni, Copenhafer and Demissie ⁴⁸ ⁾. Furthermore, the authors used the results of this study to derive a statistical scoring system or algorithm that would classify a subject from a newer Framingham Nutrition Study into one of the previously identified patterns for males and females. Using the scoring system it was reported that 80% of new males and females under study were correctly classified into one of the previous patterns already identified⁽ Reference Pencina, Millen and Hayes ⁷⁸ ⁾. The results from this large population based study show that dietary patterns are reproducible across similar population groups, although it should be noted that reproducibility does not guarantee validity. As mentioned previously, cluster analysis can be carried out using different algorithms; however, to date just one study has investigated the differences between these. Lo Siou et al. reported that when the clustering variable was presented as the %TE food method, the k-means approach (in comparison with Ward's and flexible beta methods) had the highest reproducibility of cluster solutions for Canadian adults aged 35–69 years⁽ Reference Lo Siou, Yasui and Csizmadi ²⁰ ⁾. When the sample was split by sex, a strong relationship was only seen for males; similar results were not found in females, therefore, highlighting the need for further research in the area. One study has also evaluated the influence of the dietary assessment method used (FFQ and 3-d diary), by comparing the classification rate of subjects into the same dietary patterns using either method, where it was found that four out of ten subjects were misclassified⁽ Reference Bountziouka, Tzavelas and Polychronopoulos ⁷⁹ ⁾. Furthermore, the question is raised as to what is the appropriate threshold for acceptable correct classification. As few studies have assessed both reproducibility and validity, it is clear that there is insufficient evidence to make firm conclusions; therefore highlighting the need for further research.

Energy mis-reporting

Energy mis-reporting is a major issue in dietary surveys⁽ ²² ⁾. Research has indicated consistent errors in self-reported dietary intake, using the available dietary assessment methods⁽ Reference Black, Prentice and Goldberg ⁸⁰ ⁾. Dietary intake is commonly over- or under-reported leading to implausible energy intake in population groups, where the latter may be considered the most detrimental to research studies. Under-reporting of dietary intake can happen in three ways, where subjects can (1) deny ever eating the food at all; (2) fail to report the correct portion size consumed or (3) fail to report how many times the food is actually consumed. Approaches to identify under-reporters are to calculate the ratio of energy intake to BMR where cut-off values are applied described by Goldberg et al.⁽ Reference Goldberg, Black and Jebb ⁸¹ ⁾ or by using the gold standard doubly labelled water technique⁽ Reference Livingstone and Black ⁸² ⁾. In studies of under-reporting, it has been found that females, overweight and obese subjects are more likely to under-report their dietary intake⁽ Reference Pryer, Vrijheid and Nichols ⁸³ ^– Reference Mattisson, Wirfalt and Aronsson ⁸⁶ ⁾. This is no exception in dietary pattern analysis studies as significant differences have been reported among males and females⁽ Reference Holmback, Ericson and Gullberg ³⁷ ^, Reference Pryer, Cook and Shetty ⁴⁶ ⁾ and healthy dietary patterns have been found to contain the greatest proportion of females and overweight subjects⁽ Reference Winkvist, Hornell and Hallmans ¹⁹ ^, Reference Holmback, Ericson and Gullberg ³⁷ ⁾. In contrast, Pryer et al. found that there was no difference in the proportion of under-reporters across the patterns⁽ Reference Pryer, Nichols and Elliott ¹⁶ ⁾, although Martikainen et al. demonstrated that differences in the numbers of under-reporters exist across all patterns; however, these differences are not systematically associated with good or bad diets⁽ Reference Martikainen, Brunner and Marmot ³⁹ ⁾. Other studies have found that under-reporting of energy intake is not uniformly distributed among dietary patterns⁽ Reference Hornell, Winkvist and Hallmans ⁸⁷ ^, Reference Scagliusi, Ferriolli and Pfrimer ⁸⁸ ⁾. In one study the highest prevalence of under-reporting fell among those in the healthy pattern, where although this study measured under-reporting using the doubly labelled water method, the sample consisted of females only aged 18–57 years⁽ Reference Scagliusi, Ferriolli and Pfrimer ⁸⁸ ⁾.

To the best of our knowledge, no studies have examined the effects of energy mis-reporting by identifying patterns for adequate and under-reporters separately. Two studies have although demonstrated that patterns generated following the removal of under-reporters are relatively similar in comparison with patterns of the total sample (including adequate and under-reporters)⁽ Reference Winkvist, Hornell and Hallmans ¹⁹ ^, Reference Martikainen, Brunner and Marmot ³⁹ ⁾. In one study 70% of participants fall into the same pattern regardless of their reporting status⁽ Reference Martikainen, Brunner and Marmot ³⁹ ⁾. The limitations of both these studies are that the authors have only briefly acknowledged under-reporting and there is a lack of published statistical analysis. Similarly, in another study patterns were identified in the total population and adequate reporters, where it was shown that the correlation between energy intake and weight status was improved for females only after removal of under-reporters⁽ Reference Bailey, Mitchell and Miller ⁸⁹ ⁾. Although it is not clear the effect energy mis-reporting may have on dietary pattern analysis, only two studies have removed such reporters from their analysis in healthy population groups⁽ Reference Engeset, Alsaker and Ciampi ¹³ ^, Reference Villegas, Salim and Collins ²¹ ⁾ and eight studies in chronic disease groups⁽ Reference Anderson, Harris and Houston ²⁵ ^, Reference Flores, Macias and Rivera ⁵³ ^, Reference Carrera, Gao and Tucker ⁵⁴ ^, Reference Villegas, Yang and Gao ⁶¹ ^, Reference Liu, McKeown and Newby ⁶² ^, Reference Brunner, Mosdol and Witte ⁷⁰ ^, Reference Song and Joung ⁷² ^, Reference Tucker, Chen and Hannan ⁷³ ⁾.

Conclusion and future work

From the numerous studies mentioned in this review, some consistent trends emerge when using cluster analysis to derive dietary patterns. It can be argued that there is homogeneity of dietary patterns across populations, where the consistency of patterns identified suggests that they are reproducible. Despite this, given the data driven nature of this statistical technique, the extent to which the identified patterns are reproducible and the extent to which they can be used to develop the understanding of nutritional epidemiology remains debatable. Several important issues have been highlighted, specifically regarding the methodological aspect of cluster analysis and these should be considered in future studies. However, in the earlier studies, different clustering techniques and procedures have been used, making it difficult to draw firm conclusions. Few studies have examined the effect of energy mis-reporting and it is clear that this effect is not fully understood. This review demonstrates the need for large representative cross-sectional and longitudinal studies to assess the effects of energy mis-reporting by carrying dietary pattern analysis on (1) the total population, (2) adequate reporters and (3) under-reporters.

Acknowledgements

U.M.D. wrote the review, B.A.McN., A.P.N. and M.J.G. provided expert advice in the drafting of the paper and commented on drafts of the paper. The authors declare no conflict of interest. The work was supported by joint funding from the Irish Department of Agriculture, Fisheries and Food and the Health Research Board under the Food for Health Research Initiative (2007–2012).

References

1. World Health Organization (2003) Diet, Nutrition and the Prevention of Chronic Diseases. Joint WHO/FAO Expert Consultation. WHO Technical Report Series no. 916. Geneva: WHO.Google Scholar

2. Kafatos, A & Codrington, CA (1999) Nutrition and diet for healthy lifestyles in Europe: the Eurodiet Project. Public Health Nutr 2, 327–328.10.1017/S1368980099000439Google Scholar

3. Jacques, P & Tucker, K (2001) Are dietary patterns useful for understanding the role of diet in chronic disease? Am J Clin Nutr 73, 1–2.10.1093/ajcn/73.1.1Google Scholar

4. Hu, F (2002) Dietary pattern analysis: a new direction in nutritional epidemiology. Curr Opin Lipidol 13, 3–9.Google Scholar

5. Tucker, KL (2010) Dietary patterns, approaches, and multicultural perspective. Appl Physiol Nutr Metab 35, 211–218.Google Scholar

6. Kennedy, ET, Ohls, J, Carlson, S et al. (1995) The Healthy Eating Index: design and applications. J Am Diet Assoc 95, 1103–1108.Google Scholar

7. Kant, AK (2004) Dietary patterns and health outcomes. J Am Diet Assoc 104, 615–635.10.1016/j.jada.2004.01.010Google Scholar

8. Hearty, AP & Gibney, MJ (2009) Comparison of cluster and principal component analysis techniques to derive dietary patterns in Irish adults. Br J Nutr 101, 598–608.10.1017/S0007114508014128Google Scholar

9. Crozier, SR, Robinson, SM, Borland, SE et al. (2006) Dietary patterns in the Southampton Women's Survey. Eur J Clin Nutr 60, 1391–1399.Google Scholar

10. Cunha, DB, Almeida, RM & Pereira, RA (2010) A comparison of three statistical methods applied in the identification of eating patterns. Cad Saude Publica 26, 2138–2148.10.1590/S0102-311X2010001100015Google Scholar

11. Newby, PK, Muller, D & Tucker, KL (2004) Associations of empirically derived eating patterns with plasma lipid biomarkers: a comparison of factor and cluster analysis methods. Am J Clin Nutr 80, 759–767.Google Scholar

12. Hoffman, K, Schulze, MB, Boeing, H et al. (2002) Dietary patterns: report of an international workshop. Public Health Nutr 5, 89–90.Google Scholar

13. Engeset, D, Alsaker, E, Ciampi, A et al. (2005) Dietary patterns and lifestyle factors in the Norwegian EPIC cohort: the Norwegian Women and Cancer (NOWAC) study. Eur J Clin Nutr 59, 675–684.Google Scholar

14. Newby, PK & Tucker, KL (2004) Empirically derived eating patterns using factor or cluster analysis: a review. Nutr Rev 62, 177–203.10.1111/j.1753-4887.2004.tb00040.xGoogle Scholar

15. Togo, P, Osler, M, Sorensen, TI et al. (2001) Food intake patterns and body mass index in observational studies. Int J Obes Relat Metab Disord 25, 1741–1751.Google Scholar

16. Pryer, JA, Nichols, R, Elliott, P et al. (2001) Dietary patterns among a national random sample of British adults. J Epidemiol Community Health 55, 29–37.10.1136/jech.55.1.29Google Scholar

17. Millen, BE, Quatromoni, PA, Gagnon, DR et al. (1996) Dietary patterns of men and women suggest targets for health promotion: the Framingham Nutrition Studies. Am J Health Promot 11, 42–52.Google Scholar

18. Haveman-Nies, A, Tucker, KL, de Groot, LC et al. (2001) Evaluation of dietary quality in relationship to nutritional and lifestyle factors in elderly people of the US Framingham Heart Study and the European SENECA study. Eur J Clin Nutr 55, 870–880.Google Scholar

19. Winkvist, A, Hornell, A, Hallmans, G et al. . (2009) More distinct food intake patterns among women than men in northern Sweden: a population-based survey. Nutr J 8, 12.10.1186/1475-2891-8-12Google Scholar

20. Lo Siou, G, Yasui, Y, Csizmadi, I et al. (2011) Exploring statistical approaches to diminish subjectivity of cluster analysis to derive dietary patterns: the Tomorrow Project. Am J Epidemiol 173, 956–967.Google Scholar

21. Villegas, R, Salim, A, Collins, M et al. (2004) Dietary patterns in middle-aged Irish men and women defined by cluster analysis. Public Health Nutr 7, 1017–1024.Google Scholar

22. European Food Safety Authority (2009) General principles for the collection of national food consumption data in the view of a pan-European dietary survey. EFSA J 7, 1435.Google Scholar

23. Moeller, SM, Reedy, J, Millen, AE et al. (2007) Dietary patterns: challenges and opportunities in dietary patterns research an Experimental Biology workshop, April 1, 2006. J Am Diet Assoc 107, 1233–1239.Google Scholar

24. Michels, KB & Schulze, MB (2005) Can dietary patterns help us detect diet disease associations? Nutr Res Rev 18, 241–248.Google Scholar

25. Anderson, AL, Harris, TB, Houston, DK et al. . Relationships of dietary patterns with body composition in older adults differ by gender and PPAR-gamma Pro12Ala genotype. Eur J Nutr 49, 385–394.Google Scholar

26. Wirfalt, AK & Jeffery, RW (1997) Using cluster analysis to examine dietary patterns: nutrient intakes, gender, and weight status differ across food pattern clusters. J Am Diet Assoc 97, 272–279.10.1016/S0002-8223(97)00071-0Google Scholar

27. Bailey, RL, Mitchell, DC, Miller, CK et al. (2007) A dietary screening questionnaire identifies dietary patterns in older adults. J Nutr 137, 421–426.Google Scholar

28. Berg, CM, Lappas, G, Strandhagen, E et al. (2008) Food patterns and cardiovascular disease risk factors: the Swedish INTERGENE research program. Am J Clin Nutr 88, 289–297.10.1093/ajcn/88.2.289Google Scholar

29. Newby, PK, Muller, D, Hallfrisch, J et al. (2003) Dietary patterns and changes in body mass index and waist circumference in adults. Am J Clin Nutr 77, 1417–1425.Google Scholar

30. O'Sullivan, A, Gibney, MJ & Brennan, L (2011) Dietary intake patterns are reflected in metabolomic profiles: potential role in dietary assessment studies. Am J Clin Nutr 93, 314–321.Google Scholar

31. Anderson, AL, Harris, TB, Tylavsky, FA et al. (2011) Dietary patterns and survival of older adults. J Am Diet Assoc 111, 84–91.10.1016/j.jada.2010.10.012Google Scholar

32. Bailey, RL, Gutschall, MD, Mitchell, DC et al. (2006) Comparative strategies for using cluster analysis to assess dietary patterns. J Am Diet Assoc 106, 1194–1200.10.1016/j.jada.2006.05.012Google Scholar

33. Wirfält, E, Mattisson, I, Gullberg, B et al. (2000) Food patterns defined by cluster analysis and their utility as dietary exposure variables: a report from the Malmö Diet and Cancer Study. Public Health Nutr 3, 159–173.Google Scholar

34. Wirfalt, E, Hedblad, B, Gullberg, B et al. (2001) Food patterns and components of the metabolic syndrome in men and women: a cross-sectional study within the Malmo Diet and Cancer cohort. Am J Epidemiol 154, 1150–1159.10.1093/aje/154.12.1150Google Scholar

35. Hulshof, KF, Wedel, M, Lowik, MR et al. (1992) Clustering of dietary variables and other lifestyle factors (Dutch Nutritional Surveillance System). J Epidemiol Community Health 46, 417–424.10.1136/jech.46.4.417Google Scholar

36. Greenwood, DC, Cade, JE, Draper, A et al. (2000) Seven unique food consumption patterns identified among women in the UK Women's Cohort Study. Eur J Clin Nutr 54, 314–320.Google Scholar

37. Holmback, I, Ericson, U, Gullberg, B et al. . (2009) Five meal patterns are differently associated with nutrient intakes, lifestyle factors and energy misreporting in a sub-sample of the Malmo Diet and Cancer cohort. Food Nutr Res: DOI:10.3402/fnr.v53i0.1970.Google Scholar

38. Margetts, BM, Thompson, RL, Speller, V et al. (1998) Factors which influence ‘healthy’ eating patterns: results from the 1993 Health Education Authority health and lifestyle survey in England. Public Health Nutr 1, 193–198.Google Scholar

39. Martikainen, P, Brunner, E & Marmot, M (2003) Socioeconomic differences in dietary patterns among middle-aged men and women. Soc Sci Med 56, 1397–1410.Google Scholar

40. Millen, BE, Quatromoni, PA, Copenhafer, DL et al. (2001) Validation of a dietary pattern approach for evaluating nutritional risk: the Framingham Nutrition Studies. J Am Diet Asocc 101, 187–194.Google Scholar

41. Correa Leite, ML, Nicolosi, A, Cristina, S et al. (2003) Dietary and nutritional patterns in an elderly rural population in Northern and Southern Italy: (I). A cluster analysis of food consumption. Eur J Clin Nutr 57, 1514–1521.Google Scholar

42. Davis, MS, Miller, CK & Mitchell, DC (2004) More favorable dietary patterns are associated with lower glycemic load in older adults. J Am Diet Assoc 104, 1828–1835.Google Scholar

43. Ledikwe, JH, Smiciklas-Wright, H, Mitchell, DC et al. (2004) Dietary patterns of rural older adults are associated with weight and nutritional status. J Am Geriatr Soc 52, 589–595.Google Scholar

44. James, DC (2009) Cluster analysis defines distinct dietary patterns for African–American men and women. J Am Diet Assoc 109, 255–262.Google Scholar

45. Delisle, HF, Vioque, J & Gil, A 2009 Dietary patterns and quality in West-African immigrants in Madrid. Nutr J 8, 3.10.1186/1475-2891-8-3Google Scholar

46. Pryer, JA, Cook, A & Shetty, P (2001) Identification of groups who report similar patterns of diet among a representative national sample of British adults aged 65 years of age or more. Public Health Nutr 4, 787–795.10.1079/PHN200098Google Scholar

47. Bamia, C, Orfanos, P, Ferrari, P et al. (2005) Dietary patterns among older Europeans: the EPIC-Elderly study. Br J Nutr 94, 100–113.Google Scholar

48. Quatromoni, PA, Copenhafer, DL, Demissie, S et al. (2002) The internal validity of a dietary pattern analysis. The Framingham Nutrition Studies. J Epidemiol Community Health 56, 381–388.Google Scholar

49. Tucker, KL, Dallal, GE & Rush, D (1992) Dietary patterns of elderly Boston-area residents defined by cluster analysis. J Am Diet Assoc 92, 1487–1491.Google Scholar

50. Schroll, K, Carbajal, A, Decarli, B et al. (1996) Food patterns of elderly Europeans. SENECA Investigators. Eur J Clin Nutr 50, 86–100.Google Scholar

51. Lin, H, Bermudez, OI & Tucker, KL (2003) Dietary patterns of Hispanic elders are associated with acculturation and obesity. J Nutr 133, 3651–3657.Google Scholar

52. Quatromoni, PA, Copenhafer, DL, D'Agostino, RB et al. (2002) Dietary patterns predict the development of overweight in women: the Framingham Nutrition Studies. J Am Diet Assoc 102, 1239–1246.Google Scholar

53. Flores, M, Macias, N, Rivera, M et al. (2010) Dietary patterns in Mexican adults are associated with risk of being overweight or obese. J Nutr 140, 1869–1873.10.3945/jn.110.121533Google Scholar

54. Carrera, PM, Gao, X & Tucker, KL (2007) A study of dietary patterns in the Mexican–American population and their association with obesity. J Am Diet Assoc 107, 1735–1742.Google Scholar

55. Millen, BE, Quatromoni, PA, Nam, BH et al. (2004) Dietary patterns, smoking, and subclinical heart disease in women: opportunities for primary prevention from the Framingham Nutrition Studies. J Am Diet Assoc 104, 208–214.Google Scholar

56. Millen, BE, Quatromoni, PA, Nam, BH et al. (2002) Dietary patterns and the odds of carotid atherosclerosis in women: the Framingham Nutrition Studies. Prev Med 35, 540–547.10.1006/pmed.2002.1116Google Scholar

57. Hlebowicz, J, Persson, M, Gullberg, B et al. (2011) Food patterns, inflammation markers and incidence of cardiovascular disease: the Malmo Diet and Cancer study. J Intern Med 270, 365–376.Google Scholar

58. Oliveira, A, Rodriguez-Artalejo, F, Gaio, R et al. (2011) Major habitual dietary patterns are associated with acute myocardial infarction and cardiovascular risk markers in a southern European population. J Am Diet Assoc 111, 241–250.10.1016/j.jada.2010.10.042Google Scholar

59. Lopez, EP, Rice, C, Weddle, DO et al. (2008) The relationship among cardiovascular risk factors, diet patterns, alcohol consumption, and ethnicity among women aged 50 years and older. J Am Diet Assoc 108, 248–256.Google Scholar

60. Liese, AD, Schulz, M, Moore, CG et al. (2004) Dietary patterns, insulin sensitivity and adiposity in the multi-ethnic Insulin Resistance Atherosclerosis Study population. Br J Nutr 92, 973–984.Google Scholar

61. Villegas, R, Yang, G, Gao, YT et al. (2010) Dietary patterns are associated with lower incidence of type 2 diabetes in middle-aged women: the Shanghai Women's Health Study. Int J Epidemiol 39, 889–899.Google Scholar

62. Liu, E, McKeown, NM, Newby, PK et al. (2009) Cross-sectional association of dietary patterns with insulin-resistant phenotypes among adults without diabetes in the Framingham Offspring Study. Br J Nutr 102, 576–583.Google Scholar

63. He, Y, Ma, G, Zhai, F, Li, Y et al. (2009) Dietary patterns and glucose tolerance abnormalities in Chinese adults. Diabetes Care 32, 1972–1976.Google Scholar

64. Chen, H, Ward, MH, Graubard, BI et al. (2002) Dietary patterns and adenocarcinoma of the esophagus and distal stomach. Am J Clin Nutr 75, 137–144.Google Scholar

65. Bastos, J, Lunet, N, Peleteiro, B et al. (2010) Dietary patterns and gastric cancer in a Portuguese urban population. Int J Cancer 127, 433–441.Google Scholar

66. Edefonti, V, Randi, G, Decarli, A et al. (2009) Clustering dietary habits and the risk of breast and ovarian cancers. Ann Oncol 20, 581–590.Google Scholar

67. Tsai, YY, McGlynn, KA, Hu, Y et al. (2003) Genetic susceptibility and dietary patterns in lung cancer. Lung Cancer 41, 269–281.Google Scholar

68. Rouillier, P, Senesse, P, Cottet, V et al. (2005) Dietary patterns and the adenomacarcinoma sequence of colorectal cancer. Eur J Nutr 44, 311–318.Google Scholar

69. Millen, BE, Quatromoni, PA, Pencina, M et al. (2005) Unique dietary patterns and chronic disease risk profiles of adult men: the Framingham nutrition studies. J Am Diet Assoc 105, 1723–1734.Google Scholar

70. Brunner, EJ, Mosdol, A, Witte, DR et al. (2008) Dietary patterns and 15-y risks of major coronary events, diabetes, and mortality. Am J Clin Nutr 87, 1414–1421.Google Scholar

71. Leite, ML & Nicolosi, A (2009) Dietary patterns and metabolic syndrome factors in a non-diabetic Italian population. Public Health Nutr 12, 1494–1503.Google Scholar

72. Song, Y & Joung, H (2012) A traditional Korean dietary pattern and metabolic syndrome abnormalities. Nutr Metab Cardiovasc Dis 22, 456–462.Google Scholar

73. Tucker, KL, Chen, H, Hannan, MT et al. (2002) Bone mineral density and dietary patterns in older adults: the Framingham Osteoporosis Study. Am J Clin Nutr 76, 245–252.Google Scholar

74. McNulty, H & Scott, JM (2008) Intake and status of folate and related B-vitamins: considerations and challenges in achieving optimal status. Br J Nutr 99, 48–54.10.1017/S0007114508006855Google Scholar

75. Gao, X, Yao, M, McCrory, MA et al. (2003) Dietary pattern is associated with homocysteine and B vitamin status in an urban Chinese population. J Nutr 133, 3636–3642.10.1093/jn/133.11.3636Google Scholar

76. Knoops, KT, Spiro, A, de Groot, LC et al. (2009) Do dietary patterns in older men influence change in homocysteine through folate fortification? The Normative Aging Study. Public Health Nutr 12, 1760–1766.Google Scholar

77. Brennan, L (2008) Session 2: personalised nutrition Metabolomic applications in nutritional research. Proc Nutr Soc 67, 404–408.10.1017/S0029665108008719Google Scholar

78. Pencina, MJ, Millen, BE, Hayes, LJ et al. (2008) Performance of a method for identifying the unique dietary patterns of adult women and men: the Framingham nutrition studies. J Am Diet Assoc 108, 1453–1460.Google Scholar

79. Bountziouka, V, Tzavelas, G, Polychronopoulos, E et al. (2011) Validity of dietary patterns derived in nutrition surveys using a priori and a posteriori multivariate statistical methods. Int J Food Sci Nutr 62, 617–627.Google Scholar

80. Black, AE, Prentice, AM, Goldberg, GR et al. (1993) Measurements of total energy expenditure provide insights into the validity of dietary measurements of energy intake. J Am Diet Assoc 93, 572–579.Google Scholar

81. Goldberg, GR, Black, AE, Jebb, SA et al. (1991) Critical evaluation of energy intake data using fundamental principles of energy physiology: 1. Derivation of cut-off limits to identify under-recording. Eur J Clin Nutr 45, 569–581.Google Scholar

82. Livingstone, MB & Black, AE (2003) Markers of the validity of reported energy intake. J Nutr 133, 895S–920S.Google Scholar

83. Pryer, JA, Vrijheid, M, Nichols, R et al. (1997) Who are the ‘low energy reporters’ in the dietary and nutritional survey of British adults? Int J Epidemiol 26, 146–154.10.1093/ije/26.1.146Google Scholar

84. Johansson, L, Solvoll, K, Bjorneboe, GE et al. (1998) Under- and overreporting of energy intake related to weight status and lifestyle in a nationwide sample. Am J Clin Nutr 68, 266–274.Google Scholar

85. Mendez, MA, Wynter, S, Wilks, R et al. (2004) Under- and overreporting of energy is related to obesity, lifestyle factors and food group intakes in Jamaican adults. Public Health Nutr 7, 9–19.Google Scholar

86. Mattisson, I, Wirfalt, E, Aronsson, CA et al. (2005) Misreporting of energy: prevalence, characteristics of misreporters and influence on observed risk estimates in the Malmo Diet and Cancer cohort. Br J Nutr 94, 832–842.10.1079/BJN20051573Google Scholar

87. Hornell, A, Winkvist, A, Hallmans, G et al. . (2010) Mis-reporting, previous health status and health status of family may seriously bias the association between food patterns and disease. Nutr J 9, 48.10.1186/1475-2891-9-48Google Scholar

88. Scagliusi, FB, Ferriolli, E, Pfrimer, K et al. (2008) Under-reporting of energy intake is more prevalent in a healthy dietary pattern cluster. Br J Nutr 100, 1060–1068.Google Scholar

89. Bailey, RL, Mitchell, DC, Miller, C et al. (2007) Assessing the effect of underreporting energy intake on dietary patterns and weight status. J Am Diet Assoc 107, 64–71.Google Scholar

Table 1. Associations between dietary patterns and chronic diseases

Article contents

The use of cluster analysis to derive dietary patterns: methodological considerations, reproducibility, validity and the effect of energy mis-reporting

Abstract

Keywords

Methodological considerations

Dietary patterns in healthy population groups

Dietary patterns and associations with chronic diseases

Dietary patterns and associations with nutritional biomarkers

Reproducibility and validity

Energy mis-reporting

Conclusion and future work

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests