Skip to main content Accessibility help
Hostname: page-component-cf9d5c678-9z9qw Total loading time: 0.503 Render date: 2021-07-28T18:11:17.018Z Has data issue: true Feature Flags: { "shouldUseShareProductTool": true, "shouldUseHypothesis": true, "isUnsiloEnabled": true, "metricsAbstractViews": false, "figures": true, "newCiteModal": false, "newCitedByModal": true, "newEcommerce": true, "newUsageEvents": true }

Effects of education and culture on the validity of the Geriatric Mental State and its AGECAT algorithm

Published online by Cambridge University Press:  02 January 2018

Martin Prince
Institute of Psychiatry, King's College, London, UK
Daisy Acosta
Universidad Nacional Pedro Henriquez Ureña (UNPHU), Santo Domingo, Dominican Republic
Helen Chiu
Chinese University of Hong Kong, Hong Kong, SAR
John Copeland
University of Liverpool, Liverpool, UK
Michael Dewey
Institute of Psychiatry King's College, London, UK
Marcia Scazufca
University of São Paulo, São Paulo, Brazil
Mathew Varghese
National Institute of Mental Health and Neurological Sciences, Bangalore, India
E-mail address:
Rights & Permissions[Opens in a new window]



The Geriatric Mental State (GMS) is the most widely used psychiatric research assessment for older persons. Evidence for validity comes from the developed world.


To assess the validity of GMS/AGECAT organicity and depression diagnoses in 26 centres in India, China, Latin America and Africa.


We studied 2941 persons aged 60 years and over: 742 people with dementia and three groups free of dementia (697 with depression, 719 with high and 783 with low levels of education). Local clinicians diagnosed dementia (DSM–IV) and depression (Montgomery – Åsberg Depression Rating Scale score ⩾18).


For dementia diagnosis GMS/AGECAT performed well in many centres but educational bias was evident. Specificity was poor in India and sensitivity sub-optimal in Latin America. A predictive algorithm excluding certain orientation items but including interviewer judgements improved upon the AGECAT algorithm. For depression, sensitivity was high. The EURO–D depression scale, derived from GMS items using European data, has a similar factor structure in Latin America, India and, to a lesser extent, China.


Valid, comprehensive mental status assessment across cultures seems achievable in principle.

Copyright © 2004 The Royal College of Psychiatrists 

The Geriatric Mental State (GMS) examination (Reference Copeland, Dewey and Griffith-JonesCopeland et al, 1986) is the most widely used comprehensive mental health research assessment for older persons (Reference Copeland, Prince and WilsonCopeland et al, 2002). It is particularly suited for comparative epidemiological research, given its structured format for identifying, rating and recording symptoms, and the use of the AGECAT computerised algorithm (Reference Copeland, Dewey and Griffith-JonesCopeland et al, 1986) to generate diagnoses. The purpose of this paper is to assess the extent to which the validity of the GMS, first established in developed countries (Reference Livingston, Sax and WillisonLivingston et al, 1990; Reference Collinghan, MacDonald and HerzbergCollinghan et al, 1993), extends to poorly educated populations in developing countries. The 10/66 Dementia Research Group recently included the GMS as a key component of an algorithm for the diagnosis of dementia in developing countries, in conjunction with the Community Screening Instrument for Dementia (CSI-D; Reference Hall, Hendrie and BrittainHall et al, 1993) and the modified CERAD ten-word list learning test (Reference Ganguli, Chandra and GilbeyGanguli et al, 1996). Here, using the same data, we assess the performance of the GMS in more detail. Validity is assessed at the level of each of the 26 participating centres. The GMS depression diagnoses are examined in addition to organicity (dementia). Responses to individual items contributing to the depression and organicity diagnostic algorithms are assessed across world regions.


Study design

The design of the 10/66 dementia diagnosis pilot study is described in more detail elsewhere (Reference Prince, Acosta and ChiuPrince et al, 2003). In each centre we aimed to recruit 30 participants into each of four groups: mild to moderate dementia; depression; high level of education; and low level of education. Ethical approval for the studies was obtained in London and in the overseas centres. Recruitment was on the basis of informed consent, or relatives’ agreement where individuals with dementia lacked capacity. All participants were aged 60 years or over. To maintain blindness, independent clinicians established the diagnosis of dementia according to DSM–IV (American Psychiatric Association, 1994) criteria (any dementia subtype) by completing a clinical pro forma, and formally rating dementia severity using the Clinical Dementia Rating (CDR) scale (Reference MorrisMorris, 1993). They confirmed the diagnosis of depression with a clinical assessment guided by the Montgomery-Åsberg Depression Rating (MADRS; Reference Montgomery and ÅsbergMontgomery & Åsberg, 1979) with an inclusion criterion of a score of 18 or above. Dementia and depression case groups were recruited either from previous clinical contacts or by local key informant nomination. The two groups with normal cognitive function (low and high levels of education) were recruited from the community. Interviewers were given the participants’ name and address but were not told of their diagnosis.


All study instruments were translated and back-translated by bilingual local investigators and the resulting local language version was reviewed by local key informants to check its face validity. The GMS is a 25-50 min clinical interview generating, from a computerised algorithm (AGE-CAT), nine diagnostic clusters: organicity (dementia and other organic brain syndromes), schizophrenia (and related psychoses), mania, neurotic depression, psychotic depression, hypochondriasis, phobias, obsessional neurosis and anxiety neurosis. A diagnostic confidence level for each syndrome ranges from 0 (no symptoms) to 5 (very severely affected). Levels 3 and greater represent likely cases, a degree of severity warranting professional intervention; levels 1 and 2 are sub-cases. Stage 1 diagnoses are then organised into final stage 2 diagnoses on the basis of precedence determined by a hierarchically structured algorithm. We used the original A3 version of the GMS. A briefer B3 ‘community’ version of the GMS omits those sections that assess syndromes with a low prevalence in the general community: mania, obsessive-compulsive disorder, hypochondriasis and some ratings of hallu-cinations and delusions. It is possible to generate B3 AGECAT diagnoses from A3 data-sets as if the briefer interview had been administered instead.

A subset of 12 GMS items contribute particularly towards the determination of the stage 1 organicity diagnostic confidence level. These comprise tests of cognitive ability (knowledge of date of birth and age; discrepancy between stated date of birth and age; orientation to day, month, year and address; recall of name of interviewer; name of their country's current and previous political leader) and two judgements made by the interviewer (presence of memory deficit, and problems with memory worse than problems with thinking). Twelve symptoms of depression in the GMS (depression, pessimism, wishing death, guilt, sleep, interest, irritability, appetite, fatigue, concentration, enjoyment, tearfulness) are used to generate the EURO-D 12-item depression symptom scale (Reference Prince, Reischies and BeekmanPrince et al, 1999). The EURO-D was internally consistent and captured the essence of its parent instrument. Across Europe a two-factor solution seemed appropriate: depression, tearfulness and wishing to die loaded on the first factor (affective suffering); and loss of interest, poor concentration and lack of enjoyment loaded on the second factor (motivation).


All centres were trained in the use of the GMS. M.P. and J.C. trained the Chinese and Indian centres, using English. For Latin America the Brazilian (Portuguese-speaking) and Hispanic 10/66 network coordinators were trained by M.P., using English. They subsequently trained investigators from the 14 Latin American centres using their own languages. Over 2-3 days, each trainee viewed and co-rated two training tapes, completed and rated a supervised training interview and co-rated a further four to six training interviews. This represented a necessary compression of the more conventional 5-day training period for the GMS.


Dementia diagnosis

  1. (a) We estimated the sensitivity (%) for dementia of the GMS-A3/AGECAT stage 2 organicity diagnosis (level 3 or greater), and the false-positive rates (%) for each centre, among those with depression and in the ‘high-education’ and ‘low-education’ control groups. We compared these parameters with those that would have been achieved with the briefer GMS-B3.

  2. (b) We assessed the validity of the 12 organicity items by estimating the proportions of participants in each region and for each diagnostic group who failed on the ten GMS cognitive test items and who were considered to be impaired on the two objective interviewer assessments.

  3. (c) We further assessed the 12 organicity items as independent predictors of true dementia status using logistic regression. The collective discriminability of the optimally discriminant items was assessed using predicted probabilities from the logistic regression model. To avoid overprediction the data-set was divided randomly into two halves; the logistic model was developed from the first half (development sample) and applied to the second half (test sample). The sensitivity (%) for dementia of the predictive model (on the test sample) and its false-positive rates (%) for each centre among those with depression and in the high and low education control groups was compared with that of the AGECAT stage 2 organicity diagnosis.

Depression diagnosis

We estimated in each region:

  1. (a) the sensitivity (%) for depression of the GMS/AGECAT stage 2 depression diagnosis (level 3 or greater) in the depression group, and the proportion of GMS/AGECAT stage 2 depression in each of the three other groups (dementia, high-education and low-education), for which depression status was not a selection criterion;

  2. (b) the mean EURO-D score and standard deviation for each diagnostic group;

  3. (c) the internal consistency of the EURO-D scale (excluding those with dementia);

  4. (d) the factor structure of the EURO-D scale items (using principal components analysis with varimax rotation), comparing the results with those published previously for European centres (Reference Prince, Reischies and BeekmanPrince et al, 1999).


Centres and participants

In all, 2941 persons were interviewed: 746 in India, 336 in China and south-east Asia, 119 in Russia, 74 in Nigeria (Africa) and 1666 in Latin America and the Caribbean. Centres were asked to recruit participants aged 65 years and over. In the event, some centres recruited some participants aged 60-64 years, 207 in all or 7% of the total sample. Of the 2941 participants, 742 were people with dementia, 697 people with depression, 719 high-education controls and 783 low-education controls. In the low-education groups the proportions receiving no or minimal education were 91% for India, 89% for China and 80% for Latin America and the Caribbean. In the high-education groups the proportions completing secondary education were 81%, 99% and 80%, respectively.


At regional level, the GMS-A3/AGECAT stage 2 organicity rating has reasonable validity against the gold standard clinical diagnosis of dementia (Table 1). Sensitivity appears to be better in Indian and Chinese centres than in Latin American. The false-positive rate among those with little education is worse in India. However, at centre level the performance is patchy. In Thrissur and Goa in Southern India, although sensitivity is excellent, one-half and two-thirds, respectively, of the least well educated are misdiagnosed. In Latin America in Venezuela, Argentina, Chile and Mexico (Guadalajara) less than half of dementia cases are correctly identified. The GMS-B3 stage 2 organicity rating was identical to the A3 rating in most centres and is therefore not cited here. Where it differed significantly, sensitivity was superior with little or no decline in specificity. Thus, in Guadalajara, sensitivity with version B3 was 50% against 13% for A3, in Chile it was 67% compared with 42% and in Argentina it was 57% compared with 50%.

Table 1 Discriminability of the Geriatric Mental State (GMS) A3 ‘organicity’ AGECAT stage 2 diagnosis and of the new algorithm derived from GMS organicity items (see also Tables 2 and 3)

Dementia sensitivity (%) Depression (FPR%)1 High-education controls (FPR%) Low-education controls (FPR%)
    VHS, Chennai GMS–A3 77 17 0 23
New algorithm 57 17 0 6
    SCARF, Chennai GMS–A3 80 4 0 17
New algorithm 100 0 0 0
    Goa GMS–A3 97 47 7 67
New algorithm 85 9 0 24
    Thrissur GMS–A3 86 19 0 54
New algorithm 85 0 0 0
    Hyderabad GMS–A3 77 33 0 7
China New algorithm 88 26 9 0
    Beijing GMS–A3 90 0 0 0
New algorithm 94 0 0 0
    Taipei GMS–A3 71 9 3 12
New algorithm 67 0 0 0
    Anambra GMS–A3 95 13 0 13
New algorithm 83 14 0 0
    Moscow GMS–A3 61 0 3 3
New algorithm 84 5 14 6
Latin America
    São Paulo GMS–A3 67 7 0 20
New algorithm 73 7 0 7
    Botucatu GMS–A3 77 23 0 13
New algorithm 77 8 0 0
    São Jose GMS–A3 73 30 7 3
New algorithm 91 16 0 0
    La Habana GMS–A3 85 3 0 13
New algorithm 86 6 0 0
    Peru GMS–A3 73 3 7 13
New algorithm 73 15 10 7
    Venezuela GMS–A3 47 13 0 3
New algorithm 63 6 0 0
    Dominican Republic GMS–A3 67 11 0 22
New algorithm 88 14 0 18
    Argentina GMS–A3 50 6 0 0
New algorithm 92 10 0 0
    Chile GMS–A3 42 18 0 6
New algorithm 80 0 0 0
    Panama GMS–A3 63 13 3 10
New algorithm 78 11 0 0
    Guatemala GMS–A3 77 17 0 3
New algorithm 94 12 7 0
    Mexico City GMS–A3 69 14 0 0
New algorithm 91 25 0 11
    Guadalajara GMS–A3 13 37 3 3
New algorithm 100 0 0 0
Uruguay GMS–A3 87 0 6 8
New algorithm 100 38 0 15
India GMS–A3 80 30 1 37
New algorithm 85 12 1 5
LAC2 GMS–A3 65 14 2 8
New algorithm 86 13 2 4
China GMS–A3 80 5 2 6
New algorithm 79 5 0 2

Table 2 Discriminability by region of the Geriatric Mental State (GMS) organicity items

Item Dementia Depression High education Low education
Date of birth incorrect
    India 54 36 7 41
    China 22 4 0 2
    LAC 47 5 1 2
Age incorrect
    India 45 14 2 15
    China 12 0 0 0
    LAC 47 5 1 2
Discrepancy between date of birth and age
    India 18 3 0 3
    China 37 3 0 2
    LAC 49 8 1 4
One or more of above
    India 64 54 8 50
    China 61 6 0 3
    LAC 61 11 1 6
Day of week incorrect
    India 8 2 0 1
    China 41 1 0 0
    LAC 30 5 1 1
Month incorrect
    India 35 6 1 4
    China 43 2 0 0
    LAC 49 6 1 1
Year incorrect
    India 40 11 1 9
    China 52 5 1 9
    LAC 54 8 1 4
Address incorrect
    India 44 11 1 5
    China 28 1 0 1
    LAC 49 5 2 6
Claimed to have seen interviewer before
    India 24 2 1 0
    China 16 5 0 0
    LAC 30 2 1 3
Did not recall interviewer's name
    India 81 31 6 12
    China 79 29 8 39
    LAC 67 28 7 19
Did not recall country's leader
    India 85 39 12 32
    China 44 5 0 0
    LAC 54 18 5 12
Did not recall country's past leader
    India 90 50 15 41
    China 49 13 0 2
    LAC 65 38 18 27
Interviewer's opinion
Participant has difficulty with memory
    India 52 2 0 0
    China 39 2 0 0
    LAC 62 6 1 1
Interviewer's opinion
Problems with memory more prominent than problems with thinking
    India 45 1 1 1
    China 26 0 0 0
    LAC 65 13 1 7

Table 3 Predictive model for the diagnosis of dementia derived from the Geriatric Mental State (GMS) organicity items using logistic regression

β s.e. Wald statistic d.f. P Odds ratio (95% CI)
Age incorrect 0.82 0.35 5.6 1 0.018 2.3 (1.2–4.5)
Day of the week incorrect 1.74 0.48 13.0 1 <0.001 5.7 (2.2–14.6)
Month of the year incorrect 1.63 0.34 22.8 1 <0.001 5.1 (2.6–10.0)
Address incorrect 2.23 0.33 46.6 1 <0.001 9.3 (4.9–17.6)
Claims to have seen interviewer before 1.42 0.44 10.6 1 0.001 4.1 (1.8–9.7)
Does not recall interviewer's name 1.01 0.25 16.5 1 <0.001 2.7 (1.7–4.4)
Interviewer judgement
    Memory impairment 2.26 0.35 40.8 1 <0.001 9.6 (4.8–19.3)
    Problems with memory worse than problems with thinking 2.94 0.27 117.8 1 <0.001 18.8 (11.1–32.0)
Constant – 3.82 0.21

Item-level analysis showed that all of the cognitive test items and each of the two interviewer objective assessments of the presence of memory deficits discriminated effectively between dementia and the other three groups, in each of the regions (Table 2). Defective orientation to year (more so than to month or day of the week), ignorance of the names of the country's current and previous political leaders were the most educationally biased, particularly in India. Response patterns to the latter items were highly dependent upon local political culture. In Cuba, everyone in the control groups knew that Fidel Castro was their country's leader, as did 82% of people with dementia. Conversely, in Goa only 3% of people with dementia and 13% of people with low education and no dementia could name Mr Vajpayee. Three cognitive test items – disorientation to address, error in stating age, and confabulation in response to the question ‘have you seen me before’ – were good discriminators with little educational bias across all three regions. The most effective discriminators were the interviewers’ two global assessments. A parsimonious model developed using logistic regression on one random half of the data-set included these most effective and least biased items, and excluded orientation to year and knowledge of past and present political leaders (Table 3). The resulting coefficients were applied to the other ‘test’ half of the dataset, probabilities of group membership were calculated and a cut-off point of 0.30 (optimal in the development data-set) was applied. The new algorithm was markedly more effective at discriminating between dementia and the other three groups than the AGECAT organicity rating, both in every region and in nearly every centre (Table 1). At region level, the false-positive rate in the low-education group in India fell from 37% to 5%, and the sensitivity in Latin America and the Caribbean increased from 65% to 86%.


The sensitivity of the GMS/AGECAT stage 1 diagnosis of depression for the MADRS-defined depression case criterion was close to 90% in each of the three main regions (Table 4). This figure dropped to around 70-80% in AGECAT stage 2, mainly because the depressive symptoms had been trumped by the organicity (dementia) ratings in the hierarchical diagnosis. Because dementia was an exclusion criterion for selection into the depression group, this suggested misclassification by the stage 2 AGECAT algorithm. The high levels of apparent comorbidity in the dementia case groups are noteworthy, as is the apparent high proportion of those with depression in the high- and low-education control groups in Latin America and the Caribbean compared with Indian and Chinese centres. Case-level depression was neither screened for nor excluded from the high- and low-education or dementia groups, so we could not estimate the specificity of the AGECAT depression diagnosis.

Table 4 Prevalence of Geriatric Mental State (GMS) AGECAT stage 1 and stage 2 depression diagnoses and EURO–D mean scores (standard deviations) by group and by region

Dementia Depression High education Low education
    EURO–D score 3.6 (3.1) 7.5 (2.5) 0.8 (1.4) 1.2 (1.6)
    Stage 1 34% 89% 4% 12%
    Stage 2 8% 65% 4% 5%
China and south-east Asia
    EURO–D score 2.6 (2.4) 8.3 (2.1) 1.2 (1.1) 1.8 (1.5)
    Stage 1 14% 90% 3% 6%
    Stage 2 2% 84% 3% 6%
Latin America and Caribbean
    EURO–D score 4.1 (2.8) 6.9 (2.7) 2.0 (2.0) 2.4 (2.1)
    Stage 1 51% 90% 22% 31%
    Stage 2 18% 74% 21% 27%
Nigeria (Anambra)
    EURO–D score 0.8 (1.2) 7.2 (2.3) 0.2 (0.6) 0.2 (0.4)
    Stage 1 5% 100% 0% 0%
    Stage 2 0% 87% 0% 0%
Russia (Moscow)
    EURO–D score 4.4 (2.7) 7.9 (2.3) 1.8 (1.9) 2.5 (2.5)
    Stage 1 36% 89% 7% 20%
    Stage 2 12% 89% 7% 20%

The distribution of the EURO-D scale, within diagnostic groups, was similar across the three main regions (Table 4). In each region the mean scores were much higher in the depression group than in the dementia or high- and low-education control groups. Internal consistency (Cronbach's α) was universally satisfactory. For India it was 0.91 (range for centres: 0.87-0.95), for Latin America and the Caribbean it was 0.83 (range for centres: 0.64-0.91) and for China and south-east Asia (both centres) it was 0.88. The other two regions were represented by only one centre each – Anambra in Africa (α=0.93) and Moscow in Russia (α=0.86). Principal component analysis was attempted for three regions: India; China and south-east Asia; and Latin America and the Caribbean. Two factor solutions were applied in each region following inspection of scree plots. Similar factors were extracted for India and for Latin America and the Caribbean (see Table 5), conforming to the affective suffering (depression, suicidality, tearfulness) and motivation (enjoyment, interest) factors previously reported for EURO-D (Reference Prince, Reischies and BeekmanPrince et al, 1999). In the Chinese centres all of these items loaded on a single factor, whereas the second factor was characterised by guilt and pessimism.

Table 5 Principal component analysis of EURO–D scale in each of three world regions

India (n=414) Latin America and the Caribbean (n=1242) China and south-east Asia (n=186)
Factor 1 Factor 2 Factor 1 Factor 2 Factor 1 Factor 2
Eigenvalue 6.1 1.0 4.3 1.0 6.0 1.1
Variance (%) 51 9 36 8 50 9
Depression 0.80 0.28 0.76 0.21 0.81 0.28
Suicidality 0.79 0.16 0.28 0.55 0.76 0.32
Guilt 0.04 0.61 -0.01 0.65 0.35 0.68
Interest 0.45 0.75 0.37 0.65 0.80 0.35
Appetite 0.58 0.43 0.56 0.21 0.81 0.05
Fatigue 0.59 0.30 0.49 0.39 0.83 0.15
Concentration 0.65 0.40 0.49 0.39 0.60 0.24
Tearfulness 0.81 0.22 0.79 -0.06 0.38 0.51
Sleep 0.72 0.16 0.48 0.27 0.77 0.11
Pessimism 0.82 0.26 0.48 0.41 0.40 -0.58
Irritability 0.21 0.58 0.22 0.45 0.36 0.55
Enjoyment 0.37 0.74 0.23 0.73 0.79 0.27


Across the developing-country centres included in this study, the GMS was highly effective at discriminating between dementia cases and high-education controls, therefore the data presented here are entirely consistent with earlier reports of the satisfactory validity of GMS/AGECAT when used in well-educated developed-country populations (Reference Livingston, Sax and WillisonLivingston et al, 1990; Reference Collinghan, MacDonald and HerzbergCollinghan et al, 1993). It was in this context that the GMS was first developed and the AGECAT algorithm calibrated. In the Medical Research Council Cognitive Function and Ageing Study (MRC CFAS; 1998) the age-specific prevalence of GMS/AGECAT organicity was very similar to that consistently reported from other major European and North American population-based surveys.

In developing countries the GMS is a useful adjunct to dementia diagnosis. Our earlier analyses have demonstrated that it adds to the discriminating power of an algorithm, including informant report of decline in cognitive and functional ability (from the CSI-D) and cognitive testing (from the CSI-D and the CERAD ten-word list learning test) (Reference Prince, Acosta and ChiuPrince et al, 2003). More detailed findings presented here underline a tendency for the GMS to overdiagnose dementia in low-education groups in some but not all centres, and for a relative insensitivity to the presence of dementia in others. Given that the items contributing to the AGECAT organicity algorithm can be used to generate an algorithm that is much less educationally biased, one can infer that, in Latin America, AGECAT gives more weight to some of those items that we have identified as relatively educationally biased and gives less weight to items that are sensitive to the presence of dementia.

Our data also suggest that the briefer B3 ‘community’ version of the GMS may, paradoxically, be a more valid assessment for dementia than the more comprehensive GMS-A3. In a few centres it would appear that ratings for the sections excluded from B3 (mania, obsessive-compulsive disorder, hypochondriasis and some ratings of hallu-cinations and delusions) were sub-optimal, giving rise to implausible diagnoses. Thus, in Guadalajara, Mexico, 43% of all dementia true cases were rated by stage 2 as cases of mania. Similar but less extreme problems were noted for some other Latin American centres. Extensive training was provided in all Latin American centres by the regional coordinator but it was not possible logistically after training to supervise directly the conduct of the research in each and every centre. Our collective experience as trainers is that those elements of the GMS-A3 version omitted in the B3 are the most problematic with respect to achieving reliable and accurate ratings, particularly with non-clinical interviewers. Given the low prevalence of these symptoms in community samples it would seem advisable to use the B3 version.


There is ample evidence from our data for the core validity of the AGECAT depression algorithm, at least with respect to its sensitivity to the relatively severe form of depression implied by our independent-clinician inclusion criterion of a MADRS score of 18 or over. It is possible that applying the diagnostic hierarchy in stage 2 may lead to misclassification of depression as dementia. Alternatively, given the typically high rates of dementia incidence in cases clinically diagnosed as depressive pseudodementia, ‘false positives’ may reflect an incipient dementia process that was not apparent to the independent clinician recruiting the depression cases. Misclassification will be more marked in low-education samples, and use of the AGECAT ‘patch’ should again remedy this problem. Alternatively, this pitfall may be avoided by using instead the non-hierarchical AGECAT stage 1 diagnosis; this strategy also permits analysis of comorbidity with dementia, which our data demonstrate to be a phenomenon prevalent in all of the countries and cultures under study.

The EURO-D scale, derived from just 12 GMS items and extensively validated across Europe, would seem to have similar internal validity properties in other cultures. The underlying two-factor solutions for the Indian and Latin American centres were both similar to each other and generally concordant with those derived previously across the 14 EURODEP European centres. The factor solution for the Chinese centres was somewhat different but difficult to interpret, given the small numbers studied.


Implications for future use of GMS/AGECAT

None of the comprehensive diagnostic assessments in common use in adult populations, whether structured or semi-structured, lay interviewer or clinician administered, has adequately addressed the problems posed by older people with organic conditions. Thus, for research in older populations the GMS remains deservedly popular. However, the GMS on its own was never intended to provide a formal diagnosis of dementia. For such a diagnosis the History and Aetiology Schedule (HAS) informant interview with HAS/AGECAT or the History and Aetiology Schedule – Dementia Diagnosis and Subtype (HAS-DDS) would have to be used (Reference Copeland, Prince and WilsonCopeland et al, 2002). Without these, the necessary criteria of cognitive and functional decline cannot be established. Neither is it possible to exclude delirium or stable chronic brain injury as an explanation for cognitive impairment; hence the AGECAT label of ‘organicity’ rather than ‘dementia’. Empirically, in developed countries the GMS/AGECAT organicity approximates closely to the clinical construct of dementia. In developing countries and other low-education populations, the focus in the GMS/AGECAT algorithm upon educationally biased cognitive test items risks overdiagnosis. In the 10/66 Dementia Research Group's previously published diagnostic algorithm (Reference Prince, Acosta and ChiuPrince et al, 2003) this tendency is corrected through education-fair cognitive assessment and informant history of cognitive and functional decline provided by the CSI-D. The GMS is a key element of the 10/66 algorithm because of its unique ability to discriminate between depression and dementia (Reference Prince, Acosta and ChiuPrince et al, 2003). If the GMS is to be used alone in developing-country and other low-education populations, then caution is indicated in interpreting the organicity output, which may not map as closely onto clinical dementia as in a developed-country population. Future users of the GMS, particularly in low-education populations, may, where resources permit and when dementia is a principal focus, wish to make use of the 10/66 diagnostic algorithm incorporating the CSI-D and CERAD ten-word list learning test (Reference Prince, Acosta and ChiuPrince et al, 2003). Others might wish to use the ‘patch’ provided in the form of the logistic regression coefficients included in this paper. Note, though, that we administered the GMS with the two other components of the 10/66 algorithm; thus, the remarkable discriminability of the interviewer judgement of the presence of memory impairment might be explained by global impressions, including information from these assessments. Similar discriminability may not be achieved when the GMS is used on its own. A revised AGECAT algorithm will provide a more robust long-term solution and it is towards this goal that we now direct our efforts.

More work is required to clarify the cross-cultural validity of GMS/AGECAT. This certainly should include the predictive validity of the organicity rating for future clinical deterioration. Clinicopathological correlation studies are superficially attractive but problematic. In the UK MRC CFAS study, GMS/AGECAT organicity diagnosis predicted the presence upon autopsy of neuropathological features associated with the most prevalent dementia sub-types – Alzheimer's disease, vascular dementia, Lewy-body dementia and frontotemporal dementia (Medical Research Council Cognitive Function and Ageing Study, 2001). However, these features were also prevalent among those who did not have an AGECAT organicity diagnosis in vivo. This may reflect upon the suitability of these pathological indicators as gold standards for clinical dementia diagnosis rather than on the specificity of the GMS/AGECAT algorithm.


10/66 Dementia Research Group

The 10/66 Dementia Research Group, part of Alzheimer's Disease International, is a collective of researchers from the developing and developed regions of the world. A full list of members with contact details can be found at The following members of the 10/66 Group participated as investigators in this project and can be considered jointly responsible for the development of the protocol, the data gathering, data analysis and the preparation of this report.

Coordinating Centre

Professor Martin Prince, 10/66 Coordinator, Institute of Psychiatry, London; Ms Seema Quraishi, 10/66 Administrator, Institute of Psychiatry, London; Professor John Copeland, University of Liverpool; Dr Michael Dewey, Institute of Psychiatry, London.

10/66 India (Regional Coordinator Additional Professor Mathew Varghese)

Bangalore: Professor Mathew Varghese, Dr Srikala Bharath, NIMHANS, Bangalore; Chennai (SCARF): Ms Latha Srinivasan, Dr R. Thara, Schizophrenia Research Foundation; Chennai (VHS): Mr Ravi Samuel, Dr E. S. Krishnamoorthy, Voluntary Health Services; Goa: Dr Vikram Patel, Sangath, Dr Amit Dias, Goa Medical College; Hyderabad: Dr K. Chandrasekhar, Dr M. Ajay Verma, Heritage Hospitals; Thrissur: Assistant Professor K. S. Shaji, Professor K. Praveen Lal, Medical College, Thrissur; Vellore: Professor K.S. Jacob, Dr Arockia Philip Raj, Christian Medical College.

10/66 China and ES Asia (Regional Coordinator Professor Helen Chiu)

China (Beijing): Professor Li Shuran, Dr Jin Liu, Beijing University; China (Hong Kong SAR): Professor Linda Lam, Dr Teresa Chan, Chinese University of Hong Kong; Taiwan (Taipei): Dr Shen-Ing Liu, Mackay Memorial Hospital, Professor P. K. Yip, National Taiwan University Hospital.

10/66 Latin America and Caribbean (Regional Coordinators Dr Daisy Acosta (Dominican Republic) and Dr Marcia Scazufca (Brazil))

Argentina (Buenos Aires): Dr Raúl Luciano Arizaga, Hospital Santojanni (GCBA), Dr Ricardo F. Allegri, Hospital Zubizarreta (GBCA Y CONICET); Brazil (São Paulo): Dr Marcia Scazufca, Dr Paulo Rossi Menezes, Universidade de São Paulo; Brazil (Botucatu): Dr Ana Teresa de A.R. Cerquerira, Botucatu Medical School, UNESP; Brazil (São Jose do Rio Preto): M. Cristina O. S. Miyazaki, Neide A. Micelli Domingos, FAMERP Medical School; Chile (Santiago/Concepción/Valparaiso): Dr Patricio Fuentes, G. Hospital Del Salvador, Santiago, Dr Pilar Quoroga, L. Universidad de Concepción; Concepción; Cuba (Havana): Dr Juan de J. Llibre Rodriguez, Dr Hector Bayarre Vea, Facultad de Medicina ‘Finlay-Albarran’, Universidad Medica de la Habana; Dominican Republic (Santo Domingo): Dr Daisy Acosta, Universidad Nacional Pedro Henriquez Ureña (UNPHU), Lic. Guillermina Rodriguez, Asociación; Dominicana de Alzheimer (ADA); Guatemala (Guatemala City): Dr Carlos A. Mayorga Ruiz, Dr Mario Luna de Floran; Mexico (Mexico City): Dr Ana Luisa Sosa, Dr Yaneth Rodriguez Agudelo, National Institute of Neurology and Neurosurgery; Mexico (Guadalajara): Dr Genaro G. Ortiz, Lab Desarrollo/Envejecimiento, CIBO/IMSS, Dr Elva D. Arias-Merino, Gerontologia, Universidad de Guadalajara; Panama (Panama City): Dr Gloriela R. de Alba, Paitilla Medical Center Hospital, Dr Gloria Grimaldo, Santa Fe Hospital; Peru (Lima): Dr Mariella Guerra, Instituto Nacional de Salud Mental ‘Honorio Delgado-Hideyo Noguchi’, Universidad Peruana Cauetano Heredia, M. Victor González, Instituto Peruano de Seguridad Social, ESSALUD; Uruguay (Montevideo): Dr Roberto Ventura, Dr Nair Raciope, University of Uruguay; Venezuela (Caracas): Dr Aquiles Salas, Universidad Central de Venezuela, Faculty of Medicine, Dr Ciro Gaona Yánez, Fundación; Alzheimer's Venezuela.

10/66 Africa

Nigeria (Anambra): Dr Richard Uwakwe, Nnamdi Azikiwe University Teaching Hospital.

10/66 Russia

Moscow (Russia): Professor Svetlana Gavrilova, Dr Grigory Jarikov, Alzheimer's Disease Research Center, Mental Health Research Center of Russian Academy of Medical Sciences.

Clinical Implications and Limitations


  1. The Geriatric Mental State (GMS) and its AGECAT computerised algorithm may overdiagnose dementia in developing countries and other low-education populations.

  2. Testing for orientation to year and knowledge of a country's political leaders are particularly educationally biased.

  3. The EURO–D scale, derived from GMS depression items, has a common underlying factor structure across several continents.


  1. Small sample sizes in each centre imply some imprecision in the estimation of sensitivity and false-positive rates.

  2. Although interviewers were blind to diagnosis, their GMS ratings may have been influenced by knowledge of other cognitive assessments administered in the same sitting.

  3. We were able to study only the sensitivity and not the specificity of the GMS depression diagnosis.


Declaration of interest



American Psychiatric Association (1994) Diagnostic and Statistical Manual of Mental Disorders (4th edn) (DSM–IV). Washington, DC: APA.Google Scholar
Collinghan, G., MacDonald, A., Herzberg, J., et al (1993) An evaluation of the multidisciplinary approach to psychiatric diagnosis in elderly people. BMJ, 306, 821824.CrossRefGoogle Scholar
Copeland, J. R., Dewey, M. E. & Griffith-Jones, H. M. (1986) A computerised psychiatric diagnostic system and case nomenclature for elderly subjects: GMS and AGECAT. Psychological Medicine, 16, 8999.CrossRefGoogle Scholar
Copeland, J. R., Prince, M., Wilson, K. C., et al (2002) The Geriatric Mental State Examination in the 21st century. International Journal of Geriatric Psychiatry, 17, 729732.CrossRefGoogle ScholarPubMed
Ganguli, M., Chandra, V. & Gilbey, J. (1996) Cognitive test performance in a community-based non demented elderly sample in rural India: the Indo-US cross national dementia epidemiology study. International Psychogeriatrics, 8, 507524.CrossRefGoogle Scholar
Hall, K. S., Hendrie, H. H., Brittain, H. M., et al (1993) The development of a dementia screening interview in two distinct languages. International Journal of Methods in Psychiatric Research, 3, 128.Google Scholar
Livingston, G., Sax, K., Willison, J., et al (1990) The Gospel Oak Study stage II: the diagnosis of dementia in the community. Psychological Medicine, 20, 137146.CrossRefGoogle Scholar
Medical Research Council Cognitive Function and Ageing Study (MRC CFAS) (1998) Cognitive function and dementia in six areas of England and Wales: the distribution of MMSE and prevalence of GMS organicity level in the MRC CFA Study. Psychological Medicine, 28, 319335.CrossRefGoogle Scholar
Medical Research Council Cognitive Function and Ageing Study Neuropathology Group (MRC CFAS) (2001) Pathological correlates of late-onset dementia in a multicentre, community-based population in England and Wales. Lancet, 357, 169175.CrossRefGoogle Scholar
Montgomery, S. A. & Åsberg, M. (1979) A new depression scale designed to be sensitive to change. British Journal of Psychiatry, 134, 382389.CrossRefGoogle ScholarPubMed
Morris, J. C. (1993) The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology, 43, 24122414.CrossRefGoogle ScholarPubMed
Prince, M. J., Reischies, F., Beekman, A. T. F., et al (1999) The development of the EURO–D scale – a European Union initiative to compare symptoms of depression in 14 European centres. British Journal of Psychiatry, 174, 330338.CrossRefGoogle ScholarPubMed
Prince, M., Acosta, D., Chiu, H., et al (2003) Dementia diagnosis in developing countries: a cross-cultural validation study. Lancet, 361, 909917.CrossRefGoogle ScholarPubMed
Figure 0

Table 1 Discriminability of the Geriatric Mental State (GMS) A3 ‘organicity’ AGECAT stage 2 diagnosis and of the new algorithm derived from GMS organicity items (see also Tables 2 and 3)

Figure 1

Table 2 Discriminability by region of the Geriatric Mental State (GMS) organicity items

Figure 2

Table 3 Predictive model for the diagnosis of dementia derived from the Geriatric Mental State (GMS) organicity items using logistic regression

Figure 3

Table 4 Prevalence of Geriatric Mental State (GMS) AGECAT stage 1 and stage 2 depression diagnoses and EURO–D mean scores (standard deviations) by group and by region

Figure 4

Table 5 Principal component analysis of EURO–D scale in each of three world regions

Submit a response


No eLetters have been published for this article.
You have Access
Cited by

Send article to Kindle

To send this article to your Kindle, first ensure is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about sending to your Kindle. Find out more about sending to your Kindle.

Note you can select to send to either the or variations. ‘’ emails are free but can only be sent to your device when it is connected to wi-fi. ‘’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Effects of education and culture on the validity of the Geriatric Mental State and its AGECAT algorithm
Available formats

Send article to Dropbox

To send this article to your Dropbox account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your <service> account. Find out more about sending content to Dropbox.

Effects of education and culture on the validity of the Geriatric Mental State and its AGECAT algorithm
Available formats

Send article to Google Drive

To send this article to your Google Drive account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your <service> account. Find out more about sending content to Google Drive.

Effects of education and culture on the validity of the Geriatric Mental State and its AGECAT algorithm
Available formats

Reply to: Submit a response

Please enter your response.

Your details

Please enter a valid email address.

Conflicting interests

Do you have any conflicting interests? *