We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Confounding refers to a mixing or muddling of effects that can occur when the relationship we are interested in is confused by the effect of something else. It arises when the groups we are comparing are not completely exchangeable and so differ with respect to factors other than their exposure status. If one (or more) of these other factors is a cause of both the exposure and the outcome, then some or all of an observed association between the exposure and outcome may be due to that factor.
In this chapter, we look at the analytic studies that are our main tools for identifying the causes of disease and evaluating health interventions. Unlike descriptive epidemiology, analytic studies involve planned comparisons between people with and without disease, or between people with and without exposures thought to cause (or prevent) disease. They try to answer the questions, ‘Why do some people develop disease?’ and ‘How strong is the association between exposure and outcome?’. This group of studies includes the intervention, cohort and case–control studies that you met briefly in Chapter 1. Together, descriptive and analytic epidemiology provide information for all stages of health planning, from the identification of problems and their causes to the design, funding and implementation of public health solutions and the evaluation of whether these solutions really work and are cost-effective in practice.
People live complicated lives and, unlike laboratory scientists who can control all aspects of their experiments, epidemiologists have to work with that complexity. As a result, no epidemiological study can ever be perfect. Even an apparently straightforward survey of, say, alcohol consumption in a community, can be fraught with problems. Who should be included in the survey? How do you measure alcohol consumption reliably? All we can do when we conduct a study is aim to minimise error as far as possible, and then assess the practical effects of any unavoidable error. A critical aspect of epidemiology is, therefore, the ability to recognise potential sources of error and, more importantly, to assess the likely effects of any error, both in your own work and in the work of others. If we publish or use flawed or biased research we spread misinformation that could hinder decision-making, harm patients and adversely affect health policy. Future research may also be misdirected, delaying discoveries that can enhance public health.
If the results of a study reveal an interesting association between an exposure and a health outcome, there is a natural tendency to assume that it is real. (Note: we are considering whether two things are associated. This does not imply that one causes the other to occur.) However, before we can even contemplate this possibility we have to try to rule out other possible explanations for the results. There are three main ‘alternative explanations’ that we have to consider whenever we analyse epidemiological data or read the reports of others, whatever the study design; namely, could the results be due to chance, bias or error, or confounding? We discuss the first of these, chance, in this chapter and cover bias and confounding in Chapters 7 and 8, respectively.
When we speak of prevention in the context of public health, we usually think of what is sometimes called ‘primary prevention’, which aims to prevent disease from occurring in the first place; that is, to reduce the incidence of disease. Vaccination against childhood infectious diseases is a good example of primary prevention, as is the use of sunscreen to prevent the development of skin cancer. However, somewhat confusingly, the term ‘prevention’ is also used to describe other strategies to control disease. One of these is the use of screening to advance diagnosis to a point at which intervention is more effective, often described as ‘secondary prevention’. What is sometimes called ‘tertiary prevention’ is even more remote from the everyday concept of prevention, usually implying efforts to limit disease progression or the provision of better rehabilitation to enhance quality of life among those who have been diagnosed with a disease.
The importance of simple descriptive data was recognised by William Farr, whom we mentioned briefly in Chapter 1 for his seminal work using the newly established vital statistics register of England in the nineteenth century. As we discussed in Chapter 1, this descriptive epidemiology, concerned as it is with ‘person, place and time’, attempts to answer the questions ‘Who?’, ‘What?’, ‘Where?’ and ‘When?’. This can include anything from a description of disease in a single person (a case report) or a special survey conducted to measure the prevalence of a particular health issue in a specific population, to reports from national surveys and data collection systems showing how rates of disease or other health-related factors vary in different geographical areas or over time (time trends). In this chapter we look in more detail at some of the most common types of descriptive data and where they come from. However, before embarking on a data hunt, we first need to decide exactly what it is we want to know, and this can pose a challenge. To make good use of the most relevant descriptive data, it is critical to formulate our question as precisely as possible.
In this chapter we look at the ways in which we calculate, use and interpret ‘measures of association’, so-called because they describe the association between an exposure and a health outcome. An understanding of these measures will help you to interpret reports on the causes of ill health and the effects of particular exposures or interventions on the burden of illness in a community. Note that, while we discuss the measures in the context of an ‘exposure’ and ‘disease’, they can be used to assess the association between any measure of health status and any potential ‘cause’.
The search for the causes of disease is an obvious central step in the pursuit of better health through disease prevention. In the previous chapters we looked at how we measure health (or disease) and how we look for associations between exposure and disease. Being able to identify a relation between a potential cause of disease and the disease itself is not enough, though. If our goal is to change practice or policy in order to improve health, then we need to go one step further and decide whether the relation is causal because, if it is not, intervening will have no effect. As in previous chapters, we discuss causation mainly in the context of an exposure causing disease but, as you will see when we come to assessing causation in practice, the concepts apply equally to a consideration of whether a potential preventive measure really does improve health.
The goal of public health is to improve the overall health of a population by reducing the burden of disease and premature death. In order to monitor our progress towards eliminating existing problems and to identify the emergence of new problems, we need to be able to quantify the levels of ill health or disease in a population. Researchers and policy makers use many different measures to describe the health of populations. In this chapter we introduce more of the most commonly used measures so that you can use and interpret them correctly. We first discuss the three fundamental measures that underlie both the attack rate and most of the other health statistics that you will come across in health-related reports, the incidence rate, incidence proportion (also called risk or cumulative incidence) and prevalence, and then look at how they are calculated and used in practice. We finish by considering other, more elaborate measures that attempt to get closer to describing the overall health of a population. As you will see, this is not always as straightforward as it might seem.