Skip to main content Accessibility help
Hostname: page-component-99c86f546-qdp55 Total loading time: 0.85 Render date: 2021-12-05T06:43:03.027Z Has data issue: true Feature Flags: { "shouldUseShareProductTool": true, "shouldUseHypothesis": true, "isUnsiloEnabled": true, "metricsAbstractViews": false, "figures": true, "newCiteModal": false, "newCitedByModal": true, "newEcommerce": true, "newUsageEvents": true }

15 - Planning Data Analysis

Published online by Cambridge University Press:  19 September 2019

Joanna M. Setchell
Durham University
Get access


We use statistical analyses to test our predictions using the measures we collect for our sample. Like all aspects of study design, we need to think carefully about our choice of analytical approach. Planning our data analysis in detail, before we collect your data, helps to determine what data we need to collect. It is very common to rush past the analysis plan and dive straight into collecting data. This is partly because statistics are not intuitive and can be intimidating. However, statistical analysis is an integral part of study design. We must understand statistics to understand the strengths, limitations, and potential biases of any research. This may seem daunting, but our understanding of statistics determines the quality of a study. The more we think about this now, the better our study will be. I begin this chapter with how to determine what sort of analyses we need and the need to consult a statistician when we design a study. Next, I cover problems associated with multiple testing and assessing multiple predictor variables. I explain how to prepare an analysis plan and suggest pre-registration.

Studying Primates
How to Design, Conduct and Report Primatological Research
, pp. 185 - 206
Publisher: Cambridge University Press
Print publication year: 2019

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)


Barnard, C, Gilbert, F, McGregor, P. 2017. Asking Questions in Biology: A Guide to Hypothesis Testing, Experimental Design and Presentation in Practical Work and Research Projects. 5th edn. Harlow, Essex: Benjamin Cummings. Chapter 3 covers statistical analysis and hypothesis testing. Also includes quick test-finders at the end of the book, linking statistical tests to study design. Recommends step-wise regression but don’t follow this advice.Google Scholar
Beckerman, A, Petchey, O, Childs, D. 2017. Getting Started with R. Oxford: Oxford University Press. An introductory guide to the standard software for statistical analyses and graphical presentation of data. Includes how to import, explore, plot, and analyse data.CrossRefGoogle Scholar
Burnham, KP, Anderson, DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. 2nd edn. New York: Springer-Verlag. An introduction to information-theoretic approaches to model selection and multi-model inference.Google Scholar
Darlington, RB, Smulders, TV. 2001. Problems with residual analysis. Animal Behaviour 62: 599602. Explains sources of bias in residual analysis, and recommends multiple regression.CrossRefGoogle Scholar
Field, A, Hole, G. 2003. How to Design and Report Experiments. London: Sage Publications Ltd. Part 2 covers analysing and interpreting data.Google Scholar
Field, A, Miles, J, Field, Z. 2012. Discovering Statistics Using R. SAGE Publications Ltd. Includes the logic behind tests, how to do them, and how to report the results, with worked examples. Uses theory for what I term a hypothesis and hypothesis for what I term predictions.Google Scholar
Freckleton, RP. 2002. On the misuse of residuals in ecology: regression of residuals vs. multiple regression. Journal of Animal Ecology 71: 542545. Explains why we shouldn’t treat the residuals from regression as data in further analysis.CrossRefGoogle Scholar
Garamszegi, LZ. 2011. Special Issue on ‘Model selection, multimodel inference and information-theoretic approaches in behavioural ecology’. Behavioral Ecology and Sociobiology 65: 1116. A collection of articles from authors with different viewpoints on how to analyse data with multiple predictors. Essential reading if you intend to use an information-theory approach.CrossRefGoogle Scholar
Grueber, CE, Nakagawa, S, Laws, RJ, Jamieson, IG. 2011. Multimodel inference in ecology and evolution: Challenges and solutions. Journal of Evolutionary Biology 24: 699711. An overview of multi-model inference, common challenges faced when using the information theoretic approach and potential solutions where they exist.CrossRefGoogle ScholarPubMed
Harrison, XA, Donaldson, L, Correa-Cano, ME, Evans, J, Fisher, DN, Goodwin, C, Robinson, BS, Hodgson, DJ, Inger, R. 2018. A brief introduction to mixed effects modelling and multi-model inference in ecology. PeerJ 6: e4794. doi:10.7717/peerj.4794. A best practice guide for the application of mixed effects models and model selection in ecological studies.CrossRefGoogle ScholarPubMed
Janson, C. 2012. Reconciling rigor and range: observations, experiments, and quasi-experiments in field primatology. International Journal of Primatology 33: 520541.–011-9550-7. Includes the use of random-effects models to account for repeated and uneven sampling of the same individuals.CrossRefGoogle Scholar
Kronmal, RA. 1993. Spurious correlation and the fallacy of the ratio standard revisited Journal of the Royal Statistical Society. Series A (Statistics in Society). 156: 379392. Shows that using ratios in regression analyses can lead to incorrect or misleading inferences.CrossRefGoogle Scholar
Kroodsma, DE, Byers, BE, Goodale, E, Johnson, S, Liu, W-C. 2001. Pseudoreplication in playback experiments, revisited a decade later. Animal Behaviour 61: 10291033. Examines the issue of pseudo-replication in playback experiments and the need for multiple stimuli to represent a class of stimuli.CrossRefGoogle Scholar
Mangiafico, SS. 2015. An R Companion for the Handbook of Biological Statistics, version 1.3.2. Pdf version: Notes on getting started in R, and R code for many of the examples given in McDonald’s Handbook of Biological Statistics (below).
McDonald, JH. 2014. Handbook of Biological Statistics. 3rd edn. Baltimore, MD: Sparky House Publishing. Available free online at and as a pdf. A very useful introduction to statistics, including links to spreadsheets to calculate simple tests.Google Scholar
McElreath, R. 2015. Statistical Rethinking: A Bayesian Course with Examples in R and Stan. Boca Raton, FL: CRC Press. An excellent, pragmatic introduction to the reasoning underlying Bayesian inference, from basic probability to generalised linear multilevel modelling. Recorded lectures available online.Google Scholar
McGregor, PK. 2000. Playback experiments: Design and analysis. Acta Ethologica 3: 38. Examines issues in the design of playback experiments, including pseudoreplication.CrossRefGoogle Scholar
Mundry, R, Nunn, R. 2009. Stepwise model fitting and statistical inference: Turning noise into signal pollution. American Naturalist 173: 119123. Shows why we should not use significance tests based on stepwise procedures.CrossRefGoogle ScholarPubMed
Mundry, R, Sommer, C. 2007. Discriminant function analysis with non-independent data: Consequences and an alternative. Animal Behaviour 74: 965976. Explains that including non-independent data in a discriminant function analysis yields incorrect results, and provides a permutation-based test that copes with such datasets.CrossRefGoogle Scholar
Nosek, BA, Ebersole, CR, DeHaven, AC, Mellor, DT. 2018. The preregistration revolution. Proceedings of the National Academy of Sciences of the United States of America 115: 26002606. Includes the idea of a decision tree defining a sequence of tests and decision rules at each stage of analysis.CrossRefGoogle ScholarPubMed
Ruxton, GB, Neuhäuser, M. 2010. When should we use one-tailed hypothesis testing? Methods in Ecology and Evolution 1: 114117. Highlights how one-tailed tests are often used without clear justification.CrossRefGoogle Scholar
Stamp, Dawkins M. 2007. Observing Animal Behaviour: Design and Analysis of Quantitative Data. Oxford: Oxford University Press. Chapter 4 addresses independence of data and matching each case to itself.CrossRefGoogle Scholar
Waller, B, Warmelink, L, Liebal, K, Micheletta, , Slocombe, KE. 2013. Pseudoreplication: A widespread problem in primate communication research. Animal Behaviour 86: 483488. Highlights the problem of pseudoreplication in primate communication studies.CrossRefGoogle Scholar

Send book to Kindle

To send this book to your Kindle, first ensure is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about sending to your Kindle.

Note you can select to send to either the or variations. ‘’ emails are free but can only be sent to your device when it is connected to wi-fi. ‘’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats

Send book to Dropbox

To send content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about sending content to Dropbox.

Available formats

Send book to Google Drive

To send content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about sending content to Google Drive.

Available formats