Guidelines for Data Acquisition, Quality and Curation for Observational Research Designs (DAQCORD)

Ari Ercole; Vibeke Brinck; Pradeep George; Ramona Hicks; Jilske Huijben; Michael Jarrett; Mary Vassar; Lindsay Wilson; the DAQCORD collaborators

doi:10.1017/cts.2020.24

Guidelines for Data Acquisition, Quality and Curation for Observational Research Designs (DAQCORD)

Published online by Cambridge University Press: 13 March 2020

and

the DAQCORD collaborators

Show author details

Ari Ercole*: Affiliation:
Department of Medicine, Division of Anaesthesia, University of Cambridge, Cambridge, UK
Vibeke Brinck: Affiliation:
QuesGen Systems, Inc, Burlingame, CA, USA
Pradeep George: Affiliation:
International Neuroinformatics Coordinating Facility, Karolinska Institutet, Stockholm, Sweden
Ramona Hicks: Affiliation:
One Mind, Rutherford, CA, USA
Jilske Huijben: Affiliation:
Department of Public Health, Center for Medical Decision Sciences, Erasmus MC, Rotterdam, The Netherlands
Michael Jarrett: Affiliation:
QuesGen Systems, Inc, Burlingame, CA, USA
Mary Vassar: Affiliation:
Department of Neurological Surgery, University of California, San Francisco, CA, USA
Lindsay Wilson: Affiliation:
Division of Psychology, University of Stirling, Stirling, UK
*: Address for correspondence: A. Ercole, PhD, Department of Medicine, Division of Anaesthesia, University of Cambridge, Addenbrookeʼs Hospital, CambridgeCB2 0QQ, UK. Email: ae105@cam.ac.uk

Article contents

Abstract
Background:
Methods:
Results:
Conclusion:
Introduction
Methods
Results
Discussion
Disclosures
Supplementary material
References

Rights & Permissions

Abstract

Background:

High-quality data are critical to the entire scientific enterprise, yet the complexity and effort involved in data curation are vastly under-appreciated. This is especially true for large observational, clinical studies because of the amount of multimodal data that is captured and the opportunity for addressing numerous research questions through analysis, either alone or in combination with other data sets. However, a lack of details concerning data curation methods can result in unresolved questions about the robustness of the data, its utility for addressing specific research questions or hypotheses and how to interpret the results. We aimed to develop a framework for the design, documentation and reporting of data curation methods in order to advance the scientific rigour, reproducibility and analysis of the data.

Methods:

Forty-six experts participated in a modified Delphi process to reach consensus on indicators of data curation that could be used in the design and reporting of studies.

Results:

We identified 46 indicators that are applicable to the design, training/testing, run time and post-collection phases of studies.

Conclusion:

The Data Acquisition, Quality and Curation for Observational Research Designs (DAQCORD) Guidelines are the first comprehensive set of data quality indicators for large observational studies. They were developed around the needs of neuroscience projects, but we believe they are relevant and generalisable, in whole or in part, to other fields of health research, and also to smaller observational studies and preclinical research. The DAQCORD Guidelines provide a framework for achieving high-quality data; a cornerstone of health research.

Keywords

Data quality curation observational studies Delphi process design reporting

Type: Research Article
Information: Journal of Clinical and Translational Science , Volume 4 , Issue 4 , August 2020 , pp. 354 - 359

DOI: https://doi.org/10.1017/cts.2020.24 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Association for Clinical and Translational Science 2020

Introduction

Observational studies are a crucial part of the biomedical research armamentarium, particularly when studying complex conditions or the related problem of understanding the outcomes of interventions in highly heterogeneous real-world populations [Reference Ligthelm1]. As well as generalisability, the cost-benefit ratio of enrolling a subject in observational studies is relatively low, which makes feasible the recruitment of large samples potentially needed to reliably identify modest, but clinically important differences. This scalability, alongside the availability of electronic case-report form (eCRF) platforms and increasing availability of routinely collected data in electronic form, means that it is possible to devise large, multicentre/multinational observational projects.

With open or shared access to data becoming increasingly common, including with funding agencies, it is likely that large observational data sets will become important resources for future secondary analysis by external investigators. For example, a recent comparative effectiveness study in traumatic brain injury [Reference Maas2] was designed to prospectively acquire demographic, longitudinal clinical intervention, outcome, biomarker, ‘omics, imaging and waveform data in 5400 patients in 3 strata from multiple sites in 22 countries. This data set alone comprises more than 2500 discrete data concepts, but in addition, it is designed to be compatible with data from sister studies in the USA, Australia, India and China. This combination of scale, structure and data types makes such initiatives highly complex, technical challenges.

Even electronically collected clinical data may comprise a diverse mixture of data types and sources, and combinations of single, repeated measures, as well as time series, which may be irregularly sampled. Combining this with ‘omics, waveform recordings or imaging data introduces yet another tier of structural complexity. The involvement of multiple sites, particularly where these are international, may introduce further data variances due to local interpretation of procedures and linguistic and cultural misunderstandings. Notwithstanding incomplete data standards, real-world data from even a well-conducted study will inevitably contain errors or limitations that can only be understood in the context of the precise study structure. An understanding of this is crucial to making robust inferences and therefore also to repeatability. Furthermore, without detailed metadata, this knowledge can reside only with the study team, limiting transparency and making secondary analysis potentially subject to bias or other misinterpretations.

Data curation is clearly important, but the complexity and effort involved are under-appreciated and this may have serious scientific repercussions on the entire data sharing/open science enterprise. Poor attention to detail from design through execution including quality control and curation may severely limit data interpretation and consequently reuse and transparency. For prospective studies, post factum curation may improve data usability but retrospective correction of issues that emerge during the collection period is at best time-consuming and may be impossible. Thus, data quality efforts should start at the study design phase. Even the timely detection of emergent data quality issues is predicated on an understanding of both the data structure and study structure and will be severely hampered if these are not carefully specified.

Since a lack of attention to data quality and curation throughout the study may not only degrade data quality but also limit the validity of primary and subsequent analyses, an appraisal of this is important in evaluating study quality. Initiatives such as the Strengthening the Reporting of Observational studies in Epidemiology (STROBE) guidelines [Reference von Elm3] aim to improve transparency and reproducibility in observational research. However, STROBE primarily addresses crucial conceptual and statistical rigour. A more recent extension to STROBE, the REporting of studies Conducted using Observational Routinely-collected Data (RECORD) checklist [Reference Benchimol4], touches on data quality in the context of routine data. However, neither of these excellent initiatives directly address the equally critical question of the extent or adequacy of the steps taken to ensure the data are high quality, or to more fully inform a reader of any potential limitations to the analysis resulting from the curation process. This also means that study designers lack a prospective framework from which to devise (and budget) the necessary comprehensive data quality strategy at study conception and design.

The Data Acquisition, Quality and Curation for Observational Research Designs (DAQCORD) Guidelines were developed for investigators conducting large observational research studies to aid the design, documentation and reporting of practices for assuring data quality within their studies. This information is intended to provide guidance and a transparent reporting framework for improving data quality and data sharing. Given the absence of a structured framework for the description and appraisal of the collection and curation process, the DAQCORD Collaboration aims to address these issues and has three key aims.

1. To provide a framework/toolkit for robust study design (and eCRF design in particular) and data quality management.
2. To provide a framework by which proposed study plans can be systematically appraised (for example, by funding organisations) in terms of their approach to data quality.
3. To provide a reporting framework with which to describe the steps taken to ensure data quality in the final study publication.

Methods

Development of the DAQCORD Indicators

The DAQCORD project was initiated in 2017, originally arising from discussion of data management issues in the InTBIR [Reference Tosetti5] consortium. This consortium includes observational studies which are representative of the most ambitious staged to date in the field of traumatic brain injury with respect to the number of patients and complexity of the data collected. Funding/technical support was obtained to facilitate a face-to-face consensus meeting as well as the necessary survey infrastructure and website (www.daqcord.org). Our methodology was designed in accordance with best practice published by the Equator network [Reference Simera6] with which the initiative was registered. We formed a Steering Committee consisting of seven individuals with professional backgrounds in informatics and data management and/or experience in data curation/data set design in large-scale observational studies. A summary of the steps involved in developing the DAQCORD indicators is shown in Figure 1.

Fig. 1. Flow diagram for the DAQCORD-modified Delphi process.

The Steering Committee performed a search of literature for relevant publications on data quality methodology for large observational and heterogeneous studies. Sources consulted included PubMed, Ovid-Medline, Web of Science and Google Scholar, and we followed this up by hand searching specific journals. The search identified a range of informing literature, including a body of work concerning data collected during routine care [Reference Weiskopf and Weng7–Reference Kahn10]; however, we were unable to identify any peer-reviewed publications giving systematic practical advice on data quality methodology for observational studies (i.e. studies with a typical cycle of design, implementation and post-collection). The Steering Committee generated an initial set of 106 items potentially relevant to data quality that were derived from published sources, including transferable concepts identified by the Steering Committee from our literature search [Reference Ene-Iordache11–Reference Wang17], unpublished manuals on data curation provided by studies within the InTBIR consortium, previously published Equator guidelines, and from personal experience. We carried out an initial exercise within the Steering Committee to categorise questions on the data quality factors of completeness, correctness, concordance, plausibility and currency (Weiskopf and Weng [Reference Weiskopf and Weng7], see Table 1 for definitions of these terms) and evaluate the importance of individual items. Items were reviewed for duplication and overlap and were removed or re-written as necessary. As a result of this initial exercise, the number of items was reduced to 68 and the remaining items were edited for clarity.

Table 1. Key terms and concepts

The Steering Committee agreed a Delphi approach to reach consensus on the DAQCORD tool was appropriate, with the modification of having a face-to-face meeting of the panel in addition to circulation of material. A meeting was judged vital to allow in-depth discussion of the aims and outcomes of the project as well as the criteria and boundaries applied to item selection. The 68 items were collated into an online structured questionnaire for rating by panel members, and a consensus conference was held in September 2018 at the National Institutes of Health, Bethesda. There was a range of expertise among the 46 panel participants, including 9 bioinformaticists/computer scientists, 8 data managers/data scientists, 7 epidemiologists/statisticians, 15 clinician/researchers and 7 biomedical scientists. The majority were from the USA (29), with 9 from Europe and 8 from Canada. Participants also represented a range of organisations, including 33 from academia, 8 from government, 3 from non-profit organisations and 2 from industry. Respondents were chosen to be representative of a range of career stages from principal investigators to earlier stage researchers.

At the consensus meeting, we discussed the criteria used to assess the suitability of items for assessing data quality; the criteria agreed were validity, feasibility and action ability. The three criteria were elaborated as follows: “validity” means that “the metric is likely to reflect data quality”, “feasibility” means “this is something that can be measured or assessed and is quantifiable”, and “action ability” means that “improving this metric could be used in practice to make changes to a study that improves data quality”. We also discussed whether additional items were required, the potential applications of the instrument, and strategies for disseminating the outputs of the project. The consensus meeting allowed greater convergence on key issues and more detailed feedback on responses than would have been possible using only online questionnaires.

In the separate rounds of the Delphi, panel members rated items on whether they met each criterion using a Likert-type scale from 1 (strongly disagree) to 5 (strongly agree). A formal procedure was agreed for adopting and rejecting items on the basis of ratings which was in keeping with methods which have been previously employed and found to provide consensus [Reference Huijben18,Reference Rietjens19]. A median score ≥ 4 for agreement was considered a good rating for the dimension, while ≤3 was a neutral or poor rating. In addition, an interquartile range of 0 or 1 was regarded as very good consensus on the rating, 2 as good consensus and more than 2 as a lack of consensus. To be accepted, an item needed a good rating on each dimension and a good consensus on each rating (or very good consensus for the “validity” dimension), items were rejected if they had a low rating on one or more dimensions with good consensus and otherwise they were carried forward to the next round. The criteria adopted for “validity” ratings were stricter because this dimension was regarded as critical to the usefulness of the item. No upper or lower boundary was set on the number of items that would be accepted. Respondents could also make free text comments, which were included in the feedback to participants. At each stage, items were also edited for precision or duplication as a result of responses from participants. Respondents were able to see results for each item in each domain from previous rounds.

Results

The Delphi process converged on 46 items after 3 rounds that were judged to be indicators of data quality (see Figure 1). The 46 items (henceforth referred to as indicators) included in the final set all had median ratings for validity, feasibility and action ability of 4 or 5 indicating agreement or strong agreement that the component met the criterion. All the indicators also showed good consensus after three rounds. The final DAQCORD components are categorised and listed by data quality factors (i.e. completeness, correctness, concordance, plausibility and currency) with the relevant study phase for implementation noted in a separate column (see Table 2). Supplementary material, including the DAQCORD indicators with examples derived from the Delphi exercise, is also presented online (https://www.daqcord.org/daqcord-questions/).

Table 2. DACQORD indicators

The DAQCORD indicators are intended as a descriptive system for planning and reporting observational studies. At a minimum, they can be used as a checklist for documenting whether an indicator is being addressed fully, partially or not at all. A more extended and informative record can be made by users through creation of a brief narrative for each indicator describing how this was addressed for their study. The resulting text will provide formal documentation of the data quality steps taken for the study, which will serve as an evidential record that can inform funders and the research community.

Discussion

The DAQCORD Guidelines were developed to help authors in reporting on large observational studies and to assist readers and reviewers in appraising data quality in published studies and of the data set as a whole. Furthermore, the Guidelines aim to provide a prospective framework to encourage comprehensive best practice in the design of a data quality strategy from the outset to ensure that the data ultimately collected is of as high quality as possible, to streamline and limit the need for costly retrospective curation, as well as to improve transparency and facilitate meaningful open access and reuse. It may also provide a structure for funding agency review of proposed data quality strategies.

DAQCORD was developed by a panel selected for its comprehensive expertise in the practical design and issues encountered in large data-heavy observational studies. It is likely that observational data sets will grow in complexity and scope in the future, and it is conceivable that new challenges (or indeed data platforms and standards) will emerge and consequently DAQCORD will need to be revised in the light of such developments.

Observational studies are, by their nature, heterogeneous in their domains; aims and scope and therefore not all elements will be relevant to all study designs. At the same time, we believe that where they are applicable, the indicators that we have developed provide a systematic framework for addressing potential data quality issues. It is not our aim to prescriptively specify the steps necessary for all studies. Indeed, given the heterogeneity of such studies, we do not believe that this is possible. There may be many, equally valid, ways in which a particular study may address (or demonstrate that it has addressed) any particular aspect of data curation. As part of the Delphi process, we also gathered examples of possible best practice for each indicator: these are available online to serve as a guide and further elaboration. We also envision this a “living resource”, which could be expanded on to include more indicators for selected types of data, i.e., electronic health records, preclinical research, qualitative data (e.g. derived from interviews and surveys), neuroimaging, biospecimens, continuous physiological measurements, etc.

The indicators are weighted towards measures that should be implemented at design time. In our experience, the challenges presented by large-scale projects may be under-appreciated at project inception. In particular, the amount of funding that needs to be allocated to data quality processes may be underestimated. Grant giving bodies could play a key role in identifying this shortfall at proposal stage and ensuring that it is adequately addressed.

We recognise that there are likely to be limitations to the retrospective application of the Guidelines to existing data sets. For some studies, the details of the steps taken during data curation may not be available. It may also be appropriate to be tolerant when applying criteria post hoc, since the original study may not have had the resources to adequately address data curation at the time. Issues in such databases may be addressed over time, for example, through documentation of known problems by researchers.

DAQCORD set out to address the issues of large-scale, complex observational studies, explicitly including the design of the data capture infrastructure such as eCRFs since this is an area which is highly complex and potentially problematic. A large proportion of the Delphi collaborators are from neurosciences backgrounds. This domain has seen some of the most complex data sets from large-scale multinational observational studies, and therefore, this community has necessarily developed a substantial expertise in this area. However, we believe that the concepts are generalisable to other clinical disorders, and smaller clinical and preclinical studies, as well. In summary, we believe that the DAQCORD Guidelines will enhance the design and management of biomedical research studies, provide assurance to potential collaborators about data quality and promote collaborative research to improve healthcare on a global scale.

Acknowledgements

The authors gratefully acknowledge the support provided by One Mind, the National Institutes of Health, the International Neuroinformatics Coordinating Facility and QuesGen. LW is supported by the 7th Framework programme (EC grant 602150).

The DAQCORD collaborators: Xinmin An, University of North Carolina, Chapel Hill, NC, USA; Derek Beaton, Baycrest Center, Toronto, Canada; Kim Boase, University of Washington, Seattle, WA, USA; Yelena Bodien, Harvard University, Boston, MA, USA; Guido Bertolini, Marionegri Institute, Milan, Italy; Doxa Chatzopoulou, University of California, Los Angeles, CA, USA; Ramon Diaz-Arrastia, University of Pennsylvania, Philadelphia, PA, USA; Anthony Fabio, University of Pittsburgh, Pittsburgh, PA, USA; Gregory Farber, National Institutes of Health, Bethesda, MD, USA; Adam Ferguson, University of California, San Francisco, CA, USA; Louis French, Walter Reed Medical Center, Bethesda, MD, USA; Isabelle Gagnon, McGill University, Montreal, Quebec, Canada; Joseph Giacino, Harvard University, Boston, MA, USA; Jeffrey Grethe, University of California, San Diego, California, USA; Robert Heinson, National Institutes of Health, Bethesda, MD, USA; Sonia Jain, University of California, San Diego, CA, USA; Ferath Kherif, Lausanne University, Lausanne, Switzerland; Christopher Lindsell, Vanderbilt University Medical Center, Nashville, TN, USA; Christine MacDonald, University of Washington, Seattle, WA, USA; Joan Machamer, University of Washington, Seattle, WA, USA; Donald Marion, Defense and Veterans Brain Injury Center, Silver Spring, MD, USA; Louise Marshall, Wellcome Trust, London, UK; Matthew McAuliffe, National Institutes of Health, Bethesda, MD, USA; Paula McLaughlin, Queens University, Kingston, ON, Canada; Samuel McLean, University of North Carolina, Chapel Hill, NC, USA; Carolina Mendoza-Puccini, National Institutes of Health, Bethesda, MD, USA; David Menon, Cambridge University, Cambridge, UK; David Nelson, Karolinska Institute, Stockholm, Sweden; Tara Niendam, University of California, Davis, CA, USA; Patricia Rinvelt, National Network of Depression Centers, Ann Arbor, MI, USA; Laurie Silfes, University of Pittsburgh, Pittsburgh, PA, USA; Stephen Strother, Baycrest Center, Toronto, ON, Canada; Kelly Sunderland, Baycrest Center, Toronto, ON, Canada; Carol Taylor-Burds, National Institutes of Health, Bethesda, MD, USA; Nancy Temkin, University of Washington, Seattle, WA, USA; Nsini Umoh, Department of Defense, Fort Dietrich, MD, USA; Stephen Wisniewski, University of Pittsburgh, Pittsburgh, PA, USA and Richard Wintle, Hospital for Sick Children, Toronto, ON, Canada.

Disclosures

QuesGen Systems, Inc. provided technology and support services for the DAQCORD project without charge and were not involved in the Delphi assessment/rating process.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/cts.2020.24.

References

Ligthelm, RJ, et al. Importance of observational studies in clinical practice. Clinical Therapeutics 2007; 29 Spec No: 1284–1292.CrossRef Google Scholar PubMed

Maas, AI, et al. Collaborative European NeuroTrauma Effectiveness Research in Traumatic Brain Injury (CENTER-TBI): a prospective longitudinal observational study. Neurosurgery 2015; 76(1): 67–80.CrossRef Google Scholar PubMed

von Elm, E, et al. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet 2007; 370(9596): 1453–1457.CrossRef Google Scholar PubMed

Benchimol, EI, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Medicine 2015; 12(10): e1001885.CrossRef Google Scholar PubMed

Tosetti, P, et al. Toward an international initiative for traumatic brain injury research. Journal of Neurotrauma 2013; 30(14): 1211–1222.CrossRef Google Scholar PubMed

Simera, I.EQUATOR network collates resources for good research. BMJ 2008; 337: a2471.CrossRef Google Scholar PubMed

Weiskopf, NG, Weng, C.Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. Journal of the American Medical Informatics Association 2013; 20(1): 144–151.CrossRef Google Scholar PubMed

Weiskopf, NG, et al. A data quality assessment guideline for electronic health record data reuse. EGEMS 2017; 5(1): 14.CrossRef Google Scholar PubMed

Kahn, MG, et al. A harmonized data quality assessment terminology and framework for the secondary use of electronic health record data. EGEMS 2016; 4(1): 1244.CrossRef Google Scholar PubMed

Kahn, MG, et al. Transparent reporting of data quality in distributed data networks. EGEMS 2015; 3(1): 1052.CrossRef Google Scholar PubMed

Ene-Iordache, B, et al. Developing regulatory-compliant electronic case report forms for clinical trials: experience with the demand trial. Journal of the American Medical Informatics Association 2009; 16(3): 404–408.CrossRef Google Scholar PubMed

Gamble, C, et al. Guidelines for the content of statistical analysis plans in clinical trials. JAMA 2017; 318(23): 2337–2343.CrossRef Google Scholar PubMed

Kruse, CS, et al. Challenges and opportunities of big data in health care: a systematic review. JMIR Medical Informatics 2016; 4(4): e38.CrossRef Google Scholar PubMed

Lee, DJ, Stvilia, B.Practices of research data curation in institutional repositories: a qualitative view from repository staff. P:LoS ONE 2017; 12(3): e0173987.Google Scholar PubMed

McInnes, MDF, et al. Preferred Reporting Items for a Systematic review and Meta-analysis of Diagnostic Test Accuracy studies: The PRISMA-DTA statement. JAMA 2018; 319(4): 388–396.CrossRef Google Scholar PubMed

Poldrack, RA, et al. Scanning the horizon: towards transparent and reproducible neuroimaging research. Nature Reviews Neuroscience 2017; 18(2): 115–126.CrossRef Google Scholar PubMed

Wang, X, et al. Big data management challenges in health research-a literature review. Briefings in Bioinformatics 2019; 20(1): 156–167.CrossRef Google Scholar PubMed

Huijben, JA, et al. Development of a quality indicator set to measure and improve quality of ICU care for patients with traumatic brain injury. Critical Care 2019; 23(1): 95.CrossRef Google Scholar PubMed

Rietjens, JAC, et al. Definition and recommendations for advance care planning: an international consensus supported by the European Association for Palliative Care. The Lancet Oncology 2017; 18(9): e543–e551.CrossRef Google Scholar PubMed

Fig. 1. Flow diagram for the DAQCORD-modified Delphi process.

Table 1. Key terms and concepts

Table 2. DACQORD indicators

Ercole et al. supplementary material

File 22.3 KB

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Wilson, Lindsay Boase, Kim Nelson, Lindsay D. Temkin, Nancy R. Giacino, Joseph T. Markowitz, Amy J. Maas, Andrew Menon, David K. Teasdale, Graham and Manley, Geoffrey T. 2021. A Manual for the Glasgow Outcome Scale-Extended Interview. Journal of Neurotrauma, Vol. 38, Issue. 17, p. 2435.

Gillespie, Brenda W. Laurin, Louis-Philippe Zinsser, Dawn Lafayette, Richard Marasa, Maddalena Wenderfer, Scott E. Vento, Suzanne Poulton, Caroline Barisoni, Laura Zee, Jarcy Helmuth, Margaret Lugani, Francesca Kamel, Margret Hill-Callahan, Peg Hewitt, Stephen M. Mariani, Laura H. Smoyer, William E. Greenbaum, Larry A. Gipson, Debbie S. Robinson, Bruce M. Gharavi, Ali G. Guay-Woodford, Lisa M. and Trachtman, Howard 2021. Improving data quality in observational research studies: Report of the Cure Glomerulonephropathy (CureGN) network. Contemporary Clinical Trials Communications, Vol. 22, Issue. , p. 100749.

Boase, Kim Machamer, Joan Temkin, Nancy R. Dikmen, Sureyya Wilson, Lindsay Nelson, Lindsay D. Barber, Jason Bodien, Yelena G. Giacino, Joseph T. Markowitz, Amy J. McCrea, Michael A. Satris, Gabriela Stein, Murray B. Taylor, Sabrina R. Manley, Geoffrey T. Adeoye, Opeolu Bullock, M. Ross Corrigan, John D. Diaz-Arrastia, Ramon Ellenbogen, Richard Feeser, V. Ramana Ferguson, Adam R. Gardner, Raquel Goldman, Dana Gopinath, Shankar Hemphill, J Claude Keene, C. Dirk Korley, Frederick K. Kramer, Joel Kreitzer, Natalie Levin, Harvey Lindsell, Chris Madden, Christopher Martin, Alastair McAllister, Thomas Merchant, Randall Mukherjee, Pratik Ngwenya, Laura B. Noel, Florence Nolan, Amber Okonkwo, David Palacios, Eva Perl, Daniel Puccio, Ava Rabinowitz, Miri Robertson, Claudia Rosand, Jonathan Sander, Angelle Schnyer, David Seabury, Seth Sherer, Mark Toga, Arthur Valadka, Alex Vassar, Mary MS, RN Vespa, Paul Wang, Kevin Yue, John K. Yuh, Esther and Zafonte, Ross 2021. Central Curation of Glasgow Outcome Scale-Extended Data: Lessons Learned from TRACK-TBI. Journal of Neurotrauma, Vol. 38, Issue. 17, p. 2419.

Lindner, Lisa Weiß, Anja Reich, Andreas Kindler, Siegfried Behrens, Frank Braun, Jürgen Listing, Joachim Schett, Georg Sieper, Joachim Strangfeld, Anja and Regierer, Anne C. 2021. Implementing an automated monitoring process in a digital, longitudinal observational cohort study. Arthritis Research & Therapy, Vol. 23, Issue. 1,

Foreman, Brandon Lissak, India A Kamireddi, Neha Moberg, Dick and Rosenthal, Eric S 2021. Challenges and Opportunities in Multimodal Monitoring and Data Analytics in Traumatic Brain Injury. Current Neurology and Neuroscience Reports, Vol. 21, Issue. 3,

Lavinio, Andrea Ercole, Ari Battaglini, Denise Magnoni, Sandra Badenes, Rafael Taccone, Fabio Silvio Helbok, Raimund Thomas, William Pelosi, Paolo Robba, Chiara Innerhofer, Nicole Miori, Sara Librizzi, Alberto Bertuetti, Rita Faria, Nicolas Figueiredo Peluso, Lorenzo Montrucchio, Giorgia Sales, Gabriele Brazzi, Luca Alampi, Daniela Manca, Maria Beatrice Sepe, Lilia Natalini, Giuseppe Bellino, Antonio Bocci, Maria Grazia Mattana, Chiara Corradi, Francesco Forfori, Francesco Cundari, Francesco Bonvecchio, Emilio Busani, Zara Bianchin, Andrea Federico, Carla Santoro, Anna Bilotta, Federico Rajani, Giorgio Lopez, Berta Moleon Aspide, Raffaele Raffaele, Merola Cabrini, Luca Motta, Alessandro Frattini, Lara Godon, Alexandre Bouzat, Pierre Grappa, Elena Bonvecchio, Alberto Innerhofer, Nicole Fries, Dietmar Hernandez, Christian Preuss Thomé, Claudius Klein, Sebastian Joannidis, Michael Pelosi, Paolo Ball, Lorenzo Patroniti, Nicolo’ Brunetti, Iole Bassetti, Matteo Giacobbe, Daniele Roberto Vena, Antonio Valbusa, Alberto Porto, Italo and Bona, Roberta Della 2021. Safety profile of enhanced thromboprophylaxis strategies for critically ill COVID-19 patients during the first wave of the pandemic: observational report from 28 European intensive care units. Critical Care, Vol. 25, Issue. 1,

Vaccarino, Anthony L. Beaton, Derek Black, Sandra E. Blier, Pierre Farzan, Farnak Finger, Elizabeth Foster, Jane A. Freedman, Morris Frey, Benicio N. Gilbert Evans, Susan Ho, Keith Javadi, Mojib Kennedy, Sidney H. Lam, Raymond W. Lang, Anthony E. Lasalandra, Bianca Latour, Sara Masellis, Mario Milev, Roumen V. Müller, Daniel J. Munoz, Douglas P. Parikh, Sagar V. Placenza, Franca Rotzinger, Susan Soares, Claudio N. Sparks, Alana Strother, Stephen C. Swartz, Richard H. Tan, Brian Tartaglia, Maria Carmela Taylor, Valerie H. Theriault, Elizabeth Turecki, Gustavo Uher, Rudolf Zinman, Lorne and Evans, Kenneth R. 2022. Common Data Elements to Facilitate Sharing and Re-use of Participant-Level Data: Assessment of Psychiatric Comorbidity Across Brain Disorders. Frontiers in Psychiatry, Vol. 13, Issue. ,

Erwin Johnson, C Colquhoun, Daniel Ruppar, Daniel A and Vetter, Sascha 2022. De-identified data quality assessment approaches by data vendors who license data to healthcare and life sciences researchers. JAMIA Open, Vol. 5, Issue. 4,

Zhang, Joe Symons, Joshua Agapow, Paul Teo, James T. Paxton, Claire A. Abdi, Jordan Mattie, Heather Davie, Charlie Torres, Aracelis Z. Folarin, Amos Sood, Harpreet Celi, Leo A. Halamka, John Eapen, Sara Budhdeo, Sanjay and McGinnis, Ryan S. 2022. Best practices in the real-world data life cycle. PLOS Digital Health, Vol. 1, Issue. 1, p. e0000003.

Maas, Andrew I. R. Ercole, Ari De Keyser, Veronique Menon, David K. and Steyerberg, Ewout W. 2022. Opportunities and Challenges in High-Quality Contemporary Data Collection in Traumatic Brain Injury: The CENTER-TBI Experience. Neurocritical Care, Vol. 37, Issue. S2, p. 192.

Greco, Massimiliano De Corte, Thomas Ercole, Ari Antonelli, Massimo Azoulay, Elie Citerio, Giuseppe Morris, Andy Conway De Pascale, Gennaro Duska, Frantisek Elbers, Paul Einav, Sharon Forni, Lui Galarza, Laura Girbes, Armand R. J. Grasselli, Giacomo Gusarov, Vitaly Jubb, Alasdair Kesecioglu, Jozef Lavinio, Andrea Delgado, Maria Cruz Martin Mellinghoff, Johannes Myatra, Sheila Nainan Ostermann, Marlies Pellegrini, Mariangela Povoa, Pedro Schaller, Stefan J. Teboul, Jean-Louis Wong, Adrian De Waele, Jan J. Cecconi, Maurizio Bezzi, Marco Gira, Alicia Eller, Philipp Hamid, Tarikul Haque, Injamam Ull De Buyser, Wim Cudia, Antonella De Backer, Daniel Foulon, Pierre Collin, Vincent De Waele, Jan Van Hecke, Jolien De Waele, Elisabeth Van Malderen, Claire Mesland, Jean-Baptiste Biston, Patrick Piagnerelli, Michael Haentjens, Lionel De Schryver, Nicolas Van Leemput, Jan Vanhove, Philippe Bulpa, Pierre Ilieva, Viktoria Katz, David Binnie, Alexandra Geagea, Anna Tirapegui, Fernando Lago, Gustavo Graf, Jerónimo Perez-Araos, Rodrigo Vargas, Patricio Martinez, Felipe Labarca, Eduardo Franco, Daniel Molano Parra-Tanoux, Daniela Reyes, Luis Felipe Yepes, David Periš, Filip Stipić, Sanda Stojanović Burgos, Cynthia Vanessa Campozano Boada, Paulo Roberto Navas Brun, Jose Luis Barberan Ballesteros, Juan Pablo Paredes Abdelnasser, Gamal Hammouda, Ahmed Elmandouh, Omar Azzam, Ahmed Hussein, Aliae Mohamed Galal, Islam Awad, Ahmed K. Azab, Mohammed A. Abdalla, Maged Assal, Hebatallah Alfishawy, Mostafa Ghozy, Sherief Tharwat, Samar Eldaly, Abdullah Ellervee, Anneli Reinhard, Veronika Chrisment, Anne Poyat, Chrystelle Badie, Julio Ferrari, Fernando Berdaguer Weiss, Björn Schellenberg, Clara Grunow, Julius J. Lorenz, Marco Schaller, Stefan J. Spieth, Peter Bota, Marc Fichtner, Falk Fuest, Kristina Lahmer, Tobias Herrmann, Johannes Meybohm, Patrick Markou, Nikolaos Vasileiadou, Georgia Chrysanthopoulou, Evangelia Papamichalis, Panagiotis Soultati, Ioanna Jog, Sameer Kalvit, Kushal Myatra, Sheila Nainan Krupa, Ivan Tharwat, Aisa Nichol, Alistair McCarthy, Aine Mahmoodpoor, Ata Tonetti, Tommaso Isoni, Paolo Spadaro, Savino Volta, Carlo Alberto Mirabella, Lucia Noto, Alberto Florio, Gaetano Guzzardella, Amedeo Paleari, Chiara Baccanelli, Federica Savi, Marzia Antonelli, Massimo De Pascale, Gennaro Luca, San Vaccarini, Barbara Montrucchio, Giorgia Sales, Gabriele Donadello, Katia Gottin, Leonardo Nizzero, Marta Polati, Enrico De Rosa, Silvia Sulemanji, Demet Abusalama, Abdurraouf Elhadi, Muhammed De FelipeJesus, Montelongo Gonzalez, Daniel Rodriguez Robles, Victor Hugo Madrigal Canedo, Nancy Chavez, Alejandro Esquivel Dendane, Tarek Grady, Bart de Jong, Ben van der Heiden, Eveline Thoral, Patrick van den Bogaard, Bas Spronk, Peter E. Achterberg, Sefanja Groeneveld, Melanie So, Ralph K. L. de Wijs, Calvin Scholten, Harm Beishuizen, Albertus Cornet, Alexander D. Reidinga, Auke C. Kranen, Hetty Mensink, Roos Gasthuis, Spaarne den Boer, Sylvia de Groot, Marcel Beck, Oliver Bethlehem, Carina van Bussel, Bas Frenzel, Tim de Jong, Celestine Wilting, Rob Kesecioglu, Jozef Mehagnoul-Schipper, Jannet Alasia, Datonye Kumar, Ashok Qayyum, Ahad Rana, Muhammad Jayyab, Mustafa Abu Sierra, Rosario Quispe Hernandez, Aaron Mark de Almeida, José Taborda, Lúcia Anselmo, Mónica Ramires, Tiago Silva, Catarina Roriz, Carolina Morais, Rui Póvoa, Pedro Patricio, Patricia Pinto, André Santos, Maria Lurdes Costa, Vasco Cunha, Pedro Gonçalves, Celina Nunes, Sandra Camões, João Adrião, Diana Oliveira, Ana Omrani, Ali Al Maslamani, Muna elbuzidi, Abdurrahmaan Suei Al qudah, Bara Mahmoud Akkari, Abdel Rauof Alkhatteb, Mohamed Baiou, Anas Husain, Ahmed Alwraidat, Mohamed Saif, Ibrahim Abdulsalam Bakdach, Dana Ahmed, Amna Aleef, Mohamed Bintaher, Awadh Petrisor, Cristina Popov, Evgeniy Popova, Ksenia Dementienko, Mariia Teplykh, Boris Pyregov, Alexey Davydova, Liubov Vladislav, Belskii Neporada, Elena Zverev, Ivan Meshchaninova, Svetlana Sokolov, Dmitry Gavrilova, Elena Shlyk, Irina Poliakov, Igor Vlasova, Marina Aljuhani, Ohoud Alkhalaf, Amina Humaid, Felwa Bin Arabi, Yaseen Kuhail, Ahmed Elrabi, Omar Ghannam, Madihah E. Fong, Ng Teng Kansal, Amit Ho, Vui Kian Ng, Jensen García, Raquel Rodrígez Fraga, Xiana Taboada del Pilar García-Bonillo, Mª Padilla-Serrano, Antonio Cuadrado, Marta Martin Ferrando, Carlos Catalan-Monzon, Ignacio Galarza, Laura Frutos-Vivar, Fernando Jimenez, Jorge Rodríguez-Solis, Carmen Franquesa-Gonzalez, Enric Acosta, Guillermo Pérez Cabrera, Luciano Santana Parra, Juan Pablo Aviles Gonzalez, Francisco Muñoyerro del Carmen Lorente Conesa, Maria Varela, Ignacio Yago Martinez Pravia, Orville Victoriano Baez Delgado, Maria Cruz Martin de Cabo, Carlos Munoz Ioan, Ana-Maria Perez-Calvo, Cesar Santos, Arnoldo Abad-Motos, Ane Ripolles-Melchor, Javier Martin, Belén Civantos Teruel, Santiago Yus Lucas, Juan Higuera Ortiz, Aaron Blandino de Pablo Sánchez, Raúl Barrueco-Francioni, Jesús Emilio Espina, Lorena Forcelledo Bonell-Goytisolo, José M. Salaverria, Iñigo Mir, Antonia Socias Rodriguez-Ruiz, Emilio Valverde, Virginia Hidalgo Cubero, Patricia Jimeno Linde, Francisca Arbol Leganes, Nieves Cruza Romeu, Juan Maria Concha, Pablo Berezo-Garcia, José Angel Fraile, Virginia Cuenca-Rubio, Cristina Pérez-Torres, David Serrano, Ainhoa Valero, Clara Martínez Suner, Andrea Ortiz Larrañaga, Leire Legaristi, Noemi Ferrigno, Gerardo Khlafalla, Safa Bihariesingh-Sanchit, Rosita Sjukhus, Hallands Zoerner, Frank Grip, Jonathan Kilsand, Kristina Mårtensson, Johan Österlind, Jonas Sjukhuset, Akademiska von Seth, Magnus Sjukhus, Västerviks Berkius, Johan Ceruti, Samuele Glotta, Andrea Izdes, Seval Turan, Işıl Özkoçak Cosar, Ahmet Halacli, Burcin Dereli, Necla Yilmaz, Mehmet Akbas, Türkay Elay, Gülseren Eyüpoğlu, Selin Bílír, Yelíz Saraçoğlu, Kemal Tolga Kaya, Ebru Sahin, Ayca Sultan Ekren, Pervin Korkmaz Mengi, Tuğçe Suner, Kezban Ozmen Tomak, Yakup Eroglu, Ahmet Alsabbah, Asad Hanlon, Katie Gervin, Kevin McMahon, Sean Hagan, Samantha Higenbottam, Caroline V. Mullhi, Randeep Poulton, Lottie Torlinski, Tomasz Gareth, Allen Truman, Nick Vijayakumar, Gopal Hall, Chris Jubb, Alasdair Cagova, Lenka Jones, Nicola Graham, Sam Robin, Nicole Cowton, Amanda Donnelly, Adrian Singatullina, Natalia Kent, Melanie Boulanger, Carole Campbell, Zoë Potter, Elizabeth Duric, Natalie Szakmany, Tamas Brompton, Royal Kviatkovske, Orinta Marczin, Nandor Ellis, Caroline Saha, Rajnish Sri-Chandana, Chunda Allan, John Mumelj, Lana Venkatesh, Harish Gotz, Vera Nina Cochrane, Anthony Ficial, Barbara Kamble, Shruthi Lumlertgul, Nuttha Oddy, Christopher Jain, Susan Crapelli, Giulia Beatrice Vlachou, Aikaterini Golden, David Garrioch, Sweyn Henning, Jeremy Loveleena, Gupta Davey, Miriam Grauslyte, Lina Salciute-Simene, Erika Cook, Martin Barling, Danny Broadhurst, Phil Purvis, Sarah Spivey, Michael Shuker, Benjamin Grecu, Irina Harding, Daniel Singatullina, Natalia Dean, James T. Nielsen, Nathan D. Al-Bayati, Sama Al-Sadawi, Mohammed Charron, Mariane Stubenrauch, Peter Santanilla, Jairo Wentowski, Catherine Rosenberger, Dorothea Eksarko, Polikseni and Jawa, Randeep 2022. Clinical and organizational factors associated with mortality during the peak of first COVID-19 wave: the global UNITE-COVID study. Intensive Care Medicine, Vol. 48, Issue. 6, p. 690.

Kwok, Chun Shing Muntean, Elena-Andra Mallen, Christian D. and Borovac, Josip Andelo 2022. Data Collection Theory in Healthcare Research: The Minimum Dataset in Quantitative Studies. Clinics and Practice, Vol. 12, Issue. 6, p. 832.

Weissman, Alexandra Cheng, Alex Mainor, Alex Gimbel, Elizabeth Nowak, Kayla Pan, Huaqin (Helen) Stratford, Jeran Merkel, Alyssa Taylor, Caroline Meier, Heather Auman, Jeanette Nolen, Tracy L. Lindsell, Christopher J. and Huang, David T. 2022. Development and implementation of the National Heart, Lung, and Blood Institute COVID-19 common data elements. Journal of Clinical and Translational Science, Vol. 6, Issue. 1,

Maas, Andrew I R Menon, David K Manley, Geoffrey T Abrams, Mathew Åkerlund, Cecilia Andelic, Nada Aries, Marcel Bashford, Tom Bell, Michael J Bodien, Yelena G Brett, Benjamin L Büki, András Chesnut, Randall M Citerio, Giuseppe Clark, David Clasby, Betony Cooper, D Jamie Czeiter, Endre Czosnyka, Marek Dams-O'Connor, Kristen De Keyser, Véronique Diaz-Arrastia, Ramon Ercole, Ari van Essen, Thomas A Falvey, Éanna Ferguson, Adam R Figaji, Anthony Fitzgerald, Melinda Foreman, Brandon Gantner, Dashiell Gao, Guoyi Giacino, Joseph Gravesteijn, Benjamin Guiza, Fabian Gupta, Deepak Gurnell, Mark Haagsma, Juanita A Hammond, Flora M Hawryluk, Gregory Hutchinson, Peter van der Jagt, Mathieu Jain, Sonia Jain, Swati Jiang, Ji-yao Kent, Hope Kolias, Angelos Kompanje, Erwin J O Lecky, Fiona Lingsma, Hester F Maegele, Marc Majdan, Marek Markowitz, Amy McCrea, Michael Meyfroidt, Geert Mikolić, Ana Mondello, Stefania Mukherjee, Pratik Nelson, David Nelson, Lindsay D Newcombe, Virginia Okonkwo, David Orešič, Matej Peul, Wilco Pisică, Dana Polinder, Suzanne Ponsford, Jennie Puybasset, Louis Raj, Rahul Robba, Chiara Røe, Cecilie Rosand, Jonathan Schueler, Peter Sharp, David J Smielewski, Peter Stein, Murray B von Steinbüchel, Nicole Stewart, William Steyerberg, Ewout W Stocchetti, Nino Temkin, Nancy Tenovuo, Olli Theadom, Alice Thomas, Ilias Espin, Abel Torres Turgeon, Alexis F Unterberg, Andreas Van Praag, Dominique van Veen, Ernest Verheyden, Jan Vyvere, Thijs Vande Wang, Kevin K W Wiegers, Eveline J A Williams, W Huw Wilson, Lindsay Wisniewski, Stephen R Younsi, Alexander Yue, John K Yuh, Esther L Zeiler, Frederick A Zeldovich, Marina and Zemek, Roger 2022. Traumatic brain injury: progress and challenges in prevention, clinical care, and research. The Lancet Neurology, Vol. 21, Issue. 11, p. 1004.

Abrams, Mathew Birdsall Bjaalie, Jan G. Das, Samir Egan, Gary F. Ghosh, Satrajit S. Goscinski, Wojtek J. Grethe, Jeffrey S. Kotaleski, Jeanette Hellgren Ho, Eric Tatt Wei Kennedy, David N. Lanyon, Linda J. Leergaard, Trygve B. Mayberg, Helen S. Milanesi, Luciano Mouček, Roman Poline, J. B. Roy, Prasun K. Strother, Stephen C. Tang, Tong Boon Tiesinga, Paul Wachtler, Thomas Wójcik, Daniel K. and Martone, Maryann E. 2022. A Standards Organization for Open and FAIR Neuroscience: the International Neuroinformatics Coordinating Facility. Neuroinformatics, Vol. 20, Issue. 1, p. 25.

Wilmes, Nick Hendriks, Charlotte W E Viets, Caspar T A Cornelissen, Simon J W M van Mook, Walther N K A Cox-Brinkman, Josanne Celi, Leo A Martinez-Martin, Nicole Gichoya, Judy W Watkins, Craig Bakhshi-Raiez, Ferishta Wynants, Laure van der Horst, Iwan C C and van Bussel, Bas C T 2023. Structural under-reporting of informed consent, data handling and sharing, ethical approval, and application of Open Science principles as proxies for study quality conduct in COVID-19 research: a systematic scoping review. BMJ Global Health, Vol. 8, Issue. 5, p. e012007.

Sedlakova, Jana Daniore, Paola Horn Wintsch, Andrea Wolf, Markus Stanikic, Mina Haag, Christina Sieber, Chloé Schneider, Gerold Staub, Kaspar Alois Ettlin, Dominik Grübner, Oliver Rinaldi, Fabio von Wyl, Viktor and Sarmiento, Raymond Francis 2023. Challenges and best practices for digital unstructured data enrichment in health research: A systematic narrative review. PLOS Digital Health, Vol. 2, Issue. 10, p. e0000347.

Greco, Massimiliano Caruso, Pier Francesco Angelotti, Giovanni Aceto, Romina Coppalini, Giacomo Martinetti, Nicolò Albini, Marco Bash, Lori D. Carvello, Michele Piccioni, Federico Monzani, Roberta Montorsi, Marco and Cecconi, Maurizio 2023. REVersal of nEuromusculAr bLocking Agents in Patients Undergoing General Anaesthesia (REVEAL Study). Journal of Clinical Medicine, Vol. 12, Issue. 2, p. 563.

Yaseen, Ashraf Robertson, Claudia Cruz Navarro, Jovany Chen, Jingxiao Heckler, Brian DeSantis, Stacia M. Temkin, Nancy Barber, Jason Foreman, Brandon Diaz-Arrastia, Ramon Chesnut, Randall Manley, Geoffrey T. Wright, David W. Vassar, Mary Ferguson, Adam R. Markowitz, Amy J. and Yamal, Jose-Miguel 2023. Integrating, Harmonizing, and Curating Studies With High-Frequency and Hourly Physiological Data: Proof of Concept from Seven Traumatic Brain Injury Data Sets. Journal of Neurotrauma, Vol. 40, Issue. 21-22, p. 2362.

McPhee, Patrick G. Vaccarino, Anthony L. Naska, Sibel Nylen, Kirk Santisteban, Jose Arturo Chepesiuk, Rachel Andrade, Andrea Georgiades, Stelios Behan, Brendan Iaboni, Alana Wan, Flora Aimola, Sabrina Cheema, Heena and Gorter, Jan Willem 2024. Harmonizing data on correlates of sleep in children within and across neurodevelopmental disorders: lessons learned from an Ontario Brain Institute cross-program collaboration. Frontiers in Neuroinformatics, Vol. 18, Issue. ,

Download full list

Article contents

Guidelines for Data Acquisition, Quality and Curation for Observational Research Designs (DAQCORD)

Abstract

Keywords

Introduction

Methods

Development of the DAQCORD Indicators

Results

Discussion

Acknowledgements

Disclosures

Supplementary material

References

Ercole et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests