Hostname: page-component-68945f75b7-l9cl4 Total loading time: 0 Render date: 2024-08-05T18:43:47.622Z Has data issue: false hasContentIssue false

323 Applying MeSH Tree Structures and Condition-to-MeSH Mapping to Catalog and Characterize Clinical Trials Research Focus Areas

Published online by Cambridge University Press:  03 April 2024

Winfred Wu
Affiliation:
University of Miami
Trevor Yuen
Affiliation:
University of Miami
Sakshi Mittal
Affiliation:
University of Miami
Rosalina Das
Affiliation:
University of Miami
Sheela Dominguez
Affiliation:
University of Miami
Daru Ransford
Affiliation:
University of Miami
Micky Simwenyi
Affiliation:
University of Miami
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

OBJECTIVES/GOALS: Characterizing and analyzing research studies presents several challenges given the various ways studies may be labeled or organized. The Medical Subject Headings (MeSH) thesaurus is a hierarchical vocabulary that can index and organize research foci using common business intelligence tools to enable rapid exploration of research portfolios. METHODS/STUDY POPULATION: Metadata from ClinicalTrials.gov on 455,437 trials were downloaded and all MeSH terms associated with trials in the condition_browse section were loaded into a database. The corresponding MeSH trees for each term were then identified and mapped to their ancestor terms within the tree. Trials were then indexed based on top four hierarchical levels for each associated MeSH term. Trials performed at the University of Miami (UM) were identified based on locations associated with the trial as well as matching National Clinical Trial (NCT) numbers identified from internal research administration systems. Business intelligence software (Microsoft PowerBI) was applied to the corresponding dataset to enable end user exploration and analysis of the trials within ClinicalTrials.gov. RESULTS/ANTICIPATED RESULTS: A total of 3,271 studies associated with UM were identified, of which, 3,054 (93.3%) had at least one condition MeSH term linked. A total of 7,711 MeSH terms were associated with the trials overall, representing 1,112 unique MeSH terms; the most common terms were carcinoma (164), lymphoma (155), HIV Infections (139), neoplasms (136), and leukemia (122). Utilizing MeSH hierarchy, trials were characterized were categorized into 36 different trees. The most common top tree nodes were neoplasms (1,181), followed by pathological conditions/signs and symptoms (913), immune system diseases (574), nervous system diseases (513), and digestive system diseases (483). Within trees, a total of 184, 681, and 1057 different MeSH terms were specified at the second, third, and fourth nodes in the hierarchy respectively. DISCUSSION/SIGNIFICANCE: Utilizing existing metadata from trials posted on ClinicalTrials.gov and MeSH tree structures can enable organizations to readily explore the foci of clinical trials research. High rates of MeSH term association to research study conditions are necessary to ensure adequate representation of research foci.

Type
Informatics and Data Science
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
© The Author(s), 2024. The Association for Clinical and Translational Science