Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-vvkck Total loading time: 0 Render date: 2024-04-26T14:41:01.953Z Has data issue: false hasContentIssue false

21 - Functional Enrichment Testing: A Survey of Statistical Methods

Published online by Cambridge University Press:  05 June 2013

Laila M. Poisson
Affiliation:
Department of Public Health Sciences
Debashis Ghosh
Affiliation:
Penn State University
Kim-Anh Do
Affiliation:
University of Texas, MD Anderson Cancer Center
Zhaohui Steve Qin
Affiliation:
Emory University, Atlanta
Marina Vannucci
Affiliation:
Rice University, Houston
Get access

Summary

Introduction and Motivation

With the proliferation of available high-throughput data being generated by platforms such as microarray, mass spectrometry, and sequencing machines, scientists are now commonly confronted with measurements of thousands of molecules. The term molecule here can refer to anything from an mRNA strand to an SNP (single nucleotide polymorphism) to a peptide to an exon. When analysis is done across these molecules on a genome-wide basis, it becomes difficult for analysts to interpret the results. In many instances, investigators are provided with a list of ranked molecules that are then prioritized for further study. Although this represents a fairly reasonable approach, it has the potential to be unwieldy depending on the number of molecules under consideration. Additionally, meta-analysis of high-throughput gene expression studies have found that sets of molecules such as metabolic pathways and signaling cascades are dysregulated more consistently across studies than are the specific molecules detected within these sets (e.g., Fan et al., 2006).

Predicated on the assumption that individual molecules act in concert for various biological processes, multiple databases have been constructed to classify molecules into sets of biological interest, and many are publicly available. Databases such as as KEGG (Kyoto Encyclopedia of Genes and Genomes)(Kanehisa and Goto, 2000; Kanehisa et al., 2006, 2008), GO (Gene Ontology) (Consortium, 2000, 2004), and Biocarta give functional annotation for genes (GO) and relationships to enzymes and metabolites (KEGG and Biocarta).

Type
Chapter
Information
Advances in Statistical Bioinformatics
Models and Integrative Inference for High-Throughput Data
, pp. 423 - 444
Publisher: Cambridge University Press
Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×