Whole-Genome Multi-SNP-Phenotype Association Analysis

doi:10.1017/CBO9781139226448.012

11 - Whole-Genome Multi-SNP-Phenotype Association Analysis

Published online by Cambridge University Press: 05 June 2013

Yongtao Guan and

Kai Wang

Edited by

Kim-Anh Do ,

Zhaohui Steve Qin and

Marina Vannucci

Show author details

Yongtao Guan: Affiliation:
Baylor College of Medicine
Kai Wang: Affiliation:
University of California
Kim-Anh Do: Affiliation:
University of Texas, MD Anderson Cancer Center
Zhaohui Steve Qin: Affiliation:
Emory University, Atlanta
Marina Vannucci: Affiliation:
Rice University, Houston

Book contents

Get access

Summary

Introduction

Current typical genome-wide association studies (GWAS) (e.g., Wellcome Trust Case Control Consortium, 2007) measure hundreds of thousands, or millions, of genetic variants (typically single-nucleotide polymorphisms, or SNPs), in hundreds, thousands, or tens of thousands of individuals, with the primary goal being to identify which regions of the genome harbor SNPs that affect some phenotype or outcome of interest. Although many GWAS are casecontrol studies, here we focus primarily on the computationally simpler setting where a continuous phenotype has been measured on population-based samples, before briefly considering the challenges of extending these methods to binary outcomes.

Most existing GWAS analyses are “single-SNP” analyses, which simply test each SNP, one at a time, for association with the phenotype. Strong associations between a SNP and the phenotype are interpreted as indicating that the SNP, or a nearby correlated SNP, likely affects the phenotype. The primary rationale for GWAS is the idea that by examining these SNPs in more detail – for example, examining which genes they are located within or near – we may glean important insights into the biology of the phenotype under study.

Single-SNP Analysis has Difficulties in Assessing Overall Association Signals

Single-SNP analysis appears to be clean, clear, and easy to perform with standard software packages such as PLINK (Purcell et al., 2007) and BIMBAM (Guan and Stephens, 2008). However, single-SNP analysis has limitations in answering questions that try to gauge the collective strength of association signals in the data.

Type: Chapter
Information: Advances in Statistical Bioinformatics
Models and Integrative Inference for High-Throughput Data
, pp. 224 - 243

DOI: https://doi.org/10.1017/CBO9781139226448.012 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2013

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

11 - Whole-Genome Multi-SNP-Phenotype Association Analysis

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive