Multiple Sequence Alignment

Jin Xiong

doi:10.1017/CBO9780511806087.006

5 - Multiple Sequence Alignment

Published online by Cambridge University Press: 05 June 2012

Jin Xiong

Show author details

Jin Xiong: Affiliation:
Texas A & M University

Book contents

Get access

Summary

A natural extension of pairwise alignment is multiple sequence alignment, which is to align multiple related sequences to achieve optimal matching of the sequences. Related sequences are identified through the database similarity searching described in Chapter 4. As the process generates multiple matching sequence pairs, it is often necessary to convert the numerous pairwise alignments into a single alignment, which arranges sequences in such a way that evolutionarily equivalent positions across all sequences are matched.

There is a unique advantage of multiple sequence alignment because it reveals more biological information than many pairwise alignments can. For example, it allows the identification of conserved sequence patterns and motifs in the whole sequence family, which are not obvious to detect by comparing only two sequences. Many conserved and functionally critical amino acid residues can be identified in a protein multiple alignment. Multiple sequence alignment is also an essential prerequisite to carrying out phylogenetic analysis of sequence families and prediction of protein secondary and tertiary structures. Multiple sequence alignment also has applications in designing degenerate polymerase chain reaction (PCR) primers based on multiple related sequences.

It is theoretically possible to use dynamic programming to align any number of sequences as for pairwise alignment. However, the amount of computing time and memory it requires increases exponentially as the number of sequences increases. As a consequence, full dynamic programming cannot be applied for datasets of more than ten sequences. In practice, heuristic approaches are most often used.

Type: Chapter
Information: Essential Bioinformatics , pp. 63 - 74

DOI: https://doi.org/10.1017/CBO9780511806087.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Apostolico, A., and Giancarlo, R. 1998. Sequence alignment in molecular biology. J. Comput. Biol. 5:173–96CrossRef Google Scholar PubMed

Gaskell, G. J. 2000. Multiple sequence alignment tools on the Web. Biotechniques 29:60–2Google Scholar PubMed

Gotoh, O. 1999. Multiple sequence alignment: Algorithms and applications. Adv. Biophys. 36:159–206CrossRef Google Scholar PubMed

Lecompte, O., Thompson, J. D., Plewniak, F., Thierry, J., and Poch, O. 2001. Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 270:17–30CrossRef Google Scholar PubMed

Morgenstern, B. 1999. DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics 15:211–8CrossRef Google Scholar PubMed

Morgenstern, B., Dress, A., and Werner, T. 1996. Multiple DNA and protein sequence alignment based on segment-to-segment comparison. Proc. Natl. Acad. Sci. U S A 93:12098–103CrossRef Google Scholar PubMed

Mullan, L. J. 2002. Multiple sequence alignment – The gateway to further analysis. Brief. Bioinform. 3:303–5CrossRef Google Scholar PubMed

Nicholas, H. B. Jr., Ropelewski, A. J., and Deerfield, D. W. II. 2002. Strategies for multiple sequence alignment. Biotechniques 32:572–91Google Scholar PubMed

Notredame, C. 2002. Recent progress in multiple sequence alignment: A survey. Pharmacogenomics 3:131–44CrossRef Google Scholar PubMed

Notredame, C., Higgins, D. G., and Heringa, J. 2000. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302:205–17CrossRef Google Scholar PubMed

Thompson, J. D., Higgins, D. G., and Gibson, T. J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673–80CrossRef Google Scholar PubMed

Thompson, J. D., Plewniak, F., and Poch, O. 1999. A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Res. 27:2682–90CrossRef Google Scholar PubMed

Book contents

5 - Multiple Sequence Alignment

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive