Pairwise alignment using HMMs

Mark Borodovsky; Svetlana Ekisheva

doi:10.1017/CBO9780511617829.005

4 - Pairwise alignment using HMMs

Published online by Cambridge University Press: 06 January 2010

Mark Borodovsky and

Svetlana Ekisheva

Show author details

Mark Borodovsky: Affiliation:
Georgia Institute of Technology
Svetlana Ekisheva: Affiliation:
Georgia Institute of Technology

Book contents

Get access

Summary

In the BSA Chapter 3 we learned that a DP algorithm for pairwise sequence alignment allows a probabilistic interpretation. Indeed, the equivalent equations appear in the logarithmic form of the Viterbi algorithm for the hidden Markov model of a gapped sequence alignment. The hidden states of such a model, called a pair HMM, correspond to the alignment match, the x-gap, and the y-gap positions. The pair HMM state diagram is topologically similar to the diagram of the finite state machine (Durbin et al. (1998), Fig. 4.1), although the pair HMM parameters have clear probabilistic meanings. The optimal finite state machine alignment found by standard DP is equivalent to the most probable path through the pair HMM determined by the Viterbi algorithm. Both global and local optimal DP alignment algorithms have Viterbi counterparts for suitably defined HMMs. Interestingly, the HMM has an advantage over the finite state machine because the HMM can compute the full probability that sequences X and Y could be generated by a given pair HMM; thus, a probabilistic measure can be introduced to help establish evolutionary relationships. This full probabilistic model also defines (i) the posterior distribution over all possible alignments given sequences X and Y and (ii) the posterior probability that a particular symbol x of sequence X is aligned to a given symbol y of sequence Y. However, real biological sequences cannot be considered to be exact realizations of probabilistic models. This explains the difficulties met by the HMM based alignment methods for the similarity search (Durbin et al. (1998), Sect. 4.5), while more simplistic finite state machine methods perform sufficiently well.

Type: Chapter
Information: Problems and Solutions in Biological Sequence Analysis , pp. 104 - 125

DOI: https://doi.org/10.1017/CBO9780511617829.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Pairwise alignment using HMMs

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive