Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-wzw2p Total loading time: 0 Render date: 2024-05-29T09:44:09.231Z Has data issue: false hasContentIssue false

1 - Introduction

Published online by Cambridge University Press:  06 January 2010

Mark Borodovsky
Affiliation:
Georgia Institute of Technology
Svetlana Ekisheva
Affiliation:
Georgia Institute of Technology
Get access

Summary

The reader will quickly discover that the organization of this book was chosen to be parallel to the organization of Biological Sequence Analysis by Durbin et al. (1998). The first chapter of BSA contains an introduction to the fundamental notions of biological sequence analysis: sequence similarity, homology, sequence alignment, and the basic concepts of probabilistic modeling.

Finding these distinct concepts described back-to-back is surprising at first glance. However, let us recall several important bioinformatics questions. How could we construct a pairwise sequence alignment? How could we build an alignment of multiple sequences? How could we create a phylogenetic tree for several biological sequences? How could we predict an RNA secondary structure? None of these questions can be consistently addressed without use of probabilistic methods. The mathematical complexity of these methods ranges from basic theorems and formulas to sophisticated architectures of hidden Markov models and stochastic grammars able to grasp fine compositional characteristics of empirical biological sequences.

The explosive growth of biological sequence data created an excellent opportunity for the meaningful application of discrete probabilistic models. Perhaps, without much exaggeration, the implications of this new development could be compared with implications of the revolutionary use of calculus and differential equations for solving problems of classic mechanics in the eighteenth century.

The problems considered in this introductory chapter are concerned with the fundamental concepts that play an important role in biological sequence analysis: the maximum likelihood and the maximum a posteriori (Bayesian) estimation of the model parameters. These concepts are crucial for understanding statistical inference from experimental data and are impossible to introduce without notions of conditional, joint, and marginal probabilities.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Introduction
  • Mark Borodovsky, Georgia Institute of Technology, Svetlana Ekisheva, Georgia Institute of Technology
  • Book: Problems and Solutions in Biological Sequence Analysis
  • Online publication: 06 January 2010
  • Chapter DOI: https://doi.org/10.1017/CBO9780511617829.002
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Introduction
  • Mark Borodovsky, Georgia Institute of Technology, Svetlana Ekisheva, Georgia Institute of Technology
  • Book: Problems and Solutions in Biological Sequence Analysis
  • Online publication: 06 January 2010
  • Chapter DOI: https://doi.org/10.1017/CBO9780511617829.002
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Introduction
  • Mark Borodovsky, Georgia Institute of Technology, Svetlana Ekisheva, Georgia Institute of Technology
  • Book: Problems and Solutions in Biological Sequence Analysis
  • Online publication: 06 January 2010
  • Chapter DOI: https://doi.org/10.1017/CBO9780511617829.002
Available formats
×