Overview of computational gene prediction

William H. Majoros

doi:10.1017/CBO9780511811135.005

3 - Overview of computational gene prediction

Published online by Cambridge University Press: 05 June 2012

William H. Majoros

Show author details

William H. Majoros: Affiliation:
Duke University, North Carolina

Book contents

Get access

Summary

In this chapter we will develop a conceptual framework describing the gene prediction problem from a computational perspective. Our goal will be to expose the reader to the overall problem from a high level, but in a very concrete way, so that the necessities and compromises of the computational methods which we will introduce in the chapters ahead can be seen in light of the practical realities of the problem. A comparison of the material in this chapter with the description of the underlying biology given in Chapter 1 should highlight the gulf which yet needs to be crossed between the goals of genome annotation and the current state of the art in computational gene prediction.

Genes, exons, and coding segments

The common substrate for gene finding is the DNA sequence produced by the genome sequencing and assembly processes. As described in Chapter 1, the raw trace files produced by the sequencing machines are subjected to a base-caller program which infers the most likely nucleotide at each position in a fragment, given the levels of the fluorescent dyes measured by the sequencing machine. The nucleotide sequence fragments produced by the base-caller are then fed to an assembler, a program that combines fragments into longer DNA sequences called contigs. Contigs are generally stored in FASTA files. Figure 3.1 shows an example FASTA file.

Type: Chapter
Information: Methods for Computational Gene Prediction , pp. 83 - 103

DOI: https://doi.org/10.1017/CBO9780511811135.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

3 - Overview of computational gene prediction

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive