No CrossRef data available.
Statistical tools for discovering pseudo-periodicities inbiological sequences
Published online by Cambridge University Press: 15 August 2002
Many protein sequences present non trivial periodicities, such as cysteine signatures and leucine heptads. These known periodicities probably represent a small percentage of the total number of sequences periodic structures, and it is useful to have general tools to detect such sequences and their period in large databases of sequences. We compare three statistics adapted from those used in time series analysis: a generalisation of the simple autocovariance based on a similarity score and two statistics intending to increase the power of the method. Theoretical behaviour of these statistics are derived, and the corresponding tests are then described. In this paper we also present an application of these tests to a protein known to have sequence periodicity.
- Research Article
- ESAIM: Probability and Statistics , Volume 5 , 2001 , pp. 171 - 181
- © EDP Sciences, SMAI, 2001