'Genomemark': detecting word periodicity in biological sequences |
| |
Authors: | Fadiel A Eichenbaum K D Hamza A |
| |
Affiliation: | Yale University School of Medicine, Yale Center for Research On Reproductive Biology, New Haven, CT 06511, USA. afadiel@yale.edu |
| |
Abstract: | Identifying and predicting the structural characteristics of novel repeats throughout the genome can lend insight into biological function. Specific repeats are believed to have biological significance as a function of their distribution patterns. We have developed 'GenomeMark,' a computer program that detects and statistically analyzes candidate repeats. Specifically, 'GenomeMark' identifies the periodic distribution of unique words, calculating their chi2 and Z-score values. Using 'GenomeMark,' we identified novel sequence words present in tandem throughout genomes. We found that these sequences have remarkable spacer sequence distributions and many were genome specific, validating the genome signature theory. Further analysis confirmed that many of these sequences have a specific biological function. The program is available from the authors upon request and is freely available for non-commercial and academic entities. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|