共查询到20条相似文献,搜索用时 937 毫秒
1.
2.
Jittima Piriyapongsa Mark T Rutledge Sanil Patel Mark Borodovsky I King Jordan 《Biology direct》2007,2(1):31-24
Background
Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. 相似文献3.
4.
5.
Background
Alternative splicing is an important mechanism for generating functional and evolutionary diversity of proteins in eukaryotes. Here, we studied the frequency and functionality of recently gained, rodent-specific alternative exons. 相似文献6.
Background
The computation of accurate alignments of cDNA sequences against a genome is at the foundation of modern genome annotation pipelines. Several factors such as presence of paralogs, small exons, non-consensus splice signals, sequencing errors and polymorphic sites pose recognized difficulties to existing spliced alignment algorithms. 相似文献7.
8.
Gregory Nuel Leslie Regad Juliette Martin Anne-Claude Camproux 《Algorithms for molecular biology : AMB》2010,5(1):15
Background
In bioinformatics it is common to search for a pattern of interest in a potentially large set of rather short sequences (upstream gene regions, proteins, exons, etc.). Although many methodological approaches allow practitioners to compute the distribution of a pattern count in a random sequence generated by a Markov source, no specific developments have taken into account the counting of occurrences in a set of independent sequences. We aim to address this problem by deriving efficient approaches and algorithms to perform these computations both for low and high complexity patterns in the framework of homogeneous or heterogeneous Markov models. 相似文献9.
Daniel N Frank 《BMC bioinformatics》2008,9(1):420
Background
Advances in automated DNA sequencing technology have accelerated the generation of metagenomic DNA sequences, especially environmental ribosomal RNA gene (rDNA) sequences. As the scale of rDNA-based studies of microbial ecology has expanded, need has arisen for software that is capable of managing, annotating, and analyzing the plethora of diverse data accumulated in these projects. 相似文献10.
11.
Background
Transposable elements (TEs) are abundant genomic sequences that have been found to contribute to genome evolution in unexpected ways. Here, we characterize the evolutionary and functional characteristics of TE-derived human genome regulatory sequences uncovered by the high throughput mapping of DNaseI-hypersensitive (HS) sites. 相似文献12.
Zongyun Chen Fan Luo Jing Feng Weishan Yang Danyun Zeng Ruiming Zhao Zhijian Cao Maili Liu Wenxin Li Ling Jiang Yingliang Wu 《PloS one》2013,8(4)
Background
Recently, a new subfamily of long-chain toxins with a Kunitz-type fold was found in scorpion venom glands. Functionally, these toxins inhibit protease activity and block potassium channels. However, the genomic organization and three-dimensional (3-D) structure of this kind of scorpion toxin has not been reported.Principal Findings
Here, we characterized the genomic organization and 3-D nuclear magnetic resonance structure of the scorpion Kunitz-type toxin, LmKTT-1a, which has a unique cysteine pattern. The LmKTT-1a gene contained three exons, which were interrupted by two introns located in the mature peptide region. Despite little similarity to other Kunitz-type toxins and a unique pattern of disulfide bridges, LmKTT-1a possessed a conserved Kunitz-type structural fold with one α-helix and two β-sheets. Comparison of the genomic organization, 3-D structure, and functional data of known toxins from the α-KTx, β-KTx, γ-KTx, and κ-KTx subfamily suggested that scorpion Kunitz-type potassium channel toxins might have evolved from a new ancestor that is completely different from the common ancestor of scorpion toxins with a CSα/β fold. Thus, these analyses provide evidence of a new scorpion potassium channel toxin subfamily, which we have named δ-KTx.Conclusions/Significance
Our results highlight the genomic, structural, and evolutionary diversity of scorpion potassium channel toxins. These findings may accelerate the design and development of diagnostic and therapeutic peptide agents for human potassium channelopathies. 相似文献13.
Background
Alternative splicing of mutually exclusive exons is an important mechanism for increasing protein diversity in eukaryotes. The insect Mhc (myosin heavy chain) gene produces all different muscle myosins as a result of alternative splicing in contrast to most other organisms of the Metazoa lineage, that have a family of muscle genes with each gene coding for a protein specialized for a functional niche. 相似文献14.
Frank M You Naxin Huo Yong Q Gu Gerard R Lazo Jan Dvorak Olin D Anderson 《BMC bioinformatics》2009,10(1):331
Background
In some genomic applications it is necessary to design large numbers of PCR primers in exons flanking one or several introns on the basis of orthologous gene sequences in related species. The primer pairs designed by this target gene approach are called "intron-flanking primers" or because they are located in exonic sequences which are usually conserved between related species, "conserved primers". They are useful for large-scale single nucleotide polymorphism (SNP) discovery and marker development, especially in species, such as wheat, for which a large number of ESTs are available but for which genome sequences and intron/exon boundaries are not available. To date, no suitable high-throughput tool is available for this purpose. 相似文献15.
Background
Clostridium perfringens is a Gram-positive anaerobic bacterium causing severe diseases such as gas gangrene and pseudomembranosus colitis, that are generally due to the secretion of powerful extracellular toxins. The expression of toxin genes is mainly regulated by VirR, the response regulator of a two-component system. Up to now few targets only are known for this regulator and mainly in one strain (Strain 13). Due to the high genomic and phenotypic variability in toxin production by different strains, the development of effective strategies to counteract C. perfringens infections requires methodologies to reconstruct the VirR regulon from genome sequences. 相似文献16.
Makoto K Shimada Yosuke Hayakawa Jun-ichi Takeda Takashi Gojobori Tadashi Imanishi 《BMC evolutionary biology》2010,10(1):122
Background
Alternative splicing (AS) is a key molecular process that endows biological functions with diversity and complexity. Generally, functional redundancy leads to the generation of new functions through relaxation of selective pressure in evolution, as exemplified by duplicated genes. It is also known that alternatively spliced exons (ASEs) are subject to relaxed selective pressure. Within consensus sequences at the splice junctions, the most conserved sites are dinucleotides at both ends of introns (splice dinucleotides). However, a small number of single nucleotide polymorphisms (SNPs) occur at splice dinucleotides. An intriguing question relating to the evolution of AS diversity is whether mutations at splice dinucleotides are maintained as polymorphisms and produce diversity in splice patterns within the human population. We therefore surveyed validated SNPs in the database dbSNP located at splice dinucleotides of all human genes that are defined by the H-Invitational Database. 相似文献17.
Bainbridge MN Wang M Wu Y Newsham I Muzny DM Jefferies JL Albert TJ Burgess DL Gibbs RA 《Genome biology》2011,12(7):R68
Background
Enrichment of loci by DNA hybridization-capture, followed by high-throughput sequencing, is an important tool in modern genetics. Currently, the most common targets for enrichment are the protein coding exons represented by the consensus coding DNA sequence (CCDS). The CCDS, however, excludes many actual or computationally predicted coding exons present in other databases, such as RefSeq and Vega, and non-coding functional elements such as untranslated and regulatory regions. The number of variants per base pair (variant density) and our ability to interrogate regions outside of the CCDS regions is consequently less well understood. 相似文献18.
Background
The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account. 相似文献19.
Origin and distribution of epipolythiodioxopiperazine (ETP) gene clusters in filamentous ascomycetes
Nicola J Patron Ross F Waller Anton J Cozijnsen David C Straney Donald M Gardiner William C Nierman Barbara J Howlett 《BMC evolutionary biology》2007,7(1):174