共查询到20条相似文献,搜索用时 15 毫秒
1.
Andrew S Warren Jeremy Archuleta Wu-chun Feng João Carlos Setubal 《BMC bioinformatics》2010,11(1):131
Background
Protein-coding gene detection in prokaryotic genomes is considered a much simpler problem than in intron-containing eukaryotic genomes. However there have been reports that prokaryotic gene finder programs have problems with small genes (either over-predicting or under-predicting). Therefore the question arises as to whether current genome annotations have systematically missing, small genes. 相似文献2.
Li L Wang X Xia M Stolc V Su N Peng Z;Waraporn Tongprasit Li S Wang J Wang X Deng XW 《Genome biology》2005,6(6):R52
Background
Sequencing and annotation of the genome of rice (Oryza sativa) have generated gene models in numbers that top all other fully sequenced species, with many lacking recognizable sequence homology to known genes. Experimental evaluation of these gene models and identification of new models will facilitate rice genome annotation and the application of this knowledge to other more complex cereal genomes. 相似文献3.
Background
The decrease in cost for sequencing and improvement in technologies has made it easier and more common for the re-sequencing of large genomes as well as parallel sequencing of small genomes. It is possible to completely sequence a small genome within days and this increases the number of publicly available genomes. Among the types of genomes being rapidly sequenced are those of microbial and viral genomes responsible for infectious diseases. However, accurate gene prediction is a challenge that persists for decoding a newly sequenced genome. Therefore, accurate and efficient gene prediction programs are highly desired for rapid and cost effective surveillance of RNA viruses through full genome sequencing. 相似文献4.
Natalie Garrett Juan Viruel Nathaniel Klimpert Marybel Soto Gomez Vivienne K. Y. Lam Vincent S. F. T. Merckx Sean W. Graham 《American journal of botany》2023,110(4):e16141
Premise
Species in Thismiaceae can no longer photosynthesize and instead obtain carbon from soil fungi. Here we infer Thismiaceae phylogeny using plastid genome data and characterize the molecular evolution of this genome.Methods
We assembled five Thismiaceae plastid genomes from genome skimming data, adding to previously published data for phylogenomic inference. We investigated plastid-genome structural changes, considering locally colinear blocks (LCBs). We also characterized possible shifts in selection pressure in retained genes by considering changes in the ratio of nonsynonymous to synonymous changes (ω).Results
Thismiaceae experienced two major pulses of gene loss around the early diversification of the family, with subsequent scattered gene losses across descendent lineages. In addition to massive size reduction, Thismiaceae plastid genomes experienced occasional inversions, and there were likely two independent losses of the plastid inverted repeat (IR) region. Retained plastid genes remain under generally strong purifying selection (ω << 1), with significant and sporadic weakening or strengthening in several instances. The bifunctional trnE-UUC gene of Thismia huangii may retain a secondary role in heme biosynthesis, despite a probable loss of functionality in protein translation. Several cis-spliced group IIA introns have been retained, despite the loss of the plastid intron maturase, matK.Conclusions
We infer that most gene losses in Thismiaceae occurred early and rapidly, following the initial loss of photosynthesis in its stem lineage. As a species-rich, fully mycoheterotrophic lineage, Thismiaceae provide a model system for uncovering the unique and divergent ways in which plastid genomes evolve in heterotrophic plants. 相似文献5.
Cloning and characterization of microRNAs from wheat (<Emphasis Type="Italic">Triticum aestivum</Emphasis> L.) 总被引:8,自引:0,他引:8
Background
MicroRNAs (miRNAs) are a class of small, non-coding regulatory RNAs that regulate gene expression by guiding target mRNA cleavage or translational inhibition. So far, identification of miRNAs has been limited to a few model plant species, such as Arabidopsis, rice and Populus, whose genomes have been sequenced. Wheat is one of the most important cereal crops worldwide. To date, only a few conserved miRNAs have been predicted in wheat and the computational identification of wheat miRNAs requires the genome sequence, which is unknown. 相似文献6.
Loksum Wong Cameron J Weadick Claire Kuo Belinda SW Chang Vincent Tropepe 《BMC developmental biology》2010,10(1):100
Background
The Dmbx1 gene is important for the development of the midbrain and hindbrain, and mouse gene targeting experiments reveal that this gene is required for mediating postnatal and adult feeding behaviours. A single Dmbx1 gene exists in terrestrial vertebrate genomes, while teleost genomes have at least two paralogs. We compared the loss of function of the zebrafish dmbx1a and dmbx1b genes in order to gain insight into the molecular mechanism by which dmbx1 regulates neurogenesis, and to begin to understand why these duplicate genes have been retained in the zebrafish genome. 相似文献7.
Tomas A Larsson Frida Olsson Gorel Sundstrom Lars-Gustav Lundin Sydney Brenner Byrappa Venkatesh Dan Larhammar 《BMC evolutionary biology》2008,8(1):184
Background
One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY) receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes) and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. 相似文献8.
Background
The correct taxonomic assignment of bacterial genomes is a primary and challenging task. With the availability of whole genome sequences, the gene content based approaches appear promising in inferring the bacterial taxonomy. The complete genome sequencing of a bacterial genome often reveals a substantial number of unique genes present only in that genome which can be used for its taxonomic classification.Results
In this study, we have proposed a comprehensive method which uses the taxon-specific genes for the correct taxonomic assignment of existing and new bacterial genomes. The taxon-specific genes identified at each taxonomic rank have been successfully used for the taxonomic classification of 2,342 genomes present in the NCBI genomes, 36 newly sequenced genomes, and 17 genomes for which the complete taxonomy is not yet known. This approach has been implemented for the development of a tool ‘Microtaxi’ which can be used for the taxonomic assignment of complete bacterial genomes.Conclusion
The taxon-specific gene based approach provides an alternate valuable methodology to carry out the taxonomic classification of newly sequenced or existing bacterial genomes.Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1542-0) contains supplementary material, which is available to authorized users. 相似文献9.
Nicolas Corradi Karen L Haag Jean-François Pombert Dieter Ebert Patrick J Keeling 《Genome biology》2009,10(10):R106-12
Background
The highly compacted 2.9-Mb genome of Encephalitozoon cuniculi placed the microsporidia in the spotlight, encoding a mere 2,000 proteins and a highly reduced suite of biochemical pathways. This extreme level of reduction is not universal across the microsporidia, with genomes known to vary up to sixfold in size, suggesting that some genomes may harbor a gene content that is not as reduced as that of Enc. cuniculi. In this study, we present an in-depth survey of the large genome of Octosporea bayeri, a pathogen of Daphnia magna, with an estimated genome size of 24 Mb, in order to shed light on the organization and content of a large microsporidian genome. 相似文献10.
Haas BJ Volfovsky N Town CD Troukhan M Alexandrov N Feldmann KA Flavell RB White O Salzberg SL 《Genome biology》2002,3(6):research0029.1-research002912
11.
12.
Background
Brucellaspecies are Gram-negative, facultative intracellular bacteria that cause brucellosis in humans and animals. Sequences of fourBrucellagenomes have been published, and variousBrucellagene and genome data and analysis resources exist. A web gateway to integrate these resources will greatly facilitateBrucellaresearch.Brucellagenome data in current databases is largely derived from computational analysis without experimental validation typically found in peer-reviewed publications. It is partially due to the lack of a literature mining and curation system able to efficiently incorporate the large amount of literature data into genome annotation. It is further hypothesized that literature-basedBrucellagene annotation would increase understanding of complicatedBrucellapathogenesis mechanisms. 相似文献13.
Background
In eukaryotic genomes, most genes are members of gene families. When comparing genes from two species, therefore, most genes in one species will be homologous to multiple genes in the second. This often makes it difficult to distinguish orthologs (separated through speciation) from paralogs (separated by other types of gene duplication). Combining phylogenetic relationships and genomic position in both genomes helps to distinguish between these scenarios. This kind of comparison can also help to describe how gene families have evolved within a single genome that has undergone polyploidy or other large-scale duplications, as in the case of Arabidopsis thaliana – and probably most plant genomes. 相似文献14.
Tomonori Hirao Atsushi Watanabe Manabu Kurita Teiji Kondo Katsuhiko Takata 《BMC plant biology》2008,8(1):70
Background
The recent determination of complete chloroplast (cp) genomic sequences of various plant species has enabled numerous comparative analyses as well as advances in plant and genome evolutionary studies. In angiosperms, the complete cp genome sequences of about 70 species have been determined, whereas those of only three gymnosperm species, Cycas taitungensis, Pinus thunbergii, and Pinus koraiensis have been established. The lack of information regarding the gene content and genomic structure of gymnosperm cp genomes may severely hamper further progress of plant and cp genome evolutionary studies. To address this need, we report here the complete nucleotide sequence of the cp genome of Cryptomeria japonica, the first in the Cupressaceae sensu lato of gymnosperms, and provide a comparative analysis of their gene content and genomic structure that illustrates the unique genomic features of gymnosperms. 相似文献15.
Kalliopi Georgiades Vicky Merhej Khalid El Karkouri Didier Raoult Pierre Pontarotti 《Biology direct》2011,6(1):6
Background
Genome degradation is an ongoing process in all members of the Rickettsiales order, which makes these bacterial species an excellent model for studying reductive evolution through interspecies variation in genome size and gene content. In this study, we evaluated the degree to which gene loss shaped the content of some Rickettsiales genomes. We shed light on the role played by horizontal gene transfers in the genome evolution of Rickettsiales. 相似文献16.
Background
Genomic islands can be observed in many microbial genomes. These stretches of DNA have a conspicuous composition with regard to sequence or encoded functions. Genomic islands are assumed to be frequently acquired via horizontal gene transfer. For the analysis of genome structure and the study of horizontal gene transfer, it is necessary to reliably identify and characterize these islands. 相似文献17.
Background
To meet the needs of gene annotation for newly sequenced organisms, optimized spaced seeds can be implemented into cross-species sequence alignment programs to accurately align gene sequences to the genome of a related species. So far, seed performance has been tested for comparisons between closely related species, such as human and mouse, or on simulated data. As the number and variety of genomes increases, it becomes desirable to identify a small set of universal seeds that perform optimally or near-optimally on a large range of comparisons. 相似文献18.
Haining Lin Shu Ouyang Amy Egan Kan Nobuta Brian J Haas Wei Zhu Xun Gu Joana C Silva Blake C Meyers C Robin Buell 《BMC plant biology》2008,8(1):18
Background
High gene numbers in plant genomes reflect polyploidy and major gene duplication events. Oryza sativa, cultivated rice, is a diploid monocotyledonous species with a ~390 Mb genome that has undergone segmental duplication of a substantial portion of its genome. This, coupled with other genetic events such as tandem duplications, has resulted in a substantial number of its genes, and resulting proteins, occurring in paralogous families. 相似文献19.