共查询到20条相似文献,搜索用时 31 毫秒
1.
Belen Lorente-Galdos Jonathan Bleyhl Gabriel Santpere Laura Vives Oscar Ramírez Jessica Hernandez Roger Anglada Gregory M Cooper Arcadi Navarro Evan E Eichler Tomas Marques-Bonet 《Genome biology》2013,14(1):R9
Background
The identification of signatures of natural selection has long been used as an approach to understanding the unique features of any given species. Genes within segmental duplications are overlooked in most studies of selection due to the limitations of draft nonhuman genome assemblies and to the methodological reliance on accurate gene trees, which are difficult to obtain for duplicated genes.Results
In this work, we detected exons with an accumulation of high-quality nucleotide differences between the human assembly and shotgun sequencing reads from single human and macaque individuals. Comparing the observed rates of nucleotide differences between coding exons and their flanking intronic sequences with a likelihood-ratio test, we identified 74 exons with evidence for rapid coding sequence evolution during the evolution of humans and Old World monkeys. Fifty-five percent of rapidly evolving exons were either partially or totally duplicated, which is a significant enrichment of the 6% rate observed across all human coding exons.Conclusions
Our results provide a more comprehensive view of the action of selection upon segmental duplications, which are the most complex regions of our genomes. In light of these findings, we suggest that segmental duplications could be subjected to rapid evolution more frequently than previously thought. 相似文献2.
Ancestral genome duplication in rice. 总被引:22,自引:0,他引:22
The recent availability of the pseudochromosome sequences of rice allows for the first time the investigation of the extent of intra-genomic duplications on a large scale in this agronomically important species. Using a dot-matrix plotter as a tool to display pairwise comparisons of ordered predicted coding sequences along rice pseudochromosomes, we found that the rice genome contains extensive chromosomal duplications accounting for 53% of the available sequences. The size of duplicated blocks is considerably larger than previously reported. In the rice genome, a duplicated block size of >1 Mb appears to be the rule and not the exception. Comparative mapping has shown high genetic colinearity among chromosomes of cereals, promoting rice as a model for studying grass genomes. Further comparative genome analysis should allow the study of the conservation and evolution of these duplication events in other important cereals such as rye, barley, and wheat. 相似文献
3.
4.
Cheung J Wilson MD Zhang J Khaja R MacDonald JR Heng HH Koop BF Scherer SW 《Genome biology》2003,4(8):R47
Background
The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (≥ 5 kb) and recent (≥ 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies.Results
We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of 'unmapped' chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice.Conclusion
Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis.5.
Background
Intrinsically disordered regions are enriched in short interaction motifs that play a critical role in many protein-protein interactions. Since new short interaction motifs may easily evolve, they have the potential to rapidly change protein interactions and cellular signaling. In this work we examined the dynamics of gain and loss of intrinsically disordered regions in duplicated proteins to inspect if changes after genome duplication can create functional divergence. For this purpose we used Saccharomyces cerevisiae and the outgroup species Lachancea kluyveri.Principal Findings
We find that genes duplicated as part of a genome duplication (ohnologs) are significantly more intrinsically disordered than singletons (p<2.2e-16, Wilcoxon), reflecting a preference for retaining intrinsically disordered proteins in duplicate. In addition, there have been marked changes in the extent of intrinsic disorder following duplication. A large number of duplicated genes have more intrinsic disorder than their L. kluyveri ortholog (29% for duplicates versus 25% for singletons) and an even greater number have less intrinsic disorder than the L. kluyveri ortholog (37% for duplicates versus 25% for singletons). Finally, we show that the number of physical interactions is significantly greater in the more intrinsically disordered ohnolog of a pair (p = 0.003, Wilcoxon).Conclusion
This work shows that intrinsic disorder gain and loss in a protein is a mechanism by which a genome can also diverge and innovate. The higher number of interactors for proteins that have gained intrinsic disorder compared with their duplicates may reflect the acquisition of new interaction partners or new functional roles. 相似文献6.
Javier Sanzol 《BMC plant biology》2010,10(1):87
Background
Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. 相似文献7.
Background
Polyploid species contribute to Oryza diversity. However, the mechanisms underlying gene and genome evolution in Oryza polyploids remain largely unknown. The allotetraploid Oryza minuta, which is estimated to have formed less than one million years ago, along with its putative diploid progenitors (O. punctata and O. officinalis), are quite suitable for the study of polyploid genome evolution using a comparative genomics approach.Results
Here, we performed a comparative study of a large genomic region surrounding the Shattering4 locus in O. minuta, as well as in O. punctata and O. officinalis. Duplicated genomes in O. minuta have maintained the diploid genome organization, except for several structural variations mediated by transposon movement. Tandem duplicated gene clusters are prevalent in the Sh4 region, and segmental duplication followed by random deletion is illustrated to explain the gene gain-and-loss process. Both copies of most duplicated genes still persist in O. minuta. Molecular evolution analysis suggested that these duplicated genes are equally evolved and mostly manipulated by purifying selection. However, cDNA-SSCP analysis revealed that the expression patterns were dramatically altered between duplicated genes: nine of 29 duplicated genes exhibited expression divergence in O. minuta. We further detected one gene silencing event that was attributed to gene structural variation, but most gene silencing could not be related to sequence changes. We identified one case in which DNA methylation differences within promoter regions that were associated with the insertion of one hAT element were probably responsible for gene silencing, suggesting a potential epigenetic gene silencing pathway triggered by TE movement.Conclusions
Our study revealed both genetic and epigenetic mechanisms involved in duplicated gene silencing in the allotetraploid O. minuta. 相似文献8.
Francesco Santini Luke J Harmon Giorgio Carnevale Michael E Alfaro 《BMC evolutionary biology》2009,9(1):194-15
Background
One of the main explanations for the stunning diversity of teleost fishes (~29,000 species, nearly half of all vertebrates) is that a fish-specific whole-genome duplication event (FSGD) in the ancestor to teleosts triggered their subsequent radiation. However, one critical assumption of this hypothesis, that diversification rates in teleosts increased soon after the acquisition of a duplicated genome, has never been tested. 相似文献9.
Thomas Thiel Andreas Graner Robbie Waugh Ivo Grosse Timothy J Close Nils Stein 《BMC evolutionary biology》2009,9(1):209
Background
Well preserved genomic colinearity among agronomically important grass species such as rice, maize, Sorghum, wheat and barley provides access to whole-genome structure information even in species lacking a reference genome sequence. We investigated footprints of whole-genome duplication (WGD) in barley that shaped the cereal ancestor genome by analyzing shared synteny with rice using a ~2000 gene-based barley genetic map and the rice genome reference sequence. 相似文献10.
Background
Ortholog assignment is a critical and fundamental problem in comparative genomics, since orthologs are considered to be functional counterparts in different species and can be used to infer molecular functions of one species from those of other species. MSOAR is a recently developed high-throughput system for assigning one-to-one orthologs between closely related species on a genome scale. It attempts to reconstruct the evolutionary history of input genomes in terms of genome rearrangement and gene duplication events. It assumes that a gene duplication event inserts a duplicated gene into the genome of interest at a random location (i.e., the random duplication model). However, in practice, biologists believe that genes are often duplicated by tandem duplications, where a duplicated gene is located next to the original copy (i.e., the tandem duplication model). 相似文献11.
12.
13.
Zhandong Liu Min Wang James V Alvarez Megan E Bonney Chien-chung Chen Celina D'Cruz Tien-chi Pan Mahlet G Tadesse Lewis A Chodosh 《Genome biology》2008,9(12):1-11
Background
The physical organization and chromosomal localization of genes within genomes is known to play an important role in their function. Most genes arise by duplication and move along the genome by random shuffling of DNA segments. Higher order structuring of the genome occurs in eukaryotes, where groups of physically linked genes are co-expressed. However, the contribution of gene duplication to gene order has not been analyzed in detail, as it is believed that co-expression due to recent duplicates would obscure other domains of co-expression.Results
We have catalogued ordered duplicated genes in Drosophila melanogaster, and found that one in five of all genes is organized as tandem arrays. Furthermore, among arrays that have been spatially conserved over longer periods than would be expected on the basis of random shuffling, a disproportionate number contain genes encoding developmental regulators. Using in situ gene expression data for more than half of the Drosophila genome, we find that genes in these conserved clusters are co-expressed to a much higher extent than other duplicated genes.Conclusions
These results reveal the existence of functional constraints in insects that retain copies of genes encoding developmental and regulatory proteins as neighbors, allowing their co-expression. This co-expression may be the result of shared cis-regulatory elements or a shared need for a specific chromatin structure. Our results highlight the association between genome architecture and the gene regulatory networks involved in the construction of the body plan. 相似文献14.
Berline Fopa Fomeju Cyril Falentin Gilles Lassalle Maria J Manzanares-Dauleux Régine Delourme 《BMC genomics》2014,15(1)
Background
Several major crop species are current or ancient polyploids. To better describe the genetic factors controlling traits of agronomic interest (QTL), it is necessary to understand the structural and functional organisation of these QTL regions in relation to genome duplication. We investigated quantitative resistance to the fungal disease stem canker in Brassica napus, a highly duplicated amphidiploid species, to assess the proportion of resistance QTL located at duplicated positions.Results
Genome-wide association analysis on a panel of 116 oilseed rape varieties genotyped with 3228 SNP indicated that 321 markers, corresponding to 64 genomic regions, are associated with resistance to stem canker. These genomic regions are relatively equally distributed on the A (53%) and C (47%) genomes of B. napus. Overall, 44% of these regions (28/64) are duplicated homoeologous regions. They are located in duplications of six (E, J, R, T, U and W) of the 24 ancestral blocks that constitute the B. napus genome. Overall, these six ancestral blocks have 34 duplicated copies in the B.napus genome. Almost all of the duplicated copies (82% of the 34 regions) harboured resistance associated markers for stem canker resistance, which suggests structural and functional conservation of genetic factors involved in this trait in B. napus.Conclusions
Our study provides information on the involvement of duplicated loci in the control of stem canker resistance in B. napus. Further investigation of the similarity/divergence in sequence and gene content of these duplicated regions will provide insight into the conservation and allelic diversity of the underlying genes.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-498) contains supplementary material, which is available to authorized users. 相似文献15.
Xiyin Wang Udo Gowik Haibao Tang John E Bowers Peter Westhoff Andrew H Paterson 《Genome biology》2009,10(6):R68-18
Background
Sorghum is the first C4 plant and the second grass with a full genome sequence available. This makes it possible to perform a whole-genome-level exploration of C4 pathway evolution by comparing key photosynthetic enzyme genes in sorghum, maize (C4) and rice (C3), and to investigate a long-standing hypothesis that a reservoir of duplicated genes is a prerequisite for the evolution of C4 photosynthesis from a C3 progenitor. 相似文献16.
Marion Ouedraogo Charles Bettembourg Anthony Bretaudeau Olivier Sallou Christian Diot Olivier Demeure Frédéric Lecerf 《PloS one》2012,7(11)
Background
There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The ‘Duplicated Genes Database’ (DGD) was developed for this purpose.Methodology
Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available.Conclusions
The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org. 相似文献17.
Background
The completion of rice genome sequencing has made rice and its wild relatives an attractive system for biological studies. Despite great efforts, phylogenetic relationships among genome types and species in the rice genus have not been fully resolved. To take full advantage of rice genome resources for biological research and rice breeding, we will benefit from the availability of a robust phylogeny of the rice genus.Results
Through screening rice genome sequences, we sampled and sequenced 142 single-copy genes to clarify the relationships among all diploid genome types of the rice genus. The analysis identified two short internal branches around which most previous phylogenetic inconsistency emerged. These represent two episodes of rapid speciation that occurred approximately 5 and 10 million years ago (Mya) and gave rise to almost the entire diversity of the genus. The known chromosomal distribution of the sampled genes allowed the documentation of whole-genome sorting of ancestral alleles during the rapid speciation, which was responsible primarily for extensive incongruence between gene phylogenies and persisting phylogenetic ambiguity in the genus. Random sample analysis showed that 120 genes with an average length of 874 bp were needed to resolve both short branches with 95% confidence.Conclusion
Our phylogenomic analysis successfully resolved the phylogeny of rice genome types, which lays a solid foundation for comparative and functional genomic studies of rice and its relatives. This study also highlights that organismal genomes might be mosaics of conflicting genealogies because of rapid speciation and demonstrates the power of phylogenomics in the reconstruction of rapid diversification. 相似文献18.
Carlos Quijano Pavel Tomancak Jesus Lopez-Marti Mikita Suyama Peer Bork Marco Milan David Torrents Miguel Manzanares 《Genome biology》2008,9(12):R176
Background
The physical organization and chromosomal localization of genes within genomes is known to play an important role in their function. Most genes arise by duplication and move along the genome by random shuffling of DNA segments. Higher order structuring of the genome occurs in eukaryotes, where groups of physically linked genes are co-expressed. However, the contribution of gene duplication to gene order has not been analyzed in detail, as it is believed that co-expression due to recent duplicates would obscure other domains of co-expression.Results
We have catalogued ordered duplicated genes in Drosophila melanogaster, and found that one in five of all genes is organized as tandem arrays. Furthermore, among arrays that have been spatially conserved over longer periods than would be expected on the basis of random shuffling, a disproportionate number contain genes encoding developmental regulators. Using in situ gene expression data for more than half of the Drosophila genome, we find that genes in these conserved clusters are co-expressed to a much higher extent than other duplicated genes.Conclusions
These results reveal the existence of functional constraints in insects that retain copies of genes encoding developmental and regulatory proteins as neighbors, allowing their co-expression. This co-expression may be the result of shared cis-regulatory elements or a shared need for a specific chromatin structure. Our results highlight the association between genome architecture and the gene regulatory networks involved in the construction of the body plan. 相似文献19.
Background
Peptide transporter (PTR) family whose member can transport di-/tripeptides and nitrate is important for plant growth and development. Although the rice (Oryza sativa L.) genome has been sequenced for a few years, a genomic survey, characterization and expression profile analysis of the PTR family in this species has not been reported. 相似文献20.
《BMC genomics》2014,15(1)