首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
CpG islands (CGIs) are often considered as gene markers, but the number of CGIs varies among mammalian genomes that have similar numbers of genes. In this study, we investigated the distribution of CGIs in the promoter regions of 3,197 human-mouse orthologous gene pairs and found that the mouse genome has notably fewer CGIs in the promoter regions and less pronounced CGI characteristics than does the human genome. We further inferred CGI's ancestral state using the dog genome as a reference and examined the nucleotide substitution pattern and the mutational direction in the conserved regions of human and mouse CGIs. The results reveal many losses of CGIs in both genomes but the loss rate in the mouse lineage is two to four times the rate in the human lineage. We found an intriguing feature of CGI loss, namely that the loss of a CGI usually starts from erosion at the both edges and gradually moves towards the center. We found functional bias in the genes that have lost promoter-associated CGIs in the human or mouse lineage. Finally, our analysis indicates that the association of CGIs with housekeeping genes is not as strong as previously estimated. Our study provides a detailed view of the evolution of promoter-associated CGIs in the human and mouse genomes and our findings are helpful for understanding the evolution of mammalian genomes and the role of CGIs in gene function.  相似文献   

2.
3.
To study reductive evolutionary processes in bacterial genomes, we examine sequences in the Rickettsia genomes which are unconstrained by selection and evolve as pseudogenes, one of which is the metK gene, which codes for AdoMet synthetase. Here, we sequenced the metK gene and three surrounding genes in eight different species of the genus Rickettsia. The metK gene was found to contain a high incidence of deletions in six lineages, while the three genes in its surroundings were functionally conserved in all eight lineages. A more drastic example of gene degradation was identified in the metK downstream region, which contained an open reading frame in Rickettsia felis. Remnants of this open reading frame could be reconstructed in five additional species by eliminating sites of frameshift mutations and termination codons. A detailed examination of the two reconstructed genes revealed that deletions strongly predominate over insertions and that there is a strong transition bias for point mutations which is coupled to an excess of GC-to-AT substitutions. Since the molecular evolution of these inactive genes should reflect the rates and patterns of neutral mutations, our results strongly suggest that there is a high spontaneous rate of deletions as well as a strong mutation bias toward AT pairs in the Rickettsia genomes. This may explain the low genomic G + C content (29%), the small genome size (1.1 Mb), and the high noncoding content (24%), as well as the presence of several pseudogenes in the Rickettsia prowazekii genome.  相似文献   

4.
Payseur BA  Hoekstra HE 《Genetics》2005,171(4):1905-1916
Reproductive isolation is often caused by the disruption of genic interactions that evolve in geographically separate populations. Identifying the genomic regions and genes involved in these interactions, known as "Dobzhansky-Muller incompatibilities," can be challenging but is facilitated by the wealth of genetic markers now available in model systems. In recent years, the complete genome sequence and thousands of single nucleotide polymorphisms (SNPs) from laboratory mice, which are largely genetic hybrids between Mus musculus and M. domesticus, have become available. Here, we use these resources to locate genomic regions that may underlie reproductive isolation between these two species. Using genotypes from 332 SNPs that differ between wild-derived strains of M. musculus and M. domesticus, we identified several physically unlinked SNP pairs that show exceptional gametic disequilibrium across the lab strains. Conspecific alleles were associated in a disproportionate number of these cases, consistent with the action of natural selection against hybrid gene combinations. As predicted by the Dobzhansky-Muller model, this bias was differentially attributable to locus pairs for which one hybrid genotype was missing. We assembled a list of potential Dobzhansky-Muller incompatibilities from locus pairs that showed extreme associations (only three gametic types) among conspecific alleles. Two SNPs in this list map near known hybrid sterility loci on chromosome 17 and the X chromosome, allowing us to nominate partners for disrupted interactions involving these genomic regions for the first time. Together, these results indicate that patterns produced by speciation between M. musculus and M. domesticus are visible in the genomes of lab strains of mice, underscoring the potential of these genetic model organisms for addressing general questions in evolutionary biology.  相似文献   

5.
作为DNA序列的重要组成特征,基因组寡核苷酸使用模式及其偏倚的研究已被广泛应用于原核生物基因组的分析。然而,关于寡核苷酸使用模式的偏倚是否具有种群特异性并反映种群的功能这一问题,尚未阐明。我们基于一阶马尔可夫链模型,提出了一个度量寡核苷酸使用模式偏倚的新指标——基因组三核苷酸(trinucleotide,tri-)转移概率偏倚(transition probability bias,TPB)特征向量,或称之为三核苷酸转移概率最大偏倚分布,并分析比较了727条有代表性的原核生物基因组序列tri-TPB特征向量。结果表明,基因组tri-TPB特征向量具有物种特异性,亲缘关系越近的物种,它们的tri-TPB特征向量越相似;同种内的不同菌株具有几乎完全相同的tri-TPB特征向量,并且不依赖于基因组的GC含量;此外,基因组tri-TPB特征向量的相似性与菌株的致病性特征相关。本研究结果为基于全基因组寡核苷酸组成和分布信息的物种及其致病性进化分析提供了新的思路和方法。  相似文献   

6.
The evolution of a pair of chromosomes that differ in appearance between males and females (heteromorphic sex chromosomes) has occurred repeatedly across plants and animals. Recent work has shown that the male heterogametic (XY) and female heterogametic (ZW) sex chromosomes evolved independently from different pairs of homomorphic autosomes in the common ancestor of birds and mammals but also that X and Z chromosomes share many convergent molecular features. However, little is known about how often heteromorphic sex chromosomes have either evolved convergently from different autosomes or in parallel from the same pair of autosomes and how universal patterns of molecular evolution on sex chromosomes really are. Among winged insects with sequenced genomes, there are male heterogametic species in both the Diptera (e.g., Drosophila melanogaster) and the Coleoptera (Tribolium castaneum), female heterogametic species in the Lepidoptera (Bombyx mori), and haplodiploid species in the Hymenoptera (e.g., Nasonia vitripennis). By determining orthologous relationships among genes on the X and Z chromosomes of insects with sequenced genomes, we are able to show that these chromosomes are not homologous to one another but are homologous to autosomes in each of the other species. These results strongly imply that heteromorphic sex chromosomes have evolved independently from different pairs of ancestral chromosomes in each of the insect orders studied. We also find that the convergently evolved X chromosomes of Diptera and Coleoptera share genomic features with each other and with vertebrate X chromosomes, including excess gene movement from the X to the autosomes. However, other patterns of molecular evolution--such as increased codon bias, decreased gene density, and the paucity of male-biased genes on the X--differ among the insect X and Z chromosomes. Our results provide evidence for both differences and nearly universal similarities in patterns of evolution among independently derived sex chromosomes.  相似文献   

7.
8.
ABSTRACT. Analysis of total DNA isolated from the Chrysophyte alga Ochromonas danica revealed, in addition to nuclear DNA, two genomes present as numerous copies per cell. The larger genome (?120 kilobase pairs or kbp) is the plastid DNA, which is identified by its hybridization to plasmids containing sequences for the photosynthesis genes rbcL, psbA, and psbC. The smaller genome (40 kbp) is the mitochondrial genome as identified by its hybridization with plasmids containing gene sequences of plant cytochrome oxidase subunits I and II. Both the 120- and 40-kbp genomes contain genes for the small and large subunits of rDNA. The mitochondrial genome is linear with terminal inverted repeats of about 1.6 kbp. Two other morphologically similar species were examined, Ochromonas minuta and Poteriochromonas malhamensis. All three species have linear mitochondrial DNA of 40 kbp. Comparisons of endonuclease restriction-fragment patterns of the mitochondrial and chloroplast DNAs as well as those of their nuclear rDNA repeats failed to reveal any fragment shared by any two of the species. Likewise, no common fragment size was detected by hybridization with plasmids containing heterologous DNA or with total mitochondrial DNA of O. danica; these observations support the taxonomic assignment of these three organisms to different species. The Ochromonas mitochondrial genomes are the first identified in the chlorophyll a/c group of algae. Combining these results with electron microscopic observations of putative mitochondrial genomes reported for other chromophytes and published molecular studies of other algal groups suggests that all classes of eukaryote algae may have mitochondrial genomes < 100 kbp in size, more like other protistans than land plants.  相似文献   

9.
10.
11.
Despite substantial work, the phylogeny of malaria parasites remains debated. The matter is complicated by concerns about patterns of evolution in potentially strongly selected genes as well as the extreme AT bias of some Plasmodium genomes. Particularly contentious has been the position of the most virulent human parasite Plasmodium falciparum, whether grouped with avian parasites or within a larger clade of mammalian parasites. Here, we study 3 classes of rare genomic changes, as well as the sequences of mitochondrial ribosomal RNA (rRNA) genes. We report 3 lines of support for a clade of mammalian parasites: 1) we find no instances of spliceosomal intron loss in a hypothetical ancestor of P. falciparum and the avian parasite Plasmodium gallinaceum, suggesting against a close relationship between those species; 2) we find 4 genomic mitochondrial indels supporting a mammalian clade, but none grouping P. falciparum with avian parasites; and 3) slowly evolving mitochondrial rRNA sequences support a mammalian parasite clade with 100% posterior probability. We further report a large deletion in the mitochondrial large subunit rRNA gene, which suggests a subclade including both African and Asian parasites within the clade of closely related primate malarias. This contrasts with previous studies that provided strong support for separate Asian and African clades, and reduces certainty about the historical and geographic origins of Plasmodium vivax. Finally, we find a lack of synapomorphic gene losses, suggesting a low rate of ancestral gene loss in Plasmodium.  相似文献   

12.
Angiosperm mitochondrial genomes and mutations   总被引:1,自引:0,他引:1  
Flowering plants harbor the largest mitochondrial genomes reported so far. At present, the nucleotide sequences of 15 mitochondrial genomes from seven angiosperm species are available, making detailed comparative analysis feasible. The gene content is variable among the species, but the most striking feature is the fluidity of intergenic regions, where species-specific sequences predominate. Additionally, angiosperm mitochondrial genomes, even within a species, show a remarkable amount of rearrangement. We also review mitochondrial mutants in angiosperms from a genomic viewpoint, and discuss how they have arisen. The involvement of nuclear genes in mitochondrial genome stability and organization is currently being revealed through the analysis of mutants.  相似文献   

13.

Background

Nucleomorphs are residual nuclei derived from eukaryotic endosymbionts in chlorarachniophyte and cryptophyte algae. The endosymbionts that gave rise to nucleomorphs and plastids in these two algal groups were green and red algae, respectively. Despite their independent origin, the chlorarachniophyte and cryptophyte nucleomorph genomes share similar genomic features such as extreme size reduction and a three-chromosome architecture. This suggests that similar reductive evolutionary forces have acted to shape the nucleomorph genomes in the two groups. Thus far, however, only a single chlorarachniophyte nucleomorph and plastid genome has been sequenced, making broad evolutionary inferences within the chlorarachniophytes and between chlorarachniophytes and cryptophytes difficult. We have sequenced the nucleomorph and plastid genomes of the chlorarachniophyte Lotharella oceanica in order to gain insight into nucleomorph and plastid genome diversity and evolution.

Results

The L. oceanica nucleomorph genome was found to consist of three linear chromosomes totaling ~610 kilobase pairs (kbp), much larger than the 373 kbp nucleomorph genome of the model chlorarachniophyte Bigelowiella natans. The L. oceanica plastid genome is 71 kbp in size, similar to that of B. natans. Unexpectedly long (~35 kbp) sub-telomeric repeat regions were identified in the L. oceanica nucleomorph genome; internal multi-copy regions were also detected. Gene content analyses revealed that nucleomorph house-keeping genes and spliceosomal intron positions are well conserved between the L. oceanica and B. natans nucleomorph genomes. More broadly, gene retention patterns were found to be similar between nucleomorph genomes in chlorarachniophytes and cryptophytes. Chlorarachniophyte plastid genomes showed near identical protein coding gene complements as well as a high level of synteny.

Conclusions

We have provided insight into the process of nucleomorph genome evolution by elucidating the fine-scale dynamics of sub-telomeric repeat regions. Homologous recombination at the chromosome ends appears to be frequent, serving to expand and contract nucleomorph genome size. The main factor influencing nucleomorph genome size variation between different chlorarachniophyte species appears to be expansion-contraction of these telomere-associated repeats rather than changes in the number of unique protein coding genes. The dynamic nature of chlorarachniophyte nucleomorph genomes lies in stark contrast to their plastid genomes, which appear to be highly stable in terms of gene content and synteny.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-374) contains supplementary material, which is available to authorized users.  相似文献   

14.
15.
SK Behura  DW Severson 《PloS one》2012,7(8):e43111

Background

Codon bias is a phenomenon of non-uniform usage of codons whereas codon context generally refers to sequential pair of codons in a gene. Although genome sequencing of multiple species of dipteran and hymenopteran insects have been completed only a few of these species have been analyzed for codon usage bias.

Methods and Principal Findings

Here, we use bioinformatics approaches to analyze codon usage bias and codon context patterns in a genome-wide manner among 15 dipteran and 7 hymenopteran insect species. Results show that GAA is the most frequent codon in the dipteran species whereas GAG is the most frequent codon in the hymenopteran species. Data reveals that codons ending with C or G are frequently used in the dipteran genomes whereas codons ending with A or T are frequently used in the hymenopteran genomes. Synonymous codon usage orders (SCUO) vary within genomes in a pattern that seems to be distinct for each species. Based on comparison of 30 one-to-one orthologous genes among 17 species, the fruit fly Drosophila willistoni shows the least codon usage bias whereas the honey bee (Apis mellifera) shows the highest bias. Analysis of codon context patterns of these insects shows that specific codons are frequently used as the 3′- and 5′-context of start and stop codons, respectively.

Conclusions

Codon bias pattern is distinct between dipteran and hymenopteran insects. While codon bias is favored by high GC content of dipteran genomes, high AT content of genes favors biased usage of synonymous codons in the hymenopteran insects. Also, codon context patterns vary among these species largely according to their phylogeny.  相似文献   

16.
This paper presents a genomic comparison between 20 sequenced BACs (or fragments of BACs) from Tetraodon nigroviridis and the human genome. A total of 199 fish genes were identified by informatics resources, together with their putative human orthologues. Comparisons of the localizations in both species led to the identification of 32 syntenic regions and a minimum of 131 rearrangements in these regions that occurred during independent evolution of these species. This made it possible to estimate the rate of genomic rearrangements that occurred per million years (and per megabase). This rate is comparable to that obtained by comparison of the Fugu rubripes shotgun sequence data to human data but is significantly higher that those obtained by comparing the human genome to mammalian genomes. Overall, it suggests that genomic evolution by rearrangement is not uniform within the vertebrate group.Sequence data for the genomic BAC clones have been deposited with the DDBJ/EMBL/GenBank Data Libraries under accession numbers BX629360, BX629354, BX629355, BX629356, BX629357, BX629358, BX629359, and BX629360.  相似文献   

17.
Somatic mutations are not distributed randomly throughout Ab V region genes. A sequence-specific target bias is revealed by a defined hierarchy of mutability among di- and trinucleotide sequences located within Ig intronic DNA. Here we report that the di- and trinucleotide mutability preference pattern is shared by mouse intronic JH and Jkappa clusters and by human VH genes, suggesting that a common mutation mechanism exists for all Ig V genes of both species. Using di- and trinucleotide target preferences, we performed a comprehensive analysis of human and murine germline V genes to predict regional mutabilities. Heavy chain genes of both species exhibit indistinguishable patterns in which complementarity-determining region 1 (CDR1), CDR2, and framework region 3 (FR3) are predicted to be more mutable than FR1 and FR2. This prediction is borne out by empirical mutation data from nonproductively rearranged human VH genes. Analysis of light chain genes in both species also revealed a common, but unexpected, pattern in which FR2 is predicted to be highly mutable. While our analyses of nonfunctional Ig genes accurately predicts regional mutation preferences in VH genes, observed relative mutability differences between regions are more extreme than expected. This cannot be readily accounted for by nascent mRNA secondary structure or by a supplemental gene conversion mechanism that might favor nucleotide replacements in CDR. Collectively, our data support the concept of a common mutation mechanism for heavy and light chain genes of mice and humans with regional bias that is qualitatively, but not quantitatively, accounted for by short nucleotide sequence composition.  相似文献   

18.
DAGchainer: a tool for mining segmental genome duplications and synteny   总被引:8,自引:0,他引:8  
SUMMARY: Given the positions of protein-coding genes along genomic sequence and probability values for protein alignments between genes, DAGchainer identifies chains of gene pairs sharing conserved order between genomic regions, by identifying paths through a directed acyclic graph (DAG). These chains of collinear gene pairs can represent segmentally duplicated regions and genes within a single genome or syntenic regions between related genomes. Automated mining of the Arabidopsis genome for segmental duplications illustrates the use of DAGchainer.  相似文献   

19.
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole‐genome sequencing of numerous species, both prokaryotes and eukaryotes, genome‐wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole‐genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome‐sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome‐sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance.  相似文献   

20.
Haplotype block structure is conserved across mammals   总被引:2,自引:0,他引:2  
Genetic variation in genomes is organized in haplotype blocks, and species-specific block structure is defined by differential contribution of population history effects in combination with mutation and recombination events. Haplotype maps characterize the common patterns of linkage disequilibrium in populations and have important applications in the design and interpretation of genetic experiments. Although evolutionary processes are known to drive the selection of individual polymorphisms, their effect on haplotype block structure dynamics has not been shown. Here, we present a high-resolution haplotype map for a 5-megabase genomic region in the rat and compare it with the orthologous human and mouse segments. Although the size and fine structure of haplotype blocks are species dependent, there is a significant interspecies overlap in structure and a tendency for blocks to encompass complete genes. Extending these findings to the complete human genome using haplotype map phase I data reveals that linkage disequilibrium values are significantly higher for equally spaced positions in genic regions, including promoters, as compared to intergenic regions, indicating that a selective mechanism exists to maintain combinations of alleles within potentially interacting coding and regulatory regions. Although this characteristic may complicate the identification of causal polymorphisms underlying phenotypic traits, conservation of haplotype structure may be employed for the identification and characterization of functionally important genomic regions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号