首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Forward Genomics – a comparative genomics approach to link phenotype to genotype Despite availability of several sequenced genomes, we know very little about the specific changes in the DNA that underlie phenotypic differences between species. The main reason is that species differ by both numerous genomic and phenotypic changes. A new comparative genomics method addresses this question by for phenotypes with independent evolutionary losses by searching for genomic regions that exhibit an elevated number of mutations in exactly these phenotype‐loss species. The near future sequencing of thousands of novel genomes will make it possible to use comparative genomics to systematically search for such DNA changes that are associated with phenotypic differences.  相似文献   

2.
An evolutionary distance is introduced in order to propose an efficient and feasible procedure for phylogeny studies. Our analysis are based on the strand asymmetry property of mitochondrial DNA, but can be applied to other genomes. Comparison of our results with those reported in conventional phylogenetic trees, gives confidence about our approximation. Our findings support the hypotheses about the origin of the skew and its dependence upon evolutionary pressures, and improves previous efforts on using the strand asymmetry property of genomes for phylogeny inference. For the evolutionary distance introduced here, we observe that the more adequate technique for tree reconstructions correspond to an average link method which employs a sequential clustering algorithm.  相似文献   

3.
Analyzing genomes with cumulative skew diagrams.   总被引:28,自引:3,他引:25       下载免费PDF全文
A novel method of cumulative diagrams shows that the nucleotide composition of a microbial chromosome changes at two points separated by about a half of its length. These points coincide with sites of replication origin and terminus for all bacteria where such sites are known. The leading strand is found to contain more guanine than cytosine residues. This fact is used to predict origin and terminus locations in other bacterial and archaeal genomes. Local changes, visible as diagram distortions, may represent recent genome rearrangements, as demonstrated for two strains of Escherichia coli . Analysis of the diagrams of viral and mitochondrial genomes suggests a link between the base composition bias and the time spent by DNA in a single stranded state during replication.  相似文献   

4.
The patterns of synonymous codon usage, both within and among genomes, have been extensively studied over the past two decades. Despite the accumulating evidence that natural selection can shape codon usage, it has not been possible to link a particular pattern of codon usage to a specific external selective force. Here, we have analyzed the patterns of synonymous codon usage in 40 completely sequenced prokaryotic genomes. By combining the genes from several genomes (more than 80 000 genes in all) into a single dataset for this analysis, we were able to investigate variations in codon usage, both within and between genomes. The results show that synonymous codon usage is affected by two major factors: (i) the overall G+C content of the genome and (ii) growth at high temperature. This study focused on the relationship between synonymous codon usage and the ability to grow at high temperature. We have been able to eliminate both phylogenetic history and lateral gene transfer as possible explanations for the characteristic pattern of codon usage among the thermophiles. Thus, these results demonstrate a clear link between a particular pattern of codon usage and an external selective force.  相似文献   

5.
Eukaryotes are traditionally considered to be one of the three natural divisions of the tree of life and the sister group of the Archaebacteria. However, eukaryotic genomes are replete with genes of eubacterial ancestry, and more than 20 mutually incompatible hypotheses have been proposed to account for eukaryote origins. Here we test the predictions of these hypotheses using a novel supertree-based phylogenetic signal-stripping method, and recover supertrees of life based on phylogenies for up to 5,741 single gene families distributed across 185 genomes. Using our signal-stripping method, we show that there are three distinct phylogenetic signals in eukaryotic genomes. In order of strength, these link eukaryotes with the Cyanobacteria, the Proteobacteria, and the Thermoplasmatales, an archaebacterial (euryarchaeotes) group. These signals correspond to distinct symbiotic partners involved in eukaryote evolution: plastids, mitochondria, and the elusive host lineage. According to our whole-genome data, eukaryotes are hardly the sister group of the Archaebacteria, because up to 83% of eukaryotic genes with a prokaryotic homolog have eubacterial, not archaebacterial, origins. The results reject all but two of the current hypotheses for the origin of eukaryotes: those assuming a sulfur-dependent or hydrogen-dependent syntrophy for the origin of mitochondria.  相似文献   

6.
Experiments are described which show that recombinant plasmids bearing cauliflower mosaic virus genomes flanked by homologous viral DNA arms (‘nested’ viral genomes) can infect plants without prior excision of the CaMV genome. Infectious viral genomes are excised from such recombinant plasmids within the plant by a process of intramolecular recombination. The frequency of infection, and presumably recombination increase with the length of the homologous DNA flanking the CaMV genome. This mechanism affords us the opportunity to transfer directly the CaMV genome from bacteria to plants.  相似文献   

7.
《Genomics》2022,114(4):110405
Southern Italy was characterised by a complex prehistory that started with different Palaeolithic cultures, later followed by the Neolithization and the demic dispersal from the Pontic-Caspian Steppe during the Bronze Age. Archaeological and historical evidences point to a link between Southern Italians and the Balkans still present in modern times. To shed light on these dynamics, we analysed around 700 South Mediterranean genomes combined with informative ancient DNAs. Our findings revealed high affinities of South-Eastern Italians with modern Eastern Peloponnesians, and a closer affinity of ancient Greek genomes with those from specific regions of South Italy than modern Greek genomes. The higher similarity could be associated with a Bronze Age component ultimately originating from the Caucasus with high Iranian and Anatolian Neolithic ancestries. Furthermore, extremely differentiated allele frequencies among Northern and Southern Italy revealed putatively adapted SNPs in genes involved in alcohol metabolism, nevi features and immunological traits.  相似文献   

8.
The relationship between vertebrates and the principal model invertebrates - fruitflies and nematodes - is unclear. A fly-nematode grouping was becoming widely accepted, but recent comparisons of their genomes argue against this and link flies with the vertebrates instead.  相似文献   

9.
10.
Natural selection processes tune genomes in the edge of the chaos imposed by mutation and drift, allowing an enduring exploration of fitter genetic networks within the constraints imposed by self-organization and the interactions of genotype and phenotype. Alternatively, evolution can be viewed from thermodynamic, kinetic or cybernetic perspectives. Regardless of insight, there is need to understand structure-function relationships at the molecular and holistic evolutionary levels. Strategies are here described that analyze genetic variation in time and trace the evolution of nucleic acid structure. Nucleic acid scanning techniques were used to measure sequence divergence and provide a direct inference of genome-wide mutation rate. This was tested for the first time in vegetatively propagating plants. The method is general and was also used in a study of mutational patterns in phytopathogenic fungi, showing there was a link between sequence and structural diversification of ribosomal gene spacers. In order to determine if this was a general phenomenon, the origin and diversification of nucleic acid secondary structure was traced using a cladistic method capable of producing rooted phylogenetic trees. Phylogenies reconstructed from primary and secondary RNA structure were congruent at all taxonomical levels, providing evidence of a strong link between phenotype and genotype favoring thermodynamic stability and dissipation of Gibbs free energy. Overall results suggest that thermodynamic principles are important driving forces of the evolutionary processes of the living world.  相似文献   

11.
Discovery of pyknons, the most frequent, variable-length DNA sequence motifs in the human genomes, suggests extensive sequence-based connectivity between non-coding and protein-coding components of human genomes. Here we report identification of ubiquitous template design sequences (templum intentio series, templints) of human genomes common for disease-associated SNPs, microRNAs, and pyknons. We demonstrate that genome-unique SNP-coding sequences associated with multiple common human disorders appear assembled from series of ubiquitous short octamer sequences shared by 5’-UTR pyknons and microRNAs. Our analysis suggests that units of genetic information encoded in the linear sequences of the 3.6 billion bases of human genome are condensed in ~200,000 bases (0.006%) of 5’ UTR pyknons which are represented by hundreds of copies in a genome and utilized to build genome-unique sequences. Allele-specific sequence variations link disease-associated SNPs to distinct sets of pyknons and microRNAs, suggesting that increased susceptibility to multiple common human disorders is associated with global alterations of genome-wide regulatory templates affecting the biogenesis and functions of non-coding RNAs.  相似文献   

12.
It has widely been documented that life form and mating system have significant influences on genetic diversity. In the tribe Triticeae, several genera contain both annual and perennial species, whereas other genera comprise strictly annual or perennial species. It was suggested that Triticeae annuals have originated from Triticeae perennials. The present study aims to analyze nucleotide diversity of Acc-1 gene among different Triticeae genomes, and attempts to link effects of life history (annuals and perennials) and mating systems. The nucleotide diversity of 364 Acc-1 sequences in Triticeae species was characterized. The highest estimates of nucleotide diversity values (π = 0.01919, θ = 0.03515) were found for the Ns genome among the genomes analyzed. Nucleotide diversities in the D genome and Ns genome of polyploids are higher than those in respective genomes of diploids, while in the St genome of polyploids, it is lower than that in the St genome of diploids. The averaged π value (0.013705) in the genomes of perennials is more than twice of the value (0.00508) in the genomes of annuals. The averaged π value (0.01323) in the genomes of outcrossing species is two-fold of the value (0.005664) in the genomes of selfer. Our results suggested that the evolutionary history and mating system may play an important role in determining nucleotide diversity of Acc-1 gene in each genome.  相似文献   

13.
New method to reveal the sites in genomes obtaining the high information capacity is developed. A distribution of those sites of the length 3 among 16 viral genomes and 11 bacteriophages genomes has been studied. It is shown that some triplets with high information capacity occur in a family of relatively close genomes with the increased frequency. The molecular evolution aspects of a persistence of highly scored sites with respect to their information capacity among various genomes are discussed.  相似文献   

14.
Salmonella enterica is an animal and zoonotic pathogen of worldwide importance and may be classified into serovars differing in virulence and host range. We sequenced and annotated the genomes of serovar Typhimurium, Choleraesuis, Dublin, and Gallinarum strains of defined virulence in each of three food-producing animal hosts. This provides valuable measures of intraserovar diversity and opportunities to formally link genotypes to phenotypes in target animals.  相似文献   

15.
Two-component systems that link environmental signals to cellular responses are viewed as the primary mode of signal transduction in prokaryotes. By analyzing information encoded by 145 prokaryotic genomes, we found that the majority of signal transduction systems consist of a single protein that contains input and output domains but lacks phosphotransfer domains typical of two-component systems. One-component systems are evolutionarily older, more widely distributed among bacteria and archaea, and display a greater diversity of domains than two-component systems.  相似文献   

16.
We present here a comprehensive analysis of the complement of enzymes in a large variety of species. As enzymes are a relatively conserved group there are several classification systems available that are common to all species and link a protein sequence to an enzymatic function. Enzymes are therefore an ideal functional group to study the relationship between sequence expansion, functional divergence and phenotypic changes. By using information retrieved from the well annotated SWISS-PROT database together with sequence information from a variety of fully sequenced genomes and information from the EC functional scheme we have aimed here to estimate the fraction of enzymes in genomes, to determine the extent of their functional redundancy in different domains of life and to identify functional innovations and lineage specific expansions in the metazoa lineage. We found that prokaryote and eukaryote species differ both in the fraction of enzymes in their genomes and in the pattern of expansion of their enzymatic sets. We observe an increase in functional redundancy accompanying an increase in species complexity. A quantitative assessment was performed in order to determine the degree of functional redundancy in different species. Finally, we report a massive expansion in the number of mammalian enzymes involved in signalling and degradation.  相似文献   

17.
Holland, P.W.H. 2010. From genomes to morphology: a view from amphioxus. —Acta Zoologica (Stockholm) 91 : 81–86 As complete genome sequences are determined from an ever‐increasing number of animal species, new opportunities are arising for comparative biology. For zoologists interested in the evolution of shape and form, however, there is a problem. The link between genome sequence and morphology is not direct and is obfuscated by complex and evolving genetic pathways, even when conserved regulatory genes are considered. Nonetheless, a large‐scale comparison of genome sequences between extant chordates reveals an intriguing parallel between genotypic and phenotypic evolution. Tunicates have highly altered genomes, with loss of ancestral genes and shuffled genetic arrangements, while vertebrate genomes are also derived through gene loss and genome duplication. The recently sequenced amphioxus genome, in contrast, reveals much greater stasis on the cephalochordate lineage, in parallel to a less derived body plan. The opportunities and challenges for relating genome evolution to morphological evolution are discussed.  相似文献   

18.

Background

In conventional approaches to plastid and mitochondrial genome sequencing, the sequencing steps are performed separately; thus, plastid DNA (ptDNA) and mitochondrial DNA (mtDNA) should be prepared independently. However, it is difficult to extract pure ptDNA and mtDNA from plant tissue. Following the development of high-throughput sequencing technology, many researchers have attempted to obtain plastid genomes or mitochondrial genomes using high-throughput sequencing data from total DNA. Unfortunately, the huge datasets generated consume massive computing and storage resources and cost a great deal, and even more importantly, excessive pollution reads affect the accuracy of the assembly. Therefore, it is necessary to develop an effective method that can generate base sequences from plant tissue and that is suitable for all plant species. Here, we describe a highly effective, low-cost method for obtaining plastid and mitochondrial genomes simultaneously.

Results

First, we obtained high-quality DNA employing Partial Concentration Extraction. Second, we evaluated the purity of the DNA sample and determined the sequencing dataset size employing Vector Control Quantitative Analysis. Third, paired-end reads were obtained using a high-throughput sequencing platform. Fourth, we obtained scaffolds employing Two-step Assembly. Finally, we filled in gaps using specific methods and obtained complete plastid and mitochondrial genomes. To ensure the accuracy of plastid and mitochondrial genomes, we validated the assembly using PCR and Sanger sequencing. Using this method,we obtained complete plastid and mitochondrial genomes with lengths of 153,533 nt and 223,412 nt separately.

Conclusion

A simple method for extracting, evaluating, sequencing and assembling plastid and mitochondrial genomes was developed. This method has many advantages: it is timesaving, inexpensive and reproducible and produces high-quality sequence. Furthermore, this method can produce plastid and mitochondrial genomes simultaneously and be used for other plant species. Due to its simplicity and extensive applicability, this method will support research on plant cytoplasmic genomes.  相似文献   

19.
Alexeev  Nikita  Alekseyev  Max A. 《BMC genomics》2017,18(4):356-9

Background

The ability to estimate the evolutionary distance between extant genomes plays a crucial role in many phylogenomic studies. Often such estimation is based on the parsimony assumption, implying that the distance between two genomes can be estimated as the rearrangement distance equal the minimal number of genome rearrangements required to transform one genome into the other. However, in reality the parsimony assumption may not always hold, emphasizing the need for estimation that does not rely on the rearrangement distance. The distance that accounts for the actual (rather than minimal) number of rearrangements between two genomes is often referred to as the true evolutionary distance. While there exists a method for the true evolutionary distance estimation, it however assumes that genomes can be broken by rearrangements equally likely at any position in the course of evolution. This assumption, known as the random breakage model, has recently been refuted in favor of the more rigorous fragile breakage model postulating that only certain “fragile” genomic regions are prone to rearrangements.

Results

We propose a new method for estimating the true evolutionary distance between two genomes under the fragile breakage model. We evaluate the proposed method on simulated genomes, which show its high accuracy. We further apply the proposed method for estimation of evolutionary distances within a set of five yeast genomes and a set of two fish genomes.

Conclusions

The true evolutionary distances between the five yeast genomes estimated with the proposed method reveals that some pairs of yeast genomes violate the parsimony assumption. The proposed method further demonstrates that the rearrangement distance between the two fish genomes underestimates their evolutionary distance by about 20%. These results demonstrate how drastically the two distances can differ and justify the use of true evolutionary distance in phylogenomic studies.
  相似文献   

20.
Recent years have seen the extensive use of phylogeographic approaches to unveil the dispersal history of virus epidemics. Spatially explicit reconstructions of viral spread represent valuable sources of lineage movement data that can be exploited to investigate the impact of underlying environmental layers on the dispersal of pathogens. Here, we performed phylogeographic inference and applied different post hoc approaches to analyse a new and comprehensive data set of viral genomes to elucidate the dispersal history and dynamics of rabies virus (RABV) in Iran, which have remained largely unknown. We first analysed the association between environmental factors and variations in dispersal velocity among lineages. Second, we present, test and apply a new approach to study the link between environmental conditions and the dispersal direction of lineages. The statistical performance (power of detection, false‐positive rate) of this new method was assessed using simulations. We performed phylogeographic analyses of RABV genomes, allowing us to describe the large diversity of RABV in Iran and to confirm the cocirculation of several clades in the country. Overall, we estimate a relatively high lineage dispersal velocity, similar to previous estimates for dog rabies virus spread in northern Africa. Finally, we highlight a tendency for RABV lineages to spread in accessible areas associated with high human population density. Our analytical workflow illustrates how phylogeographic approaches can be used to investigate the impact of environmental factors on several aspects of viral dispersal dynamics.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号