首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Dehnert M  Helm WE  Hütt MT 《Gene》2005,345(1):81-90
We study short-range correlations in DNA sequences with methods from information theory and statistics. We find a persisting degree of identity between the correlation patterns of different chromosomes of a species. Except for the case of human and chimpanzee inter-species differences in this correlation pattern allow robust species distinction: in a clustering tree based upon the correlation curves on the level of individual chromosomes distinct clusters for the individual species are found. This capacity of distinguishing species persists, even when the length of the underlying sequences is drastically reduced. In comparison to the standard tool for studying symbol correlations in DNA sequences, namely the mutual information function, we find that an autoregressive model for higher order Markov processes significantly improves species distinction due to an implicit subtraction of random background.  相似文献   

2.
3.
The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.  相似文献   

4.
We isolated a new family of satellite DNA sequences from Hae III- and Eco RI-digested genomic DNA of the Blakistons fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.An erratum to this article can be found at Communicated by Y. Hiraoka  相似文献   

5.
Comparing DNA or protein sequences plays an important role in the functional analysis of genomes. Despite many methods available for sequences comparison, few methods retain the information content of sequences. We propose a new approach, the Yau-Hausdorff method, which considers all translations and rotations when seeking the best match of graphical curves of DNA or protein sequences. The complexity of this method is lower than that of any other two dimensional minimum Hausdorff algorithm. The Yau-Hausdorff method can be used for measuring the similarity of DNA sequences based on two important tools: the Yau-Hausdorff distance and graphical representation of DNA sequences. The graphical representations of DNA sequences conserve all sequence information and the Yau-Hausdorff distance is mathematically proved as a true metric. Therefore, the proposed distance can preciously measure the similarity of DNA sequences. The phylogenetic analyses of DNA sequences by the Yau-Hausdorff distance show the accuracy and stability of our approach in similarity comparison of DNA or protein sequences. This study demonstrates that Yau-Hausdorff distance is a natural metric for DNA and protein sequences with high level of stability. The approach can be also applied to similarity analysis of protein sequences by graphic representations, as well as general two dimensional shape matching.  相似文献   

6.
MOTIVATION: Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. RESULTS: The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.  相似文献   

7.
The origin and genomic constitution of the tetraploid perennial species Dasypyrum hordeaceum (2n = 4x = 28) and its phylogenetic relationships with the annual diploid Dasypyrum villosum (2n = 2x = 14) have been investigated by comparing the two genomes using different methods. There is no apparent homology between the conventional or Giemsa C-banded karyotypes of the two Dasypyrum species, nor can the karyotype of D. hordeaceum be split up into two similar sets. Polymorphism within several chromosome pairs was observed in both karyotypes. Cytophotometric determinations of the Feulgen-DNA absorptions showed that the genome size of D. hordeaceum was twice as large as that of D. villosum. Both the cross D. villosum x D. hordeaceum (crossability rate 12.1%) and the reciprocal cross (crossability rate 50.7%) produced plump seeds. Only those from the former cross germinated, producing sterile plants with a phenotype that was intermediate between those of the parents. In these hybrids (2n = 21), an average of 13.77 chromosomes per cell paired at meiotic metaphase I. Trivalents were only rarely observed. Through dot-blot hybridizations, a highly repeated DNA sequence of D. villosum was found not to be represented in the genome of D. hordeaceum. By contrast, very similar restriction patterns were observed when a low-repeated DNA sequence or different single-copy sequences of D. villosum or two sequences in the plastidial DNA of rice were hybridized to Southern blots of the genomic DNAs of the two Dasypyrum species digested with different restriction endonucleases. By analyzing glutamic-oxaloacetic-transaminase, superoxide dismutase, alcohol dehydrogenase, and esterase isozyme systems, it was shown that both Dasypyrum species shared the same phenotypes, which differed from those found in hexaploid wheat. In situ hybridizations using DNA sequences encoding gliadins showed that these genes were located close to the centromere of three pairs of D. villosum chromosomes and that they had the same locations in six pairs of D. hordeaceum chromosomes. We conclude that the autoploid origin of D. hordeaceum from D. villosum, which cannot be defended on the basis of chromosomal traits, is suggested by the other findings obtained by comparing the two genomes. Key words : Dasypyrum hordeaceum, Dasypyrum villosum, phylogenetic relationships.  相似文献   

8.
Sequence divergence in the internal transcribed spacer region 1 (ITS-1) of the ribosomal DNA locus was assessed in subspecies of the coastal North American tiger beetle, Cicindela dorsalis. The spacer region was amplified using the polymerase chain reaction and cloned for sequencing. Of a total of 50 clones obtained from 12 specimens, 42 clones were different in at least one nucleotide position. In a parsimony analysis of these sequences, the main phylogenetic distinction was found to separate sequences from the Gulf of Mexico and the Atlantic Ocean. Within these two assemblages phylogenetic resolution was low, and the variation within individuals was almost as high as the variation within the entire lineage. The pattern of sequence variation suggests the existence of two forms of the ITS-1 that are maintained on different chromosomes. Polymorphisms of limited geographical distribution could be detected, and 41 additional clones were partly sequenced, to assess the geographic distribution of these polymorphisms in more detail. In a population aggregation analysis, the geographic pattern of ITS-1 distribution was basically congruent with that obtained in earlier studies from mitochondrial DNA in the same C. dorsalis populations.   相似文献   

9.
Eukaryote nuclear ribosomal DNA (rDNA) typically exhibits strong concerted evolution: a pattern in which several hundred rDNA sequences within any one species show little or no genetic diversity, whereas the sequences of different species diverge. We report a markedly different pattern in the genome of the grasshopper Podisma pedestris. Single individuals contain several highly divergent ribosomal DNA groups. Analysis of the magnitude of divergence indicates that these groups have coexisted in the Podisma lineage for at least 11 million years. There are two putatively functional groups, each estimated to be at least 4 million years old, and several pseudogene groups, many of which are transcribed. Southern hybridization and real-time PCR experiments show that only one of the putatively functional types occurs at high copy number. However, this group is scarcely amplified under standard PCR conditions, which means that phylogenetic inference on the basis of standard PCR would be severely distorted. The analysis suggests that concerted evolution has been remarkably ineffective in P. pedestris. We propose that this outcome may be related to the species' exceptionally large genome and the associated low rate of deletion per base pair, which may allow pseudogenes to persist.  相似文献   

10.
Jiming Jiang  Bikram S Gill 《Génome》2006,49(9):1057-1068
Fluorescence in situ hybridization (FISH), which allows direct mapping of DNA sequences on chromosomes, has become the most important technique in plant molecular cytogenetics research. Repetitive DNA sequence can generate unique FISH patterns on individual chromosomes for karyotyping and phylogenetic analysis. FISH on meiotic pachytene chromosomes coupled with digital imaging systems has become an efficient method to develop physical maps in plant species. FISH on extended DNA fibers provides a high-resolution mapping approach to analyze large DNA molecules and to characterize large genomic loci. FISH-based physical mapping provides a valuable complementary approach in genome sequencing and map-based cloning research. We expect that FISH will continue to play an important role in relating DNA sequence information to chromosome biology. FISH coupled with immunoassays will be increasingly used to study features of chromatin at the cytological level that control expression and regulation of genes.  相似文献   

11.
One of the fascinating properties of the DNA sequences of prokaryotic and eukaryotic chromosomes is that they possess long-range order. Computational methods like spectral analysis, mutual information and DNA random walks have been used to probe long-range order via-long range correlations. This work attempts to show the advantage of using the Information Theoretic measure of mutual information for this purpose. A number Mu is found which indicates the existence of long-range order. Mu is the ratio between the value of mutual information function between two nucleotides of a DNA sequence separated by a large distance of 100 kilobases to the value expected from a randomized sequence of the same DNA. It is found that in spite of the constant shuffling of nucleotides due to insertion, deletion, inversion and recombination that occur during evolution, the chromosomal structure of prokaryotes is not always mosaic. While all archaeal chromosomes show mosaic structure and lack long-range order, a sizable fraction of the bacterial chromosomes do possess long-range order. A statistical multivariate analysis has been done to find which of the physical variables like genome size or GC% affects the organization of the chromosome or correlates with the long-range order. The existence of long-range order in bacterial chromosomes could be directly correlated to the degree of gene strand bias shown by it. Firmicutes which have low GC content also have pronounced strand bias and show long-range correlations. It is observed that the occurrence of long-range order in bacteria is independent of genome size, but depends on its GC content and gene strand bias.  相似文献   

12.
James TY  Moncalvo JM  Li S  Vilgalys R 《Genetics》2001,157(1):149-161
The common split-gilled mushroom Schizophyllum commune is found throughout the world on woody substrates. This study addresses the dispersal and population structure of this fungal species by studying the phylogeny and evolutionary dynamics of ribosomal DNA (rDNA) spacer regions. Extensive sampling (n = 195) of sequences of the intergenic spacer region (IGS1) revealed a large number of unique haplotypes (n = 143). The phylogeny of these IGS1 sequences revealed strong geographic patterns and supported three evolutionarily distinct lineages within the global population. The same three geographic lineages were found in phylogenetic analysis of both other rDNA spacer regions (IGS2 and ITS). However, nested clade analysis of the IGS1 phylogeny suggested the population structure of S. commune has undergone recent changes, such as a long distance colonization of western North America from Europe as well as a recent range expansion in the Caribbean. Among all spacer regions, variation in length and nucleotide sequence was observed between but not within the tandem rDNA repeats (arrays). This pattern is consistent with strong within-array and weak among-array homogenizing forces. We present evidence for the suppression of recombination between rDNA arrays on homologous chromosomes that may account for this pattern of concerted evolution.  相似文献   

13.
It is shown by isopycnic density gradient centrifugation that the DNAs of the sibling species Drosophila hydei, Drosophila neohydei and Drosophila pseudoneohydei differ regarding the numbers and proportions of satellite DNA bands. An overwhelming proportion of all repetitive nucleotide sequences of the DNA is contained in these satellite fractions. The majority of the satellites are species specific despite the close phylogenetic and cytological relationship between the three species studied. — By in situ hybridization experiments it is demonstrated that the various satellite sequences occupy different positions within the chromosomes. All types of localization patterns, from a wide spread occurrence in all chromosomes to an apparent restriction to kinetochore regions of single chromosomes, have been observed. Main band DNA, on the other hand, in its hybridization behavior reflects the DNA distribution according to the banding pattern in giant chromosomes. Generally satellite sequences seem to be included in -heterochromatic chromosome regions but no relation to the heterochromatin of the Y-chromosome was found. — Renaturation studies support various evidence that satellite sequences occur in tandemly repetitious units. At least some of this repetitious material seems to be linked to non-satellite DNA sequences or to DNA of other satellites.  相似文献   

14.
Zuckerkandl and Pauling (1962, "Horizons in Biochemistry," pp. 189-225, Academic Press, New York) first noticed that the degree of sequence similarity between the proteins of different species could be used to estimate their phylogenetic relationship. Since then models have been developed to improve the accuracy of phylogenetic inferences based on amino acid or DNA sequences. Most of these models were designed to yield distance measures that are linear with time, on average. The reliability of phylogenetic reconstruction, however, depends on the variance of the distance measure in addition to its expectation. In this paper we show how the method of generalized least squares can be used to combine data types, each most informative at different points in time, into a single distance measure. This measure reconstructs phylogenies more accurately than existing non-likelihood distance measures. We illustrate the approach for a two-rate mutation model and demonstrate that its application provides more accurate phylogenetic reconstruction than do currently available analytical distance measures.  相似文献   

15.
A huge part of the genomes of most Triticeae species is formed by different families of repetitive DNA sequences. In this paper the phylogenetic distribution of two major classes of the repeats, retrotransposons and tandemly organized DNA sequences, are considered and compared with the evolution of gene-rich regions and generally accepted Triticeae phylogenetic relationships. In Hordeum, LTR-containing retrotransposons are dispersed along the chromosomes and are consistent with the existing picture of the phylogeny of Hordeum. Another retrotransposon class, LINEs, have evolved independently from LTR-retrotransposons. Different retrotransposon classes appear to have competed for genome space during the evolution of Hordeum. Another class of repeats, tandemly organized DNA sequences, tends to cluster at the functionally important regions of chromosomes, centromeres and telomeres. The distribution of a number of tandem DNA families in Triticeae is not congruent with generally accepted phylogenetic relationships. While natural selection is the dominant factor determining the structure of genic regions we suggest that the contribution of random events is important in the evolution of repetitive DNA sequences. The interplay of stochastic processes, molecular drive, and selection determines the structure of chromosomal regions, notably at centromeres and telomeres, stabilizing and differentiating species-specific karyotypes. Thus, the evolution of these regions may occur largely independently of the evolution of gene-rich regions.  相似文献   

16.
M. Feldman  B. Liu  G. Segal  S. Abbo  A. A. Levy    J. M. Vega 《Genetics》1997,147(3):1381-1387
To study genome evolution in allopolyploid plants, we analyzed polyploid wheats and their diploid progenitors for the occurrence of 16 low-copy chromosome- or genome-specific sequences isolated from hexaploid wheat. Based on their occurrence in the diploid species, we classified the sequences into two groups: group I, found in only one of the three diploid progenitors of hexaploid wheat, and group II, found in all three diploid progenitors. The absence of group II sequences from one genome of tetraploid wheat and from two genomes of hexaploid wheat indicates their specific elimination from these genomes at the polyploid level. Analysis of a newly synthesized amphiploid, having a genomic constitution analogous to that of hexaploid wheat, revealed a pattern of sequence elimination similar to the one found in hexaploid wheat. Apparently, speciation through allopolyploidy is accompanied by a rapid, nonrandom elimination of specific, low-copy, probably noncoding DNA sequences at the early stages of allopolyploidization, resulting in further divergence of homoeologous chromosomes (partially homologous chromosomes of different genomes carrying the same order of gene loci). We suggest that such genomic changes may provide the physical basis for the diploid-like meiotic behavior of polyploid wheat.  相似文献   

17.
The crucial role played by the analysis of microbial diversity in biotechnology-based innovations has increased the interest in the microbial taxonomy research area. Phylogenetic sequence analyses have contributed significantly to the advances in this field, also in the view of the large amount of sequence data collected in recent years. Phylogenetic analyses could be realized on the basis of protein-encoding nucleotide sequences or encoded amino acid molecules: these two mechanisms present different peculiarities, still starting from two alternative representations of the same information. This complementarity could be exploited to achieve a multimodal phylogenetic scheme that is able to integrate gene and protein information in order to realize a single final tree. This aspect has been poorly addressed in the literature. In this paper, we propose to integrate the two phylogenetic analyses using basic schemes derived from the multimodality fusion theory (or multiclassifier systems theory), a well-founded and rigorous branch for which its powerfulness has already been demonstrated in other pattern recognition contexts. The proposed approach could be applied to distance matrix-based phylogenetic techniques (like neighbor joining), resulting in a smart and fast method. The proposed methodology has been tested in a real case involving sequences of some species of lactic acid bacteria. With this dataset, both nucleotide sequence- and amino acid sequence-based phylogenetic analyses present some drawbacks, which are overcome with the multimodal analysis.  相似文献   

18.
The focus of the research is on the analysis of genome sequences. Based on the inter-nucleotide distance sequence, we propose the conditional multinomial distribution profile for the complete genomic sequence. These profiles can be used to define a very simple, computationally efficient, alignment-free, distance measure that reflects the evolutionary relationships between genomic sequences. We use this distance measure to classify chromosomes according to species of origin, to build the phylogenetic tree of 24 complete genome sequences of coronaviruses. Our results demonstrate the new method is powerful and efficient.  相似文献   

19.
Reptiles are a karyologically heterogeneous group, where some orders and suborders exhibit characteristics similar to those of anamniotes and others share similarities with homeotherms. The class also shows different evolutionary trends, for instance in genome and chromosome size and composition. The turtle DNA base composition is similar to that of mammals, whereas that of lizards and snakes is more similar to that of anamniotes. The major karyological differences between turtles and squamates are the size and composition of the genome and the rate at which chromosomes change. Turtles have larger and more variable genome sizes, and a greater amount of middle repetitive DNA that differs even among related species. In lizards and snakes size of the genome are smaller, single-copy DNA is constant within each suborder, and differences in repetitive DNA involve fractions that become increasingly heterogeneous with widening phylogenetic distance. With regard to variation in karyotype morphology, turtles and crocodiles show low variability in chromosome number, morphology, and G-banding pattern. Greater variability is found among squamates, which have a similar degree of karyotypic change-as do some mammals, such as carnivores and bats-and in which there are also differences among congeneric species. An interesting relationship has been highlighted in the entire class Reptilia between rates of change in chromosomes, number of living species, and rate of extinction. However, different situations obtain in turtles and crocodiles on the one hand, and squamates on the other. In the former, the rate of change in chromosomes is lower and the various evolutionary steps do not seem to have entailed marked chromosomal variation, whereas squamates have a higher rate of change in chromosomes clearly related to the number of living species, and chromosomal variation seems to have played an important role in the evolution of several taxa. The different evolutionary trends in chromosomes observed between turtles and crocodiles on the one hand and squamates on the other might depend on their different patterns of G-banding.  相似文献   

20.
Summary The pattern of banding induced by five restriction enzymes in the chromosome complement of chimpanzee, gorilla, and orangutan is described and compared with that of humans. The G banding pattern induced by Hae III was the only feature common to the four species. Although hominid species show almost complete chromosomal homology, the restriction enzyme C banding pattern differed among the species studied. Hinf I did not induce banding in chimpanzee chromosomes, and Rsa I did not elicit banding in chimpanzee and orangutan chromosomes. Equivalent amounts of similar satellite DNA fractions located in homologous chromosomes from different species or in nonhomologous chromosomes from the same species showed different banding patterns with identical restriction enzymes. The great variability in frequency of restriction sites observed between homologous chromosome regions may have resulted from the divergence of primordial sequences changing the frequency of restriction sites for each species and for each chromosomal pair. A total of 30 patterns of banding were found informative for analysis of the hominid geneaalogical tree. Using the principle of maximum parsimony, our data support a branching order in which the chimpanzee is more closely related to the gorilla than to the human.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号