首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In order to study the relationships among mammalian alpha-globin genes, we have determined the sequence of the 3' flanking region of the human alpha 1 globin gene and have made pairwise comparisons between sequenced alpha-globin genes. The flanking regions were examined in detail because sequence matches in these regions could be interpreted with the least complication from the gene duplications and conversions that have occurred frequently in mammalian alpha-like globin gene clusters. We found good matches between the flanking regions of human alpha 1 and rabbit alpha 1, human psi alpha 1 and goat I alpha, human alpha 2 and goat II alpha, and horse alpha 1 and goat II alpha. These matches were used to align the alpha-globin genes in gene clusters from different mammals. This alignment shows that genes at equivalent positions in the gene clusters of different mammals can be functional or nonfunctional, depending on whether they corrected against a functional alpha-globin gene in recent evolutionary history. The number of alpha-globin genes (including pseudogenes) appears to differ among species, although highly divergent pseudogenes may not have been detected in all species examined. Although matching sequences could be found in interspecies comparisons of the flanking regions of alpha- globin genes, these matches are not as extensive as those found in the flanking regions of mammalian beta-like globin genes. This observation suggests that the noncoding sequences in the mammalian alpha-globin gene clusters are evolving at a faster rate than those in the beta-like globin gene clusters. The proposed faster rate of evolution fits with the poor conservation of the genetic linkage map around alpha-globin gene clusters when compared to that of the beta-like globin gene clusters. Analysis of the 3' flanking regions of alpha-globin genes has revealed a conserved sequence approximately 100-150 bp 3' to the polyadenylation site; this sequence may be involved in the expression or regulation of alpha-globin genes.   相似文献   

2.
In some species, histone gene clusters consist of tandem arrays of each type of histone gene, whereas in other species the genes may be clustered but not arranged in tandem. In certain species, however, histone genes are found scattered across several different chromosomes. This study examines the evolution of histone 3 (H3) genes that are not arranged in large clusters of tandem repeats. Although H3 amino acid sequences are highly conserved both within and between species, we found that the nucleotide sequence divergence at synonymous sites is high, indicating that purifying selection is the major force for maintaining H3 amino acid sequence homogeneity over long-term evolution. In cases where synonymous-site divergence was low, recent gene duplication appeared to be a better explanation than gene conversion. These results, and other observations on gene inactivation, organization, and phylogeny, indicated that these H3 genes evolve according to a birth-and-death process under strong purifying selection. Thus, we found little evidence to support previous claims that all H3 proteins, regardless of their genome organization, undergo concerted evolution. Further analyses of the structure of H3 proteins revealed that the histones of higher eukaryotes might have evolved from a replication-independent-like H3 gene.  相似文献   

3.
4.
Olfactory receptors are G protein-coupled, seven-transmembrane-domain proteins that are responsible for binding odorants in the nasal epithelium. They are encoded by a large gene family, members of which are organized in several clusters scattered throughout the genomes of mammalian species. Here we describe the mapping of mouse sequences corresponding to four conserved olfactory receptor genes, each representing separate, recently identified canine gene subfamilies. Three of the four canine genes detected related gene clusters in regions of mouse Chromosomes (Chrs) 2, 9, and 10, near previously mapped mouse olfactory genes, while one detected a formerly unidentified gene cluster located on mouse Chr 6. In addition, we have localized two human gene clusters with homology to the canine gene, CfOLF4, within the established physical map of Chr 19p. Combined with recently published studies, these data link the four conserved olfactory gene subfamilies to homologous regions of the human, dog, and mouse genomes. Received: 10 September 1997 / Accepted: 29 December 1997  相似文献   

5.
In comparative genomics, differences or similarities of gene orders are determined to predict functional relations of genes or phylogenetic relations of genomes. For this purpose, various combinatorial models can be used to specify gene clusters--groups of genes that are co-located in a set of genomes. Several approaches have been proposed to reconstruct putative ancestral gene clusters based on the gene order of contemporary species. One prevalent and natural reconstruction criterion is consistency: For a set of reconstructed gene clusters, there should exist a gene order that comprises all given clusters. For permutation-based gene cluster models, efficient methods exist to verify this condition. In this article, we discuss the consistency problem for different gene cluster models on sequences with restricted gene multiplicities. Our results range from linear-time algorithms for the simple model of adjacencies to NP-completeness proofs for more complex models like common intervals.  相似文献   

6.
7.
Summary The nucleic acid sequences coding for 23 H3 histone genes from a variety of species have been analyzed using a computer assisted alignment and analysis program. Although these histones are highly conserved within and between highly divergent species, they represent various classes of histones whose patterns of expression are distinctively regulated. Surprisingly, in dendrograms derived from these comparisons, H3 sequences cluster according to their modes of regulation rather than phylogenetically. These clusters are generated from highly distinctive patterns of codon usage within the functional gene classes. We suggest that one factor involved in specifying the differing codon usage patterns between functional classes is a difference in requirements for rapid translation of mRNA. In addition, the data presented here, together with structural and sequence information, suggest a heterodox evolutionary model in which genes related to the intron-bearing, basally expressed H3.3 vertebrate genes are the ancestors of the intronless H3. 1 class of genes of higher eukaryotes. The H3. 1 class must have arisen, therefore, following duplication of a primitive H3.3 gene, but prior to the plant-animal divergence. Implications of the data presented are discussed with regard to functional and evolutionary relationships.  相似文献   

8.
9.
10.
Structure and organization of the chicken H2B histone gene family.   总被引:7,自引:5,他引:2  
The results of Southern blotting experiments confirm that the chicken H2B histone gene family contains eight highly homologous members. One or two more sequences which are considerably divergent from the others appear to exist in the chicken genome. Seven of the eight H2B genes have been cloned and sequenced. All seven genes fall in two histone gene clusters, but no common arrangement exists for the clusters themselves. Three different H2B protein variants are encoded by these seven genes. The nucleotide sequence homology among the genes within their coding sequences appears to exceed that required for the corresponding protein sequences, suggesting that histone H2B mRNA sequence and structure are both selected during evolution. An analysis of the 5' flanking sequence data reveals that these genes possess CCAAT and TATA boxes, elements commonly associated with genes transcribed by RNA polymerase II. In addition, these genes all share an H2B-specific element of the form: ATTTGCATA. The 3' sequences of these genes contain the hyphenated symmetrical dyad homology and downstream purine-rich sequence shared by histone genes in general.  相似文献   

11.
A cluster of four trypsin genes has previously been localized to cytological position 47D-F of the Drosophila melanogaster genome. One of these genes had been sequenced, and the presence of the other three genes was identified by cross-hybridization. Here, we present the DNA sequence of the entire genomic region encoding these four trypsin genes. In addition to the four previously inferred genes, we have identified a fifth trypsin-coding sequence located within this gene cluster. This new gene shows a high degree of sequence divergence (more than 30%) from the other four genes, although it retains all of the functional motifs that are characteristic of trypsin-coding sequences. In order to trace the molecular evolution of this gene cluster, we isolated and sequenced the homologous 7-kb region from the closely related species Drosophila erecta. A comparison of the DNA sequences between the two species provides strong evidence for the concerted evolution of some members of this gene family. Two genes within the cluster are evolving in concert, while a third gene appears to be evolving independently. The remaining two genes show an intermediate pattern of evolution. We propose a simple model, involving chromosome looping and gene conversion, to explain the relatively complex patterns of molecular evolution within this gene cluster.  相似文献   

12.
AIMS: To compare the biosynthetic gene cluster sequences of the main aflatoxin (AF)-producing Aspergillus species. METHODS AND RESULTS: Sequencing was on fosmid clones selected by homology to Aspergillus parasiticus sequence. Alignments revealed that gene order is conserved among AF gene clusters of Aspergillus nomius, A. parasiticus, two sclerotial morphotypes of Aspergillus flavus, and an unnamed Aspergillus sp. Phylogenetic relationships were established using the maximum likelihood method implemented in PAUP. Based on the Eurotiomycete/Sordariomycete divergence time, the A. flavus-type cluster has been maintained for at least 25 million years. Such conservation of the genes and gene order reflects strong selective constraints on rearrangement. Phylogenetic comparison of individual genes in the cluster indicated that ver-1, which has homology to a melanin biosynthesis gene, experienced selective forces distinct from the other pathway genes. Sequences upstream of the polyketide synthase-encoding gene vary among the species, but a four-gene sugar utilization cluster at the distal end is conserved, indicating a functional relationship between the two adjacent clusters. CONCLUSIONS: The high conservation of cluster components needed for AF production suggests there is an adaptive value for AFs in character-shaping niches important to those taxa. SIGNIFICANCE AND IMPACT OF THE STUDY: This is the first comparison of the complete nucleotide sequences of gene clusters harbouring the AF biosynthesis genes of the main AF-producing species. Such a comparison will aid in understanding how AF biosynthesis is regulated in experimental and natural environments.  相似文献   

13.
Kiyasu T  Nagahashi Y  Hoshino T 《Gene》2001,265(1-2):103-113
The biotin biosynthesis genes of Kurthia sp., which is an aerobic gram-positive bacterium, were cloned from Kurthia sp. 538-KA26 and characterized. Eleven biotin biosynthetic genes have been identified in Kurthia sp. Kurthia sp. has two genes coding for KAPA synthase, bioF and bioFII, and also has two genes coding for BioH protein, bioH and bioHII. In addition, three genes, orf1, orf2, and orf3, whose functions are unknown, were found in the biotin gene clusters of Kurthia sp. The bioA, bioD, and orf1 genes are arranged in a gene cluster in the order orf1bioDA, and the bioB, bioF, and orf2 genes are arranged in a gene cluster in the order orf2bioFB. These gene clusters proceed to both directions; the face to face promoters and two 40-bp of palindrome sequences exist upstream of the orf1 and orf2 genes. The bioC, bioFII, and bioHII genes are arranged in a gene cluster in the order bioFIIHIIC; a 40-bp of palindrome sequence exists upstream of the bioFII gene. The bioH and orf3 genes are arranged in a gene cluster in the order bioHorf3; a palindrome sequence was not found upstream of the bioH gene. These palindrome sequences are extremely similar to each other, suggesting that the orf1bioDA, orf2bioFB, and bioFIIHIIC gene clusters are regulated by biotin. Kurthia sp. does not have the bioW gene coding pimeloyl-CoA synthase, suggesting that pimeloyl-CoA may be produced by a different pathway than that of gram-positive bacterium B. subtilis or B. sphaericus, further suggesting a modified fatty acid synthesis pathway via acetyl-CoA instead as E. coli has.  相似文献   

14.

Background

The intra- and inter-species genetic diversity of bacteria and the absence of ‘reference’, or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia.

Methods

A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM) of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization.

Results

The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52%) corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as ‘centroids’ in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578.

Conclusion

The identification of centroids of 16S rRNA gene sequence clusters using novel distance matrix clustering enables the identification of the most representative sequences for each individual species of Nocardia and allows the quantitation of inter- and intra-species variability.  相似文献   

15.
16.
The genes for 22 tRNA species from Acholeplasma laidawii, belonging to the class Mollicutes (Mycoplasmas), have been cloned and sequenced. Sixteen genes are organized in 3 clusters consisting of eleven, three and two tRNA genes, respectively, and the other 6 genes exist as a single gene. The arrangement of tRNA genes in the 11-gene, the 3-gene and the 2-gene clusters reveals extensive similarity to several parts of the 21-tRNA or 16-tRNA gene cluster in Bacillus subtilis. The 11-gene cluster is also similar to the tRNA gene clusters found in other mycoplasma species, the 9-tRNA gene cluster in M.capricolum and in M.mycoides, and the 10-tRNA gene cluster in Spiroplasma meliferm. The results suggest that the tRNA genes in mycoplasmas have evolved from large tRNA gene clusters in the ancestral Gram-positive bacterial genome common to mycoplasmas and B.subtilis. The anticodon sequences including base modifications of 15 tRNA species from A.laidlawii were determined. The anticodon composition and codon-recognition patterns of A.laidlawii resemble those of Bacillus subtilis rather than those of other mycoplasma species.  相似文献   

17.
Plastomes of the peridinin-containing dinoflagellates are composed of a limited number of genes, which are carried individually on small circular molecules, termed 'minicircles'. Although the prevalent plastid chromosome of most algae and plants has only a single copy of each gene, our previous study showed that low copy numbers of multiple variants of the gene psbA co-exist with the 'ordinary' gene encoding the D1 protein in minicircles of Alexandrium tamarense. Although none of the psbA variants encoded the entire protein, they persisted in culture. In this study, we compared the distribution and structure of psbA and psbD variants in two species of Alexandrium to characterize DNA rearrangement within these genes. In addition to four previously reported psbA variants, three psbD variants were found in A. tamarense minicircles. The ordinary psbA and psbD genes also co-existed with variants in another species, A. catenella. The sequences of the ordinary genes were virtually identical in the two species. All the variants comprised insertion or deletion mutations, with no base substitutions being identified. Duplicated parts of the coding sequences were contained in most of the insertions. Short direct repeats (4-14?bp) and/or adenine?+?thymine-rich motifs were present in all mutation regions, although the position and/or the sequence of each DNA rearrangement was unique to each variant. The results indicated that replication-based repeat-mediated recombination was responsible for generation of the variants.  相似文献   

18.
The olfactory receptor (OR) subgenome harbors the largest known gene family in mammals, disposed in clusters on numerous chromosomes. One of the best characterized OR clusters, located at human chromosome 17p13.3, has previously been studied by us in human and in other primates, revealing a conserved set of 17 OR genes. Here, we report the identification of a syntenic OR cluster in the mouse and the partial DNA sequence of many of its OR genes. A probe for the mouse M5 gene, orthologous to one of the OR genes in the human cluster (OR17-25), was used to isolate six PAC clones, all mapping by in situ hybridization to mouse chromosome 11B3-11B5, a region of shared synteny with human chromosome 17p13.3. Thirteen mouse OR sequences amplified and sequenced from these PACs allowed us to construct a putative physical map of the OR gene cluster at the mouse Olfr1 locus. Several points of evidence, including a strong similarity in subfamily composition and at least four cases of gene orthology, suggest that the mouse Olfr1 and the human 17p13.3 clusters are orthologous. A detailed comparison of the OR sequences within the two clusters helps trace their independent evolutionary history in the two species. Two types of evolutionary scenarios are discerned: cases of "true orthologous genes" in which high sequence similarity suggests a shared conserved function, as opposed to instances in which orthologous genes may have undergone independent diversification in the realm of "free reign" repertoire expansion.  相似文献   

19.
We screened plant genome sequences, primarily from rice and Arabidopsis thaliana, for CpG islands, and identified DNA segments rich in CpG dinucleotides within these sequences. These CpG-rich clusters appeared in the analysed sequences as discrete peaks and occurred at the frequencies of one per 4.7 kb in rice and one per 4.0 kb in A. thaliana. In rice and A. thaliana, most of the CpG-rich clusters were associated with genes, which suggests that these clusters are useful landmarks in genome sequences for identifying genes in plants with small genomes. In contrast, in plants with larger genomes, only a few of the clusters were associated with genes. These plant CpG-rich clusters satisfied the criteria used for identifying human CpG islands, which suggests that these CpG clusters may be regarded as plant CpG islands. The position of each island relative to the 5'-end of its associated gene varied considerably. Genes in the analysed sequences were grouped into five classes according to the position of the CpG islands within their associated genes. A large proportion of the genes belonged to one of two classes, in which a CpG island occurred near the 5'-end of the gene or covered the whole gene region. The position of a plant CpG island within its associated gene appeared to be related to the extent of tissue-specific expression of the gene; the CpG islands of most of the widely expressed rice genes occurred near the 5'-end of the genes.  相似文献   

20.
Multiple copies of a given ribosomal RNA gene family undergo concerted evolution such that sequences of all gene copies are virtually identical within a species although they diverge normally between species. In eukaryotes, gene conversion and unequal crossing over are the proposed mechanisms for concerted evolution of tandemly repeated sequences, whereas dispersed genes are homogenized by gene conversion. However, the homogenization mechanisms for multiple-copy, normally dispersed, prokaryotic rRNA genes are not well understood. Here we compared the sequences of multiple paralogous rRNA genes within a genome in 12 prokaryotic organisms that have multiple copies of the rRNA genes. Within a genome, putative sequence conversion tracts were found throughout the entire length of each individual rRNA genes and their immediate flanks. Individual conversion events convert only a short sequence tract, and the conversion partners can be any paralogous genes within the genome. Interestingly, the genic sequences undergo much slower divergence than their flanking sequences. Moreover, genomic context and operon organization do not affect rRNA gene homogenization. Thus, gene conversion underlies concerted evolution of bacterial rRNA genes, which normally occurs within genic sequences, and homogenization of flanking regions may result from co-conversion with the genic sequence. Received: 31 March 2000 / Accepted: 15 June 2000  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号