首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 515 毫秒
1.
2.
With ∼1000 genes, the odorant receptor (OR) gene repertoire is the largest gene family in the mouse genome. Here we have established a 129/Sv BAC contig for mouse OR gene cluster 7 (Olfr7) on Chromosome (Chr) 9. The assembled ∼2-Mb contig consists of 75 BACs and may contain as many as 100 OR genes, or ∼10% of the mouse repertoire. Facilitated by the lack of introns in the coding region, we have determined the nucleotide sequence of 37 full-length, 2 partial, and 3 pseudo coding regions. These 42 OR genes and 3 additional OR genes previously mapped to the mouse Olfr7 cluster can be organized into 13 classes based on OR probe cross-hybridizations with 129/Sv mouse genomic DNA. OR genes belonging to the same class tend to be located next to each other within the cluster. Comparison of published full-length mouse and rat OR coding sequences with those identified here shows that the Olfr7 OR genes are highly related to each other, clustering on two major branches of an unrooted phylogenetic tree. Eight ORs contain an unusual NXC sequon at the amino-terminal extracellular domain that may represent a novel N-linked glycosylation site. The BAC contig presented here provides the substrate for sequencing of the cluster. Received: 27 June 2000 / Accepted: 17 August 2000  相似文献   

3.
Zhang X  Zhang X  Firestein S 《Genomics》2007,89(4):441-450
We applied a comprehensive data-mining strategy to examine the repertoires of rat and mouse odorant receptors (ORs) and type 1 pheromone receptors (V1Rs) using the mm5 (mouse) and rn3 (rat) genomes. We identified 1576 rat OR genes, including 292 pseudogenes. The rat V1R repertoire is composed of 115 intact genes and 72 pseudogenes. The mouse OR and V1R databases were updated using the new assembly mm5, from which 1375 mouse ORs and 308 V1Rs were identified, with more than 100 putative pseudogenes from mm2 now identified as intact because of the higher sequence quality. With these new data we have conducted a series of genomic analyses of the OR and V1R genes from mouse and rat. Orthologous OR clusters were identified in mouse and rat and comparison analysis was performed at three incremental levels: families, coding sequences, and motifs. At the family level, we found that V1R genes have more species-specific families than OR genes. About 20% of intact V1R genes have no orthologous counterpart in the same family, whereas less than 1% of intact ORs are similarly isolated. At the coding sequence level, OR genes are more conserved between mouse and rat than V1R genes. OR genes share greater similarity with their orthologous counterparts than with their closest neighbor, whereas V1R genes show the opposite tendency. Motifs were identified to obtain biological insights. Motifs specific for species or families were found in OR and V1R genes, which may result in the differential pheromone-dependent behaviors and perception of odors between mouse and rat.  相似文献   

4.
Hoppe R  Breer H  Strotmann J 《Genomics》2003,82(3):355-364
We report a comprehensive comparative analysis of human and mouse olfactory receptor (OR) genes encoding OR37 subtypes to determine the repertoire, chromosomal organization, and relatedness of these genes. Two OR37 clusters were found in both mouse (chromosome 4) and human (chromosome 9); with five genes in cluster I and three (mouse) and seven genes (human) in cluster II. The pronounced diversity of noncoding sequence regions in both genomic loci indicates a long-term coexistence of the two clusters and the genes within the clusters. In contrast, the coding regions, particularly of genes in cluster I, showed remarkably high sequence identity, a feature quite unique for OR genes. The conservation of only the coding sequences indicates that OR37 may be under negative selection pressure and suggests that the OR37 receptor family may be tuned to recognize distinct sets of signaling molecules. A comparison of mouse and human OR37 gene clusters revealed that genes in cluster I are highly related within each species whereas genes in cluster II are highly related across species. These data reflect a unique and complex evolutionary history of the OR37 family.  相似文献   

5.
The aquatic larvae of the genus Chironomus (Diptera, Insecta) contain at least 12 different hemoglobin (Hb) variants in their hemolymph. In the present study we have analysed the structure and part of the nucleotide sequence of a Hb gene cluster cloned from the genomic DNA of Chironomus thummi piger. The cluster contains probably 6 different genes, separated by intergenic regions of various lengths. The nucleotide sequence of three putative Hb genes including the intergenic regions is presented. The inferred amino-acid sequences show clearly that two of these putative genes code for subvariants of the Hb variant VIIB. The third gene codes for a so far unknown Hb protein. As known already for other chironomid Hb genes, there are no intron sequences present in the coding regions.  相似文献   

6.
7.
Five closely related immunoglobulin VH genes (subgroup II) were compared by sequencing of several kb of DNA. In three of the genes homology greater than 75% was found along an area of 4 kb that includes the coding region. The homology in flanking regions is only slightly lower than that in the coding sequences. Two other genes, which are located on the same EcoRI fragment, show high homology to the first three genes in the coding and immediately flanking regions. In more distant flanking regions no homology is found with the first three genes. This indicates that their evolutionary history differs from that of the other three genes. A region of simple DNA sequence composed of repetitive TCC and TCA elements was found at a distance of approximately 380 bp upstream from the initiator ATG of these VH genes. This region is the site where the two sets of genes abruptly start to diverge. The structure of the simple DNA sequence in the various VH genes suggests that it may be involved in gene interaction. We propose that both simple DNA sequences and homology in flanking regions serve a function in the correction of VH genes, which seem to be rather free to diverge and drift into pseudogenes. A correction mechanism may help this gene family to maintain its two major features, multiplicity and diversity.  相似文献   

8.
The DNA immediately flanking the 164-base-pair U1 RNA coding region is highly conserved among the approximately 30 human U1 genes. The U1 multigene family also contains many U1 pseudogenes (designated class I) with striking although imperfect flanking homology to the true U1 genes. Using cosmid vectors, we now have cloned, characterized, and partially sequenced three 35-kilobase (kb) regions of the human genome spanning U1 homologies. Two clones contain one true U1 gene each, and the third bears two class I pseudogenes 9 kb apart in the opposite orientation. We show by genomic blotting and by direct DNA sequence determination that the conserved sequences surrounding U1 genes are much more extensive than previously estimated: nearly perfect sequence homology between many true U1 genes extends for at least 24 kb upstream and at least 20 kb downstream from the U1 coding region. In addition, the sequences of the two new pseudogenes provide evidence that class I U1 pseudogenes are more closely related to each other than to true genes. Finally, it is demonstrated elsewhere (Lindgren et al., Mol. Cell. Biol. 5:2190-2196, 1985) that both true U1 genes and class I U1 pseudogenes map to chromosome 1, but in separate clusters located far apart on opposite sides of the centromere. Taken together, these results suggest a model for the evolution of the U1 multigene family. We speculate that the contemporary family of true U1 genes was derived from a more ancient family of U1 genes (now class I U1 pseudogenes) by gene amplification and transposition. Gene amplification provides the simplest explanation for the clustering of both U1 genes and class I pseudogenes and for the conservation of at least 44 kb of DNA flanking the U1 coding region in a large fraction of the 30 true U1 genes.  相似文献   

9.
We have observed three calmodulin mRNA species in rat tissues. In order to know from how many expressed genes they are derived, we have investigated the genomic organization of calmodulin genes in the rat genome. From a rat brain cDNA library, we obtained two kinds of cDNAs (pRCM1 and pRCM3) encoding authentic calmodulin. DNA sequence analysis of these cDNA clones revealed substitutions of nucleotides at 73 positions of 450 nucleotides in the coding region, although the amino acid sequences of these calmodulins are exactly the same. DNA sequences in the 5' and 3' noncoding regions are quite different between these two cDNAs. From these results, we conclude that they are derived from two distinct bona fide calmodulin genes, CaMI (pRCM1) and CaMII (pRCM3). Total genomic Southern hybridization suggested four distinct calmodulin-related genes in the rat genome. By cloning and sequencing the calmodulin-related genes from rat genomic libraries, we demonstrated that the other two genes are processed pseudogenes generated from the CaMI (lambda SC9) and CaMII (lambda SC8) genes, respectively, through an mRNA-mediated process of insertions. Northern blotting showed that the CaMI gene is transcribed in liver, muscle, and brain in similar amounts, whereas the CaMII gene is transcribed mainly in brain. S1 nuclease mapping indicated that the CaMI gene produced two mRNA species (1.7 and 4 kilobases), whereas the CaMII gene expressed a single mRNA species (1.4 kilobases).  相似文献   

10.
The vertebrate olfactory receptor (OR) subgenome harbors the largest known gene family, which has been expanded by the need to provide recognition capacity for millions of potential odorants. We implemented an automated procedure to identify all OR coding regions from published sequences. This led us to the identification of 831 OR coding regions (including pseudogenes) from 24 vertebrate species. The resulting dataset was subjected to neighbor-joining phylogenetic analysis and classified into 32 distinct families, 14 of which include only genes from tetrapodan species (Class II ORs). We also report here the first identification of OR sequences from a marsupial (koala) and a monotreme (platypus). Analysis of these OR sequences suggests that the ancestral mammal had a small OR repertoire, which expanded independently in all three mammalian subclasses. Classification of ``fish-like' (Class I) ORs indicates that some of these ancient ORs were maintained and even expanded in mammals. A nomenclature system for the OR gene superfamily is proposed, based on a divergence evolutionary model. The nomenclature consists of the root symbol `OR', followed by a family numeral, subfamily letter(s), and a numeral representing the individual gene within the subfamily. For example, OR3A1 is an OR gene of family 3, subfamily A, and OR7E12P is an OR pseudogene of family 7, subfamily E. The symbol is to be preceded by a species indicator. We have assigned the proposed nomenclature symbols for all 330 human OR genes in the database. A WWW tool for automated name assignment is provided. Received: / Accepted:  相似文献   

11.
12.
R R Robinson  N Davidson 《Cell》1981,23(1):251-259
A recombinant DNA phage containing a cluster of Drosophila melanogaster tRNA genes has been isolated and analyzed. The insert of this phage has been mapped by in situ hybridization to chromosomal region 50AB, a known tRNA site. Nucleotide sequencing of the entire Drosophila tRNA coding region reveals seven tRNA genes spanning 2.5 kb of chromosomal DNA. This cluster is separated from other tRNA regions on the chromosome by at least 2.7 kb on one side, and 9.6 kb on the other. Two tRNA genes are nearly identical and contain intervening sequences of length 38 and 45 bases, respectively, in the anticodon loop. These two genes are assigned to be tRNALeu genes because of significant sequence homology with yeast tRNA3Leu, and secondary structure homology with yeast tRNA3Leu intervening sequence. In addition, an 8 base sequence (AAAAUCUU) is conserved in the same location in the intervening sequences of Drosophila tRNALeu genes and a yeast tRNA3Leu gene. Similar sequenes occur in all other tRNAs containing intervening sequences. The remaining five genes are identical tRNAIle genes, which are also identical to a tRNAIle gene from chromosomal region 42A. The 5' flanking regions are only weakly homologous, but each set of isoacceptors contains short regions of strong homology approximately 20 nucleotides preceding the tRNA coding sequences: GCNTTTTG preceding tRNAIle genes; and GANTTTGG preceding tRNALeu genes. The genes are irregularly distributed on both DNA strands; spacing regions are divergent in sequence and length.  相似文献   

13.
Composite human VK genes and a model of their evolution.   总被引:17,自引:9,他引:8       下载免费PDF全文
A phage library and two cosmid libraries were screened for human VK genes. Two recombinant phage and four cosmid clones were analysed in detail by restriction mapping and sequencing. Each one contained a single VKI sequence. Two of these six sequences are potentially functional VK genes and four are pseudogenes. Two pseudogenes derived from different genomic DNAs are highly homologous and are therefore either allelic variants or the products of a recent duplication event. Comparisons of our sequences with all fully determined human VKI amino acid and DNA sequences reveal identical segments which at first sight appear like minigenes. But these segments do not coincide with the subregions and some of the segments include both, framework and complementarity determining regions (FR, CDR, ref. 2). The findings may be explained by an evolutionary model generating composite genes by gene conversion and selection.  相似文献   

14.
15.
16.
17.
Kamalika Sen 《FEBS letters》2010,584(18):4015-4018
Pseudogenes, regarded as ‘genomic fossils’, are DNA sequences resembling functional genes in perspective of sequence homology but completely non-functional. In this study, we explored the unique characteristic features of human genes, configuring classical duplicated pseudogenes. We found that progenitors of duplicated pseudogenes are characterized by a high expressivity, and ability to encode hub-proteins in association with a high evolutionary rate. Such unusual features are endorsed by longer protein length, elevated CpG content, and a high recombination rate. The non-functionalization of their duplicated copies can be attributed to the overabundance of gene paralog number in concert with functional redundancy.  相似文献   

18.
Olfactory receptors (ORs) constitute the largest multigene family in multicellular organisms. Their evolutionary proliferation has been driven by the need to provide recognition capacity for millions of potential odorants with arbitrary chemical configurations. Human genome sequencing has provided a highly informative picture of the "olfactory subgenome", the repertoire of OR genes. We describe here an analysis of 224 human OR genes, a much larger number than hitherto systematically analyzed. These are derived by literature survey, data mining at 14 genomic clusters, and by an OR-targeted experimental sequencing strategy. The presented set contains at least 53% pseudogenes and is minimally divided into 11 gene families. One of these (no. 7) has undergone a particularly extensive expansion in primates. The analysis of this collection leads to insight into the origin of OR genes, suggesting a graded expansion through mammalian evolution. It also allows us to delineate a structural map of the respective proteins. A sequence database and analysis package is provided (http://bioinformatics.weizmann.ac.il/HORDE), which will be useful for analyzing human OR sequences genome-wide.  相似文献   

19.
Nucleotide sequence analysis of the delta beta-globin gene region in humans   总被引:31,自引:0,他引:31  
The continuous DNA sequence of a 16.5-kilobase pair region encompassing the linked delta beta-globin gene cluster in humans is presented with a detailed restriction endonuclease map. There are 38 differences (0.5%) in comparison with published sequence data, corrected for errors in sequencing, resulting in polymorphic rates of 0.2% in exons and 0.76% in 5'-gene flanking regions. Fifteen changes result in the generation or elimination of restriction sites which may be useful in linkage disequilibrium studies. Two pairs of inverted Alu repeats, a pyrimidine-rich region 5' to delta, and (TG)n, (Pu/Py)n, and (ATTTT)n tracts 5' to beta are described. Dinucleotide frequencies and deviation from expected values approximated those found in total human genomic DNA. Regions of less than 50% A + T content were found associated with Alu sequences, a 150-base pair region immediately 5' to the beta gene, exon regions from both genes, and an area 3' to the beta gene. These regions also contained significantly lower than expected CpG levels compared to other regions, suggesting a possible relationship between DNA organizational patterns and functionally important regions. In addition, strand asymmetries in base composition in this region differ from those associated with the fetal globin genes.  相似文献   

20.
MOTIVATION: Insertion mutagenesis, using transgenes or endogenous transposons, is a popular method for generating null mutations (knockouts) in model organisms. Insertions are mapped to specific genes by amplifying (via TAIL-PCR) and sequencing genomic regions flanking the inserted DNA. The presence of multiple TAIL-PCR templates in one sequencing reaction results in chimeric sequence of intermittently low quality. Standard processing of this sequence by applying Phred quality requirements results in loss of informative sequence, whereas not trimming low-quality sequence causes inclusion of low-complexity homopolymers from the ends of sequence runs. Accurate mapping of the flanking sequences is complicated by the presence of gene families. RESULTS: Methods for extracting informative regions from sequence traces obtained by sequencing multiple TAIL-PCR fragments in a single reaction are described. The completely sequenced Arabidopsis genome was used to identify informative TAIL-PCR sequence regions. Methods were devised to define and select high quality matches and precisely map each insert to the correct genome location. These methods were used to analyze sequence of TAIL-PCR-amplified flanking regions of the inserts from individual plants in a T-DNA-mutagenized population of Arabidopsis thaliana, and are applicable to similar situations where a reference genome can be used to extract information from poor-quality sequence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号