首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We have identified a new family of interspersed, moderately repetitive DNA elements, termed the RSg-1 family, in the genome of the rainbow trout. Two of the elements examined here are situated upstream of sequences which code for trout nuclear proteins; a protamine gene (p101) and the clustered histone H4 gene. Sequence comparison of various RSg-1 elements indicated a high degree of nucleotide sequence homology between different members of the family. These repetitive elements exhibit well defined 3' ends which contain poly(A) segments preceded by the consensus polyadenylation signal AATAAA. Sequences flanking the 3' end of the poly(A) tract also conform to a consensus sequence. A similar sequence is also found flanking the 5' terminus of the element in the protamine clone p101, and thus may represent a target-site duplication generated upon insertion of the element into the genome. These characteristics, together with the heterogeneous nature of the 5' ends of the elements, are reminiscent of processed pseudogenes and retroposons such as the mammalian L1 family of interspersed repetitive elements.  相似文献   

2.
3.
4.
We have used computer assisted dot matrix and oligonucleotide frequency analyses to identify highly recurring sequence elements of 7-11 base pairs in eukaryotic genes and viral DNAs. Such elements are found much more frequently than expected, often with an average spacing of a few hundred base pairs. Furthermore, the most abundant repetitive elements observed in the ovalbumin locus, the beta-globin gene cluster, the metallothionein gene and the viral genomes of SV40, polyoma, Herpes simplex-1 and Mouse Mammary Tumor Virus were sequences shown previously to be protein binding sites or sequences important for regulating gene expression. These sequences were present in both exons and introns as well as promoter regions. These observations suggest that such sequences are often highly overrepresented within the specific gene segments with which they are associated. Computer analysis of other genetic units, including viral genomes and oncogenes, has identified a number of highly recurring sequence elements that could serve similar regulatory or protein-binding functions. A model for the role of such reiterated sequence elements in DNA organization and function is presented.  相似文献   

5.
Six overlapping BAC clones covering the Hv-eIF4E gene region in barley were sequenced in their entire length, resulting in a 439.7 kb contiguous sequence. The contig contains only two genes, Hv-eIF4E and Hv-MLL, which are located in a small gene island and more than 88% of the sequence is composed of transposable elements. A detailed analysis of the repetitive component revealed that this chromosomal region was affected by multiple major duplication and deletion events as well as the insertion of numerous transposable elements, resulting in a complete reshuffling of genomic DNA. Resolving this highly complex pattern resulted in a model unraveling evolutionary events that shaped this region over an estimated 7 million years. Duplications and deletions caused by illegitimate recombination and unequal crossing over were major driving forces in the evolution of the Hv-eIF4E region, equaling or exceeding the effects of transposable element activities. In addition to a dramatic reshuffling of the repetitive portion of the sequence, we also found evidence for important contributions of illegitimate recombination and transposable elements to the sequence organization of the gene island containing Hv-eIF4E and Hv-MLL.  相似文献   

6.
The tCUP cryptic constitutive promoter was discovered in the tobacco genome by T-DNA (transfer DNA) tagging with a promoterless GUS-nos gene. Here, we show that the portion of the tCUP sequence containing a variety of cryptic gene regulatory elements is related to a new family of moderately repetitive sequences (10(2) copies), the RENT (repetitive element from Nicotiana tabacum) family. The RENT family is found only in certain Nicotiana species. Five RENT elements were cloned and sequenced. The RENT elements are a minimum of 5 kb in length and share 80-90% sequence similarity throughout their length. The 5' termini are the same in the isolated RENT family members and are characterized by a conserved border sequence (TGTTGA(T or C)ACCCAATTTT(T or C)). The 3' ends of RENT sequence similarity vary in location and sequence. The tCUP cryptic promoter originated from a unique truncated RENT element that interrupts a phytochelatin synthase-like gene that may have undergone rearrangements prior to or resulting from T-DNA insertion. No evidence was found for expressed coding regions within the RENT elements; however, like the cryptic gene regulatory elements within the tCUP sequence, the isolated RENT elements possess promoter activity and translational enhancer activity.  相似文献   

7.
8.
Summary The availability of the amino acid sequence for nine different mammalian P1 family protamines and the revised amino acid sequence of the chicken protamine galline (Oliva and Dixon 1989) reveals a much close relationship between mammalian and avian protamines than was previously thought (Nakano et al. 1976). Dot matrix analysis of all protamine genes for which genomic DNA or cDNA sequence is available reveals both marked sequence similarities in the mammalian protamine gene family and internal repeated sequences in the chicken protamine gene. The detailed alignments of the cis-acting regulatory DNA sequences shows several consensus sequence patterns, particularly the conservation of a cAMP response element (CRE) in all the protamine genes and of the regions flanking the TATA box, CAP site, N-terminal coding region, and polyadenylation signal. In addition we have found a high frequency of the CA dinucleotide immediately adjacent to the CRE element of both the protamine genes and the testis transition proteins, a feature not present in other genes, which suggests the existence of an extended CRE motif involved in the coordinate expression of protamine and transition protein genes during spermatogenesis. Overall these findings suggest the existence of an avian-mammalian P1 protamine gene line and are discussed in the context of different hypotheses for protamine gene evolution and regulation.  相似文献   

9.
Arabidopsis thaliana has a relatively small genome of approximately 130 Mb containing about 10% repetitive DNA. Genome sequencing studies reveal a gene-rich genome, predicted to contain approximately 25000 genes spaced on average every 4.5 kb. Between 10 to 20% of the predicted genes occur as clusters of related genes, indicating that local sequence duplication and subsequent divergence generates a significant proportion of gene families. In addition to gene families, repetitive sequences comprise individual and small clusters of two to three retroelements and other classes of smaller repeats. The clustering of highly repetitive elements is a striking feature of the A. thaliana genome emerging from sequence and other analyses.  相似文献   

10.
Structural data are presented on the protamine gene cluster (PGC) of human, mouse, rat, and bull. By restriction mapping we demonstrate that the organization of the protamine cluster is conserved throughout all four species, i.e., the genes are situated in a head to tail arrangement in the order: protamine l-protamine 2-transition protein 2. Further, we established the nucleotide sequence of the entire human PGC (25 kb in total) and the 3′ portion of the rat protamine cluster (PRM2 and TNP2 genes and intergenic region). In addition, a 1 kb fragment of the bovine and murine protamine cluster, situated between PRM2 and TNP2, was sequenced. This fragment is conserved regarding sequence, position, and orientation in all species examined, and was classified as likely coding region by gene recognition program GRAIL. Using the rat fragment as a probe in RNA blots, we detected a testis-specific signal of about 0.5 kb. Finally, we demonstrate a high density of Alu elements, both full and fragmented copies, in the human PGC and discuss their localization with respect to evolutionary and functional aspects. © 1996 Wiley-Liss, Inc.  相似文献   

11.
We have analyzed a sequence of approximately 70 base pairs (bp) that shows a high degree of similarity to sequences present in the non-coding regions of a number of human and other mammalian genes. The sequence was discovered in a fragment of human genomic DNA adjacent to an integrated hepatitis B virus genome in cells derived from human hepatocellular carcinoma tissue. When one of the viral flanking sequences was compared to nucleotide sequences in GenBank, more than thirty human genes were identified that contained a similar sequence in their non-coding regions. The sequence element was usually found once or twice in a gene, either in an intron or in the 5' or 3' flanking regions. It did not share any similarities with known short interspersed nucleotide elements (SINEs) or presently known gene regulatory elements. This element was highly conserved at the same position within the corresponding human and mouse genes for myoglobin and N-myc, indicating evolutionary conservation and possible functional importance. Preliminary DNase I footprinting data suggested that the element or its adjacent sequences may bind nuclear factors to generate specific DNase I hypersensitive sites. The size, structure, and evolutionary conservation of this sequence indicates that it is distinct from other types of short interspersed repetitive elements. It is possible that the element may have a cis-acting functional role in the genome.  相似文献   

12.
Fine organization of Bombyx mori fibroin heavy chain gene   总被引:17,自引:0,他引:17       下载免费PDF全文
The complete sequence of the Bombyx mori fibroin gene has been determined by means of combining a shotgun sequencing strategy with physical map-based sequencing procedures. It consists of two exons (67 and 15 750 bp, respectively) and one intron (971 bp). The fibroin coding sequence presents a spectacular organization, with a highly repetitive and G-rich (~45%) core flanked by non-repetitive 5′ and 3′ ends. This repetitive core is composed of alternate arrays of 12 repetitive and 11 amorphous domains. The sequences of the amorphous domains are evolutionarily conserved and the repetitive domains differ from each other in length by a variety of tandem repeats of subdomains of ~208 bp which are reminiscent of the repetitive nucleosome organization. A typical composition of a subdomain is a cluster of repetitive units, Ua, followed by a cluster of units, Ub, (with a Ua:Ub ratio of 2:1) flanked by conserved boundary elements at the 3′ end. Moreover some repeats are also perfectly conserved at the peptide level indicating that the evolutionary pressure is not identical along the sequence. A tentative model for the constitution and evolution of this unusual gene is discussed.  相似文献   

13.
Using the chicken protamine gene as a probe, we have isolated and sequenced several positive clones from a quail testis cDNA library which reveal the complete sequence for the quail protamine cDNA. The predicted amino acid sequence for the quail protamine contains the N-terminal tetrapeptide ARYR present in the N-terminal region of the mammalian protamines as well as several conserved motifs and arginine clusters. In addition the size of the quail protamine (56 amino acids) is closer to that of mammals (50 amino acids) than that of the chicken (61 amino acids). Altogether this data strongly suggests the existence of an avian-mammalian protamine gene line during evolution. Southern blot analysis suggests a small number of copies (2) per haploid genome (similar to that of chicken). The reported quail protamine cDNA sequence is the second avian protamine for which the amino acid sequence is available so far and provides new insights into vertebrate protamine function and evolution.  相似文献   

14.
Summary The Bombyx fibroin gene has a discrete mosaic structure of various repetitive sequences, which may have evolved through various repeating arrangements. Detailed sequence analysis of the fibroin gene containing coding and noncoding regions revealed that the whole sequence could be arranged as an array of short repetitive sequences. A portion of the intron of the fibroin gene is one of interspersed repetitive elements. We cloned a 1.5-kb DNA fragment of the Bombyx genome that contains interspersed elements homologous to the intron sequence. Sequence comparison between the intron and the 1.5-kb fragment shows that partial duplication has frequently occurred in evolutionary progress, and the resultant repetitive blocks of short motif sequences are abundant in the genome. These facts suggest that tandem duplication of the short motif sequence is an important rearrangement in genomic evolution of the fibroin gene. Offprint requests to: S. Ichimura  相似文献   

15.
16.
17.
K Kitajima  H Sorimachi  S Inoue  Y Inoue 《Biochemistry》1988,27(18):7141-7145
The complete amino acid sequence of the major polysialoglycoproteins (PSGPs) from two genera of salmonid fish eggs, Salvelinus and Oncorhynchus, has been determined. The occurrence of tandem repeats of a genus-specific dodeca- and tridecapeptide was found for the apoPSGP of Salvelinus leucomaenis pluvius (Slp) and Oncorhynchus masou ishikawai (Omi), respectively, their amino acid sequences being highly homologous with that of rainbow trout [Salmo gairdneri (Sg)] apoPSGP (*denotes the glycosylation site; mean value of N = approximately 25): H-PSGP(Slp): (Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-)N H-PSGP(Omi): (Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Ser-)N H-PSGP(Sg): (Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Gly-)N Within 5-7 min following fertilization H-PSGP is converted to the low-molecular-mass PSGP (L-PSGP) by a specific protease (PSGPase). We have purified L-PSGP from the fertilized eggs of S. leucomaenis pluvius and Oncorhynchus keta (chum salmon) and compared it with rainbow trout egg L-PSGP(Sg) by analysis of their amino acid sequence: L-PSGP(Slp): Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Asp L-PSGP(Ok): Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Ser L-PSGP(Sg): Asp-Asp-Ala-Thr*-Ser*-Glu-Ala-Ala-Thr*-Gly-Pro-Ser-Gly The data support the conclusion that H-PSGP is degraded in vivo 5-7 min after fertilization to L-PSGP by proteolytic cleavage at the position two residues C-terminally to the Pro residue, i.e., -Pro-Ser-Xaa-Asp-(Xaa = either Gly, Ser, or Asp) by the action of PSGPase.  相似文献   

18.
Eller CD  Regelson M  Merriman B  Nelson S  Horvath S  Marahrens Y 《Gene》2007,390(1-2):153-165
Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (> 400-bp) repetitive sequences (“repeats”), including Long Interspersed Nuclear Element-1 (LINE-1) elements, are excluded from these regions. We further show that isochore membership does not distinguish housekeeping genes from tissue-specific genes and that repetitive sequence environment distinguishes housekeeping genes from tissue-specific genes in every isochore. The distinct repetitive sequence environment, in combination with other previously published sequence properties of housekeeping genes, was used to develop a method of predicting housekeeping genes on the basis of DNA sequence alone. Using expression across tissue types as a measure of success, we demonstrate that repetitive sequence environment is by far the most important sequence feature identified to date for distinguishing housekeeping genes.  相似文献   

19.
20.
Gene mapping of isozyme loci in chum salmon   总被引:5,自引:0,他引:5  
Recombination values were used to calculate the gene-centromere map distances for four electrophoretically detected loci, Aat3, Idh1, Idh4, and Mpi, in chum salmon (Oncorhynchus keta). We also report the results from 39 pairwise examinations for joint segregation for 10 loci in nine testcross families. Only two loci assorted nonrandomly--either Aat1 or Aat2 with Gpt. Gene-centromere distances for Aat3 and Mpi differed significantly from those reported previously for rainbow trout (Salmo gairdneri), a closely related species. This difference indicates either the presence of chromosome rearrangements or a different rate of recombination between the species. These results contrast with the conservation of linkage distances previously reported within and between other salmonid genera.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号