首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We have identified and sequenced two members of a chicken middle repetitive DNA sequence family. By reassociation kinetics, members of this family (termed CRl) are estimated to be present in 1500-7000 copies per chicken haploid genome. The first family member sequenced (CRlUla) is located approximately 2 kb upstream from the previously cloned chicken Ul RNA gene. The second CRl sequence (CRl)Va) is located approximately 12 kb downstream from the 3' end of the chicken ovalbumin gene. The region of homology between these two sequences extends over a region of approximately 160 base pairs. In each case, the 160 base pair region is flanked by imperfect, but homologous, short direct repeats 10-15 base pairs in length. When the CRl sequences are compared with mammalian ubiquitous interspersed repetitive DNA sequences (human Alu and Mouse Bl families), several regions of extensive homology are evident. In addition, the short nucleotide sequence CAGCCTGG which is completely conserved in ubiquitous repetitive sequence families from several mammalian species is also conserved at a homologous position in the chicken sequences. These data imply that at least certain aspects of the sequence and structure of these interspersed repeats must predate the avian-mammalian divergence. It seems that the CRl family may possibly represent an avian counterpart of the mammalian ubiquitous repeats.  相似文献   

2.
Two members of the human salivary proline-rich protein (PRP) multigene family have been isolated and completely sequenced. These PRP genes, PRH1 and PRH2, are of the HaeIII-type subfamily and code for acidic PRP proteins. Both genes are approximately 3.5 kilobase pairs (kb) in length and contain four exons. Exon 3 encodes the proline-rich part of the protein and includes five 63-base pair (bp) repeats. CAT and ATA boxes and several possible enhancer sequences occur in a 1-kb region 5' to exon 1. Two sets of repeats occur in the sequenced region in addition to the 63-bp repeats: one pair of about 140 bp flanks 500 bp of DNA in the first intervening sequence, and the other pair of 72 bp is tandemly repeated 1.4 kb 5' to the PRH1 gene. The 4-kb region of sequenced DNA from PRH1 differs by an average of 8.7% from the same region in PRH2, but the nucleotide sequences of the exon 3 of the two genes differ by only 0.2%. This result suggests the occurrence of a recent gene conversion event. The regions containing the 5-fold repeated sequences of 63 bp are identical in the two genes, PRH1 and PRH2. A comparison of the human HaeIII and BstNI subfamily repeats and a comparison of the human, mouse, and rat repeats suggest that the individual repeats have evolved in a concerted fashion within each gene and within the PRP gene family as a whole.  相似文献   

3.
We have determined the nucleotide sequence of 4508 base pairs of human genomic DNA which contain the human serine esterase gene from cytotoxic T lymphocytes (SECT) (equivalent to the 1-3E cDNA clone) and include 879 bp of 5' flanking DNA and 393 bp of 3' flanking DNA. The gene consists of five exons of 88, 148, 136, 261, and 257 nucleotides separated by four introns of 1043, 455, 205, and 643 nucleotides. The location of introns with respect to protein coding sequences in the SECT gene is identical to that of the human cathepsin G and murine granzyme B genes. Comparison of SECT gene exonic sequences to murine granzyme B-F cDNA sequences indicates similarities of 75 and 72% for granzymes B and C and 61, 59, and 61% for granzymes D, E, and F, respectively. The 5' flanking sequence of the SECT gene showed similarity only to the 5' flanking sequence of the murine granzyme B gene, indicating that these genes are homologous. Comparison of the SECT gene sequence to the human cathepsin G sequence indicated no similarity in the 5' flanking DNA although the exonic sequences show 64% sequence similarity overall and 45% sequence similarity in the respective 3' untranslated regions. These similarities suggest that the SECT and cathepsin G genes are members of the same family of serine protease genes. Evidence from high and low stringency Southern transfer analysis of human genomic DNA indicates the presence of another gene of at least 85% sequence similarity to the SECT gene.  相似文献   

4.
5.
The nucleotide sequence of an 8658-base-pair human genomic DNA segment containing the entire corticotropin-beta-lipotropin precursor gene has been determined, and some sequence features of the gene and its flanking regions have been analysed. The gene is composed of 7665 base pairs including two introns of 3708 and 2886 base pairs. Comparison of the 5'-flanking sequences of the human, bovine and mouse corticotropin-beta-lipotropin precursor genes reveals the presence of a highly conserved region, which contains sequences of 14-15 base pairs homologous with sequences located upstream of the mRNA start site of other glucocorticoid-regulated genes.  相似文献   

6.
Comparison analysis of the sequences of the mouse and human genomes has proven a powerful approach in identifying functional regulatory elements within the non-coding regions that are conserved through evolution between homologous mammalian loci. Here, we applied computational analysis to identify regions of homology in the 5' upstream sequences of the human tyrosinase gene, similar to the locus control region (LCR) of the mouse tyrosinase gene, located at -15 kb. We detected several stretches of homology within the first 30 kb 5' tyrosinase gene upstream sequences of both species that include the proximal promoter sequences, the genomic region surrounding the mouse LCR, and further upstream segments. We cloned and sequenced a 5' upstream regulatory sequence found between -8 and -10 kb of the human tyrosinase locus (termed h5'URS) homologous to the mouse LCR sequences, and confirmed the presence of putative binding sites at -9 kb, homologous to those described in the mouse tyrosinase LCR core. Finally, we functionally validated the presence of a tissue-specific enhancer in the h5'URS by transient transfection analysis in human and mouse cells, as compared with homologous DNA sequences from the mouse tyrosinase locus. Future experiments in cells and transgenic animals will help us to understand the in vivo relevance of this newly described h5'URS sequence as a potentially important regulatory element for the correct expression of the human tyrosinase gene.  相似文献   

7.
We report the isolation of the complete genes encoding nucleolin from rat and hamster. The DNA clones were obtained from partial genomic libraries by probing with a genomic DNA fragment containing the leader and promoter regions of the mouse nucleolin gene. We have determined the complete nucleotide sequence of the 5'-terminal region for the three rodent species. The sequenced regions extend over 1 kb downstream and upstream from the cap sites and include a conserved CpG island 1500 nucleotides (nt) long. The 5' end of the CpG island in each species has maintained a long alternating purine-pyrimidine sequence which could adopt a Z-DNA conformation. By sequence comparison, 42 blocks of homology are defined in the 5'-terminal region, of which 36 appear in the CpG island and contain numerous conserved CpG dinucleotides. Two blocks, 110 and 49 nt long, encompassing the cap sites and the region immediately upstream, respectively, present features characteristic of regulated genes: a possible TATA box (ATTA), two pyrimidine-rich nucleotide stretches and two inverted juxtaposed CCAAT-like boxes (GGTTGG). Furthermore, the adjacent upstream conserved region presents features characteristic of housekeeping genes: four G/C boxes, embedded in a high G + C-content sequence, among them one presenting a perfect consensus Sp 1-binding site (GCCCCGCCCC). Among unusual features, we report numerous large G + C-rich conserved sequences located in the first intron. One of these sequences contains two G/C boxes which border a sequence presenting a dyad symmetry (GCGCACGTGCTC). Our findings shed some light on the putative role of the CpG island. We show that CpG-rich sequence motifs are under strong selective pressure over the whole 5'-terminal region and are presumably involved in regulatory mechanisms.  相似文献   

8.
9.
10.
11.
12.
Molecular cloning and characterization of the human beta-like globin gene cluster   总被引:104,自引:0,他引:104  
E F Fritsch  R M Lawn  T Maniatis 《Cell》1980,19(4):959-972
The genes encoding human embryonic (epsilon), fetal (G gamma, A gamma) and adult (delta, beta) beta-like globin polypeptides were isolated as a set of overlapping cloned DNA fragments from bacteriophage lambda libraries of high molecular weight (15-20 kb) chromosomal DNA. The 65 kb of DNA represented in these overlapping clones contains the genes for all five beta-like polypeptides, including the embryonic epsilon-globin gene, for which the chromosomal location was previously unknown. All five genes are transcribed from the same DNA strand and are arranged in the order 5'-epsilon-(13.3 kb)-G gamma-(3.5 kb)-A gamma-(13.9 kb)-delta-(5.4 kb)-beta-3'. Thus the genes are positioned on the chromosome in the order of their expression during development. In addition to the five known beta-like globin genes, we have detected two other beta-like globin sequences which do not correspond to known polypeptides. One of these sequences has been mapped to the A gamma-delta intergenic region while the other is located 6-9 kb 5' to the epsilon gene. Cross hybridization experiments between the intergenic sequences of the gene cluster have revealed a nonglobin repeat sequence (*) which is interspersed with the globin genes in the following manner: 5'-**epsilon-*G gamma-A gamma*-**delta-beta*-3'. Fine structure mapping of the region located 5' to the delta-globin gene revealed two repeats with a maximum size of 400 bp, which are separated by approximately 700 bp of DNA not repeated within the cluster. Preliminary experiments indicate that this repeat family is also repeated many times in the human genome.  相似文献   

13.
Sequence homologies in the protamine gene family of rainbow trout   总被引:9,自引:2,他引:7       下载免费PDF全文
We have sequenced five different rainbow trout protamine genes plus their flanking regions. The genes are not clustered and do not contain intervening sequences. There is an extremely high degree of sequence conservation in the coding and 3' untranslated regions of the gene. Downstream sequences exhibit little homology though conserved regions are found 250 base pairs 3' to the gene. There are four regions upstream of the gene that are highly conserved in the six clones, including the canonical Goldberg - Hogness box which is 45 base pairs 5' to the coding region. A second homologous region is found 90 bases upstream. Although in the same approximate location as the CAAT box found upstream of other genes, it does not contain the canonical CAAT sequence. Further upstream of the protamine genes at -115 there is an A-T rich sequence while a 25 base pair conserved sequence is located 150 bases upstream. In addition we report the presence of a potential Z-DNA region of predominantly A-C repeats approximately one kilobase downstream of one of the genes.  相似文献   

14.
15.
16.
A 7.5 kb Hsu I restriction fragment of genomic DNA containing a beta-globin gene has been isolated from a patient doubly heterozygous for beta + thalassaemia and a delta beta (Lepore globin fusion gene. This fragment must be derived from the chromosome carrying the beta +-thalassaemia determinant. The gross structure of the cloned gene plus flanking sequences is indistinguishable from that of a normal beta-globin gene. Within in 1606 base-pair transcribed region of the gene there is only one nucleotide difference from the normal beta-globin gene sequence. This is a G leads to A replacement 21 nucleotides upstream from the 3' terminus of the small intron. This nucleotide lies within a 10 base-pair sequence repeated in an inverted configuration near the 5' terminus of the small intron. The nucleotide replacement may result in a precursor mRNA less amenable to RNA splicing than its normal counterpart.  相似文献   

17.
We have determined the nucleotide sequence of sea urchin (Lytechinus pictus) late stage H3 and H4 histone genes contained on the clone pLpH3H4 -21 and of the early stage H3 gene contained on the plasmid pLpA . Comparison of these differentially regulated histone genes with each other and with other L. pictus late and early stage histone H3 and H4 genes previously sequenced confirms that members of each histone gene family (early and late) are more homologous to each other than they are to members of other histone gene families. The spacer regions between two late H3-H4 gene pairs on the clones pLpH3H4 -19 and pLpH3H4 -21 have diverged to the point where they are no longer homologous. However, comparative analysis of the 5' flanking DNA has identified a sequence 5'C-T-C-A-T-G-T-A-T-T3' upstream of both late H4 genes and another, 5'A-G-A-T-T-C-A3', upstream of both H3 genes. Except for a short conserved sequence near the initiation codon, the transcribed 5' leaders of the late mRNAs differ in length and sequence in the two non-allelic late histone gene pairs. This divergence contrasts with the 95 to 96% conservation found between late histone gene coding sequences. The results suggest that there is intergenic exchange in the germline among members of the late histone gene family and that the unit of exchange is the individual gene rather than the heterotypic dimer which includes the common spacer DNA.  相似文献   

18.
19.
The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses.  相似文献   

20.
Complete structure of the human gene encoding neuron-specific enolase   总被引:5,自引:0,他引:5  
D Oliva  L Calì  S Feo  A Giallongo 《Genomics》1991,10(1):157-165
At least three genes encode the different isoforms of the glycolytic enzyme enolase. We have isolated the gene for the human gamma- or neuron-specific enolase and determined the nucleotide sequence from upstream to the 5' end to beyond the polyadenylation site. The gene contains 12 exons distributed over 9213 nucleotides. Introns occur at positions identical to those reported for the homologous rat gene, as well as for the human alpha- or nonneuronal enolase gene, supporting the existence of a single ancestor for the members of this gene family. Primer extension analysis indicates that the gene has multiple start sites. The putative promoter region lacks canonical TATA and CAAT boxes, is very G + C-rich, and contains several potential regulatory sequences. Furthermore, an inverted Alu sequence is present approximately 572 nucleotides upstream of the major start site. A comparison of the 5'-flanking region of the human gamma-enolase gene with the same region of the rat gene revealed a high degree of sequence conservation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号