首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The nucleotide sequence of the integrated avian myeloblastosis virus long terminal repeat has been determined. The sequence is 385 base pairs long and is present at both ends of the viral DNA. The cell-virus junctions at each end consist of a 6-base-pair direct repeat of cell DNA next to the inverted repeat of viral DNA. The long terminal repeat also contains promoter-like sequences, an mRNA capping site, and polyadenylation signals. Several features of this long terminal repeat suggest a structural and functional similarity with sequences of transposable and other genetic elements. Comparison of these sequences with long terminal repeats of other avian retroviruses indicates that there is a great variation in the 3' unique sequence (U3), whereas the 5' specific sequences (U5) and the R region are highly conserved.  相似文献   

2.
R Levis  P Dunsmuir  G M Rubin 《Cell》1980,21(2):581-588
We have determined the nucleotide sequence of the terminal regions of two members of the copia sequence family of D. melanogaster. The first 276 bp at one end of a copia element are repeated in direct orientation at its other end. The direct repeats on a single copia element are identical to each other, but they differ by two nucleotide substitutions between the two elements which were examined; this suggests that during transposition only one direct repeat of the parent element is used as a template for both direct repeats of the transposed element. Each direct repeat itself contain a 17 bp imperfectly matched inveted terminal repetition. The ends of copia show significant sequence homology both to the yeast Ty1 element and to the integrated provirus of avian spleen necrosis virus, two other eucaryotic elements known to insert at many different chromosomal locations. Analysis of the genomic organization of the direct repeat sequence demonstrates that it seldom, if ever, occurs unlinked to an entire copia element.  相似文献   

3.
4.
Two independent isolates of a Bordetella pertussis repeated DNA unit were sequenced and shown to be an insertion sequence element with five nucleotide differences between the two copies. The sequences were 1053 bp in length with near-perfect terminal inverted repeats of 28 bp, had three open reading frames, and were each flanked by short direct repeats. The two insertion sequences showed considerable homology to two other B. pertussis repeated DNA sequences reported recently: IS481 and a 530 bp repeated DNA unit. The B. pertussis insertion sequence would appear to comprise a group of closely related sequences differing mainly in flanking direct repeats and the terminal inverted repeats. The two isolates reported here, which were from the adenylate cyclase and agglutinogen 2 regions of the genome, were numbered IS48lvl and IS48lv2 respectively.  相似文献   

5.
The nucleotide sequence of 1.4 kbp SmaI-fragment of minicircle DNA from kinetoplasts of Crithidia fasciculata has been determined and some sequence elements characterized. The sequence contains several oligo(dT)blocks located on the same strand in phase with a period of DNA helix turn, thus representing a "bent helix". Both sides of the bent helix region are flanked by sequences capable of forming a cloverleaf structure. There are also two direct 150 bp repeats located 180 degrees apart on the circular map of the molecule. Each repeat contains the sites of H-strand and L-strand replication origin. The specific stem-loop secondary structure may be folded by the nucleotide sequence within the origins region. The alignment of the sequence determined with two other C. fasciculata minicircle sequences spanning over the bent helix and the adjacent regions has indicated the presence of several conserved sequence blocks, one of them representing the sequence of the bend. The divergence of three sequences occurred mainly by small insertions-deletions. Several open reading frames were found, the largest of which being capable of coding for the approximately 200 amino acids polypeptide.  相似文献   

6.
Complete chromosome/genome sequences available from humans, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Saccharomyces cerevisiae were analyzed for the occurrence of mono-, di-, tri-, and tetranucleotide repeats. In all of the genomes studied, dinucleotide repeat stretches tended to be longer than other repeats. Additionally, tetranucleotide repeats in humans and trinucleotide repeats in Drosophila also seemed to be longer. Although the trends for different repeats are similar between different chromosomes within a genome, the density of repeats may vary between different chromosomes of the same species. The abundance or rarity of various di- and trinucleotide repeats in different genomes cannot be explained by nucleotide composition of a sequence or potential of repeated motifs to form alternative DNA structures. This suggests that in addition to nucleotide composition of repeat motifs, characteristic DNA replication/repair/recombination machinery might play an important role in the genesis of repeats. Moreover, analysis of complete genome coding DNA sequences of Drosophila, C. elegans, and yeast indicated that expansions of codon repeats corresponding to small hydrophilic amino acids are tolerated more, while strong selection pressures probably eliminate codon repeats encoding hydrophobic and basic amino acids. The locations and sequences of all of the repeat loci detected in genome sequences and coding DNA sequences are available at http://www.ncl-india.org/ssr and could be useful for further studies.  相似文献   

7.
The stability of elements of three different dispersed repeated gene families in the genome of Drosophila tissue culture cells has been examined. Different amounts of sequences homologous to elements of 412, copia and 297 dispersed repeated gene families are found in the genomes of D. melanogaster embryonic and tissue culture cells. In general the amount of these sequences is increased in the cell lines. The additional sequences homologous to 412, copia and 297 occur as intact elements and are dispersed to new sites in the cell culture genome. It appears that these elements can insert at many alternative sites. We also describe a DNA sequence arrangement found in the D. melanogaster embryo genome which appears to result from a transposition of an element of the copia dispersed repeated gene family into a new chromosomal site. The mechanism of insertion of this copia element is precise to within 90 bp and may involve a region of weak sequence homology between the site of insertion and the direct terminal repeats of the copia element.  相似文献   

8.
M Carlson  D Brutlag 《Cell》1978,15(3):733-742
A method for purifying sequences adjacent to satellite DNA in the heterochromatin of D. melanogaster is described. A cloned DNA segment containing part of a copia gene adjacent to 1.688 g/cm3 satellite DNA has been isolated. The copia genes compose a repeated gene family which codes for abundant cytoplasmic poly(a)-containing RNA (Young and Hogness, 1977; Finnegan et al., 1978). We have identified two major poly (A)-containing RNA species [5.2 and 2.1 kilobases (kb)] produced by the copia gene family. The cloned segment contains copia sequences homologous to the 5' end of RNA within 0.65 kb of the 1.688 satellite DNA sequences. Seven different cloned copia genes from elsewhere in the genome have also been isolated, and a 5.2 kb region present in five of the clones was identified as copia by heteroduplex analysis. In addition, three ususual copies of copia were found: a "partial" copy of the gene (3.7 kb) which has one endpoint in common with the 5.2 kb unit; a copia gene flanked on one side by a 1.6 kb sequence and on the other by the same 1.6 kb sequence in the inverted orientation; and a copia gene flanked only on one side by the same sequence.  相似文献   

9.
Simplified DNA sequence acquisition has provided many new data sets that are useful for phylogenetic reconstruction, including single- and multiple-copy nuclear and organellar genes. Although transcribed regions receive much attention, nontranscribed regions have recently been added to the repertoire of sequences suitable for phylogenetic studies, especially for closely related taxa. We evaluated the efficacy of a small portion of the histone repeat for phylogenetic reconstruction among Drosophila species. Histone repeats in invertebrates offer distinct advantages similar to those of widely used ribosomal repeats. First, the units are tandemly repeated and undergo concerted evolution. Second, histone repeats include both highly conserved coding and variable intergenic regions. This composition facilitates application of "universal" primers spanning potentially informative sites. We examined a small region of the histone repeat, including the intergenic spacer segments of coding regions from the divergently transcribed H2A and H2B histone genes. The spacer (about 230 bp) exists as a mosaic with highly conserved functional motifs interspersed with rapidly diverging regions; the former aid in alignment of the spacer. There are no ambiguities in alignment of coding regions. Coding and noncoding regions were analyzed together and separately for phylogenetic information. Parsimony, distance, and maximum-likelihood methods successfully retrieve the corroborated phylogeny for the taxa examined. This study demonstrates the resolving power of a small histone region which may now be added to the growing collection of phylogenetically useful DNA sequences.  相似文献   

10.
We present an analysis of a chromosomal walk in the region of the euchromatin-heterochromatin transition at the base of the X chromosome of Drosophila melanogaster. This region is difficult to analyse because of the presence of repeated sequences, and we have used cosmids to walk from the last euchromatic gene, suppressor of forked, towards the pericentric heterochromatin. The proximal 30-kb sequence we have isolated consists of repetitive DNA, including four tandem copies of a 5.9-kb sequence. This tandem repeat is itself a mosaic of other, mostly repeated, sequences, including part of a retrotransposon without long terminal repeats, a simple-sequence region of TAA repeats and part of a retrotransposon with long terminal repeats that has not been previously described. Although sequences homologous to these components are found elsewhere in the genome, this arrangement of repeated sequences is only found at the base of the X chromosome. It is conserved in D. melanogaster strains of different geographic origin, but is not conserved in even closely related species.  相似文献   

11.
The transcriptional control regions of the copia retrotransposon   总被引:4,自引:3,他引:1  
  相似文献   

12.
Nucleotide sequence of the gene for the b subunit of human factor XIII   总被引:9,自引:0,他引:9  
R E Bottenus  A Ichinose  E W Davie 《Biochemistry》1990,29(51):11195-11209
Factor XIII (Mr 320,000) is a blood coagulation factor that stabilizes and strengthens the fibrin clot. It circulates in blood as a tetramer composed of two a subunits (Mr 75,000 each) and two b subunits (Mr 80,000 each). The b subunit consists of 641 amino acids and includes 10 tandem repeats of 60 amino acids known as GP-I structures, short consensus repeats (SCR), or sushi domains. In the present study, the human gene for the b subunit has been isolated from three different genomic libraries prepared in lambda phage. Fifteen independent phage with inserts coding for the entire gene were isolated and characterized by restriction mapping, Southern blotting, and DNA sequencing. The gene was found to be 28 kilobases in length and consisted of 12 exons (I-XII) separated by 11 intervening sequences. The leader sequence was encoded by exon I, while the carbonyl-terminal region of the protein was encoded by exon XII. Exons II-XI each coded for a single sushi domain, suggesting that the gene evolved through exon shuffling and duplication. The 12 exons in the gene ranged in size from 64 to 222 base pairs, while the introns ranged in size from 87 to 9970 nucleotides and made up 92% of the gene. The introns contained four Alu repetitive sequences, one each in introns A, E, I, and J. A fifth Alu repeat was present in the flanking 3' end of the gene. Two partial KpnI repeats were also found in the introns, including one in intron I and one in intron J. The KpnI repeat in intron J was 89% homologous to a sequence of approximately 2200 nucleotides flanking the gene coding for human beta globin and approximately 3800 nucleotides from the L1 insertion present in the gene for human factor VIII. Intron H also contained an "O" family repeat, while two potential regions for Z-DNA were identified within introns G and J. One nucleotide change was found in the coding region of the gene when its sequence was compared to that of the cDNA. This difference, however, did not result in a change in the amino acid sequence of the protein.  相似文献   

13.
We have investigated the organisation, nucleotide sequence, and chromosomal distribution of a tandemly repeated, satellite DNA from Allium cepa (Liliaceae). The satellite, which constitutes about 4% of the A. cepa genome, may be resolved from main-band DNA in antibiotic-CsCl density gradients, and has a repeat length of about 375 base pairs (bp). A cloned member of the repeat family hybridises exclusively to chromosome telomeres and has a non-random distribution in interphase nuclei. We present the nucleotide sequences of three repeats, which differ at a large number of positions. In addition to arrays made up of 375-bp repeats, homologous sequences are found in units with a greater repeat length. This divergence between repeats reflects the heterogeneity of the satellite determined using other criteria. Possible constraints on the interchromosomal exchange of repeated sequences are discussed.  相似文献   

14.
15.
We have determined the nucleotide sequence of the cell binding domain region of the chicken fibronectin gene and analyzed it evolutionaly. We present here the complete nucleotide sequence of 4.3 kb HindIII/EcoRI segment from the clone lambda FC23 of the chicken fibronectin gene. There were five exons in this segment. When we lined up the amino acid of exons 28, 29 and 31, three alignments, known as the Type III repeat, appeared. Tetrapeptide, -RGDS-, called the cell binding domain, existed in the second repeat, coding exon 30. It was presumed that the Type III repeats were composed of two exons in the chicken gene, the same as in the rat and humans. We found repeatedly appearing amino-acid sequences such as -TIT- (three arrays in these Type III repeats) but also found one of the amino acids substituted in the tripeptide in these Type III repeats (seven arrays). We analyzed these repeats from the point of view of evolution. We used three of the nucleotide sequences (12-18 bp) coding such -TIT- repeats as a unit length for comparing the various homologies after dividing the coding region into 56 segments. The mutual homology of the divided segments to each one of three showed 53% on average. On the other hand, the mutual nucleotide homology of the Type III repeat was 44%. This suggested that the Type III repeat may have been developed by frequent duplication of small gene units.  相似文献   

16.
Nucleotide sequence of the glnA control region of Escherichia coli   总被引:10,自引:0,他引:10  
The RNA polymerase binding sites present along a DNA segment encompassing the glnA, glnL, and glnG genes have been identified in a hybrid plasmid carrying this chromosomal region of Escherichia coli. The DNA sequence was determined of an 817 base pair segment that contains the region coding for the first 42 amino acids of the NH2-terminal and of the glnA structural gene, as well as its regulatory region. Analysis of this nucleotide sequence revealed three probable RNA polymerase recognition sites, imperfect palindromes, inverted repeats, and direct repeated sequences.  相似文献   

17.
There are at least three immunoglobulin epsilon genes (C epsilon 1, C epsilon 2, and C epsilon 3) in the human genome. The nucleotide sequences of the expressed epsilon gene (C epsilon 1) and one (C epsilon 3) of the two epsilon pseudogenes were compared. The results show that the C epsilon 3 gene lacks the three intervening sequences entirely and has a 31-base A-rich sequence 16 bases 3' to the putative poly(A) addition signal, indicating that the C epsilon 3 gene is a processed gene. The C epsilon 3 gene sequence is homologous to the five separate DNA segments of the C epsilon 1 gene; namely, a segment in the 5'-flanking region (100 bases) and four exons, which are interrupted by a spacer region or intervening sequences. Long terminal repeat (LTR)-like sequences which contain TATAAA and AATAAA sequences as well as terminal inverted repeats are present in both 5'- and 3'-flanking regions. The 5' and 3' LTR-like sequences do not, however, constitute a direct repeat, unlike transposable elements of eukaryotes and retroviruses. The 3' LTR-like sequence is repetitive in the human genome, but is not homologous to the Alu family DNA. Models for the evolutionary origin of the processed gene flanked by the LTR-like sequences are discussed. The C epsilon 3 gene has a new open frame which codes potentially for an unknown protein of 292 amino acid residues.  相似文献   

18.
We present the complete nucleotide sequence and the deduced amino acid sequence of the H-2Dp class I gene. This gene, which was cloned from a B10.P genomic DNA library, encodes and intact, functional H-2Dp molecule. Comparative analysis of the Dp sequence with other class I sequences reveals both similarities and differences. This analysis also shows that these genes exhibit D region-specific, locus-specific, as well as allele-specific sequences. The H-2Dp nucleotide sequence is greater than 90% homologous to the H-2Ld and H-2Db genes and only approximately 85% homologous to the H-2Dd gene. The K region and Qa region genes are less homologous. The 3' noncoding sequences appear to be region-specific. All of the previously described D region genes, Db, Ld, and Dd, possess the B2-SINE Alu-like repetitive sequence, as does Dp. Thus, this B2 repeat is a region-specific marker present in all D region genes studied so far. The additional polyadenylation site found in the H-2Dp gene starting at nucleotide 4671, which is homologous to non-D region sequences, as well as unique protein Dp coding sequences, make this gene an interesting model for studying the evolution of polymorphism and structure/function relationships in the class I gene family.  相似文献   

19.
20.
An earlier report (Subramanian, Dhar, and Weissman, 1977c) presented the nucleotide sequence of Eco RII-G fragment of SV40 DNA, which contains the origin of DNA replication. The nucleotide sequence of Eco RII-N fragment located next to Eco RII-G on the physical map of SV40 DNA is presented in this report. Eco RII-N is found to be a tandem duplication of the last 55 nucleotides of Eco RII-G. This tandem repeat is immediately preceded by two other reiterated sequences occurring within Eco RII-G, one of them being a tandem repeat of 21 nucleotides and the other a nontandem repeat of 10 nucleotides. These repetitive sequences occur in close proximity to the origin of DNA replication which is known to contain other specialized sequences such as a few palindromes (one of which is 27 long and possesses a perfect 2-fold axis of symmetry), one "true" palindrome, and a long A/T-rich cluster. The repeats (and the replication origin) occur within an untranslated region of SV40 DNA flanked by (the few) structural genes coding for the "late" proteins on the one side and that (those) coding for the "early" protein(s) on the other side. The reiterated sequences are comparable in some respects to repetitive sequences occurring in eucaryotic DNAs. Possible biological functions of the repeats are discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号