首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Immunoglobulin heavy chain switch regions are segments of DNA considered to be important in mediating class switching in B lymphocytes. Whereas these segments vary in length among the different murine isotypes, their structural organization schemes are all based on the tandem repetition of unit sequences. We previously showed that the S gamma 1 segment unexpectedly contains sequence elements that differ significantly from its prevalent unit repeat (49mer). Here we extend this preliminary characterization by determining the complete nucleotide sequence of the cloned S gamma 1 segment from BALB/c DNA. We find that S gamma 1 consists of more than 120 tandemly repeated 49mers. In addition, we show that the previously identified non-49mer sequences are part of a direct repeat element about 350 bp in length (DR II), which exists in two copies at the 5' end of S gamma 1. We also show that another unrelated direct repeat element about 500 bp long (DR I) exists near the 5' and 3' ends of S gamma 1. Thus, the structure of the S gamma 1 segment might be may be abbreviated as 5'-DRII-(49mer)15-DRI-DRII-(49mer)n-DRI , where n is between 40 and 160. Our results of Southern hybridization experiments suggest that this basic structural scheme is maintained in eight different Igh haplotypes, although S gamma 1 segments in different Igh haplotypes include different numbers of 49mer elements. Other murine S gamma segments differ in size among various Igh loci, but to a lesser extent than S gamma 1. At the level of tandemly repeated sequences, S gamma 1, S gamma 3, and S gamma 2b represent three distinct, nonoverlapping sets of sequences.  相似文献   

2.
Huan Gao  Jie Kong 《DNA sequence》2005,16(6):426-436
Through two-time sequencing randomly in Fenneropenaeus chinensis, 2,597,000 bp cumulative length random genomic sequences about occupying 1.23 per thousand of the entire genome are obtained, in which the length of the first time sequencing is 884,000 bp, by cutting the genome DNA with Sau3AI enzyme, and the second is 1,713,000 bp by breaking the genome DNA with the physical method, ultrasonic. Using tandem repeat finder (TRF) soft to analyze the sequences, 4,588 tandem repeats are found, in which the number of microsatellites (1-6 bp) is 3,888, and 700 for minisatellites ( >or= 7 bp). The cumulative length of repeats is 305,555 bp, accounting for 11.72% of total cumulative sequence length, in which the cumulative length of microsatellites is 232,979 bp, accounting for 8.97% of total sequence length, and greater than those of other organisms, such as human and mosquito, etc. The dinucleotide repeat type is dominant in which the dominant repeat class is AT. The second abundant repeat type is trinucleotide, of which the dominant repeat class is AAT. Interestingly, of all of repeat types, the repeat numbers and repeat classes of primer number repeat types, such as pentanucleotide, heptanucleotide, elevennucleotide, etc. are less than those of repeat types beside them. The phenomena may involve the genesis and the evolution of microsatellites and minisatellites.  相似文献   

3.
4.
Repeat sequences in various genomes undergo expansion by poorly understood mechanisms. By using an oligonucleotide system containing such repeats, we recapitulated the last steps in Okazaki fragment processing, which have been implicated in sequence expansion. A template containing either triplet or tandem repeats was annealed to a downstream primer containing complementary repeats at its 5'-end. Overlapping upstream primers, designed to strand-displace varying numbers of repeats in the downstream primer, were annealed. Human DNA ligase I joined overlapping segments of repeats generating an expansion product from the primer strands. Joining efficiency decreased with repeat length. Flap endonuclease 1 (FEN1) cleaved the displaced downstream strand and together with DNA ligase I produced non-expanded products. However, both expanded and non-expanded products formed irrespective of relative nuclease and ligase concentrations tested or enzyme addition order, suggesting the pre-existence and persistence of intermediates leading to both outcomes. FEN1 activity decreased with the length of repeat segment displaced presumably because the flap forms structures that inhibit cleavage. Increased MgCl(2) disfavored ligation of substrate intermediates that result in expansion products. Examination of expansion in vitro enables dissection of substrate and replication enzyme dynamics on repeat sequences.  相似文献   

5.
We have characterized the heterogeneity occurring at the junction of the long (L) and short (S) segments and at the termini of the strain AD169 human cytomegalovirus (HCMV) genome by restriction endonuclease mapping and nucleotide sequence analyses. The HCMV a sequence was identified by its position at both termini and inverted orientation at the L-S junction. Heterogeneity at both termini and the L-S junction was generated by the presence of fused and tandem a sequences. Some S termini lacked an a sequence. In addition, near the L terminus and at the L-S junction there were a variable number of 217-base-pair (bp) XhoI fragments arranged in tandem. The 217-bp fragments consisted of a portion of the a and adjacent b sequences (in the L-segment repeat) bounded by the same direct repeats (DR1) found at the boundaries of the a sequence. A model for the generation of these heterogeneous fragments is presented. We also determined the sequence of seven cloned terminal fragments, five from the L terminus and two from the S terminus. All L termini contained identical terminal sequences ending with base 32 of a 33-bp DR1. The S termini differed from each other and from the L-segment termini. One S terminus lacked an a sequence and terminated within S-segment repeat (c) sequences. The second S terminus contained an a sequence and terminated with bases 20 to 33 of a 33-bp DR1. A comparison of the cloned L and S terminal sequences with cloned L-S junction sequences suggested that the termini contained 3' single base extensions which were removed during the cloning. We also show that the herpesvirus conserved sequence is in a similar position relative to the termini of HCMV and several other herpesviruses, thus adding further support for the role of the sequence in the maturation of viral DNA.  相似文献   

6.
A computer-aided homology search of databases found that the nucleotide sequences flanking ATLN44, a non-LTR retrotransposon (LINE) from Arabidopsis thaliana, are repeated in the A. thaliana genome. These sequences are homologous to flanking sequences of 664 bp with terminal inverted repeat sequences of about 70 bp. The 664-bp sequence and most of the 14 homologues identified were flanked by direct repeat sequences of 9 bp. These findings indicate that the repeated sequence, named Tnat1, is a transposable element that duplicates a 9-bp sequence at the target site on transposition and that ATLN44 is inserted in one Tnat1 member. Interestingly, all of the Tnat1 members had tandem repeats comprised of several units of a 60-bp sequence, the number of repeats differing among Tnat1 members. Of the Tnat1 members identified, one was inserted into another sequence repeated in the A. thaliana genome: that sequence is about 770 bp long and has terminal inverted repeat sequences of about 110 bp. The sequence is flanked by direct repeats of a 9-bp sequence, indicating that it is another transposable element, named Tnat2, from A. thaliana. Moreover, Tnat2 members had a tandem repeat about 240 bp long. Tnat1 and Tnat2 with tandem repeats in their internal regions show no homology to each other or to any of the elements identified previously; therefore they appear to be novel transposable elements.  相似文献   

7.
Epstein-Barr virus DNA is known to have partially homologous segments, designated DL and DR, near the left and right ends of the long unique region (Raab-Traub et al., Cell 22:257-267, 1980). DL and DR are each partially composed of tandem direct repeat sequences. DL contains 11 to 14 repeats of a 124-base-pair sequence designated IR2. DR contains approximately 30 direct repeats of a 103-base-pair sequence designated IR4. The DL and DR sequences have colinear partial homology for approximately 2.4 and 1.5 kilobase pairs to the right of IR2 and IR4, respectively. IR2 and IR4 are similar sequences and evolved in part from a common ancestor. Both sequences are 84% guanine and cytosine and have limited homology to Epstein-Barr virus IR1 and to the herpes simplex virus type 1 inverted terminal repeat "a" sequence. IR2 encodes part of an abundant 2.5-kilobase persistent early EBV RNA expressed in productively infected cells, but does not encode part of the 3-kilobase Epstein-Barr virus RNA which is transcribed from the adjacent IR1-U2 region of the Epstein-Barr virus genome in latently infected cells.  相似文献   

8.
Several complementary procedures were used to identify and characterize DNA sequences which are repeated within a 44 kilobase (kb) segment of rabbit chromosomal DNA containing four different rabbit β-like globin genes (β1–β4). Cross-hybridization between cloned DNAs from different regions of the gene cluster indicates the presence of a complex array of repeat sequences interspersed with the globin genes. We classified 20 different repeat sequences into five families whose members cross-hybridize. Electron microscopy was used to determine the location, size and relative orientations of many of the repeat sequences. Both direct and inverted repeats were identified, with sizes ranging from 140 to 1400 base pairs (bp). Each of the four closely linked globin genes is flanked by at least one pair of inverted repeats of 140–400 bp, and the entire set of four genes is flanked by an inverted repeat of 1400 bp. Two of the five repeat families contain repeat sequences of different sizes. We found that the smaller sequence elements can occur individually or in association with the larger repeat sequences, suggesting that the larger repeats may be composed of more than one smaller repeat sequence. The restriction fragments containing the intracluster repeats also contain sequences which are repeated many times in total rabbit genomic DNA, but it is not known whether the genomic and intracluster repeats are the same sequences. The results provide the first demonstration of the relationship between single-copy and repetitive DNA sequences in a large segment of chromosomal DNA containing a well characterized set of developmentally regulated genes.  相似文献   

9.
VNTRs in review     
In the last decade the study of human genetic variation has made a quantum leap from the analysis of protein and antigen intermediaries to the investigation of DNA itself.1 The DNA double helix codes genetic information as a sequence of four different nucleotides: adenine (A), guanine (G), cytosine (C), and thymine (T). Nucleotides are nitrogenous bases that bind the complementary strands of the double helix, giving rise to the use of base pairs (bp) as a unit of DNA length. So far so good. Within the human genome there are DNA sequences that do not code for proteins and that consist of short runs of nucleotides, say GTGGACAGG, repeated in tandem hundreds, or even thousands of times. This particular sequence, known as MS1 for minisatellite 1, is found on human chromosome 1 at a locus called D1S7. Although it is old news that there are a lot of repetitive DNAs in the human genome, it is new and very interesting to find that many repetitive DNA loci have arrays of different numbers of repeats in different individuals. These loci are referred to as VNTRs, shorthand for “variable numbers of tandem repeats” or, more flippantly for “very nasty types of repeat.” The finding of hundreds of VNTR loci distributed across all chromosomes exposes a richness of genetic diversity for anthropologists studying human evolutionary history.  相似文献   

10.
The dipteran Chironomus tentans has complex tandemly repeated 350-bp DNA sequences at or near the chromosome ends. As in Drosophila melanogaster, short simple repeats with cytosines and guanines in different strands have never been observed. We were therefore interested in learning whether the Chironomus repeats could have evolved from simple sequence telomeric DNA, which might suggest that they constitute a functional equivalent. We screened for repeat units with evolutionarily ancient features within the tandem arrays and recovered two clones with a less-evolved structure. Sequence analysis reveals that the present-day 350-bp unit probably evolved from a simpler 165-bp unit through the acquisition of transposed sequences. The 165-bp unit contains DNA with a highly biased distribution of cytosine and guanine between the two strands, although with the ratios inverted in two minor parts of the repeat. It is largely built up of short degenerate subrepeats for which most of the sequence can be reconstructed. The consensus for the subrepeat sequence is similar to the simple telomeric repeat sequences of several kinds of eukaryotes. We propose that the present-day unit has evolved from telomeric, simple sequence, asymmetric DNA from which it has retained some original sequence features and possibly functions.  相似文献   

11.
R L Stallings 《Genomics》1992,13(3):890-891
Simple microsatellite repetitive sequences are widely distributed in eukaryotic genomes. Using the GCG Find program, the distribution of each type of mono- and dinucleotide repetitive sequence has been examined in GenBank sequences. Examples of each type of simple satellite sequence could be found, although the frequency of (CpG)n greater than or equal to 8 repeats was extremely low. The suppression of CpG dinucleotides in vertebrates does not adequately explain the rarity of this repeat since (CpG)n repeats are also extremely infrequent in species genomes where CpG dinucleotides are not suppressed. Instead, it is proposed that (CpG)n repeats must possess a DNA conformation that has a deleterious structural effect.  相似文献   

12.
13.
Cheng ZJ  Murata M 《Genetics》2003,164(2):665-672
From a wild diploid species that is a relative of wheat, Aegilops speltoides, a 301-bp repeat containing 16 copies of a CAA microsatellite was isolated. Southern blot and fluorescence in situ hybridization revealed that approximately 250 bp of the sequence is tandemly arrayed at the centromere regions of A- and B-genome chromosomes of common wheat and rye chromosomes. Although the DNA sequence of this 250-bp repeat showed no notable homology in the databases, the flanking or intervening sequences between the repeats showed high homologies (>82%) to two separate sequences of the gag gene and its upstream region in cereba, a Ty3/gypsy-like retroelement of Hordeum vulgare. Since the amino acid sequence deduced from the 250 bp with seven CAAs showed some similarity ( approximately 53%) to that of the gag gene, we concluded that the 250-bp repeats had also originated from the cereba-like retroelements in diploid wheat such as Ae. speltoides and had formed tandem arrays, whereas the 300-bp repeats were dispersed as a part of cereba-like retroelements. This suggests that some tandem repeats localized at the centromeric regions of cereals and other plant species originated from parts of retrotransposons.  相似文献   

14.
15.
A highly repetitive DNA sequence from tilapia (Oreochromis mossambicus/hornorum) has been cloned and sequenced. It is a tandemly arrayed sequence of 237 bp and constitutes 7% of the fish genome. The copy number of the repeat is approximately 3 x 10(5) per haploid genome. DNA sequence analysis of 7 cloned repeats revealed a high degree of conservation of the monomeric unit. Within the monomeric unit, a 9 bp AT rich motif is regularly spaced approximately 30 bp apart and may represent the progenitor of the amplified sequence. One cloned repeat, Ti-14, contained a 30 bp deletion at a position flanked by a 7 bp direct repeat. The Ti-14 sequence appears to have been amplified independently of the major 237 bp tandem array. A higher-order repeat unit, defined by longer-range periodicities revealed by restriction endonuclease digestion, is further imposed on the tandem array.  相似文献   

16.
Informativeness of human (dC-dA)n.(dG-dT)n polymorphisms   总被引:133,自引:0,他引:133  
J L Weber 《Genomics》1990,7(4):524-530
Abundant human interspersed repetitive DNA sequences of the form (dC-dA)n.(dG-dT)n have been shown to exhibit length polymorphisms. Examination of over 100 human (dC-dA)n.(dG-dT)n sequences revealed that the sequences differed from each other both in numbers of repeats and in repeat sequence type. Using a set of precise classification rules, the sequences were divided into three categories: perfect repeat sequences without interruptions in the runs of CA or GT dinucleotides (64% of total), imperfect repeat sequences with one or more interruptions in the run of repeats (25%), and compound repeat sequences with adjacent tandem simple repeats of a different sequence (11%). Informativeness of (dC-dA)n.(dG-dT)n markers in the perfect sequence category was found to increase with increasing average numbers of repeats. PIC values ranged from 0 at about 10 or fewer repeats to above 0.8 for sequences with about 24 or more repeats. (dC-dA)n.(dG-dT)n polymorphisms in the imperfect sequence category showed lower informativeness than expected on the basis of the total numbers of repeats. The longest run of uninterrupted CA or GT repeats was found to be the best predictor of informativeness of (dC-dA)n.(dG-dT)n polymorphisms regardless of the repeat sequence category.  相似文献   

17.
G. S. Wilkinson  F. Mayer  G. Kerth    B. Petri 《Genetics》1997,146(3):1035-1048
Analysis of mitochondrial DNA control region sequences from 41 species of bats representing 11 families revealed that repeated sequence arrays near the tRNA-Pro gene are present in all vespertilionine bats. Across 18 species tandem repeats varied in size from 78 to 85 bp and contained two to nine repeats. Heteroplasmy ranged from 15% to 63%. Fewer repeats among heteroplasmic than homoplasmic individuals in a species with up to nine repeats indicates selection may act against long arrays. A lower limit of two repeats and more repeats among heteroplasmic than homoplasmic individuals in two species with few repeats suggests length mutations are biased. Significant regressions of heteroplasmy, θ and π, on repeat number further suggest that repeat duplication rate increases with repeat number. Comparison of vespertilionine bat consensus repeats to mammal control region sequences revealed that tandem repeats of similar size, sequence and number also occur in shrews, cats and bighorn sheep. The presence of two conserved protein-binding sequences in all repeat units indicates that convergent evolution has occurred by duplication of functional units. We speculate that D-loop region tandem repeats may provide signal redundancy and a primitive repair mechanism in the event of somatic mutations to these binding sites.  相似文献   

18.
Peculiar DNA sequences made up by the tandem repetition of a 5 bp unit have been identified within or upstream from three avian protein-coding genes. One sequence is located within an intron of the chicken "ovalbumin-X" gene with 5'-TCTCC-3' as basic repeat unit (36 repeats). Another sequence made of 27 repeats of a 5'-GGAAG-3' basic unit is found 2500 base pairs upstream from the promoter of the chicken ovotransferrin (conalbumin) gene. A related but different sequence is present in the corresponding region of the ovotransferrin gene in the pheasant, with 5'-GGAAA-3' as the basic unit (55 repeats). These three satellite-like elements are thus characterized by a total assymetry in base distribution, with purines restricted to one strand, and pyrimidines to the other. Two of the basic repeat units can be derived from the third one (GGAAA) by a single base pair change. These related sequences are found repeated in three avian genomes, at degrees which vary both with the sequence type and the genome type. Evolution of tandemly repeated sequences (including satellites) is in general studied by analysing randomly picked elements. The presence of conserved protein-coding regions neighbouring satellite-like sequences allow to follow their evolution at a single locus, as exemplified by the striking comparison of the pheasant and chicken sequences upstream from the ovotransferrin gene.  相似文献   

19.
A new rice repetitive DNA shows sequence homology to both 5S RNA and tRNA.   总被引:2,自引:0,他引:2  
T Y Wu  R Wu 《Nucleic acids research》1987,15(15):5913-5923
Moderately repetitive DNA sequences are found in the genomes of all eucaryotes that have been examined. We now report the discovery of a novel, transcribed, moderately repetitive DNA sequence in a higher plant which is different from any of the known repetitive DNA sequences from any organism. We isolated a rice cDNA clone which hybridizes to multiple bands on genomic blot analysis. The sequence of this 352 bp cDNA contains four regions of homology to the wheat phenylalanine tRNA, including the polymerase III-type promoter. Unexpectedly, two regions of the same 352 bp sequence also show homology to the wheat 5S RNA sequence. Using the cDNA as a probe, we have isolated six genomic clones which contain long tandem repeats of 355 bp sequence, and have sequenced nine repeat units. Our findings suggest that the rice repetitive sequence may be an amplified pseudogene with sequence homology to both 5S RNA and tRNA, but organized as long tandem repeats resembling 5S RNA genes. This is the first example showing homology between the sequences of a moderately repetitive DNA with unknown function and 5S RNA.  相似文献   

20.
The long (4.6-kb) A+T region of Drosophila melanogaster mitochondrial DNA has been cloned and sequenced. The A+T region is organized in two large arrays of tandemly repeated DNA sequence elements, with nonrepetitive intervening and flanking sequences comprising only 22% of its length. The first repeat array consists of five repeats of 338-373 bp. The second consists of four intact 464-bp repeats and a fifth partial repeat of 137 bp. Three DNA sequence elements are found to be highly conserved in D. melanogaster and in several Drosophila species with short A+T regions. These include a 300-bp DNA sequence element that overlaps the DNA replication origin and two thymidylate stretches identified on opposite DNA strands. We conclude that the length heterogeneity observed in the A+T regulatory region in mitochondrial DNAs from the genus Drosophila results from the expansion (and contraction) of the number of repeated DNA sequence elements. We also propose that the 300-bp conserved DNA sequence element, in conjunction with another primary sequence determinant, perhaps the adjacent thymidylate stretch, functions in the regulation of mitochondrial DNA replication.   相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号