首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
We present the first comprehensive analysis of the crocodilian control region. We have analyzed sequences from all three families of Crocodylia (Crocodylidae, Gavialidae, Alligatoridae), incorporating all genera except Paleosuchus and Melanosuchus. Within the control region of other vertebrates, several sequence motifs and their order appear to be conserved. Herein, we compare aligned crocodilian D-loop sequences to homologous sequences from other vertebrates ranging from fish to birds. Among other findings, we have discovered that while domain I tends to be shorter than the same region in mammals and birds, it contains sequences similar in structure to both the goose-hairpin and termination associated sequences (TAS). Domain II is highly conservative with regard to size among the taxa examined and contains several of the conserved sequence boxes characterized in other vertebrates. Domain III contains several interesting sequence motifs including tandemly repeated sequences, a long poly-A region in the Crocodylidae, and possible bidirection promoter sequences.  相似文献   

2.
We isolated clones and determined the sequence of portions of mouse and human cellular DNA which cross-hybridize strongly with the IR3 repetitive region of Epstein-Barr virus. The sequences were found to be tandem arrays of a simple sequence based on the triplet GGA, very similar to the IR3 repeat. The cellular repeats have distinct differences from the viral repeat region, however, and their sequences do not appear capable of being translated into a purely glycine-plus-alanine protein domain like the portion of the Epstein-Barr nuclear antigen coded by IR3. Although the relationship between IR3 and the cellular repeats is left unclear, the cellular repeats have many interesting features. The tandem arrays are about 1 to several kilobases long, much shorter than satellite tandem repeats and larger than other interspersed, tandem repeats. Each of the repeats is a distinct variation, perhaps diverged from a common sequence, (GGA)n. This family is present in the genomes of all species tested and appears to be a ubiquitous feature of all higher eucaryotic genomes.  相似文献   

3.
A total of 36 clones were randomly selected from a recombinant DNA library of small polydisperse circular DNA (spcDNA) molecules from HeLa cells and were shown to contain repetitive sequences of different reiteration frequencies that ranged from several hundred to several hundred thousand per genome. Sequencing of representative clones revealed tandem repeats of alphoid (alpha) satellite DNA, clustered repeats of the Alu family, KpnI family sequences, tandem repeats of an alpha satellite DNA specific to the X chromosome (alpha X), and A + T-rich segments carrying short stretches of poly(A) or poly(T). DNA rearrangement was frequently found in the repetitive sequences enriched in these spcDNA clones. Short regions of homology that were patchy and inverted were often found, especially at the novel joint where spcDNA sequences are circularized. The presence of these inverted repeats suggests that HeLa spcDNAs are formed by a mechanism that involves looping out of the spcDNA region and joining of the flanking DNA by illegitimate recombination.  相似文献   

4.
G. S. Wilkinson  F. Mayer  G. Kerth    B. Petri 《Genetics》1997,146(3):1035-1048
Analysis of mitochondrial DNA control region sequences from 41 species of bats representing 11 families revealed that repeated sequence arrays near the tRNA-Pro gene are present in all vespertilionine bats. Across 18 species tandem repeats varied in size from 78 to 85 bp and contained two to nine repeats. Heteroplasmy ranged from 15% to 63%. Fewer repeats among heteroplasmic than homoplasmic individuals in a species with up to nine repeats indicates selection may act against long arrays. A lower limit of two repeats and more repeats among heteroplasmic than homoplasmic individuals in two species with few repeats suggests length mutations are biased. Significant regressions of heteroplasmy, θ and π, on repeat number further suggest that repeat duplication rate increases with repeat number. Comparison of vespertilionine bat consensus repeats to mammal control region sequences revealed that tandem repeats of similar size, sequence and number also occur in shrews, cats and bighorn sheep. The presence of two conserved protein-binding sequences in all repeat units indicates that convergent evolution has occurred by duplication of functional units. We speculate that D-loop region tandem repeats may provide signal redundancy and a primitive repair mechanism in the event of somatic mutations to these binding sites.  相似文献   

5.
Previous reports have interpreted hybridization between snake satellite DNA and DNA clones from a variety of distant taxonomic groups as evidence for evolutionary conservation, which implies common ancestry (homology) and/or convergence (analogy) to produce the cross- hybridizing sequences. We have isolated 11 clones from a genomic library of Drosophila melanogaster, using a cloned 2.5-kb snake satellite probe of known nucleotide sequence. We have also analysed published sequence data from snakes, mice, and Drosophila. These data show that (1) all of the cross-hybridization between the snake, fly, and mouse clones can be accounted for by the presence of either of two tandem repeats, [GATA]n and [GACA]n and (2) these tandem repeats are organized differently among the different species. We find no evidence that these sequences are homologous apart from the existence of the simple repeat itself, although their divergence from a common ancestral sequence cannot be ruled out. The sequences contain a variety of homogeneous clusters of tandem repeats of CATA, GA, TA, and CA, as well as GATA and GACA. We suggest that these motifs may have arisen by a self-accelerating process involving slipped-strand mispairing of DNA. Homogeneity of the clusters might simply be the result of a rate of accumulation of tandem repeats that exceeds that of other mutations.   相似文献   

6.
The complete mitochondrial DNA (mtDNA) control region was amplified and directly sequenced in two species of shrew, Crocidura russula and Sorex araneus (Insectivora, Mammalia). The general organization is similar to that found in other mammals: a central conserved region surrounded by two more variable domains. However, we have found in shrews the simultaneous presence of arrays of tandem repeats in potential locations where repeats tend to occur separately in other mammalian species. These locations correspond to regions which are associated with a possible interruption of the replication processes, either at the end of the three-stranded D-loop structure or toward the end of the heavy-strand replication. In the left domain the repeated sequences (R1 repeats) are 78 bp long, whereas in the right domain the repeats are 12 bp long in C. russula and 14 bp long in S. araneus (R2 repeats). Variation in the copy number of these repeated sequences results in mtDNA control region length differences. Southern blot analysis indicates that level of heteroplasmy (more than one mtDNA form within an individual) differs between species. A comparative study of the R2 repeats in 12 additional species representing three shrew subfamilies provides useful indications for the understanding of the origin and the evolution of these homologous tandemly repeated sequences. An asymmetry in the distribution of variants within the arrays, as well as the constant occurrence of shorter repeated sequences flanking only one side of the R2 arrays, could be related to asymmetry in the replication of each strand of the mtDNA molecule. The pattern of sequence and length variation within and between species, together with the capability of the arrays to form stable secondary structures, suggests that the dominant mechanism involved in the evolution of these arrays in unidirectional replication slippage.   相似文献   

7.
T Pavelitz  D Liao    A M Weiner 《The EMBO journal》1999,18(13):3783-3792
The genes encoding primate U2 snRNA are organized as a nearly perfect tandem array (the RNU2 locus) that has been evolving concertedly for >35 Myr since the divergence of baboons and humans. Thus the repeat units of the tandem array are essentially identical within each species, but differ between species. Homogeneity is maintained because any change in one repeat unit is purged from the array or fixed in all other repeats. Intriguingly, the cytological location of RNU2 has remained unchanged despite concerted evolution of the tandem array. We had found previously that junction sequences between the U2 tandem array and flanking DNA were subject to remodeling over a region of 200-300 bp during the past 5 Myr in the hominid lineage. Here we show that the junctions between the U2 tandem array and flanking DNA have undergone dramatic rearrangements over a region of 1 to >10 kbp in the 35 Myr since divergence of the Old World Monkey and hominid lineages. We argue that these rearrangements reflect the high level of genetic activity required to sustain concerted evolution, and propose a model to explain why maintenance of homogeneity within a tandemly repeated multigene family would lead to junctional diversity.  相似文献   

8.
Mitochondrial DNA (mtDNA) control region (CR) of numerous species is known to include up to five different repetitive sequences (RS1-RS5) that are found at various locations, involving motifs of different length and extensive length heteroplasmy. Two repetitive sequences (RS2 and RS3) on opposite sides of mtDNA central conserved region have been described in domestic cat (Felis catus) and some other felid species. However, the presence of repetitive sequence RS3 has not been detected in Eurasian lynx (Lynx lynx) yet. We analyzed mtDNA CR of 35 Eurasian lynx (L. lynx L.) samples to characterize repetitive sequences and to compare them with those found in other felid species. We confirmed the presence of 80 base pairs (bp) repetitive sequence (RS2) at the 5' end of the Eurasian lynx mtDNA CR L strand and for the first time we described RS3 repetitive sequence at its 3' end, consisting of an array of tandem repeats five to ten bp long. We found that felid species share similar RS3 repetitive pattern and fundamental repeat motif TACAC.  相似文献   

9.
Nuclear ribosomal DNA (nrDNA) constitutes a multicopy gene family that is used widely to test evolutionary hypotheses across a broad range of organisms. It is presumed that, as a result of concerted evolution, tandem nrDNA repeats are homogeneous within species and different between species. We sampled 77 specimens of a disjunct species (Carapichea ipecacuanha) from throughout its three geographic ranges and obtained 266 nrDNA sequences, of which 26 were obtained by direct sequencing and 240 by cloning of PCR products. Complementary sequence analyses, which included analyses of secondary structure stability, the pattern of base substitutions, GC content, and the presence of conserved motifs, were used to characterize the internal transcribed spacer (ITS) region (ITS1-5.8S nrDNA-ITS2). Our results showed that concerted evolution of the ITS region was incomplete in C. ipecacuanha, particularly in the Atlantic range. In the highly polymorphic populations of the Atlantic range, intraindividual variation was observed and involved 56 functional paralogs and 15 pseudogenes from two highly divergent ribogroups. The Amazonian range (with 12 functional paralogs) and the Central-American range (with five functional paralogs) were genetically depauperate and exhibited no pseudogenes. In the two latter ranges, almost complete homogenization of the ITS sequences had occurred. We argue that it is important to consider past evolutionary history when making inferences about the efficiency with which concerted evolution homogenizes tandem nrDNA repeats a single sequence.  相似文献   

10.
We describe a new class of DNA length polymorphism that is due to a variation in the number of tandem repeats associated with Alu sequences (Alu sequence-related polymorphisms). The polymerase chain reaction was used to selectively amplify a (TTA)n repeat identified in the 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) reductase gene from genomic DNA of 41 human subjects, and the size of the amplified products was determined by gel electrophoresis. Seven alleles were found that differed in size by integrals of three nucleotides. The allele frequencies ranged from 1.5% to 52%, and the overall heterozygosity index was 62%. The polymorphic TTA repeat was located adjacent to a repetitive sequence of the Alu family. A homology search of human genomic DNA sequences for the trinucleotide TTA (at least five members in length) revealed tandem repeats in six other genes. Three of the six (TTA)n repeats were located adjacent to Alu sequences, and two of the three (in the genes for beta-tubulin and interleukin-1 alpha) were found to be polymorphic in length. Tandemly repetitive sequences found in association with Alu sequences may be frequent sites of length polymorphism that can be used as genetic markers for gene mapping or linkage analysis.  相似文献   

11.
12.
The TaiI family sequences are classified as tandem repetitive DNA sequences present in the genome of tribe Triticeae, and are localized in the centromeric regions of common wheat, but in the subtelomeric heterochromatic regions of Leymus racemosus and related species. In this study, we investigated the chromosomal distribution of TaiI family sequences in other Triticeae species. The results demonstrated a centromeric localization in genera Triticum and Aegilops and subtelomeric localization in other genera, thus showing a genus-dependent localization of TaiI family sequences in one or the other region. The copy numbers of TaiI family sequences in species in the same genus varied greatly, whether in the centromeric or subtelomeric regions (depending on genus). We also examined the evolution of TaiI family sequences during polyploidization of hexaploid common wheat. A comparison of chromosomal locations of the major TaiI family signals in common wheat and in its ancestral species suggested that the centromeric TaiI family sequences in common wheat were inherited from its ancestors with little modification, whereas a mixed origin for the B genome of common wheat was indicated.  相似文献   

13.
14.
The extant crocodylians comprise 23 species divided among three families, Alligatoridae, Crocodylidae, and Gavialidae. Currently, based on morphological data sets, Tomistoma schlegelii (false gharial) is placed within the family Crocodylidae. Molecular data sets consistently support a sister-taxon relationship of T. schlegelii with Gavialis gangeticus (Indian Gharial), which is the sole species in Gavialidae. To elucidate the placement of T. schlegelii within the extant crocodylians, we have sequenced 352bp of the dentin matrix protein 1 (DMP1) nuclear gene in 30 individuals and 424bp of the nuclear gene C-mos in 74 individuals. Molecular analysis of the DMP1 data set indicates that it is highly conserved within the Crocodylia. Of special note is a seven base-pair indel (GTGCTTT) shared by T. schlegelii and G. gangeticus, that is absent in the genus Crocodylus, Osteolaemus, and Mecistops. To date, C-mos is the largest molecular data set analyzed for any crocodylian study including multiple samples from all representatives of the eight extant genera. Analysis of these molecular data sets, both as individual gene sequences and concatenated sequences, support the hypothesis that T. schlegelii should be placed within the family Gavialidae.  相似文献   

15.
We study the length distribution functions for the 16 possible distinct dimeric tandem repeats in DNA sequences of diverse taxonomic partitions of GenBank (known human and mouse genomes, and complete genomes of Caenorhabditis elegans and yeast). For coding DNA, we find that all 16 distribution functions are exponential. For non-coding DNA, the distribution functions for most of the dimeric repeats have surprisingly long tails, that fit a power-law function. We hypothesize that: (i) the exponential distributions of dimeric repeats in protein coding sequences indicate strong evolutionary pressure against tandem repeat expansion in coding DNA sequences; and (ii) long tails in the distributions of dimers in non-coding DNA may be a result of various mutational mechanisms. These long, non-exponential tails in the distribution of dimeric repeats in non-coding DNA are hypothesized to be due to the higher tolerance of non-coding DNA to mutations. By comparing genomes of various phylogenetic types of organisms, we find that the shapes of the distributions are not universal, but rather depend on the specific class of species and the type of a dimer.  相似文献   

16.
Tandem repeats finder: a program to analyze DNA sequences.   总被引:66,自引:3,他引:63       下载免费PDF全文
A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, copy number, mutational history, etc. for tandem repeats has been limited by the inability to easily detect them in genomic sequence data. In this paper, we present a new algorithm for finding tandem repeats which works without the need to specify either the pattern or pattern size. We model tandem repeats by percent identity and frequency of indels between adjacent pattern copies and use statistically based recognition criteria. We demonstrate the algorithm's speed and its ability to detect tandem repeats that have undergone extensive mutational change by analyzing four sequences: the human frataxin gene, the human beta T cellreceptor locus sequence and two yeast chromosomes. These sequences range in size from 3 kb up to 700 kb. A World Wide Web server interface atc3.biomath.mssm.edu/trf.html has been established for automated use of the program.  相似文献   

17.
The recombinant plasmid dpTa1 has an insert of relic wheat DNA that represents a family of tandemly organized DNA sequences with a monomeric length of approximately 340 bp. This insert was used to investigate the structural organization of this element in the genomes of 58 species within the tribe Triticeae and in 7 species representing other tribes of the Poaceae. The main characteristic of the genomic organization of dpTa1 is a classical ladder-type pattern which is typical for tandemly organized sequences. The dpTa1 sequence is present in all of the genomes of the Triticeae species examined and in 1 species from a closely related tribe (Bromus inermis, Bromeae). DNA from Hordelymus europaeus (Triticeae) did not hybridize under the standard conditions used in this study. Prolonged exposure was necessary to obtain a weak signal. Our data suggest that the dpTa1 family is quite old in evolutionary terms, probably more ancient than the tribe Triticeae. The dpTa1 sequence is more abundant in the D-genome of wheat than in other genomes in Triticeae. DNA from several species also have bands in addition to the tandem repeats. The dpTa1 sequence contains short direct and inverted subrepeats and is homologous to a tandemly repeated DNA sequence from Hordeum chilense.  相似文献   

18.
In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.  相似文献   

19.
We present an analysis of a chromosomal walk in the region of the euchromatin-heterochromatin transition at the base of the X chromosome of Drosophila melanogaster. This region is difficult to analyse because of the presence of repeated sequences, and we have used cosmids to walk from the last euchromatic gene, suppressor of forked, towards the pericentric heterochromatin. The proximal 30-kb sequence we have isolated consists of repetitive DNA, including four tandem copies of a 5.9-kb sequence. This tandem repeat is itself a mosaic of other, mostly repeated, sequences, including part of a retrotransposon without long terminal repeats, a simple-sequence region of TAA repeats and part of a retrotransposon with long terminal repeats that has not been previously described. Although sequences homologous to these components are found elsewhere in the genome, this arrangement of repeated sequences is only found at the base of the X chromosome. It is conserved in D. melanogaster strains of different geographic origin, but is not conserved in even closely related species.  相似文献   

20.
The complete mitochondrial DNA (mtDNA) control region was cloned and sequenced in the musk shrew, Suncus murinus, Insectivora. The general aspect was similar to that found in other mammals. We have found in two locations of this region the presence of arrays of tandem repeats like those in other shrew species. One array was located in the left domain containing the termination-associated sequences (TAS) and the length of a copy was 77 bp. The other repeats were situated upstream from the recognition site for the end of H-strand replication in the right domain and were 20 bp long. The left halves of the control region containing the former repeats were sequenced and compared in several laboratory lines and wild animals from different localities, variations in copy number of repeated sequences were found both among individuals and within an individual. A comparative study of repeated sequences provides useful indication for the origin and evolution of tandem repeated sequences. Strand slippage and mispairing during replication of mtDNA with concerted manner is currently regarded as a dominant theory to account molecular mechanism for tandemly repeated sequences, and the pattern of sequence and length variation in our study supports this theory. Our results, however, suggest that the evolution of the repeated sequences containing the TAS in the musk shrew might go through the process of two steps; at the first step one complete repeated and several incomplete repeated sequences had reproduced in common ancestor of the shrew, and the second stage step-up of complete repeated sequences occurred with concerted evolution after differentiation into continental and insular groups.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号