首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Simple sequence repeats (SSRs) or microsatellites are the repetitive nucleotide sequences of motifs of length 1–6 bp. They are scattered throughout the genomes of all the known organisms ranging from viruses to eukaryotes. Microsatellites undergo mutations in the form of insertions and deletions (INDELS) of their repeat units with some bias towards insertions that lead to microsatellite tract expansion. Although prokaryotic genomes derive some plasticity due to microsatellite mutations they have in-built mechanisms to arrest undue expansions of microsatellites and one such mechanism is constituted by post-replicative DNA repair enzymes MutL, MutH and MutS. The mycobacterial genomes lack these enzymes and as a null hypothesis one could expect these genomes to harbour many long tracts. It is therefore interesting to analyse the mycobacterial genomes for distribution and abundance of microsatellites tracts and to look for potentially polymorphic microsatellites. Available mycobacterial genomes, Mycobacterium avium, M. leprae, M. bovis and the two strains of M. tuberculosis (CDC1551 and H37Rv) were analysed for frequencies and abundance of SSRs. Our analysis revealed that the SSRs are distributed throughout the mycobacterial genomes at an average of 220–230 SSR tracts per kb. All the mycobacterial genomes contain few regions that are conspicuously denser or poorer in microsatellites compared to their expected genome averages. The genomes distinctly show scarcity of long microsatellites despite the absence of a post-replicative DNA repair system. Such severe scarcity of long microsatellites could arise as a result of strong selection pressures operating against long and unstable sequences although influence of GC-content and role of point mutations in arresting microsatellite expansions can not be ruled out. Nonetheless, the long tracts occasionally found in coding as well as non-coding regions may account for limited genome plasticity in these genomes. Supplementary Data pertaining to this article is available on the Journal of Biosciences Website at  相似文献   

2.
Survey of human and rat microsatellites   总被引:44,自引:0,他引:44  
Length variations in simple sequence tandem repeats (microsatellite DNA polymorphisms) are finding increasing usage in mammalian genetics. Although every variety of short tandem repeat that has been tested has been shown to exhibit length polymorphisms, little information on the relative abundance of the different repeat motifs has been collected. In this report, summaries of GenBank searches for all possible human and rat microsatellites ranging from mononucleotide to tetranucleotide repeats are presented. In humans, the five most abundant microsatellites with total lengths for the runs of repeats of greater than or equal to 20 nucleotides contained repeat sequences of A, AC, AAAN, AAN, and AG, in order of decreasing abundance, where N is C, G, or T. These five groups comprised about 76% of all microsatellites. Many other human simple sequence repeats were found at low frequency. In the 745 kb of human genomic DNA surveyed, one microsatellite of greater than or equal to 20 nucleotides in length was found, on average, every 6 kb. Only 12% of the human microsatellites had total lengths greater than or equal to 40 nucleotides. Roughly 80% of the A, AAN, and AAAN microsatellites and 50% of the AT microsatellites, but few of the other human microsatellites, were found to be associated with interspersed, repetitive Alu elements. In rats, the five most abundant microsatellites contained AC, AG, A, AAAN, and AAGG sequences, respectively. Rat microsatellites were generally longer than human microsatellites, with 43% of the rat sequences greater than or equal to 40 nucleotides.  相似文献   

3.
We calculated occurrences of all dinucleotide and trinucleotide microsatellites in the human, mouse, and yeast genomes. The microsatellites were considered separately not only according to the repeated dinucleotide or trinucleotide and the microsatellite length but also according to the starting/terminal nucleotide. The analysis showed that dramatically non-equal amounts occurred in the human genome of microsatellites that differed only by the terminal nucleotides. For example, the 23-mer (TTG)(7)TT occurs 635 times in the human genome whereas (GTT)(7)GT is present only three times in the human genome though the two 23-mers share a 22 nucleotide sequence. The dramatically non-equal occurrences of microsatellites differing only by the terminal nucleotides are observed for most dinucleotide and trinucleotide microsatellites and in all analyzed genomes. We suppose that the strikingly non-equal genomic occurrences of these closely related microsatellites originate from conformational properties of DNA.  相似文献   

4.
Despite their ubiquity and functional importance, microsatellites have been largely ignored in comparative genomics, mostly due to the lack of genomic information. In the current study, microsatellite distribution was characterized and compared in the whole genomes and both the coding and non-coding DNA sequences of the sequenced Brassica, Arabidopsis and other angiosperm species to investigate their evolutionary dynamics in plants. The variation in the microsatellite frequencies of these angiosperm species was much smaller than those for their microsatellite numbers and genome sizes, suggesting that microsatellite frequency may be relatively stable in plants. The microsatellite frequencies of these angiosperm species were significantly negatively correlated with both their genome sizes and transposable elements contents. The pattern of microsatellite distribution may differ according to the different genomic regions (such as coding and non-coding sequences). The observed differences in many important microsatellite characteristics (especially the distribution with respect to motif length, type and repeat number) of these angiosperm species were generally accordant with their phylogenetic distance, which suggested that the evolutionary dynamics of microsatellite distribution may be generally consistent with plant divergence/evolution. Importantly, by comparing these microsatellite characteristics (especially the distribution with respect to motif type) the angiosperm species (aside from a few species) all clustered into two obviously different groups that were largely represented by monocots and dicots, suggesting a complex and generally dichotomous evolutionary pattern of microsatellite distribution in angiosperms. Polyploidy may lead to a slight increase in microsatellite frequency in the coding sequences and a significant decrease in microsatellite frequency in the whole genome/non-coding sequences, but have little effect on the microsatellite distribution with respect to motif length, type and repeat number. Interestingly, several microsatellite characteristics seemed to be constant in plant evolution, which can be well explained by the general biological rules.  相似文献   

5.
Studies on microsatellite distribution and divergence in related genomes contribute towards understanding of genome evolution in eukaryotes. Despite the availability of whole genome sequences of four rice genomes, occurrence and significance of microsatellites in the rice genome has remained a relatively unexplored area of research. We have aligned genomes of two rice subspecies i.e. indica and japonica to understand the trends of microsatellite conservation and divergence in the rice genome. Nearly 62% of the indica microsatellites were also found in the japonica genome. Occurrence of microsatellites showed a negative association with that of retrotransposons. Microsatellites repeat unit length and sequence showed direct influence on the microsatellite locus length. Further, microsatellite allele length was also influenced by the sequence characteristics of the neighbouring regions. CCG repeats were most conserved microsatellite sequences across the different syntenic regions in the two rice genomes and often showed association with CpG islands. Our study suggested that microsatellite distribution is not only governed by a balance between replication slippage and point mutations as proposed earlier, but also by the microsatellite motif sequence and characteristics of microsatellite neighbouring regions in the genome. Thus, this study is likely to prove an important reference for understanding the process of microsatellite evolution and dynamics in the two rice subspecies.  相似文献   

6.
Simple sequence repeats (SSRs), or microsatellites, are special DNA/RNA sequences with repeated unit of 1–6 bp. The genomes of Herpesvirales have many repeating structures, which is an excellent system to study the evolution and roles of microsatellites and compound microsatellites in viruses. Therefore, 56 genomes of Herpesvirales were selected and the occurrence, composition and complexity of different repeats were investigated in the genomes. A total of 63,939 microsatellites and 5825 compound microsatellites were extracted from 56 genomes. It found that GC content has a significant strong correlation with both the counts of microsatellites (CM) and the counts of compound microsatellites (CCM). However, genome size has a moderate correlation only with CM and almost no correlation with CCM. The compound microsatellites occurring in genic regions are obviously more than that in intergenic regions. In general, the number of compound microsatellite decreases with the increase of complexity (C) (the count of individual microsatellites being part of a compound microsatellite) and the complexity hardly exceeds C = 4. The vast majority of compound microsatellites exist in intergenic regions, when C ≥ 10. The distributions of SSRs tend to be organism-specific rather than host-specific in herpesvirus genomes. The diversity of microsatellites and compound microsatellites may be helpful for a better understanding of the viral genetic diversity, genotyping, and evolutionary biology in herpesviruses genomes.  相似文献   

7.
Trace sequences from the 2X alpaca genome sequencing effort were examined to identify simple sequence repeats (microsatellites) for genetic studies. A total of 6,685 repeat-containing sequences were downloaded from GenBank, processed, and assembled into contigs representing an estimated 4,278 distinct sequences. This sequence set contained 2,290 sequences of length > 100 nucleotides that contained microsatellites of length > or = 14 dinucleotide or 10 trinucleotide repeats with purity equal to 100%. An additional 13 sequences contained a GC microsatellite of length > or = 12 repeats (purity = 100%) were also obtained. Primer pairs for amplification of 1,516 putative loci are presented. Amplification of genomic DNA from alpaca and llama by PCR was demonstrated for 14 primer sets including one from each of the microsatellite repeat types. Comparative chromosomal location for the alpaca markers was predicted in the bovine genome by BLAT searches against assembly 4.0 of the bovine whole genome sequence. A total of 634 markers (41.8%) returned BLAT hits with score > 100 and Identity > 85%, with the majority assignable to unique locations. We show that microsatellites are abundant and easily identified within the alpaca genome sequence. These markers will provide a valuable resource for further genetic studies of the alpaca and related species.  相似文献   

8.
通过对桉树属(Eucalyptus)的10000条EST序列进行分析,在其中的1499条序列上共发现1775个微卫星重复序列。含有微卫星的EST序列约占序列总数的15%。此外,还发现桉树EST序列所含微卫星长度的变异速率与重复单元长度呈负相关;微卫星的丰度与重复单元长度也呈负相关(三碱基重复微卫星除外)。在桉树EST序列中,重复单元长度为三碱基的微卫星最为丰富。三碱基重复单元微卫星的过度富集可能是由于遗传密码选择所致。在微卫星的丰度及长度变异方面,桉树EST序列与杨树(Populus trichocarpa)基因组注释的转录序列随重复单元长度的变化呈现出相同的规律,但桉树EST序列中微卫星频率及三碱基重复微卫星的含量显著偏低,推测含微卫星的基因表达丰度极有可能低于不含微卫星的基因。通过对发现的所有微卫星位点进行引物设计,并对设计的引物进行PCR检测,结果表明所设计的引物具有极高的扩增成功率。  相似文献   

9.
桉树EST序列中微卫星含量及相关特征   总被引:6,自引:0,他引:6  
通过对桉树属(Eucalyptus)的10 000条EST序列进行分析, 在其中的1 499条序列上共发现1 775个微卫星重复序列。含有微卫星的EST序列约占序列总数的15%。此外, 还发现桉树EST序列所含微卫星长度的变异速率与重复单元长度呈负相关; 微卫星的丰度与重复单元长度也呈负相关(三碱基重复微卫星除外)。在桉树EST序列中, 重复单元长度为三碱基的微卫星最为丰富。三碱基重复单元微卫星的过度富集可能是由于遗传密码选择所致。在微卫星的丰度及长度变异方面, 桉树EST序列与杨树(Populus trichocarpa)基因组注释的转录序列随重复单元长度的变化呈现出相同的规律, 但桉树EST序列中微卫星频率及三碱基重复微卫星的含量显著偏低, 推测含微卫星的基因表达丰度极有可能低于不含微卫星的基因。通过对发现的所有微卫星位点进行引物设计, 并对设计的引物进行PCR检测, 结果表明所设计的引物具有极高的扩增成功率。  相似文献   

10.
An overview of the character of microsatellites in 14 fungal genomes was obtained by analyzing databases containing complete or nearly complete genome sequences. Low GC content, rather than genome size, was the best predictor of high microsatellite density, although very long iterations of tandem repeats were less common in small genomes. Motif type correlated with %GC in that low-GC genomes were more likely to be dominated by A/T-rich motifs, and vice versa, although some exceptions were noted. The experimentally useful dinucleotide and trinucleotide arrays were analyzed in greater detail. Although these varied in sequence and length among fungal species, some that are likely to be universally useful were identified. This information will be useful for researchers wanting to identify the most useful microsatellites to analyze for the fungi included in this survey and provides a platform for choosing microsatellites to target in fungi that are not yet sequenced.  相似文献   

11.
Archak S  Nagaraju J 《Fly》2007,1(5):279-281
Microsatellites show tremendous variation between genomes in terms of their occurrence and composition. Availability of whole genome sequences allows us to study microsatellite characteristics of fully sequenced insect genomes to understand the evolution and biological significance of microsatellites. InSatDb is an insect microsatellite database that provides an interactive interface to query information on microsatellites annotated with size (in base pairs and repeat units), genomic location (exon, intron, up-stream or transposon), nature (perfect or imperfect), and sequence composition (repeat motif and GC%). Here we present a snapshot of the distribution and composition of microsatellites in introns and exons of insect genomes. The data present interesting observations regarding the microsatellite life-cycle and genome flux.  相似文献   

12.
《Fly》2013,7(5):279-281
Microsatellites show tremendous variation between genomes in terms of their occurrence and composition. Availability of whole genome sequences allows us to study microsatellite characteristics of fully sequenced insect genomes to understand the evolution and biological significance of microsatellites. InSatDb is an insect microsatellite database that provides an interactive interface to query information on microsatellites annotated with size (in base pairs and repeat units); genomic location (exon, intron, up-stream or transposon); nature (perfect or imperfect); and sequence composition (repeat motif and GC%). Here, we present a snap shot of the distribution and composition of microsatellites in introns and exons of insect genomes. The data present interesting observations regarding the microsatellite life-cycle and genome flux.  相似文献   

13.
Microsatellite polymorphisms are invaluable for mapping vertebrate genomes. In order to estimate the occurrence of microsatellites in the rabbit genome and to assess their feasibility as markers in rabbit genetics, a survey on the presence of all types of mononucleotide, dinucleotide, trinucleotide and tetranucleotide repeats, with a length of about 20 bp or more, was conducted by searching the published rabbit DNA sequences in the EMBL nucleotide database (version 32). A total of 181 rabbit microsatellites could be extracted from the present database. The estimated frequency of microsatellites in the rabbit genome was one microsatellite for every 2–3 kb of DNA. Dinucleotide repeats constituted the prevailing class of microsatellites, followed by trinucleotide, mononucleotide and tetranucleotide repeats, respectively. The average length of the microsatellites, as found in the database, was 26, 23, 23 and 22 bp for mono-, di-, tri- and tetranucleotide repeats, respectively. The most common repeat motif was AG, followed by A, AC, AGG and CCG. This group comprised about 70% of all extracted rabbit microsatellites. About 61% of the microsatellites were found in non-coding regions of genes, whereas 15% resided in (protein) coding regions. A significant fraction of rabbit microsatellites (about 22%) was found within interspersed repetitive DNA sequences.  相似文献   

14.
Surveys of variability of homologous microsatellite loci among species reveal an ascertainment bias for microsatellite length where microsatellite loci isolated in one species tend to be longer than homologous loci in related species. Here, we take advantage of the availability of aligned human and chimpanzee genome sequences to compare length difference of homologous microsatellites for loci identified in humans to length difference for loci identified in chimpanzees. We are able to quantify ascertainment bias for a range of motifs and microsatellite lengths. Because ascertainment bias should not exist if a microsatellite selected in one species is as likely to be longer as it is to be shorter than its homologue, we propose that the nature of ascertainment bias can provide evidence for understanding how microsatellites evolve. We show that bias is greater for longer microsatellites but also that many long microsatellites have short homologues. These results are consistent with the notion that growth of long microsatellites is constrained by an upper length boundary that, when reached, sometimes results in large deletions. By evaluating ascertainment bias separately for interrupted and uninterrupted repeats we also show that long microsatellites tend to become interrupted, thereby contributing a second component of ascertainment bias. Having accounted for ascertainment bias, in agreement with results published elsewhere, we find that microsatellites in humans are longer on average than those in chimpanzees. This length difference is similar among repeat motifs but surprisingly comprises two roughly equal components, one associated with the repeats themselves and one with the flanking sequences. The differences we find can only be explained if microsatellites are both evolving directionally under a biased mutation process and are doing so at different rates in different closely related species.  相似文献   

15.
The distribution of microsatellites in exons, and their association with gene ontology (GO) terms is explored to elucidate patterns of microsatellite evolution in the common sunflower, Helianthus annuus. The relative position, motif, size and level of impurity were estimated for each microsatellite in the unigene database available from the Compositae Genome Project (CGP), and statistical analyses were performed to determine if differences in microsatellite distributions and enrichment within certain GO terms were significant. There are more translated than untranslated microsatellites, implying that many bring about structural changes in proteins. However, the greatest density is observed within the UTRs, particularly 5UTRs. Further, UTR microsatellites are purer and longer than coding region microsatellites. This suggests that UTR microsatellites are either younger and under more relaxed constraints, or that purifying selection limits impurities, and directional selection favours their expansion. GOs associated with response to various environmental stimuli including water deprivation and salt stress were significantly enriched with microsatellites. This may suggest that these GOs are more labile in plant genomes, or that selection has favoured the maintenance of microsatellites in these genes over others. This study shows that the distribution of transcribed microsatellites in H. annuus is nonrandom, the coding region microsatellites are under greater constraint compared to the UTR microsatellites, and that these sequences are enriched within genes that regulate plant responses to environmental stress and stimuli.  相似文献   

16.
The MICdb (Microsatellites Database) (http://www.cdfd.org.in/micas) is a comprehensive relational database of non-redundant microsatellites extracted from fully sequenced prokaryotic genomes. The current version (1.0) of the database has been compiled from 83 genomes belonging to different phylogenetic groups. This database has been linked to MICAS, the web-based Microstatellite Analysis Server. MICAS provides a user-friendly front-end to systematically extract data on microsatellite tracts from genomes. The database contains the following information pertaining to the microsatellites: the regions (coding/non-coding, if coding, their GenBank annotations) containing microsatellite tracts; the frequencies of their occurrences, the size and the number of repeating motifs; and the sequences of the tracts. MICAS also provides an interface to Autoprimer, a primer design program to automatically design primers for selected microsatellite loci.  相似文献   

17.
Abstract We analyze published data from 592 AC microsatellite loci from 98 species in five vertebrate classes including fish, reptiles, amphibians, birds, and mammals. We use these data to address nine major questions about microsatellite evolution. First, we find that larger genomes do not have more microsatellite loci and therefore reject the hypothesis that microsatellites function primarily to package DNA into chromosomes. Second, we confirm that microsatellite loci are relatively rare in avian genomes, but reject the hypothesis that this is due to physical constraints imposed by flight. Third, we find that microsatellite variation differs among species within classes, possibly relating to population dynamics. Fourth, we reject the hypothesis that microsatellite structure (length, number of alleles, allele dispersion, range in allele sizes) differs between poikilotherms and homeotherms. The difference is found only in fish, which have longer microsatellites and more alleles than the other classes. Fifth, we find that the range in microsatellite allele size at a locus is largely due to the number of alleles and secondarily to allele dispersion. Sixth, length is a major factor influencing mutation rate. Seventh, there is a directional mutation toward an increase in microsatellite length. Eighth, at the species level, microsatellite and allozyme heterozygosity covary and therefore inferences based on large-scale studies of allozyme variation may also reflect microsatellite genetic diversity. Finally, published microsatellite loci (isolated using conventional hybridization methods) provide a biased estimate of the actual mean repeat length of microsatellites in the genome.  相似文献   

18.
A sequencing strategy is described for the rapid recovery of DNA sequences flanking repeat sequences of microsatellites in plant nuclear genomes. Primers that represent a perfect microsatellite repeat and end in a 3′ degenerate base have been used to sequence directly from microsatellite repeats in one direction. The procedure allows the design of one flanking primer that is then used to sequence back through the repeat to define the microsatellite site precisely and also provides for the design of the second flanking primer. The strategy is versatile with various repeat sizes and different categories of microsatellites; perfect, imperfect, and compound were found to be suitable templates for analysis.  相似文献   

19.
MICAS is a web server for extracting microsatellite information from completely sequenced prokaryote and viral genomes, or user-submitted sequences. This server provides an integrated platform for MICdb (database of prokaryote and viral microsatellites), W-SSRF (simple sequence repeat finding program) and Autoprimer (primer design software). MICAS, through dynamic HTML page generation, helps in the systematic extraction of microsatellite information from selected genomes hosted on MICdb or from user-submitted sequences. Further, it assists in the design of primers with the help of Autoprimer, for sequences containing selected microsatellite tracts.  相似文献   

20.
Microsatellites (SSR--simple sequence repeats, STR--short tandem repeats, SSLP--simple sequence length polymorphism, VNTR--variable number of tandem repeats) are the class of repetitive DNA sequences present in all living organisms. Particular characteristics of microsatellites, such as their presence in the genomes of all living organisms, high level of allelic variation, co-dominant mode of inheritance and potential for automated analysis make them an excellent tool for a number of approaches like genotyping, mapping and positional cloning of genes. The three most popular types of markers containing microsatellite sequences that are presently used are: (1) SSR (simple sequence repeats), generated by amplifying in a PCR reaction with the use of primers complementary to flanking regions; (2) ISSR (inter-simple sequence repeats), based on the amplification of regions between inversely oriented closely spaced microsatellites; and (3) SAMPL (selective amplification of microsatellite polymorphic loci), which utilises AFLP (amplified fragment-length polymorphism) methodology, with one exception--for the second amplification, one of the starters is complementary to the microsatellite sequence. The usefulness of the three above-mentioned markers for numerous purposes has been well documented for plants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号