首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
简单重复序列亦称微卫星,被成功应用于许多真核生物、原核生物和病毒的基因组和进化研究,但是噬菌体中的微卫星目前很少被研究。因此对60条尾病毒目基因组中的微卫星和和复合型微卫星(由两个或两个以上直接相邻的微卫星组成)做综合性分析,在这60个基因组中总共观察到11 874个微卫星和449个复合型微卫星。相关性分析表明微卫星个数与基因组大小成正线性相关(ρ=0.899, P<0.01)。参考序列中的微卫星个数少于对应的随机序列中微卫星个数,这种反常现象主要是因为参考序列含有较少的单核苷酸和二核苷酸重复。A/T和AT/TA重复是单核苷酸和二核苷酸重复中最主要的类型,因此单核苷酸重复中的GC含量明显低于相应的序列中的GC含量;相比之下,微卫星中的二核苷酸和三核苷酸重复的GC含量与对应的参考序列的GC含量无明显区别。尾病毒目基因组中的这些结果与其它生物体基因组存在一定的差别。有助于了解尾病毒目中微卫星的分布、进化和生物学功能。  相似文献   

2.
Microsatellites are the most promising co-dominant markers, widely distributed throughout the genome. Identification of these repeating genomic subsets is a tedious and iterative process making computational approaches highly useful for solving this biological problem. Here 38,083 microsatellites were localized in palm sequences. A total of 2, 97,023 sequences retrieved from public domains were used for this study. The sequences were unstained using the tool Seqclean and consequently clustered using CAP3. SSRs are located in the sequences using the microsatellite search tool, MISA. Repeats were detected in 33,309 sequences and more than one SSR had appeared in 3,943 sequences. In the present study, dinucleotide repeats (49%) were found to be more abundant followed by mononucleotide (30%) and trinucleotide (19%). Also among the dinucleotides, AG/GA/TC/CT motifs (55.8%) are predominantly repeating within the palm sequences. Thus in future this study will lead to the development of specific algorithm for mining SSRs exclusively for palms.  相似文献   

3.
Microsatellite polymorphisms are invaluable for mapping vertebrate genomes. In order to estimate the occurrence of microsatellites in the rabbit genome and to assess their feasibility as markers in rabbit genetics, a survey on the presence of all types of mononucleotide, dinucleotide, trinucleotide and tetranucleotide repeats, with a length of about 20 bp or more, was conducted by searching the published rabbit DNA sequences in the EMBL nucleotide database (version 32). A total of 181 rabbit microsatellites could be extracted from the present database. The estimated frequency of microsatellites in the rabbit genome was one microsatellite for every 2–3 kb of DNA. Dinucleotide repeats constituted the prevailing class of microsatellites, followed by trinucleotide, mononucleotide and tetranucleotide repeats, respectively. The average length of the microsatellites, as found in the database, was 26, 23, 23 and 22 bp for mono-, di-, tri- and tetranucleotide repeats, respectively. The most common repeat motif was AG, followed by A, AC, AGG and CCG. This group comprised about 70% of all extracted rabbit microsatellites. About 61% of the microsatellites were found in non-coding regions of genes, whereas 15% resided in (protein) coding regions. A significant fraction of rabbit microsatellites (about 22%) was found within interspersed repetitive DNA sequences.  相似文献   

4.
M Band  M Ron 《Animal genetics》1994,25(4):281-283
A bovine genomic library was screened for the presence of (AGC)n repeats. All isolated AGC repeats were located adjacent to the 3′ end of bovine short interspersed nuclear elements (SINE). Polymerase chain reactions (PCR) using either two unique primers or one unique and one SINE primer produced high-resolution products without the secondary artifact ladders typical of dinucleotide microsatellites. Four AGC microsatellites were found to be polymorphic with 2–4 alleles each and polymorphism information context (PIC) values ranging between 0.26 and 0.49. One microsatellite, AR025, was mapped to chromosome 26 with the CSIRO reference families. Because of their strong association with AGC repeats and high frequency in the genome, SINE-3′ PCR may prove to be a novel source of polymorphic trinucleotide markers in the bovine genome.  相似文献   

5.
We report the results of a comprehensive search of Drosophila melanogaster DNA sequences in GenBank for di-, tri-, and tetranucleotide repeats of more than four repeat units, and a DNA library screen for dinucleotide repeats. Dinucleotide repeats are more abundant (66%) than tri- (30%) or tetranucleotide (4%) repeats. We estimate that 1917 dinucleotide repeats with 10 or more repeat units are present in the euchromatic D. melanogaster genome and, on average, they occur once every 60 kb. Relative to many other animals, dinucleotide repeats in D. melanogaster are short. Tri- and tetranucleotide repeats have even fewer repeat units on average than dinucleotide repeats. Our WorldWide Web site (http://www.bio.cornell.edu/genetics/aquadro/aquadro.html) posts the complete list of 1298 microsatellites (≥ five repeat units) identified from the GenBank search. We also summarize assay conditions for 70 D. melanogaster microsatellites characterized in previous studies and an additional 56 newly characterized markers.  相似文献   

6.
赤拟谷盗全基因组和EST中微卫星的丰度   总被引:1,自引:0,他引:1  
微卫星是近年大力开发的一种分子标记,为了推进赤拟谷盗Tribolium castaneum(Herbst)遗传学相关研究,对赤拟谷盗全基因组和EST中由1~6个碱基重复单元组成的简单序列重复进行分析,进而对其微卫星的丰度和分布进行比较分析。微卫星在赤拟谷盗EST中的分布频率为1/0.87kb,其中单碱基重复序列占71.25%,是最丰富的重复单元,而六、三、四、二,五碱基重复单元序列分别占23.93%,2.94%,1.56%,0.17%,0.15%。全基因组中微卫星的分布频率为1/3.65kb,其中六碱基重复序列占61.96%,是最丰富的重复单元,而三,四,一,五,二碱基重复单元序列分别占14.35%,13.75%,4.68%,3.60%,1.69%。同时发现富含A和T碱基的微卫星占主导地位,富含G和C碱基的微卫星数量较少。进一步的分析显示,微卫星在每条染色体上的丰度存在很大的相似性。  相似文献   

7.
Mining functional microsatellites in legume unigenes   总被引:1,自引:0,他引:1  
Highly polymorphic and transferable microsatellites (SSRs) are important for comparative genomics, genome analysis and phylogenetic studies. Development of novel species-specific microsatellite markers remains a costly and labor-intensive project. Therefore, interest has been shifted from genomic to genic markers owing to their high inter-species transferability as they are developed from conserved coding regions of the genome. This study concentrates on comparative analysis of genic microsatellites in nine important legume (Arachis hypogaea, Cajanus cajan, Cicer arietinum, Glycine max, Lotus japonicus, Medicago truncatula, Phaseolus vulgaris, Pisum sativum and Vigna unguiculata) and two model plant species (Oryza sativa and Arabidopsis thaliana). Screening of a total of 228090 putative unique sequences spanning 219610522 bp using a microsatellite search tool, MISA, identified 12.18% of the unigenes containing 36248 microsatellite motifs excluding mononucleotide repeats. Frequency of legume unigene-derived SSRs was one SSR in every 6.0 kb of analyzed sequences. The trinucleotide repeats were predominant in all the unigenes with the exception of C. cajan, which showed prevalence of dinucleotide repeats over trinucleotide repeats. Dinucleotide repeats along with trinucleotides counted for more than 90% of the total microsatellites. Among dinucleotide and trinucleotide repeats, AG and AAG motifs, respectively, were the most frequent. Microsatellite positive chickpea unigenes were assigned Gene Ontology (GO) terms to identify the possible role of unigenes in various molecular and biological functions. These unigene based microsatellite markers will prove valuable for recording allelic variance across germplasm collections, gene tagging and searching for putative candidate genes.  相似文献   

8.
Microsatellites or Simple Sequence Repeats (SSRs) are tandem iterations of one to six base pairs, non-randomly distributed throughout prokaryotic and eukaryotic genomes. Limited knowledge is available about distribution of microsatellites in single stranded DNA (ssDNA) viruses, particularly vertebrate infecting viruses. We studied microsatellite distribution in 118 ssDNA virus genomes belonging to three families of vertebrate infecting viruses namely Circoviridae, Parvoviridae, and Anelloviridae, and found that microsatellites constitute an important component of these virus genomes. Mononucleotide repeats were predominant followed by dinucleotide and trinucleotide repeats. A strong positive relationship existed between number of mononucleotide repeats and genome size among all the three virus families. A similar relationship existed for the occurrence of DTTPH (di-, tri-, tetra-, penta- and hexa-nucleotide) repeats in the families Anelloviridae and Parvoviridae only. Relative abundance and relative density of mononucleotide repeats showed a strong positive relationship with genome size in Circoviridae and Parvoviridae. However, in the case of DTTPH repeats, these features showed a strong relationship with genome size in Circoviridae only. On the other hand, relative microsatellite abundance and relative density of mononucleotide repeats were negatively correlated with GC content (%) in Parvoviridae genomes. On the basis of available annotations, our analysis revealed maximum occurrence of mononucleotide as well as DTTPH repeats in the coding regions of these virus genomes. Interestingly, after normalizing the length of the coding and non-coding regions of each virus genome, we found relative density of microsatellites much higher in the non-coding regions. We understand that the present study will help in the better characterization of the stability, genome organization and evolution of these virus classes and may provide useful leads to decipher the etiopathogenesis of these viruses.  相似文献   

9.
Eucalyptus microsatellites mined in silico: survey and evaluation   总被引:1,自引:0,他引:1  
Eucalyptus is an important short rotation pulpy woody plant, grown widely in the tropics. Recently, many genomic programmes are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. These sequences can be utilized for analysis of simple sequence repeats (SSRs) and single nucleotide polymorphism (SNPs) available in the transcribed genes. In this study, in silico analysis of 15,285 sequences representing partial and full-length mRNA from Eucalyptus species for their use in developing SSRs or microsatellites were carried out. A total of 875 EST-SSRs were identified from 772 SSR containing ESTs. Motif size of 6 for dinucleotide and 5 for trinucleotide, tetranucleotide, and pentanucleotides were considered in locating the microsatellites. The average frequency of identified SSRs was 12.9%. The dinucleotide repeats were the most abundant among the dinucleotide, trinucleotide and tetranucleotide motifs and accounted for 50.9% of the Eucalyptus genome. Primer designing analysis showed that 571 sequences with SSRs had sufficient flanking regions for polymerase chain reaction (PCR) primer synthesis. Evaluation of the usefulness of the SSRs showed that EST-derived SSRs can generate polymorphic markers as all the primers showed allelic diversity among the 16 provenances of E. tereticornis.  相似文献   

10.
查找出蜜蜂基因组中由1~6个碱基重复单元组成的简单序列重复,分析蜜蜂基因组中微卫星的分布频率,并比较其在各染色体中的分布频率。微卫星在蜜蜂基因组中的分布频率为1/0·804kb,其中二碱基重复序列占26·86%,是最丰富的重复单元,而六、一、三、四、五碱基重复单元序列分别占24·74%,22·19%,13·65%,10·98%,2·59%。同时发现富含A和T碱基的微卫星占主导地位,富含G和C碱基的微卫星数量较少。第4,1,3条染色体微卫星分布频率较高,而第11,14,12条染色体微卫星分布频率较低。  相似文献   

11.
An overview of the character of microsatellites in 14 fungal genomes was obtained by analyzing databases containing complete or nearly complete genome sequences. Low GC content, rather than genome size, was the best predictor of high microsatellite density, although very long iterations of tandem repeats were less common in small genomes. Motif type correlated with %GC in that low-GC genomes were more likely to be dominated by A/T-rich motifs, and vice versa, although some exceptions were noted. The experimentally useful dinucleotide and trinucleotide arrays were analyzed in greater detail. Although these varied in sequence and length among fungal species, some that are likely to be universally useful were identified. This information will be useful for researchers wanting to identify the most useful microsatellites to analyze for the fungi included in this survey and provides a platform for choosing microsatellites to target in fungi that are not yet sequenced.  相似文献   

12.
Microsatellites are islands of long repeats of mono-, di- or trinucleotides evenly distributed in the eukaryotic genome with an average distance of 50–100 kb. They display a high degree of length polymorphism and heterozygosity at individual loci, making them highly useful as markers in the development of genomic maps of eukaryotes. In the present work, we examined the dinucleotide repeat motif (dG-dT)n in the Atlantic salmon, Salmo salar L., genome. The frequency of (dG-dT)n microsatellites in salmon correlates well with earlier published estimations. Cloning and sequencing of 45 salmon microsatellites revealed perfect and imperfect repeats, but no compound microsatellites. The distribution of number of repeat units in salmon microsatellites differ significantly from that of higher vertebrates. Salmon tends to have more long repeat stretches and less intermediate length repeats.  相似文献   

13.

Background  

Simple sequence repeats (SSRs), microsatellites or polymeric sequences are common in DNA and are important biologically. From mononucleotide to trinucleotide repeats and beyond, they can be found in long (> 6 repeating units) tracts and may be characterized by quantifying the frequencies in which they are found and their tract lengths. However, most of the existing computer programs that find SSR tracts do not include these methods.  相似文献   

14.
Survey of simple sequence repeats in completed fungal genomes   总被引:7,自引:0,他引:7  
The use of simple sequence repeats or microsatellites as genetic markers has become very popular because of their abundance and length variation between different individuals. SSRs are tandem repeat units of 1 to 6 base pairs that are found abundantly in many prokaryotic and eukaryotic genomes. This is the first study examining and comparing SSRs in completely sequenced fungal genomes. We analyzed and compared the occurrences, relative abundance, relative density, most common, and longest SSRs in nine taxonomically different fungal species: Aspergillus nidulans, Cryptococcus neoformans, Encephalitozoon cuniculi, Fusarium graminearum, Magnaporthe grisea, Neurospora crassa, Saccharomyces cerevisiae, Schizosaccharomyces pombe, and Ustilago maydis. Our analysis revealed that, in all of the genomes studied, the occurrence, abundance, and relative density of SSRs varied and was not influenced by the genome sizes. No correlation between relative abundance and the genome sizes was observed, but it was shown that N. crassa, the largest genome analyzed had the highest relative abundance of SSRs. In most genomes, mononucleotide, dinucleotide, and trinucleotide repeats were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. Our analysis showed that the relative abundance of SSRs in fungi is low compared with the human genome and that longer SSRs in fungi are rare. In addition to providing new information concerning the abundance of SSRs for each of these fungi, the results provide a general source of molecular markers that could be useful for a variety of applications such as population genetics and strain identification of fungal organisms.  相似文献   

15.
16.
Rate and pattern of mutation at microsatellite loci in maize   总被引:30,自引:0,他引:30  
Microsatellites are important tools for plant breeding, genetics, and evolution, but few studies have analyzed their mutation pattern in plants. In this study, we estimated the mutation rate for 142 microsatellite loci in maize (Zea mays subsp. mays) in two different experiments of mutation accumulation. The mutation rate per generation was estimated to be 7.7 x 10(-4) for microsatellites with dinucleotide repeat motifs, with a 95% confidence interval from 5.2 x 10(-4) to 1.1 x 10(-3). For microsatellites with repeat motifs of more than 2 bp in length, no mutations were detected; so we could only estimate the upper 95% confidence limit of 5.1 x 10(-5) for the mutation rate. For dinucleotide repeat microsatellites, we also determined that the variance of change in the number of repeats (sigma(m)2) is 3.2. We sequenced 55 of the 73 observed mutations, and all mutations proved to be changes in the number of repeats in the microsatellite or in mononucleotide tracts flanking the microsatellite. There is a higher probability to mutate to an allele of larger size. There is heterogeneity in the mutation rate among dinucleotide microsatellites and a positive correlation between the number of repeats in the progenitor allele and the mutation rate. The microsatellite-based estimate of the effective population size of maize is more than an order of magnitude less than previously reported values based on nucleotide sequence variation.  相似文献   

17.
We have isolated, characterized and mapped 33 dinucleotide, three trinucleotide and one tetranucleotide repeat loci from the four major chromosomes of Drosophila pseudoobscura. Average inferred repeat unit length of the dinucleotide repeats is 12 repeat units, similar to D. melanogaster. Assays of D. pseudoobscura and populations of its sibling species, D. persimilis, using 10 of these loci show extremely high levels of variation compared with similar studies of dinucleotide repeat variation in D. melanogaster populations. The high levels of variation are consistent with an average mutation rate of approximately 10(-6) per locus per generation and an effective population size of D. pseudoobscura approximately four times larger than that of D. melanogaster. Consistent with allozymes and nucleotide sequence polymorphism, the dinucleotide repeat loci reveal minimal structure across four populations of D. pseudoobscura. Finally, our preliminary recombinational mapping of 24 of these microsatellites suggests that the total recombinational genome size may be larger than previously inferred using morphological mutant markers.  相似文献   

18.
Microsatellite lengths change over evolutionary time through a process of replication slippage. A recently proposed model of this process holds that the expansionary tendencies of slippage mutation are balanced by point mutations breaking longer microsatellites into smaller units and that this process gives rise to the observed frequency distributions of uninterrupted microsatellite lengths. We refer to this as the slippage/point-mutation theory. Here we derive the theory's predictions for interrupted microsatellites comprising regions of perfect repeats, labeled segments, separated by dinucleotide interruptions containing point mutations. These predictions are tested by reference to the frequency distributions of segments of AC microsatellite in the human genome, and several predictions are shown not to be supported by the data, as follows. The estimated slippage rates are relatively low for the first four repeats, and then rise initially linearly with length, in accordance with previous work. However, contrary to expectation and the experimental evidence, the inferred slippage rates decline in segments above 10 repeats. Point mutation rates are also found to be higher within microsatellites than elsewhere. The theory provides an excellent fit to the frequency distribution of peripheral segment lengths but fails to explain why internal segments are shorter. Furthermore, there are fewer microsatellites with many segments than predicted. The frequencies of interrupted microsatellites decline geometrically with microsatellite size measured in number of segments, so that for each additional segment, the number of microsatellites is 33.6% less. Overall we conclude that the detailed structure of interrupted microsatellites cannot be reconciled with the existing slippage/point-mutation theory of microsatellite evolution, and we suggest that microsatellites are stabilized by processes acting on interior rather than on peripheral segments.  相似文献   

19.
红原鸡全基因组中微卫星分布规律研究   总被引:1,自引:0,他引:1  
本文对红原鸡Gallus gallus全基因组中微卫星数量及分布规律进行了分析,查找到l~6个碱基重复类型的微卫星序列共282728个,约占全基因组序列(1.1Gb)的0.49%,分布频率为1/3.89kb,微卫星序列的长度主要在12~70个碱基长度范围内。第1、2、3条染色体上微卫星分布频率较高,而32号染色体上无微卫星分布。不同类型微卫星中,单碱基重复类型数目最多,为184192个,占总数的65.1%;其次是四、二、三、五、六碱基重复单元序列,分别占到总数的12.8%、9.7%、7.2%、4.6%、0.8%。T、A、AT、GTTT、AAAC、G、C、ATTT、AC、GT、AAAT、ATT、AAC、AAT、GTT、AG、CT、CTTT、AAAG、GTTTT、AAACA、AAGG、CCTT是红原鸡基因组中最主要的微卫星重复类型。本研究为红原鸡微卫星标记的分离筛选、遗传多样性的研究以及不同物种微卫星的比较分析奠定了基础。  相似文献   

20.
 The sequencing of 831 clones from an enriched microsatellite library of Melaleuca alternifolia (Myrtaceae) yielded 715 inserts containing repeat motifs. The majority of these (98%) were dinucleotide repeats or trinucleotide repeats averaging 22 and 8 repeat motifs respectively. The AG/GA motif was the most common, accounting for 43% of all microsatellites. From a total of 139 primer pairs designed, 102 produced markers within the expected size range. The majority of these (93) were polymorphic. Primer pairs were tested on five selected M. alternifolia genotypes. Loci based on dinucleotide repeats detected on average a greater number of alleles (4.2) than those based on trinucleotide repeats (2.9). The loci described will provide a large pool of polymorphisms useful for population studies, genetic mapping, and possibly application in other Myrtaceae. Received: 28 July 1998 / Accepted: 8 October 1998  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号