首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
简单重复序列亦称微卫星,被成功应用于许多真核生物、原核生物和病毒的基因组和进化研究,但是噬菌体中的微卫星目前很少被研究。因此对60条尾病毒目基因组中的微卫星和和复合型微卫星(由两个或两个以上直接相邻的微卫星组成)做综合性分析,在这60个基因组中总共观察到11 874个微卫星和449个复合型微卫星。相关性分析表明微卫星个数与基因组大小成正线性相关(ρ=0.899, P<0.01)。参考序列中的微卫星个数少于对应的随机序列中微卫星个数,这种反常现象主要是因为参考序列含有较少的单核苷酸和二核苷酸重复。A/T和AT/TA重复是单核苷酸和二核苷酸重复中最主要的类型,因此单核苷酸重复中的GC含量明显低于相应的序列中的GC含量;相比之下,微卫星中的二核苷酸和三核苷酸重复的GC含量与对应的参考序列的GC含量无明显区别。尾病毒目基因组中的这些结果与其它生物体基因组存在一定的差别。有助于了解尾病毒目中微卫星的分布、进化和生物学功能。  相似文献   

2.
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.  相似文献   

3.
Eucalyptus microsatellites mined in silico: survey and evaluation   总被引:1,自引:0,他引:1  
Eucalyptus is an important short rotation pulpy woody plant, grown widely in the tropics. Recently, many genomic programmes are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. These sequences can be utilized for analysis of simple sequence repeats (SSRs) and single nucleotide polymorphism (SNPs) available in the transcribed genes. In this study, in silico analysis of 15,285 sequences representing partial and full-length mRNA from Eucalyptus species for their use in developing SSRs or microsatellites were carried out. A total of 875 EST-SSRs were identified from 772 SSR containing ESTs. Motif size of 6 for dinucleotide and 5 for trinucleotide, tetranucleotide, and pentanucleotides were considered in locating the microsatellites. The average frequency of identified SSRs was 12.9%. The dinucleotide repeats were the most abundant among the dinucleotide, trinucleotide and tetranucleotide motifs and accounted for 50.9% of the Eucalyptus genome. Primer designing analysis showed that 571 sequences with SSRs had sufficient flanking regions for polymerase chain reaction (PCR) primer synthesis. Evaluation of the usefulness of the SSRs showed that EST-derived SSRs can generate polymorphic markers as all the primers showed allelic diversity among the 16 provenances of E. tereticornis.  相似文献   

4.
查找出蜜蜂基因组中由1~6个碱基重复单元组成的简单序列重复,分析蜜蜂基因组中微卫星的分布频率,并比较其在各染色体中的分布频率。微卫星在蜜蜂基因组中的分布频率为1/0·804kb,其中二碱基重复序列占26·86%,是最丰富的重复单元,而六、一、三、四、五碱基重复单元序列分别占24·74%,22·19%,13·65%,10·98%,2·59%。同时发现富含A和T碱基的微卫星占主导地位,富含G和C碱基的微卫星数量较少。第4,1,3条染色体微卫星分布频率较高,而第11,14,12条染色体微卫星分布频率较低。  相似文献   

5.
Simple sequence repeats (SSRs), or microsatellites, are special DNA/RNA sequences with repeated unit of 1–6 bp. The genomes of Herpesvirales have many repeating structures, which is an excellent system to study the evolution and roles of microsatellites and compound microsatellites in viruses. Therefore, 56 genomes of Herpesvirales were selected and the occurrence, composition and complexity of different repeats were investigated in the genomes. A total of 63,939 microsatellites and 5825 compound microsatellites were extracted from 56 genomes. It found that GC content has a significant strong correlation with both the counts of microsatellites (CM) and the counts of compound microsatellites (CCM). However, genome size has a moderate correlation only with CM and almost no correlation with CCM. The compound microsatellites occurring in genic regions are obviously more than that in intergenic regions. In general, the number of compound microsatellite decreases with the increase of complexity (C) (the count of individual microsatellites being part of a compound microsatellite) and the complexity hardly exceeds C = 4. The vast majority of compound microsatellites exist in intergenic regions, when C ≥ 10. The distributions of SSRs tend to be organism-specific rather than host-specific in herpesvirus genomes. The diversity of microsatellites and compound microsatellites may be helpful for a better understanding of the viral genetic diversity, genotyping, and evolutionary biology in herpesviruses genomes.  相似文献   

6.
Barley microsatellites: allele variation and mapping   总被引:37,自引:0,他引:37  
Microsatellites have developed into a powerful tool for mapping mammalian genomes and first reports about their use in plants have been published. A database search of 228 barley sequences from GenBank and EMBL was made to determine which simple sequence repeat (SSR) motif prevails in barley. Nearly all types of SSRs were found. The (A)n and (T)n SSRs occurred more often than (C)n and (G)n for n10. Among the dinucleotide repeats, the (CG)n SSRs occurred least often. Trinucleotide repeats did not occur with n>7 and there is no correlation between the GC content in the trinucleotide motifs and the number of observed SSRs. Analysing 15 different microsatellites with 11 barleys yielded 2.1 alleles per microsatellite. Sequencing 25 putative microsatellites showed that the resolution capacity of highquality agarose gels was sufficient to determine differences of only three base paris. Five microsatellites were mapped on three different chromosomes of a barley RFLP map.  相似文献   

7.
In the present study, 3217 UniGene sequences of Neurospora crassa downloaded from the National Center for Biotechnology Information (NCBI) were mined for the identification of microsatellites or simple sequence repeats (SSRs). A total of 287 SSRs detected gives density of 1SSR/14.6 kb of 4187.86 kb sequences mined suggests that only 250 (7.8%) of sequences contained SSRs. Depending on the repeat units, the length of SSRs ranged from 14 to 17 bp for mono-, 14 to 48 bp for di-, 18 to 90 bp for tri-, 24 to 48 bp for tetra-, 30 for penta- and 42 to 48 bp for hexa-nucleotide repeats. Tri-nucleotide repeats were the most frequent repeat type (88.8%) followed by di-nucleotide repeats (5.9%). An attempt was also made with the help of bioinformatics approach to find out primer pairs for identified SSRs and primers were found only for 239 sequences. But, this part needs experimental validation. Annotation of SSRs containing sequences was also carried out.  相似文献   

8.
Environmental Sciences Division, Oak Ridge National Laboratory, TN, USA We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated tr at SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites.  相似文献   

9.
We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated that SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites.  相似文献   

10.
We report 12 microsatellites enriched in CT repeats obtained from a genomic library of the lychee (Litchi chinensis Sonn.) cultivar Mauritius. The polymorphisms revealed by these microsatellites were evaluated in a collection of 21 lychee cultivars. A total of 59 fragments were detected with these 12 SSRs, with an average of 4.9 bands/SSR. Three primer pairs seem to amplify more than a single locus. The mean expected and observed heterozygosities over the 9 single-locus SSRs averaged 0.571 (range: 0.137–0.864) and 0.558 (range: 0.169–0.779) respectively. The total value for the probability of identity was 7.53×10-5. In addition, the selected SSRs were used to amplify DNA from four longan cultivars. Eleven of the 12 SSRs produced amplification fragments in longan, and eight of these fragments were polymorphic. All except two of the products amplified from longan were the same size as those amplified from lychee, suggesting a close genetic proximity between the two species. The SSRs studied produced 22 different patterns, allowing the unambiguous identification of 16 lychee and the 4 longan cultivars studied. Discrimination was possible with just four selected microsatellites. Two groups with two and three undistinguishable cultivars were obtained, reflecting probable synonymies. Unweighted pair-group method of artimetic averages (UPGMA) cluster analysis divided the lychee cultivars studied into two main groups, one consisting of ancient cultivars and the other with more diverse recent cultivars. This is the first report of microsatellite development in the Sapindaceae, and the results demonstrate the usefulness of microsatellites for identification, similarity studies and germplasm conservation in lychee and related species.Communicated by H.F. Linskens  相似文献   

11.
微卫星(Microsatellite)是一类由2-6个核苷酸经多次单位串联组成的高度变异重复DNA序列(Schlotterer and Tautz,1992)。它具有按照孟德尔方式分离、突变快、多态信息含量丰富、呈共显性遗传等特点,其核心序列在同一物种中具有保守性,因此,可以根据微卫星的侧翼序列设计合适的引  相似文献   

12.
红原鸡全基因组中微卫星分布规律研究   总被引:1,自引:0,他引:1  
本文对红原鸡Gallus gallus全基因组中微卫星数量及分布规律进行了分析,查找到l~6个碱基重复类型的微卫星序列共282728个,约占全基因组序列(1.1Gb)的0.49%,分布频率为1/3.89kb,微卫星序列的长度主要在12~70个碱基长度范围内。第1、2、3条染色体上微卫星分布频率较高,而32号染色体上无微卫星分布。不同类型微卫星中,单碱基重复类型数目最多,为184192个,占总数的65.1%;其次是四、二、三、五、六碱基重复单元序列,分别占到总数的12.8%、9.7%、7.2%、4.6%、0.8%。T、A、AT、GTTT、AAAC、G、C、ATTT、AC、GT、AAAT、ATT、AAC、AAT、GTT、AG、CT、CTTT、AAAG、GTTTT、AAACA、AAGG、CCTT是红原鸡基因组中最主要的微卫星重复类型。本研究为红原鸡微卫星标记的分离筛选、遗传多样性的研究以及不同物种微卫星的比较分析奠定了基础。  相似文献   

13.
Simple sequence repeats (SSRs) or microsatellites constitute a countable portion of genomes. However, the significance of SSRs in organelle genomes has not been completely understood. The availability of organelle genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. In the current study we surveyed the patterns of SSRs in mitochondrial genomes of different taxa of plants. A total of 16 mitochondrial genomes, from algae to angiosperms, have been considered to analyze the pattern of simple sequence repeats present in them. Based on study, the mononucleotide repeats of A/T were found to be more prevalent in mitochondrial genomes over other repeat types. The dinucleotides repeats, TA/AT, were the second most numerous, whereas tri-, tetra-, and pentanucleotide repeats were in less number and present in intronic or intergenic portions only. Mononucleotide repeats prevailed in protein-coding exonic portions of all organisms. These results indicates that microsatellite pattern in mitochondrial genomes is different from nuclear genomes and also focuses on organization and diversity at SSR locuses in mitochondrial genomes. This is the novel report of microsatellite polymorphism in plant mitochondrion on whole genome level.  相似文献   

14.
李伟  陈怀谷  李伟  张爱香  陈丽华  姜伟丽 《遗传》2007,29(9):1154-1160
利用公共的真菌基因组数据库资源, 对核盘菌(Sclerotinia sclerotiorum)和灰葡萄孢(Botrytis cinerea)基因组中SSRs的结构类型、分布、丰度及最长序列等进行了系统分析, 并与已经研究过的禾谷镰孢菌(Fusarium graminearum), 稻瘟病菌(Magnaporthe grisea)和黑粉菌(Ustilago maydis)等几种植物病原真菌基因组中的SSRs进行了比较。结果表明: 核盘菌和灰葡萄孢基因组中的SSRs非常丰富, 分别为6 539和8 627个, 并且在结构类型和分布规律上具有一定的相似性; 与其他几种病原真菌相比, 核盘菌和灰葡萄孢基因组中长重复的四、五、六核苷酸基序更为丰富, 从而使得这两种真菌具有更高的变异性。同时, 我们发现真菌基因组中SSRs的丰度与基因组的大小及GC含量没有必然的关系。文章对核盘菌和灰葡萄孢基因组中SSRs的丰度、出现频率及最长基序的分析为快速、便捷地设计多态性丰富的SSRs引物提供了有益的信息。  相似文献   

15.
蚊子全基因组中微卫星的丰度及其分布   总被引:6,自引:0,他引:6  
微卫星是近年大力开发的一种遗传标记,为推进按蚊遗传学相关研究,对按蚊全基因组中由 1~6 个碱基重复单元组成的简单序列重复 ( 微卫星 ) 进行了分析 . 进而对其微卫星的丰度和分布进行了比较分析,也比较了染色体各个区域 ( 外显子、内含子和基因间隔区 ) 之间的分布差异 . 微卫星在按蚊基因组中的比例约占 2.14% ,其中 X 染色体拥有微卫星的密度最大 . 对按蚊基因组中微卫星丰度而言, A 碱基和 C 碱基重复在基因组中丰度相似, AC 单元的丰度是 AG 单元的两倍多,然而 AT 和 CG 单元非常稀少;对于三四碱基而言, AGC, AAAC 和 AAAT 单元最为丰富, ACG, ACT, AGG, CCG, ATGC, CCCG, ACTG, AACT, ACGT, AGAT, CCGG, ACCT 和 AGCT 单元等均很稀少,而一些五碱基重复,在某条甚至某几条染色体中均未分布 . 除两碱基重复单元在 2L 的外显子区域丰度较高外,其他重复单元均在内含子和基因间隔区丰富 . 进一步分析显示,微卫星在每条染色体两臂的丰度和分布存在着很多的相似性 .  相似文献   

16.
Microsatellites or simple sequence repeats (SSRs) are among the genetic markers most widely utilized in research. This includes applications in numerous fields such as genetic conservation, paternity testing, and molecular breeding. Though ordered draft genome assemblies of camels have been announced, including for the Arabian camel, systemic analysis of camel SSRs is still limited. The identification and development of informative and robust molecular SSR markers are essential for marker assisted breeding programs and paternity testing. Here we searched and compared perfect SSRs with 1–6 bp nucleotide motifs to characterize microsatellites for draft genome sequences of the Camelidae. We analyzed and compared the occurrence, relative abundance, relative density, and guanine-cytosine (GC) content in four taxonomically different camelid species: Camelus dromedarius, C. bactrianus, C. ferus, and Vicugna pacos. A total of 546762, 544494, 547974, and 437815 SSRs were mined, respectively. Mononucleotide SSRs were the most frequent in the four genomes, followed in descending order by di-, tetra-, tri-, penta-, and hexanucleotide SSRs. GC content was highest in dinucleotide SSRs and lowest in mononucleotide SSRs. Our results provide further evidence that SSRs are more abundant in noncoding regions than in coding regions. Similar distributions of microsatellites were found in all four species, which indicates that the pattern of microsatellites is conserved in family Camelidae.  相似文献   

17.
An in-silico analysis of simple sequence repeats (SSRs) in 30 species of tobamoviruses was done. SSRs (mono to hexa) were present with variant frequency across species. Compound microsatellites, primarily of variant motifs accounted for up to 11.43% of the SSRs. Motif duplications were observed for A, T, AT, and ACA repeats. (AG)–(TC) was the most prevalent SSR-couple. SSRs were differentially localized in the coding region with ~ 54% on the 128 kDa protein while 20.37% was exclusive to 186 kDa protein. Characterization of such variations is important for elucidating the origin, sequence variations, and structure of these widely used, but incompletely understood sequences.  相似文献   

18.
Plant genomes are complex and contain large amounts of repetitive DNA including microsatellites that are distributed across entire genomes. Whole genome sequences of several monocot and dicot plants that are available in the public domain provide an opportunity to study the origin, distribution and evolution of microsatellites, and also facilitate the development of new molecular markers. In the present investigation, a genome-wide analysis of microsatellite distribution in monocots (Brachypodium, sorghum and rice) and dicots (Arabidopsis, Medicago and Populus) was performed. A total of 797,863 simple sequence repeats (SSRs) were identified in the whole genome sequences of six plant species. Characterization of these SSRs revealed that mono-nucleotide repeats were the most abundant repeats, and that the frequency of repeats decreased with increase in motif length both in monocots and dicots. However, the frequency of SSRs was higher in dicots than in monocots both for nuclear and chloroplast genomes. Interestingly, GC-rich repeats were the dominant repeats only in monocots, with the majority of them being present in the coding region. These coding GC-rich repeats were found to be involved in different biological processes, predominantly binding activities. In addition, a set of 22,879 SSR markers that were validated by e-PCR were developed and mapped on different chromosomes in Brachypodium for the first time, with a frequency of 101 SSR markers per Mb. Experimental validation of 55 markers showed successful amplification of 80% SSR markers in 16 Brachypodium accessions. An online database 'BraMi' (Brachypodium microsatellite markers) of these genome-wide SSR markers was developed and made available in the public domain. The observed differential patterns of SSR marker distribution would be useful for studying microsatellite evolution in a monocot-dicot system. SSR markers developed in this study would be helpful for genomic studies in Brachypodium and related grass species, especially for the map based cloning of the candidate gene(s).  相似文献   

19.
We report the sequence and variability parameters of 16 microsatellite primer pairs obtained from two mango (Mangifera indica L.) genomic libraries after digestion of DNA of the cultivar Tommy Atkins with HaeIII and RsaI and enrichment in CT repeats. Although no significant differences were recorded between the two libraries in the informativeness of the markers obtained, the RsaI library was shown to be more useful than the HaeIII taking into account the efficiency of the library and the feasibility of clone sequencing. The polymorphism revealed by those microsatellites was evaluated in a collection of 28 mango cultivars of different origins. A total of 88 fragments were detected with the 16 simple sequence repeats (SSRs) with an average of 5.5 bands/SSR. Two primer pairs amplified more than a single locus. The mean expected and observed heterozygosities over the 14 single-locus SSRs averaged 0.65 and 0.69 respectively. The total value for the probability of identity was 2.74 × 10−9. The SSRs studied allowed the unambiguous identification of all the mango genotypes studied and this discrimination can be carried out with just three selected microsatellites. UPGMA cluster analysis and Principal coordinates analysis group the genotypes according to their origin and their classification as monoembryonic or polyembryonic types reflecting the pedigree of the cultivars and the movement of mango germplasm. The results demonstrate the usefulness of microsatellites for studies on identification, variability, germplasm conservation, domestication and movement of germplasm in mango.  相似文献   

20.
A Norway spruce (Picea abies K.) cDNA library obtained from vegetative bud tissue was screened for the presence of (AG)n and (AC)n microsatellite repeats. Ten (AG)n and six (AC)n microsatellites were found, with an average length of 25.5 repeat units. Most of the microsatellites are simple perfect repeats. The microsatellite distribution within the clones is clearly non-random, with different classes of repeats lying in different positions relative to the coding region and in a highly conserved orientation. An estimate of the frequency of dinucleotide microsatellites in expressed regions was obtained, showing that SSRs (simple sequence repeats) are found in genes about 20 times less frequently than in random genomic clones, with (AG)n repeats more frequent than (AC)n repeats. Potential applications of these sequences as expressed region-based molecular markers are shown by developing six SSR markers for the detection of natural variation in Norway spruce populations and testing two of them for the identification of illegitimate progenies from a mapping population.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号