首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
为挖掘番薯(Ipomoea)属EST-SSR资源,从NCBI数据库下载23406条甘薯(Ipomoea batatas (L.) Lam.)EST和62282条牵牛(Ipomoea nil (L.) Roth)EST,利用生物信息学软件预处理、去冗余、拼接处理后得到12812条无冗余的甘薯EST(6.70 Mb)和28422条牵牛唯一序列(17.19 Mb)。对这些序列进行SSR搜索,在甘薯上获得328个SSR位点,发生频率为2.56%;牵牛上筛选到962个SSR位点,出现频率为3.38%。甘薯和牵牛EST-SSR具有多个共同特征:在SSR位点中,主要是二核苷酸重复类型,其次是三核苷酸重复;在二核苷酸重复中,出现最多的重复基序为AG/CT,其次是AT/AT;在三核苷酸重复中,主要基序是AAG/CCT;SSR位点的长度主要集中在20~22 bp。结果表明,这些搜索出的EST-SSR重复基序类型丰富、多态性潜能高,具有较高的开发和利用价值。  相似文献   

2.
为全面了解余甘子转录组SSR位点的分布特征和变异规律,本研究利用Illumina Hiseq 4000平台对余甘子叶片转录组进行测序,通过MISA软件对获得的Unigenes进行SSR位点搜索和统计分析。结果发现9 538条包含SSR位点的Unigenes,共检测到9 991个SSR位点,平均每5.49 kB出现1个SSR。单碱基和二碱基为余甘子转录组SSR主要重复类型,分别占SSR总数的42.3%和30.79%。位于基因编码区的SSR位点共有1 731个,出现频率为0.039 SSRs/kB,优势重复类型为三碱基重复。余甘子转录组SSR中共有169种重复基元,其中所占比例最高的是A/T(42.10%),其次是AG/CT(22.91%)和AAG/CTT(5.02%)。SSR各基元的重复次数波动于4~75次,且多数集中于4~20次。重复片段长度≥ 20 bp的SSR占21.20%,且SSR发生频率与片段长度呈显著负相关(P<0.01),相关系数为-0.561。本研究获得的余甘子转录组SSR位点出现频率较高、分布密度较大、低级重复基元较多,重复次数较高、长片段较多,大多数SSR位点的多态性潜能较高,用于余甘子遗传多样性分析的潜力较大,为下一步余甘子转录组SSR标记的大规模开发和群体遗传学研究提供了重要的数据信息,进而为余甘子野生资源的保护和合理开发利用提供了参考依据。  相似文献   

3.
ABSTRACT: BACKGROUND: There are several reports describing thousands of SSR markers in the peanut (Arachis hypogaea L.) genome. There is a need to integrate various research reports of peanut DNA polymorphism into a single platform. Further, because of lack of uniformity in the labeling of these markers across the publications, there is some confusion on the identities of many markers. We describe below an effort to develop a central comprehensive database of polymorphic SSR markers in peanut. FINDINGS: We compiled 1,343 SSR markers as detecting polymorphism (14.5%) within a total of 9,274 markers. Amongst all polymorphic SSRs examined, we found that AG motif (36.5%) was the most abundant followed by AAG (12.1%), AAT (10.9%), and AT (10.3%).The mean length of SSR repeats in dinucleotide SSRs was significantly longer than that in trinucleotide SSRs. Dinucleotide SSRs showed higher polymorphism frequency for genomic SSRs when compared to trinucleotide SSRs, while for EST-SSRs, the frequency of polymorphic SSRs was higher in trinucleotide SSRs than in dinucleotide SSRs. The correlation of the length of SSR and the frequency of polymorphism revealed that the frequency of polymorphism was decreased as motif repeat number increased. CONCLUSIONS: The assembled polymorphic SSRs would enhance the density of the existing genetic maps of peanut, which could also be a useful source of DNA markers suitable for high-throughput QTL mapping and marker-assisted selection in peanut improvement and thus would be of value to breeders. KEYWORDS: SSR, motif, polymorphism, cultivated peanut.  相似文献   

4.
褐飞虱EST资源的微卫星信息分析   总被引:2,自引:0,他引:2  
刘玉娣  侯茂林 《昆虫学报》2010,53(3):239-247
表达序列标签(expressed sequence tags,ESTs)是开发微卫星标记的一个重要的资源。褐飞虱Nilaparvata lugens (Stål) EST序列的公布为开发EST-SSRs提供了宝贵的数据资源,本研究利用生物信息学对NCBI公共数据库中的37 398条褐飞虱ESTs序列进行EST-SSRs特征分析,得到全长为7 619 324 kb的无冗余EST 9 852条。按照3个不同的查找标准在这些序列中搜索SSR。查找结果显示:褐飞虱EST-SSRs主要重复基元以1~3碱基为主,占总EST-SSR的95%以上。在单碱基重复基元中,A/T是占优势的重复基元,在二相重复类型中,AG/CT重复基元出现的频率最多,而AAG/CTT是三相重复中占绝对优势的重复基元。在褐飞虱EST-SSRs中未查找到GC重复基元。以100 bp为参照,在3种查找标准下含有SSR的EST序列中两端侧翼序列均≥100 bp的序列分别为738,89和42个。通过分析褐飞虱EST-SSRs标记可以为褐飞虱和近缘种的SSR标记的开发提供信息,同时通过分析褐飞虱EST-SSRs的分布频率和分布特征可以为昆虫EST-SSRs的研究提供借鉴和参考。  相似文献   

5.
6.
7.
Yu H  Li Q 《The Journal of heredity》2008,99(2):208-214
A total of 147 microsatellite-containing expressed sequence tags (ESTs) (3.63%) were detected from 4053 ESTs of the Pacific oyster (Crassostrea gigas) in GenBank. The average density of simple sequence repeats (SSRs) was 1 per 8.25 kb of EST after redundancy elimination. Dinucleotide repeat motifs appeared to be the most abundant type. Sixteen new polymorphic EST-SSRs were developed. The number of alleles per locus varied from 3 to 12, with an average of 5.9 alleles per locus. Marker transferability was tested on 2 other Crassostrea species, and 14 loci gave successful amplifications in both species. Twenty EST-SSRs were tested on 3 families of C. gigas for examination of inheritance mode of EST-SSRs. Thirty-five tests of segregation ratios revealed 5 significant departures from expected Mendelian ratios, 4 of which confirmed Mendelian expectations when accounting for the presence of null alleles. Null alleles were detected at 3 loci (15.0%) of the 20 loci, and the frequency of null alleles at EST-SSRs was lower than that in genomic SSRs in C. gigas. The results obtained in this study suggest that C. gigas EST-SSRs will complement the currently available genomic SSR markers and may be useful for comparative mapping, marker-assisted selection, and evolutionary studies.  相似文献   

8.
Pineapple (Ananas comosus (L.) Merrill) is the second most important tropical fruit in term of international trade. The availability of whole genomic sequences and expressed sequence tags (ESTs) offers an opportunity to identify and characterize microsatellite or simple sequence repeat (SSR) markers in pineapple. A total of 278,245 SSRs and 41,962 SSRs with an overall density of 728.57 SSRs/Mb and 619.37 SSRs/Mb were mined from genomic and ESTs sequences, respectively. 5′-untranslated regions (5′-UTRs) had the greatest amount of SSRs, 3.6–5.2 fold higher SSR density than other regions. For repeat length, 12 bp was the predominant repeat length in both assembled genome and ESTs. Class I SSRs were underrepresented compared with class II SSRs. For motif length, dinucleotide repeats were the most abundant in genomic sequences, whereas trinucleotides were the most common motif in ESTs. Tri- and hexanucleotides of total SSRs were more prevalent in ESTs than in the whole genome. The SSR frequency decreased dramatically as repeat times increased. AT was the most frequent single motif across the entire genome while AG was the most abundant motif in ESTs. Across six examined plant species, the pineapple genome displayed the highest density, substantially more than the second-place cucumber. Annotation and expression analyses were also conducted for genes containing SSRs. This thorough analysis of SSR markers in pineapple provided valuable information on the frequency and distribution of SSRs in the pineapple genome. This genomic resource will expedite genomic research and pineapple improvement.  相似文献   

9.
柔嫩艾美尔球虫EST序列中SSR的获取及分析   总被引:1,自引:0,他引:1  
对柔嫩艾美尔球虫EST—SSR进行生物信息学分析,共获取Eimeria tenella EST序列34074条,总长度为16.45Mb,小于12bpSSR的ESTs达7651条,从中获得SSR序列19576条、总长度为0.35Mb,EST—SSRs的频率是48.00%,平均相隔S40bp出现一个长度不小于12bp的SSR。在E.tenella的核苷酸重复基元中,2、3、4、5、6和7bp重复序列在基因组中出现的种类分别有11种472条、49种14710条、31种525条、13种25条、21种43条和15种400条,3碱基重复序列是最丰富的重复单元,占总数的75.14%。各种SSRs中富含G、C碱基的重复单元以GCA出现频率最多(28.63%),次为AGC(17.59%),GCT(8.76%),TGC(7.62%),CTG(7.15%)。  相似文献   

10.
Teleost fish genome projects involving model species are resulting in a rapid accumulation of genomic and expressed DNA sequences in public databases. The expressed sequence tags (ESTs) collected in the databases can be mined for the analysis of both structural and functional genomics. In this study, we in silico analyzed 49,430 unigenes representing a total of 692,654 ESTs from four model fish for their potential use in developing simple sequence repeats (SSRs), or microsatellites. After bioinformatical mining, a total of 3,018 EST derived SSRs (EST-SSRs) were identified for 2,335 SSR containing ESTs (SSR-ESTs). The frequency of identified SSR-ESTs ranged from 1.5% for Xiphophorus to 7.3% for zebrafish. The dinucleotide repeat motif is the most abundant SSR, accounting for 47%, 52%, 64%, and 78% for medaka, Fundulus, zebrafish, and Xiphophorus, respectively. Simulation analysis suggests that a majority of these EST-SSRs have sufficient flanking sequences for polymerase chain reaction (PCR) primer design. Comparative DNA sequence analyses of SSR-ESTs identified several cross-species SSRs and sequences that may be used as cross-reference genes in comparative studies. For example, the flanking sequences of one SSR (CTG)n within the pituitary tumor-transforming gene (PTTG) 1 interacting protein (PTTGIP), showed conservation spanning the medaka, Fundulus, human, and mouse genomes. This study provides a large body of information on EST-SSRs that can be useful for the development of polymorphic markers, gene mapping, and comparative genome analysis. Functional analysis of these SSR-ESTs may reveal their role in metabolism and gene evolution of these model species.  相似文献   

11.
Opium poppy (Papaver somniferum L.) is an important pharmaceutical crop with very few genetic marker resources. To expand these resources, we sequenced genomic DNA using pyrosequencing technology and examined the DNA sequences for simple sequence repeats (SSRs). A total of 1,244,412 sequence reads were obtained covering 474 Mb. Approximately half of the reads (52 %) were assembled into 166,724 contigs representing 105 Mb of the opium poppy genome. A total of 23,283 non-redundant SSRs were identified in 18,944 contigs (11.3 % of total contigs). Trinucleotide and tetranucleotide repeats were the most abundant SSR repeats, accounting for 49.0 and 27.9 % of all SSRs, respectively. The AAG/TTC repeat was the most abundant trinucleotide repeat, representing 19.7 % of trinucleotide repeats. Other SSR repeat types were AT-rich. A total of 23,126 primer pairs (98.7 % of total SSRs) were designed to amplify SSRs. Fifty-three genomic SSR markers were tested in 37 opium poppy accessions and seven Papaver species for determination of polymorphism and transferability. Intraspecific polymorphism information content (PIC) values of the genomic SSR markers were intermediate, with an average 0.17, while the interspecific average PIC value was slightly higher, 0.19. All markers showed at least 88 % transferability among related species. This study increases sequence coverage of the opium poppy genome by sevenfold and the number of opium poppy-specific SSR markers by sixfold. This is the first report of the development of genomic SSR markers in opium poppy, and the genomic SSR markers developed in this study will be useful in diversity, identification, mapping and breeding studies in opium poppy.  相似文献   

12.
Microsatellites, or simple sequence repeats (SSRs), are highly polymorphic and universally distributed in eukaryotes. SSRs have been used extensively as sequence tagged markers in genetic studies. Recently, the functional and evolutionary importance of SSRs has received considerable attention. Here we report the mining and characterization of the SSRs in papaya genome. We analyzed SSRs from 277.4 Mb of whole genome shotgun (WGS) sequences, 51.2 Mb bacterial artificial chromosome (BAC) end sequences (BES), and 13.4 Mb expressed sequence tag (EST) sequences. The papaya SSR density was one SSR per 0.7 kb of DNA sequence in the WGS, which was higher than that in BES and EST sequences. SSR abundance was dramatically reduced as the repeat length increased. According to SSR motif length, dinucleotide repeats were the most common motif in class I, whereas hexanucleotides were the most copious in class II SSRs. The tri- and hexanucleotide repeats of both classes were greater in EST sequences compared to genomic sequences. In class I SSR, AT and AAT were the most frequent motifs in BES and WGS sequences. By contrast, AG and AAG were the most abundant in EST sequences. For SSR marker development, 9,860 primer pairs were surveyed for amplification and polymorphism. Successful amplification and polymorphic rates were 66.6% and 17.6%, respectively. The highest polymorphic rates were achieved by AT, AG, and ATG motifs. The genome wide analysis of microsatellites revealed their frequency and distribution in papaya genome, which varies among plant genomes. This complete set of SSRs markers throughout the genome will assist diverse genetic studies in papaya and related species.  相似文献   

13.
Characterization of EST-SSRs in loblolly pine and spruce   总被引:3,自引:0,他引:3  
In the first large study of conifer expressed sequence tag-simple sequence repeats (EST-SSRs), two large conifer EST databases were characterized for EST-SSRs. One database was from “interior spruce” (white and Engelmann spruce in Southern British Columbia) and Sitka spruce, while the other was from loblolly pine. We found 475 and 629 unique EST-SSRs in loblolly pine and spruce, respectively. 3′ ESTs contained 14% more SSRs than 5′ EST reads in loblolly pine and 41% more in spruce. Conifer EST-SSRs differed conspicuously from angiosperm EST-SSRs in several aspects. EST-SSRs were considerably less frequent in conifers (one EST-SSR every ∼50 kb) than in angiosperms (one EST-SSR every ∼20 kb). Dinucleotide repeats were the most abundant repeat class in conifers, while in angiosperms, trinucleotides were most common. Finally, the AT motif was the dominant motif recovered in both conifer species, whereas AG was the most common dinucleotide repeat in angiosperms. Also, as these EST-SSRs in conifers could be developed into useful genetic markers, our work demonstrates the value of large-scale EST sequencing projects for in-silico approaches for marker development.  相似文献   

14.
Nearly 900 SSRs (simple sequence repeats) were identified among 15,000 ESTs (expressed sequence tags) belonging to bread wheat ( Triticum aestivum L.). The SSRs were defined by their minimum length, which ranged from 14 to 21 bp. The maximum length ranged from 24 to 87 bp depending upon the length of the repeat unit itself (1–7 bp). The average density of SSRs was one SSR per 9.2 kb of EST sequence screened. The trinucleotide repeats were the most abundant SSRs detected. As a representative sample, 78 primer pairs were designed, which were also used to screen the dbEST entries for Hordeum vulgare and Triticum tauschii (donor of the D-genome of cultivated wheat) using a cut-off E (expectation) value of 0.01. On the basis of in silico analysis, up to 55.12% of the primer pairs exhibited transferability from Triticum to Hordeum, indicating that the sequences flanking the SSRs are not only conserved within a single genus but also between related genera in Poaceae. Primer pairs for the 78 SSRs were synthesized and used successfully for the study of (1) their transferability to 18 related wild species and five cereal species (barley, oat, rye, rice and maize); and (2) polymorphism between the parents of four mapping populations available with us. A subset of 20 EST-SSR primers was also used to assess genetic diversity in a collection of 52 elite exotic wheat genotypes. This was done with a view to compare their utility relative to other molecular markers (gSSRs, AFLPs, and SAMPL) previously used by us for the same purpose with the same set of 52 bread wheat genotypes. Although only a low level of polymorphism was detected, relative to that observed with genomic SSRs, the study suggested that EST-SSRs can be successfully used for a variety of purposes, and may actually prove superior to SSR markers extracted from genomic libraries for diversity estimation and transferability.Communicated by R. Hagemann  相似文献   

15.
A total of 26,685 unutilized public domain expressed sequence tags (ESTs) of Arachis hypogaea L. were analyzed to give a total of 4442 EST-SSRs, in which 517 ESTs contained more than one simple sequence repeat (SSR). Of these EST-SSRs, 2542 were mononucleotide repeats (MNRs), 803 were dinucleotide repeats (DNRs), 1043 were trinucleotide repeats (TNRs), 40 were tetranucleotide repeats (TtNRs), six were pentanucleotide repeats (PNRs) and eight were hexanucleotide repeats (HNRs). Out of these 4442 EST-SSRs, only 1160 were found to be successful in non-redundant primer design; 1060 were simple SSRs, while the remaining 100 were compound forms. Among all the motifs, MNRs were abundant, followed by TNRs and DNRs. The AAG/CTT motif was the most abundant (~33 %) TNR, while AG/CT was the most abundant DNR. For redundancy and novelty, a stringent criterion deploying three different strategies was used and a total of 782 novel EST-SSRs were added to the public domain of peanut. These novel EST-SSR markers will be useful for qualitative and quantitative trait mapping, marker-assisted selection and genetic diversity studies in cultivated peanut as well as related Arachis species. A subset of 30 novel EST-SSRs was further randomly selected for validation and genotyping studies with eight well-known cultivars and 32 advanced breeding lines (ADBX lines, ADBY lines and ADBZ lines) from Odisha state, India. The number of polymorphic markers among accessions of A. hypogaea was low; however, a set of informative EST-SSR markers detected considerable levels of genetic variability in peanut cultivars and uncharacterized breeding lines collected from Odisha. The 30 newly developed EST-SSRs from Arachis spp. showed ~97 % amplification in Cicer arientinum and 93 % in pigeon pea. Thus, the EST-SSRs developed in this study will be a very useful asset for genetic analysis, comparative genome mapping, population genetic structure and phylogenetic inferences among wild and allied species of Arachis.  相似文献   

16.
亚麻EST序列中SSR标记的筛选   总被引:4,自引:0,他引:4  
利用亚麻NCBI数据库中的7 941条亚麻EST序列进行SSR的筛选,共发现222个SSR,占整个EST数据库的2.73%,其中三核苷酸重复单元的EST-SSR占总SSR的72.1%,二核苷酸和四核苷酸二者出现的频率基本相近,分别占总SSR的14.4%和13.5%.AGAA是四核苷酸中的优势重复类型,占四核苷酸重复类型的67.67%.设计的21对EST-SSR引物中有18对在10个亚麻材料中有扩增产物,占设计引物的85%,有14对产物条带比较清晰并具有多态性.基于SSR标记进行聚类分析,可将10个亚麻材料划分为3个组.本研究建立的亚麻SSR标记,为亚麻遗传多样性鉴定、分子作图等研究提供了一种有效的分子标记系统.  相似文献   

17.
18.
The abundance and inherent potential for variations in simple sequence repeats (SSRs) or microsatellites resulted in valuable source for genetic markers in eukaryotes. We describe the organization and abundance of SSRs in fungus Fusarium graminearum (causative agent for Fusarium head blight or head scab of wheat). We identified 1705 SSRs of various nucleotide repeat motifs in the sequence database of F. graminearum. It is observed that mononucleotide repeats (62%) were most abundant followed by di- (20%) and trinucleotide repeats (14%). It is noted that tetra-, penta- and hexanucleotide repeats accounted for only 4% of SSRs. The estimated frequency of Class I SSRs (perfect repeats ≥20 nucleotides) was one SSR per 124.5 kb, whereas the frequency of Class II (perfect repeats >10 nucleotides and ≫20 nucleotides) was one SSR per 25.6 kb. The dynamics of SSRs will be a powerful tool for taxonomic, phylogenetic, genome mapping and population genetic studies as SSR based markers show high levels of allelic variation, codominant inheritance and ease of analysis.  相似文献   

19.
红原鸡全基因组中微卫星分布规律研究   总被引:1,自引:0,他引:1  
本文对红原鸡Gallus gallus全基因组中微卫星数量及分布规律进行了分析,查找到l~6个碱基重复类型的微卫星序列共282728个,约占全基因组序列(1.1Gb)的0.49%,分布频率为1/3.89kb,微卫星序列的长度主要在12~70个碱基长度范围内。第1、2、3条染色体上微卫星分布频率较高,而32号染色体上无微卫星分布。不同类型微卫星中,单碱基重复类型数目最多,为184192个,占总数的65.1%;其次是四、二、三、五、六碱基重复单元序列,分别占到总数的12.8%、9.7%、7.2%、4.6%、0.8%。T、A、AT、GTTT、AAAC、G、C、ATTT、AC、GT、AAAT、ATT、AAC、AAT、GTT、AG、CT、CTTT、AAAG、GTTTT、AAACA、AAGG、CCTT是红原鸡基因组中最主要的微卫星重复类型。本研究为红原鸡微卫星标记的分离筛选、遗传多样性的研究以及不同物种微卫星的比较分析奠定了基础。  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号