首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.
利用NCBI数据库进行漆树EST-SSR引物开发,从NCBI数据库中共下载漆树EST序列87 856条。利用MISA软件进行序列处理、拼接及聚类后,从87 856条漆树EST序列中拼接组装成3 979条非冗余序列,含SSR位点的EST序列出现频率占EST序列总数的4.5%,从3 979条非冗余序列中检测到487个SSRs微卫星位点,出现频率为12.2%。这些SSR位点中,三核苷酸和二核苷酸重复基元所占比例较高。采用Primer5.0软件,共成功设计50对EST-SSR引物,50对EST-SSR引物在25个漆树个体上均能扩增出清晰的电泳条带,其中18对引物检测出了多态性条带,扩增率达36%。  相似文献   

2.
从NCBI的EST数据库中获得的木麻黄EST序列共有34 752条,进行拼接后得到全长7 278.578 kb的非冗余序列(Unigene)12 062条,并从中检索得到分布于353条Unigene的367个SSR位点,SSR检出率为2.93%,平均分布距离为19.83 kb,包括39种重复基序类型。其中,以二核苷酸和三核苷酸为主要类型,在总SSRs中所占比例分别为57.77%和34.60%;而二核苷酸重复基序中,AG/CT所占比例最高,为93.87%;在三核苷酸重复基序中AAG/CTT所占比例最高,为44.09%。对检索出的EST-SSR位点设计得到97对引物,其中32对为可有效扩增引物。Blastx分析发现77.3%的含SSR位点的EST序列与非冗余蛋白序列数据库中功能序列具有同源性,而功能已知的序列中葡萄来源的序列占有最大比例(10.4%)。GO功能分类发现,含有SSR位点的EST序列中有47.3%至少具有1个GO注释,归入细胞组分的序列最多,而其中细胞质和细胞核的功能项所占比例较大。  相似文献   

3.
烟草ESTs资源的SSR信息分析   总被引:2,自引:0,他引:2  
烟草ESTs数量迅速增加为开发新的SSR标记提供了宝贵的资源.经过软件分析,对242 683条烟草ESTs序列剔除冗余序列,在211 728条非冗余烟草ESTs序列中,共检索出9 339个SSR,SSR之间的距离约为14.21 kb,检出率为4.41%,包括216种重复基元.其中三核苷酸重复类型的SSR占主导地位,占总SSR的50.34%,其次为二核苷酸和单核苷酸,分别为23.00%,16.48%,其余重复类型所占比例均不足5%.在所有重复基元中,A/T重复为主要类型,占所有重复14.68%,其次为AT/TA、AG/TC、AAG/TTC,分别为10.49%、9.48%、6.85%.随机设计10对EST-SSR引物,对6个品种烟草进行扩增,10对EST-SSR引物均能扩增出产物,其中1对引物在6个品种有多态性.本研究为烟草EST-SSR标记的建立和进一步应用奠定了基础.  相似文献   

4.
随着新一代测序技术的发展,大量的转录组数据和表达序列标签(EST)成为开发简单重复序列(SSR)标记的可利用资源。本研究利用MISA软件筛选龙眼(Dimocarpus longan)顶芽转录组数据库序列,从114 445条龙眼转录组unigene序列中发现11 546个SSR位点,SSR出现频率为10.09%。其中1 975条unigene含有两个或两个以上EST-SSR位点,占所有SSR位点的比例为17.10%,SSR出现的平均距离为7.52 kb。从龙眼转录组SSR核苷酸基序类型来看,二核苷酸(52.11%)和三核苷酸(46.15%)出现频率最高,占所有核苷酸出现频率的99.26%。在龙眼转录组SSR中二核苷酸重复基元出现频率最高的是AG/CT(4 250个,占36.81%),三核苷酸重复基元出现频率最高的是AAG/CTT(1 109个,占9.61%)。对含SSR位点的9 571条unigene序列进行引物设计,共设计出了8 347对SSR位点特异引物。随机挑选合成50对EST-SSR引物,以‘石硖’、‘储良’、‘古山2号’、‘立冬本’等四份龙眼材料的基因组DNA为模板对这批引物进行PCR扩增、筛选,结果表明,其中21对引物能产生理想的PCR产物,有效扩增率为42%;16对引物扩增条带具有多态性,占有效引物的76.2%;16对多态性引物共扩增获得50个条带,其中多态性片段21个,每对引物平均产生1.31个多态性片段。  相似文献   

5.
陆地棉EST长度多态性与其SSR分布特征相关性分析   总被引:2,自引:1,他引:1  
目的:分析陆地棉EST长度多态性与其SSR分布特征的相关性。方法:从NCBI公共数据库下载陆地棉EST序列,应用SSRIT搜索SSR,分析20 000条无冗余的EST序列。结果:在剔除低质量和冗余的序列后,得到全长为7 363.878kb的无冗余EST序列7 322条,其中含有SSR位点的EST序列数520条,占被分析EST比例的2.60%。长度在400bp以下的EST序列含SSR的比例为1.46%;长度在400bp以上的EST序列含SSR的比例为8.94%。在1~6bp的重复基元中,二核苷酸重复基元的SSR重复频率最高,占总数的63.46%,其次是三核苷酸,占总数的34.04%。二核苷酸类型(AG)n、(AT)n和三核苷酸类型(AAG)n、(ACC)n、(ACT)n、(AAT)n是SSR的主要重复基元。结论:棉花EST-SSR可用于棉花分子标记,为有针对性设计陆地棉EST-SSR引物奠定基础。  相似文献   

6.
亚麻EST-SSR信息分析与标记开发   总被引:3,自引:0,他引:3  
与基因组SSR相比,以EST为基础的EST-SSR分子标记具有自身的优点。本研究从11240条亚麻(Linum sitatissmum L.)EST序列中检索出877条含有SSR的序列,其出现频率为7.8%。其中以三核苷酸重复出现的频率最高,占总SSR序列的60.1%;其次是二核苷酸重复,占21.9%;四、五和六核苷酸重复占18%。根据这些含SSR的EST序列共设计了73对SSR引物,在8份亚麻材料间通过PCR扩增检测,有63对引物扩增出清晰条带,引物可用率86.3%;有17对引物在8份亚麻材料间显现出多态性,占可扩增引物的26.3%。  相似文献   

7.
油菜EST-SSR标记的建立   总被引:12,自引:0,他引:12  
在油菜17987条非冗余EST中,共发掘出了2083个EST-SSR,分布于2443条EST中,发生频率是13.58%,平均分布距离为4.34kb。在油菜EST-SSR中,二、三核苷酸重复是主要的重复类型,二者出现的频率相近,占总SSR的89.05%。AG/CT和AAG/CTT是二、三核苷酸中的优势重复类型,分别占二、三核苷酸重复类型的84.31%和37.71%。进一步设计了23对SSR引物,通过梯度PCR试验确定了各引物的适宜退火温度,并利用非变性聚丙烯酰胺凝胶银染对这些引物在10个油菜品种中的扩增情况和多态性进行了检测。有21对引物显示扩增,引物可用率为91.30%;有12对引物显现出多态性,占可扩增引物的57.14%。本研究结果证明根据油菜EST建立SSR标记是有效、可行的。  相似文献   

8.
砂梨EST-SSR引物开发及其应用   总被引:4,自引:0,他引:4  
利用GenBank和GDR数据库中的995条梨EST序列开发砂梨SSR引物,并根据开发的EST-SSR引物对砂梨'西子绿'×'喜水'F1群体的遗传变异进行分析.结果发现:(1)60个SSR位点分布于54条EST序列中,占整个EST数据库的5.4%,其中二核苷酸重复基元出现频率最高,达51.7%,其次为三核苷酸重复基元占25%.23类重复基序中AT重复基序出现的频率最高,为32.3%.(2)利用开发的25对EST-SSR引物对'西子绿'×'喜水'F1群体的遗传变异分析结果表明,其中9对引物呈现多态性,多态性引物占设计引物的36%;多态性引物扩增产物在F1群体中的等位基因数(No)平均为2,有效等位基因数(Ne)平均为1.932 4,平均杂合度观测值(Ho)和期望杂合度(He)分别为1和0.480 9.  相似文献   

9.
为了探究家蚕Bombyx mori EST-SSR标记的多态性, 对检索获得的家蚕第12连锁群的4 465条EST序列进行了分析, 整理和拼接后得到581条非冗余EST序列, 总长度约为480 kb。其中, 有122条序列中共检测到154个EST-SSR, 占所研究的EST序列的2.73%, 平均每3.12 kb 含有一个EST-SSR。在所检测的EST-SSR中, 三核苷酸和四核苷酸重复是主导类型, 分别占总数的36.36%和28.57%,大部分表现为Perfect形式; 核苷酸重复平均长度约为16.2 bp, 最长为30 bp。进一步进行同源性分析, 发现有26条序列可以在NCBI中检索到同源序列, 在这些序列中一共含有40个SSR, 其中14个(35.0%)位于5′-UTR, 11个(27.5%)位于3′-UTR, 15个(37.5%)位于CDS区。根据筛选到的微卫星序列设计11对引物, 其中8对引物有扩增产物, 且条带清晰; 应用引物ES1204对8个家蚕品种进行PCR扩增都呈现多态性。结果说明通过家蚕EST数据库发掘SSR标记是一条可行的途径。  相似文献   

10.
为了在茶树中开发EST-SSRs功能性标记,利用生物信息学方法对NCBI网上公开的3288条茶树(Camellia sinensis) ESTs序列进行EST-SSRs特征分析。剔除冗余序列,得到非冗余序列2083条。在非冗余序列中发现含不同重复基元SSRs的EST序列有385条,共486个EST-SSRs,平均相隔2.10 kb出现一个SSR。在2-6 bp的重复基元中,二核苷酸重复基元的SSRs出现频率最高(51.97%),其次是三核苷酸(19.55%)。对所有的重复基元类型进行统计分析发现, 所占比例最高的是AG/CT(47.74%),其次分别是AT/TA(4.73%)和AAG/CTT(4.73%)。利用Prime 5 软件,设计了206对EST-SSRs引物,随机选用72对引物进行SSR扩增,发现31对引物可以扩增出条带,其中29对引物具有多态性,多态性比率为93.5%。这些EST-SSRs将有助于茶树基因组学方面的研究。  相似文献   

11.
柔嫩艾美尔球虫EST序列中SSR的获取及分析   总被引:1,自引:0,他引:1  
对柔嫩艾美尔球虫EST—SSR进行生物信息学分析,共获取Eimeria tenella EST序列34074条,总长度为16.45Mb,小于12bpSSR的ESTs达7651条,从中获得SSR序列19576条、总长度为0.35Mb,EST—SSRs的频率是48.00%,平均相隔S40bp出现一个长度不小于12bp的SSR。在E.tenella的核苷酸重复基元中,2、3、4、5、6和7bp重复序列在基因组中出现的种类分别有11种472条、49种14710条、31种525条、13种25条、21种43条和15种400条,3碱基重复序列是最丰富的重复单元,占总数的75.14%。各种SSRs中富含G、C碱基的重复单元以GCA出现频率最多(28.63%),次为AGC(17.59%),GCT(8.76%),TGC(7.62%),CTG(7.15%)。  相似文献   

12.
Xin D  Sun J  Wang J  Jiang H  Hu G  Liu C  Chen Q 《Molecular biology reports》2012,39(9):9047-9057
Microsatellites, or simple sequence repeats (SSRs), are very useful molecular markers for a number of plant species. We used a new publicly available module (TROLL) to extract microsatellites from the public database of soybean expressed sequence tag (EST) sequences. A total of 12,833 sequences containing di- to penta-type SSRs were identified from 200,516 non-redundant soybean ESTs. On average, one SSR was found per 7.25?kb of EST sequences, with the tri-nucleotide motifs being the most abundant. Primer sequences flanking the SSR motifs were successfully designed for 9,638 soybean ESTs using the software primer3.0 and only 59 pairs of them were found in earlier studies. We synthesized 124 pairs of the primers to determine the polymorphism and heterozygosity among eight genotypes of soybean cultivars, which represented a wide range of the cultivated soybean cultivars. PCR amplification products with anticipated SSRs were obtained with 81 pairs of primers; 36 PCR products appeared to be homozygous and the remaining 45 PCR products appeared to be heterozygous and displayed polymorphism among the eight cultivars. We further analysed the EST sequences containing 45 polymorphic EST-SSR markers using the programs BLASTN and BLASTX. Sequence alignment showed that 29 ESTs have homologous sequences and 15 ESTs could be classified into a Uni-gene cluster with comparatively convincing protein products. Among these 15 ESTs belonging to a Uni-gene cluster, 9 SSRs were located in 3'-UTR, 4 SSRs were located in the intron region and 2 SSRs were located in the CDS region. None of these SSRs was located in the 5'-UTR. These novel SSRs identified in the ESTs of soybean provide useful information for gene mapping and cloning in future studies.  相似文献   

13.
甜瓜EST序列中微卫星的分布特征   总被引:2,自引:0,他引:2  
GenBank中35547条甜瓜EST经去冗余处理后,得到总长度为250.3Mb的无冗余EST34438条。这些序列中有2813个微卫星简单重复序列(Simple sequence repeat,SSR),分布于2107条EST中,出现频率为8.16%,平均分布距离为8.90kb。三核苷酸重复是主导重复类型,占SSR总数的47.14%;其次是二核苷酸和单核苷酸重复,分别占SSR总数的20.72%和16.99%。AAG/TTC是优势重复基元,占微卫星总数的29.26%,AG/CT和A/T分别占14.61%和16.25%。在所有的SSR中,重复次数为4~10次的占70.32%,长度为12~20bp的占51.12%。并对这些SSR的多态性潜能进行了评价。  相似文献   

14.
烟草EST-SSR位点分析   总被引:10,自引:0,他引:10  
利用MISA软件对烟草EST公共数据库中的简单重复序列(SSRs)进行了分析。结果表明,在133523条EST序列中,共获得81757条SSR序列,SSRs之间的距离约为0.92 kb。其中,六碱基重复丰度最大,占60.3%,而单碱基、三碱基、四碱基、二碱基和五碱基重复丰度分别为20.0%、11.0%、4.2%、2.8%和1.7%。在单碱基、二碱基、三碱基和四碱基重复模体中,丰度最大的分别是A/T、AG、AAG和AAAT,而CG在编码区内丰度很低。用CAP3软件进行冗余分析表明,在这6种类型的重复模体中,冗余与非冗余的烟草EST之间没有显著差异。在得到的SSR序列中随机选择10个序列设计引物,在7个烟草品种中进行PCR扩增。结果表明,10对引物全部扩增出PCR产物,其中8对引物扩增出预期片段。用这8组扩增出预期片段的PCR产物进行变性PAGE凝胶电泳检测,结果表明,其中有4对引物(EB4、EB5、EB6和EB8)扩增出多态性条带。  相似文献   

15.
Simple sequence repeat (SSR) markers are widely used in many plant and animal genomes due to their abundance, hypervariability, and suitability for high-throughput analysis. Development of SSR markers using molecular methods is time consuming, laborious, and expensive. Use of computational approaches to mine ever-increasing sequences such as expressed sequence tags (ESTs) in public databases permits rapid and economical discovery of SSRs. Most of such efforts to date focused on mining SSRs from monocotyledonous ESTs. In this study, we have computationally mined and examined the abundance of SSRs in more than 1.54 million ESTs belonging to 55 dicotyledonous species. The frequency of ESTs containing SSRs among species ranged from 2.65% to 16.82%. Dinucleotide repeats were found to be the most abundant followed by tri- or mono-nucleotide repeats. The motifs A/T, AG/GA/CT/TC, and AAG/AGA/GAA/CTT/TTC/TCT were the predominant mono-, di-, and tri-nucleotide SSRs, respectively. Most of the mononucleotide SSRs contained 15-25 repeats, whereas the majority of the di- and tri-nucleotide SSRs contained 5-10 repeats. The comprehensive SSR survey data presented here demonstrates the potential of in silico mining of ESTs for rapid development of SSR markers for genetic analysis and applications in dicotyledonous crops.  相似文献   

16.
茶树EST-SSRs分布特征及引物开发   总被引:10,自引:1,他引:10  
为了在茶树中开发EST-SSRs功能性标记,利用生物信息学方法对NCBI网上公开的3288奈茶树(Camellia subebsus)ESTs序列进行EST-SSRs特征分析。剔除冗余序列,得到非冗余序列2083条。在非冗余序列中发现含不同重复基元SSRs的EST序列有385条,共486个EST-SSRs,平均相隔2.10kb出现1个SSR。在2~6bp的重复基元中,二核苷酸重复基元的SSRs出现频率最高(51.97%),其次是三核苷酸(19.55%)。对所有的重复基元类型进行统计分析发现,所占比例最高的是AG/CT(47.74%),其次分别是AT/TA(4.73%)和AAG/CTT(4.73%)。利用Prime5软件,设计了206对EST-SSRs引物,随机选用72对引物进行SSR扩增,发现31对引物可以扩增出条带,其中29对引物具有多态性,多态性比率为93.5%。这些EST-SSRs将有助于茶树基因组学方面的研究。  相似文献   

17.
Expressed sequence tags (ESTs) from Coffea canephora leaves and fruits were used to search for types and frequencies of simple sequence repeats (EST–SSRs) with a motif length of 1–6 bp. From a non-redundant (NR) EST set of 5,534 potential unigenes, 6.8% SSR-containing sequences were identified, with an average density of one SSR every 7.73 kb of EST sequences. Trinucleotide repeats were found to be the most abundant (34.34%), followed by di- (25.75%) and hexa-nucleotide (22.04%) motifs. The development of unique genic SSR markers was optimized by a computational approach which allowed us to eliminate redundancy in the original EST set and also to test the specificity of each pair of designed primers. Twenty-five EST–SSRs were developed and used to evaluate cross-species transferability in the Coffea genus. The orthology was supported by the amplicon sequence similarity and the amplification patterns. The >94% identity of flanking sequences revealed high sequence conservation across the Coffea genus. A high level of polymorphic loci was obtained regardless of the species considered (from 75% for C. liberica to 86% for C. canephora). Moreover, the polymorphism revealed by EST–SSR was similar to that exposed by genomic SSR. It is concluded that Coffea ESTs are a valuable resource for microsatellite mining. EST-SSR markers developed from C. canephora sequences can be easily transferred to other Coffea species for which very little molecular information is available. They constitute a set of conserved orthologous markers, which would be ideal for assessing genetic diversity in coffee trees as well as for cross-referencing transcribed sequences in comparative genomics studies.  相似文献   

18.
The growing availability of EST sequences from a range of crop plantsprovides a potentially valuable source of new DNA markers. We have examined theInternational Triticeae EST Cooperative database for the presence ofdinucleotide and trinucleotide simple sequence repeats. Analysis of 24,344 ESTsidentified 388 dinucleotide repeats and 978 trinucleotide repeats in ESTs,representing 1.6% and 4.0% of the total number of ESTs, respectively. To testthe utility and cross-species transferability of EST-derived SSR markers,primers were designed to the flanking regions of 41 barley SSRs and used toscreen 11 barley and 15 wheat varieties. Sixteen of the barley SSR markers werepolymorphic in barley and five were polymorphic in wheat. This represents arelatively high level of transferability of SSR markers between barley andwheat, which has important implications for the development of new markers andcomparative mapping of barley, wheat and other cereals. An additional 56 SSRsfrom wheat ESTs were tested in the same barley and wheat varieties. Four wheatEST SSR markers were polymorphic in wheat and one in barley. Chromosomallocations in barley and wheat were determined for the majority of polymorphicmarkers.  相似文献   

19.
Turmeric (Curcuma longa L.) (Family: Zingiberaceae) is a perennial rhizomatous herbaceous plant often used as a spice since time immemorial. Turmeric plants are also widely known for its medicinal applications. Recently EST-derived SSRs (Simple sequence repeats) are a free by-product of the currently expanding EST (Expressed Sequence Tag) databases. SSRs have been widely applied as molecular markers in genetic studies. Development of high throughput method for detection of SSRs has given a new dimension in their use as molecular markers. A software tool SciRoKo was used to mine class I SSR in Curcuma EST database comprising 12953 sequences. A total of 568 non-redundant SSR loci were detected with an average of one SSR per 14.73 Kb of EST. Furthermore, trinucleotide was found to be the most abundant repeat type among 1-6-nucleotide repeat types. It accounted for 41.19% of the total, followed by the mononucleotide (20.07%) and hexanucleotide repeats (15.14%). Among all the repeat motifs, (A/T)n accounted for the highest proportion followed by (AGG)n. These detected SSRs can be greatly used for designing primers that can be used as markers for constructing saturated genetic maps and conducting comparative genomic studies in different Curcuma species.  相似文献   

20.
The detection of simple sequence repeats (SSRs) within expressed sequence tags (ESTs) connects potential microsatellite markers with specific genes, generating Type I markers. Using an in silico approach, we identified 1975 SSRs from the Genome Research on Atlantic Salmon Project EST database. We designed primers to amplify 158 SSRs, of which 65 amplified 76 loci (including 11 duplicated loci). Sixty‐one of the 76 loci were variable in 24 Atlantic salmon from seven populations, and 96% of these markers also amplify DNA from other salmonids. Functions for 16 of the SSR associated ESTs have been determined, confirming them as Type I markers.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号