首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Analysis of Microsatellites in Citrus Unigenes   总被引:5,自引:0,他引:5  
Simple sequence repeats (SSRs) were investigated in the unigene sequences from expressed sequence tags (EST) of sweet orange (Citrus sinensis osbeck), trifoliate orange (Poncirus trifoliata Raf.) and other citrus species and cultivars. A total of 37 802 citrus unigene sequences corresponding to 23.29 Mb were searched, resulting in the identification of 8 218 SSRs. Among them there were 4 913 (59.8%) mono-, 1 419 (17.3%) di-, 1 709 (20.8%) tri-, 114 (1.39%) tetra-, 23 (0.28%) penta- and 40 (0.49%) hexa-nucleotide SSRs. The estimated frequency of SSRs was approximately 1/2.8 kb, which could be extrapolated to 1 SSR-containing unigene in 4.6 unigenes. The maximum length of the SSR ranged from 40 to 105 bp depending on the repeating numbers of the motif in the SSR. The overall average length of SSRs was 20.9 bp. The frequencies of different SSR types (di-, tri-, tetra-, and penta-nucleotide repeats) were very similar between sweet orange and trifoliate orange. The mononucelotide repeats appeared to be the most abundant SSRs within sweet orange and trifoliate orange, followed by trimeric repeats. The adenine rich repeats such as A/T, AG, AT, AAG, AAAT, AAAG, AAAT, AAAAG, AAAAT etc. were predominant in each type of SSRs (mono-, di-, tri-, tetra-, and penta-), whereas the C/G, CG, CCG repeats were less abundant. Twenty-five primer pairs flanking EST-SSR loci were designed to detect the possible polymorphism of six citrus cultivars including sweet orange and trifoliate orange. The PCR result with all these 25 primer pairs revealed the existence of polymorphism within six citrus cultivars confirming that citrus EST database could be efficiently exploited for the development of gene-derived SSR markers.  相似文献   

2.
Expressed sequence tag (EST) databases offer opportunity for the rapid development of simple sequence repeat (SSR) markers in crops. Sequence assembly and clustering of 57?895 ESTs of castor bean resulted in the identification of 10?960 unigenes (6459 singletons and 4501 contigs) having 7429 SSRs. On an average, the unigenes contained 1 SSR for every 1.23?kb of unigene sequence. The identified SSRs mostly consisted of dinucleotide (62.4%) and trinucleotide (33.5%) repeats. The AG class was the most common among the dinucleotide motifs (68.9%), whereas the AAG class (25.9%) was predominant among the trinucleotide motifs. A total of 611 primer pairs were designed for the SSRs, having repeat length more than or equal to 20 nucleotides, of which a set of 130 markers were tested and 92 of these yielding robust amplicons were analyzed for their utility in genetic purity assessment of castor bean hybrids. Nine markers were able to detect polymorphism between the parental lines of nine commercial castor bean hybrids (DCH-32, DCH-177, DCH-519, GCH-2, GCH-4, GCH-5, GCH-6, GCH-7, and RHC-1), and their utility in genetic purity testing was demonstrated. These novel EST-SSR markers would be a valuable addition to the growing molecular marker resources that could be used in genetic improvement programmes of castor bean.  相似文献   

3.
Mung bean (Vigna radiate (L.) Wilczek) is an important traditional food legume crop, with high economic and nutritional value. It is widely grown in China and other Asian countries. Despite its importance, genomic information is currently unavailable for this crop plant species or some of its close relatives in the Vigna genus. In this study, more than 103 million high quality cDNA sequence reads were obtained from mung bean using Illumina paired-end sequencing technology. The processed reads were assembled into 48,693 unigenes with an average length of 874 bp. Of these unigenes, 25,820 (53.0%) and 23,235 (47.7%) showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases, respectively. Furthermore, 19,242 (39.5%) could be classified into gene ontology categories, 18,316 (37.6%) into Swiss-Prot categories and 10,918 (22.4%) into KOG database categories (E-value < 1.0E-5). A total of 6,585 (8.3%) were mapped onto 244 pathways using the Kyoto Encyclopedia of Genes and Genome (KEGG) pathway database. Among the unigenes, 10,053 sequences contained a unique simple sequence repeat (SSR), and 2,303 sequences contained more than one SSR together in the same expressed sequence tag (EST). A total of 13,134 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats being the most abundant motif class and G/C repeats being rare. In this SSR analysis, we found five main repeat motifs: AG/CT (30.8%), GAA/TTC (12.6%), AAAT/ATTT (6.8%), AAAAT/ATTTT (6.2%) and AAAAAT/ATTTTT (1.9%). A total of 200 SSR loci were randomly selected for validation by PCR amplification as EST-SSR markers. Of these, 66 marker primer pairs produced reproducible amplicons that were polymorphic among 31 mung bean accessions selected from diverse geographical locations. The large number of SSR-containing sequences found in this study will be valuable for the construction of a high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.  相似文献   

4.

Background

Despite great advances in genomic technology observed in several crop species, the availability of molecular tools such as microsatellite markers has been limited in tea (Camellia sinensis L.). The development of microsatellite markers will have a major impact on genetic analysis, gene mapping and marker assisted breeding. Unigene derived microsatellite (UGMS) markers identified from publicly available sequence database have the advantage of assaying variation in the expressed component of the genome with unique identity and position. Therefore, they can serve as efficient and cost effective alternative markers in such species.

Results

Considering the multiple advantages of UGMS markers, 1,223 unigenes were predicted from 2,181 expressed sequence tags (ESTs) of tea (Camellia sinensis L.). A total of 109 (8.9%) unigenes containing 120 SSRs were identified. SSR abundance was one in every 3.55 kb of EST sequences. The microsatellites mainly comprised of di (50.8%), tri (30.8%), tetra (6.6%), penta (7.5%) and few hexa (4.1%) nucleotide repeats. Among the dinucleotide repeats, (GA)n.(TC)n were most abundant (83.6%). Ninety six primer pairs could be designed form 83.5% of SSR containing unigenes. Of these, 61 (63.5%) primer pairs were experimentally validated and used to investigate the genetic diversity among the 34 accessions of different Camellia spp. Fifty one primer pairs (83.6%) were successfully cross transferred to the related species at various levels. Functional annotation of the unigenes containing SSRs was done through gene ontology (GO) characterization. Thirty six (60%) of them revealed significant sequence similarity with the known/putative proteins of Arabidopsis thaliana. Polymorphism information content (PIC) ranged from 0.018 to 0.972 with a mean value of 0.497. The average heterozygosity expected (H E ) and observed (H o ) obtained was 0.654 and 0.413 respectively, thereby suggesting highly heterogeneous nature of tea. Further, test for IAM and SMM models for the UGMS loci showed excess heterozygosity and did not show any bottleneck operating in the tea population.

Conclusion

UGMS markers identified and characterized in this study provided insight about the abundance and distribution of SSR in the expressed genome of C. sinensis. The identification and validation of 61 new UGMS markers will not only help in intra and inter specific genetic diversity assessment but also be enriching limited microsatellite markers resource in tea. Further, the use of these markers would reduce the cost and facilitate the gene mapping and marker-aided selection in tea. Since, 36 of these UGMS markers correspond to the Arabidopsis protein sequence data with known functions will offer the opportunity to investigate the consequences of SSR polymorphism on gene functions.  相似文献   

5.
柑橘EST-SSR分子标记分析   总被引:25,自引:0,他引:25  
江东  钟广炎  洪棋斌 《遗传学报》2006,33(4):345-353
对来源于甜橙(Citrus sinensis Osbeck)、枳壳(Poncirus trifoliata Raf.)和其他柑橘非冗余EST数据库的38124条单-基因(Unigene)序列进行了简单重复序列SSRs(Simple Sequence Repeat)搜索,所分析的柑橘非冗余核酸序列总长23.29Mb,从中获得了8218条SSR,其中包括单碱基重复4913条(59.8%),2碱基重复1419条(17.3%),3碱基重复1709条(20.8%),4碱基重复114条(1.39%),5碱基重复23条(0.28%),6碱基重复40条(0.49%)。大约每2.8kb长度的单-基因序列中即存在1个SSR,即平均4.6个单-基因中存在1个SSR。随碱基重复单元(motif)的不同,SSR的最大长度在40-105之间,全部重复序列的平均长度为20.9bp。各种SSR(1-,2-,3-,4-,5-,6-核苷酸重复)的发生频率在甜橙和枳壳间非常接近。其中单碱基重复序列是最丰富的重复单元,其次为3碱基重复。在所得的SSR的重复单元中,富含A碱基的重复单元的分布占据优势地位,出现的频率与密度均较高,而富含CG碱基的重复单元出现频率和密度较低。用25对EST-SSR引物对6个柑橘品种的多样性进行了PCR检测,结果表明,所有25对引物在6个柑橘品种间均扩增到多样性条带,证实通过柑橘EST数据库的发掘能够高效地筛选到基因水平的SSR标记。  相似文献   

6.
7.
A collection of 5,659 expressed sequence tags (ESTs) from pineapple [Ananas comosus (L.) Merr.] was screened for simple sequence repeats (EST-SSRs) with motif lengths between 1 and 6 bp. Lower thresholds of 15, 7 and 5 repeat units were used to define microsatellites of the mono-, di-, and tri- to hexanucleotide repeat type, respectively. Based on these criteria, 696 SSRs were identified among 3,389 EST unigenes, together representing 2,840 kb. This corresponds to an average density of one SSR every 4.1 kb of non-redundant EST sequences. Dinucleotide repeats were most abundant (38.4% of all SSRs) followed by trinucleotide repeats (38.1%). Flanking primer pairs were designed for 537 EST-SSR loci, and 49 of these were screened for their functionality in 12 accessions of A. comosus, 14 accessions of 5 additional Ananas species and 1 species of Pseudananas. Distinct PCR products of the expected size range were obtained with 36 primer pairs. Eighteen loci analyzed in more detail were all polymorphic in pineapple, and primer pairs flanking these loci also generated PCR products from a wide range of genera and species from six subfamilies of the Bromeliaceae. The potential to reveal polymorphism in a heterologous target species was demonstrated in Deuterocohnia brevifolia (subfamily Pitcairnioideae).  相似文献   

8.
9.
为了在芦笋中开发EST-SSR功能性标记,对来源于NCBI公共数据库的8590条芦笋(AsparagusofficinalisL.)EST序列进行简单重复序列SSR搜索。剔除冗余序列,得到非冗余序列8377条。在非冗余序列中共挖掘出469个EST-SSR,平均相隔14.80kb出现1个SSR。在所有的重复基序中,二核苷酸重复基序的SSR所占比例最高40.51%(190/469),其次是三核苷酸34.97%(164/469),六核苷酸21.11%(99/469)。在所有基序里,CT/AG出现的频率最高有62次,占全部重复基序的13.22%(62/469)。选取含SSR的EST序列30条,并利用primer5软件设计引物,进行SSR位点的扩增,其中27对引物扩增产物,24对有较清晰可靠的目标扩增条带,占引物数的80%,且所检测出的芦笋等位基因数量较丰富,平均4.93个/对。这些EST-SSR标记的开发将有助于芦笋群体遗传多样性、遗传图谱构建、基因定位、分子标记和系谱分析等方面的研究。  相似文献   

10.
11.
Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di—(26.1 %), tetra—(11.5 %), penta—(9.7 %), and hexanucleotide (3.9 %). One hundred EST–SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST–SSR markers. Based on the 29 EST–SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST–SSR markers was also found for relative species.  相似文献   

12.
茶树EST-SSRs分布特征及引物开发   总被引:10,自引:1,他引:10  
为了在茶树中开发EST-SSRs功能性标记,利用生物信息学方法对NCBI网上公开的3288奈茶树(Camellia subebsus)ESTs序列进行EST-SSRs特征分析。剔除冗余序列,得到非冗余序列2083条。在非冗余序列中发现含不同重复基元SSRs的EST序列有385条,共486个EST-SSRs,平均相隔2.10kb出现1个SSR。在2~6bp的重复基元中,二核苷酸重复基元的SSRs出现频率最高(51.97%),其次是三核苷酸(19.55%)。对所有的重复基元类型进行统计分析发现,所占比例最高的是AG/CT(47.74%),其次分别是AT/TA(4.73%)和AAG/CTT(4.73%)。利用Prime5软件,设计了206对EST-SSRs引物,随机选用72对引物进行SSR扩增,发现31对引物可以扩增出条带,其中29对引物具有多态性,多态性比率为93.5%。这些EST-SSRs将有助于茶树基因组学方面的研究。  相似文献   

13.
14.
Faba bean (Vicia faba L.) is an important food legume crop with a huge genome. Development of genetic markers for faba bean is important to study diversity and for molecular breeding. In this study, we used Next Generation Sequencing (NGS) technology for the development of genomic simple sequence repeat (SSR) markers. A total of 14,027,500 sequence reads were obtained comprising 4,208 Mb. From these reads, 56,063 contigs were assembled (16,367 Mb) and 2138 SSRs were identified. Mono and dinucleotides were the most abundant, accounting for 57.5 % and 20.9 % of all SSR repeats, respectively. A total of 430 primer pairs were designed from contigs larger than 350 nucleotides and 50 primers pairs were tested for validation of SSR locus amplification. Nearly all (96 %) of the markers were found to produce clear amplicons and to be reproducible. Thirty-nine SSR markers were then applied to 46 faba bean accessions from worldwide origins, resulting in 161 alleles with 87.5 % polymorphism, and an average of 4.1 alleles per marker. Gene diversity (GD) of the markers ranged from 0 to 0.48 with an average of 0.27. Testing of the markers showed that they were useful in determining genetic relationships and population structure in faba bean accessions.  相似文献   

15.
Simple sequence repeats (SSRs) derived from expressed sequence tags (ESTs) are valuable markers because they represent transcribed regions and often have putative functions. We mined and characterized microsatellites in melon ESTs. Three hundred and eighty‐three SSR loci were identified in 309 of 3188 unigenes assembled by 5747 EST and mRNA sequences in GenBank with occurring frequency of 1/4.7 kb. Twenty‐two polymorphic EST‐SSR markers were developed with the mean allele number of 2.9 per locus and mean expected heterozygosity of 0.442. Amplification products were also detected by 15 pairs of primer in Cucumis sativus. Those informative EST‐SSR markers can be used in melon genetic improvement projects.  相似文献   

16.
17.
Opium poppy (Papaver somniferum L.) is an important pharmaceutical crop with very few genetic marker resources. To expand these resources, we sequenced genomic DNA using pyrosequencing technology and examined the DNA sequences for simple sequence repeats (SSRs). A total of 1,244,412 sequence reads were obtained covering 474 Mb. Approximately half of the reads (52 %) were assembled into 166,724 contigs representing 105 Mb of the opium poppy genome. A total of 23,283 non-redundant SSRs were identified in 18,944 contigs (11.3 % of total contigs). Trinucleotide and tetranucleotide repeats were the most abundant SSR repeats, accounting for 49.0 and 27.9 % of all SSRs, respectively. The AAG/TTC repeat was the most abundant trinucleotide repeat, representing 19.7 % of trinucleotide repeats. Other SSR repeat types were AT-rich. A total of 23,126 primer pairs (98.7 % of total SSRs) were designed to amplify SSRs. Fifty-three genomic SSR markers were tested in 37 opium poppy accessions and seven Papaver species for determination of polymorphism and transferability. Intraspecific polymorphism information content (PIC) values of the genomic SSR markers were intermediate, with an average 0.17, while the interspecific average PIC value was slightly higher, 0.19. All markers showed at least 88 % transferability among related species. This study increases sequence coverage of the opium poppy genome by sevenfold and the number of opium poppy-specific SSR markers by sixfold. This is the first report of the development of genomic SSR markers in opium poppy, and the genomic SSR markers developed in this study will be useful in diversity, identification, mapping and breeding studies in opium poppy.  相似文献   

18.
19.
Highly informative molecular markers, such as simple sequence repeats (SSRs), can greatly accelerate breeding programs. The aim of this study was to develop and characterise a comprehensive set of SSR markers for white clover (Trifolium repens L.), which can be used to tag genes and quantitative trait loci controlling traits of agronomic interest. Sequence analysis of 1123 clones from genomic libraries enriched for (CA) n repeats yielded 793 clones containing SSR loci. The majority of SSRs consisted of perfect dinucleotide repeats, only 7% being trinucleotide repeats. After exclusion of redundant sequences and SSR loci with less than 25 bp of flanking sequence, 397 potentially useful SSRs remained. Primer pairs were designed for 117 SSR loci and PCR products in the expected size range were amplified from 101 loci. These markers were highly polymorphic, 88% detecting polymorphism across seven white clover genotypes with an average allele number of 4.8. Four primer pairs were tested in an F2 population revealing Mendelian segregation. Successful cross-species amplification was achieved in at least one out of eight legume species for 46 of 54 primer pairs. The rate of successful amplification was significantly higher for Trifolium species when compared to species of other genera. The markers developed in this study not only provide valuable tools for molecular breeding of white clover but may also have applications in related taxa. Received: 3 April 2000 / Accepted: 12 May 2000  相似文献   

20.
Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号