首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
With the aim of developing additional genomic resources in safflower, a set of 41,011 ESTs of safflower were mined for the presence of SSRs. 18,773 SSR containing ESTs (SSR-ESTs) were identified and were analyzed to remove redundant sequences leading to identification of 8,810 non-redundant SSR-ESTs (categorized into 6104 singletons and 2,706 contigs) having 13,085 non-redundant SSRs. The average number of non-redundant SSRs per EST was 0.32 and they predominantly consisted of dinucleotide (57.7 %), and trinucleotide (37.7 %) repeat motifs. 500 primer pairs were designed for the non-redundant EST-SSRs of which, 151 were tested. 60 markers which gave robust amplicons, were validated in a set of 19 Carthamus lines. A subset of EST-SSR markers, having average polymorphism information content (PIC) ≥0.4 could precisely elucidate the pedigree relatedness among these lines. Further, these markers exhibited high cross-species transferability to five other wild species of Carthamus. The markers reported here would be a valuable addition to existing safflower marker resources aiding in hastening its improvement.  相似文献   

2.
A collection of 5,659 expressed sequence tags (ESTs) from pineapple [Ananas comosus (L.) Merr.] was screened for simple sequence repeats (EST-SSRs) with motif lengths between 1 and 6 bp. Lower thresholds of 15, 7 and 5 repeat units were used to define microsatellites of the mono-, di-, and tri- to hexanucleotide repeat type, respectively. Based on these criteria, 696 SSRs were identified among 3,389 EST unigenes, together representing 2,840 kb. This corresponds to an average density of one SSR every 4.1 kb of non-redundant EST sequences. Dinucleotide repeats were most abundant (38.4% of all SSRs) followed by trinucleotide repeats (38.1%). Flanking primer pairs were designed for 537 EST-SSR loci, and 49 of these were screened for their functionality in 12 accessions of A. comosus, 14 accessions of 5 additional Ananas species and 1 species of Pseudananas. Distinct PCR products of the expected size range were obtained with 36 primer pairs. Eighteen loci analyzed in more detail were all polymorphic in pineapple, and primer pairs flanking these loci also generated PCR products from a wide range of genera and species from six subfamilies of the Bromeliaceae. The potential to reveal polymorphism in a heterologous target species was demonstrated in Deuterocohnia brevifolia (subfamily Pitcairnioideae).  相似文献   

3.
Grape expressed sequence tags (ESTs) are a new resource for developing simple sequence repeat (SSR) functional markers for genotyping and genetic mapping. An integrated pipeline including several computational tools for SSR identification and functional annotation was developed to identify 6,447 EST-SSR sequences from a total collection of 215,609 grape ESTs retrieved from NCBI. The 6,447 EST-SSRs were further reduced to 1,701 non-redundant sequences via clustering analysis, and 1,037 of them were successfully designed with primer pairs flanking the SSR motifs. From them, 150 pairs of primers were randomly selected for PCR amplification, polymorphism and heterozygosity analysis in V. vinifera cvs. Riesling and Cabernet Sauvignon, and V. rotundifolia (muscadine grape) cvs. Summit and Noble, and 145 pairs of these primers yielded PCR products. Pairwise comparisons of loci between the parents Riesling and Cabernet Sauvignon showed that 72 were homozygous in both cultivars, while 70 loci were heterozygous in at least one cultivar of the two. Muscadine parents Noble and Summit had 90 homozygous SSR loci in both parents and contained 50 heterozygous loci in at least one of the two. These EST-SSR functional markers are a useful addition for grape genotyping and genome mapping.  相似文献   

4.
为了探究家蚕Bombyx mori EST-SSR标记的多态性, 对检索获得的家蚕第12连锁群的4 465条EST序列进行了分析, 整理和拼接后得到581条非冗余EST序列, 总长度约为480 kb。其中, 有122条序列中共检测到154个EST-SSR, 占所研究的EST序列的2.73%, 平均每3.12 kb 含有一个EST-SSR。在所检测的EST-SSR中, 三核苷酸和四核苷酸重复是主导类型, 分别占总数的36.36%和28.57%,大部分表现为Perfect形式; 核苷酸重复平均长度约为16.2 bp, 最长为30 bp。进一步进行同源性分析, 发现有26条序列可以在NCBI中检索到同源序列, 在这些序列中一共含有40个SSR, 其中14个(35.0%)位于5′-UTR, 11个(27.5%)位于3′-UTR, 15个(37.5%)位于CDS区。根据筛选到的微卫星序列设计11对引物, 其中8对引物有扩增产物, 且条带清晰; 应用引物ES1204对8个家蚕品种进行PCR扩增都呈现多态性。结果说明通过家蚕EST数据库发掘SSR标记是一条可行的途径。  相似文献   

5.
Nearly 900 SSRs (simple sequence repeats) were identified among 15,000 ESTs (expressed sequence tags) belonging to bread wheat ( Triticum aestivum L.). The SSRs were defined by their minimum length, which ranged from 14 to 21 bp. The maximum length ranged from 24 to 87 bp depending upon the length of the repeat unit itself (1–7 bp). The average density of SSRs was one SSR per 9.2 kb of EST sequence screened. The trinucleotide repeats were the most abundant SSRs detected. As a representative sample, 78 primer pairs were designed, which were also used to screen the dbEST entries for Hordeum vulgare and Triticum tauschii (donor of the D-genome of cultivated wheat) using a cut-off E (expectation) value of 0.01. On the basis of in silico analysis, up to 55.12% of the primer pairs exhibited transferability from Triticum to Hordeum, indicating that the sequences flanking the SSRs are not only conserved within a single genus but also between related genera in Poaceae. Primer pairs for the 78 SSRs were synthesized and used successfully for the study of (1) their transferability to 18 related wild species and five cereal species (barley, oat, rye, rice and maize); and (2) polymorphism between the parents of four mapping populations available with us. A subset of 20 EST-SSR primers was also used to assess genetic diversity in a collection of 52 elite exotic wheat genotypes. This was done with a view to compare their utility relative to other molecular markers (gSSRs, AFLPs, and SAMPL) previously used by us for the same purpose with the same set of 52 bread wheat genotypes. Although only a low level of polymorphism was detected, relative to that observed with genomic SSRs, the study suggested that EST-SSRs can be successfully used for a variety of purposes, and may actually prove superior to SSR markers extracted from genomic libraries for diversity estimation and transferability.Communicated by R. Hagemann  相似文献   

6.
To identify EST-SSR molecular markers, 41,986 cattle UniGene sequences from NCBI were mined for analyzing SSRs. A total of 1,831 SSRs were identified from 1,666 ESTs, which represented an average density of 19.88 kb per SSR. The frequency of EST-SSRs was 4.0%. The dinucleotide repeat motif was the most abundant SSR, accounting for 54%, followed by 22%, 13%, 7% and 4%, respec-tively, for tri-, hexa-, penta- and tetra-nucleotide repeats. Depending upon the length of the repeat unit, the length of microsatellites varied from 14 to 86 bp. Among the di- and tri-nucleotide repeats, AC/TG (57%) and AGC (12%) were the most abundant type. Annotation of EST-SSRs was also carried out. Three hundred primer pairs were randomly designed using Prime Premier 5.0 program and Oligo 5.0 for further experimental validation.  相似文献   

7.
Teleost fish genome projects involving model species are resulting in a rapid accumulation of genomic and expressed DNA sequences in public databases. The expressed sequence tags (ESTs) collected in the databases can be mined for the analysis of both structural and functional genomics. In this study, we in silico analyzed 49,430 unigenes representing a total of 692,654 ESTs from four model fish for their potential use in developing simple sequence repeats (SSRs), or microsatellites. After bioinformatical mining, a total of 3,018 EST derived SSRs (EST-SSRs) were identified for 2,335 SSR containing ESTs (SSR-ESTs). The frequency of identified SSR-ESTs ranged from 1.5% for Xiphophorus to 7.3% for zebrafish. The dinucleotide repeat motif is the most abundant SSR, accounting for 47%, 52%, 64%, and 78% for medaka, Fundulus, zebrafish, and Xiphophorus, respectively. Simulation analysis suggests that a majority of these EST-SSRs have sufficient flanking sequences for polymerase chain reaction (PCR) primer design. Comparative DNA sequence analyses of SSR-ESTs identified several cross-species SSRs and sequences that may be used as cross-reference genes in comparative studies. For example, the flanking sequences of one SSR (CTG)n within the pituitary tumor-transforming gene (PTTG) 1 interacting protein (PTTGIP), showed conservation spanning the medaka, Fundulus, human, and mouse genomes. This study provides a large body of information on EST-SSRs that can be useful for the development of polymorphic markers, gene mapping, and comparative genome analysis. Functional analysis of these SSR-ESTs may reveal their role in metabolism and gene evolution of these model species.  相似文献   

8.
茶树EST-SSRs分布特征及引物开发   总被引:11,自引:1,他引:10  
为了在茶树中开发EST-SSRs功能性标记,利用生物信息学方法对NCBI网上公开的3288奈茶树(Camellia subebsus)ESTs序列进行EST-SSRs特征分析。剔除冗余序列,得到非冗余序列2083条。在非冗余序列中发现含不同重复基元SSRs的EST序列有385条,共486个EST-SSRs,平均相隔2.10kb出现1个SSR。在2~6bp的重复基元中,二核苷酸重复基元的SSRs出现频率最高(51.97%),其次是三核苷酸(19.55%)。对所有的重复基元类型进行统计分析发现,所占比例最高的是AG/CT(47.74%),其次分别是AT/TA(4.73%)和AAG/CTT(4.73%)。利用Prime5软件,设计了206对EST-SSRs引物,随机选用72对引物进行SSR扩增,发现31对引物可以扩增出条带,其中29对引物具有多态性,多态性比率为93.5%。这些EST-SSRs将有助于茶树基因组学方面的研究。  相似文献   

9.
Cassava (Manihot esculenta) is an economically important crop that is grown in tropical and sub-tropical regions. Use of molecular technology for genetic improvement of cassava has been limited by the lack of a large set of DNA markers and a genetic map. Therefore, the aims here were to develop additional simple sequence repeat (SSR) markers from the public expressed sequence tags (ESTs), and to construct a genetic linkage map. In this study, we designed 425 EST-SSR markers from sequences obtained from the cassava EST database in GenBank, and integrated them with 667 SSR markers from a microsatellite-enriched genomic sequence received from the International Center for Tropical Agriculture (CIAT). Of these, 107 EST-SSR and 500 genomic SSR primer pairs showed polymorphic patterns when screened in two cassava varieties, Hauy Bong 60 and Hanatee, which were used as female and male parental lines, respectively. Within the 107 and 500 primer pairs, 81 and 226 EST-SSR and SSR primer pairs were successfully genotyped with 100 samples of F1 progeny, respectively. The results showed 20 linkage groups consisting of 211 markers—56 EST-SSR and 155 SSR markers—spanning 1,178 cM, with an average distance between markers of 5.6 cM and about 11 markers per linkage group. These novel EST-SSR markers provided genic PCR-based co-dominant markers that were useful, reliable and economical. The EST-SSRs were used together with SSR markers to construct the cassava genetic linkage map which will be useful for the identification of quantitative trait loci controlling the traits of interest in cassava breeding programs.  相似文献   

10.
11.
With the aim to increase the number of functional markers in resource poor crop like cultivated peanut (Arachis hypogaea), large numbers of available expressed sequence tags (ESTs) in the public databases, were employed for the development of novel EST derived simple sequence repeat (SSR) markers. From 16424 unigenes, 2784 (16.95%) SSRs containing unigenes having 3373 SSR motifs were identified. Of these, 2027 (72.81%) sequences were annotated and 4124 gene ontology terms were assigned. Among different SSR motif-classes, tri-nucleotide repeats (33.86%) were the most abundant followed by di-nucleotide repeats (27.51%) while AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant repeat-motifs. A total of 2456 EST-SSR novel primer pairs were designed, of which 366 unigenes having relevance to various stresses and other functions, were PCR validated using a set of 11 diverse peanut genotypes. Of these, 340 (92.62%) primer pairs yielded clear and scorable PCR products and 39 (10.66%) primer pairs exhibited polymorphisms. Overall, the number of alleles per marker ranged from 1-12 with an average of 3.77 and the PIC ranged from 0.028 to 0.375 with an average of 0.325. The identified EST-SSRs not only enriched the existing molecular markers kitty, but would also facilitate the targeted research in marker-trait association for various stresses, inter-specific studies and genetic diversity analysis in peanut.  相似文献   

12.
A set of 146,611 expressed sequence tags (ESTs) were generated from 10 flax cDNA libraries. After assembly, a total of 11,166 contigs and 11,896 singletons were mined for the presence of putative simple sequence repeats (SSRs) and yielded 806 (3.5%) non-redundant sequences which contained 851 putative SSRs. This is equivalent to one EST-SSR per 16.5 kb of sequence. Trinucleotide motifs were the most abundant (76.9%), followed by dinucleotides (13.9%). Tetra-, penta- and hexanucleotide motifs represented <10% of the SSRs identified. A total of 83 SSR motifs were identified. Motif (TTC/GAA)n was the most abundant (10.2%) followed by (CTT/AAG)n (8.7%), (TCT/AGA)n (8.6%), (CT/AG)n (6.7%) and (TC/GA)n (5.3%). A total of 662 primer pairs were designed, of which 610 primer pairs yielded amplicons in a set of 23 flax accessions. Polymorphism between the accessions was found for 248 primer pairs which detected a total of 275 EST-SSR loci. Two to seven alleles were detected per marker. The polymorphism information content value for these markers ranged from 0.08 to 0.82 and averaged 0.35. The 635 alleles detected by the 275 polymorphic EST-SSRs were used to study the genetic relationship of 23 flax accessions. Four major clusters and two singletons were observed. Sub-clusters within the main clusters correlated with the pedigree relationships amongst accessions. The EST-SSRs developed herein represent the first large-scale development of SSR markers in flax. They have potential to be used for the development of genetic and physical maps, quantitative trait loci mapping, genetic diversity studies, association mapping and fingerprinting cultivars for example. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

13.
A total of 26,685 unutilized public domain expressed sequence tags (ESTs) of Arachis hypogaea L. were analyzed to give a total of 4442 EST-SSRs, in which 517 ESTs contained more than one simple sequence repeat (SSR). Of these EST-SSRs, 2542 were mononucleotide repeats (MNRs), 803 were dinucleotide repeats (DNRs), 1043 were trinucleotide repeats (TNRs), 40 were tetranucleotide repeats (TtNRs), six were pentanucleotide repeats (PNRs) and eight were hexanucleotide repeats (HNRs). Out of these 4442 EST-SSRs, only 1160 were found to be successful in non-redundant primer design; 1060 were simple SSRs, while the remaining 100 were compound forms. Among all the motifs, MNRs were abundant, followed by TNRs and DNRs. The AAG/CTT motif was the most abundant (~33 %) TNR, while AG/CT was the most abundant DNR. For redundancy and novelty, a stringent criterion deploying three different strategies was used and a total of 782 novel EST-SSRs were added to the public domain of peanut. These novel EST-SSR markers will be useful for qualitative and quantitative trait mapping, marker-assisted selection and genetic diversity studies in cultivated peanut as well as related Arachis species. A subset of 30 novel EST-SSRs was further randomly selected for validation and genotyping studies with eight well-known cultivars and 32 advanced breeding lines (ADBX lines, ADBY lines and ADBZ lines) from Odisha state, India. The number of polymorphic markers among accessions of A. hypogaea was low; however, a set of informative EST-SSR markers detected considerable levels of genetic variability in peanut cultivars and uncharacterized breeding lines collected from Odisha. The 30 newly developed EST-SSRs from Arachis spp. showed ~97 % amplification in Cicer arientinum and 93 % in pigeon pea. Thus, the EST-SSRs developed in this study will be a very useful asset for genetic analysis, comparative genome mapping, population genetic structure and phylogenetic inferences among wild and allied species of Arachis.  相似文献   

14.
15.
烟草ESTs资源的SSR信息分析   总被引:2,自引:0,他引:2  
烟草ESTs数量迅速增加为开发新的SSR标记提供了宝贵的资源.经过软件分析,对242 683条烟草ESTs序列剔除冗余序列,在211 728条非冗余烟草ESTs序列中,共检索出9 339个SSR,SSR之间的距离约为14.21 kb,检出率为4.41%,包括216种重复基元.其中三核苷酸重复类型的SSR占主导地位,占总SSR的50.34%,其次为二核苷酸和单核苷酸,分别为23.00%,16.48%,其余重复类型所占比例均不足5%.在所有重复基元中,A/T重复为主要类型,占所有重复14.68%,其次为AT/TA、AG/TC、AAG/TTC,分别为10.49%、9.48%、6.85%.随机设计10对EST-SSR引物,对6个品种烟草进行扩增,10对EST-SSR引物均能扩增出产物,其中1对引物在6个品种有多态性.本研究为烟草EST-SSR标记的建立和进一步应用奠定了基础.  相似文献   

16.
Prickly lettuce (Lactuca serriola L.) is a problematic weed of Pacific Northwest and recently developed resistance to the auxinic herbicide 2,4-D. There are no publically available simple sequence repeat (SSR) markers to tag 2,4-D resistance genes in L. serriola. Therefore, a study was conducted to develop SSR markers from expressed sequence tags (ESTs) of 5 Lactuca species. A total of 15,970 SSRs were identified among 57,126 EST assemblies belonging to 5 Lactuca species. SSR-containing ESTs (SSR-ESTs) ranged from 6.23% to 7.87%, and SSR densities ranged from 1.28 to 2.51 kb(-1) among the ESTs of 5 Lactuca species. Trinucleotide repeats were the most abundant SSRs detected during the study. As a representative sample, 45 ESTs carrying class I SSRs (≥ 20 nucleotides) were selected for designing primers and were also searched against the dbEST entries for L. sativa and Helianthus annuus (≤ 10(-50); score ≥ 100). In silico analysis of 45 SSR-ESTs showed 82% conservation across species and 68% conservation across genera. Primer pairs synthesized for the above 45 EST-SSRs were used to study genetic diversity among a collection of 22 L. serriola biotypes. Comparison of the resultant dendrogram to that developed using phenotypic evaluation of the same subset of lines showed limited correspondence. Taken together, this study reported a collection of useful SSR markers for L. serriola, confirmed transferability of these markers within and across genera, and demonstrated their usefulness in studying genetic diversity.  相似文献   

17.
Sets of polymorphic expressed sequence tag–simple sequence repeat (EST-SSR) markers from the rubber tree (Hevea brasiliensis) have been published by many researchers, but none has been specifically developed to study latex and wood yield traits. In this study, a total 10,321 rubber tree EST sequences, generated from suppression subtractive hybridization-cDNA libraries of bark and latex of high- and low-yielding clones, were used as sources for SSR searching. A total of 432 EST-SSR loci were identified and it was possible to design primer pairs for a subset of 298 EST-SSRs. The highest proportion of EST-SSRs was represented by dinucleotide repeats (46.6 %), followed by trinucleotide repeats (44.3 %). Based on BLASTX analysis, 234 ESTs (80 %) showed similarity to genes in NCBI databases and could be divided into 120 putative proteins with known function and 114 unknown proteins. To enhance the resolution of an existing linkage map from previous work on a rubber tree RRIM600 × PB217 population, 69 EST-SSR markers from the above set were tested to be integrated into the reference genetic map. The enriched map of 18 linkage groups spanned 2054.2 cM in length, showed an average genetic distance of 4.3 cM between adjacent markers, and included 63 new EST-SSR markers. The enhanced map from this study provides a basis for comparative mapping using PCR-based markers and identification of expressed genes possibly affecting important traits of interest.  相似文献   

18.
Characterization of EST-SSRs in loblolly pine and spruce   总被引:3,自引:0,他引:3  
In the first large study of conifer expressed sequence tag-simple sequence repeats (EST-SSRs), two large conifer EST databases were characterized for EST-SSRs. One database was from “interior spruce” (white and Engelmann spruce in Southern British Columbia) and Sitka spruce, while the other was from loblolly pine. We found 475 and 629 unique EST-SSRs in loblolly pine and spruce, respectively. 3′ ESTs contained 14% more SSRs than 5′ EST reads in loblolly pine and 41% more in spruce. Conifer EST-SSRs differed conspicuously from angiosperm EST-SSRs in several aspects. EST-SSRs were considerably less frequent in conifers (one EST-SSR every ∼50 kb) than in angiosperms (one EST-SSR every ∼20 kb). Dinucleotide repeats were the most abundant repeat class in conifers, while in angiosperms, trinucleotides were most common. Finally, the AT motif was the dominant motif recovered in both conifer species, whereas AG was the most common dinucleotide repeat in angiosperms. Also, as these EST-SSRs in conifers could be developed into useful genetic markers, our work demonstrates the value of large-scale EST sequencing projects for in-silico approaches for marker development.  相似文献   

19.
We report on the data mining of publicly available Litopenaeus vannamei expressed sequence tags (ESTs) to generate simple sequence repeat (SSRs) markers and on their transferability between related Penaeid shrimp species. Repeat motifs were found in 3.8% of the evaluated ESTs at a frequency of one repeat every 7.8 kb of sequence data. A total of 206 primer pairs were designed, and 112 loci were amplified with the highest success in L. vannamei. A high percentage (69%) of EST-SSRs were transferable within the genus Litopenaeus. More than half of the amplified products were polymorphic in a small testing panel of L. vannamei. Evaluation of those primers in a larger testing panel showed that 72% of the markers fit Hardy-Weinberg equilibrium, which shows their utility for population genetic analysis. Additionally, a set of 26 of the EST-SSRs were evaluated for Mendelian segregation. A high percentage of monomorphic markers (46%) proved to be polymorphic by singles-stranded conformational polymorphism analysis. Because of the high number of ESTs available in public databases, a data mining approach similar to the one outlined here might yield high numbers of SSR markers in many animal taxa.  相似文献   

20.
In order to construct a saturated genetic map and facilitate marker-assisted selection (MAS) breeding, it is necessary to enhance the current reservoir of known molecular markers in Gossypium. Microsatellites or simple sequence repeats (SSRs) occur in expressed sequence tags (EST) in plants (Kantety et al., Plant Mol Biol 48:501–510, 2002). Many ESTs are publicly available now and represent a good tool in developing EST-SSRs. From 13,505 ESTs developed from our two cotton fiber/ovule cDNA libraries constructed for Upland cotton, 966 (7.15%) contained one or more SSRs and from them, 489 EST-SSR primer pairs were developed. Among the EST-SSRs, 59.1% are trinucleotides, followed by dinucleotides (30%), tetranucleotides (6.4%), pentanucleotides (1.8%), and hexanucleotides (2.7%). AT/TA (18.4%) is the most frequent repeat, followed by CTT/GAA (5.3%), AG/TC (5.1%), AGA/TCT (4.9%), AGT/TCA (4.5%), and AAG/TTC (4.5%). One hundred and thirty EST-SSR loci were produced from 114 informative EST-SSR primer pairs, which generated polymorphism between our two mapping parents. Of these, 123 were integrated on our allotetraploid cotton genetic map, based on the cross [(TM-1×Hai7124)TM-1]. EST-SSR markers were distributed over 20 chromosomes and 6 linkage groups in the map. These EST-SSR markers can be used in genetic mapping, identification of quantitative trait loci (QTLs), and comparative genomics studies of cotton. Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users. Zhiguo Han and Changbiao Wang contributed equally to this work.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号