首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Expressed sequence tag (EST) databases represent a potentially valuable resource for the development of molecular markers for use in evolutionary studies. Because EST-derived markers come from transcribed regions of the genome, they are likely to be conserved across a broader taxonomic range than are other sorts of markers. This paper describes a case study in which the publicly available cultivated sunflower (Helianthus annuus) EST database was used to develop simple sequence repeat (SSR) markers for use in the genetic analysis of a rare sunflower species, Helianthus verticillatus, as well as the more widespread Helianthus angustifolius. EST-derived SSRs were found to be more than 3 times as transferable across species as compared with anonymous SSRs (73% vs. 21%, respectively). Moreover, EST-SSRs whose primers were located within protein-coding sequence were more readily transferable than those derived from untranslated regions, and the former loci were no less variable than the latter. The utility of existing EST databases as a means for facilitating population genetic analyses in plants was further explored by cross-referencing publicly available EST resources against available lists of rare or invasive flowering plant taxa. This survey revealed that more than one-third of all plant-derived EST collections of sufficient size could conceivably serve as a source of EST-SSRs for the analysis of rare, endangered, or invasive plant species worldwide.  相似文献   

2.
Microsatellite, or simple sequence repeat (SSR), loci can be identified by mining expressed sequence tag (EST) databases, and where these are available, marker development time and expense can be decreased considerably over conventional strategies of probing the entire genome. However, it is unclear whether they provide information on population structure similar to that generated by anonymous genomic SSRs. We performed comparative population genetic analyses between EST-derived SSRs (EST-SSRs) and anonymous SSRs developed from genomic DNA for the same set of populations of the insect Diabrotica virgifera, a beetle in the family Chrysomelidae. Compared with noncoding, nontranscribed regions, EST-SSRs were generally less polymorphic but had reduced occurrence of null alleles and greater cross-species amplification. Neutrality tests suggested the loci were not under positive selection. Across all populations and all loci, the genomic and EST-SSRs performed similarly in estimating genetic diversity, F(IS), F(ST), population assignment and exclusion tests, and detection of distinct populations. These findings, therefore, indicate that the EST-SSRs examined can be used with confidence in future genetic studies of Diabrotica populations and suggest that EST libraries can be added as a valuable source of markers for population genetics studies in insects and other animals.  相似文献   

3.
Teleost fish genome projects involving model species are resulting in a rapid accumulation of genomic and expressed DNA sequences in public databases. The expressed sequence tags (ESTs) collected in the databases can be mined for the analysis of both structural and functional genomics. In this study, we in silico analyzed 49,430 unigenes representing a total of 692,654 ESTs from four model fish for their potential use in developing simple sequence repeats (SSRs), or microsatellites. After bioinformatical mining, a total of 3,018 EST derived SSRs (EST-SSRs) were identified for 2,335 SSR containing ESTs (SSR-ESTs). The frequency of identified SSR-ESTs ranged from 1.5% for Xiphophorus to 7.3% for zebrafish. The dinucleotide repeat motif is the most abundant SSR, accounting for 47%, 52%, 64%, and 78% for medaka, Fundulus, zebrafish, and Xiphophorus, respectively. Simulation analysis suggests that a majority of these EST-SSRs have sufficient flanking sequences for polymerase chain reaction (PCR) primer design. Comparative DNA sequence analyses of SSR-ESTs identified several cross-species SSRs and sequences that may be used as cross-reference genes in comparative studies. For example, the flanking sequences of one SSR (CTG)n within the pituitary tumor-transforming gene (PTTG) 1 interacting protein (PTTGIP), showed conservation spanning the medaka, Fundulus, human, and mouse genomes. This study provides a large body of information on EST-SSRs that can be useful for the development of polymorphic markers, gene mapping, and comparative genome analysis. Functional analysis of these SSR-ESTs may reveal their role in metabolism and gene evolution of these model species.  相似文献   

4.
Simple Sequence Repeats (SSRs) developed from Expressed Sequence Tags (ESTs), known as EST-SSRs are most widely used and potentially valuable source of gene based markers for their high levels of crosstaxon portability, rapid and less expensive development. The EST sequence information in the publicly available databases is increasing in a faster rate. The emerging computational approach provides a better alternative process of development of SSR markers from the ESTs than the conventional methods. In the present study, 12,851 EST sequences of Camellia sinensis, downloaded from National Center for Biotechnology Information (NCBI) were mined for the development of Microsatellites. 6148 (4779 singletons and 1369 contigs) non redundant EST sequences were found after preprocessing and assembly of these sequences using various computational tools. Out of total 3822.68 kb sequence examined, 1636 (26.61%) EST sequences containing 2371 SSRs were detected with a density of 1 SSR/1.61 kb leading to development of 245 primer pairs. These mined EST-SSR markers will help further in the study of variability, mapping, evolutionary relationship in Camellia sinensis. In addition, these developed SSRs can also be applied for various studies across species.  相似文献   

5.
The increasing availability of expressed sequence tags (ESTs) in wheat (Triticum aestivum) and related cereals provides a valuable resource of non-anonymous DNA molecular markers. We examined 170,746 wheat ESTs from the public (International Triticeae EST Cooperative) and Génoplante databases, previously clustered in contigs, for the presence of di- to hexanucleotide simple sequence repeats (SSRs). Analysis of 46,510 contigs identified 3,530 SSRs, which represented 7.5% of the total number of contigs. Only 74% of the sequences allowed primer pairs to be designed, 70% led to an amplification product, mainly of a high quality (68%), and 53% exhibited polymorphism for at least one cultivar among the eight tested. Even though dinucleotide SSRs were less represented than trinucleotide SSRs (15.5% versus 66.5%, respectively), the former showed a much higher polymorphism level (83% versus 46%). The effect of the number and type of repeats is also discussed. The development of new EST-SSRs markers will have important implications for the genetic analysis and exploitation of the genetic resources of wheat and related species and will provide a more direct estimate of functional diversity.  相似文献   

6.
Microsatellites are the markers of choice due to their high abundance reproducibility, degree of polymorphism and co-dominant nature. These are mainly used for studying the genetic variability in different species and Marker assisted selection. Expressed Sequence Tags (ESTs) serve as the main resource for Simple Sequence Repeats (SSRs). The computational approach for detecting SSRs and developing SSR markers from EST-SSRs is preferred over the conventional methods as it reduces time and cost to a great extent. The available EST sequence databases, various web interfaces and standalone tools provide the platform for an easy analysis of the EST sequences leading to the development of potential EST-SSR Markers. This paper is an overview of in silico approach to develop SSR Markers from the EST sequence using some of the most efficient tools that are available freely for academic purpose.  相似文献   

7.
Characterization of EST-SSRs in loblolly pine and spruce   总被引:3,自引:0,他引:3  
In the first large study of conifer expressed sequence tag-simple sequence repeats (EST-SSRs), two large conifer EST databases were characterized for EST-SSRs. One database was from “interior spruce” (white and Engelmann spruce in Southern British Columbia) and Sitka spruce, while the other was from loblolly pine. We found 475 and 629 unique EST-SSRs in loblolly pine and spruce, respectively. 3′ ESTs contained 14% more SSRs than 5′ EST reads in loblolly pine and 41% more in spruce. Conifer EST-SSRs differed conspicuously from angiosperm EST-SSRs in several aspects. EST-SSRs were considerably less frequent in conifers (one EST-SSR every ∼50 kb) than in angiosperms (one EST-SSR every ∼20 kb). Dinucleotide repeats were the most abundant repeat class in conifers, while in angiosperms, trinucleotides were most common. Finally, the AT motif was the dominant motif recovered in both conifer species, whereas AG was the most common dinucleotide repeat in angiosperms. Also, as these EST-SSRs in conifers could be developed into useful genetic markers, our work demonstrates the value of large-scale EST sequencing projects for in-silico approaches for marker development.  相似文献   

8.
SSR (simple sequence repeats) markers derived from ESTs (expressed sequence tags), commonly called EST‐SSRs or genic SSRs provide useful genetic markers for crop improvement. These are easy and economical to develop as by‐products of large‐scale EST resources that have become available as part of the functional genomic studies in many plant species. Here, we describe for the first time, nine genic‐SSRs of coffee that are developed from the microsatellite containing ESTs from a cDNA library of moisture‐stressed leaves of coffee variety, ‘CxR’ (a commercial interspecific hybrid between Coffea congensis and Coffea canephora). The markers show considerable allelic diversity with PIC values up to 0.70 and 0.75 for Coffea arabica and Coffea canephora, respectively, and robust cross‐species amplification in 16 other related taxa of coffee. The validation studies thus demonstrate the potential utility of the EST‐SSRs for genetic analysis of coffee germplasm.  相似文献   

9.
With the advent of high-throughput sequencing technology, sequences from many genomes are being deposited to public databases at a brisk rate. Open access to large amount of expressed sequence tag (EST) data in the public databases has provided a powerful platform for simple sequence repeat (SSR) development in species where sequence information is not available. SSRs are markers of choice for their high reproducibility, abundant polymorphism and high inter-specific transferability. The mining of SSRs from ESTs requires different high-throughput computational tools that need to be executed individually which are computationally intensive and time consuming. To reduce the time lag and to streamline the cumbersome process of SSR mining from ESTs, we have developed a user-friendly, web-based EST-SSR pipeline "EST-SSR-MARKER PIPELINE (ESMP)". This pipeline integrates EST pre-processing, clustering, assembly and subsequently mining of SSRs from assembled EST sequences. The mining of SSRs from ESTs provides valuable information on the abundance of SSRs in ESTs and will facilitate the development of markers for genetic analysis and related applications such as marker-assisted breeding. AVAILABILITY: The database is available for free at http://bioinfo.aau.ac.in/ESMP.  相似文献   

10.
Simple sequence repeats (SSRs) derived from expressed sequence tags (ESTs) are valuable markers because they represent transcribed regions and often transferable to related taxa. Here, we report the development and characterization of EST-SSRs from Shorea leprosula. Fifty-four sequences containing SSRs were identified in 2003 unigenes assembled from 3159 ESTs. Twenty-four EST-SSRs were developed, of which four gave multiple amplifications, five were found to be monomorphic and 15 showed polymorphism, with allele numbers ranging from two to 17 in a single Pasoh Forest Reserve population of 24 individuals. The observed and expected heterozygosities ranged from 0.05 to 0.91 and from 0.16 to 0.93, respectively. Cross-species transferability of the 15 loci to 36 species within Dipterocarpaceae revealed between four and 14 loci that gave positive amplification and 10 loci were found to be transferable to more than 15 species.  相似文献   

11.
A total of 5,521 expressed sequence tags (ESTs) from oil palm were used to search for type and frequency of simple sequence repeat (SSR) markers. Dimeric repeat motifs appeared to be the most abundant, followed by tri-nucleotide repeats. Redundancy was eliminated in the original EST set, resulting in 145 SSRs in 136 unique ESTs (114 singletons and 22 clusters). Primers were designed for 94 (69.1%) of the unique ESTs (consisting of 14 consensus and 80 singletons). Primers for 10 EST-SSRs were developed and used to evaluate the genetic diversity of 76 accessions of oil palm originating from seven countries in Africa, and the standard Deli dura population. The average number of observed and effective alleles was 2.56 and 1.84, respectively. The EST-SSR markers were found to be polymorphic with a mean polymorphic information content value of 0.53. Genetic differentiation (F ST) among the populations studied was 0.2492 indicating high level of genetic divergence. Moreover, the UPGMA (unweighted pair-group method with arithmetic mean) analysis revealed a strong association between genetic distance and geographic location of the populations studied. The germplasm materials exhibited higher diversity than Deli dura, indicating their potential usefulness in oil palm improvement programmes. The study also revealed that the populations from Nigeria, Congo and Cameroon showed the highest diversity among the germplasm evaluated in this study. The EST-SSRs further demonstrated their worth as a new source of polymorphic markers for phylogenetic analysis, since a high percentage of the markers showed transferability across species and palm taxa.  相似文献   

12.
We screened for simple sequence repeats (SSRs) found in ESTs derived from an EST-database development project ('Marine Genomics Europe' Network of Excellence). Different motifs of di-, tri-, tetra-, penta- and hexanucleotide SSRs were evaluated for variation in length and position in the expressed sequences, relative abundance and distribution in gilthead sea bream (Sparus aurata). We found 899 ESTs that harbor 997 SSRs (4.94%). On average, one SSR was found per 2.95 kb of EST sequence and the dinucleotide SSRs are the most abundant accounting for 47.6% of the total number. EST-SSRs were used as template for primer design. 664 primer pairs could be successfully identified and a subset of 206 pairs of primers was synthesized, PCR-tested and visualized on ethidium bromide stained agarose gels. The main objective was to further assess the potential of EST-SSRs as informative markers and investigate their cross-species amplification in sixteen teleost fish species: seven sparid species and nine other species from different families. Approximately 78% of the primer pairs gave PCR products of expected size in gilthead sea bream, and as expected, the rate of successful amplification of sea bream EST-SSRs was higher in sparids, lower in other perciforms and even lower in species of the Clupeiform and Gadiform orders. We finally determined the polymorphism and the heterozygosity of 63 markers in a wild gilthead sea bream population; fifty-eight loci were found to be polymorphic with the expected heterozygosity and the number of alleles ranging from 0.089 to 0.946 and from 2 to 27, respectively. These tools and markers are expected to enhance the available genetic linkage map in gilthead sea bream, to assist comparative mapping and genome analyses for this species and further with other model fish species and finally to help advance genetic analysis for cultivated and wild populations and accelerate breeding programs.  相似文献   

13.
Although lily is the second largest flower crop in cutting flower commodity, only six simple sequence repeats SSRs have been reported. Thus, we developed expressed sequence tag derived-SSRs (EST-SSRs) for the Lilium genus. Among 2,235 unique ESTs, 754 ESTs contained SSR motifs, among which 165 ESTs were amenable to primer design. Among these 165 EST-SSRs, 131 EST-SSRs showed amplification in at least one Lilium species, and 76 EST-SSRs showed amplification in at least nine species. Of the 76 EST-SSRs, 47 showed amplification in all Lilium species analyzed. Using 10 breeding lines, we selected 21 EST-SSRs that had the highest number of alleles and polymorphism information content. The polymorphism information content values of these selected EST-SSRs ranged from 0.49 to 0.94 with an average of 0.76, which are higher than other plant species. The phylogenetic dendrogram derived from the amplification profiles of the 21 high polymorphic EST-SSRs was congruent with the genetic background of the 84 selected lily accessions and hybrids, which are available in commerce. Thus, the developed EST-SSRs will be very useful in germplasm management, genetic diversity analysis, cultivar finger printing, and molecular breeding in the lily.  相似文献   

14.
Microsatellites, or simple sequence repeats (SSRs), are usually regarded as the markers of choice in population genetics research because they exhibit high variability. The development cost of these markers is usually high. In addition, microsatellite primers developed for one species often do not cross-amplify in related species, requiring separate development for each species. However, microsatellites found in expressed sequence tags (ESTs) might better cross-amplify as they reside in or near conserved coding DNA. In this study, we identified 14 Pinus taeda (loblolly pine) EST-SSRs from public EST databases and tested for their cross-species transferability to P. contorta ssp. latifolia, P. ponderosa, and P. sylvestris. As part of our development of a P. contorta microsatellite set, we also compared their transferability to that of 99 traditional microsatellite markers developed in P. taeda and tested on P. contorta ssp. latifolia. Compared to traditional microsatellites, EST-SSRs had higher transfer rates across pine species; however, the level of polymorphism of microsatellites derived from ESTs was lower. Sequence analyses revealed that the frequencies of insertions/deletions and base substitutions were lower in EST-SSRs than in other types of microsatellites, confirming that EST-SSRs are more conserved than traditional SSRs. Our results also provide a battery of 23 polymorphic, robust microsatellite primer pairs for lodgepole pine.Communicated by O. Savolainen  相似文献   

15.
Microsatellites, also called simple sequence repeats (SSRs), are markers of choice to estimate relevant parameters for conservation genetics, such as migration rates, effective population size and kinship. Cross‐amplification of SSRs is the simplest way to obtain sets of markers, and highly conserved SSRs have recently been developed from expressed sequence tags (EST) to improve SSR cross‐species utility. As EST‐SSRs are located in coding regions, the higher stability of their flanking regions reduces the frequency of null alleles and improves cross‐species amplification. However, EST‐SSRs have generally less allelic variability than genomic SSRs, potentially leading to differences in estimates of population genetic parameters such as genetic differentiation. To assess the potential of EST‐SSRs in studies of within‐species genetic diversity, we compared the relative performance of EST‐ and genomic SSRs following a multispecies approach on passerine birds. We tested whether patterns and levels of genetic diversity within and between populations assessed from EST‐ and from genomic SSRs are congruent, and we investigated how the relative efficiency of EST‐ and genomic SSRs is influenced by levels of differentiation. EST‐ and genomic SSRs ensured comparable inferences of population genetic structure in cases of strong genetic differentiation, and genomic SSRs performed slightly better than EST‐SSRs when differentiation is moderate. However and interestingly, EST‐SSRs had a higher power to detect weak genetic structure compared to genomic SSRs. Our study attests that EST‐SSRs may be valuable molecular markers for conservation genetic studies in taxa such as birds, where the development of genomic SSRs is impeded by their low frequency.  相似文献   

16.
Expressed sequence tag (EST) derived simple sequence repeats (SSRs, microsatellites) were screened and identified from 3863 almond and 10 185 peach EST sequences, and the spectra of SSRs in the non-redundant EST sequences were investigated after sequence assembly. One hundred seventy-eight (12.07%) almond SSRs and 497 (9.97%) peach SSRs were detected. The EST-SSR occurs every 4.97 kb in almond ESTs and 6.57 kb in peach, and SSRs with di- and trinucleotide repeat motifs are the most abundant in both almond and peach ESTs. Twenty one EST-SSRs were thereafter, developed and used together with 7 genomic SSRs, to study the genetic relationship among 36 almond (P. communis Fritsch.) cultivars from China and the Mediterranean area, as well as 8 accessions of other related species from the genus Prunus. Both EST-derived and genomic SSR markers showed high cross-species transferability in the genus. Out of the 112 polymorphic alleles detected in the 36 cultivated almonds, 28 are specific to Chinese cultivars and 25 to the others. The 44 accessions were clustered into 4 groups in the phylogenetic tree and the 36 almond cultivars formed two distinct subgroups, one containing only Chinese cultivars and one of unknown origin and the other only those originating from the Mediterranean area, indicating that Chinese almond cultivars have a distinct evolutionary history from the Mediterranean almond. Our preliminary results indicated that common almond was more closely related to peach (P. persica (L.) Batsch.) than to the four wild species of almond, (P. mongolica Maxim., P. ledebouriana Schleche, P. tangutica Batal., and P. triloba Lindl.). The implications of these SSR markers for evolutionary analysis and molecular mapping of Prunus species are discussed.  相似文献   

17.
We report on the data mining of publicly available Litopenaeus vannamei expressed sequence tags (ESTs) to generate simple sequence repeat (SSRs) markers and on their transferability between related Penaeid shrimp species. Repeat motifs were found in 3.8% of the evaluated ESTs at a frequency of one repeat every 7.8 kb of sequence data. A total of 206 primer pairs were designed, and 112 loci were amplified with the highest success in L. vannamei. A high percentage (69%) of EST-SSRs were transferable within the genus Litopenaeus. More than half of the amplified products were polymorphic in a small testing panel of L. vannamei. Evaluation of those primers in a larger testing panel showed that 72% of the markers fit Hardy-Weinberg equilibrium, which shows their utility for population genetic analysis. Additionally, a set of 26 of the EST-SSRs were evaluated for Mendelian segregation. A high percentage of monomorphic markers (46%) proved to be polymorphic by singles-stranded conformational polymorphism analysis. Because of the high number of ESTs available in public databases, a data mining approach similar to the one outlined here might yield high numbers of SSR markers in many animal taxa.  相似文献   

18.
Microsatellites physically linked to expressed sequence tags (EST-SSRs) are an important resource for linkage mapping and comparative genomics, and data mining in publicly available EST databases is a common strategy for EST-SSR discovery. At present, many species lack species-specific EST sequence data needed for the efficient characterization of EST-SSRs. This paper describes the discovery and development of EST-SSRs for red drum (Sciaenops ocellatus), an estuarine-dependent sciaenid species of economic importance in the USA and elsewhere, using a phylogenetically informed, comparative genomics approach to primer design. The approach entailed comparing existing genomic resources from species closely allied phylogenetically to red drum, with resources from more distantly related outgroup species. By taking into account the degree to which flanking regions are conserved across taxa, the efficiency of PCR primer design was increased greatly. The amplification success rate for primers designed for red drum was 100?% when using EST libraries from confamilial species and 92?% when using an EST library from a species in the same suborder. The primers developed also amplified EST-SSRs in a wide range of perciform fishes, suggesting potential use in comparative genomics. This study demonstrates that EST-SSRs can be efficiently developed for an organism when limited species-specific data are available by exploiting genomic resources from well-studied species, even those at extended phylogenetic distances.  相似文献   

19.
银杏EST序列中微卫星的分布特征   总被引:5,自引:0,他引:5  
本文利用从NCBI下载的21 590条银杏EST序列,分析了银杏(表达序列标签微卫星)EST-SSR在银杏EST序列的分布和比较了在不同长度EST序列中的SSR特性.在剔除冗余和低质量序列后,得到总长为5 708.385 kb的无冗余EST序列7 961条,发现了405个EST序列(5.09%)含有475个SSR,长度400-1000 bp的EST序列含SSR位点数为445个,占SSR总数的93.68%.二核苷酸和三核苷酸基元类型是银杏EST-SSR的主要类型,分别占SSR总数的73.89%和24.00%,最常见的SSR基元是:(AT)_n、(AG)_n、(AC)_n、(AAG)_n和(AAT)_n.通过对银杏EST序列中SSR位点信息的发掘分析,为有针对性地设计EST-SSR引物,开发银杏EST-SSR分子标记奠定基础.  相似文献   

20.
Microsatellites or SSRs (single sequence repeats) have been used to construct and integrate genetic maps in crop species, including Phaseolus vulgaris. In the present study, 3 cDNA libraries generated by the Bean EST project (http://lgm.esalq.usp.br/BEST/), comprising a unigene collection of 3126 sequences and a genomic microsatellite-enriched library, were analyzed for the presence of SSRs. A total of 219 expressed sequence tags (ESTs) were found to carry 240 SSRs (named EST-SSR), whereas 714 genomic sequences contained 471 SSRs (named genomic-SSR). A subset of 80 SSRs, 40 EST-SSRs, and 40 genomic-SSRs were evaluated for molecular polymorphism in 23 genotypes of cultivated beans from the Mesoamerican and Andean genetic pools, including Brazilian cultivars and 2 related species. Of the common bean genotypes, 31 EST-SSR loci were polymorphic, yielding 2-12 alleles as compared with 26 polymorphic genomic-SSRs, accounting for 2-7 alleles. Cluster analysis from data using both genic and genomic-SSR revealed a clear separation between Andean and Mesoamerican beans. The usefulness of these loci for distinguishing bean genotypes and genetic mapping is discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号