首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Mung bean (Vigna radiate (L.) Wilczek) is an important traditional food legume crop, with high economic and nutritional value. It is widely grown in China and other Asian countries. Despite its importance, genomic information is currently unavailable for this crop plant species or some of its close relatives in the Vigna genus. In this study, more than 103 million high quality cDNA sequence reads were obtained from mung bean using Illumina paired-end sequencing technology. The processed reads were assembled into 48,693 unigenes with an average length of 874 bp. Of these unigenes, 25,820 (53.0%) and 23,235 (47.7%) showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases, respectively. Furthermore, 19,242 (39.5%) could be classified into gene ontology categories, 18,316 (37.6%) into Swiss-Prot categories and 10,918 (22.4%) into KOG database categories (E-value < 1.0E-5). A total of 6,585 (8.3%) were mapped onto 244 pathways using the Kyoto Encyclopedia of Genes and Genome (KEGG) pathway database. Among the unigenes, 10,053 sequences contained a unique simple sequence repeat (SSR), and 2,303 sequences contained more than one SSR together in the same expressed sequence tag (EST). A total of 13,134 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats being the most abundant motif class and G/C repeats being rare. In this SSR analysis, we found five main repeat motifs: AG/CT (30.8%), GAA/TTC (12.6%), AAAT/ATTT (6.8%), AAAAT/ATTTT (6.2%) and AAAAAT/ATTTTT (1.9%). A total of 200 SSR loci were randomly selected for validation by PCR amplification as EST-SSR markers. Of these, 66 marker primer pairs produced reproducible amplicons that were polymorphic among 31 mung bean accessions selected from diverse geographical locations. The large number of SSR-containing sequences found in this study will be valuable for the construction of a high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.  相似文献   

2.
3.
4.
Faba bean (Vicia faba L.) is an important food legume crop with a huge genome. Development of genetic markers for faba bean is important to study diversity and for molecular breeding. In this study, we used Next Generation Sequencing (NGS) technology for the development of genomic simple sequence repeat (SSR) markers. A total of 14,027,500 sequence reads were obtained comprising 4,208 Mb. From these reads, 56,063 contigs were assembled (16,367 Mb) and 2138 SSRs were identified. Mono and dinucleotides were the most abundant, accounting for 57.5 % and 20.9 % of all SSR repeats, respectively. A total of 430 primer pairs were designed from contigs larger than 350 nucleotides and 50 primers pairs were tested for validation of SSR locus amplification. Nearly all (96 %) of the markers were found to produce clear amplicons and to be reproducible. Thirty-nine SSR markers were then applied to 46 faba bean accessions from worldwide origins, resulting in 161 alleles with 87.5 % polymorphism, and an average of 4.1 alleles per marker. Gene diversity (GD) of the markers ranged from 0 to 0.48 with an average of 0.27. Testing of the markers showed that they were useful in determining genetic relationships and population structure in faba bean accessions.  相似文献   

5.
Walnut (Juglans regia), an economically important woody plant, is widely cultivated in temperate regions for its timber and nutritional fruits. Despite abundant studies in germplasm, systemic molecular evaluations of walnut are sparsely reported mainly due to the limited molecular markers available. Expressed sequence tags (EST) provide a valuable resource for developing simple sequence repeat (SSR) markers. In this study, a total of 5,025 walnut ESTs (covering 16.41 Mb) were retrieved from the National Center for Biotechnology Information database. The SSR motifs were then analyzed by the SSRHunter software. In total, 398 SSRs were obtained with an average frequency of 1/4.08 kb. Dinucleotide (di-) repeat motifs accounted for 69.85% of all SSRs, followed by trinucleotide (tri-) with a frequency of 27.64%, while low frequency (2.51%) of tetranucleotide (tetra-) to hexanucleotide (hexa-) was observed. Meanwhile, GCA and TC motifs were prevalent among di- and tri- loci, respectively. Subsequently, a total of 123 primer pairs were designed from the non-redundant SSR-containing unigenes with the selection threshold of SSR length set to 10 bp or more. To examine the efficiency of candidate markers, seven DNA pools were collected from geographically different accessions. Results demonstrated that 41 SSR primer sets could generate high polymorphic amplification products (33.3%), and these polymorphic loci were mainly located in the 3′-untranslated region. Annotation analysis revealed that only two of these 41 loci were located inside open reading frames of characterized proteins (E ≤ 1E−30).  相似文献   

6.
Rapeseed (Brassica napus) is the second most important oil crop in the world after soybean. The repertoire of simple sequence repeat (SSR) markers for rapeseed is limited and warrants a search for a larger number of polymorphic SSRs for germplasm characterization and breeding applications. In this study, a total of 5,310 SSR-containing unigenes were identified from a set of 46,038 B. napus unigenes with an average density of one SSR every 5.75?kb. A set of 1,000 expressed sequence tag (EST)-SSR markers with repeat length ??18?bp were developed and tested for their ability to detect polymorphism among a panel of six rapeseed varieties. Of these SSR markers, 776 markers detected clear amplification products, and 511 displayed polymorphisms among the six varieties. Of these polymorphic markers, 195 EST-SSR markers, corresponding to 233 loci, were integrated into an existing B. napus linkage map. These EST-SSRs were randomly distributed on the 19 linkage groups of B. napus. Of the mapped loci, 166 showed significant homology to Arabidopsis genes. Based on the homology, 44 conserved syntenic blocks were identified between B. napus and Arabidopsis genomes. Most of the syntenic blocks were consistent with the duplication and rearrangement events identified previously. In addition, we also identified three previously unreported blocks in B. napus. A subset of 40 SSRs was used to assess genetic diversity in a collection of 192 rapeseed accessions. The polymorphism information content of these markers ranged from 0.0357 to 0.6753 with an average value of 0.3373. These results indicated that the EST-SSR markers developed in this study are useful for genetic mapping, molecular marker-assisted selection and comparative genomics.  相似文献   

7.
Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2–7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.  相似文献   

8.
Brassica juncea is an economically important oilseed crop worldwide. It has limited genomic resources at present. We generated 47,962,057 expressed sequence reads which were assembled into 45,280 unigenes. A total of 4108 SSR loci (≥10 bp) were identified in these unigenes. Trinucleotide was the most frequent repeat unit (59.91 %) followed by di- (38.66 %), tetra - (0.71 %), hexa - (0.49 %) and pentanucleotide repeats (0.24 %). Primers were designed for 2863 SSR loci among which 460 were selected for primer synthesis. A total of 339 loci amplified successfully of which 134 (39.5 %) exhibited polymorphism among six B. juncea genotypes with PIC values ranging from 0.18 to 0.81. Further, 25 polymorphic SSRs were used for analysis of genetic variability in 25 genotypes of Brassicas and their wild relatives. Two to five alleles with PIC values 0.22–0.66 were detected at these loci. The dendrogram grouped the genotypes according to their known pedigree/systematic position.  相似文献   

9.
Expressed sequence tag (EST) databases offer opportunity for the rapid development of simple sequence repeat (SSR) markers in crops. Sequence assembly and clustering of 57?895 ESTs of castor bean resulted in the identification of 10?960 unigenes (6459 singletons and 4501 contigs) having 7429 SSRs. On an average, the unigenes contained 1 SSR for every 1.23?kb of unigene sequence. The identified SSRs mostly consisted of dinucleotide (62.4%) and trinucleotide (33.5%) repeats. The AG class was the most common among the dinucleotide motifs (68.9%), whereas the AAG class (25.9%) was predominant among the trinucleotide motifs. A total of 611 primer pairs were designed for the SSRs, having repeat length more than or equal to 20 nucleotides, of which a set of 130 markers were tested and 92 of these yielding robust amplicons were analyzed for their utility in genetic purity assessment of castor bean hybrids. Nine markers were able to detect polymorphism between the parental lines of nine commercial castor bean hybrids (DCH-32, DCH-177, DCH-519, GCH-2, GCH-4, GCH-5, GCH-6, GCH-7, and RHC-1), and their utility in genetic purity testing was demonstrated. These novel EST-SSR markers would be a valuable addition to the growing molecular marker resources that could be used in genetic improvement programmes of castor bean.  相似文献   

10.
11.
12.
13.
为全面了解余甘子转录组SSR位点的分布特征和变异规律,本研究利用Illumina Hiseq 4000平台对余甘子叶片转录组进行测序,通过MISA软件对获得的Unigenes进行SSR位点搜索和统计分析。结果发现9 538条包含SSR位点的Unigenes,共检测到9 991个SSR位点,平均每5.49 kB出现1个SSR。单碱基和二碱基为余甘子转录组SSR主要重复类型,分别占SSR总数的42.3%和30.79%。位于基因编码区的SSR位点共有1 731个,出现频率为0.039 SSRs/kB,优势重复类型为三碱基重复。余甘子转录组SSR中共有169种重复基元,其中所占比例最高的是A/T(42.10%),其次是AG/CT(22.91%)和AAG/CTT(5.02%)。SSR各基元的重复次数波动于4~75次,且多数集中于4~20次。重复片段长度≥ 20 bp的SSR占21.20%,且SSR发生频率与片段长度呈显著负相关(P<0.01),相关系数为-0.561。本研究获得的余甘子转录组SSR位点出现频率较高、分布密度较大、低级重复基元较多,重复次数较高、长片段较多,大多数SSR位点的多态性潜能较高,用于余甘子遗传多样性分析的潜力较大,为下一步余甘子转录组SSR标记的大规模开发和群体遗传学研究提供了重要的数据信息,进而为余甘子野生资源的保护和合理开发利用提供了参考依据。  相似文献   

14.
Opium poppy (Papaver somniferum L.) is an important pharmaceutical crop with very few genetic marker resources. To expand these resources, we sequenced genomic DNA using pyrosequencing technology and examined the DNA sequences for simple sequence repeats (SSRs). A total of 1,244,412 sequence reads were obtained covering 474 Mb. Approximately half of the reads (52 %) were assembled into 166,724 contigs representing 105 Mb of the opium poppy genome. A total of 23,283 non-redundant SSRs were identified in 18,944 contigs (11.3 % of total contigs). Trinucleotide and tetranucleotide repeats were the most abundant SSR repeats, accounting for 49.0 and 27.9 % of all SSRs, respectively. The AAG/TTC repeat was the most abundant trinucleotide repeat, representing 19.7 % of trinucleotide repeats. Other SSR repeat types were AT-rich. A total of 23,126 primer pairs (98.7 % of total SSRs) were designed to amplify SSRs. Fifty-three genomic SSR markers were tested in 37 opium poppy accessions and seven Papaver species for determination of polymorphism and transferability. Intraspecific polymorphism information content (PIC) values of the genomic SSR markers were intermediate, with an average 0.17, while the interspecific average PIC value was slightly higher, 0.19. All markers showed at least 88 % transferability among related species. This study increases sequence coverage of the opium poppy genome by sevenfold and the number of opium poppy-specific SSR markers by sixfold. This is the first report of the development of genomic SSR markers in opium poppy, and the genomic SSR markers developed in this study will be useful in diversity, identification, mapping and breeding studies in opium poppy.  相似文献   

15.
The family Solanaceae is the source of several economically important plants. The aim of this study was to trace and characterize simple sequence repeat (SSR) markers from unigene sequences of Solanum lycopersicum, an important member of family Solanaceae. 18,228 unigene sequences of Solanum lycopersicum was taken in order to develop SSR markers and analyzed for the in-silico design of PCR primers. A total of 12,090 (66.32 %) unigenes containing 17,524 SSRs (microsatellites) were identified. The average frequency of microsatellites in unigenes was one in every 1.3 kb of sequence. The analysis revealed that trinucleotide motifs, coding for Glutamic acid (GAA) and AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking sequences of the SSRs generated 877 primers with forward and reverse strands. Functional categorization of SSRs containing unigenes was done through gene ontology terms like Biological process, Cellular component and Molecular function.  相似文献   

16.
Ricinus communis is a versatile industrial oil crop that is cultivated worldwide. Genetic improvement and marker-assisted breeding of castor bean have been slowed owing to the lack of abundant and efficient molecular markers. As co-dominant markers, simple sequence repeats (SSRs) are useful for genetic evaluation and molecular breeding. The recently released whole-genome sequence of castor bean provides useful genomic resources for developing markers on a genome-wide scale. In the present study, the distribution and frequency of microsatellites in the castor bean genome were characterised and numerous SSR markers were developed using genomic data mining. In total, 18,647 SSR loci at a density of one SSR per 18.89 Kb in the castor bean genome sequence (representing approximately 352.27 Mb) were identified. Dinucleotide repeats were the most frequently observed microsatellites, although the AAT repeat motif was also prevalent. Using six cultivars as screening samples, 670 polymorphic SSR markers from 1,435 primer pairs (46.7 %) were developed. Trinucleotide motif loci contained a higher proportion of polymorphisms (48.5 %) than dinucleotide motif loci (39.2 %). The polymorphism level in the SSR loci was positively correlated with the increasing number of repeat units in the microsatellites. The phylogenetic relationship among 32 varieties was evaluated using the developed SSR markers. Cultivars developed at the same institute clustered together, suggesting that these cultivars have a narrow genetic background. The large number of SSR markers developed in this study will be useful for genetic mapping and for breeding improved castor-oil plants. These markers will also facilitate genetic and genomic studies of Euphorbiaceae.  相似文献   

17.
Most studies on the genetic diversity of common bean (Phaseolus vulgaris L.) have focussed on accessions from the Mesoamerican gene pool compared to the Andean gene pool. A deeper knowledge of the genetic structure of Argentinian germplasm would enable researchers to determine how the Andean domestication event affected patterns of genetic diversity in domesticated beans and to identify candidates for genes targeted by selection during the evolution of the cultivated common bean. A collection of 116 wild and domesticated accessions representing the diversity of the Andean bean in Argentina was genotyped by means of 114 simple sequence repeat (SSR) markers. Forty-seven Mesoamerican bean accessions and 16 Andean bean accessions representing the diversity of Andean landraces and wild accessions were also included. Using the Bayesian algorithm implemented in the software STRUCTURE we identified five major groups that correspond to Mesoamerican and Argentinian wild accessions and landraces and a group that corresponds to accessions from different Andean and Mesoamerican countries. The neighbour-joining algorithm and principal coordinate clustering analysis confirmed the genetic relationships among accessions observed with the STRUCTURE analysis. Argentinian accessions showed a substantial genetic variation with a considerable number of unique haplotypes and private alleles, suggesting that they may have played an important role in the evolution of the species. The results of statistical analyses aimed at identifying genomic regions with consistent patterns of variation were significant for 35 loci (~20 % of the SSRs used in the Argentinian accessions). One of these loci mapped in or near the genomic region of the glutamate decarboxylase gene. Our data characterize the population structure of the Argentinian germplasm. This information on its diversity will be very valuable for use in introgressing Argentinian genes into commercial varieties because the majority of present-day common bean varieties are of Andean origin.  相似文献   

18.
19.
李白盾蚧Pseudaulacaspis prunicola (Maskell)寄主范围广泛,是一种重要的入侵害虫。本研究利用高通量测序平台(Illumina NovaSeq 6000)对李白盾蚧进行转录组测序、de novo从头组装及功能注释,在此基础上筛选其微卫星(SSR)位点,并挖掘微卫星引物。研究共获得李白盾蚧转录组60 296条转录本,24 967条单基因(unigenes)序列。通过GO数据库注释,将所有unigenes的功能分为生物学进程、细胞组分和分子功能三大类41个亚类功能区。KOG数据库注释结果显示,5 085条unigenes归到25个基因家族,注释到一般功能预测的数目最多。KEGG代谢通路富集分析显示6 668条unigenes注释到280个代谢通路,其中注释到内质网中蛋白质加工的数目最多。利用MISA软件共搜索到微卫星位点18 193个,分布在9 043条unigenes中,占总unigenes数量的36.22%,平均每2.29 kb出现一个SSR位点。其中主要重复类型为单核苷酸重复,占SSR位点总数的72.03%,其次为三核苷酸重复(15.90%)和二核苷酸重复(8.48%)。单核苷酸重复主要为A/T(71.16%),二核苷酸重复主要为AG/CT(5.20%)。基于Primer Primer 3软件设计出12 538对李白盾蚧SSR引物,从中随机挑选50对引物进行PCR验证,共29对引物可以稳定扩增出目的片段。本研究成功组装了李白盾蚧转录组数据,并基于转录组数据成功筛选出其微卫星位点,为未来该虫的种群遗传学以及入侵生物学研究提供了数据支撑。  相似文献   

20.
【目的】中华大仰蝽Notonecta chinensis为中国和日本冲绳分布的重要水生天敌昆虫,可用于蚊虫的生物防治。本研究旨在建立中华大仰蝽转录组数据库,挖掘其基因信息。【方法】采用高通量测序平台Illumina NextSeq500对中华大仰蝽进行转录组测序、de novo组装及生物信息学分析;利用MISA软件基于转录组unigenes数据进行SSR新分子标记筛选。毛细管电泳检测SSR多态性。【结果】总计获得34782282条clean reads(NCBI SRA数据库登录号:SRR13259254),组装成37801条unigenes,N50为913 bp。将unigenes与已知数据库比对进行基因功能注释,分别有36474,32470,27781,35079和5638条序列注释到nr,Swiss-Prot,GO,eggNOG和KEGG数据库。通过GO数据库注释,unigenes的功能可分为生物学过程、细胞组分和分子功能三大类,其中参与细胞、细胞部分及结合功能的unigenes比例较大。eggNOG数据库注释结果显示,37801条unigenes归到25个基因家族,注释到未知功能的最多。KEGG代谢通路富集分析显示,5638条unigenes注释到245个代谢通路,注释到核糖体的数目最多。此外,用MISA软件在转录组测序数据中的37801条unigenes中搜索到3124个SSR位点(占总unigenes的8.26%),发生频率为7.07%。通过PCR筛选出16个SSR位点。7个中华大仰蝽地理种群3个位点NcCF/NcCR,NcKF/NcKR和NcLF/NcLR的多态信息含量(PIC)分别为0.870,0.902和0.857,具高度多态性。【结论】本研究成功获得了中华大仰蝽转录组数据,为其基因功能分析提供了分子理论基础;SSR新标记的开发为中华大仰蝽遗传多样性分析、隐存种鉴定及基因图谱构建提供了更丰富的候选分子标记。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号