首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Opium poppy (Papaver somniferum L.) is an important pharmaceutical crop with very few genetic marker resources. To expand these resources, we sequenced genomic DNA using pyrosequencing technology and examined the DNA sequences for simple sequence repeats (SSRs). A total of 1,244,412 sequence reads were obtained covering 474 Mb. Approximately half of the reads (52 %) were assembled into 166,724 contigs representing 105 Mb of the opium poppy genome. A total of 23,283 non-redundant SSRs were identified in 18,944 contigs (11.3 % of total contigs). Trinucleotide and tetranucleotide repeats were the most abundant SSR repeats, accounting for 49.0 and 27.9 % of all SSRs, respectively. The AAG/TTC repeat was the most abundant trinucleotide repeat, representing 19.7 % of trinucleotide repeats. Other SSR repeat types were AT-rich. A total of 23,126 primer pairs (98.7 % of total SSRs) were designed to amplify SSRs. Fifty-three genomic SSR markers were tested in 37 opium poppy accessions and seven Papaver species for determination of polymorphism and transferability. Intraspecific polymorphism information content (PIC) values of the genomic SSR markers were intermediate, with an average 0.17, while the interspecific average PIC value was slightly higher, 0.19. All markers showed at least 88 % transferability among related species. This study increases sequence coverage of the opium poppy genome by sevenfold and the number of opium poppy-specific SSR markers by sixfold. This is the first report of the development of genomic SSR markers in opium poppy, and the genomic SSR markers developed in this study will be useful in diversity, identification, mapping and breeding studies in opium poppy.  相似文献   

2.
3.
Oil camellia trees are important woody plants for the production of high-quality cooking oil. On the contrary to their economic importance, their genetic and genomic resources are very limited, which greatly hamper the genetic studies on oil camellia trees. Microsatellites or simple sequence repeats (SSRs) have great value in many aspects of genetic analyses due to their high polymorphism and codominant inheritance. In this study, we report the large-scale development and characterization of SSR markers derived from genomic sequences of Camellia chekiangoleosa by high-throughput pyrosequencing technology. A total of 1,091,393 genomic shotgun reads were generated using Roche 454 FLX sequencer, the average read length was 319 bp, and the total sequence throughput was 347.9 Mb. These sequences were assembled into 35,315 contigs with total length of 14.8 Mb and the N50 contig size of 770 bp. By analyzing with microsatellite (MISA), a total of 5,844 perfect microsatellites were detected from the assembled sequences. Among them, tetranucleotide repeats were found to be the most frequent microsatellites in the genome of C. chekiangoleosa, and all the dominant repeat motifs for different types of SSRs were detected to be rich in A/T. Experimental analysis with 900 SSR primer pairs revealed that 66 % of them succeeded in PCR amplification. Further investigation with 345 SSR primer pairs showed that a relatively high percentage of primers amplified polymorphic loci (31.9 %). Experimental data also revealed that, overall, long microsatellite repeats (>20 bp) were more variable than the short ones (<20 bp) in the genome of oil camellia tree.  相似文献   

4.
Japanese chestnut (Castanea crenata Sieb. et Zucc.), unlike other Castanea species, is resistant to most diseases and wasps. However, genomic data of Japanese chestnut that could be used to determine its biotic stress resistance mechanisms have not been reported to date. In this study, we employed long-read sequencing and genetic mapping to generate genome sequences of Japanese chestnut at the chromosome level. Long reads (47.7 Gb; 71.6× genome coverage) were assembled into 781 contigs, with a total length of 721.2 Mb and a contig N50 length of 1.6 Mb. Genome sequences were anchored to the chestnut genetic map, comprising 14,973 single nucleotide polymorphisms (SNPs) and covering 1,807.8 cM map distance, to establish a chromosome-level genome assembly (683.8 Mb), with 69,980 potential protein-encoding genes and 425.5 Mb repetitive sequences. Furthermore, comparative genome structure analysis revealed that Japanese chestnut shares conserved chromosomal segments with woody plants, but not with herbaceous plants, of rosids. Overall, the genome sequence data of Japanese chestnut generated in this study is expected to enhance not only its genetics and genomics but also the evolutionary genomics of woody rosids.  相似文献   

5.
6.
7.
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utility universal application is limited by the small number of individuals used in its development. This is highlighted by the presence of high-quality sequence reads failing to map within the HGR. Sequences failing to map generally represent 2–5 % of total reads, which may harbor regions that would enhance our understanding of population variation, evolution, and disease. Alternatively, complete de novo assemblies can be created, but these effectively ignore the groundwork of the HGR. In an effort to find a middle ground, we developed a bioinformatic pipeline that maps paired-end reads to the HGR as separate single reads, exports unmappable reads, de novo assembles these reads per individual and then combines assemblies into a secondary reference assembly used for comparative analysis. Using 45 diverse 1000 Genomes Project individuals, we identified 351,361 contigs covering 195.5 Mb of sequence unincorporated in GRCh38. 30,879 contigs are represented in multiple individuals with ~40 % showing high sequence complexity. Genomic coordinates were generated for 99.9 %, with 52.5 % exhibiting high-quality mapping scores. Comparative genomic analyses with archaic humans and primates revealed significant sequence alignments and comparisons with model organism RefSeq gene datasets identified novel human genes. If incorporated, these sequences will expand the HGR, but more importantly our data highlight that with this method low coverage (~10–20×) next-generation sequencing can still be used to identify novel unmapped sequences to explore biological functions contributing to human phenotypic variation, disease and functionality for personal genomic medicine.  相似文献   

8.
9.
10.
11.
Faba bean (Vicia faba L.) is an important food legume crop with a huge genome. Development of genetic markers for faba bean is important to study diversity and for molecular breeding. In this study, we used Next Generation Sequencing (NGS) technology for the development of genomic simple sequence repeat (SSR) markers. A total of 14,027,500 sequence reads were obtained comprising 4,208 Mb. From these reads, 56,063 contigs were assembled (16,367 Mb) and 2138 SSRs were identified. Mono and dinucleotides were the most abundant, accounting for 57.5 % and 20.9 % of all SSR repeats, respectively. A total of 430 primer pairs were designed from contigs larger than 350 nucleotides and 50 primers pairs were tested for validation of SSR locus amplification. Nearly all (96 %) of the markers were found to produce clear amplicons and to be reproducible. Thirty-nine SSR markers were then applied to 46 faba bean accessions from worldwide origins, resulting in 161 alleles with 87.5 % polymorphism, and an average of 4.1 alleles per marker. Gene diversity (GD) of the markers ranged from 0 to 0.48 with an average of 0.27. Testing of the markers showed that they were useful in determining genetic relationships and population structure in faba bean accessions.  相似文献   

12.
13.
Single nucleotide polymorphism (SNP) markers were identified and validated for two stingrays species, Potamotrygon motoro and Potamotrygon falkneri, using double digest restriction‐site associated DNA (ddRAD) reads using 454‐Roche technology. A total of 226 774 reads (65.5 Mb) were obtained (mean read length 289 ± 183 bp) detecting a total of 5399 contigs (mean contig length: 396 ± 91 bp). Mining this data set, a panel of 143 in silico SNPs was selected. Eighty‐two of these SNPs were successfully validated and 61 were polymorphic: 14 in P. falkneri, 21 in P. motoro, 3 in both species and 26 fixed for alternative variants in both species, thus being useful for population analyses and hybrid detection.  相似文献   

14.
Rosa roxburghii Tratt is an important commercial horticultural crop in China that is recognized for its nutritional and medicinal values. In spite of the economic significance, genomic information on this rose species is currently unavailable. In the present research, a genome survey of R. roxburghii was carried out using next-generation sequencing (NGS) technologies. Total 30.29 Gb sequence data was obtained by HiSeq 2500 sequencing and an estimated genome size of R. roxburghii was 480.97 Mb, in which the guanine plus cytosine (GC) content was calculated to be 38.63%. All of these reads were technically assembled and a total of 627,554 contigs with a N50 length of 1.484 kb and furthermore 335,902 scaffolds with a total length of 409.36 Mb were obtained. Transposable elements (TE) sequence of 90.84 Mb which comprised 29.20% of the genome, and 167,859 simple sequence repeats (SSRs) were identified from the scaffolds. Among these, the mono-(66.30%), di-(25.67%), and tri-(6.64%) nucleotide repeats contributed to nearly 99% of the SSRs, and sequence motifs AG/CT (28.81%) and GAA/TTC (14.76%) were the most abundant among the dinucleotide and trinucleotide repeat motifs, respectively. Genome analysis predicted a total of 22,721 genes which have an average length of 2311.52 bp, an average exon length of 228.15 bp, and average intron length of 401.18 bp. Eleven genes putatively involved in ascorbate metabolism were identified and its expression in R. roxburghii leaves was validated by quantitative real-time PCR (qRT-PCR). This is the first report of genome-wide characterization of this rose species.  相似文献   

15.
16.
Adzuki bean, also known as red bean (Vigna angularis), with 2n = 22 chromosomes, is an important legume crop in East Asian countries, including China, Japan, and Korea. For single nucleotide polymorphism (SNP) discovery, we used Vigna accessions, V. angularis IT213134 and its wild relative V. nakashimae IT178530, because of the lack of DNA sequence polymorphism in the cultivated species. Short read sequences of IT213134 and IT178530 of approximately 37 billion and 35 billion bp were produced using the Illumina HiSeq 2000 system to a sequencing depth of 61.5× and 57.7×, respectively. After de novo assembly was carried out with trimmed HiSeq reads from IT213134, 98,441 contigs of various sizes were produced with N50 of 13,755 bp. Using Burrows–Wheeler Aligner software, trimmed short reads of V. nakashimae IT178530 were successfully mapped to IT213134 contigs. All sequence variations at the whole-genome level were examined between the two Vigna species. Of the 1,565,699 SNPs, 59.4 % were transitions and 40.6 % were transversions. A total of 213,758 SNPs, consisting of 122,327 non-synonymous and 91,431 synonymous SNPs, were identified in coding sequences. For SNP validation, 96 SNPs in the genic region were chosen from among IT213134 contigs longer than 10 kb. Of these 96 SNPs, 88 were confirmed by Sanger sequencing of 10 adzuki bean genotypes from various geographic origins as well as IT213134 and its wild relative IT178530. These genome-wide SNP markers will enrich the existing Vigna resources and, specifically, could be of value for constructing a genetic map and evaluating the genetic diversity of adzuki bean.  相似文献   

17.
Physical map of chickpea was developed for the reference chickpea genotype (ICC 4958) using bacterial artificial chromosome (BAC) libraries targeting 71,094 clones (~12× coverage). High information content fingerprinting (HICF) of these clones gave high-quality fingerprinting data for 67,483 clones, and 1,174 contigs comprising 46,112 clones and 3,256 singletons were defined. In brief, 574 Mb genome size was assembled in 1,174 contigs with an average of 0.49 Mb per contig and 3,256 singletons represent 407 Mb genome. The physical map was linked with two genetic maps with the help of 245 BAC-end sequence (BES)-derived simple sequence repeat (SSR) markers. This allowed locating some of the BACs in the vicinity of some important quantitative trait loci (QTLs) for drought tolerance and reistance to Fusarium wilt and Ascochyta blight. In addition, fingerprinted contig (FPC) assembly was also integrated with the draft genome sequence of chickpea. As a result, ~965 BACs including 163 minimum tilling path (MTP) clones could be mapped on eight pseudo-molecules of chickpea forming 491 hypothetical contigs representing 54,013,992 bp (~54 Mb) of the draft genome. Comprehensive analysis of markers in abiotic and biotic stress tolerance QTL regions led to identification of 654, 306 and 23 genes in drought tolerance “QTL-hotspot” region, Ascochyta blight resistance QTL region and Fusarium wilt resistance QTL region, respectively. Integrated physical, genetic and genome map should provide a foundation for cloning and isolation of QTLs/genes for molecular dissection of traits as well as markers for molecular breeding for chickpea improvement.  相似文献   

18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号