首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 343 毫秒
1.
Cassava (Manihot esculenta Crantz) is one of the most important food security crops in the tropics and increasingly being adopted for agro-industrial processing. Genetic improvement of cassava can be enhanced through marker-assisted breeding. For this, appropriate genomic tools are required to dissect the genetic architecture of economically important traits. Here, a genome-wide SNP-based genetic map of cassava anchored in SSRs is presented. An outbreeder full-sib (F1) family was genotyped on two independent SNP assay platforms: an array of 1,536 SNPs on Illumina's GoldenGate platform was used to genotype a first batch of 60 F1. Of the 1,358 successfully converted SNPs, 600 which were polymorphic in at least one of the parents and was subsequently converted to KBiosciences' KASPar assay platform for genotyping 70 additional F1. High-precision genotyping of 163 informative SSRs using capillary electrophoresis was also carried out. Linkage analysis resulted in a final linkage map of 1,837 centi-Morgans (cM) containing 568 markers (434 SNPs and 134 SSRs) distributed across 19 linkage groups. The average distance between adjacent markers was 3.4?cM. About 94.2% of the mapped SNPs and SSRs have also been localized on scaffolds of version 4.1 assembly of the cassava draft genome sequence. This more saturated genetic linkage map of cassava that combines SSR and SNP markers should find several applications in the improvement of cassava including aligning scaffolds of the cassava genome sequence, genetic analyses of important agro-morphological traits, studying the linkage disequilibrium landscape and comparative genomics.  相似文献   

2.
Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNPs genotyped by currently available arrays used for genomic selection. A number of variants detected from sequencing influential sires are likely to be causal, but noticeable improvements in prediction accuracy using imputed sequence variant genotypes have not been reported. Improvement in accuracy of predicted breeding values may be limited by the accuracy of imputed sequence variants. Using genotypes of SNPs on a high‐density array and non‐synonymous SNPs detected in sequence from influential sires of a multibreed population, results of this examination suggest that linkage disequilibrium between non‐synonymous and array SNPs may be insufficient for accurate imputation from the array to sequence. In contrast to 75% of array SNPs being strongly correlated to another SNP on the array, less than 25% of the non‐synonymous SNPs were strongly correlated to an array SNP. When correlations between non‐synonymous and array SNPs were strong, distances between the SNPs were greater than separation that might be expected based on linkage disequilibrium decay. Consistently near‐perfect whole‐genome linkage disequilibrium between the full array and each non‐synonymous SNP within the sequenced bulls suggests that whole‐genome approaches to infer sequence variants might be more accurate than imputation based on local haplotypes. Opportunity for strong linkage disequilibrium between sequence and array SNPs may be limited by discrepancies in allele frequency distributions, so investigating alternate genotyping approaches and panels providing greater chances of frequency‐matched SNPs strongly correlated to sequence variants is also warranted. Genotypes used for this study are available from https://www.animalgenome.org/repository/pub/ ;USDA2017.0519/.  相似文献   

3.
High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a community initiative to enable marker-assisted breeding for rosaceous crops. Next-generation sequencing in diverse breeding germplasm provided 25 billion basepairs (Gb) of cherry DNA sequence from which were identified genome-wide SNPs for sweet cherry and for the two sour cherry subgenomes derived from sweet cherry (avium subgenome) and P. fruticosa (fruticosa subgenome). Anchoring to the peach genome sequence, recently released by the International Peach Genome Initiative, predicted relative physical locations of the 1.9 million putative SNPs detected, preliminarily filtered to 368,943 SNPs. Further filtering was guided by results of a 144-SNP subset examined with the Illumina GoldenGate® assay on 160 accessions. A 6K Infinium® II array was designed with SNPs evenly spaced genetically across the sweet and sour cherry genomes. SNPs were developed for each sour cherry subgenome by using minor allele frequency in the sour cherry detection panel to enrich for subgenome-specific SNPs followed by targeting to either subgenome according to alleles observed in sweet cherry. The array was evaluated using panels of sweet (n = 269) and sour (n = 330) cherry breeding germplasm. Approximately one third of array SNPs were informative for each crop. A total of 1825 polymorphic SNPs were verified in sweet cherry, 13% of these originally developed for sour cherry. Allele dosage was resolved for 2058 polymorphic SNPs in sour cherry, one third of these being originally developed for sweet cherry. This publicly available genomics resource represents a significant advance in cherry genome-scanning capability that will accelerate marker-locus-trait association discovery, genome structure investigation, and genetic diversity assessment in this diploid-tetraploid crop group.  相似文献   

4.

Background

Cassava, Manihot esculenta Crantz, is one of the most important crops world-wide representing the staple security for more than one billion of people. The development of dense genetic and physical maps, as the basis for implementing genetic and molecular approaches to accelerate the rate of genetic gains in breeding program represents a significant challenge. A reference genome sequence for cassava has been made recently available and community efforts are underway for improving its quality. Cassava is threatened by several pathogens, but the mechanisms of defense are far from being understood. Besides, there has been a lack of information about the number of genes related to immunity as well as their distribution and genomic organization in the cassava genome.

Results

A high dense genetic map of cassava containing 2,141 SNPs has been constructed. Eighteen linkage groups were resolved with an overall size of 2,571 cM and an average distance of 1.26 cM between markers. More than half of mapped SNPs (57.4%) are located in coding sequences. Physical mapping of scaffolds of cassava whole genome sequence draft using the mapped markers as anchors resulted in the orientation of 687 scaffolds covering 45.6% of the genome. One hundred eighty nine new scaffolds are anchored to the genetic cassava map leading to an extension of the present cassava physical map with 30.7 Mb. Comparative analysis using anchor markers showed strong co-linearity to previously reported cassava genetic and physical maps. In silico based searching for conserved domains allowed the annotation of a repertory of 1,061 cassava genes coding for immunity-related proteins (IRPs). Based on physical map of the corresponding sequencing scaffolds, unambiguous genetic localization was possible for 569 IRPs.

Conclusions

This is the first study reported so far of an integrated high density genetic map using SNPs with integrated genetic and physical localization of newly annotated immunity related genes in cassava. These data build a solid basis for future studies to map and associate markers with single loci or quantitative trait loci for agronomical important traits. The enrichment of the physical map with novel scaffolds is in line with the efforts of the cassava genome sequencing consortium.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1397-4) contains supplementary material, which is available to authorized users.  相似文献   

5.
SNP(single nucleotide polymorphism,单核苷酸多态)在猪基因组中的分布极其广泛,平均分布间隔为300~400 bp,相关数据库收录已达55万条。猪基因组测序已取得实质性进展,大规模搜索发现基因组及EST(expressed sequence tag)序列中的SNP已展开,应用于猪全基因组水平的SNP芯片已建立。在此基础上,基于猪SNP标记的遗传图谱绘制、QTL(quantitative trait loci)定位、遗传多样性检测及全基因组关联分析等也都相继出现。  相似文献   

6.
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species.  相似文献   

7.
The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina’s GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.  相似文献   

8.
SUMMARY: Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variations in closely related microbial species, strains or isolates. Some SNPs confer selective advantages for microbial pathogens during infection and many others are powerful genetic markers for distinguishing closely related strains or isolates that could not be distinguished otherwise. To facilitate SNP discovery in microbial genomes, we have developed a web-based application, SNPsFinder, for genome-wide identification of SNPs. SNPsFinder takes multiple genome sequences as input to identify SNPs within homologous regions. It can also take contig sequences and sequence quality scores from ongoing sequencing projects for SNP prediction. SNPsFinder will use genome sequence annotation if available and map the predicted SNP regions to known genes or regions to assist further evaluation of the predicted SNPs for their functional significance. SNPsFinder can generate PCR primers for all predicted SNP regions according to user's input parameters to facilitate experimental validation. The results from SNPsFinder analysis are accessible through the World Wide Web. AVAILABILITY: The SNPsFinder program is available at http://snpsfinder.lanl.gov/. SUPPLEMENTARY INFORMATION: The user's manual is available at http://snpsfinder.lanl.gov/UsersManual/  相似文献   

9.

Background

Plant resistance genes (R genes) exist in large families and usually contain both a nucleotide-binding site domain and a leucine-rich repeat domain, denoted NBS-LRR. The genome sequence of cassava (Manihot esculenta) is a valuable resource for analysing the genomic organization of resistance genes in this crop.

Results

With searches for Pfam domains and manual curation of the cassava gene annotations, we identified 228 NBS-LRR type genes and 99 partial NBS genes. These represent almost 1% of the total predicted genes and show high sequence similarity to proteins from other plant species. Furthermore, 34 contained an N-terminal toll/interleukin (TIR)-like domain, and 128 contained an N-terminal coiled-coil (CC) domain. 63% of the 327 R genes occurred in 39 clusters on the chromosomes. These clusters are mostly homogeneous, containing NBS-LRRs derived from a recent common ancestor.

Conclusions

This study provides insight into the evolution of NBS-LRR genes in the cassava genome; the phylogenetic and mapping information may aid efforts to further characterize the function of these predicted R genes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1554-9) contains supplementary material, which is available to authorized users.  相似文献   

10.
Pest and disease problems are important constraints of cassava production and host plant resistance is the most efficient method of combating them. Breeding for host plant resistance is considerably slowed down by the crop’s biological constraints of a long growth cycle, high levels of heterozygosity and a large genetic load. More efficient methods such as gene cloning and transgenesis are required to deploy resistance genes. To facilitate the cloning of resistance genes, bacterial artificial chromosome (BAC) library resources have been developed for cassava. Two libraries were constructed from the cassava clones, TMS 30001, resistant to the cassava mosaic disease (CMD) and the cassava bacterial blight (CBB), and MECU72, resistant to cassava white fly. The TMS30001 library has 55 296 clones with an insert size range of 40–150 kb with an average of 80 kb, while the MECU72 library consists of 92 160 clones and an insert size range of 25–250 kb average of 93 kb. Based on a genome size of 772 Mb, the TMS30001 and MECU72 libraries have a 5 and 11.3 haploid genome equivalents and a 95 and 99 chance of finding any sequence, respectively. To demonstrate the potential of the libraries, the TMS30001 library was screened by southern hybridization using a cassava analog (CBB1) of the Xa21 gene from rice that maps to a region containing a QTL for resistance to CBB as probe. Five BAC clones that hybridized to CBB1 were isolated and a Hind III fingerprint revealed 2–3 copies of the gene in individual BAC clones. A larger scale analysis of resistance gene analogs (RGAs) in cassava has also been conducted in order to understand the number and organization of RGAs. To scan for gene and repeat DNA content in the libraries, end-sequencing was performed on 2301 clones from the MECU72 library. A total of 1705 unique sequences were obtained with an average size of 715 bp. Database homology searches using BLAST revealed that 458 sequences had significant homology with known proteins and 321 with transposable elements. The use of the library in positional cloning of pest and disease resistance genes is discussed.  相似文献   

11.
The development and application of genomic tools to loblolly pine (Pinus taeda L.) offer promising insights into the organization and structure of conifer genomes. The application of a high-throughput genotyping assay across diverse forest tree species, however, is currently limited taxonomically. This is despite the ongoing development of genome-scale projects aiming at the construction of expressed sequence tag (EST) libraries and the resequencing of EST-derived unigenes for a diverse array of forest tree species. In this paper, we report on the application of Illumina’s high-throughput GoldenGate™ SNP genotyping assay to a loblolly pine mapping population. Single nucleotide polymorphisms (SNPs) were identified through resequencing of previously identified wood quality, drought tolerance, and disease resistance candidate genes prior to genotyping. From that effort, a 384 multiplexed SNP assay was developed for high-throughput genotyping. Approximately 67% of the 384 SNPs queried converted into high-quality genotypes for the 48 progeny samples. Of those 257 successfully genotyped SNPs, 70 were segregating within the mapping population. A total of 27 candidate genes were subsequently mapped onto the existing loblolly pine consensus map, which consists of 12 linkage groups spanning a total map distance of 1,227.6 cM. The ability of SNPs to be mapped to the same position as fragment-based markers previously developed within the same candidate genes, as well as the pivotal role that SNPs currently play in the dissection of complex phenotypic traits, illustrate the usefulness of high-throughput SNP genotyping technologies to the continued development of pine genomics. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

12.
Cassava brown streak disease (CBSD) and cassava mosaic disease (CMD) are currently two major viral diseases that severely reduce cassava production in large areas of Sub-Saharan Africa. Natural resistance has so far only been reported for CMD in cassava. CBSD is caused by two virus species, Cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV). A sequence of the CBSV coat protein (CP) highly conserved between the two virus species was used to demonstrate that a CBSV-CP hairpin construct sufficed to generate immunity against both viral species in the cassava model cultivar (cv. 60444). Most of the transgenic lines showed high levels of resistance under increasing viral loads using a stringent top-grafting method of inoculation. No viral replication was observed in the resistant transgenic lines and they remained free of typical CBSD root symptoms 7 month post-infection. To generate transgenic cassava lines combining resistance to both CBSD and CMD the hairpin construct was transferred to a CMD-resistant farmer-preferred Nigerian landrace TME 7 (Oko-Iyawo). An adapted protocol allowed the efficient Agrobacterium-based transformation of TME 7 and the regeneration of transgenic lines with high levels of CBSV-CP hairpin-derived small RNAs. All transgenic TME 7 lines were immune to both CBSV and UCBSV infections. Further evaluation of the transgenic TME 7 lines revealed that CBSD resistance was maintained when plants were co-inoculated with East African cassava mosaic virus (EACMV), a geminivirus causing CMD. The innovative combination of natural and engineered virus resistance in farmer-preferred landraces will be particularly important to reducing the increasing impact of cassava viral diseases in Africa.  相似文献   

13.
Advancements in next-generation sequencing technology have enabled whole genome re-sequencing in many species providing unprecedented discovery and characterization of molecular polymorphisms. There are limitations, however, to next-generation sequencing approaches for species with large complex genomes such as barley and wheat. Genotyping-by-sequencing (GBS) has been developed as a tool for association studies and genomics-assisted breeding in a range of species including those with complex genomes. GBS uses restriction enzymes for targeted complexity reduction followed by multiplex sequencing to produce high-quality polymorphism data at a relatively low per sample cost. Here we present a GBS approach for species that currently lack a reference genome sequence. We developed a novel two-enzyme GBS protocol and genotyped bi-parental barley and wheat populations to develop a genetically anchored reference map of identified SNPs and tags. We were able to map over 34,000 SNPs and 240,000 tags onto the Oregon Wolfe Barley reference map, and 20,000 SNPs and 367,000 tags on the Synthetic W9784 × Opata85 (SynOpDH) wheat reference map. To further evaluate GBS in wheat, we also constructed a de novo genetic map using only SNP markers from the GBS data. The GBS approach presented here provides a powerful method of developing high-density markers in species without a sequenced genome while providing valuable tools for anchoring and ordering physical maps and whole-genome shotgun sequence. Development of the sequenced reference genome(s) will in turn increase the utility of GBS data enabling physical mapping of genes and haplotype imputation of missing data. Finally, as a result of low per-sample costs, GBS will have broad application in genomics-assisted plant breeding programs.  相似文献   

14.
Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress.  相似文献   

15.
Several begomovirus species and strains causing Cassava mosaic disease (CMD) have been reported from cassava in Africa. In Nigeria, African cassava mosaic virus (ACMV) was the predominant virus in this important crop, and East African cassava mosaic virus (EACMV), first reported from eastern Nigeria in 1999, was also found occasionally. A survey was conducted in 2002 to resolve the diversity of the virus types present in cassava in Nigeria and to further understand the increasing complexity of the viruses contributing to CMD. A total of 234 leaf samples from cassava with conspicuous CMD symptoms were collected in farmers’ fields across different agroecological zones of Nigeria and subjected to polymerase chain reaction (PCR) with type‐specific primers. In addition and, to provide a full characterization of the viruses present, DNA‐A genome components of several viruses and informative genome fragments were sequenced. In Nigeria, ACMV proved to be the dominant virus with 80% of all samples being positive for ACMV. The East African cassava mosaic Cameroon virus (EACMCV) prevalent in Cameroon and Ivory Coast was detected in single infections (2%) and in mixed infections (18%) with ACMV. There was no indication for other virus strains of EACMV present in the country. The EACMCV samples collected showed a high nucleotide sequence identity >98% and resembled the described sequence of a Cameroon isolate (EACMCV‐CM) more than an Ivory Coast isolate, EACMCV‐CM[CI]. Evidence is provided that the EACMCV has reached epidemiological significance in Nigeria.  相似文献   

16.
17.
Association mapping currently relies on the identification of genetic markers. Several technologies have been adopted for genetic marker analysis, with single nucleotide polymorphisms (SNPs) being the most popular where a reasonable quantity of genome sequence data are available. We describe several tools we have developed for the discovery, annotation, and visualization of molecular markers for association mapping. These include autoSNPdb for SNP discovery from assembled sequence data; TAGdb for the identification of gene specific paired read Illumina GAII data; CMap3D for the comparison of mapped genetic and physical markers; and BAC and Gene Annotator for the online annotation of genes and genomic sequences.  相似文献   

18.
Simple sequence repeat (SSR) markers provide a powerful tool for genetic linkage map construction that can be applied for identification of quantitative trait loci (QTL). In this study, a total of 640 new SSR markers were developed from an enriched genomic DNA library of the cassava variety 'Huay Bong 60' and 1,500 novel expressed sequence tag-simple sequence repeat (EST-SSR) loci were developed from the Genbank database. To construct a genetic linkage map of cassava, a 100 F(1) line mapping population was developed from the cross Huay Bong 60 by 'Hanatee'. Polymorphism screening between the parental lines revealed that 199 SSRs and 168 EST-SSRs were identified as novel polymorphic markers. Combining with previously developed SSRs, we report a linkage map consisted of 510 markers encompassing 1,420.3?cM, distributed on 23 linkage groups with a mean distance between markers of 4.54?cM. Comparison analysis of the SSR order on the cassava linkage map and the cassava genome sequences allowed us to locate 284 scaffolds on the genetic map. Although the number of linkage groups reported here revealed that this F(1) genetic linkage map is not yet a saturated map, it encompassed around 88% of the cassava genome indicating that the map was almost complete. Therefore, sufficient markers now exist to encompass most of the genomes and efficiently map traits in cassava.  相似文献   

19.
Single nucleotide polymorphisms (SNP) are the most abundant type of DNA polymorphism found in animal and plant genomes. They provide an important new source of molecular markers that are useful in genetic mapping, map-based positional cloning, quantitative trait locus mapping and the assessment of genetic distances between individuals. Very little is known on the frequency of SNPs in cassava. We have exploited the recently-developed collection of cassava expressed sequence tags (ESTs) to detect SNPs in the five cultivars of cassava used to generate the sequences. The frequency of intra-cultivar and inter-cultivar SNPs after analysis of 111 contigs was one polymorphism per 905 and one per 1,032 bp, respectively; totaling 1 each 509 bp. We have obtained further information on the frequency of SNPs in six cassava cultivars by analysis of 33 amplicons obtained from 3 EST and BAC end sequences. Overall, about 11 kb of DNA sequence was obtained for each cultivar. A total of 186 SNPs (136 and 50 from ESTs and BAC ends, respectively) were identified. Among these, 146 were intra-cultivar polymorphisms, while 80 were inter-cultivar polymorphisms. Thus the total frequency of SNPs was one per 62 bp. This information will help to develop new strategies for EST mapping as well as their association with phenotypic characteristics.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号