首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
With its small, diploid and completely sequenced genome, sorghum (Sorghum bicolor L. Moench) is highly amenable to genomics‐based breeding approaches. Here, we describe the development and testing of a robust single‐nucleotide polymorphism (SNP) array platform that enables polymorphism screening for genome‐wide and trait‐linked polymorphisms in genetically diverse S. bicolor populations. Whole‐genome sequences with 6× to 12× coverage from five genetically diverse S. bicolor genotypes, including three sweet sorghums and two grain sorghums, were aligned to the sorghum reference genome. From over 1 million high‐quality SNPs, we selected 2124 Infinium Type II SNPs that were informative in all six source genomes, gave an optimal Assay Design Tool (ADT) score, had allele frequencies of 50% in the six genotypes and were evenly spaced throughout the S. bicolor genome. Furthermore, by phenotype‐based pool sequencing, we selected an additional 876 SNPs with a phenotypic association to early‐stage chilling tolerance, a key trait for European sorghum breeding. The 3000 attempted bead types were used to populate half of a dual‐species Illumina iSelect SNP array. The array was tested using 564 Sorghum spp. genotypes, including offspring from four unrelated recombinant inbred line (RIL) and F2 populations and a genetic diversity collection. A high call rate of over 80% enabled validation of 2620 robust and polymorphic sorghum SNPs, underlining the efficiency of the array development scheme for whole‐genome SNP selection and screening, with diverse applications including genetic mapping, genome‐wide association studies and genomic selection.  相似文献   

3.
4.
5.
6.
7.
Heterodera glycines, the soybean cyst nematode (SCN), is a damaging agricultural pest that could be effectively managed if critical phenotypes, such as virulence and host range could be understood. While SCN is amenable to genetic analysis, lack of DNA sequence data prevents the use of such methods to study this pathogen. Fortunately, new methods of DNA sequencing that produced large amounts of data and permit whole genome comparative analyses have become available. In this study, 400 million bases of genomic DNA sequence were collected from two inbred biotypes of SCN using 454 micro-bead DNA sequencing. Comparisons to a BAC, sequenced by Sanger sequencing, showed that the micro-bead sequences could identify low and high copy number regions within the BAC. Potential single nucleotide polymorphisms (SNPs) between the two SCN biotypes were identified by comparing the two sets of sequences. Selected resequencing revealed that up to 84% of the SNPs were correct. We conclude that the quality of the micro-bead sequence data was sufficient for de novo SNP identification and should be applicable to organisms with similar genome sizes and complexities. The SNPs identified will be an important starting point in associating phenotypes with specific regions of the SCN genome.  相似文献   

8.
Common bean (Phaseolus vulgaris L.) is a legume that is an important source of dietary protein in developing countries throughout the world. Utilizing the G19833 BAC library for P. vulgaris from Clemson University, 89,017 BAC-end sequences were generated giving 62,588,675 base pairs of genomic sequence covering approximately 9.54% of the genome. Analysis of these sequences in combination with 1,404 shotgun sequences from the cultivar Bat7 revealed that approximately 49.2% of the genome contains repetitive sequence and 29.3% is genic. Compared to other legume BAC-end sequencing projects, it appears that P. vulgaris has higher predicted levels of repetitive sequence, but this may be due to a more intense identification strategy combining both similarity-based matches as well as de novo identification of repeats. In addition, fingerprints for 41,717 BACs were obtained and assembled into a draft physical map consisting of 1,183 clone contigs and 6,385 singletons with ~9x coverage of the genome.  相似文献   

9.
An exhaustive screening of the Pleurotus ostreatus genome was performed to search for nucleotide sequences of heme peroxidases in this white-rot fungus, which could be useful for different biotechnological applications. After sequence identification and manual curation of the corresponding genes and cDNAs, the deduced amino acid sequences were converted into structural homology models. A comparative study of these sequences and their structural models with those of known fungal peroxidases revealed the complete inventory of heme peroxidases of this fungus. This consists of cytochrome c peroxidase and ligninolytic peroxidases, including manganese peroxidase and versatile peroxidase but not lignin peroxidase, as representative of the "classical" superfamily of plant, fungal, and bacterial peroxidases; and members of two relatively "new" peroxidase superfamilies, namely heme-thiolate peroxidases, here described for the first time in a fungus from the genus Pleurotus, and dye-decolorizing peroxidases, already known in P.?ostreatus but still to be thoroughly explored and characterized.  相似文献   

10.
Switchgrass (Panicum virgatum) has been evaluated as one potential source for cellulosic biofuel feedstocks. Planting switchgrass in marginal croplands and waterway buffers can reduce soil erosion, improve water quality, and improve regional ecosystem services (i.e. it serves as a potential carbon sink). In previous studies, we mapped high risk marginal croplands and highly erodible cropland buffers that are potentially suitable for switchgrass development, which would improve ecosystem services and minimally impact food production. In this study, we advance our previous study results and integrate future crop expansion information to develop a switchgrass biofuel potential ensemble map for current and future croplands in eastern Nebraska. The switchgrass biomass productivity and carbon benefits (i.e. NEP: net ecosystem production) for the identified biofuel potential ensemble areas were quantified. The future scenario‐based (‘A1B’) land use and land cover map for 2050, the US Geological Survey crop type and Compound Topographic Index (CTI) maps, and long‐term (1981–2010) averaged annual precipitation data were used to identify future crop expansion regions that are suitable for switchgrass development. Results show that 2528 km2 of future crop expansion regions (~3.6% of the study area) are potentially suitable for switchgrass development. The total estimated biofuel potential ensemble area (including cropland buffers, marginal croplands, and future crop expansion regions) is 4232 km2 (~6% of the study area), potentially producing 3.52 million metric tons of switchgrass biomass per year. Converting biofuel ensemble regions to switchgrass leads to potential carbon sinks (the total NEP for biofuel potential areas is 0.45 million metric tons C) and is environmentally sustainable. Results from this study improve our understanding of environmental conditions and ecosystem services of current and future cropland systems in eastern Nebraska and provide useful information to land managers to make land use decisions regarding switchgrass development.  相似文献   

11.
The perennial grass, switchgrass (Panicum virgatum L.), is a promising bioenergy crop and the target of whole genome sequencing. We constructed two bacterial artificial chromosome (BAC) libraries from the AP13 clone of switchgrass to gain insight into the genome structure and organization, initiate functional and comparative genomic studies, and assist with genome assembly. Together representing 16 haploid genome equivalents of switchgrass, each library comprises 101,376 clones with average insert sizes of 144 (HindIII-generated) and 110 kb (BstYI-generated). A total of 330,297 high quality BAC-end sequences (BES) were generated, accounting for 263.2 Mbp (16.4%) of the switchgrass genome. Analysis of the BES identified 279,099 known repetitive elements, >50,000 SSRs, and 2,528 novel repeat elements, named switchgrass repetitive elements (SREs). Comparative mapping of 47 full-length BAC sequences and 330K BES revealed high levels of synteny with the grass genomes sorghum, rice, maize, and Brachypodium. Our data indicate that the sorghum genome has retained larger microsyntenous regions with switchgrass besides high gene order conservation with rice. The resources generated in this effort will be useful for a broad range of applications.  相似文献   

12.
13.
Separation of proteins by two-dimensional gel electrophoresis (2-DE) coupled with identification of proteins through peptide mass fingerprinting (PMF) by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is the widely used technique for proteomic analysis. This approach relies, however, on the presence of the proteins studied in public-accessible protein databases or the availability of annotated genome sequences of an organism. In this work, we investigated the reliability of using raw genome sequences for identifying proteins by PMF without the need of additional information such as amino acid sequences. The method is demonstrated for proteomic analysis of Klebsiella pneumoniae grown anaerobically on glycerol. For 197 spots excised from 2-DE gels and submitted for mass spectrometric analysis 164 spots were clearly identified as 122 individual proteins. 95% of the 164 spots can be successfully identified merely by using peptide mass fingerprints and a strain-specific protein database (ProtKpn) constructed from the raw genome sequences of K. pneumoniae. Cross-species protein searching in the public databases mainly resulted in the identification of 57% of the 66 high expressed protein spots in comparison to 97% by using the ProtKpn database. 10 dha regulon related proteins that are essential for the initial enzymatic steps of anaerobic glycerol metabolism were successfully identified using the ProtKpn database, whereas none of them could be identified by cross-species searching. In conclusion, the use of strain-specific protein database constructed from raw genome sequences makes it possible to reliably identify most of the proteins from 2-DE analysis simply through peptide mass fingerprinting.  相似文献   

14.
15.
16.
17.
Brachypodium distachyon (Brachypodium) has been recently recognized as an emerging model system for both comparative and functional genomics in grass species. In this study, 55,221 repeat masked Brachypodium BAC end sequences (BES) were used for comparative analysis against the 12 rice pseudomolecules. The analysis revealed that ~26.4% of BES have significant matches with the rice genome and 82.4% of the matches were homologous to known genes. Further analysis of paired-end BES and ~1.0 Mb sequences from nine selected BACs proved to be useful in revealing conserved regions and regions that have undergone considerable genomic changes. Differential gene amplification, insertions/deletions and inversions appeared to be the common evolutionary events that caused variations of microcolinearity at different orthologous genomic regions. It was found that ~17% of genes in the two genomes are not colinear in the orthologous regions. Analysis of BAC sequences also revealed higher gene density (~9 kb/gene) and lower repeat DNA content (~13.1%) in Brachypodium when compared to the orthologous rice regions, consistent with the smaller size of the Brachypodium genome. The 119 annotated Brachypodium genes were BLASTN compared against the wheat EST database and deletion bin mapped wheat ESTs. About 77% of the genes retrieved significant matches in the EST database, while 9.2% matched to the bin mapped ESTs. In some cases, genes in single Brachypodium BACs matched to multiple ESTs that were mapped to the same deletion bins, suggesting that the Brachypodium genome will be useful for ordering wheat ESTs within the deletion bins and developing specific markers at targeted regions in the wheat genome.  相似文献   

18.
Prediction of Saccharomyces cerevisiae replication origins   总被引:2,自引:0,他引:2       下载免费PDF全文

Background  

Autonomously replicating sequences (ARSs) function as replication origins in Saccharomyces cerevisiae. ARSs contain the 17 bp ARS consensus sequence (ACS), which binds the origin recognition complex. The yeast genome contains more than 10,000 ACS matches, but there are only a few hundred origins, and little flanking sequence similarity has been found. Thus, identification of origins by sequence alone has not been possible.  相似文献   

19.
Leishmaniasis is a neglected disease with an estimated 12 million infected people. The recent completion of the sequencing of the Leishmania major genome has opened opportunities for the identification of targets for vaccine development. We present here the first attempt at identifying novel vaccine candidates by whole genome analysis. We predicted CD8+ T cell epitopes from the L. major proteome and validated in vivo in mice the immunogenicity of some of the best predicted epitopes. Consensus epitope predictions from 8272 annotated protein sequences with 5–8 different algorithms allowed the identification of 78 class I CD8+ epitopes. BALB/c mice were immunized with 26 synthetic peptides corresponding to the most likely epitopes. Fourteen (54%) resulted immunogenic, with eight being strong inducers of T cell IFNγ production. None of the proteins from which the epitopes are derived are differentially expressed, only two may be surface proteins, eight have putative enzymatic, and metabolic activities. These epitopes and proteins represent new antigen candidates for further studies. While pathogen genomes have not yet delivered their full promise in terms of human health applications, our study opens the way for extensive genome mining for antigen identification and vaccine development against Leishmania and other pathogens.  相似文献   

20.
Amplification and dispersion of repeated DNA sequences in theTriticeae   总被引:1,自引:0,他引:1  
Four representatives of a family of dispersed repetitive sequences which were prominent and dispersed in the E genome ofThinopyrum elongatum but poorly represented in wheat, were studied in detail. The 1.4kb sequences were present both as part of tandem and more complex arrays and appeared to have resulted from repeated amplification of the sequence and their dispersion throughout the genome. Subcloning of sections of the 1.4 kb sequences resulted in probes which improved the resolution of the E genome from the genomes in wheat and enabled identification of single E genome chromosomes introduced into wheat. The generality of these types of sequences in the tribeTriticeae was confirmed by isolating analogous sequences from the R (rye,Secale cereale), V (Dasypyrum villosum), and N (Psathyrostachys juncea) genomes. — The cloned repetitive sequences from the R, V, and N genomes each showed characteristic fluctuations in amount within the grasses examined in addition to being virtually absent from wheat. It is thus possible that these sequences may provide useful taxonomic indicators for establishing relationships within theTriticeae, as well as valuable probes for tracing alien chromatin introduced into wheat.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号