首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A new approach to sequencing and assembling a highly heterozygous genome, that of grape, species Vitis vinifera cv Pinot Noir, is described. The combining of genome shotgun of paired reads produced by Sanger sequencing and sequencing by synthesis of unpaired reads was shown to be an efficient procedure for decoding a complex genome. About 2 million SNPs and more than a million heterozygous gaps have been identified in the 500Mb genome of grape. More than 91% of the sequence assembled into 58,611 contigs is now anchored to the 19 linkage groups of V. vinifera.  相似文献   

2.

Background

Until recently, only a small number of low- and mid-throughput methods have been used for single nucleotide polymorphism (SNP) discovery and genotyping in grapevine (Vitis vinifera L.). However, following completion of the sequence of the highly heterozygous genome of Pinot Noir, it has been possible to identify millions of electronic SNPs (eSNPs) thus providing a valuable source for high-throughput genotyping methods.

Results

Herein we report the first application of the SNPlex? genotyping system in grapevine aiming at the anchoring of an eukaryotic genome. This approach combines robust SNP detection with automated assay readout and data analysis. 813 candidate eSNPs were developed from non-repetitive contigs of the assembled genome of Pinot Noir and tested in 90 progeny of Syrah × Pinot Noir cross. 563 new SNP-based markers were obtained and mapped. The efficiency rate of 69% was enhanced to 80% when multiple displacement amplification (MDA) methods were used for preparation of genomic DNA for the SNPlex assay.

Conclusion

Unlike other SNP genotyping methods used to investigate thousands of SNPs in a few genotypes, or a few SNPs in around a thousand genotypes, the SNPlex genotyping system represents a good compromise to investigate several hundred SNPs in a hundred or more samples simultaneously. Therefore, the use of the SNPlex assay, coupled with whole genome amplification (WGA), is a good solution for future applications in well-equipped laboratories.  相似文献   

3.

Background

Vitis vinifera (grape) is one of the most economically significant fruit crops in the world. The availability of the recently released grape genome sequence offers an opportunity to identify and analyze some important gene families in this species. Subtilases are a group of subtilisin-like serine proteases that are involved in many biological processes in plants. However, no comprehensive study incorporating phylogeny, chromosomal location and gene duplication, gene organization, functional divergence, selective pressure and expression profiling has been reported so far for the grape.

Results

In the present study, a comprehensive analysis of the subtilase gene family in V. vinifera was performed. Eighty subtilase genes were identified. Phylogenetic analyses indicated that these subtilase genes comprised eight groups. The gene organization is considerably conserved among the groups. Distribution of the subtilase genes is non-random across the chromosomes. A high proportion of these genes are preferentially clustered, indicating that tandem duplications may have contributed significantly to the expansion of the subtilase gene family. Analyses of divergence and adaptive evolution show that while purifying selection may have been the main force driving the evolution of grape subtilases, some of the critical sites responsible for the divergence may have been under positive selection. Further analyses of real-time PCR data suggested that many subtilase genes might be important in the stress response and functional development of plants.

Conclusions

Tandem duplications as well as purifying and positive selections have contributed to the functional divergence of subtilase genes in V. vinifera. The data may contribute to a better understanding of the grape subtilase gene family.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1116) contains supplementary material, which is available to authorized users.  相似文献   

4.
5.
6.

Background  

Efforts to sequence the genomes of different organisms continue to increase. The DNA sequence is usually decoded for one individual and its application is for the whole species. The recent sequencing of the highly heterozygous Vitis vinifera L. cultivar Pinot Noir (clone ENTAV 115) genome gave rise to several thousand polymorphisms and offers a good model to study the transferability of its degree of polymorphism to other individuals of the same species and within the genus.  相似文献   

7.

Background

Silene latifolia is a dioceous plant with well distinguished X and Y chromosomes that is used as a model to study sex determination and sex chromosome evolution in plants. However, efficient utilization of this species has been hampered by the lack of large-scale sequencing resources and detailed analysis of its genome composition, especially with respect to repetitive DNA, which makes up the majority of the genome.

Methodology/Principal Findings

We performed low-pass 454 sequencing followed by similarity-based clustering of 454 reads in order to identify and characterize sequences of all major groups of S. latifolia repeats. Illumina sequencing data from male and female genomes were also generated and employed to quantify the genomic proportions of individual repeat families. The majority of identified repeats belonged to LTR-retrotransposons, constituting about 50% of genomic DNA, with Ty3/gypsy elements being more frequent than Ty1/copia. While there were differences between the male and female genome in the abundance of several repeat families, their overall repeat composition was highly similar. Specific localization patterns on sex chromosomes were found for several satellite repeats using in situ hybridization with probes based on k-mer frequency analysis of Illumina sequencing data.

Conclusions/Significance

This study provides comprehensive information about the sequence composition and abundance of repeats representing over 60% of the S. latifolia genome. The results revealed generally low divergence in repeat composition between the sex chromosomes, which is consistent with their relatively recent origin. In addition, the study generated various data resources that are available for future exploration of the S. latifolia genome.  相似文献   

8.
Peng J  Yang J  Jin Q 《PloS one》2011,6(4):e18509

Background

The completion of numerous genome sequences introduced an era of whole-genome study. However, many genes are missed during genome annotation, including small RNAs (sRNAs) and small open reading frames (sORFs). In order to improve genome annotation, we aimed to identify novel sRNAs and sORFs in Shigella, the principal etiologic agents of bacillary dysentery.

Methodology/Principal Findings

We identified 64 sRNAs in Shigella, which were experimentally validated in other bacteria based on sequence conservation. We employed computer-based and tiling array-based methods to search for sRNAs, followed by RT-PCR and northern blots, to identify nine sRNAs in Shigella flexneri strain 301 (Sf301) and 256 regions containing possible sRNA genes. We found 29 candidate sORFs using bioinformatic prediction, array hybridization and RT-PCR verification. We experimentally validated 557 (57.9%) DOOR operon predictions in the chromosomes of Sf301 and 46 (76.7%) in virulence plasmid.We found 40 additional co-expressed gene pairs that were not predicted by DOOR.

Conclusions/Significance

We provide an updated and comprehensive annotation of the Shigella genome. Our study increased the expected numbers of sORFs and sRNAs, which will impact on future functional genomics and proteomics studies. Our method can be used for large scale reannotation of sRNAs and sORFs in any microbe with a known genome sequence.  相似文献   

9.
Li L  Xiao X  Li S  Jia X  Wang P  Guo X  Jiao X  Zhang Q  Hejtmancik JF 《PloS one》2011,6(5):e19458

Background

Leber congenital amaurosis (LCA) is the earliest onset and most severe form of hereditary retinal dystrophy. So far, full spectrum of variations in the 15 genes known to cause LCA has not been systemically evaluated in East Asians. Therefore, we performed comprehensive detection of variants in these 15 genes in 87 unrelated Han Chinese patients with LCA.

Methodology/Principal Findings

The 51 most frequently mutated exons and introns in the 15 genes were selected for an initial scan using cycle sequencing. All the remaining exons in 11 of the 15 genes were subsequently sequenced. Fifty-three different variants were identified in 44 of the 87 patients (50.6%), involving 78 of the 88 alleles (11 homozygous and 56 heterozygous variants). Of the 53 variants, 35 (66%) were novel pathogenic mutations. In these Chinese patients, variants in GUCY2D are the most common cause of LCA (16.1% cases), followed by CRB1 (11.5%), RPGRIP1 (8%), RPE65 (5.7%), SPATA7 (4.6%), CEP290 (4.6%), CRX (3.4%), LCA5 (2.3%), MERTK (2.3%), AIPL1 (1.1%), and RDH12 (1.1%). This differs from the variation spectrum described in other populations. An initial scan of 55 of 215 PCR amplicons, including 214 exons and 1 intron, detected 83.3% (65/78) of the mutant alleles ultimately found in these 87 patients. In addition, sequencing only 9 exons would detect over 50% of the identified variants and require less than 5% of the labor and cost of comprehensive sequencing for all exons.

Conclusions/Significance

Our results suggest that specific difference in the variation spectrum found in LCA patients from the Han Chinese and other populations are related by ethnicity. Sequencing exons in order of decreasing risk is a cost-effective way to identify causative mutations responsible for LCA, especially in the context of genetic counseling for individual patients in a clinical setting.  相似文献   

10.

Background

The mosquito Aedes aegypti is the primary global vector for dengue and yellow fever viruses. Sequencing of the Ae. aegypti genome has stimulated research in vector biology and insect genomics. However, the current genome assembly is highly fragmented with only ∼31% of the genome being assigned to chromosomes. A lack of a reliable source of chromosomes for physical mapping has been a major impediment to improving the genome assembly of Ae. aegypti.

Methodology/Principal Findings

In this study we demonstrate the utility of mitotic chromosomes from imaginal discs of 4th instar larva for cytogenetic studies of Ae. aegypti. High numbers of mitotic divisions on each slide preparation, large sizes, and reproducible banding patterns of the individual chromosomes simplify cytogenetic procedures. Based on the banding structure of the chromosomes, we have developed idiograms for each of the three Ae. aegypti chromosomes and placed 10 BAC clones and a 18S rDNA probe to precise chromosomal positions.

Conclusion

The study identified imaginal discs of 4th instar larva as a superior source of mitotic chromosomes for Ae. aegypti. The proposed approach allows precise mapping of DNA probes to the chromosomal positions and can be utilized for obtaining a high-quality genome assembly of the yellow fever mosquito.  相似文献   

11.

Background

The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide.

Methodology/Principal Findings

Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice.

Conclusions

The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye.  相似文献   

12.
Chuang LY  Huang HC  Lin MC  Yang CH 《PloS one》2011,6(6):e21036

Background

Regions with abundant GC nucleotides, a high CpG number, and a length greater than 200 bp in a genome are often referred to as CpG islands. These islands are usually located in the 5′ end of genes. Recently, several algorithms for the prediction of CpG islands have been proposed.

Methodology/Principal Findings

We propose here a new method called CPSORL to predict CpG islands, which consists of a complement particle swarm optimization algorithm combined with reinforcement learning to predict CpG islands more reliably. Several CpG island prediction tools equipped with the sliding window technique have been developed previously. However, the quality of the results seems to rely too much on the choices that are made for the window sizes, and thus these methods leave room for improvement.

Conclusions/Significance

Experimental results indicate that CPSORL provides results of a higher sensitivity and a higher correlation coefficient in all selected experimental contigs than the other methods it was compared to (CpGIS, CpGcluster, CpGProd and CpGPlot). A higher number of CpG islands were identified in chromosomes 21 and 22 of the human genome than with the other methods from the literature. CPSORL also achieved the highest coverage rate (3.4%). CPSORL is an application for identifying promoter and TSS regions associated with CpG islands in entire human genomic. When compared to CpGcluster, the islands predicted by CPSORL covered a larger region in the TSS (12.2%) and promoter (26.1%) region. If Alu sequences are considered, the islands predicted by CPSORL (Alu) covered a larger TSS (40.5%) and promoter (67.8%) region than CpGIS. Furthermore, CPSORL was used to verify that the average methylation density was 5.33% for CpG islands in the entire human genome.  相似文献   

13.
14.

Background

The bacterial taxon Polynucleobacter necessarius subspecies asymbioticus represents a group of planktonic freshwater bacteria with cosmopolitan and ubiquitous distribution in standing freshwater habitats. These bacteria comprise <1% to 70% (on average about 20%) of total bacterioplankton cells in various freshwater habitats. The ubiquity of this taxon was recently explained by intra-taxon ecological diversification, i.e. specialization of lineages to specific environmental conditions; however, details on specific adaptations are not known. Here we investigated by means of genomic and experimental analyses the ecological adaptation of a persistent population dwelling in a small acidic pond.

Findings

The investigated population (F10 lineage) contributed on average 11% to total bacterioplankton in the pond during the vegetation periods (ice-free period, usually May to November). Only a low degree of genetic diversification of the population could be revealed. These bacteria are characterized by a small genome size (2.1 Mb), a relatively small number of genes involved in transduction of environmental signals, and the lack of motility and quorum sensing. Experiments indicated that these bacteria live as chemoorganotrophs by mainly utilizing low-molecular-weight substrates derived from photooxidation of humic substances.

Conclusions

Evolutionary genome streamlining resulted in a highly passive lifestyle so far only known among free-living bacteria from pelagic marine taxa dwelling in environmentally stable nutrient-poor off-shore systems. Surprisingly, such a lifestyle is also successful in a highly dynamic and nutrient-richer environment such as the water column of the investigated pond, which was undergoing complete mixis and pronounced stratification in diurnal cycles. Obviously, metabolic and ecological versatility is not a prerequisite for long-lasting establishment of abundant bacterial populations under highly dynamic environmental conditions. Caution should be exercised when generalizing the obtained insights into the ecology and adaptation of the investigated lineage to other Polynucleobacter lineages.  相似文献   

15.
Zhang Y  Mao L  Wang H  Brocker C  Yin X  Vasiliou V  Fei Z  Wang X 《PloS one》2012,7(2):e32153

Background

The completion of the grape genome sequencing project has paved the way for novel gene discovery and functional analysis. Aldehyde dehydrogenases (ALDHs) comprise a gene superfamily encoding NAD(P)+-dependent enzymes that catalyze the irreversible oxidation of a wide range of endogenous and exogenous aromatic and aliphatic aldehydes. Although ALDHs have been systematically investigated in several plant species including Arabidopsis and rice, our knowledge concerning the ALDH genes, their evolutionary relationship and expression patterns in grape has been limited.

Methodology/Principal Findings

A total of 23 ALDH genes were identified in the grape genome and grouped into ten families according to the unified nomenclature system developed by the ALDH Gene Nomenclature Committee (AGNC). Members within the same grape ALDH families possess nearly identical exon-intron structures. Evolutionary analysis indicates that both segmental and tandem duplication events have contributed significantly to the expansion of grape ALDH genes. Phylogenetic analysis of ALDH protein sequences from seven plant species indicates that grape ALDHs are more closely related to those of Arabidopsis. In addition, synteny analysis between grape and Arabidopsis shows that homologs of a number of grape ALDHs are found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the speciation of the grape and Arabidopsis. Microarray gene expression analysis revealed large number of grape ALDH genes responsive to drought or salt stress. Furthermore, we found a number of ALDH genes showed significantly changed expressions in responses to infection with different pathogens and during grape berry development, suggesting novel roles of ALDH genes in plant-pathogen interactions and berry development.

Conclusion

The genome-wide identification, evolutionary and expression analysis of grape ALDH genes should facilitate research in this gene family and provide new insights regarding their evolution history and functional roles in plant stress tolerance.  相似文献   

16.

Background

Ampicillin-resistant Enterococcus faecium (ARE) has emerged as a nosocomial pathogen. Here, we quantified ARE carriage in different community sources and determined genetic relatedness with hospital ARE.

Methods and Results

ARE was recovered from rectal swabs of 24 of 79 (30%) dogs, 11 of 85 (13%) cats and 0 of 42 horses and from 3 of 40 (8%) faecal samples of non-hospitalized humans receiving amoxicillin. Multi-locus Sequence Typing revealed 21 sequence types (STs), including 5 STs frequently associated with hospital-acquired infections. Genes previously found to be enriched in hospital ARE, such as IS16, orf903, orf905, orf907, were highly prevalent in community ARE (≥79%), while genes with a proposed role in pathogenesis, such as esp, hyl and ecbA, were found rarely (≤5%) in community isolates. Comparative genome analysis of 2 representative dog isolates revealed that the dog strain of ST192 was evolutionarily closely linked to two previously sequenced hospital ARE, but had, based on gene content, more genes in common with the other, evolutionarily more distantly related, dog strain (ST266).

Conclusion

ARE were detected in dogs, cats and sporadically in healthy humans, with evolutionary linkage to hospital ARE. Yet, their accessory genome has diversified, probably as a result of niche adaptation.  相似文献   

17.

Background  

Terpenoids are among the most important constituents of grape flavour and wine bouquet, and serve as useful metabolite markers in viticulture and enology. Based on the initial 8-fold sequencing of a nearly homozygous Pinot noir inbred line, 89 putative terpenoid synthase genes (VvTPS) were predicted by in silico analysis of the grapevine (Vitis vinifera) genome assembly [1]. The finding of this very large VvTPS family, combined with the importance of terpenoid metabolism for the organoleptic properties of grapevine berries and finished wines, prompted a detailed examination of this gene family at the genomic level as well as an investigation into VvTPS biochemical functions.  相似文献   

18.

Background

Trypanosoma brucei brucei infects livestock, with severe effects in horses and dogs. Mouse strains differ greatly in susceptibility to this parasite. However, no genes controlling these differences were mapped.

Methods

We studied the genetic control of survival after T. b. brucei infection using recombinant congenic (RC) strains, which have a high mapping power. Each RC strain of BALB/c-c-STS/A (CcS/Dem) series contains a different random subset of 12.5% genes from the parental “donor” strain STS/A and 87.5% genes from the “background” strain BALB/c. Although BALB/c and STS/A mice are similarly susceptible to T. b. brucei, the RC strain CcS-11 is more susceptible than either of them. We analyzed genetics of survival in T. b. brucei-infected F2 hybrids between BALB/c and CcS-11. CcS-11 strain carries STS-derived segments on eight chromosomes. They were genotyped in the F2 hybrid mice and their linkage with survival was tested by analysis of variance.

Results

We mapped four Tbbr (Trypanosoma brucei brucei response) loci that influence survival after T. b. brucei infection. Tbbr1 (chromosome 3) and Tbbr2 (chromosome 12) have effects on survival independent of inter-genic interactions (main effects). Tbbr3 (chromosome 7) influences survival in interaction with Tbbr4 (chromosome 19). Tbbr2 is located on a segment 2.15 Mb short that contains only 26 genes.

Conclusion

This study presents the first identification of chromosomal loci controlling susceptibility to T. b. brucei infection. While mapping in F2 hybrids of inbred strains usually has a precision of 40–80 Mb, in RC strains we mapped Tbbr2 to a 2.15 Mb segment containing only 26 genes, which will enable an effective search for the candidate gene. Definition of susceptibility genes will improve the understanding of pathways and genetic diversity underlying the disease and may result in new strategies to overcome the active subversion of the immune system by T. b. brucei.  相似文献   

19.

Background

The different regions of a genome do not evolve at the same rate. For example, comparative genomic studies have suggested that the sex chromosomes and the regions harbouring the immune defence genes in the Major Histocompatability Complex (MHC) may evolve faster than other genomic regions. The advent of the next generation sequencing technologies has made it possible to study which genomic regions are evolutionary liable to change and which are static, as well as enabling an increasing number of genome studies of non-model species. However, de novo sequencing of the whole genome of an organism remains non-trivial. In this study, we present the draft genome of the black grouse, which was developed using a reference-guided assembly strategy.

Results

We generated 133 Gbp of sequence data from one black grouse individual by the SOLiD platform and used a combination of de novo assembly and chicken reference genome mapping to assemble the reads into 4572 scaffolds with a total length of 1022 Mb. The draft genome well covers the main chicken chromosomes 1 ~ 28 and Z which have a total length of 1001 Mb. The draft genome is fragmented, but has a good coverage of the homologous chicken genes. Especially, 33.0% of the coding regions of the homologous genes have more than 90% proportion of their sequences covered. In addition, we identified ~1 M SNPs from the genome and identified 106 genomic regions which had a high nucleotide divergence between black grouse and chicken or between black grouse and turkey.

Conclusions

Our results support the hypothesis that the chromosome X (Z) evolves faster than the autosomes and our data are consistent with the MHC regions being more liable to change than the genome average. Our study demonstrates how a moderate sequencing effort can be combined with existing genome references to generate a draft genome for a non-model species.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-180) contains supplementary material, which is available to authorized users.  相似文献   

20.

Background

In order to maintain genome information accurately and relevantly, original genome annotations need to be updated and evaluated regularly. Manual reannotation of genomes is important as it can significantly reduce the propagation of errors and consequently diminishes the time spent on mistaken research. For this reason, after five years from the initial submission of the Entamoeba histolytica draft genome publication, we have re-examined the original 23 Mb assembly and the annotation of the predicted genes.

Principal Findings

The evaluation of the genomic sequence led to the identification of more than one hundred artifactual tandem duplications that were eliminated by re-assembling the genome. The reannotation was done using a combination of manual and automated genome analysis. The new 20 Mb assembly contains 1,496 scaffolds and 8,201 predicted genes, of which 60% are identical to the initial annotation and the remaining 40% underwent structural changes. Functional classification of 60% of the genes was modified based on recent sequence comparisons and new experimental data. We have assigned putative function to 3,788 proteins (46% of the predicted proteome) based on the annotation of predicted gene families, and have identified 58 protein families of five or more members that share no homology with known proteins and thus could be entamoeba specific. Genome analysis also revealed new features such as the presence of segmental duplications of up to 16 kb flanked by inverted repeats, and the tight association of some gene families with transposable elements.

Significance

This new genome annotation and analysis represents a more refined and accurate blueprint of the pathogen genome, and provides an upgraded tool as reference for the study of many important aspects of E. histolytica biology, such as genome evolution and pathogenesis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号