首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data in public repositories makes it feasible to evaluate SNP predictions on the DNA chromatogram level. MAVIANT, a platform-independent Multipurpose Alignment VIewing and Annotation Tool, provides DNA chromatogram and alignment views and facilitates evaluation of predictions. In addition, it supports direct manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non-synonymous SNPs were analyzed for their potential effect on the protein structure/function using the PolyPhen and SIFT prediction programs. Predicted SNPs and annotations are stored in a web-based database. Using MAVIANT SNPs can visually be verified based on the DNA sequencing traces. A subset of candidate SNPs was selected for experimental validation by resequencing and genotyping. This study provides a web-based DNA chromatogram and contig browser that facilitates the evaluation and selection of candidate SNPs, which can be applied as genetic markers for genome wide genetic studies. AVAILABILITY: The stand-alone version of MAVIANT program for local use is freely available under GPL license terms at http://snp.agrsci.dk/maviant. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

2.
3.
4.
The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a reference genome. Many species lack a reference genome, but are still important genetic models or are significant species in agricultural production or natural ecosystems. For these species, it is possible to annotate SNPs through comparison with cDNA, or data from well‐annotated genes in public repositories. We present SNPMeta, a tool which gathers information about SNPs by comparison with sequences present in GenBank databases. SNPMeta is able to annotate SNPs from contextual sequence in SNP assay designs, and SNPs discovered through genotyping by sequencing (GBS) approaches. However, SNPs discovered through GBS occur throughout the genome, rather than only in gene space, and therefore do not annotate at high rates. SNPMeta can therefore be used to annotate SNPs in nonmodel species or species that lack a reference genome. Annotations generated by SNPMeta are highly concordant with annotations that would be obtained from a reference genome.  相似文献   

5.
6.
SUMMARY: Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variations in closely related microbial species, strains or isolates. Some SNPs confer selective advantages for microbial pathogens during infection and many others are powerful genetic markers for distinguishing closely related strains or isolates that could not be distinguished otherwise. To facilitate SNP discovery in microbial genomes, we have developed a web-based application, SNPsFinder, for genome-wide identification of SNPs. SNPsFinder takes multiple genome sequences as input to identify SNPs within homologous regions. It can also take contig sequences and sequence quality scores from ongoing sequencing projects for SNP prediction. SNPsFinder will use genome sequence annotation if available and map the predicted SNP regions to known genes or regions to assist further evaluation of the predicted SNPs for their functional significance. SNPsFinder can generate PCR primers for all predicted SNP regions according to user's input parameters to facilitate experimental validation. The results from SNPsFinder analysis are accessible through the World Wide Web. AVAILABILITY: The SNPsFinder program is available at http://snpsfinder.lanl.gov/. SUPPLEMENTARY INFORMATION: The user's manual is available at http://snpsfinder.lanl.gov/UsersManual/  相似文献   

7.
8.
Forensically relevant SNP classes   总被引:2,自引:0,他引:2  
Budowle B  van Daal A 《BioTechniques》2008,44(5):603-8, 610
Forensic samples that contain too little template DNA or are too degraded require alternate genetic marker analyses or approaches to what is currently used for routine casework. Single nucleotide polymorphisms (SNPs) offer promise to support forensic DNA analyses because of an abundance of potential markers, amenability to automation, and potential reduction in required fragment length to only 60-80 bp. The SNP markers will serve an important role in analyzing challenging forensic samples, such as those that are very degraded, for augmenting the power of kinship analyses and family reconstructions for missing persons and unidentified human remains, as well as for providing investigative lead value in some cases without a suspect (and no genetic profile match in CODIS). The SNPs for forensic analyses can be divided into four categories: identity-testing SNPs; lineage informative SNPs; ancestry informative SNPs; and phenotype informative SNPs. In addition to discussing the applications of these different types of SNPs, this article provides some discussion on privacy issues so that society and policymakers can be more informed.  相似文献   

9.
The power of genome-wide SNP association studies is limited, among others, by the large number of false positive test results. To provide a remedy, we combined SNP association analysis with the pathway-driven gene set enrichment analysis (GSEA), recently developed to facilitate handling of genome-wide gene expression data. The resulting GSEA-SNP method rests on the assumption that SNPs underlying a disease phenotype are enriched in genes constituting a signaling pathway or those with a common regulation. Besides improving power for association mapping, GSEA-SNP may facilitate the identification of disease-associated SNPs and pathways, as well as the understanding of the underlying biological mechanisms. GSEA-SNP may also help to identify markers with weak effects, undetectable in association studies without pathway consideration. The program is freely available and can be downloaded from our website.  相似文献   

10.

Background

High-throughput genotype (HTG) data has been used primarily in genome-wide association (GWA) studies; however, GWA results explain only a limited part of the complete genetic variation of traits. In systems genetics, network approaches have been shown to be able to identify pathways and their underlying causal genes to unravel the biological and genetic background of complex diseases and traits, e.g., the Weighted Gene Co-expression Network Analysis (WGCNA) method based on microarray gene expression data. The main objective of this study was to develop a scale-free weighted genetic interaction network method using whole genome HTG data in order to detect biologically relevant pathways and potential genetic biomarkers for complex diseases and traits.

Results

We developed the Weighted Interaction SNP Hub (WISH) network method that uses HTG data to detect genome-wide interactions between single nucleotide polymorphism (SNPs) and its relationship with complex traits. Data dimensionality reduction was achieved by selecting SNPs based on its: 1) degree of genome-wide significance and 2) degree of genetic variation in a population. Network construction was based on pairwise Pearson's correlation between SNP genotypes or the epistatic interaction effect between SNP pairs. To identify modules the Topological Overlap Measure (TOM) was calculated, reflecting the degree of overlap in shared neighbours between SNP pairs. Modules, clusters of highly interconnected SNPs, were defined using a tree-cutting algorithm on the SNP dendrogram created from the dissimilarity TOM (1-TOM). Modules were selected for functional annotation based on their association with the trait of interest, defined by the Genome-wide Module Association Test (GMAT). We successfully tested the established WISH network method using simulated and real SNP interaction data and GWA study results for carcass weight in a pig resource population; this resulted in detecting modules and key functional and biological pathways related to carcass weight.

Conclusions

We developed the WISH network method which is a novel 'systems genetics' approach to study genetic networks underlying complex trait variation. The WISH network method reduces data dimensionality and statistical complexity in associating genotypes with phenotypes in GWA studies and enables researchers to identify biologically relevant pathways and potential genetic biomarkers for any complex trait of interest.
  相似文献   

11.
水稻单核苷酸多态性及其应用现状   总被引:6,自引:0,他引:6  
刘传光  张桂权 《遗传》2006,28(6):737-744
单核苷酸多态性(single nucleotide polymorphisms, SNPs)在水稻中数量多,分布密度高,遗传稳定性高。水稻SNPs的发现方法主要有对样本DNA的PCR产物直接测序、从SSR区段检测SNPs和从基因组序列直接搜索等。目前已有多种基因分型技术运用到了水稻SNPs检测,SNPs检测的高度自动化使水稻SNPs基因分型非常方便。单核苷酸多态性在水稻遗传图谱的构建、基因克隆和功能基因组学研究、标记辅助选择育种、遗传资源分类及物种进化等方面的应用具有巨大潜力。  相似文献   

12.
One of the applications of genomics is to identify genetic markers linked to loci responsible for variation in phenotypic traits, which could be used in breeding programs to select individuals with favorable alleles, particularly at the seedling stage. With this aim, in the framework of the European project FruitBreedomics, we selected five main peach fruit characters and a resistance trait, controlled by major genes with Mendelian inheritance: fruit flesh color Y, fruit skin pubescence G, fruit shape S, sub-acid fruit D, stone adhesion-flesh texture F-M, and resistance to green peach aphid Rm2. They were all previously mapped in Prunus. We then selected three F1 and three F2 progenies segregating for these characters and developed genetic maps of the linkage groups including the major genes, using the single nucleotide polymorphism (SNP) genome-wide scans obtained with the International Peach SNP Consortium (IPSC) 9K SNP array v1. We identified SNPs co-segregating with the characters in all cases. Their positions were in agreement with the known positions of the major genes. The number of SNPs linked to each of these, as well as the size of the physical regions encompassing them, varied depending on the maps. As a result, the number of useful SNPs for marker-assisted selection varied accordingly. As a whole, this study establishes a sound basis for further development of MAS on these characters. Additionally, we also discussed some limitations that were observed regarding the SNP array efficiency.  相似文献   

13.
Despite single nucleotide polymorphism (SNP) availability and frequent cost reduction has allowed genome-wide association studies even in complex traits as tick resistance, the use of this information source in SNP by environment interaction context is unknown for many economically important traits in cattle. We aimed at identifying putative genomic regions explaining differences in tick resistance in Hereford and Braford cattle under SNP by environment point of view as well as to identify candidate genes derived from outliers/significant markers. The environment was defined as contemporary group means of tick counts, since they seemed to be the most appropriate entities to describe the environmental gradient in beef cattle. A total of 4363 animals having tick counts (n=10 673) originated from 197 sires and 3966 dams were used. Genotypes were acquired on 3591 of these cattle. From top 1% SNPs (410) having the greatest effects in each environment, 75 were consistently relevant in all environments, which indicated SNP by environment interaction. The outliers/significant SNPs were mapped on chromosomes 1, 2, 5, 6, 7, 9, 11, 13, 14, 15, 16, 18, 21, 23, 24, 26 and 28, and potential candidate genes were detected across environments. The presence of SNP by environment interaction for tick resistance indicates that genetic expression of resistance depends upon tick burden. Markers with major portion of genetic variance explained across environments appeared to be close to genes with different direct or indirect functions related to immune system, inflammatory process and mechanisms of tissue destruction/repair, such as energy metabolism and cell differentiation.  相似文献   

14.
Li C  Li Y  Xu J  Lv J  Ma Y  Shao T  Gong B  Tan R  Xiao Y  Li X 《Gene》2011,489(2):119-129
Detection of the synergetic effects between variants, such as single-nucleotide polymorphisms (SNPs), is crucial for understanding the genetic characters of complex diseases. Here, we proposed a two-step approach to detect differentially inherited SNP modules (synergetic SNP units) from a SNP network. First, SNP-SNP interactions are identified based on prior biological knowledge, such as their adjacency on the chromosome or degree of relatedness between the functional relationships of their genes. These interactions form SNP networks. Second, disease-risk SNP modules (or sub-networks) are prioritised by their differentially inherited properties in IBD (Identity by Descent) profiles of affected and unaffected sibpairs. The search process is driven by the disease information and follows the structure of a SNP network. Simulation studies have indicated that this approach achieves high accuracy and a low false-positive rate in the identification of known disease-susceptible SNPs. Applying this method to an alcoholism dataset, we found that flexible patterns of susceptible SNP combinations do play a role in complex diseases, and some known genes were detected through these risk SNP modules. One example is GRM7, a known alcoholism gene successfully detected by a SNP module comprised of two SNPs, but neither of the two SNPs was significantly associated with the disease in single-locus analysis. These identified genes are also enriched in some pathways associated with alcoholism, including the calcium signalling pathway, axon guidance and neuroactive ligand-receptor interaction. The integration of network biology and genetic analysis provides putative functional bridges between genetic variants and candidate genes or pathways, thereby providing new insight into the aetiology of complex diseases.  相似文献   

15.
16.
The advances in genotyping technology provide an opportunity to use genomic tools in crop breeding. As compared to field selections performed in conventional breeding programmes, genomics‐based genotype screen can potentially reduce number of breeding cycles and more precisely integrate target genes for particular traits into an ideal genetic background. We developed a whole‐genome single nucleotide polymorphism (SNP) array, RICE6K, based on Infinium technology, using representative SNPs selected from more than four million SNPs identified from resequencing data of more than 500 rice landraces. RICE6K contains 5102 SNP and insertion–deletion (InDel) markers, about 4500 of which were of high quality in the tested rice lines producing highly repeatable results. Forty‐five functional markers that are located inside 28 characterized genes of important traits can be detected using RICE6K. The SNP markers are evenly distributed on the 12 chromosomes of rice with the average density of 12 SNPs per 1 Mb and can provide information for polymorphisms between indica and japonica subspecies as well as varieties within indica and japonica groups. Application tests of RICE6K showed that the array is suitable for rice germplasm fingerprinting, genotyping bulked segregating pools, seed authenticity check and genetic background selection. These results suggest that RICE6K provides an efficient and reliable genotyping tool for rice genomic breeding.  相似文献   

17.
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs.The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species.  相似文献   

18.
19.
《Genomics》2020,112(5):3238-3246
Knowledge on population structure and genetic diversity is a focal point for association mapping studies and genomic selection. Genotyping by sequencing (GBS) represents an innovative method for large scale SNP detection and genotyping of genetic resources. Here we used the GBS approach for the genome-wide identification of SNPs in a collection of Cynoglossus semilaevis and for the assessment of the level of genetic diversity in C. semilaevis genotypes. GBS analysis generated a total of 55.12 Gb high-quality sequence data, with an average of 0.63 Gb per sample. The total number of SNP markers was 563, 109. In order to explore the genetic diversity of C. semilaevis and to select a minimal core set representing most of the total genetic variation with minimum redundancy, C. semilaevis sequences were analyzed using high quality SNPs. Based on hierarchical clustering, it was possible to divide the collection into 2 clusters. The marine fishing populations were clustered and clearly separated from the cultured populations, and the cultured populations from Hebei was also distinct from the other two local populations. These analyses showed that genotypes were clustered based on species-related features. Differential significant SNPs were also captured and validated by GBS and SNaPshot, with linkage disequilibrium and haplotype analysis, seven SNPs have been confirmed to have obvious differentiation in two populations, which may be used as the characteristic evaluation sites of sea-captured and cultured Cynoglossus semilaevis populations. And SNP markers and information on population structure developed in this study will undoubtedly support genome-wide association mapping studies and marker-assisted selection programs. These differential SNPs could be also employed as the characteristic evaluation sites of sea-captured and cultured Cynoglossus semilaevis populations in future.  相似文献   

20.
Hao Z  Li X  Xie C  Weng J  Li M  Zhang D  Liang X  Liu L  Liu S  Zhang S 《植物学报(英文版)》2011,53(8):641-652
Single nucleotide polymorphism (SNP) is a common form of genetic variation and popularly exists in maize genome. An Illumina GoldenGate assay with 1 536 SNP markers was used to genotype maize inbred lines and identified the functional genetic variations underlying drought tolerance by association analysis. Across 80 lines, 1 006 polymorphic SNPs (65.5% of the total) in the assay with good call quality were used to estimate the pattern of genetic diversity, population structure, and familial relatedness. The analysis showed the best number of fixed subgroups was six, which was consistent with their original sources and results using only simple sequence repeat markers. Pairwise linkage disequilibrium (LD) and association mapping with phenotypic traits investigated under water-stressed and well-watered regimes showed rapid LD decline within 100-500 kb along the physical distance of each chromosome, and that 29 SNPs were associated with at least two phenotypic traits in one or more environments, which were related to drought-tolerant or drought-responsive genes. These drought-tolerant SNPs could be converted into functional markers and then used for maize improvement by marker-assisted selection.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号