首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 437 毫秒
1.
Selective signatures in whole genome can help us understand the mechanisms of selection and target causal variants for breeding program. In present study, we performed Extended Haplotype Homozygosity (EHH) tests to identify significant core regions harboring such signals in Chinese Holstein, and then verified the biological significance of these identified regions based on commonly-used bioinformatics analyses. Results showed a total of 125 significant regions in entire genome containing some of important functional genes such as LEP, ABCG2, CSN1S1, CSN3 and TNF based on the Gene Ontology database. Some of these annotated genes involved in the core regions overlapped with those identified in our previous GWAS as well as those involved in a recently constructed candidate gene database for cattle, further indicating these genes under positive selection maybe underlie milk production traits and other important traits in Chinese Holstein. Furthermore, in the enrichment analyses for the second level GO terms and pathways, we observed some significant terms over represented in these identified regions as compared to the entire bovine genome. This indicates that some functional genes associated with milk production traits, as reflected by GO terms, could be clustered in core regions, which provided promising evidence for the exploitability of the core regions identified by EHH tests. Findings in our study could help detect functional candidate genes under positive selection for further genetic and breeding research in Chinese Holstein.  相似文献   

2.
In dairy sheep flocks from Mediterranean countries, replacement and adult ewes are the animals most affected by gastrointestinal nematode (GIN) infections. In this study, we have exploited the information derived from an RNA-Seq experiment with the aim of identifying potential causal mutations related to GIN resistance in sheep. Considering the RNA-Seq samples from 12 ewes previously classified as six resistant and six susceptible animals to experimental infection by Teladorsagia circumcincta, we performed a variant calling analysis pipeline using two different types of software, gatk version 3.7 and Samtools version 1.4. The variants commonly identified by the two packages (high-quality variants) within two types of target regions – (i) QTL regions previously reported in sheep for parasite resistance based on SNP-chip or sequencing technology studies and (ii) functional candidate genes selected from gene expression studies related to GIN resistance in sheep – were further characterised to identify mutations with a potential functional impact. Among the genes harbouring these potential functional variants (930 and 553 respectively for the two types of regions), we identified 111 immune-related genes in the QTL regions and 132 immune-related genes from the initially selected candidate genes. For these immune-related genes harbouring potential functional variants, the enrichment analyses performed highlighted significant GO terms related to apoptosis, adhesion and inflammatory response, in relation to the QTL related variants, and significant disease-related terms such as inflammation, adhesion and necrosis, in relation to the initial candidate gene list. Overall, the study provides a valuable list of potential causal mutations that could be considered as candidate causal mutations in relation to GIN resistance in sheep. Future studies should assess the role of these suggested mutations with the aim of identifying genetic markers that could be directly implemented in sheep breeding programmes considering not only production traits, but also functional traits such as resistance to GIN infections.  相似文献   

3.
Both growth and immune capacity are important traits in animal breeding. The animal quantitative trait loci (QTL) database is a valuable resource and can be used for interpreting the genetic mechanisms that underlie growth and immune traits. However, QTL intervals often involve too many candidate genes to find the true causal genes. Therefore, the aim of this study was to provide an effective annotation pipeline that can make full use of the information of Gene Ontology terms annotation, linkage gene blocks and pathways to further identify pleiotropic genes and gene sets in the overlapping intervals of growth-related and immunity-related QTLs. In total, 55 non-redundant QTL overlapping intervals were identified, 1893 growth-related genes and 713 immunity-related genes were further classified into overlapping intervals and 405 pleiotropic genes shared by the two gene sets were determined. In addition, 19 pleiotropic gene linkage blocks and 67 pathways related to immunity and growth traits were discovered. A total of 343 growth-related genes and 144 immunity-related genes involved in pleiotropic pathways were also identified, respectively. We also sequenced and genotyped 284 individuals from Chinese Meishan pigs and European pigs and mapped the single nucleotide polymorphisms (SNPs) to the pleiotropic genes and gene sets that we identified. A total of 971 high-confidence SNPs were mapped to the pleiotropic genes and gene sets that we identified, and among them 743 SNPs were statistically significant in allele frequency between Meishan and European pigs. This study explores the relationship between growth and immunity traits from the view of QTL overlapping intervals and can be generalized to explore the relationships between other traits.  相似文献   

4.
An integral part of functional genomics studies is to assess the enrichment of specific biological terms in lists of genes found to be playing an important role in biological phenomena. Contrasting the observed frequency of annotated terms with those of the background is at the core of overrepresentation analysis (ORA). Gene Ontology (GO) is a means to consistently classify and annotate gene products and has become a mainstay in ORA. Alternatively, Medical Subject Headings (MeSH) offers a comprehensive life science vocabulary including additional categories that are not covered by GO. Although MeSH is applied predominantly in human and model organism research, its full potential in livestock genetics is yet to be explored. In this study, MeSH ORA was evaluated to discern biological properties of identified genes and contrast them with the results obtained from GO enrichment analysis. Three published datasets were employed for this purpose, representing a gene expression study in dairy cattle, the use of SNPs for genome‐wide prediction in swine and the identification of genomic regions targeted by selection in horses. We found that several overrepresented MeSH annotations linked to these gene sets share similar concepts with those of GO terms. Moreover, MeSH yielded unique annotations, which are not directly provided by GO terms, suggesting that MeSH has the potential to refine and enrich the representation of biological knowledge. We demonstrated that MeSH can be regarded as another choice of annotation to draw biological inferences from genes identified via experimental analyses. When used in combination with GO terms, our results indicate that MeSH can enhance our functional interpretations for specific biological conditions or the genetic basis of complex traits in livestock species.  相似文献   

5.
6.
MOTIVATION: Numerous annotations are available that functionally characterize genes and proteins with regard to molecular process, cellular localization, tissue expression, protein domain composition, protein interaction, disease association and other properties. Searching this steadily growing amount of information can lead to the discovery of new biological relationships between genes and proteins. To facilitate the searches, methods are required that measure the annotation similarity of genes and proteins. However, most current similarity methods are focused only on annotations from the Gene Ontology (GO) and do not take other annotation sources into account. RESULTS: We introduce the new method BioSim that incorporates multiple sources of annotations to quantify the functional similarity of genes and proteins. We compared the performance of our method with four other well-known methods adapted to use multiple annotation sources. We evaluated the methods by searching for known functional relationships using annotations based only on GO or on our large data warehouse BioMyn. This warehouse integrates many diverse annotation sources of human genes and proteins. We observed that the search performance improved substantially for almost all methods when multiple annotation sources were included. In particular, our method outperformed the other methods in terms of recall and average precision.  相似文献   

7.
Unraveling the genetic background of economic traits is a major goal in modern animal genetics and breeding. Both candidate gene analysis and QTL mapping have previously been used for identifying genes and chromosome regions related to studied traits. However, most of these studies may be limited in their ability to fully consider how multiple genetic factors may influence a particular phenotype of interest. If possible, taking advantage of the combined effect of multiple genetic factors is expected to be more powerful than analyzing single sites, as the joint action of multiple loci within a gene or across multiple genes acting in the same gene set will likely have a greater influence on phenotypic variation. Thus, we proposed a pipeline of gene set analysis that utilized information from multiple loci to improve statistical power. We assessed the performance of this approach by both simulated and a real IGF1-FoxO pathway data set. The results showed that our new method can identify the association between genetic variation and phenotypic variation with higher statistical power and unravel the mechanisms of complex traits in a point of gene set. Additionally, the proposed pipeline is flexible to be extended to model complex genetic structures that include the interactions between different gene sets and between gene sets and environments.  相似文献   

8.
Salmonids are an important cultural and ecological resource exhibiting near worldwide distribution between their native and introduced range. Previous research has generated linkage maps and genomic resources for several species as well as genome assemblies for two species. We first leveraged improvements in mapping and genotyping methods to create a dense linkage map for Chinook salmon Oncorhynchus tshawytscha by assembling family data from different sources. We successfully mapped 14 620 SNP loci including 2336 paralogs in subtelomeric regions. This improved map was then used as a foundation to integrate genomic resources for gene annotation and population genomic analyses. We anchored a total of 286 scaffolds from the Atlantic salmon genome to the linkage map to provide a framework for the placement 11 728 Chinook salmon ESTs. Previously identified thermotolerance QTL were found to colocalize with several candidate genes including HSP70, a gene known to be involved in thermal response, as well as its inhibitor. Multiple regions of the genome with elevated divergence between populations were also identified, and annotation of ESTs in these regions identified candidate genes for fitness related traits such as stress response, growth and behaviour. Collectively, these results demonstrate the utility of combining genomic resources with linkage maps to enhance evolutionary inferences.  相似文献   

9.
Genome-wide association studies (GWAS) are designed to identify the portion of single-nucleotide polymorphisms (SNPs) in genome sequences associated with a complex trait. Strategies based on the gene list enrichment concept are currently applied for the functional analysis of GWAS, according to which a significant overrepresentation of candidate genes associated with a biological pathway is used as a proxy to infer overrepresentation of candidate SNPs in the pathway. Here we show that such inference is not always valid and introduce the program SNP2GO, which implements a new method to properly test for the overrepresentation of candidate SNPs in biological pathways.  相似文献   

10.
With numerous whole genomes now in hand, and experimental data about genes and biological pathways on the increase, a systems approach to biological research is becoming essential. Ontologies provide a formal representation of knowledge that is amenable to computational as well as human analysis, an obvious underpinning of systems biology. Mapping function to gene products in the genome consists of two, somewhat intertwined enterprises: ontology building and ontology annotation. Ontology building is the formal representation of a domain of knowledge; ontology annotation is association of specific genomic regions (which we refer to simply as 'genes', including genes and their regulatory elements and products such as proteins and functional RNAs) to parts of the ontology. We consider two complementary representations of gene function: the Gene Ontology (GO) and pathway ontologies. GO represents function from the gene's eye view, in relation to a large and growing context of biological knowledge at all levels. Pathway ontologies represent function from the point of view of biochemical reactions and interactions, which are ordered into networks and causal cascades. The more mature GO provides an example of ontology annotation: how conclusions from the scientific literature and from evolutionary relationships are converted into formal statements about gene function. Annotations are made using a variety of different types of evidence, which can be used to estimate the relative reliability of different annotations.  相似文献   

11.
Quantitative traits often underlie risk for complex diseases. Many studies collect multiple correlated quantitative phenotypes and perform univariate analyses on each of them respectively. However, this strategy may not be powerful and has limitations to detect plei- otropic genes that may underlie correlated quantitative traits. In addition, testing multiple traits individually will exacerbate perplexing problem of multiple testing. In this study, generalized estimating equation 2 (GEE2) is applied to association mapping of two correlated quantitative traits. We suppose that a quantitative trait locus is located in a chromosome region that exerts pleiotropic effects on multiple quantitative traits. In that region, multiple SNPs are genotyped. Genotypes of these SNPs and the two quantitative traits affected by a causal SNP were simulated under various parameter values: residual correlation coefficient between two traits, causal SNP heritability, minor allele frequency of the causal SNP, extent of linkage disequilibrium with the causal SNP, and the test sample size. By power ana- lytical analyses, it is showed that the bivariate method is generally more powerful than the univariate method. This method is robust and yields false-positive rates close to the pre-set nominal significance level. Our real data analyses attested to the usefulness of the method.  相似文献   

12.
Understanding the genetic influences of traits of nonmodel organisms is crucial to understanding how novel traits arise. Do new traits require new genes or are old genes repurposed? How predictable is this process? Here, we examine this question for gene expression influencing parenting behavior in a beetle, Nicrophorus vespilloides. Parental care, produced from many individual behaviors, should be influenced by changes of expression of multiple genes, and one suggestion is that the genes can be predicted based on knowledge of behavior expected to be precursors to parental care, such as aggression, resource defense, and mating on a resource. Thus, testing gene expression during parental care allows us to test expectations of this “precursor hypothesis” for multiple genes and traits. We tested for changes of the expression of serotonin, octopamine/tyramine, and dopamine receptors, as well as one glutamate receptor, predicting that these gene families would be differentially expressed during social interactions with offspring and associated resource defense. We found that serotonin receptors were strongly associated with social and aggression behavioral transitions. Octopamine receptors produced a complex picture of gene expression over a reproductive cycle. Dopamine was not associated with the behavioral transitions sampled here, while the glutamate receptor was most consistent with a behavioral change of resource defense/aggression. Our results generate new hypotheses, refine candidate lists for further studies, and inform the genetic mechanisms that are co‐opted during the evolution of parent–offspring interactions, a likely evolutionary path for many lineages that become fully social. The precursor hypothesis, while not perfect, does provide a starting point for identifying candidate genes.  相似文献   

13.
14.
Previous association analyses showed that variation at major regulatory genes contributes to standing variation for complex traits in Balsas teosinte, the progenitor of maize. This study expands our previous association mapping effort in teosinte by testing 123 markers in 52 candidate genes for association with 31 traits in a population of 817 individuals. Thirty-three significant associations for markers from 15 candidate genes and 10 traits survive correction for multiple testing. Our analyses suggest several new putative causative relationships between specific genes and trait variation in teosinte. For example, two ramosa genes (ra1 and ra2) associate with ear structure, and the MADS-box gene, zagl1, associates with ear shattering. Since zagl1 was previously shown to be a target of selection during maize domestication, we suggest that this gene was under selection for its effect on the loss of ear shattering, a key domestication trait. All observed effects were relatively small in terms of the percentage of phenotypic variation explained (<10%). We also detected several epistatic interactions between markers in the same gene that associate with the same trait. Candidate-gene-based association mapping appears to be a promising method for investigating the inheritance of complex traits in teosinte.  相似文献   

15.
Identification of candidate causal variants in regions associated with risk of common diseases is complicated by linkage disequilibrium (LD) and multiple association signals. Nonetheless, accurate maps of these variants are needed, both to fully exploit detailed cell specific chromatin annotation data to highlight disease causal mechanisms and cells, and for design of the functional studies that will ultimately be required to confirm causal mechanisms. We adapted a Bayesian evolutionary stochastic search algorithm to the fine mapping problem, and demonstrated its improved performance over conventional stepwise and regularised regression through simulation studies. We then applied it to fine map the established multiple sclerosis (MS) and type 1 diabetes (T1D) associations in the IL-2RA (CD25) gene region. For T1D, both stepwise and stochastic search approaches identified four T1D association signals, with the major effect tagged by the single nucleotide polymorphism, rs12722496. In contrast, for MS, the stochastic search found two distinct competing models: a single candidate causal variant, tagged by rs2104286 and reported previously using stepwise analysis; and a more complex model with two association signals, one of which was tagged by the major T1D associated rs12722496 and the other by rs56382813. There is low to moderate LD between rs2104286 and both rs12722496 and rs56382813 (r2 ≃ 0:3) and our two SNP model could not be recovered through a forward stepwise search after conditioning on rs2104286. Both signals in the two variant model for MS affect CD25 expression on distinct subpopulations of CD4+ T cells, which are key cells in the autoimmune process. The results support a shared causal variant for T1D and MS. Our study illustrates the benefit of using a purposely designed model search strategy for fine mapping and the advantage of combining disease and protein expression data.  相似文献   

16.
Family-based candidate gene and genome-wide association studies are a logical progression from linkage studies for the identification of gene and polymorphisms underlying complex traits. An efficient way to analyse phenotypic and genotypic data is to model linkage and association simultaneously. An important result from such an analysis is whether any evidence for linkage remains after fitting polymorphisms at candidate genes (residual linkage), because this may indicate locus and allelic heterogeneity in the population and will influence subsequent molecular strategies. Here we report that substantial residual linkage is to be expected, even under genetic homogeneity and when the underlying causal polymorphisms are genotyped and fitted in the model. We simulated a powerful design to detect linkage to quantitative trait loci, with 5, 10 or 20 causal SNPs spread throughout the genome. These SNPs were responsible for all genetic variation, and hence for both linkage and association. Residual linkage at the largest linkage peak from a genome-wide scan was substantial, with mean LOD scores of 0.4, 0.7, and 1.4 for the case of 5, 10 and 20 underlying causal SNPs, respectively. For less powerful designs, the proportion of the original LOD scores that remains after association will be even larger. All cases of ‘significant’ residual linkage are false positives. The reason for the apparent paradox of detecting residual linkage after fitting causal polymorphisms is that the linkage signals at the largest peaks in a genome-scan are severely inflated, even if all peaks correspond to true linkage. Our findings are general and apply to linkage mapping of any phenotype and to any pedigree structure.  相似文献   

17.
The speed of sound (SOS) value is an indicator of bone mineral density (BMD). Previous genome-wide association (GWA) studies have identified a number of genes, whose variations may affect BMD levels. However, their biological implications have been elusive. We re-analyzed the GWA study dataset for the SOS values in skeletal sites of 4,659 Korean women, using a gene-set analysis software, GSA-SNP. We identified 10 common representative GO terms, and 17 candidate genes between these two traits (PGS < 0.05). Implication of these GO terms and genes in the bone mechanism is well supported by the literature survey. Interestingly, the significance levels of some member genes were inversely related, in several gene-sets that were shared between two skeletal sites. This implies that biological process, rather than SNP or gene, is the substantial unit of genetic association for SOS in bone. In conclusion, our findings may provide new insights into the biological mechanisms for BMD. [BMB Reports 2014; 47(6): 348-353]  相似文献   

18.
The Gene Ontology (GO) has become the internationally accepted standard for representing function, process, and location aspects of gene products. The wealth of GO annotation data provides a valuable source of implicit knowledge of relationships among these aspects. We describe a new method for association rule mining to discover implicit co-occurrence relationships across the GO sub-ontologies at multiple levels of abstraction. Prior work on association rule mining in the GO has concentrated on mining knowledge at a single level of abstraction and/or between terms from the same sub-ontology. We have developed a bottom-up generalization procedure called Cross-Ontology Data Mining-Level by Level (COLL) that takes into account the structure and semantics of the GO, generates generalized transactions from annotation data and mines interesting multi-level cross-ontology association rules. We applied our method on publicly available chicken and mouse GO annotation datasets and mined 5368 and 3959 multi-level cross ontology rules from the two datasets respectively. We show that our approach discovers more and higher quality association rules from the GO as evaluated by biologists in comparison to previously published methods. Biologically interesting rules discovered by our method reveal unknown and surprising knowledge about co-occurring GO terms.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号