首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The mixed model for complex segregation analysis of quantitative data from three-generational nuclear families is extended to the multivariate case. Likelihood functions for hypothesis testing are derived for two types of conditional analysis of multiple traits: first when entry to the study depends on the index case's values of all the quantitative traits that are of interest, and second when entry depends on only one trait, but other correlated traits are to be studied simultaneously. Using direct products of covariance matrices, these functions are seen to be direct multivariate equivalence of the univariate functions.  相似文献   

2.
Multiple-trait association mapping, in which multiple traits are used simultaneously in the identification of genetic variants affecting those traits, has recently attracted interest. One class of approaches for this problem builds on classical variance component methodology, utilizing a multitrait version of a linear mixed model. These approaches both increase power and provide insights into the genetic architecture of multiple traits. In particular, it is possible to estimate the genetic correlation, which is a measure of the portion of the total correlation between traits that is due to additive genetic effects. Unfortunately, the practical utility of these methods is limited since they are computationally intractable for large sample sizes. In this article, we introduce a reformulation of the multiple-trait association mapping approach by defining the matrix-variate linear mixed model. Our approach reduces the computational time necessary to perform maximum-likelihood inference in a multiple-trait model by utilizing a data transformation. By utilizing a well-studied human cohort, we show that our approach provides more than a 10-fold speedup, making multiple-trait association feasible in a large population cohort on the genome-wide scale. We take advantage of the efficiency of our approach to analyze gene expression data. By decomposing gene coexpression into a genetic and environmental component, we show that our method provides fundamental insights into the nature of coexpressed genes. An implementation of this method is available at http://genetics.cs.ucla.edu/mvLMM.  相似文献   

3.
This article provides a fully Bayesian approach for modeling of single-dose and complete pharmacokinetic data in a population pharmacokinetic (PK) model. To overcome the impact of outliers and the difficulty of computation, a generalized linear model is chosen with the hypothesis that the errors follow a multivariate Student t distribution which is a heavy-tailed distribution. The aim of this study is to investigate and implement the performance of the multivariate t distribution to analyze population pharmacokinetic data. Bayesian predictive inferences and the Metropolis-Hastings algorithm schemes are used to process the intractable posterior integration. The precision and accuracy of the proposed model are illustrated by the simulating data and a real example of theophylline data.  相似文献   

4.
Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.  相似文献   

5.
A Model for Analysis of Population Structure   总被引:2,自引:3,他引:2       下载免费PDF全文
Arguments have been presented for the appropriateness of a multinomial Dirichlet distribution for describing single-locus genotypic frequencies in a subdivided population. This distribution is defined as a function of allele frequency, the average (over the entire population) inbreeding coefficient and the correlation between genotypes within a subdivision. Alternative parameterizations and their genetic interpretations are given.-We then show how information from a sample drawn from this subdivided population, in the absence of pedigrees, can be combined with the multinomial Dirichlet model to form a likelihood function. This likelihood function is then used as the basis for estimation and testing hypotheses concerning the genetic parameters of the model. Comparisons of this approach to the alternative procedure of Cockerham (1969) and (1973) are made using human data obtained from Tecumseh, Michigan and Monte Carlo simulations.-Finally, implications of these results to statistical inference and to mutation rates are presented.  相似文献   

6.
A class of generalized linear mixed models can be obtained by introducing random effects in the linear predictor of a generalized linear model, e.g. a split plot model for binary data or count data. Maximum likelihood estimation, for normally distributed random effects, involves high-dimensional numerical integration, with severe limitations on the number and structure of the additional random effects. An alternative estimation procedure based on an extension of the iterative re-weighted least squares procedure for generalized linear models will be illustrated on a practical data set involving carcass classification of cattle. The data is analysed as overdispersed binomial proportions with fixed and random effects and associated components of variance on the logit scale. Estimates are obtained with standard software for normal data mixed models. Numerical restrictions pertain to the size of matrices to be inverted. This can be dealt with by absorption techniques familiar from e.g. mixed models in animal breeding. The final model fitted to the classification data includes four components of variance and a multiplicative overdispersion factor. Basically the estimation procedure is a combination of iterated least squares procedures and no full distributional assumptions are needed. A simulation study based on the classification data is presented. This includes a study of procedures for constructing confidence intervals and significance tests for fixed effects and components of variance. The simulation results increase confidence in the usefulness of the estimation procedure.  相似文献   

7.
利用DH或RIL群体检测QTL体系并估计其遗传效应   总被引:38,自引:1,他引:38  
章元明  盖钧镒 《遗传学报》2000,27(7):634-640
利用DH和RIKL群体并结合重复内分组随机区组设计对和物产量等遗传率较低的数量性状进行分离分析可提高遗传分析的精度。根据混合分布理论菜了利用DH或RIL群体重复实验数据鉴定数量性状混合遗传模型的分离分析法,特别是2对链锁主基因+多基因模型。该方法可鉴定数量性状的遗传模型和主基因的作用方式,估计主基因、多基因的遗传疚和遗传方差,在两主基因存在连锁可可估计其重组率。下面通过应用举例说明该方法。  相似文献   

8.
9.
10.
RECENTLY Edelstein1 has concluded on the basis of a numerical analysis that the sequential model as formulated by Koshland, Nemethy and Filmer (KNF)2 describes the oxygen binding curves of a number of species of human haemoglobin less well than does the two-state allosteric model of Monod, Wyman and Changeux (MWC)3. This communication demonstrates that Edelstein's analysis is incomplete and that extension of his analysis reveals that no such conclusion can be drawn from the data considered.  相似文献   

11.
Population structure is a confounding factor in genome-wide association studies, increasing the rate of false positive associations. To correct for it, several model-based algorithms such as ADMIXTURE and STRUCTURE have been proposed. These tend to suffer from the fact that they have a considerable computational burden, limiting their applicability when used with large datasets, such as those produced by next generation sequencing techniques. To address this, nonmodel based approaches such as sparse nonnegative matrix factorization (sNMF) and EIGENSTRAT have been proposed, which scale better with larger data. Here we present a novel nonmodel-based approach, population structure inference using kernel-PCA and optimization (PSIKO), which is based on a unique combination of linear kernel-PCA and least-squares optimization and allows for the inference of admixture coefficients, principal components, and number of founder populations of a dataset. PSIKO has been compared against existing leading methods on a variety of simulation scenarios, as well as on real biological data. We found that in addition to producing results of the same quality as other tested methods, PSIKO scales extremely well with dataset size, being considerably (up to 30 times) faster for longer sequences than even state-of-the-art methods such as sNMF. PSIKO and accompanying manual are freely available at https://www.uea.ac.uk/computing/psiko.  相似文献   

12.
13.
The traditional method for estimating the linear function of fixed parameters in mixed linear model is a two-stage procedure. In the first stage of this procedure the variance components estimators are calculated and next in the second stage these estimators are taken as true values of variance components to estimating the linear function of fixed parameters according to generalized least squares method. In this paper the general mixed linear model is considered in which a matrix related to fixed parameters and or/a dispersion matrix of observation vector may be deficient in rank. It is shown that the estimators of a set of functions of fixed parameters obtained in second stage are unbiased if only the observation vector is symmetrically distributed about its expected value and the estimators of variance components from first stage are translation-invariant and are even functions of the observation vector.  相似文献   

14.
C. Stricker  R. L. Fernando    R. C. Elston 《Genetics》1995,141(4):1651-1656
This paper presents an extension of the finite polygenic mixed model of FERNANDO et al. (1994) to linkage analysis. The finite polygenic mixed model, extended for linkage analysis, leads to a likelihood that can be calculated using efficient algorithms developed for oligogenic models. For comparison, linkage analysis of 5 simulated 4021-member pedigrees was performed using the usual mixed model of inheritance, approximated by HASSTEDT (1982), and the finite polygenic mixed model extended for linkage analysis presented here. Maximum likelihood estimates of the finite polygenic mixed model could be inferred to be closer to the simulated values in these pedigrees.  相似文献   

15.
16.
17.

Background

Molecular genotyping methods have shown infection with more than one Mycobacterium tuberculosis strain genotype in a single sputum culture, indicating mixed infection.

Aim

This study aimed to develop a PCR-based genotyping tool to determine the population structure of M. tuberculosis strain genotypes in primary Mycobacterial Growth Indicator Tubes (MGIT) and Löwenstein–Jensen (LJ) cultures to identify mixed infections and to establish whether the growth media influenced the recovery of certain strain genotypes.

Method

A convenience sample of 206 paired MGIT and LJ M. tuberculosis cultures from pulmonary tuberculosis patients resident in Khayelitsha, South Africa were genotyped using an in-house PCR-based method to detect defined M. tuberculosis strain genotypes.

Results

The sensitivity and specificity of the PCR-based method for detecting Beijing, Haarlem, S-family, and LAM genotypes was 100%, and 75% and 50% for detecting the Low Copy Clade, respectively. Thirty-one (15%) of the 206 cases showed the presence of more than one M. tuberculosis strain genotype. Strains of the Beijing and Haarlem genotypes were significantly more associated with a mixed infection (on both media) when compared to infections with a single strain (Beijing MGIT p = 0.02; LJ, p<0.01) and (Haarlem: MGIT p<0.01; LJ, p = 0.01). Strains with the Beijing genotype were less likely to be with “other genotype” strains (p<0.01) while LAM, Haarlem, S-family and LCC occurred independently with the Beijing genotype.

Conclusion

The PCR-based method was able to identify mixed infection in at least 15% of the cases. LJ media was more sensitive in detecting mixed infections than MGIT media, implying that the growth characteristics of M. tuberculosis on different media may influence our ability to detect mixed infections. The Beijing and Haarlem genotypes were more likely to occur in a mixed infection than any of the other genotypes tested suggesting pathogen-pathogen compatibility.  相似文献   

18.
19.
Many important phenotypic traits in plants are ordinal. However, relatively little is known about the methodologies for ordinal trait association studies. In this study, we proposed a hierarchical generalized linear mixed model for mapping quantitative trait locus (QTL) of ordinal traits in crop cultivars. In this model, all the main-effect QTL and QTL-by-environment interaction were treated as random, while population mean, environmental effect and population structure were fixed. In the estimation of parameters, the pseudo data normal approximation of likelihood function and empirical Bayes approach were adopted. A series of Monte Carlo simulation experiments were performed to confirm the reliability of new method. The result showed that new method works well with satisfactory statistical power and precision. The new method was also adopted to dissect the genetic basis of soybean alkaline-salt tolerance in 257 soybean cultivars obtained, by stratified random sampling, from 6 geographic ecotypes in China. As a result, 6 main-effect QTL and 3 QTL-by-environment interactions were identified.  相似文献   

20.
随着基因芯片的技术的推广,越来越多的表达数据需要被处理和分析.利用这些表达数据提取基因调控矩阵从而构建基因网络是一个重要的问题.通过线性微分方程模型可以初步构建基因网络,了解网络结构,提取最显著的信息.然而由于分子生物学的条件限制或者数据来源的限制,导致实验数据不充分,使方程组无解.本文使用三次样条方法,对26例临床、病理资料完备的具有淋巴结转移的乳腺癌基因表达数据进行插值处理,使表达数据满秩,从而使用最小二乘法解出加权矩阵,构建初步的表达基因调控网络.通过对构建的基因网络的初步分析表明:乳腺癌转移的形成是由多基因异常引起多条传导通路异常,致使细胞恶性转化的结果,这与生物学上公认的看法是相一致的.因此,利用此线性模型方法对基因表达谱进行分析兵有一定可行性,在认识乳腺癌转移机制,乳腺癌诊断和治疗方面具有一定的理论和应用价值.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号