首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Simultaneous analysis of correlated traits that change with time is an important issue in genetic analyses. Several methodologies have already been proposed for the genetic analysis of longitudinal data on single traits, in particular random regression and character process models. Although the latter proved, in most cases, to compare favourably to alternative approaches for analysis of single function-valued traits, they do not allow a straightforward extension to the multivariate case. In this paper, another methodology (structured antedependence models) is proposed, and methods are derived for the genetic analysis of two or more correlated function-valued traits. Multivariate analyses are presented of fertility and mortality in Drosophila and of milk, fat and protein yields in dairy cattle. These models offer a substantial flexibility for the correlation structure, even in the case of complex non-stationary patterns, and perform better than multivariate random regression models, with fewer parameters.  相似文献   

2.
Modeling nonstationary longitudinal data   总被引:7,自引:0,他引:7  
An important theme of longitudinal data analysis in the past two decades has been the development and use of explicit parametric models for the data's variance-covariance structure. A variety of these models have been proposed, of which most are second-order stationary. A few are flexible enough to accommodate nonstationarity, i.e., nonconstant variances and/or correlations that are not a function solely of elapsed time between measurements. We review five nonstationary models that we regard as most useful: (1) the unstructured covariance model, (2) unstructured antedependence models, (3) structured antedependence models, (4) autoregressive integrated moving average and similar models, and (5) random coefficients models. We evaluate the relative strengths and limitations of each model, emphasizing when it is inappropriate or unlikely to be useful. We present three examples to illustrate the fitting and comparison of the models and to demonstrate that nonstationary longitudinal data can be modeled effectively and, in some cases, quite parsimoniously. In these examples, the antedependence models generally prove to be superior and the random coefficients models prove to be inferior. We conclude that antedependence models should be given much greater consideration than they have historically received.  相似文献   

3.
Pletcher SD  Geyer CJ 《Genetics》1999,153(2):825-835
The extension of classical quantitative genetics to deal with function-valued characters (also called infinite-dimensional characters) such as growth curves, mortality curves, and reaction norms, was begun by Kirkpatrick and co-workers. In this theory, the analogs of variance components for single traits are covariance functions for function-valued traits. In the approach presented here, we employ a variety of parametric models for covariance functions that have a number of desirable properties: the functions (1) are positive definite, (2) can be estimated using procedures like those currently used for single traits, (3) have a small number of parameters, and (4) allow simple hypotheses to be easily tested. The methods are illustrated using data from a large experiment that examined the effects of spontaneous mutations on age-specific mortality rates in Drosophila melanogaster. Our methods are shown to work better than a standard multivariate analysis, which assumes the character value at each age is a distinct character. Advantages over existing methods that model covariance functions as a series of orthogonal polynomials are discussed.  相似文献   

4.
In genetic studies, many interesting traits, including growth curves and skeletal shape, have temporal or spatial structure. They are better treated as curves or function-valued traits. Identification of genetic loci contributing to such traits is facilitated by specialized methods that explicitly address the function-valued nature of the data. Current methods for mapping function-valued traits are mostly likelihood-based, requiring specification of the distribution and error structure. However, such specification is difficult or impractical in many scenarios. We propose a general functional regression approach based on estimating equations that is robust to misspecification of the covariance structure. Estimation is based on a two-step least-squares algorithm, which is fast and applicable even when the number of time points exceeds the number of samples. It is also flexible due to a general linear functional model; changing the number of covariates does not necessitate a new set of formulas and programs. In addition, many meaningful extensions are straightforward. For example, we can accommodate incomplete genotype data, and the algorithm can be trivially parallelized. The framework is an attractive alternative to likelihood-based methods when the covariance structure of the data is not known. It provides a good compromise between model simplicity, statistical efficiency, and computational speed. We illustrate our method and its advantages using circadian mouse behavioral data.  相似文献   

5.
Variation,selection and evolution of function-valued traits   总被引:9,自引:0,他引:9  
We describe an emerging framework for understanding variation, selection and evolution of phenotypic traits that are mathematical functions. We use one specific empirical example – thermal performance curves (TPCs) for growth rates of caterpillars – to demonstrate how models for function-valued traits are natural extensions of more familiar, multivariate models for correlated, quantitative traits. We emphasize three main points. First, because function-valued traits are continuous functions, there are important constraints on their patterns of variation that are not captured by multivariate models. Phenotypic and genetic variation in function-valued traits can be quantified in terms of variance-covariance functions and their associated eigenfunctions: we illustrate how these are estimated as well as their biological interpretations for TPCs. Second, selection on a function-valued trait is itself a function, defined in terms of selection gradient functions. For TPCs, the selection gradient describes how the relationship between an organism's performance and its fitness varies as a function of its temperature. We show how the form of the selection gradient function for TPCs relates to the frequency distribution of environmental states (caterpillar temperatures) during selection. Third, we can predict evolutionary responses of function-valued traits in terms of the genetic variance-covariance and the selection gradient functions. We illustrate how non-linear evolutionary responses of TPCs may occur even when the mean phenotype and the selection gradient are themselves linear functions of temperature. Finally, we discuss some of the methodological and empirical challenges for future studies of the evolution of function-valued traits.  相似文献   

6.
The genetic analysis of female preferences has been seen as a particularly challenging empirical endeavor because of difficulties in generating suitable preference metrics in experiments large enough to adequately characterize variation. In this article, we take an alternative approach, treating female preference as a function-valued trait and exploiting random-coefficient models to characterize the genetic basis of female preference without measuring preference functions in each individual. Applying this approach to Drosophila bunnanda, in which females assess males through a multivariate contact pheromone system, we gain three valuable insights into the genetic basis of female preference functions. First, most genetic variation was attributable to one eigenfunction, suggesting shared genetic control of preferences for nine male pheromones. Second, genetic variance in female preference functions was not associated with genetic variance in the pheromones, implying that genetic variation in female preference did not maintain genetic variation in male traits. Finally, breeding values for female preference functions were skewed away from the direction of selection on the male traits, suggesting directional selection on female preferences. The genetic analysis of female preference functions as function-valued traits offers a robust statistical framework for investigations of female preference, in addition to alleviating some experimental difficulties associated with estimating variation in preference functions.  相似文献   

7.
The genetic analysis of characters that change as a function of some independent and continuous variable has received increasing attention in the biological and statistical literature. Previous work in this area has focused on the analysis of normally distributed characters that are directly observed. We propose a framework for the development and specification of models for a quantitative genetic analysis of function-valued characters that are not directly observed, such as genetic variation in age-specific mortality rates or complex threshold characters. We employ a hybrid Markov chain Monte Carlo algorithm involving a Monte Carlo EM algorithm coupled with a Markov chain approximation to the likelihood, which is quite robust and provides accurate estimates of the parameters in our models. The methods are investigated using simulated data and are applied to a large data set measuring mortality rates in the fruit fly, Drosophila melanogaster.  相似文献   

8.
A non-stationary model for functional mapping of complex traits   总被引:3,自引:0,他引:3  
SUMMARY: Understanding the genetic control of growth is fundamental to agricultural, evolutionary and biomedical genetic research. In this article, we present a statistical model for mapping quantitative trait loci (QTL) that are responsible for genetic differences in growth trajectories during ontogenetic development. This model is derived within the maximum likelihood context, implemented with the expectation-maximization algorithm. We incorporate mathematical aspects of growth processes to model the mean vector and structured antedependence models to approximate time-dependent covariance matrices for longitudinal traits. Our model has been employed to map QTL that affect body mass growth trajectories in both male and female mice of an F2 population derived from the Large and Small mouse strains. The results from this model are compared with those from the autoregressive-based functional mapping approach. Based on results from computer simulation studies, we suggest that these two models are alternative to one another and should be used simultaneously for the same dataset.  相似文献   

9.
10.
This study presents a multivariate, variance component-based QTL mapping model implemented via restricted maximum likelihood (REML). The method was applied to investigate bivariate and univariate QTL mapping analyses, using simulated data. Specifically, we report results on the statistical power to detect a QTL and on the precision of parameter estimates using univariate and bivariate approaches. The model and methodology were also applied to study the effectiveness of partitioning the overall genetic correlation between two traits into a component due to many genes of small effect, and one due to the QTL. It is shown that when the QTL has a pleiotropic effect on two traits, a bivariate analysis leads to a higher statistical power of detecting the QTL and to a more precise estimate of the QTL''s map position, in particular in the case when the QTL has a small effect on the trait. The increase in power is most marked in cases where the contributions of the QTL and of the polygenic components to the genetic correlation have opposite signs. The bivariate REML analysis can successfully partition the two components contributing to the genetic correlation between traits.  相似文献   

11.
Objective: To identify the genetic determinants of obesity using univariate and bivariate models in a genome scan. Research Methods and Procedures: We evaluated the genetic and environmental effects and performed a genome‐wide linkage analysis of obesity‐related traits in 478 subjects from 105 Mexican‐American nuclear families ascertained through a proband with documented coronary artery disease. The available obesity traits include BMI, body surface area (BSA), waist‐to‐hip ratio (WHR), and trunk fat mass as percentage of body weight. Heritability estimates and multipoint linkage analysis were performed using a variance components procedure implemented in SOLAR software. Results: The heritability estimates were 0.62 for BMI, 0.73 for BSA, 0.40 for WHR, and 0.38 for trunk fat mass as percentage of body weight. Using a bivariate genetic model, we observed significant genetic correlations between BMI and other obesity‐related traits (all p < 0.01). Evidence for univariate linkage was observed at 252 to approximately 267 cM on chromosome 2 for three obesity‐related traits (except for WHR) and at 163 to approximately 167 cM on chromosome 5 for BMI and BSA, with the maximum logarithm of the odds ratio score of 3.12 (empirical p value, 0.002) for BSA on chromosome 2. Use of the bivariate linkage model yielded an additional peak (logarithm of the odds ratio = 3.25, empirical p value, 0.002) at 25 cM on chromosome 7 for the pair of BMI and BSA. Discussion: The evidence for linkage on chromosomes 2q36‐37 and 5q36 is supported both by univariate and bivariate analysis, and an additional linkage peak at 7p15 was identified by the bivariate model. This suggests that use of the bivariate model provides additional information to identify linkage of genes responsible for obesity‐related traits.  相似文献   

12.
Quantitative genetic studies that model complex, multivariate phenotypes are important for both evolutionary prediction and artificial selection. For example, changes in gene expression can provide insight into developmental and physiological mechanisms that link genotype and phenotype. However, classical analytical techniques are poorly suited to quantitative genetic studies of gene expression where the number of traits assayed per individual can reach many thousand. Here, we derive a Bayesian genetic sparse factor model for estimating the genetic covariance matrix (G-matrix) of high-dimensional traits, such as gene expression, in a mixed-effects model. The key idea of our model is that we need consider only G-matrices that are biologically plausible. An organism’s entire phenotype is the result of processes that are modular and have limited complexity. This implies that the G-matrix will be highly structured. In particular, we assume that a limited number of intermediate traits (or factors, e.g., variations in development or physiology) control the variation in the high-dimensional phenotype, and that each of these intermediate traits is sparse – affecting only a few observed traits. The advantages of this approach are twofold. First, sparse factors are interpretable and provide biological insight into mechanisms underlying the genetic architecture. Second, enforcing sparsity helps prevent sampling errors from swamping out the true signal in high-dimensional data. We demonstrate the advantages of our model on simulated data and in an analysis of a published Drosophila melanogaster gene expression data set.  相似文献   

13.
Many traits of evolutionary interest, when placed in their developmental, physiological, or environmental contexts, are function-valued. For instance, gene expression during development is typically a function of the age of an organism and physiological processes are often a function of environment. In comparative and experimental studies, a fundamental question is whether the function-valued trait of one group is different from another. To address this question, evolutionary biologists have several statistical methods available. These methods can be classified into one of two types: multivariate and functional. Multivariate methods, including univariate repeated-measures analysis of variance (ANOVA), treat each trait as a finite list of data. Functional methods, such as repeated-measures regression, view the data as a sample of points drawn from an underlying function. A key difference between multivariate and functional methods is that functional methods retain information about the ordering and spacing of a set of data values, information that is discarded by multivariate methods. In this study, we evaluated the importance of that discarded information in statistical analyses of function-valued traits. Our results indicate that functional methods tend to have substantially greater statistical power than multivariate approaches to detect differences in a function-valued trait between groups.  相似文献   

14.
The analysis of nonlinear function-valued characters is very important in genetic studies, especially for growth traits of agricultural and laboratory species. Inference in nonlinear mixed effects models is, however, quite complex and is usually based on likelihood approximations or Bayesian methods. The aim of this paper was to present an efficient stochastic EM procedure, namely the SAEM algorithm, which is much faster to converge than the classical Monte Carlo EM algorithm and Bayesian estimation procedures, does not require specification of prior distributions and is quite robust to the choice of starting values. The key idea is to recycle the simulated values from one iteration to the next in the EM algorithm, which considerably accelerates the convergence. A simulation study is presented which confirms the advantages of this estimation procedure in the case of a genetic analysis. The SAEM algorithm was applied to real data sets on growth measurements in beef cattle and in chickens. The proposed estimation procedure, as the classical Monte Carlo EM algorithm, provides significance tests on the parameters and likelihood based model comparison criteria to compare the nonlinear models with other longitudinal methods.  相似文献   

15.
功能作图框架能够表征复杂动态性状背后的数量性状位点(Quantitative trait locus, QTL)或核苷酸。此前,功能作图广泛应用于单个性状或两个性状的QTL定位当中,多个相关性状功能作图的研究相对匮乏。本研究通过对SAD(Structured antedependence)模型进一步推导,得到在多个性状时用于拟合时间相关协方差矩阵的结构化模型,并使用胡杨(Populus euphratica Oliv.)340个F1代群体的4个生长相关性状以及模拟实验对多维功能作图模型进行测试。结果显示,共定位出173个显著位点,分布于除4、9、10、15、16以外的染色体。功能注释涉及参与叶绿体早期发育、生长素横向运输、提高植物非生物胁迫耐受性以及作为跨膜运输系统开关等生物学过程的32个基因。  相似文献   

16.
Summary The importance of constraints, defined as factors that retard or prevent a population from reaching its immediate adaptive peak on an ecological time scale is analysed. This is done by means of simple quantitative genetic models, which if anything underestimate the importance of constraints. The results show that even in the simplest case the response to selection will not generally be in the same direction as the selection vector, i.e. the direction to the nearest optimum. Adding complexity identifies cases where selection may lead the population in suboptimal directions. It is concluded that information about univariate genetic variances is not sufficient to predict evolutionary responses and may even be misleading. However, genetic covariances are not always acting as constraints, but can under certain circumstances promote evolution towards the nearest optimum. This can be understood by a spectral decomposition of the genetic variance—covariance matrix, where it is shown that the eigenvector associated with the largest amount of variance will to various degrees determine the outcome of selection. A literature survey of the pattern of character covariation in morphological characters in natural populations shows a wide variety of correlation patterns, but quite often shows a high level of covariance between traits. This suggests that constraints to short-term evolution may be more common than generally appreciated.  相似文献   

17.

Background

The focus in dairy cattle breeding is gradually shifting from production to functional traits and genetic parameters of calving traits are estimated more frequently. However, across countries, various statistical models are used to estimate these parameters. This study evaluates different models for calving ease and stillbirth in United Kingdom Holstein-Friesian cattle.

Methods

Data from first and later parity records were used. Genetic parameters for calving ease, stillbirth and gestation length were estimated using the restricted maximum likelihood method, considering different models i.e. sire (−maternal grandsire), animal, univariate and bivariate models. Gestation length was fitted as a correlated indicator trait and, for all three traits, genetic correlations between first and later parities were estimated. Potential bias in estimates was avoided by acknowledging a possible environmental direct-maternal covariance. The total heritable variance was estimated for each trait to discuss its theoretical importance and practical value. Prediction error variances and accuracies were calculated to compare the models.

Results and discussion

On average, direct and maternal heritabilities for calving traits were low, except for direct gestation length. Calving ease in first parity had a significant and negative direct-maternal genetic correlation. Gestation length was maternally correlated to stillbirth in first parity and directly correlated to calving ease in later parities. Multi-trait models had a slightly greater predictive ability than univariate models, especially for the lowly heritable traits. The computation time needed for sire (−maternal grandsire) models was much smaller than for animal models with only small differences in accuracy. The sire (−maternal grandsire) model was robust when additional genetic components were estimated, while the equivalent animal model had difficulties reaching convergence.

Conclusions

For the evaluation of calving traits, multi-trait models show a slight advantage over univariate models. Extended sire models (−maternal grandsire) are more practical and robust than animal models. Estimated genetic parameters for calving traits of UK Holstein cattle are consistent with literature. Calculating an aggregate estimated breeding value including direct and maternal values should encourage breeders to consider both direct and maternal effects in selection decisions.  相似文献   

18.
Evolutionary theory predicts that genetic constraints should be widespread, but empirical support for their existence is surprisingly rare. Commonly applied univariate and bivariate approaches to detecting genetic constraints can underestimate their prevalence, with important aspects potentially tractable only within a multivariate framework. However, multivariate genetic analyses of data from natural populations are challenging because of modest sample sizes, incomplete pedigrees, and missing data. Here we present results from a study of a comprehensive set of life history traits (juvenile survival, age at first breeding, annual fecundity, and longevity) for both males and females in a wild, pedigreed, population of red deer (Cervus elaphus). We use factor analytic modeling of the genetic variance–covariance matrix (G) to reduce the dimensionality of the problem and take a multivariate approach to estimating genetic constraints. We consider a range of metrics designed to assess the effect of G on the deflection of a predicted response to selection away from the direction of fastest adaptation and on the evolvability of the traits. We found limited support for genetic constraint through genetic covariances between traits, both within sex and between sexes. We discuss these results with respect to other recent findings and to the problems of estimating these parameters for natural populations.  相似文献   

19.
20.
Yang W  Tempelman RJ 《Genetics》2012,190(4):1491-1501
Hierarchical mixed effects models have been demonstrated to be powerful for predicting genomic merit of livestock and plants, on the basis of high-density single-nucleotide polymorphism (SNP) marker panels, and their use is being increasingly advocated for genomic predictions in human health. Two particularly popular approaches, labeled BayesA and BayesB, are based on specifying all SNP-associated effects to be independent of each other. BayesB extends BayesA by allowing a large proportion of SNP markers to be associated with null effects. We further extend these two models to specify SNP effects as being spatially correlated due to the chromosomally proximal effects of causal variants. These two models, that we respectively dub as ante-BayesA and ante-BayesB, are based on a first-order nonstationary antedependence specification between SNP effects. In a simulation study involving 20 replicate data sets, each analyzed at six different SNP marker densities with average LD levels ranging from r(2) = 0.15 to 0.31, the antedependence methods had significantly (P < 0.01) higher accuracies than their corresponding classical counterparts at higher LD levels (r(2) > 0. 24) with differences exceeding 3%. A cross-validation study was also conducted on the heterogeneous stock mice data resource (http://mus.well.ox.ac.uk/mouse/HS/) using 6-week body weights as the phenotype. The antedependence methods increased cross-validation prediction accuracies by up to 3.6% compared to their classical counterparts (P < 0.001). Finally, we applied our method to other benchmark data sets and demonstrated that the antedependence methods were more accurate than their classical counterparts for genomic predictions, even for individuals several generations beyond the training data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号