期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Zero-inflated generalized Poisson regression mixture model for mapping quantitative trait loci underlying count trait with many zeros

Cui Y Yang W 《Journal of theoretical biology》2009,256(2):276-285

Phenotypes measured in counts are commonly observed in nature. Statistical methods for mapping quantitative trait loci (QTL) underlying count traits are documented in the literature. The majority of them assume that the count phenotype follows a Poisson distribution with appropriate techniques being applied to handle data dispersion. When a count trait has a genetic basis, “naturally occurring” zero status also reflects the underlying gene effects. Simply ignoring or miss-handling the zero data may lead to wrong QTL inference. In this article, we propose an interval mapping approach for mapping QTL underlying count phenotypes containing many zeros. The effects of QTLs on the zero-inflated count trait are modelled through the zero-inflated generalized Poisson regression mixture model, which can handle the zero inflation and Poisson dispersion in the same distribution. We implement the approach using the EM algorithm with the Newton-Raphson algorithm embedded in the M-step, and provide a genome-wide scan for testing and estimating the QTL effects. The performance of the proposed method is evaluated through extensive simulation studies. Extensions to composite and multiple interval mapping are discussed. The utility of the developed approach is illustrated through a mouse F₂ intercross data set. Significant QTLs are detected to control mouse cholesterol gallstone formation. 相似文献

2.

Bayesian interval mapping of count trait loci based on zero-inflated generalized Poisson regression model

Jinling Chi Ying Zhou Lili Chen Yajing Zhou 《Biometrical journal. Biometrische Zeitschrift》2020,62(6):1428-1442

Count phenotypes with excessive zeros are often observed in the biological world. Researchers have studied many statistical methods for mapping the quantitative trait loci (QTLs) of zero-inflated count phenotypes. However, most of the existing methods consist of finding the approximate positions of the QTLs on the chromosome by genome-wide scanning. Additionally, most of the existing methods use the EM algorithm for parameter estimation. In this paper, we propose a Bayesian interval mapping scheme of QTLs for zero-inflated count data. The method takes advantage of a zero-inflated generalized Poisson (ZIGP) regression model to study the influence of QTLs on the zero-inflated count phenotype. The MCMC algorithm is used to estimate the effects and position parameters of QTLs. We use the Haldane map function to realize the conversion between recombination rate and map distance. Monte Carlo simulations are conducted to test the applicability and advantage of the proposed method. The effects of QTLs on the formation of mouse cholesterol gallstones were demonstrated by analyzing an mouse data set. 相似文献

3.

Interval mapping of quantitative trait loci for time-to-event data with the proportional hazards mixture cure model

Liu M Lu W Shao Y 《Biometrics》2006,62(4):1053-1061

Interval mapping using normal mixture models has been an important tool for analyzing quantitative traits in experimental organisms. When the primary phenotype is time-to-event, it is natural to use survival models such as Cox's proportional hazards model instead of normal mixtures to model the phenotype distribution. An extra challenge for modeling time-to-event data is that the underlying population may consist of susceptible and nonsusceptible subjects. In this article, we propose a semiparametric proportional hazards mixture cure model which allows missing covariates. We discuss applications to quantitative trait loci (QTL) mapping when the primary trait is time-to-event from a population of mixed susceptibility. This model can be used to characterize QTL effects on both susceptibility and time-to-event distribution, and to estimate QTL location. The model can naturally incorporate covariate effects of other risk factors. Maximum likelihood estimates for the parameters in the model as well as their corresponding variance estimates can be obtained numerically using an EM-type algorithm. The proposed methods are assessed by simulations under practical settings and illustrated using a real data set containing survival times of mice after infection with Listeria monocytogenes. An extension to multiple intervals is also discussed. 相似文献

4.

A general mixture model for mapping quantitative trait loci by using molecular markers 总被引：3，自引：0，他引：3

R. C. Jansen 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1992,85(2-3):252-260

Summary In a segregating population a quantitative trait may be considered to follow a mixture of (normal) distributions, the mixing proportions being based on Mendelian segregation rules. A general and flexible mixture model is proposed for mapping quantitative trait loci (QTLs) by using molecular markers. A method is discribed to fit the model to data. The model makes it possible to (1) analyse non-normally distributed traits such as lifetimes, counts or percentages in addition to normally distributed traits, (2) reduce environmental variation by taking into account the effects of experimental design factors and interaction between genotype and environment, (3) reduce genotypic variation by taking into account the effects of two or more QTLs simultaneously, (4) carry out a (combined) analysis of different population types, (5) estimate recombination frequencies between markers or use known marker distances, (6) cope with missing marker observations, (7) use markers as covariables in detection and mapping of QTLs, and finally to (8) implement the mapping in standard statistical packages. 相似文献

5.

Semiparametric methods for mapping quantitative trait loci with censored data 总被引：1，自引：0，他引：1

Diao G Lin DY 《Biometrics》2005,61(3):789-798

Statistical methods for the detection of genes influencing quantitative traits with the aid of genetic markers are well developed for normally distributed, fully observed phenotypes. Many experiments are concerned with failure-time phenotypes, which have skewed distributions and which are usually subject to censoring because of random loss to follow-up, failures from competing causes, or limited duration of the experiment. In this article, we develop semiparametric statistical methods for mapping quantitative trait loci (QTLs) based on censored failure-time phenotypes. We formulate the effects of the QTL genotype on the failure time through the Cox (1972, Journal of the Royal Statistical Society, Series B 34, 187-220) proportional hazards model and derive efficient likelihood-based inference procedures. In addition, we show how to assess statistical significance when searching several regions or the entire genome for QTLs. Extensive simulation studies demonstrate that the proposed methods perform well in practical situations. Applications to two animal studies are provided. 相似文献

6.

A logistic regression mixture model for interval mapping of genetic trait loci affecting binary phenotypes

下载免费PDF全文

Deng W Chen H Li Z 《Genetics》2006,172(2):1349-1358

Often in genetic research, presence or absence of a disease is affected by not only the trait locus genotypes but also some covariates. The finite logistic regression mixture models and the methods under the models are developed for detection of a binary trait locus (BTL) through an interval-mapping procedure. The maximum-likelihood estimates (MLEs) of the logistic regression parameters are asymptotically unbiased. The null asymptotic distributions of the likelihood-ratio test (LRT) statistics for detection of a BTL are found to be given by the supremum of a chi2-process. The limiting null distributions are free of the null model parameters and are determined explicitly through only four (backcross case) or nine (intercross case) independent standard normal random variables. Therefore a threshold for detecting a BTL in a flanking marker interval can be approximated easily by using a Monte Carlo method. It is pointed out that use of a threshold incorrectly determined by reading off a chi2-probability table can result in an excessive false BTL detection rate much more severely than many researchers might anticipate. Simulation results show that the BTL detection procedures based on the thresholds determined by the limiting distributions perform quite well when the sample sizes are moderately large. 相似文献

7.

Generalized linear model for interval mapping of quantitative trait loci

Shizhong Xu Zhiqiu Hu 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2010,121(1):47-63

We developed a generalized linear model of QTL mapping for discrete traits in line crossing experiments. Parameter estimation was achieved using two different algorithms, a mixture model-based EM (expectation–maximization) algorithm and a GEE (generalized estimating equation) algorithm under a heterogeneous residual variance model. The methods were developed using ordinal data, binary data, binomial data and Poisson data as examples. Applications of the methods to simulated as well as real data are presented. The two different algorithms were compared in the data analyses. In most situations, the two algorithms were indistinguishable, but when large QTL are located in large marker intervals, the mixture model-based EM algorithm can fail to converge to the correct solutions. Both algorithms were coded in C++ and interfaced with SAS as a user-defined SAS procedure called PROC QTL. 相似文献

8.

Interval mapping of quantitative trait loci with selective DNA pooling data

Jing Wang Kenneth J Koehler Jack CM Dekkers 《遗传、选种与进化》2007,39(6):685-709

Selective DNA pooling is an efficient method to identify chromosomal regions that harbor quantitative trait loci (QTL) by comparing marker allele frequencies in pooled DNA from phenotypically extreme individuals. Currently used single marker analysis methods can detect linkage of markers to a QTL but do not provide separate estimates of QTL position and effect, nor do they utilize the joint information from multiple markers. In this study, two interval mapping methods for analysis of selective DNA pooling data were developed and evaluated. One was based on least squares regression (LS-pool) and the other on approximate maximum likelihood (ML-pool). Both methods simultaneously utilize information from multiple markers and multiple families and can be applied to different family structures (half-sib, F2 cross and backcross). The results from these two interval mapping methods were compared with results from single marker analysis by simulation. The results indicate that both LS-pool and ML-pool provided greater power to detect the QTL than single marker analysis. They also provide separate estimates of QTL location and effect. With large family sizes, both LS-pool and ML-pool provided similar power and estimates of QTL location and effect as selective genotyping. With small family sizes, however, the LS-pool method resulted in severely biased estimates of QTL location for distal QTL but this bias was reduced with the ML-pool. 相似文献

9.

Bayesian LASSO for quantitative trait loci mapping 总被引：7，自引：1，他引：6

下载免费PDF全文

Yi N Xu S 《Genetics》2008,179(2):1045-1055

The mapping of quantitative trait loci (QTL) is to identify molecular markers or genomic loci that influence the variation of complex traits. The problem is complicated by the facts that QTL data usually contain a large number of markers across the entire genome and most of them have little or no effect on the phenotype. In this article, we propose several Bayesian hierarchical models for mapping multiple QTL that simultaneously fit and estimate all possible genetic effects associated with all markers. The proposed models use prior distributions for the genetic effects that are scale mixtures of normal distributions with mean zero and variances distributed to give each effect a high probability of being near zero. We consider two types of priors for the variances, exponential and scaled inverse-chi(2) distributions, which result in a Bayesian version of the popular least absolute shrinkage and selection operator (LASSO) model and the well-known Student's t model, respectively. Unlike most applications where fixed values are preset for hyperparameters in the priors, we treat all hyperparameters as unknowns and estimate them along with other parameters. Markov chain Monte Carlo (MCMC) algorithms are developed to simulate the parameters from the posteriors. The methods are illustrated using well-known barley data. 相似文献

10.

Testing approaches for overdispersion in poisson regression versus the generalized poisson model 总被引：1，自引：0，他引：1

Yang Z Hardin JW Addy CL Vuong QH 《Biometrical journal. Biometrische Zeitschrift》2007,49(4):565-584

Overdispersion is a common phenomenon in Poisson modeling, and the negative binomial (NB) model is frequently used to account for overdispersion. Testing approaches (Wald test, likelihood ratio test (LRT), and score test) for overdispersion in the Poisson regression versus the NB model are available. Because the generalized Poisson (GP) model is similar to the NB model, we consider the former as an alternate model for overdispersed count data. The score test has an advantage over the LRT and the Wald test in that the score test only requires that the parameter of interest be estimated under the null hypothesis. This paper proposes a score test for overdispersion based on the GP model and compares the power of the test with the LRT and Wald tests. A simulation study indicates the score test based on asymptotic standard Normal distribution is more appropriate in practical application for higher empirical power, however, it underestimates the nominal significance level, especially in small sample situations, and examples illustrate the results of comparing the candidate tests between the Poisson and GP models. A bootstrap test is also proposed to adjust the underestimation of nominal level in the score statistic when the sample size is small. The simulation study indicates the bootstrap test has significance level closer to nominal size and has uniformly greater power than the score test based on asymptotic standard Normal distribution. From a practical perspective, we suggest that, if the score test gives even a weak indication that the Poisson model is inappropriate, say at the 0.10 significance level, we advise the more accurate bootstrap procedure as a better test for comparing whether the GP model is more appropriate than Poisson model. Finally, the Vuong test is illustrated to choose between GP and NB2 models for the same dataset. 相似文献

11.

Lineage-specific mapping of quantitative trait loci

C Chen K Ritland 《Heredity》2013,111(2):106-113

We present an approach for quantitative trait locus (QTL) mapping, termed as ‘lineage-specific QTL mapping'', for inferring allelic changes of QTL evolution along with branches in a phylogeny. We describe and analyze the simplest case: by adding a third taxon into the normal procedure of QTL mapping between pairs of taxa, such inferences can be made along lineages to a presumed common ancestor. Although comparisons of QTL maps among species can identify homology of QTLs by apparent co-location, lineage-specific mapping of QTL can classify homology into (1) orthology (shared origin of QTL) versus (2) paralogy (independent origin of QTL within resolution of map distance). In this light, we present a graphical method that identifies six modes of QTL evolution in a three taxon comparison. We then apply our model to map lineage-specific QTLs for inbreeding among three taxa of yellow monkey-flower: Mimulus guttatus and two inbreeders M. platycalyx and M. micranthus, but critically assuming outcrossing was the ancestral state. The two most common modes of homology across traits were orthologous (shared ancestry of mutation for QTL alleles). The outbreeder M. guttatus had the fewest lineage-specific QTL, in accordance with the presumed ancestry of outbreeding. Extensions of lineage-specific QTL mapping to other types of data and crosses, and to inference of ancestral QTL state, are discussed. 相似文献

12.

On the differences between maximum likelihood and regression interval mapping in the analysis of quantitative trait loci 总被引：10，自引：0，他引：10

Kao CH 《Genetics》2000,156(2):855-865

The differences between maximum-likelihood (ML) and regression (REG) interval mapping in the analysis of quantitative trait loci (QTL) are investigated analytically and numerically by simulation. The analytical investigation is based on the comparison of the solution sets of the ML and REG methods in the estimation of QTL parameters. Their differences are found to relate to the similarity between the conditional posterior and conditional probabilities of QTL genotypes and depend on several factors, such as the proportion of variance explained by QTL, relative QTL position in an interval, interval size, difference between the sizes of QTL, epistasis, and linkage between QTL. The differences in mean squared error (MSE) of the estimates, likelihood-ratio test (LRT) statistics in testing parameters, and power of QTL detection between the two methods become larger as (1) the proportion of variance explained by QTL becomes higher, (2) the QTL locations are positioned toward the middle of intervals, (3) the QTL are located in wider marker intervals, (4) epistasis between QTL is stronger, (5) the difference between QTL effects becomes larger, and (6) the positions of QTL get closer in QTL mapping. The REG method is biased in the estimation of the proportion of variance explained by QTL, and it may have a serious problem in detecting closely linked QTL when compared to the ML method. In general, the differences between the two methods may be minor, but can be significant when QTL interact or are closely linked. The ML method tends to be more powerful and to give estimates with smaller MSEs and larger LRT statistics. This implies that ML interval mapping can be more accurate, precise, and powerful than REG interval mapping. The REG method is faster in computation, especially when the number of QTL considered in the model is large. Recognizing the factors affecting the differences between REG and ML interval mapping can help an efficient strategy, using both methods in QTL mapping to be outlined. 相似文献

13.

Bayesian quantitative trait loci mapping for multiple traits 总被引：1，自引：0，他引：1

下载免费PDF全文

Banerjee S Yandell BS Yi N 《Genetics》2008,179(4):2275-2289

Most quantitative trait loci (QTL) mapping experiments typically collect phenotypic data on multiple correlated complex traits. However, there is a lack of a comprehensive genomewide mapping strategy for correlated traits in the literature. We develop Bayesian multiple-QTL mapping methods for correlated continuous traits using two multivariate models: one that assumes the same genetic model for all traits, the traditional multivariate model, and the other known as the seemingly unrelated regression (SUR) model that allows different genetic models for different traits. We develop computationally efficient Markov chain Monte Carlo (MCMC) algorithms for performing joint analysis. We conduct extensive simulation studies to assess the performance of the proposed methods and to compare with the conventional single-trait model. Our methods have been implemented in the freely available package R/qtlbim (http://www.qtlbim.org), which greatly facilitates the general usage of the Bayesian methodology for unraveling the genetic architecture of complex traits. 相似文献

14.

Sib mating designs for mapping quantitative trait loci

Chongqing Xie Damian D.G. Gessler Shizhong Xu 《Genetica》1998,104(1):9-19

The power to separate the variance of a quantitative trait locus (QTL) from the polygenic variance is determined by the variability of genes identical by descent (IBD) at the QTL. This variability may increase with inbreeding. Selfing, the most extreme form of inbreeding, increases the variability of the IBD value shared by siblings, and thus has a higher efficiency for QTL mapping than random mating. In self-incompatible organisms, sib mating is the closest form of inbreeding. Similar to selfing, sib mating may also increase the power of QTL detection relative to random mating. In this study, we develop an IBD-based method under sib mating designs for QTL mapping. The efficiency of sib mating is then compared with random mating. Monte Carlo simulations show that sib mating designs notably increase the power for QTL detection. When power is intermediate, the power to detect a QTL using full-sib mating is, on average, 7% higher than under random mating. In addition, the IBD-based method proposed in this paper can be used to combine data from multiple families. As a result, the estimated QTL parameters can be applied to a wide statistical inference space relating to the entire reference population. This revised version was published online in July 2006 with corrections to the Cover Date. 相似文献

15.

Multiple interval mapping for quantitative trait loci. 总被引：72，自引：0，他引：72

C H Kao Z B Zeng R D Teasdale 《Genetics》1999,152(3):1203-1216

A new statistical method for mapping quantitative trait loci (QTL), called multiple interval mapping (MIM), is presented. It uses multiple marker intervals simultaneously to fit multiple putative QTL directly in the model for mapping QTL. The MIM model is based on Cockerham's model for interpreting genetic parameters and the method of maximum likelihood for estimating genetic parameters. With the MIM approach, the precision and power of QTL mapping could be improved. Also, epistasis between QTL, genotypic values of individuals, and heritabilities of quantitative traits can be readily estimated and analyzed. Using the MIM model, a stepwise selection procedure with likelihood ratio test statistic as a criterion is proposed to identify QTL. This MIM method was applied to a mapping data set of radiata pine on three traits: brown cone number, tree diameter, and branch quality scores. Based on the MIM result, seven, six, and five QTL were detected for the three traits, respectively. The detected QTL individually contributed from approximately 1 to 27% of the total genetic variation. Significant epistasis between four pairs of QTL in two traits was detected, and the four pairs of QTL contributed approximately 10.38 and 14.14% of the total genetic variation. The asymptotic variances of QTL positions and effects were also provided to construct the confidence intervals. The estimated heritabilities were 0.5606, 0.5226, and 0. 3630 for the three traits, respectively. With the estimated QTL effects and positions, the best strategy of marker-assisted selection for trait improvement for a specific purpose and requirement can be explored. The MIM FORTRAN program is available on the worldwide web (http://www.stat.sinica.edu.tw/chkao/). 相似文献

16.

Marker pair selection for mapping quantitative trait loci 总被引：10，自引：0，他引：10

Piepho HP Gauch HG 《Genetics》2001,157(1):433-444

Mapping of quantitative trait loci (QTL) for backcross and F(2) populations may be set up as a multiple linear regression problem, where marker types are the regressor variables. It has been shown previously that flanking markers absorb all information on isolated QTL. Therefore, selection of pairs of markers flanking QTL is useful as a direct approach to QTL detection. Alternatively, selected pairs of flanking markers can be used as cofactors in composite interval mapping (CIM). Overfitting is a serious problem, especially if the number of regressor variables is large. We suggest a procedure denoted as marker pair selection (MPS) that uses model selection criteria for multiple linear regression. Markers enter the model in pairs, which reduces the number of models to be considered, thus alleviating the problem of overfitting and increasing the chances of detecting QTL. MPS entails an exhaustive search per chromosome to maximize the chance of finding the best-fitting models. A simulation study is conducted to study the merits of different model selection criteria for MPS. On the basis of our results, we recommend the Schwarz Bayesian criterion (SBC) for use in practice. 相似文献

17.

A general mixture model approach for mapping quantitative trait loci from diverse cross designs involving multiple inbred lines

Liu Y Zeng ZB 《Genetical research》2000,75(3):345-355

Most current statistical methods developed for mapping quantitative trait loci (QTL) based on inbred line designs apply to crosses from two inbred lines. Analysis of QTL in these crosses is restricted by the parental genetic differences between lines. Crosses from multiple inbred lines or multiple families are common in plant and animal breeding programmes, and can be used to increase the efficiency of a QTL mapping study. A general statistical method using mixture model procedures and the EM algorithm is developed for mapping QTL from various cross designs of multiple inbred lines. The general procedure features three cross design matrices, W, that define the contribution of parental lines to a particular cross and a genetic design matrix, D, that specifies the genetic model used in multiple line crosses. By appropriately specifying W matrices, the statistical method can be applied to various cross designs, such as diallel, factorial, cyclic, parallel or arbitrary-pattern cross designs with two or multiple parental lines. Also, with appropriate specification for the D matrix, the method can be used to analyse different kinds of cross populations, such as F2 backcross, four-way cross and mixed crosses (e.g. combining backcross and F2). Simulation studies were conducted to explore the properties of the method, and confirmed its applicability to diverse experimental designs. 相似文献

18.

A bivalent polyploid model for mapping quantitative trait loci in outcrossing tetraploids

Wu R Ma CX Casella G 《Genetics》2004,166(1):581-595

Two major aspects have made the genetic and genomic study of polyploids extremely difficult. First, increased allelic or nonallelic combinations due to multiple alleles result in complex gene actions and interactions for quantitative trait loci (QTL) in polyploids. Second, meiotic configurations in polyploids undergo a complex biological process including either bivalent or multivalent formation, or both. For bivalent polyploids, different degrees of preferential chromosome pairings may occur during meiosis. In this article, we develop a maximum-likelihood-based model for mapping QTL in tetraploids by considering the quantitative inheritance and meiotic mechanism of bivalent polyploids. This bivalent polyploid model is implemented with the EM algorithm to simultaneously estimate QTL position, QTL effects, and QTL-marker linkage phases by incorporating the impact of a cytological parameter determining bivalent chromosome pairings (the preferential pairing factor). Simulation studies are performed to investigate the performance and robustness of our statistical method for parameter estimation. The implication and extension of the bivalent polyploid model are discussed. 相似文献

19.

Selection bias in quantitative trait loci mapping

Lee C 《The Journal of heredity》2005,96(4):363-367

A simulation study was performed to see whether selection affected quantitative trait loci (QTL) mapping. Populations under random selection, under selection among full-sib families, and under selection within a full-sib family were simulated each with heritability of 0.3, 0.5, and 0.7. They were analyzed with the marker spacing of 10 cM and 20 cM. The accuracy for QTL detection decreased for the populations under selection within full-sib family. Estimates of QTL effects and positions differed (P < .05) from their input values. The problems could be ignored when mapping a QTL for the populations under selection among full-sib families. A large heritability helped reduction of such problems. When the animals were selected within a full-sib family, the QTL was detected for the populations with heritability of 0.5 or larger using the marker spacing of 10 cM, and with heritability of 0.7 using the marker spacing of 20 cM. This study implied that when selection was introduced, the accuracy for QTL detection decreased and the estimates of QTL effects were biased. A caution was warranted on the decision of data (including selected animals to be genotyped) for QTL mapping. 相似文献

20.

A simple regression method for mapping quantitative trait loci in line crosses using flanking markers 总被引：94，自引：0，他引：94

Haley CS Knott SA 《Heredity》1992,69(4):315-324

The use of flanking marker methods has proved to be a powerful tool for the mapping of quantitative trait loci (QTL) in the segregating generations derived from crosses between inbred lines. Methods to analyse these data, based on maximum-likelihood, have been developed and provide good estimates of QTL effects in some situations. Maximum-likelihood methods are, however, relatively complex and can be computationally slow. In this paper we develop methods for mapping QTL based on multiple regression which can be applied using any general statistical package. We use the example of mapping in an F(2) population and show that these regression methods produce very similar results to those obtained using maximum likelihood. The relative simplicity of the regression methods means that models with more than a single QTL can be explored and we give examples of two lined loci and of two interacting loci. Other models, for example with more than two QTL, with environmental fixed effects, with between family variance or for threshold traits, could be fitted in a similar way. The ease, speed of application and generality of regression methods for flanking marker analysis, and the good estimates they obtain, suggest that they should provide the method of choice for the analysis of QTL mapping data from inbred line crosses. 相似文献