首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
    
There has been a tremendous advancement of Bayesian methodology in quantitative genetics and evolutionary biology. Still, there are relatively few publications that apply this methodology, probably because the availability of multipurpose and user-friendly software is somewhat limited. It is here described how only a few rows of code of the well-developed and very flexible Bayesian software WinBUGS ( Lunn et al. 2000 ) can be used for inference of the additive polygenic variance and heritabilty in pedigrees of general design. The presented code is illustrated by application to an earlier published dataset of Scots pine.  相似文献   

2.
We present a Bayesian method for characterizing the mating system of populations reproducing through a mixture of self-fertilization and random outcrossing. Our method uses patterns of genetic variation across the genome as a basis for inference about reproduction under pure hermaphroditism, gynodioecy, and a model developed to describe the self-fertilizing killifish Kryptolebias marmoratus. We extend the standard coalescence model to accommodate these mating systems, accounting explicitly for multilocus identity disequilibrium, inbreeding depression, and variation in fertility among mating types. We incorporate the Ewens sampling formula (ESF) under the infinite-alleles model of mutation to obtain a novel expression for the likelihood of mating system parameters. Our Markov chain Monte Carlo (MCMC) algorithm assigns locus-specific mutation rates, drawn from a common mutation rate distribution that is itself estimated from the data using a Dirichlet process prior model. Our sampler is designed to accommodate additional information, including observations pertaining to the sex ratio, the intensity of inbreeding depression, and other aspects of reproduction. It can provide joint posterior distributions for the population-wide proportion of uniparental individuals, locus-specific mutation rates, and the number of generations since the most recent outcrossing event for each sampled individual. Further, estimation of all basic parameters of a given model permits estimation of functions of those parameters, including the proportion of the gene pool contributed by each sex and relative effective numbers.  相似文献   

3.
  总被引:2,自引:0,他引:2  
Accurate and fast estimation of genetic parameters that underlie quantitative traits using mixed linear models with additive and dominance effects is of great importance in both natural and breeding populations. Here, we propose a new fast adaptive Markov chain Monte Carlo (MCMC) sampling algorithm for the estimation of genetic parameters in the linear mixed model with several random effects. In the learning phase of our algorithm, we use the hybrid Gibbs sampler to learn the covariance structure of the variance components. In the second phase of the algorithm, we use this covariance structure to formulate an effective proposal distribution for a Metropolis-Hastings algorithm, which uses a likelihood function in which the random effects have been integrated out. Compared with the hybrid Gibbs sampler, the new algorithm had better mixing properties and was approximately twice as fast to run. Our new algorithm was able to detect different modes in the posterior distribution. In addition, the posterior mode estimates from the adaptive MCMC method were close to the REML (residual maximum likelihood) estimates. Moreover, our exponential prior for inverse variance components was vague and enabled the estimated mode of the posterior variance to be practically zero, which was in agreement with the support from the likelihood (in the case of no dominance). The method performance is illustrated using simulated data sets with replicates and field data in barley.  相似文献   

4.
    
Methods to examine whether genetic and/or environmental sources can account for the residual variation in ordinal family data usually assume proportional odds. However, standard software to fit the non‐proportional odds model to ordinal family data is limited because the correlation structure of family data is more complex than for other types of clustered data. To perform these analyses we propose the non‐proportional odds multivariate logistic regression model and take a simulation‐based approach to model fitting using Markov chain Monte Carlo methods, such as partially collapsed Gibbs sampling and the Metropolis algorithm. We applied the proposed methodology to male pattern baldness data from the Victorian Family Heart Study.  相似文献   

5.
Multiple shrinkage and subset selection in wavelets   总被引:6,自引:0,他引:6  
  相似文献   

6.
    
Knowledge about the reproductive system of species inhabiting rocky habitats is scarce. The reproductive biology (floral biology, experimental pollination, insect visits, inbreeding depression, and seed predation) of the rupicolous Linaria cavanillesii was analyzed under field and experimental conditions. Self-compatibility was revealed by the high fruit set, seed set, and seed mass in the pollination experiment. Furthermore, results disclose that this species does not need insect visitors for seed production since fruit set after autonomous self-pollination was similar to that by hand cross-pollination. Self-offsprings were not affected by a strong degree of inbreeding depression in early life-cycle stages. Seed predation by Cucurlionidae is the main limiting factor of fruit and seed production in this species. Contrary to other perennial species of Linaria previously studied, L. cavanillesii represents the first clear report of self-fertility.  相似文献   

7.
Motif识别是计算生物学中的重要问题.处理缺失数据的方法被大家广泛应用于生物序列中的Motif识别,例如EM算法,Gibbs抽样等等.现在识别Motif的方法都是首先假定Motif的长度是给的,但是,事实上Motif的长度是未知的,在这篇文章中,我们用Gibbs抽样算法在寻找Motif的位置的同时确定Motif的长度.  相似文献   

8.
    
The relationship between the number of open flowers and pollen transfer to stigmas was estimated in three insect-pollinated species (Cynoglossum officinale, Echium vulgare and Oenothera erythrocepala) in 1995 and 1996, using fluorescent dye as a pollen analogue. Dye was applied to an equal number of flowers on a small, a medium and a large individual and its pattern of dispersal to stigmas of conspecifics in local populations was observed. In Cynoglossum officinale and in Echium vulgare large individuals tended to receive more visits per flower. In Oenothera erythrocepala no difference in number of visits per flower was found between small and large individuals. Geitonogamous pollination clearly occurred in all species. In Cynoglossum officinale large individuals transferred more pollen per flower within the same plant (geitonogamy) and an equal or a slightly smaller amount between plants, compared to small individuals. As a result total pollen transfer per flower increased in both years with increasing flower number. In Echium vulgare and in Oenothera erythrocepala large individuals transferred more pollen per flower within the plant (pollen discounting), but a significantly lower amount per flower to other individuals in the populations compared to small individuals. For both these species, there is no difference in total pollen transfer between small and large individuals. In general, the male fitness curve, as estimated by pollen transfer, is a decelerating function of flower number. We discuss this in relation to optimal sex-allocation and the pollen donation hypothesis.  相似文献   

9.
    
It has been assumed, based on theoretical studies, that lethals with the level of dominance estimated from experimental studies would have an allele frequency that is virtually independent of effective population size. However, here it is shown numerically that the expected frequency of lethals with low levels of dominance is also dependent on finite population size, although not as much as completely recessive lethals. This finding is significant in determining the standing level of inbreeding depression and the consequent potential for the evolution of self-fertilization. In addition, the architecture of genetic variation influencing inbreeding depression in populations with a history of small size may be of important consequence in endangered species. Finally, it is shown that the loss of lethal genetic variation often occurs much more quickly than the regeneration of lethal variation by mutation. This asymmetry may result in a lower standing genetic variation for inbreeding depression than expected from mutation rates and contemporary population size data.  相似文献   

10.
    
Burgette LF  Reiter JP 《Biometrics》2012,68(1):92-100
We describe a Bayesian quantile regression model that uses a confirmatory factor structure for part of the design matrix. This model is appropriate when the covariates are indicators of scientifically determined latent factors, and it is these latent factors that analysts seek to include as predictors in the quantile regression. We apply the model to a study of birth weights in which the effects of latent variables representing psychosocial health and actual tobacco usage on the lower quantiles of the response distribution are of interest. The models can be fit using an R package called factorQR.  相似文献   

11.
    
Chen HY  Xie H  Qian Y 《Biometrics》2011,67(3):799-809
Multiple imputation is a practically useful approach to handling incompletely observed data in statistical analysis. Parameter estimation and inference based on imputed full data have been made easy by Rubin's rule for result combination. However, creating proper imputation that accommodates flexible models for statistical analysis in practice can be very challenging. We propose an imputation framework that uses conditional semiparametric odds ratio models to impute the missing values. The proposed imputation framework is more flexible and robust than the imputation approach based on the normal model. It is a compatible framework in comparison to the approach based on fully conditionally specified models. The proposed algorithms for multiple imputation through the Markov chain Monte Carlo sampling approach can be straightforwardly carried out. Simulation studies demonstrate that the proposed approach performs better than existing, commonly used imputation approaches. The proposed approach is applied to imputing missing values in bone fracture data.  相似文献   

12.
Abstract Analyses of whole-genome data often reveal that some genes have evolutionary histories that diverge from the majority phylogeny estimated for the entire genome. We present a probabilistic model that deals with heterogeneity among gene trees, implement it via the Gibbs sampler, and apply it to the plastid genome. Plastids and their genomes are transmitted as a single block without recombination, hence homogeneity among gene trees within this genome is expected. Nevertheless, previous work has revealed clear heterogeneity among plastid genes (e.g., Delwiche and Palmer 1996). Other studies, using whole plastid genomes of various algae and land plants, found little additional heterogeneity (Martin et al. 1998; Adachi et al. 2000). We augment the earlier studies by using a data set of 14 taxa: 6 land plants, 2 green algae, a diatom, 2 red algae and a cryptophyte, the cyanelle of the glaucocystophyte Cyanophora, and the blue–green alga Synechocystis as an outgroup. Contrary to the earlier analyses, we cannot find even a single, dominant consensus tree. Therefore, we formulate a probabilistic model that divides the genes into two sets: those that follow the consensus tree and those that have independent gene trees. No particular tree is supported by more than three-fourths of the genes. But the set of genes that follows a certain tree is fairly independent of data processing and the method of analysis. With one possible exception, we find no evidence for collinear or functionally related genes to follow similar trees. The phylogenetic pattern also seems independent of bias in amino acid composition. Among possible explanations for the observed phenomenon, the hypothesis that different genes have different covarion structures is difficult to assess. But gene duplication may be possible through the inverted or direct repeat regions, while horizontal gene transfer seems less likely. In contrast to green algae and land plants, inverted repeat regions in red algae and in Cyanophora show abundant differences among the copies. Thus, genes may get duplicated when they are recruited into the inverted repeat region and one of the two copies may be lost after leaving the inverted repeat region.  相似文献   

13.
    
Kim S  Wang Z  Dalkilic M 《Proteins》2007,66(3):671-681
The motif prediction problem is to predict short, conserved subsequences that are part of a family of sequences, and it is a very important biological problem. Gibbs is one of the first successful motif algorithms and it runs very fast compared with other algorithms, and its search behavior is based on the well-studied Gibbs random sampling. However, motif prediction is a very difficult problem and Gibbs may not predict true motifs in some cases. Thus, the authors explored a possibility of improving the prediction accuracy of Gibbs while retaining its fast runtime performance. In this paper, the authors considered Gibbs only for proteins, not for DNA binding sites. The authors have developed iGibbs, an integrated motif search framework for proteins that employs two previous techniques of their own: one for guiding motif search by clustering sequences and another by pattern refinement. These two techniques are combined to a new double clustering approach to guiding motif search. The unique feature of their framework is that users do not have to specify the number of motifs to be predicted when motifs occur in different subsets of the input sequences since it automatically clusters input sequences into clusters and predict motifs from the clusters. Tests on the PROSITE database show that their framework improved the prediction accuracy of Gibbs significantly. Compared with more exhaustive search methods like MEME, iGibbs predicted motifs more accurately and runs one order of magnitude faster.  相似文献   

14.
We propose a discrete-time Bayesian hierarchical model for the population dynamics of the great gerbil-flea ecological system. The model accounts for the sampling variability arising from data originally collected for other purposes. The prior for the unknown population densities incorporates specific biological hypotheses regarding the interacting dynamics of the two species, as well as their life cycles, where density-dependent effects are included. Posterior estimates are obtained via Markov chain Monte Carlo. The variance of the observed density estimates is a quadratic function of the unknown density. Our study indicates the presence of a density-dependent growth rate for the gerbil population. For the flea population there is clear evidence of density-dependent over-summer net growth, which is dependent on the flea-to-gerbil ratio at the beginning of the reproductive summer. Over-winter net growth is favored by high density. We estimate that on average 35% of the gerbil population survives the winter. Our study shows that hierarchical Bayesian models can be useful in extracting ecobiological information from observational data.  相似文献   

15.
非损伤性取样法在保护遗传学研究中的应用   总被引:30,自引:4,他引:30  
通过对非损伤怀取样法的种类以及毛发的非损伤性取样分析方法、口腔脱落细胞的非损伤性取样分析方法、馆藏标本的非损伤性取样分析方法、粪便的非损伤性取样分析方法和其它样品的非损伤性取样分析方法的介绍,就非损伤取样法在保护遗传学中的应用及其优点、局限性和应用前景作了介绍、分析和讨论。  相似文献   

16.
    
The minority cytotype exclusion principle describes how random mating between diploid and autotetraploid cytotypes hinders establishment of the rare cytotype. We present deterministic and stochastic models to ascertain how selfing, inbreeding depression, unreduced gamete production, and finite population size affect minority cytotype exclusion and the establishment of autotetraploids. Results demonstrate that higher selfing rates and lower inbreeding depression in autotetraploids facilitate establishment of autotetraploid populations. Stochastic effects due to finite population size increase the probability of polyploid establishment and decrease the mean time to tetraploid fixation. Our results extend the minority cytotype exclusion principle to include important features of plant reproduction and demonstrate that variation in mating system parameters significantly influences the conditions necessary for polyploid establishment.  相似文献   

17.
    

Background

Recently mixed linear models are used to address the issue of “missing\" heritability in traditional Genome-wide association studies (GWAS). The models assume that all single-nucleotide polymorphisms (SNPs) are associated with the phenotypes of interest. However, it is more common that only a small proportion of SNPs have significant effects on the phenotypes, while most SNPs have no or very small effects. To incorporate this feature, we propose an efficient Hierarchical Bayesian Model (HBM) that extends the existing mixed models to enforce automatic selection of significant SNPs. The HBM models the SNP effects using a mixture distribution of a point mass at zero and a normal distribution, where the point mass corresponds to those non-associative SNPs.

Results

We estimate the HBM using Gibbs sampling. The estimation performance of our method is first demonstrated through two simulation studies. We make the simulation setups realistic by using parameters fitted on the Framingham Heart Study (FHS) data. The simulation studies show that our method can accurately estimate the proportion of SNPs associated with the simulated phenotype and identify these SNPs, as well as adapt to certain model mis-specification than the standard mixed models. In addition, we analyze data from the FHS and the Health and Retirement Study (HRS) to study the association between Body Mass Index (BMI) and SNPs on Chromosome 16, and replicate the identified genetic associations. The analysis of the FHS data identifies 0.3% SNPs on Chromosome 16 that affect BMI, including rs9939609 and rs9939973 on the FTO gene. These two SNPs are in strong linkage disequilibrium with rs1558902 (Rsq =0.901 for rs9939609 and Rsq =0.905 for rs9939973), which has been reported to be linked with obesity in previous GWAS. We then replicate the findings using the HRS data: the analysis finds 0.4% of SNPs associated with BMI on Chromosome 16. Furthermore, around 25% of the genes that are identified to be associated with BMI are common between the two studies.

Conclusions

The results demonstrate that the HBM and the associated estimation algorithm offer a powerful tool for identifying significant genetic associations with phenotypes of interest, among a large number of SNPs that are common in modern genetics studies.  相似文献   

18.
Sinha D  Maiti T 《Biometrics》2004,60(1):34-40
We consider modeling and Bayesian analysis for panel-count data when the termination time for each subject may depend on its history of the recurrent events. We propose a fully specified semiparametric model for the joint distribution of the recurrent events and the termination time. For this model, we provide a natural motivation, derive several novel properties, and develop a Bayesian analysis based on a Markov chain Monte Carlo algorithm. Comparisons are made to other existing models and methods for panel-count data. We demonstrate the usefulness of our new models and methodologies through the reanalysis of a data set from a clinical trial.  相似文献   

19.
20.
    
If a dependent variable in a regression analysis is exceptionally expensive or hard to obtain the overall sample size used to fit the model may be limited. To avoid this one may use a cheaper or more easily collected “surrogate” variable to supplement the expensive variable. The regression analysis will be enhanced to the degree the surrogate is associated with the costly dependent variable. We develop a Bayesian approach incorporating surrogate variables in regression based on a two‐stage experiment. Illustrative examples are given, along with comparisons to an existing frequentist method. (© 2004 WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim)  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号