首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
There are two common designs for association mapping of complex diseases: case-control and family-based designs. A case-control sample is more powerful to detect genetic effects than a family-based sample that contains the same numbers of affected and unaffected persons, although additional markers may be required to control for spurious association. When family and unrelated samples are available, statistical analyses are often performed in the family and unrelated samples separately, conditioning on parental information for the former, thus resulting in reduced power. In this report, we propose a unified approach that can incorporate both family and case-control samples and, provided the additional markers are available, at the same time corrects for population stratification. We apply the principal components of a marker matrix to adjust for the effect of population stratification. This unified approach makes it unnecessary to perform a conditional analysis of the family data and is more powerful than the separate analyses of unrelated and family samples, or a meta-analysis performed by combining the results of the usual separate analyses. This property is demonstrated in both a variety of simulation models and empirical data. The proposed approach can be equally applied to the analysis of both qualitative and quantitative traits.  相似文献   

2.
3.
DNA sequence copy number has been shown to be associated with cancer development and progression. Array-based comparative genomic hybridization (aCGH) is a recent development that seeks to identify the copy number ratio at large numbers of markers across the genome. Due to experimental and biological variations across chromosomes and hybridizations, current methods are limited to analyses of single chromosomes. We propose a more powerful approach that borrows strength across chromosomes and hybridizations. We assume a Gaussian mixture model, with a hidden Markov dependence structure and with random effects to allow for intertumoral variation, as well as intratumoral clonal variation. For ease of computation, we base estimation on a pseudolikelihood function. The method produces quantitative assessments of the likelihood of genetic alterations at each clone, along with a graphical display for simple visual interpretation. We assess the characteristics of the method through simulation studies and analysis of a brain tumor aCGH data set. We show that the pseudolikelihood approach is superior to existing methods both in detecting small regions of copy number alteration and in accurately classifying regions of change when intratumoral clonal variation is present. Software for this approach is available at http://www.biostat.harvard.edu/ approximately betensky/papers.html.  相似文献   

4.

Background

Nerve growth factor (NGF) helps in the healing and survival of ganglion cells, photoreceptors, and optic nerve after injury and has been implicated to have a role in pathophysiology of glaucoma. So far, in animal studies, injury to iris in vitro has revealed an increase in NGF levels in aqueous. There is a great interest in investigating the levels of NGF in human aqueous in glaucomatous eyes, as suggested by animal studies, to gain a better understanding of the pathophysiology of glaucoma.

Findings

In this study, we examined the presence of NGF levels in aqueous humor collected from human eyes and the limitations in determining the NGF levels in human samples. NGF was assessed by ELISA immunoassay in undiluted aqueous samples collected from 32 consecutive patients undergoing surgery for cataract (control) or primary open angle glaucoma (POAG). Recombinant NGF was used as positive control. NGF levels were below undetectable levels in aqueous humor from eyes with POAG and controls by immunoassay. Less than 10% of samples had detectable NGF levels and these were considered outliers.

Conclusion

Our result highlights the undetectable levels of NGF in human aqueous samples.  相似文献   

5.
The HapMap project has given case-control association studies a unique opportunity to uncover the genetic basis of complex diseases. However, persistent issues in such studies remain the proper quantification of, testing for, and correction for population stratification (PS). In this paper, we present the first unified paradigm that addresses all three fundamental issues within one statistical framework. Our unified approach makes use of an omnibus quantity (delta), which can be estimated in a case-control study from suitable null loci. We show how this estimated value can be used to quantify PS, to statistically test for PS, and to correct for PS, all in the context of case-control studies. Moreover, we provide guidelines for interpreting values of delta in association studies (e.g., at alpha = 0.05, a delta of size 0.416 is small, a delta of size 0.653 is medium, and a delta of size 1.115 is large). A novel feature of our testing procedure is its ability to test for either strictly any PS or only 'practically important' PS. We also performed simulations to compare our correction procedure with Genomic Control (GC). Our results show that, unlike GC, it maintains good Type I error rates and power across all levels of PS.  相似文献   

6.
7.
In recent years multilocus data sets have been used to study the demographic history of human populations. In this paper (1) analyses previously done on 60 short tandem repeat (STR) loci are repeated on 30 restriction site polymorphism (RSP) markers; (2) relative population weights are estimated from the RSP data set and compared to previously published estimates from STR and craniometric data sets; and (3) computer simulations are performed to show the effects of ascertainment bias on relative population weight estimates. Not surprisingly, given that the RSP markers were originally identified in a small panel of Caucasians, estimates of relative population weights are biased and the European population weight is artificially inflated. However, the effects of ascertainment bias are not apparent in a principal components plot or estimates of FST. Ascertainment bias can have a large effect in other genetic systems with inherently low heterozygosity such as Alus or single nucleotide polymorphisms (SNPs), and care must be taken to have prior knowledge of how polymorphic markers in a given data set were originally identified. Otherwise, results can be skewed and interpretations faulty.  相似文献   

8.
9.
Widespread multifactor interactions present a significant challenge in determining risk factors of complex diseases. Several combinatorial approaches, such as the multifactor dimensionality reduction (MDR) method, have emerged as a promising tool for better detecting gene-gene (G x G) and gene-environment (G x E) interactions. We recently developed a general combinatorial approach, namely the generalized multifactor dimensionality reduction (GMDR) method, which can entertain both qualitative and quantitative phenotypes and allows for both discrete and continuous covariates to detect G x G and G x E interactions in a sample of unrelated individuals. In this article, we report the development of an algorithm that can be used to study G x G and G x E interactions for family-based designs, called pedigree-based GMDR (PGMDR). Compared to the available method, our proposed method has several major improvements, including allowing for covariate adjustments and being applicable to arbitrary phenotypes, arbitrary pedigree structures, and arbitrary patterns of missing marker genotypes. Our Monte Carlo simulations provide evidence that the PGMDR method is superior in performance to identify epistatic loci compared to the MDR-pedigree disequilibrium test (PDT). Finally, we applied our proposed approach to a genetic data set on tobacco dependence and found a significant interaction between two taste receptor genes (i.e., TAS2R16 and TAS2R38) in affecting nicotine dependence.  相似文献   

10.
A pseudolikelihood method for analyzing interval censored data   总被引:1,自引:0,他引:1  
We introduce a method based on a pseudolikelihood ratio forestimating the distribution function of the survival time ina mixed-case interval censoring model. In a mixed-case model,an individual is observed a random number of times, and at eachtime it is recorded whether an event has happened or not. Oneseeks to estimate the distribution of time to event. We usea Poisson process as the basis of a likelihood function to constructa pseudolikelihood ratio statistic for testing the value ofthe distribution function at a fixed point, and show that thisconverges under the null hypothesis to a known limit distribution,that can be expressed as a functional of different convex minorantsof a two-sided Brownian motion process with parabolic drift.Construction of confidence sets then proceeds by standard inversion.The computation of the confidence sets is simple, requiringthe use of the pool-adjacent-violators algorithm or a standardisotonic regression algorithm. We also illustrate the superiorityof the proposed method over competitors based on resamplingtechniques or on the limit distribution of the maximum pseudolikelihoodestimator, through simulation studies, and illustrate the differentmethods on a dataset involving time to HIV seroconversion ina group of haemophiliacs.  相似文献   

11.
12.
One widely used measure of familial aggregation is the sibling recurrence-risk ratio, which is defined as the ratio of risk of disease manifestation, given that one's sibling is affected, as compared with the disease prevalence in the general population. Known as lambdaS, it has been used extensively in the mapping of complex diseases. In this paper, I show that, for a fictitious disease that is strictly nongenetic and nonenvironmental, lambdaS can be dramatically inflated because of misunderstanding of the original definition of lambdaS, ascertainment bias, and overreporting. Therefore, for a disease of entirely environmental origin, the lambdaS inflation due to ascertainment bias and/or overreporting is expected to be more prominent if the risk factor also is familially aggregated. This suggests that, like segregation analysis, the estimation of lambdaS also is prone to ascertainment bias and should be performed with great care. This is particularly important if one uses lambdaS for exclusion mapping, for discrimination between different genetic models, and for association studies, since these practices hinge tightly on an accurate estimation of lambdaS.  相似文献   

13.
Sensitivity and specificity are common measures of the accuracy of a diagnostic test. The usual estimators of these quantities are unbiased if data on the diagnostic test result and the true disease status are obtained from all subjects in an appropriately selected sample. In some studies, verification of the true disease status is performed only for a subset of subjects, possibly depending on the result of the diagnostic test and other characteristics of the subjects. Estimators of sensitivity and specificity based on this subset of subjects are typically biased; this is known as verification bias. Methods have been proposed to correct verification bias under the assumption that the missing data on disease status are missing at random (MAR), that is, the probability of missingness depends on the true (missing) disease status only through the test result and observed covariate information. When some of the covariates are continuous, or the number of covariates is relatively large, the existing methods require parametric models for the probability of disease or the probability of verification (given the test result and covariates), and hence are subject to model misspecification. We propose a new method for correcting verification bias based on the propensity score, defined as the predicted probability of verification given the test result and observed covariates. This is estimated separately for those with positive and negative test results. The new method classifies the verified sample into several subsamples that have homogeneous propensity scores and allows correction for verification bias. Simulation studies demonstrate that the new estimators are more robust to model misspecification than existing methods, but still perform well when the models for the probability of disease and probability of verification are correctly specified.  相似文献   

14.
Uncertainty about the ascertainment of human family data leads to a need for robust methods for estimating genetic and environmental effects. This in turn leads to a need for efficient techniques for estimating model parameters for data generated under one parametric model but analyzed under a second model. If the two models correspond to different ascertainment schemes for the same exponential family, simple formulas for the asymptotic means and standard errors of both conditional and unconditional MLEs can be derived. In an example for continuous sibship data, these formulas show that estimates derived from conditioning on proband value have greater asymptotic bias than two other estimators. Similarly, either conditioning on proband value or conditioning on the number of affected family members resulted in biases of up to 30% when ascertainment depended on the values of more than one affected family member.  相似文献   

15.
Species-specific differences in microsatellite locus length and ascertainment bias have both been proposed to explain differences in microsatellite variability and length usually observed when loci isolated in one species are used to survey variation in a related species. Here we provide a simple algebraic approach to independently estimate the contributions of true species-specific length differences and ascertainment bias. We apply this approach to a reciprocal-isolation microsatellite study and show contributions of both ascertainment bias and a true longer average microsatellite length in Drosophila melanogaster compared with D. simulans.  相似文献   

16.
It has been shown that the classical binomial form of ascertainment, assuming a constant probability pi that any affected individual may become a proband for his pedigree, cannot describe a rather wide range of ascertainment procedures that might arise in practice. Some more general heuristic ascertainment formulas might then be preferred, and in this paper we consider the probabilistic basis for these formulas. We retain the binomial assumption of the classical scheme but allow the ascertainment probability to depend on the number of potential probands per pedigree. This probability can be expressed by an increasing or a decreasing function of that number. Various illustrations are given and situations where the "cooperative" binomial scheme should be valuable are discussed.  相似文献   

17.
A semiparametric pseudolikelihood estimation method for panel count data   总被引:1,自引:0,他引:1  
Zhang  Ying 《Biometrika》2002,89(1):39-48
  相似文献   

18.
19.
The study of codon usage bias is an important research area that contributes to our understanding of molecular evolution, phylogenetic relationships, respiratory lifestyle, and other characteristics. Translational efficiency bias is perhaps the most well-studied codon usage bias, as it is frequently utilized to predict relative protein expression levels. We present a novel approach to isolating translational efficiency bias in microbial genomes. There are several existent methods for isolating translational efficiency bias. Previous approaches are susceptible to the confounding influences of other potentially dominant biases. Additionally, existing approaches to identifying translational efficiency bias generally require both genomic sequence information and prior knowledge of a set of highly expressed genes. This novel approach provides more accurate results from sequence information alone by resisting the confounding effects of other biases. We validate this increase in accuracy in isolating translational efficiency bias on 10 microbial genomes, five of which have proven particularly difficult for existing approaches due to the presence of strong confounding biases.  相似文献   

20.
In a previous paper we have shown that, when DNA samples for cases and controls are prepared in different laboratories prior to high-throughput genotyping, scoring inaccuracies can lead to differential misclassification and, consequently, to increased false-positive rates. Different DNA sourcing is often unavoidable in large-scale disease association studies of multiple case and control sets. Here, we describe methodological improvements to minimise such biases. These fall into two categories: improvements to the basic clustering methods for identifying genotypes from fluorescence intensities, and use of "fuzzy" calls in association tests in order to make appropriate allowance for call uncertainty. We find that the main improvement is a modification of the calling algorithm that links the clustering of cases and controls while allowing for different DNA sourcing. We also find that, in the presence of different DNA sourcing, biases associated with missing data can increase the false-positive rate. Therefore, we propose the use of "fuzzy" calls to deal with uncertain genotypes that would otherwise be labeled as missing.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号