共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
3.
4.
5.
A W Kimball 《Biometrics》1987,43(3):707-712
A test procedure using chi-square statistics is proposed for determining a threshold in an ordered sequence of correlated proportions. The procedure is based on the multivariate Bernoulli model. It is applied to the problem of ascertaining when visual acuity has stabilized in a group of patients with regular follow-up after a vision-reducing acute abnormality. 相似文献
6.
7.
8.
Nonparametric regression using local kernel estimating equations for correlated failure time data 总被引:1,自引:0,他引:1
We study nonparametric regression for correlated failure timedata. Kernel estimating equations are used to estimate nonparametriccovariate effects. Independent and weighted-kernel estimatingequations are studied. The derivative of the nonparametric functionis first estimated and the nonparametric function is then estimatedby integrating the derivative estimator. We show that the nonparametrickernel estimator is consistent for any arbitrary working correlationmatrix and that its asymptotic variance is minimized by assumingworking independence. We evaluate the performance of the proposedkernel estimator using simulation studies, and apply the proposedmethod to the western Kenya parasitaemia data. 相似文献
9.
A simple approximation for calculating sample sizes for detecting linear trend in proportions 总被引:1,自引:0,他引:1
J M Nam 《Biometrics》1987,43(3):701-705
A simple approximate formula for sample sizes for detecting a linear trend in proportions is derived. The formulas for both the uncorrected and corrected Cochran-Armitage test are given. For two binomial proportions these reduce to those given by Casagrande, Pike, and Smith (1978, Biometrics 34, 483-486). Some numerical results of a power study for small sample sizes show that the nominal power corresponding to the approximate sample size is a reasonably good approximation to the actual power. 相似文献
10.
Longitudinal data analysis using generalized linear models 总被引:186,自引:0,他引:186
11.
12.
Significance testing for correlated binary outcome data 总被引:1,自引:0,他引:1
Multiple logistic regression is a commonly used multivariate technique for analyzing data with a binary outcome. One assumption needed for this method of analysis is the independence of outcome for all sample points in a data set. In ophthalmologic data and other types of correlated binary data, this assumption is often grossly violated and the validity of the technique becomes an issue. A technique has been developed (Rosner, 1984) that utilizes a polychotomous logistic regression model to allow one to look at multiple exposure variables in the context of a correlated binary data structure. This model is an extension of the beta-binomial model, which has been widely used to model correlated binary data when no covariates are present. In this paper, a relationship is developed between the two techniques, whereby it is shown that use of ordinary logistic regression in the presence of correlated binary data can result in true significance levels that are considerably larger than nominal levels in frequently encountered situations. This relationship is explored in detail in the case of a single dichotomous exposure variable. In this case, the appropriate test statistic can be expressed as an adjusted chi-square statistic based on the 2 X 2 contingency table relating exposure to outcome. The test statistic is easily computed as a function of the ordinary chi-square statistic and the correlation between eyes (or more generally between cluster members) for outcome and exposure, respectively. This generalizes some previous results obtained by Koval and Donner (1987, in Festschrift for V. M. Joshi, I. B. MacNeill (ed.), Vol. V, 199-224.(ABSTRACT TRUNCATED AT 250 WORDS) 相似文献
13.
A testable linear model for diversity trends in estuaries 总被引:9,自引:0,他引:9
Martin J. Attrill 《The Journal of animal ecology》2002,71(2):262-269
14.
Nixon J 《Heredity》2006,96(4):290-297
It is important that breeders have the means to assess genetic scoring data for segregation distortion because of its probable effect on the design of efficient breeding strategies. Scoring data is usually assessed for segregation distortion by separate nonindependent chi2 tests at each locus in a set of marker loci. This analysis gives the loci most affected by selection if it exists, but it cannot give a statistically correct test for the presence or absence of selection in a linkage group as a whole. I have used a combined test based on the statistic, which is the most significant P-value from the above tests, called the single locus test. I have also derived mathematically a new combined statistical test, the overall test, for segregation distortion that requires genetic scoring data for a single linkage group. This test also takes genetic linkage into account. Using a range of marker densities and population sizes, simulations were carried out, to compare the power of these two statistical tests to detect the effect of selection at one or two loci. The single locus test was always found to be more powerful than the overall test, but the single locus test required a more complicated P-value correction. For the single locus test, approximate correction factors for the P-values are given for a range of marker densities and genetic lengths. 相似文献
15.
Testing in normal mixture models when the proportions are known 总被引:3,自引:0,他引:3
16.
In case-control genetic association studies, cases are subjects with the disease and controls are subjects without the disease. At the time of case-control data collection, information about secondary phenotypes is also collected. In addition to studies of primary diseases, there has been some interest in studying genetic variants associated with secondary phenotypes. In genetic association studies, the deviation from Hardy-Weinberg proportion (HWP) of each genetic marker is assessed as an initial quality check to identify questionable genotypes. Generally, HWP tests are performed based on the controls for the primary disease or secondary phenotype. However, when the disease or phenotype of interest is common, the controls do not represent the general population. Therefore, using only controls for testing HWP can result in a highly inflated type I error rate for the disease- and/or phenotype-associated variants. Recently, two approaches, the likelihood ratio test (LRT) approach and the mixture HWP (mHWP) exact test were proposed for testing HWP in samples from case-control studies. Here, we show that these two approaches result in inflated type I error rates and could lead to the removal from further analysis of potential causal genetic variants associated with the primary disease and/or secondary phenotype when the study of primary disease is frequency-matched on the secondary phenotype. Therefore, we proposed alternative approaches, which extend the LRT and mHWP approaches, for assessing HWP that account for frequency matching. The goal was to maintain more (possible causative) single-nucleotide polymorphisms in the sample for further analysis. Our simulation results showed that both extended approaches could control type I error probabilities. We also applied the proposed approaches to test HWP for SNPs from a genome-wide association study of lung cancer that was frequency-matched on smoking status and found that the proposed approaches can keep more genetic variants for association studies. 相似文献
17.
We use the quasilikelihood concept to propose an estimatingequation for spatial data with correlation across the studyregion in a multi-dimensional space. With appropriate mixingconditions, we develop a central limit theorem for a randomfield under various Lp metrics. The consistency and asymptoticnormality of quasilikelihood estimators can then be derived.We also conduct simulations to evaluate the performance of theproposed estimating equation, and a dataset from East LansingWoods is used to illustrate the method. 相似文献
18.
MOTIVATION: The nearest shrunken centroids classifier has become a popular algorithm in tumor classification problems using gene expression microarray data. Feature selection is an embedded part of the method to select top-ranking genes based on a univariate distance statistic calculated for each gene individually. The univariate statistics summarize gene expression profiles outside of the gene co-regulation network context, leading to redundant information being included in the selection procedure. RESULTS: We propose an Eigengene-based Linear Discriminant Analysis (ELDA) to address gene selection in a multivariate framework. The algorithm uses a modified rotated Spectral Decomposition (SpD) technique to select 'hub' genes that associate with the most important eigenvectors. Using three benchmark cancer microarray datasets, we show that ELDA selects the most characteristic genes, leading to substantially smaller classifiers than the univariate feature selection based analogues. The resulting de-correlated expression profiles make the gene-wise independence assumption more realistic and applicable for the shrunken centroids classifier and other diagonal linear discriminant type of models. Our algorithm further incorporates a misclassification cost matrix, allowing differential penalization of one type of error over another. In the breast cancer data, we show false negative prognosis can be controlled via a cost-adjusted discriminant function. AVAILABILITY: R code for the ELDA algorithm is available from author upon request. 相似文献
19.
Zhu and Elston developed a transmission disequilibrium test for quantitative traits by defining a linear transformation to condition out founder information. The method tests the null hypothesis of no linkage or association and can be applied to general pedigree structures. However, this method requires both genotype and phenotype parental information, which may be difficult to obtain. In this paper, we describe parametric and non-parametric methods to relax this requirement when only nuclear families are sampled. We show that neither method is affected by population stratification in the absence of linkage. The statistical power and validity of the tests are investigated by simulation. A simple simulation method to calculate the power of the nonparametric method is also discussed. In practice, the data may have some families with parental phenotype and genotype information available and some without. We briefly discuss how all the data may be analyzed jointly. 相似文献