期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The size of the chi-square test for the Hardy-Weinberg law

Kang SH Shin D 《Human heredity》2004,58(1):10-17

Many scientific problems can be formulated in terms of a statistical model indexed by parameters, only some of which are of scientific interest and the other parameters, called nuisance parameters, are not of interest in themselves. For testing the Hardy-Weinberg law, a relation among genotype and allele probabilities is of interest and allele probabilities are of no interest and now nuisance parameters. In this paper we investigate how the size (the maximum of the type I error rate over the nuisance parameter space) of the chi-square test for the Hardy-Weinberg law is affected by the nuisance parameters. Whether the size is well controlled or not under the nominal level has been frequently investigated as basic components of statistical tests. The size represents the type I error rate at the worst case. We prove that the size is always greater than the nominal level as the sample size increases. Extensive computations show that the size of the chi-squared test (worst type I error rate over the nuisance parameter space) deviates more upwardly from the nominal level as the sample size gets larger. The value at which the maximum of the type I error rate was found moves closer to the edges of the the nuisance parameter space with increasing sample size. An exact test is recommended as an alternative when the type I error is inflated. 相似文献

2.

Internal pilots for observational studies

Matthew J. Gurka Christopher S. Coffey Kelly K. Gurka 《Biometrical journal. Biometrische Zeitschrift》2010,52(5):590-603

Study planning often involves selecting an appropriate sample size. Power calculations require specifying an effect size and estimating “nuisance” parameters, e.g. the overall incidence of the outcome. For observational studies, an additional source of randomness must be estimated: the rate of the exposure. A poor estimate of any of these parameters will produce an erroneous sample size. Internal pilot (IP) designs reduce the risk of this error ‐ leading to better resource utilization ‐ by using revised estimates of the nuisance parameters at an interim stage to adjust the final sample size. In the clinical trials setting, where allocation to treatment groups is pre‐determined, IP designs have been shown to achieve the targeted power without introducing substantial inflation of the type I error rate. It has not been demonstrated whether the same general conclusions hold in observational studies, where exposure‐group membership cannot be controlled by the investigator. We extend the IP to observational settings. We demonstrate through simulations that implementing an IP, in which prevalence of the exposure can be re‐estimated at an interim stage, helps ensure optimal power for observational research with little inflation of the type I error associated with the final data analysis. 相似文献

3.

Conditional properties of unconditional parametric bootstrap procedures for inference in exponential families

DiCiccio Thomas J.; Young G. Alastair 《Biometrika》2008,95(3):747-758

Higher-order inference about a scalar parameter in the presenceof nuisance parameters can be achieved by bootstrapping, incircumstances where the parameter of interest is a componentof the canonical parameter in a full exponential family. Theoptimal test, which is approximated, is a conditional one basedon conditioning on the sufficient statistic for the nuisanceparameter. A bootstrap procedure that ignores the conditioningis shown to have desirable conditional properties in providingthird-order relative accuracy in approximation of p-values associatedwith the optimal test, in both continuous and discrete models.The bootstrap approach is equivalent to third-order analyticalapproaches, and is demonstrated in a number of examples to givevery accurate approximations even for very small sample sizes. 相似文献

4.

Blinded sample size reestimation in non-inferiority trials with binary endpoints

Friede T Mitchell C Müller-Velten G 《Biometrical journal. Biometrische Zeitschrift》2007,49(6):903-916

Sample size calculations in the planning of clinical trials depend on good estimates of the model parameters involved. When the estimates of these parameters have a high degree of uncertainty attached to them, it is advantageous to reestimate the sample size after an internal pilot study. For non-inferiority trials with binary outcome we compare the performance of Type I error rate and power between fixed-size designs and designs with sample size reestimation. The latter design shows itself to be effective in correcting sample size and power of the tests when misspecification of nuisance parameters occurs with the former design. 相似文献

5.

A Semiparametric Joint Model for Longitudinal and Survival Data with Application to Hemodialysis Study

Liang Li Bo Hu Tom Greene 《Biometrics》2009,65(3):737-745

Summary . In many longitudinal clinical studies, the level and progression rate of repeatedly measured biomarkers on each subject quantify the severity of the disease and that subject's susceptibility to progression of the disease. It is of scientific and clinical interest to relate such quantities to a later time-to-event clinical endpoint such as patient survival. This is usually done with a shared parameter model. In such models, the longitudinal biomarker data and the survival outcome of each subject are assumed to be conditionally independent given subject-level severity or susceptibility (also called frailty in statistical terms). In this article, we study the case where the conditional distribution of longitudinal data is modeled by a linear mixed-effect model, and the conditional distribution of the survival data is given by a Cox proportional hazard model. We allow unknown regression coefficients and time-dependent covariates in both models. The proposed estimators are maximizers of an exact correction to the joint log likelihood with the frailties eliminated as nuisance parameters, an idea that originated from correction of covariate measurement error in measurement error models. The corrected joint log likelihood is shown to be asymptotically concave and leads to consistent and asymptotically normal estimators. Unlike most published methods for joint modeling, the proposed estimation procedure does not rely on distributional assumptions of the frailties. The proposed method was studied in simulations and applied to a data set from the Hemodialysis Study. 相似文献

6.

Estimating the prevalence of atrial fibrillation from a three‐class mixture model for repeated diagnoses

Liang Li Huzhang Mao Hemant Ishwaran Jeevanantham Rajeswaran John Ehrlinger Eugene H. Blackstone 《Biometrical journal. Biometrische Zeitschrift》2017,59(2):331-343

Atrial fibrillation (AF) is an abnormal heart rhythm characterized by rapid and irregular heartbeat, with or without perceivable symptoms. In clinical practice, the electrocardiogram (ECG) is often used for diagnosis of AF. Since the AF often arrives as recurrent episodes of varying frequency and duration and only the episodes that occur at the time of ECG can be detected, the AF is often underdiagnosed when a limited number of repeated ECGs are used. In studies evaluating the efficacy of AF ablation surgery, each patient undergoes multiple ECGs and the AF status at the time of ECG is recorded. The objective of this paper is to estimate the marginal proportions of patients with or without AF in a population, which are important measures of the efficacy of the treatment. The underdiagnosis problem is addressed by a three‐class mixture regression model in which a patient's probability of having no AF, paroxysmal AF, and permanent AF is modeled by auxiliary baseline covariates in a nested logistic regression. A binomial regression model is specified conditional on a subject being in the paroxysmal AF group. The model parameters are estimated by the Expectation‐Maximization (EM) algorithm. These parameters are themselves nuisance parameters for the purpose of this research, but the estimators of the marginal proportions of interest can be expressed as functions of the data and these nuisance parameters and their variances can be estimated by the sandwich method. We examine the performance of the proposed methodology in simulations and two real data applications. 相似文献

7.

Flexible Regression Model Selection for Survival Probabilities: With Application to AIDS

A. Gregory DiRienzo 《Biometrics》2009,65(4):1194-1202

Summary Clinicians are often interested in the effect of covariates on survival probabilities at prespecified study times. Because different factors can be associated with the risk of short‐ and long‐term failure, a flexible modeling strategy is pursued. Given a set of multiple candidate working models, an objective methodology is proposed that aims to construct consistent and asymptotically normal estimators of regression coefficients and average prediction error for each working model, that are free from the nuisance censoring variable. It requires the conditional distribution of censoring given covariates to be modeled. The model selection strategy uses stepup or stepdown multiple hypothesis testing procedures that control either the proportion of false positives or generalized familywise error rate when comparing models based on estimates of average prediction error. The context can actually be cast as a missing data problem, where augmented inverse probability weighted complete case estimators of regression coefficients and prediction error can be used ( Tsiatis, 2006 , Semiparametric Theory and Missing Data). A simulation study and an interesting analysis of a recent AIDS trial are provided. 相似文献

8.

Improving the Flexibility and Efficiency of Phase II Designs for Oncology Trials

Englert S Kieser M 《Biometrics》2012,68(3):886-892

Summary Phase II trials in oncology are usually conducted as single-arm two-stage designs with binary endpoints. Currently available adaptive design methods are tailored to comparative studies with continuous test statistics. Direct transfer of these methods to discrete test statistics results in conservative procedures and, therefore, in a loss in power. We propose a method based on the conditional error function principle that directly accounts for the discreteness of the outcome. It is shown how application of the method can be used to construct new phase II designs that are more efficient as compared to currently applied designs and that allow flexible mid-course design modifications. The proposed method is illustrated with a variety of frequently used phase II designs. 相似文献

9.

Family-based tests of association and linkage that use unaffected sibs, covariates, and interactions 总被引：9，自引：0，他引：9

下载免费PDF全文

Lunetta KL Faraone SV Biederman J Laird NM 《American journal of human genetics》2000,66(2):605-614

We extend the methodology for family-based tests of association and linkage to allow for both variation in the phenotypes of subjects and incorporation of covariates into general-score tests of association. We use standard association models for a phenotype and any number of predictors. We then construct a score statistic, using likelihoods for the distribution of phenotype, given genotype. The distribution of the score is computed as a function of offspring genotypes, conditional on parental genotypes and trait values for offspring and parents. This approach provides a natural extension of the transmission/disequilibrium test to any phenotype and to multiple genes or environmental factors and allows the study of gene-gene and gene-environment interaction. When the trait varies among subjects or when covariates are included in the association model, the score statistic depends on one or more nuisance parameters. We suggest two approaches for obtaining parameter estimates: (1) choosing the estimate that minimizes the variance of the test statistic and (2) maximizing the statistic over a nuisance parameter and using a corrected P value. We apply our methods to a sample of families with attention-deficit/hyperactivity disorder and provide examples of how covariates and gene-environment and gene-gene interactions can be incorporated. 相似文献

10.

Likelihood methods for incomplete longitudinal binary responses with incomplete categorical covariates

Lipsitz SR Ibrahim JG Fitzmaurice GM 《Biometrics》1999,55(1):214-223

We consider longitudinal studies in which the outcome observed over time is binary and the covariates of interest are categorical. With no missing responses or covariates, one specifies a multinomial model for the responses given the covariates and uses maximum likelihood to estimate the parameters. Unfortunately, incomplete data in the responses and covariates are a common occurrence in longitudinal studies. Here we assume the missing data are missing at random (Rubin, 1976, Biometrika 63, 581-592). Since all of the missing data (responses and covariates) are categorical, a useful technique for obtaining maximum likelihood parameter estimates is the EM algorithm by the method of weights proposed in Ibrahim (1990, Journal of the American Statistical Association 85, 765-769). In using the EM algorithm with missing responses and covariates, one specifies the joint distribution of the responses and covariates. Here we consider the parameters of the covariate distribution as a nuisance. In data sets where the percentage of missing data is high, the estimates of the nuisance parameters can lead to highly unstable estimates of the parameters of interest. We propose a conditional model for the covariate distribution that has several modeling advantages for the EM algorithm and provides a reduction in the number of nuisance parameters, thus providing more stable estimates in finite samples. 相似文献

11.

Applying the generalized partitioning principle to control the generalized familywise error rate

Xu H Hsu J 《Biometrical journal. Biometrische Zeitschrift》2007,49(1):52-67

In multiple testing, strong control of the familywise error rate (FWER) may be unnecessarily stringent in some situations such as bioinformatic studies. An alternative approach, discussed by Hommel and Hoffmann (1988) and Lehmann and Romano (2005), is to control the generalized familywise error rate (gFWER), the probability of incorrectly rejecting more than m hypotheses. This article presents the generalized Partitioning Principle as a systematic technique of constructing gFWER-controlling tests that can take the joint distribution of test statistics into account. The paper is structured as follows. We first review classical partitioning principle, indicating its conditioning nature. Then the generalized partitioning principle is introduced, with a set of sufficient conditions that allows it to be executed as a computationally more feasible step-down test. Finally, we show the importance of having some knowledge of the distribution of the observations in multiple testing. In particular, we show that step-down permutation tests require an assumption on the joint distribution of the observations in order to control the familywise error rate. 相似文献

12.

Some remarks about the analysis of active control studies

Lawrence J 《Biometrical journal. Biometrische Zeitschrift》2005,47(5):616-622

In an active-controlled trial, the experimental treatment can be declared to be non-inferior to the control if the confidence interval for the difference excludes a fixed pre-specified margin. Recently, some articles have discussed an alternative method where the data from the current study and placebo-controlled studies for the active control are combined together into a single test statistic to test whether a fixed fraction of the effect of the active control is preserved. It has been shown that, conditional on nuisance parameters from the active-controlled study, a fixed margin can be defined that will be operationally equivalent to this latter method. In this article, we will discuss statistical properties associated with these approaches. Specifically, the interim monitoring boundaries and level of evidence will be considered. 相似文献

13.

Conditional and unconditional categorical regression models with missing covariates

Satten GA Carroll RJ 《Biometrics》2000,56(2):384-388

We consider methods for analyzing categorical regression models when some covariates (Z) are completely observed but other covariates (X) are missing for some subjects. When data on X are missing at random (i.e., when the probability that X is observed does not depend on the value of X itself), we present a likelihood approach for the observed data that allows the same nuisance parameters to be eliminated in a conditional analysis as when data are complete. An example of a matched case-control study is used to demonstrate our approach. 相似文献

14.

Unconditional small-sample confidence intervals for the odds ratio

Agresti A Min Y 《Biostatistics (Oxford, England)》2002,3(3):379-386

The traditional approach to 'exact' small-sample interval estimation of the odds ratio for binomial, Poisson, or multinomial samples uses the conditional distribution to eliminate nuisance parameters. This approach can be very conservative. For two independent binomial samples, we study an unconditional approach with overall confidence level guaranteed to equal at least the nominal level. With small samples this interval tends to be shorter and have coverage probabilities nearer the nominal level. 相似文献

15.

Sample size recalculation in multicenter randomized controlled clinical trials based on noncomparative data

Markus Harden Tim Friede 《Biometrical journal. Biometrische Zeitschrift》2020,62(5):1284-1299

Many late-phase clinical trials recruit subjects at multiple study sites. This introduces a hierarchical structure into the data that can result in a power-loss compared to a more homogeneous single-center trial. Building on a recently proposed approach to sample size determination, we suggest a sample size recalculation procedure for multicenter trials with continuous endpoints. The procedure estimates nuisance parameters at interim from noncomparative data and recalculates the sample size required based on these estimates. In contrast to other sample size calculation methods for multicenter trials, our approach assumes a mixed effects model and does not rely on balanced data within centers. It is therefore advantageous, especially for sample size recalculation at interim. We illustrate the proposed methodology by a study evaluating a diabetes management system. Monte Carlo simulations are carried out to evaluate operation characteristics of the sample size recalculation procedure using comparative as well as noncomparative data, assessing their dependence on parameters such as between-center heterogeneity, residual variance of observations, treatment effect size and number of centers. We compare two different estimators for between-center heterogeneity, an unadjusted and a bias-adjusted estimator, both based on quadratic forms. The type 1 error probability as well as statistical power are close to their nominal levels for all parameter combinations considered in our simulation study for the proposed unadjusted estimator, whereas the adjusted estimator exhibits some type 1 error rate inflation. Overall, the sample size recalculation procedure can be recommended to mitigate risks arising from misspecified nuisance parameters at the planning stage. 相似文献

16.

A semiparametric empirical likelihood method for data from an outcome-dependent sampling scheme with a continuous outcome

Zhou H Weaver MA Qin J Longnecker MP Wang MC 《Biometrics》2002,58(2):413-421

Outcome-dependent sampling (ODS) schemes can be a cost effective way to enhance study efficiency. The case-control design has been widely used in epidemiologic studies. However, when the outcome is measured on a continuous scale, dichotomizing the outcome could lead to a loss of efficiency. Recent epidemiologic studies have used ODS sampling schemes where, in addition to an overall random sample, there are also a number of supplemental samples that are collected based on a continuous outcome variable. We consider a semiparametric empirical likelihood inference procedure in which the underlying distribution of covariates is treated as a nuisance parameter and is left unspecified. The proposed estimator has asymptotic normality properties. The likelihood ratio statistic using the semiparametric empirical likelihood function has Wilks-type properties in that, under the null, it follows a chi-square distribution asymptotically and is independent of the nuisance parameters. Our simulation results indicate that, for data obtained using an ODS design, the semiparametric empirical likelihood estimator is more efficient than conditional likelihood and probability weighted pseudolikelihood estimators and that ODS designs (along with the proposed estimator) can produce more efficient estimates than simple random sample designs of the same size. We apply the proposed method to analyze a data set from the Collaborative Perinatal Project (CPP), an ongoing environmental epidemiologic study, to assess the relationship between maternal polychlorinated biphenyl (PCB) level and children's IQ test performance. 相似文献

17.

On Some Nuisance Parameter Free Uniformly Most Powerful Tests

D. G. Kabe A. G. Laurent 《Biometrical journal. Biometrische Zeitschrift》1981,23(3):245-250

TAKEUCHI (1969) provides a uniformly most powerful (UMP) one side test for testing the location parameter of the two parameters exponential model when the scale parameter is unknown. The power of his similar size α test depends, however, on the unknown scale parameter. In this case and in more general situations when there exists a sufficient statistic for the nuisance parameter, the theory of generalized THOMPSON's distributions, more specifically, the Thompsonization of a test statistic, LAURENT (1959, 1972) provides a UMP test whose power does not depend on the nuisance parameter. Examples of application of the general nuisance parameter free test procedure include here the truncated exponential, the inverse Gaussian, and the geometric distributions. 相似文献

18.

Approximate Monte Carlo conditional inference in exponential families

Kolassa JE Tanner MA 《Biometrics》1999,55(1):246-251

This article presents an algorithm for approximate frequentist conditional inference on two or more parameters for any regression model in the Generalized Linear Model (GLIM) family. We thereby extend highly accurate inference beyond the cases of logistic regression and contingency tables implimented in commercially available software. The method makes use of the double saddlepoint approximations of Skovgaard (1987, Journal of Applied Probability 24, 875-887) and Jensen (1992, Biometrika 79, 693-703) to the conditional cumulative distribution function of a sufficient statistic given the remaining sufficient statistics. This approximation is then used in conjunction with noniterative Monte Carlo methods to generate a sample from a distribution that approximates the joint distribution of the sufficient statistics associated with the parameters of interest conditional on the observed values of the sufficient statistics associated with the nuisance parameters. This algorithm is an alternate approach to that presented by Kolassa and Tanner (1994, Journal of the American Statistical Association 89, 697-702), in which a Markov chain is generated whose equilibrium distribution under certain regularity conditions approximates the joint distribution of interest. In Kolassa and Tanner (1994), the Gibbs sampler was used in conjunction with these univariate conditional distribution function approximations. The method of this paper does not require the construction and simulation of a Markov chain, thus avoiding the need to develop regularity conditions under which the algorithm converges and the need for the data analyst to check convergence of the particular chain. Examples involving logistic and truncated Poisson regression are presented. 相似文献

19.

Unconditional versions of several tests commonly used in the analysis of contingency tables

Freidlin B 《Biometrics》1999,55(1):264-267

By focusing on a confidence interval for a nuisance parameter, Berger and Boos (1994, Journal of the American Statistical Association 89, 1012-1016) proposed new unconditional tests. In particular, they showed that, for a 2 x 2 table, this procedure generally was more powerful than Fisher's exact test. This paper utilizes and extends their approach to obtain unconditional tests for combining several 2 x 2 tables and testing for trend and homogeneity in a 2 x K table. The unconditional procedures are compared to the conditional ones by reanalyzing some published biomedical data. 相似文献

20.

Interim Treatment Selection in Drug Development

Gang Li Yining Wang S. Peter Ouyang 《Statistics in biosciences》2009,1(2):268-288

We consider hypothesis testing in a clinical trial with an interim treatment selection. Recently, unconditional and conditional procedures for selecting one treatment as the winner have been proposed when the mean responses are approximately normal. In this paper, we generalize both procedures to multi-winner cases. The distributions of the test statistics are obtained and step-down approaches are proposed. We prove that both unconditional and conditional procedures strongly control the family-wise error rate. We give a brief discussion on power comparisons. 相似文献