共查询到20条相似文献,搜索用时 9 毫秒
1.
2.
3.
4.
K.-D. Wernecke J. Haerting G. Kalb E. Stuerzebecher 《Biometrical journal. Biometrische Zeitschrift》1989,31(3):289-296
An algorithm for model selection in discrimination with categorical variables is presented. It is based on four models applied hierarchically and linked with a build-up procedure of feature-selection. The choice of models and features is ensured by a consequent cross-validation. Results of an application in medical diagnostics are described. 相似文献
5.
Kernel density estimation with spherical data 总被引:9,自引:0,他引:9
6.
Anne‐Laure Boulesteix 《Biometrical journal. Biometrische Zeitschrift》2016,58(3):652-673
Automated variable selection procedures, such as backward elimination, are commonly employed to perform model selection in the context of multivariable regression. The stability of such procedures can be investigated using a bootstrap‐based approach. The idea is to apply the variable selection procedure on a large number of bootstrap samples successively and to examine the obtained models, for instance, in terms of the inclusion of specific predictor variables. In this paper, we aim to investigate a particular important problem affecting this method in the case of categorical predictor variables with different numbers of categories and to give recommendations on how to avoid it. For this purpose, we systematically assess the behavior of automated variable selection based on the likelihood ratio test using either bootstrap samples drawn with replacement or subsamples drawn without replacement from the original dataset. Our study consists of extensive simulations and a real data example from the NHANES study. Our main result is that if automated variable selection is conducted on bootstrap samples, variables with more categories are substantially favored over variables with fewer categories and over metric variables even if none of them have any effect. Importantly, variables with no effect and many categories may be (wrongly) preferred to variables with an effect but few categories. We suggest the use of subsamples instead of bootstrap samples to bypass these drawbacks. 相似文献
7.
Tu D 《Biometrical journal. Biometrische Zeitschrift》2007,49(3):474-483
In cancer clinical trials, it is often of interest in estimating the ratios of hazard rates at some specific time points during the study from two independent populations. In this paper, we consider nonparametric confidence interval procedures for the hazard ratio based on kernel estimates for the hazard rates with under-smoothing bandwidths. Two methods are used to derive the confidence intervals: one based on the asymptotic normality of the ratio of the kernel estimates for the hazard rates in two populations and another through Fieller's Theorem. The performances of the proposed confidence intervals are evaluated through Monte-Carlo simulations and applied to the analysis of data from a clinical trial on early breast cancer. 相似文献
8.
In some infectious disease studies and 2‐step treatment studies, 2 × 2 table with structural zero could arise in situations where it is theoretically impossible for a particular cell to contain observations or structural void is introduced by design. In this article, we propose a score test of hypotheses pertaining to the marginal and conditional probabilities in a 2 × 2 table with structural zero via the risk/rate difference measure. Score test‐based confidence interval will also be outlined. We evaluate the performance of the score test and the existing likelihood ratio test. Our empirical results evince the similar and satisfactory performance of the two tests (with appropriate adjustments) in terms of coverage probability and expected interval width. Both tests consistently perform well from small‐ to moderate‐sample designs. The score test however has the advantage that it is only undefined in one scenario while the likelihood ratio test can be undefined in many scenarios. We illustrate our method by a real example from a two‐step tuberculosis skin test study. 相似文献
9.
An automatic bandwidth selector for kernel density estimation 总被引:4,自引:0,他引:4
10.
Melanoma incidence has increased throughout the world over the past 25 years. A surrogate for the severity of melanoma is the Breslow thickness of the lesions. Data on melanoma, including Breslow thickness, were collected in 1978-1980 and 1988-1990 from the Tasmania Tumor Registry. We use a density ratio model to quantify the change of melanoma by Breslow thickness. In this model, the ratio of two densities is assumed to have a known form up to a parameter, but the underlying densities are not modeled. This model includes the length bias sampling model as a special case. The Kolmogorov-Smirnov test statistic is used to test the correctness of the density ratio model. Model-based cumulative distribution estimation is studied. Methodology developed in this article is applied to the Tasmania Tumor Registry data. 相似文献
11.
Likelihood methods for the discrimination problem 总被引:1,自引:0,他引:1
12.
Gerhard Tutz 《Biometrical journal. Biometrische Zeitschrift》1991,33(5):519-527
Direct kernels, due to LAUDER (1983), as an alternative to the indirect kernel method in discriminant analysis are considered. It is shown that direct kernels may be based on any kernel function known in discrete density estimation. The choice of smoothing parameters is based on general loss functions and a family of loss functions which are specific for the discrimination problem is introduced. Examples with distance dependent and distance independent smoothing parameters are given to illustrate the applicability. 相似文献
13.
14.
We assessed complementary log–log (CLL) regression as an alternative statistical model for estimating multivariable‐adjusted prevalence ratios (PR) and their confidence intervals. Using the delta method, we derived an expression for approximating the variance of the PR estimated using CLL regression. Then, using simulated data, we examined the performance of CLL regression in terms of the accuracy of the PR estimates, the width of the confidence intervals, and the empirical coverage probability, and compared it with results obtained from log–binomial regression and stratified Mantel–Haenszel analysis. Within the range of values of our simulated data, CLL regression performed well, with only slight bias of point estimates of the PR and good confidence interval coverage. In addition, and importantly, the computational algorithm did not have the convergence problems occasionally exhibited by log–binomial regression. The technique is easy to implement in SAS (SAS Institute, Cary, NC), and it does not have the theoretical and practical issues associated with competing approaches. CLL regression is an alternative method of binomial regression that warrants further assessment. 相似文献
15.
Multivariate binary discrimination by the kernel method 总被引:10,自引:0,他引:10
16.
J. E. Higgins 《Biometrical journal. Biometrische Zeitschrift》1981,23(2):185-198
A model is discussed for incorporating information from a time-dependent covariable (an intervening event) and covariables independent of time into the analysis of survival data. In the model, it is assumed that individuals are potentially subject to two paths to failure, one including the intervening event and the other not. Additional assumptions are that failure times associated with the two paths are independent and that the time to failure subsequent to the intervening event is dependent on the intervening event time. Allowing the underlying hazard rates for the model to follow a WEIBULL form, use of the model and methods for fitting and hypothesis testing are illustrated by application to a follow-up study involving industrial workers where disability retirement was the intervening event. Extensions of the model to accommodate grouped survival data are presented. 相似文献
17.
18.
We describe existing tests and introduce two new tests concerning the value of a survival function. These tests may be used to construct a confidence interval for the survival probability at a given time or for a quantile of the survival distribution. Simulation studies show that error rates can differ substantially from their nominal values, particularly at survival probabilities close to zero or one. We recommend our new constrained bootstrap test for its good overall performance. 相似文献
19.
