首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this work, the development of an Artificial Neural Network (ANN) based soft estimator is reported for the estimation of static-nonlinearity associated with the transducers. Under the realm of ANN based transducer modeling, only two neural models have been suggested to estimate the static-nonlinearity associated with the transducers with quite successful results. The first existing model is based on the concept of a functional link artificial neural network (FLANN) trained with mu-LMS (Least Mean Squares) learning algorithm. The second one is based on the architecture of a single layer linear ANN trained with alpha-LMS learning algorithm. However, both these models suffer from the problem of slow convergence (learning). In order to circumvent this problem, it is proposed to synthesize the direct model of transducers using the concept of a Polynomial-ANN (polynomial artificial neural network) trained with Levenberg-Marquardt (LM) learning algorithm. The proposed Polynomial-ANN oriented transducer model is implemented based on the topology of a single-layer feed-forward back-propagation-ANN. The proposed neural modeling technique provided an extremely fast convergence speed with increased accuracy for the estimation of transducer static nonlinearity. The results of convergence are very stimulating with the LM learning algorithm.  相似文献   

2.
A nonparametric, robust density estimation method is explored for the analysis of right-angle distances from a transect line to the objects sighted. The method is based on the FOURIER series expansion of a probability density function over an interval. With only mild assumptions, a general population density estimator of wide applicability is obtained.  相似文献   

3.
This work proposes a sequential modelling approach using an artificial neural network (ANN) to develop four independent multivariate models that are able to predict the dynamics of biochemical oxygen demand (BOD), chemical oxygen demand (COD), suspended solid (SS), and total nitrogen (TN) removal in a wastewater treatment plant (WWTP). Suitable structures of ANN models were automatically and conveniently optimized by a genetic algorithm rather than the conventional trial and error method. The sequential modelling approach, which is composed of two parts, a process disturbance estimator and a process behaviour predictor, was also presented to develop multivariate dynamic models. In particular, the process disturbance estimator was first employed to estimate the influent quality. The process behaviour predictor then sequentially predicted the effluent quality based on the estimated influent quality from the process disturbance estimator with other process variables. The efficiencies of the developed ANN models with a sequential modelling approach were demonstrated with a practical application using a data set collected from a full-scale WWTP during 2 years. The results show that the ANN with the sequential modelling approach successfully developed multivariate dynamic models of BOD, COD, SS, and TN removal with satisfactory estimation and prediction capability. Thus, the proposed method could be used as a powerful tool for the prediction of complex and nonlinear WWTP performance.  相似文献   

4.
Partial AUC estimation and regression   总被引:2,自引:0,他引:2  
Dodd LE  Pepe MS 《Biometrics》2003,59(3):614-623
Accurate diagnosis of disease is a critical part of health care. New diagnostic and screening tests must be evaluated based on their abilities to discriminate diseased from nondiseased states. The partial area under the receiver operating characteristic (ROC) curve is a measure of diagnostic test accuracy. We present an interpretation of the partial area under the curve (AUC), which gives rise to a nonparametric estimator. This estimator is more robust than existing estimators, which make parametric assumptions. We show that the robustness is gained with only a moderate loss in efficiency. We describe a regression modeling framework for making inference about covariate effects on the partial AUC. Such models can refine knowledge about test accuracy. Model parameters can be estimated using binary regression methods. We use the regression framework to compare two prostate-specific antigen biomarkers and to evaluate the dependence of biomarker accuracy on the time prior to clinical diagnosis of prostate cancer.  相似文献   

5.
MOTIVATION: Data representation and encoding are essential for classification of protein sequences with artificial neural networks (ANN). Biophysical properties are appropriate for low dimensional encoding of protein sequence data. However, in general there is no a priori knowledge of the relevant properties for extraction of representative features. RESULTS: An adaptive encoding artificial neural network (ACN) for recognition of sequence patterns is described. In this approach parameters for sequence encoding are optimized within the same process as the weight vectors by an evolutionary algorithm. The method is applied to the prediction of signal peptide cleavage sites in human secretory proteins and compared with an established predictor for signal peptides. CONCLUSION: Knowledge of physico-chemical properties is not necessary for training an ACN. The advantage is a low dimensional data representation leading to computational efficiency, easy evaluation of the detected features, and high prediction accuracy. Availability: A cleavage site prediction server is located at the Humboldt University http://itb.biologie.hu-berlin.de/ approximately jo/sig-cleave/ACNpredictor.cgi Contact: jo@itb.hu-berlin.de; berndj@zedat.fu-berlin.de  相似文献   

6.
Advances in technology provide new diagnostic tests for early detection of disease. Frequently, these tests have continuous outcomes. One popular method to summarize the accuracy of such a test is the Receiver Operating Characteristic (ROC) curve. Methods for estimating ROC curves have long been available. To examine covariate effects, Pepe (1997, 2000) and Alonzo and Pepe (2002) proposed distribution-free approaches based on a parametric regression model for the ROC curve. Cai and Pepe (2002) extended the parametric ROC regression model by allowing an arbitrary non-parametric baseline function. In this paper, while we follow the same semi-parametric setting as in that paper, we highlight a new estimator that offers several improvements over the earlier work: superior efficiency, the ability to estimate the covariate effects without estimating the non-parametric baseline function and easy implementation with standard software. The methodology is applied to a case control dataset where we evaluate the accuracy of the prostate-specific antigen as a biomarker for early detection of prostate cancer. Simulation studies suggest that the new estimator under the semi-parametric model, while always being more robust, has efficiency that is comparable to or better than the Alonzo and Pepe (2002) estimator from the parametric model.  相似文献   

7.
Fouling and cleaning in heat exchangers are severe and costly (up to 0.3% of gross national product) issues in dairy and food processing. Therefore, reducing cleaning time and cost is urgently needed. In this study, two classification methods [artificial neural network (ANN) and support vector machine (SVM)] for detecting protein and mineral fouling presence and absence based on ultrasonic measurements were presented and compared. ANN is based on a multilayer perceptron feed forward neural network, whereas SVM is based on clustering between fouling and no fouling using a hyperplane. When both fouling types (1239 datasets) were combined, ANN showed an accuracy of 71.9% while SVM displayed an accuracy of 97.6%. Separate fouling detection of mineral/protein fouling by ANN/SVM was comparable: dependent on fouling type detection accuracies of 100% (protein fouling, ANN and SVM), and 98.2% (SVM), and 93.5% (ANN) for mineral fouling was reached. It was shown that it was possible to detect fouling presence and absence offline in a static setup using ultrasonic measurements in combination with a classification method. This study proved the applicability of combining classification methods and fouling measurements to take a step toward reducing cleaning costs and time.  相似文献   

8.
BACKGROUND: Artificial neural networks (ANNs) have been shown to be valuable in the analysis of analytical flow cytometric (AFC) data in aquatic ecology. Automated extraction of clusters is an important first stage in deriving ANN training data from field samples, but AFC data pose a number of challenges for many types of clustering algorithm. The fuzzy k-means algorithm recently has been extended to address nonspherical clusters with the use of scatter matrices. Four variants were proposed, each optimizing a different measure of clustering "goodness." METHODS: With AFC data obtained from marine phytoplankton species in culture, the four fuzzy k-means algorithm variants were compared with each other and with another multivariate clustering algorithm based on critical distances currently used in flow cytometry. RESULTS: One of the algorithm variants (adaptive distances, also known as the Gustafson--Kessel algorithm) was found to be robust and reliable, whereas the others showed various problems. CONCLUSIONS: The adaptive distances algorithm was superior in use to the clustering algorithms against which it was tested, but the problem of automatic determination of the number of clusters remains to be addressed.  相似文献   

9.
This study developed an artificial neural network (ANN) to estimate the growth of microorganisms during a fermentation process. The ANN relies solely on the cumulative consumption of alkali and the buffer capacity, which were measured on-line from the on/off control signal and pH values through automatic pH control. The two input variables were monitored on-line from a series of different batch cultivations and used to train the ANN to estimate biomass. The ANN was refined by optimizing the network structure and by adopting various algorithms for its training. The software estimator successfully generated growth profiles that showed good agreement with the measured biomass of separate batch cultures carried out between at 25 and 35_C.  相似文献   

10.
He X  Yang Z  Tsien JZ 《PloS one》2011,6(5):e20002
Humans can categorize objects in complex natural scenes within 100-150 ms. This amazing ability of rapid categorization has motivated many computational models. Most of these models require extensive training to obtain a decision boundary in a very high dimensional (e.g., ~6,000 in a leading model) feature space and often categorize objects in natural scenes by categorizing the context that co-occurs with objects when objects do not occupy large portions of the scenes. It is thus unclear how humans achieve rapid scene categorization.To address this issue, we developed a hierarchical probabilistic model for rapid object categorization in natural scenes. In this model, a natural object category is represented by a coarse hierarchical probability distribution (PD), which includes PDs of object geometry and spatial configuration of object parts. Object parts are encoded by PDs of a set of natural object structures, each of which is a concatenation of local object features. Rapid categorization is performed as statistical inference. Since the model uses a very small number (~100) of structures for even complex object categories such as animals and cars, it requires little training and is robust in the presence of large variations within object categories and in their occurrences in natural scenes. Remarkably, we found that the model categorized animals in natural scenes and cars in street scenes with a near human-level performance. We also found that the model located animals and cars in natural scenes, thus overcoming a flaw in many other models which is to categorize objects in natural context by categorizing contextual features. These results suggest that coarse PDs of object categories based on natural object structures and statistical operations on these PDs may underlie the human ability to rapidly categorize scenes.  相似文献   

11.
基于人工神经网络的天然林生物量遥感估测   总被引:5,自引:0,他引:5  
基于Landsat TM遥感图像, 以吉林省汪清天然林区为例, 应用B-P神经网络建立了森林生物量非线性遥感模型系统. 除采用遥感数据外, 该系统还引入了地形因子(海拔、坡度、坡向、立地类型等)作为模型自变量. 通过压缩输入数据和增强网络训练学习算法等措施, 对标准B-P神经网络进行了增强. 模型仿真结果表明:增强型B-P神经网络具有收敛速度快和自学习、自适应功能强的特点, 能最大限度地利用样本集的先验知识, 自动提取合理的模型, 模型预测结果能真实合理地反映实际情况. 针叶林、阔叶林和针阔混交林的生物量遥感模型系统仿真结果的平均相对误差分别为-1.47%、2.38%和3.56%, 平均相对误差绝对值分别为6.33%、8.46%和8.91%, 预估效果较理想. 应用该模型系统生成了研究区的森林生物量定量分布图, 其总体精度为88.04%.  相似文献   

12.
In this paper, a novel Artificial Neural-Network (ANN) based multi-sensor multi-band adaptive signal-processing scheme is described for enhancing acoustic-speech corrupted by real noise and reverberation. Numerically robust adaptation-algorithms are employed for the ANN based sub-band filters; and, new simulation experiments are reported using real-reverberant automobile data which demonstrate that the proposed speech-enhancement system is capable of outperforming conventional linear filtering-based wide-band and multi-band noise-cancellation schemes.  相似文献   

13.
A covariance estimator for GEE with improved small-sample properties   总被引:2,自引:0,他引:2  
Mancl LA  DeRouen TA 《Biometrics》2001,57(1):126-134
In this paper, we propose an alternative covariance estimator to the robust covariance estimator of generalized estimating equations (GEE). Hypothesis tests using the robust covariance estimator can have inflated size when the number of independent clusters is small. Resampling methods, such as the jackknife and bootstrap, have been suggested for covariance estimation when the number of clusters is small. A drawback of the resampling methods when the response is binary is that the methods can break down when the number of subjects is small due to zero or near-zero cell counts caused by resampling. We propose a bias-corrected covariance estimator that avoids this problem. In a small simulation study, we compare the bias-corrected covariance estimator to the robust and jackknife covariance estimators for binary responses for situations involving 10-40 subjects with equal and unequal cluster sizes of 16-64 observations. The bias-corrected covariance estimator gave tests with sizes close to the nominal level even when the number of subjects was 10 and cluster sizes were unequal, whereas the robust and jackknife covariance estimators gave tests with sizes that could be 2-3 times the nominal level. The methods are illustrated using data from a randomized clinical trial on treatment for bone loss in subjects with periodontal disease.  相似文献   

14.
The problem of predicting the enzymes and non-enzymes from the protein sequence information is still an open problem in bioinformatics. It is further becoming more important as the number of sequenced information grows exponentially over time. We describe a novel approach for predicting the enzymes and non-enzymes from its amino-acid sequence using artificial neural network (ANN). Using 61 sequence derived features alone we have been able to achieve 79 percent correct prediction of enzymes/non-enzymes (in the set of 660 proteins). For the complete set of 61 parameters using 5-fold cross-validated classification, ANN model reveal a superior model (accuracy = 78.79 plus or minus 6.86 percent, Q(pred) = 74.734 plus or minus 17.08 percent, sensitivity = 84.48 plus or minus 6.73 percent, specificity = 77.13 plus or minus 13.39 percent). The second module of ANN is based on PSSM matrix. Using the same 5-fold cross-validation set, this ANN model predicts enzymes/non-enzymes with more accuracy (accuracy = 80.37 plus or minus 6.59 percent, Q(pred) = 67.466 plus or minus 12.41 percent, sensitivity = 0.9070 plus or minus 3.37 percent, specificity = 74.66 plus or minus 7.17 percent).  相似文献   

15.
Radial basis function (RBF) artificial neural network (ANN) and response surface methodology (RSM) were used to build a predictive model of the combined effects of independent variables (pH, temperature, inoculum volume) for extracellular protease production from a newly isolated Pseudomonas sp. The optimum operating conditions obtained from the quadratic form of the RSM and ANN models were pH 7.6, temperature 38 °C, and inoculum volume of 1.5 with 58.5 U/ml of predicted protease activity within 24 h of incubation. The normalized percentage mean squared error obtained from ANN and RSM models were 0.05 and 0.1%, respectively. The results demonstrated an higher prediction accuracy of ANN compared to RSM. This superiority of ANN over other multi factorial approaches could make this estimation technique a very helpful tool for fermentation monitoring and control.  相似文献   

16.
Causal approaches based on the potential outcome framework providea useful tool for addressing noncompliance problems in randomizedtrials. We propose a new estimator of causal treatment effectsin randomized clinical trials with noncompliance. We use theempirical likelihood approach to construct a profile randomsieve likelihood and take into account the mixture structurein outcome distributions, so that our estimator is robust toparametric distribution assumptions and provides substantialfinite-sample efficiency gains over the standard instrumentalvariable estimator. Our estimator is asymptotically equivalentto the standard instrumental variable estimator, and it canbe applied to outcome variables with a continuous, ordinal orbinary scale. We apply our method to data from a randomizedtrial of an intervention to improve the treatment of depressionamong depressed elderly patients in primary care practices.  相似文献   

17.
Cryptosporidium parvum is a coccidian protozoan parasite capable of infecting a variety of mammalian hosts, and can cause gastro-enteric disease within humans. C. parvum oocysts were stained with varying concentrations of 4', 6 diamidino-2-phenylindole (DAPI). After microscopic observation, objects of interest were captured using a CCD color digital camera. The microscopic images were classified based on their DAPI stain properties as either DAPI positive or negative. Individual oocysts were cropped, converted to grayscale, applied to a binary threshold filter, and were further processed into a numerical data array. DAPI positive and negative images (100 each) were randomly removed for artificial neural network (ANN) testing. The remaining image data were used for ANN training using a commercially available software program. After training experimentation, a final network design was implemented possessing 95 input, 400 hidden, and 2 output neurons. Additional control image sets (ranging from 165 to 119 images) were collected to better ascertain ANN performance. These images consisted of DAPI positive oocysts and two types of DAPI negative images (either formalin treated oocysts or algal cultures). Selected ANN correctly identified, as a range, 82.4–93.8% of the DAPI positive oocysts, 97–98.2% of the DAPI negative oocysts, and 52.9–57% of the DAPI negative algal cells. The control image sets were unique data, never presented during ANN training. Because of this, combined with the high number of correct image identifications for certain image sets, ANN technology may provide a means to identify C. parvum oocysts through automated analysis.  相似文献   

18.
The concept of balanced sampling is applied to prediction in finite samples using model based inference procedures. Necessary and sufficient conditions are derived for a general linear model with arbitrary covariance structure to yield the expansion estimator as the best linear unbiased predictor for the mean. The analysis is extended to produce a robust estimator for the mean squared error under balanced sampling and the results are discussed in the context of statistical genetics where appropriate sampling produces simple efficient and robust genetic predictors free from unnecessary genetic assumptions.  相似文献   

19.
The use of non-invasive genetic sampling to estimate population size in elusive or rare species is increasing. The data generated from this sampling differ from traditional mark-recapture data in that individuals may be captured multiple times within a session or there may only be a single sampling event. To accommodate this type of data, we develop a method, named capwire, based on a simple urn model containing individuals of two capture probabilities. The method is evaluated using simulations of an urn and of a more biologically realistic system where individuals occupy space, and display heterogeneous movement and DNA deposition patterns. We also analyse a small number of real data sets. The results indicate that when the data contain capture heterogeneity the method provides estimates with small bias and good coverage, along with high accuracy and precision. Performance is not as consistent when capture rates are homogeneous and when dealing with populations substantially larger than 100. For the few real data sets where N is approximately known, capwire's estimates are very good. We compare capwire's performance to commonly used rarefaction methods and to two heterogeneity estimators in program capture: Mh-Chao and Mh-jackknife. No method works best in all situations. While less precise, the Chao estimator is very robust. We also examine how large samples should be to achieve a given level of accuracy using capwire. We conclude that capwire provides an improved way to estimate N for some DNA-based data sets.  相似文献   

20.
Estimation of population size with missing zero-class is an important problem that is encountered in epidemiological assessment studies. Fitting a Poisson model to the observed data by the method of maximum likelihood and estimation of the population size based on this fit is an approach that has been widely used for this purpose. In practice, however, the Poisson assumption is seldom satisfied. Zelterman (1988) has proposed a robust estimator for unclustered data that works well in a wide class of distributions applicable for count data. In the work presented here, we extend this estimator to clustered data. The estimator requires fitting a zero-truncated homogeneous Poisson model by maximum likelihood and thereby using a Horvitz-Thompson estimator of population size. This was found to work well, when the data follow the hypothesized homogeneous Poisson model. However, when the true distribution deviates from the hypothesized model, the population size was found to be underestimated. In the search of a more robust estimator, we focused on three models that use all clusters with exactly one case, those clusters with exactly two cases and those with exactly three cases to estimate the probability of the zero-class and thereby use data collected on all the clusters in the Horvitz-Thompson estimator of population size. Loss in efficiency associated with gain in robustness was examined based on a simulation study. As a trade-off between gain in robustness and loss in efficiency, the model that uses data collected on clusters with at most three cases to estimate the probability of the zero-class was found to be preferred in general. In applications, we recommend obtaining estimates from all three models and making a choice considering the estimates from the three models, robustness and the loss in efficiency.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号