首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 109 毫秒
1.
In this paper, we present an effective and efficient diagnosis system based on particle swarm optimization (PSO) enhanced fuzzy k-nearest neighbor (FKNN) for Parkinson's disease (PD) diagnosis. In the proposed system, named PSO–FKNN, both the continuous version and binary version of PSO were used to perform the parameter optimization and feature selection simultaneously. On the one hand, the neighborhood size k and the fuzzy strength parameter m in FKNN classifier are adaptively specified by the continuous PSO. On the other hand, binary PSO is utilized to choose the most discriminative subset of features for prediction. The effectiveness of the PSO–FKNN model has been rigorously evaluated against the PD data set in terms of classification accuracy, sensitivity, specificity and the area under the receiver operating characteristic (ROC) curve (AUC). Compared to the existing methods in previous studies, the proposed system has achieved the highest classification accuracy reported so far via 10-fold cross-validation analysis, with the mean accuracy of 97.47%. Promisingly, the proposed diagnosis system might serve as a new candidate of powerful tools for diagnosing PD with excellent performance.  相似文献   

2.
The focus of research in swarm intelligence has been largely on the algorithmic side with relatively little attention being paid to the study of problems and the behaviour of algorithms in relation to problems. When a new algorithm or variation on an existing algorithm is proposed in the literature, there is seldom any discussion or analysis of algorithm weaknesses and on what kinds of problems the algorithm is expected to fail. Fitness landscape analysis is an approach that can be used to analyse optimisation problems. By characterising problems in terms of fitness landscape features, the link between problem types and algorithm performance can be studied. This article investigates a number of measures for analysing the ability of a search process to improve fitness on a particular problem (called evolvability in literature but referred to as searchability in this study to broaden the scope to non-evolutionary-based search techniques). A number of existing fitness landscape analysis techniques originally proposed for discrete problems are adapted to work in continuous search spaces. For a range of benchmark problems, the proposed searchability measures are viewed alongside performance measures for a traditional global best particle swarm optimisation (PSO) algorithm. Empirical results show that no single measure can be used as a predictor of PSO performance, but that multiple measures of different fitness landscape features can be used together to predict PSO failure.  相似文献   

3.
This paper presents a new direct method for estimating the average center of rotation (CoR). An existing least-squares (LS) solution has been shown by previous works to have reduced accuracy for data with small range of motion (RoM). Alternative methods proposed to improve the CoR estimation use iterative algorithms. However, in this paper we show that with a carefully chosen normalization scheme, constrained least-squares solutions can perform as well as iterative approaches, even for challenging problems with significant noise and small RoM. In particular, enforcing the normalization constraint avoids poor fits near plane singularities that can affect the existing LS method. Our formulation has an exact solution, accounts for multiple markers simultaneously, and does not depend on manually-adjusted parameters. Simulation tests compare the method to four published CoR estimation techniques. The results show that the new approach has the accuracy of the iterative methods as well as the short computation time and repeatability of a least-squares solution. In addition, application of the new method to experimental motion capture data of the thumb carpometacarpal (CMC) joint yielded a more plausible CoR location compared to the previously reported LS solution and required less time than all four alternative techniques.  相似文献   

4.
This study aims at optimizing the culture conditions (agitation speed, temperature and pH) of the Pleuromutilin production by Pleurotus mutilus. A hybrid methodology including a central composite design (CCD), an artificial neural network (ANN), and a particle swarm optimization algorithm (PSO) was used. Specifically, the CCD and ANN were used for conducting experiments and modeling the non-linear process, respectively. The PSO was used for two purposes: Replacing the standard back propagation in training the ANN (PSONN) and optimizing the process. In comparison to the response surface methodology (RSM) and to the Bayesian regularization neural network (BRNN), PSONN model has shown the highest modeling ability. Under this hybrid approach (PSONN-PSO), the optimum levels of culture conditions were: 242 rpm agitation speed; temperature 26.88 and pH 6.06. A production of 10,074 ± 500 ??g/g, which was in very good agreement with the prediction (10,149 ??g/g), was observed in verification experiment. The hybrid PSONN-PSO gave a yield of 27.5% greater than that obtained by the hybrid BRNN-PSO. This work shows that the combination of PSONN with the generic PSO algorithm has a good predictability and a good accuracy for bio-process optimization. This hybrid approach is sufficiently general and thus can be helpful for modeling and optimization of other industrial bio-processes.  相似文献   

5.
In recent years, significant effort has been given to predicting protein functions from protein interaction data generated from high throughput techniques. However, predicting protein functions correctly and reliably still remains a challenge. Recently, many computational methods have been proposed for predicting protein functions. Among these methods, clustering based methods are the most promising. The existing methods, however, mainly focus on protein relationship modeling and the prediction algorithms that statically predict functions from the clusters that are related to the unannotated proteins. In fact, the clustering itself is a dynamic process and the function prediction should take this dynamic feature of clustering into consideration. Unfortunately, this dynamic feature of clustering is ignored in the existing prediction methods. In this paper, we propose an innovative progressive clustering based prediction method to trace the functions of relevant annotated proteins across all clusters that are generated through the progressive clustering of proteins. A set of prediction criteria is proposed to predict functions of unannotated proteins from all relevant clusters and traced functions. The method was evaluated on real protein interaction datasets and the results demonstrated the effectiveness of the proposed method compared with representative existing methods.  相似文献   

6.
The inhibitory effects of tetrahydrolipstatin (THL) on the hydrolytic activity of human pancreatic lipase (HPL) and T. lanuginosa lipase (TLL) on various lipidic substrates ‘poisoned’ with THL as previously described was studied, using either the pH-stat, monomolecular film or oil drop technique.Prior to adding lipase (method C), an ethanolic solution of THL was injected in a tributyrin (TC4) or a purified soybean oil (PSO) emulsion prepared in a pH-stat vessel. Under these conditions, THL was found to be a potent HPL inhibitor. After being dissolved in the pure triglyceride phase (method D), THL also strongly inhibited HPL. However, with TC4 as substrate TLL was efficiently inhibited by THL only when method C was used and not method D. The very different inhibitory effects on HPL and TLL recorded with method D and PSO as substrate were confirmed using the monomolecular film and oil drop techniques.With a monomolecular film of dicaprin (di-C10) as substrate, 1 molecule of THL embedded in 400 000 molecules of di-C10 sufficed to reduce the HPL activity to half of its initial value.HPL was therefore efficiently inhibited by THL with all the methods and substrates tested here. Paradoxically, TLL was inhibited by THL molecules transiently present in the aqueous phase and not by the THL molecules present at the triglyceride/water interface. It should therefore be stressed that the inhibitory effects of THL on each lipase depend strongly on the method and the substrate used.  相似文献   

7.
This paper proposes a fault diagnosis methodology for a gear pump based on the ensemble empirical mode decomposition (EEMD) method and the Bayesian network. Essentially, the presented scheme is a multi-source information fusion based methodology. Compared with the conventional fault diagnosis with only EEMD, the proposed method is able to take advantage of all useful information besides sensor signals. The presented diagnostic Bayesian network consists of a fault layer, a fault feature layer and a multi-source information layer. Vibration signals from sensor measurement are decomposed by the EEMD method and the energy of intrinsic mode functions (IMFs) are calculated as fault features. These features are added into the fault feature layer in the Bayesian network. The other sources of useful information are added to the information layer. The generalized three-layer Bayesian network can be developed by fully incorporating faults and fault symptoms as well as other useful information such as naked eye inspection and maintenance records. Therefore, diagnostic accuracy and capacity can be improved. The proposed methodology is applied to the fault diagnosis of a gear pump and the structure and parameters of the Bayesian network is established. Compared with artificial neural network and support vector machine classification algorithms, the proposed model has the best diagnostic performance when sensor data is used only. A case study has demonstrated that some information from human observation or system repair records is very helpful to the fault diagnosis. It is effective and efficient in diagnosing faults based on uncertain, incomplete information.  相似文献   

8.
An important goal of DNA microarray research is to develop tools to diagnose cancer more accurately based on the genetic profile of a tumor. There are several existing techniques in the literature for performing this type of diagnosis. Unfortunately, most of these techniques assume that different subtypes of cancer are already known to exist. Their utility is limited when such subtypes have not been previously identified. Although methods for identifying such subtypes exist, these methods do not work well for all datasets. It would be desirable to develop a procedure to find such subtypes that is applicable in a wide variety of circumstances. Even if no information is known about possible subtypes of a certain form of cancer, clinical information about the patients, such as their survival time, is often available. In this study, we develop some procedures that utilize both the gene expression data and the clinical data to identify subtypes of cancer and use this knowledge to diagnose future patients. These procedures were successfully applied to several publicly available datasets. We present diagnostic procedures that accurately predict the survival of future patients based on the gene expression profile and survival times of previous patients. This has the potential to be a powerful tool for diagnosing and treating cancer.  相似文献   

9.
To predict rice blast, many machine learning methods have been proposed. As the quality and quantity of input data are essential for machine learning techniques, this study develops three artificial neural network (ANN)-based rice blast prediction models by combining two ANN models, the feed-forward neural network (FFNN) and long short-term memory (LSTM), with diverse input datasets, and compares their performance. The Blast_Weather_FFNN model had the highest recall score (66.3%) for rice blast prediction. This model requires two types of input data: blast occurrence data for the last 3 years and weather data (daily maximum temperature, relative humidity, and precipitation) between January and July of the prediction year. This study showed that the performance of an ANN-based disease prediction model was improved by applying suitable machine learning techniques together with the optimization of hyperparameter tuning involving input data. Moreover, we highlight the importance of the systematic collection of long-term disease data.  相似文献   

10.
In this work, the development of an Artificial Neural Network (ANN) based soft estimator is reported for the estimation of static-nonlinearity associated with the transducers. Under the realm of ANN based transducer modeling, only two neural models have been suggested to estimate the static-nonlinearity associated with the transducers with quite successful results. The first existing model is based on the concept of a functional link artificial neural network (FLANN) trained with mu-LMS (Least Mean Squares) learning algorithm. The second one is based on the architecture of a single layer linear ANN trained with alpha-LMS learning algorithm. However, both these models suffer from the problem of slow convergence (learning). In order to circumvent this problem, it is proposed to synthesize the direct model of transducers using the concept of a Polynomial-ANN (polynomial artificial neural network) trained with Levenberg-Marquardt (LM) learning algorithm. The proposed Polynomial-ANN oriented transducer model is implemented based on the topology of a single-layer feed-forward back-propagation-ANN. The proposed neural modeling technique provided an extremely fast convergence speed with increased accuracy for the estimation of transducer static nonlinearity. The results of convergence are very stimulating with the LM learning algorithm.  相似文献   

11.
The authors discuss the potentialities of CT in the diagnosis of endophytic stomach cancer. They proposed a method for CT of the stomach based on a pneumatic study of its lumen via a nasogastric probe with graded inflation and repeated CT imaging. The major CT-semiotics of endophytic stomach cancer were obtained. This method should be employed in combination with the existing routine methods of stomach cancer diagnosis, and its efficacy is in direct relation to the quality of preliminary routine x-ray and endoscopic investigations.  相似文献   

12.
BACKGROUND: A variety of methods for prediction of peptide binding to major histocompatibility complex (MHC) have been proposed. These methods are based on binding motifs, binding matrices, hidden Markov models (HMM), or artificial neural networks (ANN). There has been little prior work on the comparative analysis of these methods. MATERIALS AND METHODS: We performed a comparison of the performance of six methods applied to the prediction of two human MHC class I molecules, including binding matrices and motifs, ANNs, and HMMs. RESULTS: The selection of the optimal prediction method depends on the amount of available data (the number of peptides of known binding affinity to the MHC molecule of interest), the biases in the data set and the intended purpose of the prediction (screening of a single protein versus mass screening). When little or no peptide data are available, binding motifs are the most useful alternative to random guessing or use of a complete overlapping set of peptides for selection of candidate binders. As the number of known peptide binders increases, binding matrices and HMM become more useful predictors. ANN and HMM are the predictive methods of choice for MHC alleles with more than 100 known binding peptides. CONCLUSION: The ability of bioinformatic methods to reliably predict MHC binding peptides, and thereby potential T-cell epitopes, has major implications for clinical immunology, particularly in the area of vaccine design.  相似文献   

13.
Xiong Y  Liu J  Wei DQ 《Proteins》2011,79(2):509-517
Proteins that interact with DNA play vital roles in all mechanisms of gene expression and regulation. In order to understand these activities, it is crucial to analyze and identify DNA-binding residues on DNA-binding protein surfaces. Here, we proposed two novel features B-factor and packing density in combination with several conventional features to characterize the DNA-binding residues in a well-constructed representative dataset of 119 protein-DNA complexes from the Protein Data Bank (PDB). Based on the selected features, a prediction model for DNA-binding residues was constructed using support vector machine (SVM). The predictor was evaluated using a 5-fold cross validation on above dataset of 123 DNA-binding proteins. Moreover, two independent datasets of 83 DNA-bound protein structures and their corresponding DNA-free forms were compiled. The B-factor and packing density features were statistically analyzed on these 83 pairs of holo-apo proteins structures. Finally, we developed the SVM model to accurately predict DNA-binding residues on protein surface, given the DNA-free structure of a protein. Results showed here indicate that our method represents a significant improvement of previously existing approaches such as DISPLAR. The observation suggests that our method will be useful in studying protein-DNA interactions to guide consequent works such as site-directed mutagenesis and protein-DNA docking.  相似文献   

14.

Background

CpG islands have been demonstrated to influence local chromatin structures and simplify the regulation of gene activity. However, the accurate and rapid determination of CpG islands for whole DNA sequences remains experimentally and computationally challenging.

Methodology/Principal Findings

A novel procedure is proposed to detect CpG islands by combining clustering technology with the sliding-window method (PSO-based). Clustering technology is used to detect the locations of all possible CpG islands and process the data, thus effectively obviating the need for the extensive and unnecessary processing of DNA fragments, and thus improving the efficiency of sliding-window based particle swarm optimization (PSO) search. This proposed approach, named ClusterPSO, provides versatile and highly-sensitive detection of CpG islands in the human genome. In addition, the detection efficiency of ClusterPSO is compared with eight CpG island detection methods in the human genome. Comparison of the detection efficiency for the CpG islands in human genome, including sensitivity, specificity, accuracy, performance coefficient (PC), and correlation coefficient (CC), ClusterPSO revealed superior detection ability among all of the test methods. Moreover, the combination of clustering technology and PSO method can successfully overcome their respective drawbacks while maintaining their advantages. Thus, clustering technology could be hybridized with the optimization algorithm method to optimize CpG island detection.

Conclusion/Significance

The prediction accuracy of ClusterPSO was quite high, indicating the combination of CpGcluster and PSO has several advantages over CpGcluster and PSO alone. In addition, ClusterPSO significantly reduced implementation time.  相似文献   

15.
The purpose of this study was to develop a method of classifying cancers to specific diagnostic categories based on their gene expression signatures using artificial neural networks (ANNs). We trained the ANNs using the small, round blue-cell tumors (SRBCTs) as a model. These cancers belong to four distinct diagnostic categories and often present diagnostic dilemmas in clinical practice. The ANNs correctly classified all samples and identified the genes most relevant to the classification. Expression of several of these genes has been reported in SRBCTs, but most have not been associated with these cancers. To test the ability of the trained ANN models to recognize SRBCTs, we analyzed additional blinded samples that were not previously used for the training procedure, and correctly classified them in all cases. This study demonstrates the potential applications of these methods for tumor diagnosis and the identification of candidate targets for therapy.  相似文献   

16.
This work proposes a sequential modelling approach using an artificial neural network (ANN) to develop four independent multivariate models that are able to predict the dynamics of biochemical oxygen demand (BOD), chemical oxygen demand (COD), suspended solid (SS), and total nitrogen (TN) removal in a wastewater treatment plant (WWTP). Suitable structures of ANN models were automatically and conveniently optimized by a genetic algorithm rather than the conventional trial and error method. The sequential modelling approach, which is composed of two parts, a process disturbance estimator and a process behaviour predictor, was also presented to develop multivariate dynamic models. In particular, the process disturbance estimator was first employed to estimate the influent quality. The process behaviour predictor then sequentially predicted the effluent quality based on the estimated influent quality from the process disturbance estimator with other process variables. The efficiencies of the developed ANN models with a sequential modelling approach were demonstrated with a practical application using a data set collected from a full-scale WWTP during 2 years. The results show that the ANN with the sequential modelling approach successfully developed multivariate dynamic models of BOD, COD, SS, and TN removal with satisfactory estimation and prediction capability. Thus, the proposed method could be used as a powerful tool for the prediction of complex and nonlinear WWTP performance.  相似文献   

17.
This paper investigates fault diagnosis in batch processes and presents a comparative study of feature extraction and classification techniques applied to a specific biotechnological case study: the fermentation process model by Birol et al. (Comput Chem Eng 26:1553–1565, 2002), which is a benchmark for advanced batch processes monitoring, diagnosis and control. Fault diagnosis is achieved using four approaches on four different process scenarios based on the different levels of noise so as to evaluate their effects on the performance. Each approach combines a feature extraction method, either multi-way principal component analysis (MPCA) or multi-way independent component analysis (MICA), with a classification method, either artificial neural network (ANN) or support vector machines (SVM). The performance obtained by the different approaches is assessed and discussed for a set of simulated faults under different scenarios. One of the faults (a loss in mixing power) could not be detected due to the minimal effect of mixing on the simulated data. The remaining faults could be easily diagnosed and the subsequent discussion provides practical insight into the selection and use of the available techniques to specific applications. Irrespective of the classification algorithm, MPCA renders better results than MICA, hence the diagnosis performance proves to be more sensitive to the selection of the feature extraction technique.  相似文献   

18.
Several methods to detect faults have been developed in various fields, mainly in chemical and process engineering. However, minimal practical guidelines exist for their selection and application. This work presents an index that allows for evaluating monitoring and diagnosis performance of fault detection methods, which takes into account several characteristics, such as false alarms, false acceptance, and undesirable switching from correct detection to non-detection during a fault event. The usefulness of the index to process engineering is demonstrated first by application to a simple example. Then, it is used to compare five univariate fault detection methods (Shewhart, EWMA, and residuals of EWMA) applied to the simulated results of the Benchmark Simulation Model No. 1 long-term (BSM1_LT). The BSM1_LT, provided by the IWA Task Group on Benchmarking of Control Strategies, is a simulation platform that allows for creating sensor and actuator faults and process disturbances in a wastewater treatment plant. The results from the method comparison using BSM1_LT show better performance to detect a sensor measurement shift for adaptive methods (residuals of EWMA) and when monitoring the actuator signals in a control loop (e.g., airflow). Overall, the proposed index is able to screen fault detection methods.  相似文献   

19.
Particle Swarm Optimization (PSO) is a stochastic optimization approach that originated from simulations of bird flocking, and that has been successfully used in many applications as an optimization tool. Estimation of distribution algorithms (EDAs) are a class of evolutionary algorithms which perform a two-step process: building a probabilistic model from which good solutions may be generated and then using this model to generate new individuals. Two distinct research trends that emerged in the past few years are the hybridization of PSO and EDA algorithms and the parallelization of EDAs to exploit the idea of exchanging the probabilistic model information. In this work, we propose the use of a cooperative PSO/EDA algorithm based on the exchange of heterogeneous probabilistic models. The model is heterogeneous because the cooperating PSO/EDA algorithms use different methods to sample the search space. Three different exchange approaches are tested and compared in this work. In all these approaches, the amount of information exchanged is adapted based on the performance of the two cooperating swarms. The performance of the cooperative model is compared to the existing state-of-the-art PSO cooperative approaches using a suite of well-known benchmark optimization functions.  相似文献   

20.
In recent years the use of fuzzy clustering techniques in medical diagnosis is increasing steadily, because of the effectiveness of fuzzy clustering techniques in recognizing the systems in the medical database to help medical experts in diagnosing diseases. This study focuses on clustering lung cancer dataset into three types of cancers which are leading cause of cancer death in the world. This paper invents effective fuzzy clustering techniques by incorporating hyper tangent kernel function, and entropy methods for analyzing the Lung Cancer database to assist physician in diagnosing lung cancer. Further this paper proposes an algorithm to initialize the cluster centers to speed up the process of the algorithms. The effectiveness of the proposed methods has been proved through the experimental works on synthetic dataset, Wine dataset and IRIS dataset in terms of running time, number of iterations, visual segmentation effects and clustering accuracy. And then this paper proposes the proposed method on Lung cancer database to divide it into three types of lung cancers. In addition this paper proves the superiority of the proposed methods by comparing the obtained classes with reference classes through Error Matrix.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号