首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Decision-in decision-out fusion architecture can be used to fuse the outputs of multiple classifiers from different diagnostic sources. In this paper, Dempster-Shafer Theory (DST) has been used to fuse classification results of breast cancer data from two different sources: gene-expression patterns in peripheral blood cells and Fine-Needle Aspirate Cytology (FNAc) data. Classification of individual sources is done by Support Vector Machine (SVM) with linear, polynomial and Radial Base Function (RBF) kernels. Out put belief of classifiers of both data sources are combined to arrive at one final decision. Dynamic uncertainty assessment is based on class differentiation of the breast cancer. Experimental results have shown that the new proposed breast cancer data fusion methodology have outperformed single classification models.  相似文献   

2.
Prostate cancer is the most common cancer in men over 50 years of age and it has been shown that nuclear magnetic resonance spectra are sensitive enough to distinguish normal and cancer tissues. In this paper, we propose a classification technique of spectra from magnetic resonance spectroscopy. We studied automatic classification with and without quantification of metabolite signals. The dataset is composed of 22 patient datasets with a biopsy-proven cancer, from which we extracted 2464 spectra from the whole prostate and of which 1062 were localised in the peripheral zone. The spectra were manually classed into 3 different categories by a spectroscopist with 4 years experience in clinical spectroscopy of prostate cancer: undetermined, healthy and pathologic. We used different preprocessing methods (module, phase correction only, phase correction and baseline correction) as input for Support Vector Machine and for Multilayer Perceptron, and we compared the results with those from the expert. If we class only healthy and pathologic spectra we reach a total error rate of 4.51%. However, if we class all spectra (undetermined, healthy and pathologic) the total error rate rises to 11.49%. We have shown in this paper that the best results are obtained using the pre-processed spectra without quantification as input for the classifiers and we confirm that Support Vector Machine are more efficient than Multilayer Perceptron in processing high dimensional data.  相似文献   

3.
4.
Phenotypic Up-regulated Gene Support Vector Machine (PUGSVM) is a cancer Biomedical Informatics Grid (caBIG?) analytical tool for multiclass gene selection and classification. PUGSVM addresses the problem of imbalanced class separability, small sample size and high gene space dimensionality, where multiclass gene markers are defined by the union of one-versus-everyone phenotypic upregulated genes, and used by a well-matched one-versus-rest support vector machine. PUGSVM provides a simple yet more accurate strategy to identify statistically reproducible mechanistic marker genes for characterization of heterogeneous diseases. AVAILABILITY: http://www.cbil.ece.vt.edu/caBIG-PUGSVM.htm.  相似文献   

5.
The cancer classification problem is one of the most challenging problems in bioinformatics. The data provided by Netherland Cancer Institute consists of 295 breast cancer patient; 101 patients are with distant metastases and 194 patients are without distant metastases. Combination of features sets based on kernel method to classify the patient who are with or without distant metastases will be investigated. The single data set will be compared with three data integration strategies and also weighted data integration strategies based on kernel method. Least Square Support Vector Machine (LS-SVM) is chosen as the classifier because it can handle very high dimensional features, for instance, microarray data. The experiment result shows that the performance of weighted late integration and the using of only microarray data are almost similar. The data integration strategy is not always better than using single data set in this case. The performance of classification absolutely depends on the features that are used to represent the object.  相似文献   

6.
A growing body of evidence concerning estrogen effects cannot be explained by the classic model of hormone action, which involves the binding to estrogen receptors (ERs) alpha and ERbeta and the interaction of the steroid-receptor complex with specific DNA sequences associated with target genes. Using c-fos proto-oncogene expression as an early molecular sensor of estrogen action in ERalpha-positive MCF7 and ER-negative SKBR3 breast cancer cells, we have discovered that 17beta-estradiol (E2), and the two major phytoestrogens, genistein and quercetin, stimulate c-fos expression through ERalpha as well as through an ER-independent manner via the G protein-coupled receptor homologue GPR30. The c-fos response is repressed in GPR30-expressing SKBR3 cells transfected with an antisense oligonucleotide against GPR30 and reconstituted in GPR30-deficient MDA-MB 231 and BT-20 breast cancer cells transfected with a GPR30 expression vector. GPR30-dependent activation of ERK1/2 by E2 and phytoestrogens occurs via a Gbetagamma-associated pertussis toxin-sensitive pathway that requires both Src-related and EGF receptor tyrosine kinase activities. The ability of E2 and phytoestrogens to regulate the expression of growth-related genes such as c-fos even in the absence of ER has interesting implications for understanding breast cancer progression.  相似文献   

7.
《IRBM》2023,44(3):100749
ObjectiveThe most widespread and intrusive cancer type among women is breast cancer. Globally, this type of cancer causes more mortality among women, next to lung cancer. This made the researchers to focus more on developing effective Computer-Aided Detection (CAD) methodologies for the classification of such deadly cancer types. In order to improve the rate of survival and earlier diagnosis, an optimistic research methodology is required in the classification of breast cancer. Consequently, an improved methodology that integrates the principle of deep learning with metaheuristic and classification algorithms is proposed for the severity classification of breast cancer. Hence to enhance the recent findings, an improved CAD methodology is proposed for redressing the healthcare problem.Material and MethodsThe work intends to cast a light-of-research towards classifying the severities present in digital mammogram images. For evaluating the work, the publicly available MIAS, INbreast, and WDBC databases are utilized. The proposed work employs transfer learning for extricating the features. The novelty of the work lies in improving the classification performance of the weighted k-nearest neighbor (wKNN) algorithm using particle swarm optimization (PSO), dragon-fly optimization algorithm (DFOA), and crow-search optimization algorithm (CSOA) as a transformation technique i.e., transforming non-linear input features into minimal linear separable feature vectors.ResultsThe results obtained for the proposed work are compared then with the Gaussian Naïve Bayes and linear Support Vector Machine algorithms, where the highest accuracy for classification is attained for the proposed work (CSOA-wKNN) with 84.35% for MIAS, 83.19% for INbreast, and 97.36% for WDBC datasets respectively.ConclusionThe obtained results reveal that the proposed Computer-Aided-Diagnosis (CAD) tool is robust for the severity classification of breast cancer.  相似文献   

8.
9.
In this paper, we propose a new hybrid method based on Correlation-based feature selection method and Artificial Bee Colony algorithm,namely Co-ABC to select a small number of relevant genes for accurate classification of gene expression profile. The Co-ABC consists of three stages which are fully cooperated: The first stage aims to filter noisy and redundant genes in high dimensionality domains by applying Correlation-based feature Selection (CFS) filter method. In the second stage, Artificial Bee Colony (ABC) algorithm is used to select the informative and meaningful genes. In the third stage, we adopt a Support Vector Machine (SVM) algorithm as classifier using the preselected genes form second stage. The overall performance of our proposed Co-ABC algorithm was evaluated using six gene expression profile for binary and multi-class cancer datasets. In addition, in order to proof the efficiency of our proposed Co-ABC algorithm, we compare it with previously known related methods. Two of these methods was re-implemented for the sake of a fair comparison using the same parameters. These two methods are: Co-GA, which is CFS combined with a genetic algorithm GA. The second one named Co-PSO, which is CFS combined with a particle swarm optimization algorithm PSO. The experimental results shows that the proposed Co-ABC algorithm acquire the accurate classification performance using small number of predictive genes. This proofs that Co-ABC is a efficient approach for biomarker gene discovery using cancer gene expression profile.  相似文献   

10.
11.
Wang X 《Genomics》2012,99(2):90-95
Two-gene classifiers have attracted a broad interest for their simplicity and practicality. Most existing two-gene classification algorithms were involved in exhaustive search that led to their low time-efficiencies. In this study, we proposed two new two-gene classification algorithms which used simple univariate gene selection strategy and constructed simple classification rules based on optimal cut-points for two genes selected. We detected the optimal cut-point with the information entropy principle. We applied the two-gene classification models to eleven cancer gene expression datasets and compared their classification performance to that of some established two-gene classification models like the top-scoring pairs model and the greedy pairs model, as well as standard methods including Diagonal Linear Discriminant Analysis, k-Nearest Neighbor, Support Vector Machine and Random Forest. These comparisons indicated that the performance of our two-gene classifiers was comparable to or better than that of compared models.  相似文献   

12.
Yuan Z  Burrage K  Mattick JS 《Proteins》2002,48(3):566-570
A Support Vector Machine learning system has been trained to predict protein solvent accessibility from the primary structure. Different kernel functions and sliding window sizes have been explored to find how they affect the prediction performance. Using a cut-off threshold of 15% that splits the dataset evenly (an equal number of exposed and buried residues), this method was able to achieve a prediction accuracy of 70.1% for single sequence input and 73.9% for multiple alignment sequence input, respectively. The prediction of three and more states of solvent accessibility was also studied and compared with other methods. The prediction accuracies are better than, or comparable to, those obtained by other methods such as neural networks, Bayesian classification, multiple linear regression, and information theory. In addition, our results further suggest that this system may be combined with other prediction methods to achieve more reliable results, and that the Support Vector Machine method is a very useful tool for biological sequence analysis.  相似文献   

13.
While agents targeting estrogen receptors are most effective in adjuvant therapy for human breast cancers expressing estrogen receptors after surgery, breast cancers lacking estrogen receptor are clinically serious, because they are highly malignant and exhibit resistance to the usual anti-cancer drugs, including estrogen receptor-antagonists and DNA breaking agents. Here, we found that MX-1, a human breast cancer cell line lacking estrogen receptors, exhibited higher AP-1 activity and expressed higher levels of c-Jun, c-Fos, and Fra-1 when compared with conventional estrogen receptor-positive human breast cancer cell lines. The prenylphenol antibiotic ascochlorin suppressed the AP-1 activity of MX-1 cells, and selectively killed MX-1 cells, partly due to induction of apoptosis. Our results suggest that AP-1 is an effective clinical target molecule for the treatment of estrogen receptor-negative human breast cancer.  相似文献   

14.
MOTIVATION: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. RESULTS: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods. SUPPLEMENTARY INFORMATION: http://www.stat.ohio-state.edu/~yklee/msvm.htm  相似文献   

15.
In this paper, the recently developed Extreme Learning Machine (ELM) is used for direct multicategory classification problems in the cancer diagnosis area. ELM avoids problems like local minima, improper learning rate and overfitting commonly faced by iterative learning methods and completes the training very fast. We have evaluated the multi-category classification performance of ELM on three benchmark microarray datasets for cancer diagnosis, namely, the GCM dataset, the Lung dataset and the Lymphoma dataset. The results indicate that ELM produces comparable or better classification accuracies with reduced training time and implementation complexity compared to artificial neural networks methods like conventional back-propagation ANN, Linder's SANN, and Support Vector Machine methods like SVM-OVO and Ramaswamy's SVM-OVA. ELM also achieves better accuracies for classification of individual categories.  相似文献   

16.
17.
Most physiological and biological processes are regulated by endogenous circadian rhythms under the control of both a master clock, which acts systemically and individual cellular clocks, which act at the single cell level. The cellular clock is based on a network of core clock genes, which drive the circadian expression of non-clock genes involved in many cellular processes. Circadian deregulation of gene expression has emerged to be as important as deregulation of estrogen signaling in breast tumorigenesis. Whether there is a mutual deregulation of circadian and hormone signaling is the question that we address in this study. Here we show that, upon entrainment by serum shock, cultured human mammary epithelial cells maintain an inner circadian oscillator, with key clock genes oscillating in a circadian fashion. In the same cells, the expression of the estrogen receptor α (ERA) gene also oscillates in a circadian fashion. In contrast, ERA-positive and -negative breast cancer epithelial cells show disruption of the inner clock. Further, ERA-positive breast cancer cells do not display circadian oscillation of ERA expression. Our findings suggest that estrogen signaling could be affected not only in ERA-negative breast cancer, but also in ERA-positive breast cancer due to lack of circadian availability of ERA. Entrainment of the inner clock of breast epithelial cells, by taking into consideration the biological time component, provides a novel tool to test mechanistically whether defective circadian mechanisms can affect hormone signaling relevant to breast cancer.Key words: circadian rhythm, clock genes, estrogen receptor alpha (ERA), breast cancer cells, entrainment, serum shock  相似文献   

18.
Automatic text categorization is one of the key techniques in information retrieval and the data mining field. The classification is usually time-consuming when the training dataset is large and high-dimensional. Many methods have been proposed to solve this problem, but few can achieve satisfactory efficiency. In this paper, we present a method which combines the Latent Dirichlet Allocation (LDA) algorithm and the Support Vector Machine (SVM). LDA is first used to generate reduced dimensional representation of topics as feature in VSM. It is able to reduce features dramatically but keeps the necessary semantic information. The Support Vector Machine (SVM) is then employed to classify the data based on the generated features. We evaluate the algorithm on 20 Newsgroups and Reuters-21578 datasets, respectively. The experimental results show that the classification based on our proposed LDA+SVM model achieves high performance in terms of precision, recall and F1 measure. Further, it can achieve this within a much shorter time-frame. Our process improves greatly upon the previous work in this field and displays strong potential to achieve a streamlined classification process for a wide range of applications.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号