Search results for "Feature selection"
showing 10 items of 139 documents
Search strategies for ensemble feature selection in medical diagnostics
2003
The goal of this paper is to propose, evaluate, and compare four search strategies for ensemble feature selection, and to consider their application to medical diagnostics, with a focus on the problem of the classification of acute abdominal pain. Ensembles of learnt models constitute one of the main current directions in machine learning and data mining. Ensembles allow us to get higher accuracy, sensitivity, and specificity, which are often not achievable with single models. One technique, which proved to be effective for ensemble construction, is feature selection. Lately, several strategies for ensemble feature selection were proposed, including random subspacing, hill-climbing-based se…
Detecting global and local hippocampal shape changes in Alzheimer's disease using statistical shape models.
2012
Item does not contain fulltext The hippocampus is affected at an early stage in the development of Alzheimer's disease (AD). With the use of structural magnetic resonance (MR) imaging, we can investigate the effect of AD on the morphology of the hippocampus. The hippocampal shape variations among a population can be usually described using statistical shape models (SSMs). Conventional SSMs model the modes of variations among the population via principal component analysis (PCA). Although these modes are representative of variations within the training data, they are not necessarily discriminative on labeled data or relevant to the differences between the subpopulations. We use the shape des…
Computational Methods in Developing Quantitative Structure-Activity Relationships (QSAR): A Review
2006
Virtual filtering and screening of combinatorial libraries have recently gained attention as methods complementing the high-throughput screening and combinatorial chemistry. These chemoinformatic techniques rely heavily on quantitative structure-activity relationship (QSAR) analysis, a field with established methodology and successful history. In this review, we discuss the computational methods for building QSAR models. We start with outlining their usefulness in high-throughput screening and identifying the general scheme of a QSAR model. Following, we focus on the methodologies in constructing three main components of QSAR model, namely the methods for describing the molecular structure …
Analysis of compatibility between lighting devices and descriptive features using Parzen’s kernel: application to flaw inspection by artificial vision
2000
We present a supervised method, developed for industrial inspections by artificial vision, to obtain an adapted combination of descriptive features and a lighting device. This method must be implemented under real-time constraints and therefore a minimal number of features must be selected. The method is based on the assessment of the discrimination power of many descriptive features. The objective is to select the combination of descriptive features and lighting system best able to discriminate flawed classes from defect-free classes. In the first step, probability densities are computed for flawed and defect-free classes and for each tested combination. The discrimination power of the fea…
Assessment of the statistical significance of classifications in infrared spectroscopy based diagnostic models.
2014
Fourier transform infrared (IR) spectroscopy in combination with multivariate data analysis is a versatile tool that can be applied to disease diagnosis. However, a rigorous validation of the obtained models is necessary in order to obtain robust results. This work evaluates the advantages of the use of permutation testing for determining the statistical significance of the misclassification errors obtained from IR based diagnostic models through cross validation (CV). The model performance, estimated by CV, is compared to a distribution of CV-performance values obtained using randomly permuted class labels. The distribution of ‘random CV-values’ is considered as a null distribution and use…
Optimization of the KNN Supervised Classification Algorithm as a Support Tool for the Implantation of Deep Brain Stimulators in Patients with Parkins…
2019
Deep Brain Stimulation (DBS) of the Subthalamic Nuclei (STN) is the most used surgical treatment to improve motor skills in patients with Parkinson&rsquo
Quantification of the heterogeneity of prognostic cellular biomarkers in ewing sarcoma using automated image and random survival forest analysis
2014
Driven by genomic somatic variation, tumour tissues are typically heterogeneous, yet unbiased quantitative methods are rarely used to analyse heterogeneity at the protein level. Motivated by this problem, we developed automated image segmentation of images of multiple biomarkers in Ewing sarcoma to generate distributions of biomarkers between and within tumour cells. We further integrate high dimensional data with patient clinical outcomes utilising random survival forest (RSF) machine learning. Using material from cohorts of genetically diagnosed Ewing sarcoma with EWSR1 chromosomal translocations, confocal images of tissue microarrays were segmented with level sets and watershed algorithm…
Support vector machine classification of brain metastasis and radiation necrosis based on texture analysis in MRI
2015
Purpose To develop a classification model using texture features and support vector machine in contrast-enhanced T1-weighted images to differentiate between brain metastasis and radiation necrosis. Methods Texture features were extracted from 115 lesions: 32 of them previously diagnosed as radiation necrosis, 23 as radiation-treated metastasis and 60 untreated metastases; including a total of 179 features derived from six texture analysis methods. A feature selection technique based on support vector machine was used to obtain a subset of features that provide optimal performance. Results The highest classification accuracy evaluated over test sets was achieved with a subset of ten features…
Hyperspectral system for early detection of rottenness caused by Penicillium digitatum in mandarins
2008
Abstract Nowadays, the detection of fruit infected with Penicillium sp. fungi on packing lines is carried out manually under ultraviolet illumination. Ultraviolet sources induce visible fluorescence of essential oils, present in the skin of citrus and which are released by the action of fungi, thus increasing the contrast between sound and rotten skin. This work analyses a set of techniques aimed at detecting rotten citrus without the use of UV lighting. The techniques used include hyperspectral image acquisition, pre-processing and calibration, feature selection and segmentation using linear and non-linear methods for classification of fruits. Different methods such as correlation analysis…
The Prediction of Human Intestinal Absorption Based on the Molecular Structure
2014
Human Intestinal Absorption (HIA) has been modeled many times by using classification models. However, regression models are scarce. Here, Artificial Neural Networks (ANNs) are implemented for this purpose. A dataset of structurally diverse chemicals with their respective experimental HIA were used to design robust, true predictive and widespread applicable ANN models. An input variables pool was made up of structural invariants calculated by using either Dragon or our software Desmol 1. The selection of best variables was performed following three steps using the entire dataset of molecules. Firstly, variables poorly correlated with the experimental data were eliminated. Secondly, input va…