Search results for "feature selection"

showing 10 items of 139 documents

Hybrid descriptive-inferential method for key feature selection in prostate cancer radiomics

2021

In healthcare industry 4.0, a big role is played by radiomics. Radiomics concerns the extraction and analysis of quantitative information not visible to the naked eye, even by expert operators, from biomedical images. Radiomics involves the management of digital images as data matrices, with the aim of extracting a number of morphological and predictive variables, named features, using automatic or semi-automatic methods. Multidisciplinary methods as machine learning and deep learning are fully involved in this field. However, the large number of features requires efficient and effective core methods for their selection, in order to avoid bias or misinterpretations problems. In this work, t…

business.industryComputer sciencefeature selection image analysis prostate cancer radiomicsFeature selectionManagement Science and Operations Researchmedicine.diseaseMachine learningcomputer.software_genreprostate cancerGeneral Business Management and AccountingProstate cancerRadiomicsimage analysisradiomicsModeling and SimulationFeature selectionmedicineKey (cryptography)Artificial intelligencebusinesscomputer
researchProduct

<title>Expanding context against weighted voting of classifiers</title>

2000

In the paper we propose a new method to integrate the predictions of multiple classifiers for Data Mining and Machine Learning tasks. The method assumes that each classifier stands in it's own context, and the contexts are partially ordered. The order is defined by monotonous quality function that maps each context to the value from the interval [0,1]. The classifier that has the context with better quality is supposed to predict better than the classifier from worse quality. The objective is to generate the opinion of `virtual' classifier that stands in the context with quality equal to 1. This virtual classifier must have the best accuracy of predictions due to the best context. To do thi…

business.industryComputer sciencemedia_common.quotation_subjectWeighted votingFeature selectionQuadratic classifiercomputer.software_genreMachine learningInformation extractionComputingMethodologies_PATTERNRECOGNITIONKnowledge extractionVotingMargin classifierArtificial intelligencebusinesscomputerClassifier (UML)media_commonSPIE Proceedings
researchProduct

Diversity in search strategies for ensemble feature selection

2005

Ensembles of learnt models constitute one of the main current directions in machine learning and data mining. Ensembles allow us to achieve higher accuracy, which is often not achievable with single models. It was shown theoretically and experimentally that in order for an ensemble to be effective, it should consist of base classifiers that have diversity in their predictions. One technique, which proved to be effective for constructing an ensemble of diverse base classifiers, is the use of different feature subsets, or so-called ensemble feature selection. Many ensemble feature selection strategies incorporate diversity as an objective in the search for the best collection of feature subse…

business.industryContext (language use)Feature selectionMachine learningcomputer.software_genreEnsemble learningMeasure (mathematics)Random subspace methodEnsembles of classifiersComputingMethodologies_PATTERNRECOGNITIONHardware and ArchitectureFeature (computer vision)Signal ProcessingArtificial intelligenceData miningbusinesscomputerSoftwareSelection (genetic algorithm)Information SystemsMathematics
researchProduct

<title>Distance functions in dynamic integration of data mining techniques</title>

2000

One of the most important directions in the improvement of data mining and knowledge discovery is the integration of multiple data mining techniques. An integration method needs to be able either to evaluate and select the most appropriate data mining technique or to combine two or more techniques efficiently. A recent integration method for the dynamic integration of multiple data mining techniques is based on the assumption that each of the data mining techniques is the best one inside a certain subarea of the whole domain area. This method uses an instance-based learning approach to collect information about the competence areas of the mining techniques and applies a distance function to…

business.industryData stream miningComputer scienceFeature selectionMachine learningcomputer.software_genreData modelingInformation extractionKnowledge extractionMetric (mathematics)Artificial intelligenceData miningbusinesscomputerInformation integrationData integrationSPIE Proceedings
researchProduct

Quality based classification of gasoline samples by ATR-FTIR spectrometry using spectral feature selection with quadratic discriminant analysis

2013

Abstract A chemometric approach has been developed for characterization of gasoline samples regarding their quality. Attenuated total reflectance – infrared spectrometric data were processed by genetic algorithm (GA) and successive projection algorithm (SPA) feature selection techniques, being employed as an initial step prior to apply a discriminative tool. It was aimed to classify the fuel samples according to their quality passed/failed data. Chemometric predictive procedures were developed using quadratic discriminant analysis (QDA) combined with GA and SPA as a feature subset and feature selection strategy. Results showed 93.3% and 95.6% accuracy for SPA-QDA and GA-QDA models respectiv…

business.industryGeneral Chemical EngineeringOrganic ChemistryAnalytical chemistryEnergy Engineering and Power TechnologyPattern recognitionFeature selectionQuadratic classifierMass spectrometryFuel TechnologyDiscriminative modelFeature (computer vision)Genetic algorithmArtificial intelligenceGasolinebusinessDykstra's projection algorithmMathematicsFuel
researchProduct

Analyse spectrale et texturale de données à haute résolution pour la détection automatique des maladies de la vigne

2019

‘Flavescence dorée’ is a contagious and incurable disease present on the vine leaves. In order to contain the infection, the regulations require growers to control each of the vine rows and to remove the suspect vine plants. This monitoring is done on foot during the harvest and mobilizes many people during a strategic period for viticulture. In order to solve this problem, the DAMAV project (Automatic detection of Vine Diseases) aims to develop a solution for automated detection of vine diseases using a micro-drone. The goal is to offer a turnkey solution for wine growers. This tool will allow the search for potential foci, and then more generally any type of vine diseases detectable on th…

capteur multispectralmultispectral sensor[SDV]Life Sciences [q-bio]indices de végétationalgorithmes génétiquesgrapevine diseases detectiondétection des maladies de la vignegenetic algorithms[SDV] Life Sciences [q-bio]successive projections algorithmfeature selectionclassificationalgorithmes de projections successivesvegetation indicesanalyse de texturesélection de caractéristiquestexture analysis
researchProduct

Linear Feature Extraction for Ranking

2018

We address the feature extraction problem for document ranking in information retrieval. We then propose LifeRank, a Linear feature extraction algorithm for Ranking. In LifeRank, we regard each document collection for ranking as a matrix, referred to as the original matrix. We try to optimize a transformation matrix, so that a new matrix (dataset) can be generated as the product of the original matrix and a transformation matrix. The transformation matrix projects high-dimensional document vectors into lower dimensions. Theoretically, there could be very large transformation matrices, each leading to a new generated matrix. In LifeRank, we produce a transformation matrix so that the generat…

dimension reductionComputer scienceFeature extractionMathematicsofComputing_NUMERICALANALYSISFeature selectiontiedonhakujärjestelmät02 engineering and technologyLibrary and Information SciencesRanking (information retrieval)Matrix (mathematics)Transformation matrix020204 information systemsalgoritmit0202 electrical engineering electronic engineering information engineeringtiedonhakulearning to rankbusiness.industryfeature extractionPattern recognitionkoneoppiminenPattern recognition (psychology)Benchmark (computing)020201 artificial intelligence & image processingLearning to rankArtificial intelligencebusinessInformation Systems
researchProduct

Prototyping Crop Traits Retrieval Models for CHIME: Dimensionality Reduction Strategies Applied to PRISMA Data

2022

In preparation for new-generation imaging spectrometer missions and the accompanying unprecedented inflow of hyperspectral data, optimized models are needed to generate vegetation traits routinely. Hybrid models, combining radiative transfer models with machine learning algorithms, are preferred, however, dealing with spectral collinearity imposes an additional challenge. In this study, we analyzed two spectral dimensionality reduction methods: principal component analysis (PCA) and band ranking (BR), embedded in a hybrid workflow for the retrieval of specific leaf area (SLA), leaf area index (LAI), canopy water content (CWC), canopy chlorophyll content (CCC), the fraction of absorbed photo…

feature selectionCHIMEactive learningGeneral Earth and Planetary Scienceshybrid methodPRISMAprincipal component analysibiochemical and biophysical traitGaussian process regressionPRISMA; CHIME; hybrid methods; biochemical and biophysical traits; Gaussian process regression; active learning; principal component analysis; feature selectionRemote Sensing
researchProduct

Evaluating similarity measures for gaze patterns in the context of representational competence in physics education

2018

The competent handling of representations is required for understanding physics' concepts, developing problem-solving skills, and achieving scientific expertise. Using eye-tracking methodology, we present the contributions of this paper as follows: We first investigated the preferences of students with the different levels of knowledge; experts, intermediates, and novices, in representational competence in the domain of physics problem-solving. It reveals that experts more likely prefer to use vector than other representations. Besides, a similar tendency of table representation usage was observed in all groups. Also, diagram representation has been used less than others. Secondly, we evalu…

graafinen esitysPhysics educationrepresentational competenceFeature selection02 engineering and technologycomputer.software_genresilmänliikkeetfeature selection0202 electrical engineering electronic engineering information engineeringta516fysiikkaCompetence (human resources)ta113eye-trackingbusiness.industry05 social sciences050301 education020207 software engineeringsimilarity measuresMutual informationLevenshtein distanceGazekatseEye trackingongelmanratkaisugaze patternsArtificial intelligencebusinessphysics0503 educationMaximal information coefficientcomputerNatural language processingProceedings of the 2018 ACM Symposium on Eye Tracking Research & Applications
researchProduct

A New Feature Selection Methodology for K-mers Representation of DNA Sequences

2015

DNA sequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence into a numerical space by a numerical feature vector of fixed length. This simple process allows to compare sequences in an alignment free way, using common similarities and distance functions on the numerical codomain of the mapping. The most common used decomposition uses all the substrings of a fixed length k making the codomain of exponential dimension. This obviously can affect the time complexity of the similarity computation, and in general of the machine learning algorithm used for the purpose of sequence analysis. Moreover, the presence of possible noisy features can also affect the…

k-mers DNA sequence similarity feature selection DNA sequence classification.Settore INF/01 - InformaticaComputer scienceSequence analysisbusiness.industryFeature vectorPattern recognitionFeature selectionDNA sequencingSubstringExponential functionArtificial intelligencebusinessAlgorithmTime complexity
researchProduct