Search results for "feature selection"

showing 10 items of 139 documents

An Improved Decision System for URL Accesses Based on a Rough Feature Selection Technique

2015

Corporate security is usually one of the matters in which companies invest more resources, since the loss of information directly translates into monetary losses. Security issues might have an origin in external attacks or internal security failures, but an important part of the security breaches is related to the lack of awareness that the employees have with regard to the use of the Web. In this work we have focused on the latter problem, describing the improvements to a system able to detect anomalous and potentially insecure situations that could be dangerous for a company. This system was initially conceived as a better alternative to what are known as black/white lists. These lists co…

Information retrievalInternal securityComputer scienceDecision systemFeature (computer vision)String (computer science)Computational intelligenceFeature selectionRough setCorporate security
researchProduct

Assessment of workflow feature selection on forest LAI prediction with sentinel-2A MSI, landsat 7 ETM+ and Landsat 8 OLI

2020

The European Space Agency (ESA)’s Sentinel-2A (S2A) mission is providing time series that allow the characterisation of dynamic vegetation, especially when combined with the National Aeronautics and Space Administration (NASA)/United States Geological Survey (USGS) Landsat 7 (L7) and Landsat 8 (L8) missions. Hybrid retrieval workflows combining non-parametric Machine Learning Regression Algorithms (MLRAs) and vegetation Radiative Transfer Models (RTMs) were proposed as fast and accurate methods to infer biophysical parameters such as Leaf Area Index (LAI) from these data streams. However, the exact design of optimal retrieval workflows is rarely discussed. In this study, the impact of…

Leaf area index (LAI)010504 meteorology & atmospheric sciencesComputer scienceScienceMultispectral image0211 other engineering and technologiesFeature selection02 engineering and technology01 natural sciencesCropLaboratory of Geo-information Science and Remote SensingMachine learningRadiative transferBosecologie en BosbeheerLaboratorium voor Geo-informatiekunde en Remote SensingForestLeaf area indexDiscrete anisotropic radiative transfer (DART) model021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingQInversion (meteorology)Vegetation15. Life on landPE&RCForest Ecology and Forest ManagementVegetation radiative transfer modelNoiseFeature (computer vision)Thematic MapperGeological surveyGeneral Earth and Planetary SciencesSentinel-2Remote Sensing
researchProduct

Application of machine learning techniques to analyse the effects of physical exercise in ventricular fibrillation

2014

This work presents the application of machine learning techniques to analyse the influence of physical exercise in the physiological properties of the heart, during ventricular fibrillation. To this end, different kinds of classifiers (linear and neural models) are used to classify between trained and sedentary rabbit hearts. The use of those classifiers in combination with a wrapper feature selection algorithm allows to extract knowledge about the most relevant features in the problem. The obtained results show that neural models outperform linear classifiers (better performance indices and a better dimensionality reduction). The most relevant features to describe the benefits of physical …

MaleComputer scienceHealth InformaticsPhysical exerciseFeature selectionMachine learningcomputer.software_genreElectrocardiographyKnowledge extractionArtificial IntelligencePhysical Conditioning AnimalmedicineAnimalsExtreme learning machinebusiness.industryDimensionality reductionWork (physics)Signal Processing Computer-Assistedmedicine.diseaseComputer Science ApplicationsCor MalaltiesPhysical FitnessMultilayer perceptronVentricular fibrillationVentricular FibrillationEnginyeria biomèdicaArtificial intelligenceRabbitsbusinesscomputer
researchProduct

Transforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures

2014

Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, such as the lasso and componentwise boosting, which can simultaneously consider all measurements and provide both, multivariable regression models for prediction and automated variable selection. However, they might be affected by the typical skewness, mean-variance-dependency or extreme values of RNA-…

MaleGene Expressionlcsh:Medicinecomputer.software_genreBioinformaticslcsh:ScienceExtreme value theoryMultidisciplinaryMultivariable calculusStatisticsRegression analysisGenomicsPrognosisKidney NeoplasmsNeoplasm ProteinsLeukemia Myeloid AcuteMedicineProbability distributionFemaleSequence AnalysisAlgorithmsResearch ArticleStatistical DistributionsRiskBoosting (machine learning)Clinical Research DesignFeature selectionBiostatisticsBiologyMachine learningMolecular GeneticsGenome Analysis ToolsCovariateHumansStatistical MethodsGene PredictionBiologyCarcinoma Renal CellProbabilityClinical GeneticsSequence Analysis RNAbusiness.industrylcsh:RPersonalized MedicineModelingComputational BiologyProbability TheorySurvival AnalysisSkewnessMultivariate AnalysisRNAlcsh:QArtificial intelligenceGenome Expression AnalysisTranscriptomebusinesscomputerMathematicsPLoS ONE
researchProduct

General method for automated feature extraction and selection and its application for gender classification and biomechanical knowledge discovery of …

2020

Modern technologies enable to capture multiple biomechanical parameters often resulting in relational data. The current work proposes a generally applicable method comprising automated feature extraction, ensemble feature selection and classification to best capture the potentials of the data also for generating new biomechanical knowledge. Its benefits are demonstrated in the concrete biomechanically and medically relevant use case of gender classification based on spinal data for stance and gait. Very good results for accuracy were obtained using gait data. Dynamic movements of the lumbar spine in sagittal and frontal plane and of the pelvis in frontal plane best map gender differences.

MaleRelational databaseComputer science0206 medical engineeringFeature extractionPostureBiomedical EngineeringBioengineeringFeature selection02 engineering and technology03 medical and health sciencesAutomation0302 clinical medicineGait (human)Knowledge extractionmedicineHumansGaitComputingMethodologies_COMPUTERGRAPHICSSex Characteristicsbusiness.industryWork (physics)Reproducibility of ResultsPattern recognition030229 sport sciencesGeneral MedicineKnowledge Discovery020601 biomedical engineeringSagittal planeComputer Science ApplicationsBiomechanical PhenomenaHuman-Computer Interactionmedicine.anatomical_structureComputingMethodologies_PATTERNRECOGNITIONCoronal planeFemaleArtificial intelligencebusinessAlgorithms
researchProduct

Choline PET/CT Features to Predict Survival Outcome in High Risk Prostate Cancer Restaging: A Preliminary Machine-Learning Radiomics Study

2020

Background Radiomic features are increasingly utilized to evaluate tumor heterogeneity in PET imaging but to date its role has not been investigated for Cho-PET in prostate cancer. The potential application of radiomics features analysis using a machine-learning radiomics algorithm was evaluated to select 18F-Cho PET/CT imaging features to predict disease progression in PCa. Methods We retrospectively analyzed high-risk PCa patients who underwent restaging 18F-Cho PET/CT from November 2013 to May 2018. 18F-Cho PET/CT studies and related structures containing volumetric segmentations were imported in the "CGITA" toolbox to extract imaging features from each lesion. A Machine-learning model h…

Malemedicine.medical_specialtyn artificial intelligence model demonstrated to be feasible and able to select a panel of 18F-Cho PET/CT features with valuable association with PCa patients' outcome.business.industryProstatic NeoplasmsFeature selectionPet imagingCholine pet ctmedicine.diseaseTumor heterogeneitySurvival outcomeCholineMachine LearningProstate cancerRadiomicsFeature (computer vision)Artificial IntelligencePositron Emission Tomography Computed TomographyMedicineHumansRadiology Nuclear Medicine and imagingRadiologybusinessRetrospective Studies
researchProduct

Accelerating Causal Inference and Feature Selection Methods through G-Test Computation Reuse

2021

This article presents a novel and remarkably efficient method of computing the statistical G-test made possible by exploiting a connection with the fundamental elements of information theory: by writing the G statistic as a sum of joint entropy terms, its computation is decomposed into easily reusable partial results with no change in the resulting value. This method greatly improves the efficiency of applications that perform a series of G-tests on permutations of the same features, such as feature selection and causal inference applications because this decomposition allows for an intensive reuse of these partial results. The efficiency of this method is demonstrated by implementing it as…

Markov blanketMarkov blanketComputer sciencecomputation reuseConditional mutual informationComputationSciencePhysicsQC1-999QGeneral Physics and AstronomyContext (language use)Feature selectionInformation theoryAstrophysicsJoint entropyArticleG-testQB460-466feature selectionCausal inferencecausal inferenceAlgorithminformation theoryEntropy
researchProduct

Variable selection for the determination of total polar materials in fried oils by near infrared spectroscopy

2018

Total polar materials (TPM) content is considered as the best indicator and the most common parameter to check the quality of deep-frying oils. The development of simpler and quicker analytical techniques than the available methods to monitor oil quality in restaurants and fried food outlets is an important topic related to the human health. This paper reports a comparison of the variable selection of near infrared (NIR) spectra by multiple linear regression (MLR-NIR) with partial least squares (PLS-NIR) models for the quantification of TPM in fried vegetable oils. The use of PLS-NIR offers an alternative in laboratory bench equipment for the determination of TPM in oils employed for fryin…

Materials scienceTEC010401 analytical chemistryNear-infrared spectroscopyAnalytical chemistryFeature selection04 agricultural and veterinary sciences040401 food science01 natural sciences0104 chemical sciences0404 agricultural biotechnologyPartial least squares regressionPolarNear infrared radiationSpectroscopyJournal of Near Infrared Spectroscopy
researchProduct

Sensitivity Maps of the Hilbert-Schmidt Independence Criterion

2018

Abstract Kernel dependence measures yield accurate estimates of nonlinear relations between random variables, and they are also endorsed with solid theoretical properties and convergence rates. Besides, the empirical estimates are easy to compute in closed form just involving linear algebra operations. However, they are hampered by two important problems: the high computational cost involved, as two kernel matrices of the sample size have to be computed and stored, and the interpretability of the measure, which remains hidden behind the implicit feature map. We here address these two issues. We introduce the sensitivity maps (SMs) for the Hilbert–Schmidt independence criterion (HSIC). Sensi…

Mathematical optimization0211 other engineering and technologiesFeature selection02 engineering and technology010501 environmental sciences01 natural sciencesMeasure (mathematics)Kernel methodKernel (statistics)Linear algebraApplied mathematicsSensitivity (control systems)Random variableSoftwareIndependence (probability theory)021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsApplied Soft Computing
researchProduct

Use of Guided Regularized Random Forest for Biophysical Parameter Retrieval

2018

This paper introduces a feature selection method based on random forest -the Guided Regularized Random Forest (GRRF)- which can be used in classification and regression tasks. The method is based on the regularization of the information gain in the random forest nodes to obtain a subset of relevant and non-redundant features. The proposed method is used as a preliminary step In the process of retrieving biophysical parameters from a hyperspectral image. Preliminary experiments show that we can reduce the RMSE of the retrievals by around 7% for the Leaf Area Index and around 8% for the fraction of vegetation cover when compared to the results using random forest features.

Mean squared error22/3 OA procedurebusiness.industryComputer scienceFeature extractionHyperspectral images0211 other engineering and technologiesHyperspectral imagingPattern recognitionFeature selection02 engineering and technologyBiophysical parameter retrievalRegularization (mathematics)RegressionRandom forestFeature selection0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligenceLeaf area indexbusinessRandom forest021101 geological & geomatics engineeringIGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium
researchProduct