Search results for "Statistics - Machine Learning"

showing 10 items of 90 documents

Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

2022

International audience; Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward the exact Jacobian. Using implicit differentiation, we show it is possible to leverage the non-smoothness of the inner problem to speed up the computation. Finally, we provide a bound on the error made on the hypergradient when the inner optimization problem is solved approxim…

FOS: Computer and information sciencesbilevel optimizationComputer Science - Machine Learninghyperparameter selec- tionMachine Learning (stat.ML)[MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC]generalized linear modelsMachine Learning (cs.LG)Convex optimizationStatistics - Machine Learning[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Optimization and Control (math.OC)FOS: Mathematics[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]hyperparameter optimizationLassoMathematics - Optimization and Control[MATH.MATH-ST] Mathematics [math]/Statistics [math.ST]
researchProduct

Dimensionality Reduction via Regression in Hyperspectral Imagery

2015

This paper introduces a new unsupervised method for dimensionality reduction via regression (DRR). The algorithm belongs to the family of invertible transforms that generalize Principal Component Analysis (PCA) by using curvilinear instead of linear features. DRR identifies the nonlinear features through multivariate regression to ensure the reduction in redundancy between he PCA coefficients, the reduction of the variance of the scores, and the reduction in the reconstruction error. More importantly, unlike other nonlinear dimensionality reduction methods, the invertibility, volume-preservation, and straightforward out-of-sample extension, makes DRR interpretable and easy to apply. The pro…

FOS: Computer and information sciencesbusiness.industryDimensionality reductionComputer Vision and Pattern Recognition (cs.CV)Feature extractionNonlinear dimensionality reductionDiffusion mapComputer Science - Computer Vision and Pattern RecognitionPattern recognitionMachine Learning (stat.ML)CollinearityReduction (complexity)Statistics - Machine LearningSignal ProcessingPrincipal component analysisArtificial intelligenceElectrical and Electronic EngineeringbusinessMathematicsCurse of dimensionality
researchProduct

A Unified SVM Framework for Signal Estimation

2013

This paper presents a unified framework to tackle estimation problems in Digital Signal Processing (DSP) using Support Vector Machines (SVMs). The use of SVMs in estimation problems has been traditionally limited to its mere use as a black-box model. Noting such limitations in the literature, we take advantage of several properties of Mercer's kernels and functional analysis to develop a family of SVM methods for estimation in DSP. Three types of signal model equations are analyzed. First, when a specific time-signal structure is assumed to model the underlying system that generated the data, the linear signal model (so called Primal Signal Model formulation) is first stated and analyzed. T…

FOS: Computer and information sciencesbusiness.industryNoise (signal processing)Computer scienceApplied MathematicsSpectral density estimationArray processingPattern recognitionMachine Learning (stat.ML)Statistics - ApplicationsSupport vector machineKernel (linear algebra)Kernel methodComputational Theory and MathematicsStatistics - Machine LearningArtificial IntelligenceSignal ProcessingApplications (stat.AP)Computer Vision and Pattern RecognitionArtificial intelligenceElectrical and Electronic EngineeringStatistics Probability and UncertaintybusinessDigital signal processingReproducing kernel Hilbert space
researchProduct

Study design in causal models

2012

The causal assumptions, the study design and the data are the elements required for scientific inference in empirical research. The research is adequately communicated only if all of these elements and their relations are described precisely. Causal models with design describe the study design and the missing data mechanism together with the causal structure and allow the direct application of causal calculus in the estimation of the causal effects. The flow of the study is visualized by ordering the nodes of the causal diagram in two dimensions by their causal order and the time of the observation. Conclusions whether a causal or observational relationship can be estimated from the collect…

FOS: Computer and information sciencesdesignstructural equation modelG.362A01 62-09 62F99 62D05 62P10 62K99 68T30graphical modelMachine Learning (stat.ML)G.2.2Statistics - ApplicationsG.3; G.2.2Methodology (stat.ME)missing dataStatistics - Machine LearningkausaliteettiApplications (stat.AP)epidemiologiaStatistics - Methodology
researchProduct

Optimization of anemia treatment in hemodialysis patients via reinforcement learning

2013

Objective: Anemia is a frequent comorbidity in hemodialysis patients that can be successfully treated by administering erythropoiesis-stimulating agents (ESAs). ESAs dosing is currently based on clinical protocols that often do not account for the high inter- and intra-individual variability in the patient's response. As a result, the hemoglobin level of some patients oscillates around the target range, which is associated with multiple risks and side-effects. This work proposes a methodology based on reinforcement learning (RL) to optimize ESA therapy. Methods: RL is a data-driven approach for solving sequential decision-making problems that are formulated as Markov decision processes (MDP…

MaleFOS: Computer and information sciencesMathematical optimizationDarbepoetin alfaComputer scienceAnemiaComputer Science - Artificial Intelligencemedicine.medical_treatmentMedicine (miscellaneous)Machine Learning (stat.ML)Outcome (game theory)Decision Support TechniquesMachine Learning (cs.LG)Renal DialysisArtificial IntelligenceStatistics - Machine LearningmedicineHumansReinforcement learningDosingAgedProtocol (science)Patient SelectionAnemiaHemoglobin AMiddle Agedmedicine.diseaseMarkov ChainsComputer Science - LearningArtificial Intelligence (cs.AI)Chronic DiseaseHematinicsKidney Failure ChronicFemaleHemodialysisMarkov decision processReinforcement PsychologyAlgorithmsmedicine.drug
researchProduct

Inference of Spatio-Temporal Functions over Graphs via Multi-Kernel Kriged Kalman Filtering

2018

Inference of space-time varying signals on graphs emerges naturally in a plethora of network science related applications. A frequently encountered challenge pertains to reconstructing such dynamic processes, given their values over a subset of vertices and time instants. The present paper develops a graph-aware kernel-based kriged Kalman filter that accounts for the spatio-temporal variations, and offers efficient online reconstruction, even for dynamically evolving network topologies. The kernel-based learning framework bypasses the need for statistical information by capitalizing on the smoothness that graph signals exhibit with respect to the underlying graph. To address the challenge o…

Signal Processing (eess.SP)FOS: Computer and information sciencesComputer Science - Machine LearningComputational complexity theoryComputer scienceInferenceMachine Learning (stat.ML)Network scienceMultikernel02 engineering and technologyNetwork topologyLinear spanMachine Learning (cs.LG)Kernel (linear algebra)Matrix (mathematics)Statistics - Machine LearningFOS: Electrical engineering electronic engineering information engineering0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringElectrical Engineering and Systems Science - Signal Processing020206 networking & telecommunicationsKalman filterSignal Processing020201 artificial intelligence & image processingLaplace operatorAlgorithm
researchProduct

Online Topology Identification from Vector Autoregressive Time Series

2019

Causality graphs are routinely estimated in social sciences, natural sciences, and engineering due to their capacity to efficiently represent the spatiotemporal structure of multivariate data sets in a format amenable for human interpretation, forecasting, and anomaly detection. A popular approach to mathematically formalize causality is based on vector autoregressive (VAR) models and constitutes an alternative to the well-known, yet usually intractable, Granger causality. Relying on such a VAR causality notion, this paper develops two algorithms with complementary benefits to track time-varying causality graphs in an online fashion. Their constant complexity per update also renders these a…

Signal Processing (eess.SP)FOS: Computer and information sciencesTheoretical computer scienceComputer scienceEstimatorMachine Learning (stat.ML)020206 networking & telecommunicationsRegret02 engineering and technologyCausalitySynthetic dataCausality (physics)Autoregressive modelGranger causalityStatistics - Machine LearningSignal ProcessingFOS: Electrical engineering electronic engineering information engineering0202 electrical engineering electronic engineering information engineeringAnomaly detectionElectrical and Electronic EngineeringTime seriesElectrical Engineering and Systems Science - Signal Processing
researchProduct

A fast and recursive algorithm for clustering large datasets with k-medians

2012

Clustering with fast algorithms large samples of high dimensional data is an important challenge in computational statistics. Borrowing ideas from MacQueen (1967) who introduced a sequential version of the $k$-means algorithm, a new class of recursive stochastic gradient algorithms designed for the $k$-medians loss criterion is proposed. By their recursive nature, these algorithms are very fast and are well adapted to deal with large samples of data that are allowed to arrive sequentially. It is proved that the stochastic gradient algorithm converges almost surely to the set of stationary points of the underlying loss criterion. A particular attention is paid to the averaged versions, which…

Statistics and ProbabilityClustering high-dimensional dataFOS: Computer and information sciencesMathematical optimizationhigh dimensional dataMachine Learning (stat.ML)02 engineering and technologyStochastic approximation01 natural sciencesStatistics - Computation010104 statistics & probabilityk-medoidsStatistics - Machine Learning[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]stochastic approximation0202 electrical engineering electronic engineering information engineeringComputational statisticsrecursive estimatorsAlmost surely[ MATH.MATH-ST ] Mathematics [math]/Statistics [math.ST]0101 mathematicsCluster analysisComputation (stat.CO)Mathematicsaveragingk-medoidsRobbins MonroApplied MathematicsEstimator[STAT.TH]Statistics [stat]/Statistics Theory [stat.TH]stochastic gradient[ STAT.TH ] Statistics [stat]/Statistics Theory [stat.TH]MedoidComputational MathematicsComputational Theory and Mathematicsonline clustering020201 artificial intelligence & image processingpartitioning around medoidsAlgorithm
researchProduct

Fusing optical and SAR time series for LAI gap filling with multioutput Gaussian processes

2019

The availability of satellite optical information is often hampered by the natural presence of clouds, which can be problematic for many applications. Persistent clouds over agricultural fields can mask key stages of crop growth, leading to unreliable yield predictions. Synthetic Aperture Radar (SAR) provides all-weather imagery which can potentially overcome this limitation, but given its high and distinct sensitivity to different surface properties, the fusion of SAR and optical data still remains an open challenge. In this work, we propose the use of Multi-Output Gaussian Process (MOGP) regression, a machine learning technique that learns automatically the statistical relationships among…

Synthetic aperture radarFOS: Computer and information sciencesComputer Science - Machine LearningTeledetecció010504 meteorology & atmospheric sciencesMean squared error0208 environmental biotechnologySoil ScienceFOS: Physical sciencesMachine Learning (stat.ML)02 engineering and technology01 natural sciencesArticlelaw.inventionMachine Learning (cs.LG)symbols.namesakelawStatistics - Machine LearningFOS: Electrical engineering electronic engineering information engineeringComputers in Earth SciencesRadarLeaf area indexCluster analysisGaussian process0105 earth and related environmental sciencesRemote sensingMathematicsImage and Video Processing (eess.IV)Processos estocàsticsGeologyElectrical Engineering and Systems Science - Image and Video ProcessingSensor fusionRegression020801 environmental engineeringPhysics - Data Analysis Statistics and ProbabilitysymbolsData Analysis Statistics and Probability (physics.data-an)Imatges Processament
researchProduct

Enhancing identification of causal effects by pruning

2018

Causal models communicate our assumptions about causes and effects in real-world phe- nomena. Often the interest lies in the identification of the effect of an action which means deriving an expression from the observed probability distribution for the interventional distribution resulting from the action. In many cases an identifiability algorithm may return a complicated expression that contains variables that are in fact unnecessary. In practice this can lead to additional computational burden and increased bias or inefficiency of estimates when dealing with measurement error or missing data. We present graphical criteria to detect variables which are redundant in identifying causal effe…

päättelyFOS: Computer and information sciencesalgorithmcausal modelMachine Learning (stat.ML)Machine Learning (cs.LG)Computer Science - Learningleikkaus (kasvit)koneoppiminenStatistics - Machine Learningidentiafiabilityalgoritmitkausaliteetticausal inferencetunnistaminen
researchProduct