Search results for "Hyperparameter"

showing 10 items of 22 documents

Optimizing Gaussian Process Regression for Image Time Series Gap-Filling and Crop Monitoring

2020

Image processing entered the era of artificial intelligence, and machine learning algorithms emerged as attractive alternatives for time series data processing. Satellite image time series processing enables crop phenology monitoring, such as the calculation of start and end of season. Among the promising algorithms, Gaussian process regression (GPR) proved to be a competitive time series gap-filling algorithm with the advantage of, as developed within a Bayesian framework, providing associated uncertainty estimates. Nevertheless, the processing of time series images becomes computationally inefficient in its standard per-pixel usage, mainly for GPR training rather than the fitting step. To…

010504 meteorology & atmospheric sciencesMean squared errorComputer science0211 other engineering and technologiesImage processing02 engineering and technologycomputer.software_genre01 natural scienceslcsh:AgricultureKrigingTime series021101 geological & geomatics engineering0105 earth and related environmental sciences2. Zero hungerHyperparameterPixelSeries (mathematics)lcsh:SGaussian processes regressionSatellite Image Time SeriesData miningtime seriesSentinel-2optimizationAgronomy and Crop Sciencecomputercrop monitoringphenology indicatorsAgronomy

researchProduct

Adjusted bat algorithm for tuning of support vector machine parameters

2016

Support vector machines are powerful and often used technique of supervised learning applied to classification. Quality of the constructed classifier can be improved by appropriate selection of the learning parameters. These parameters are often tuned using grid search with relatively large step. This optimization process can be done computationally more efficiently and more precisely using stochastic search metaheuristics. In this paper we propose adjusted bat algorithm for support vector machines parameter optimization and show that compared to the grid search it leads to a better classifier. We tested our approach on standard set of benchmark data sets from UCI machine learning repositor…

0209 industrial biotechnologyWake-sleep algorithmActive learning (machine learning)Computer scienceStability (learning theory)Linear classifier02 engineering and technologySemi-supervised learningcomputer.software_genreCross-validationRelevance vector machineKernel (linear algebra)020901 industrial engineering & automationLeast squares support vector machine0202 electrical engineering electronic engineering information engineeringMetaheuristicBat algorithmStructured support vector machinebusiness.industrySupervised learningOnline machine learningParticle swarm optimizationPattern recognitionPerceptronGeneralization errorSupport vector machineKernel methodComputational learning theoryMargin classifierHyperparameter optimization020201 artificial intelligence & image processingData miningArtificial intelligenceHyper-heuristicbusinesscomputer2016 IEEE Congress on Evolutionary Computation (CEC)

researchProduct

A heuristic, iterative algorithm for change-point detection in abrupt change models

2017

Change-point detection in abrupt change models is a very challenging research topic in many fields of both methodological and applied Statistics. Due to strong irregularities, discontinuity and non-smootheness, likelihood based procedures are awkward; for instance, usual optimization methods do not work, and grid search algorithms represent the most used approach for estimation. In this paper a heuristic, iterative algorithm for approximate maximum likelihood estimation is introduced for change-point detection in piecewise constant regression models. The algorithm is based on iterative fitting of simple linear models, and appears to extend easily to more general frameworks, such as models i…

0301 basic medicineStatistics and ProbabilityMathematical optimizationIterative methodHeuristic (computer science)Linear model01 natural sciencesPiecewise constant model Approximate maximum likelihood Model linearization Grid search limitations010104 statistics & probability03 medical and health sciencesComputational MathematicsDiscontinuity (linguistics)030104 developmental biologyHyperparameter optimizationCovariatePiecewise0101 mathematicsStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaChange detectionMathematics

researchProduct

Probabilistic cross-validation estimators for Gaussian process regression

2018

Gaussian Processes (GPs) are state-of-the-art tools for regression. Inference of GP hyperparameters is typically done by maximizing the marginal log-likelihood (ML). If the data truly follows the GP model, using the ML approach is optimal and computationally efficient. Unfortunately very often this is not case and suboptimal results are obtained in terms of prediction error. Alternative procedures such as cross-validation (CV) schemes are often employed instead, but they usually incur in high computational costs. We propose a probabilistic version of CV (PCV) based on two different model pieces in order to reduce the dependence on a specific model choice. PCV presents the benefits from both…

050502 lawHyperparameterMinimum mean square error05 social sciencesProbabilistic logicEstimator01 natural sciencesCross-validation010104 statistics & probabilitysymbols.namesakeKrigingStatisticssymbolsMaximum a posteriori estimation0101 mathematicsGaussian processAlgorithm0505 lawMathematics2017 25th European Signal Processing Conference (EUSIPCO)

researchProduct

RNN- and LSTM-Based Soft Sensors Transferability for an Industrial Process

2021

The design and application of Soft Sensors (SSs) in the process industry is a growing research field, which needs to mediate problems of model accuracy with data availability and computational complexity. Black-box machine learning (ML) methods are often used as an efficient tool to implement SSs. Many efforts are, however, required to properly select input variables, model class, model order and the needed hyperparameters. The aim of this work was to investigate the possibility to transfer the knowledge acquired in the design of a SS for a given process to a similar one. This has been approached as a transfer learning problem from a source to a target domain. The implementation of a transf…

Computational complexity theoryProcess (engineering)Computer sciencesulfur recovery unit02 engineering and technologytransfer learningMachine learningcomputer.software_genrelcsh:Chemical technologyBiochemistryRNNField (computer science)ArticleAnalytical ChemistryDomain (software engineering)0202 electrical engineering electronic engineering information engineeringlcsh:TP1-1185Electrical and Electronic EngineeringInstrumentationsystem identificationHyperparameterbusiness.industry020208 electrical & electronic engineeringdynamical modelsSystem identificationAtomic and Molecular Physics and OpticsNonlinear systemRecurrent neural networksoft sensors020201 artificial intelligence & image processingArtificial intelligenceTransfer of learningbusinessLSTMcomputerDynamical models; LSTM; RNN; Soft sensors; Sulfur recovery unit; System identification; Transfer learningSensors

researchProduct

Optimisation non-lisse pour l'estimation de composants immunitaires cellulaires dans un environnement tumoral

2021

In this PhD proposal we will investigate new regularization methods of inverse problems that provide an absolute quantification of immune cell subpopulations. The mathematical aspect of this PhD proposal is two-fold. The first goal is to enhance the underlying linear model through a more refined construction of the expression matrix. The second goal is, given this linear model, to derive the best possible estimator. These two issues can be treated in a decoupled way, which is the standard for existing methods such as Cibersort, or as a coupled optimization problem (which is known as blind deconvolution in signal processing).

Coordinate descentProblème inverse[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]Automatic differentiationBiomedical applicationHyperparameters selectionOptimisation non-LisseÉlection de paramètresDifférentiation automatique[INFO.INFO-OH] Computer Science [cs]/Other [cs.OH]Descente de coordonnéesInverse problemApplication biomédicaleNon-Smooth optimization

researchProduct

Hub-Centered Gene Network Reconstruction Using Automatic Relevance Determination

2012

Network inference deals with the reconstruction of biological networks from experimental data. A variety of different reverse engineering techniques are available; they differ in the underlying assumptions and mathematical models used. One common problem for all approaches stems from the complexity of the task, due to the combinatorial explosion of different network topologies for increasing network size. To handle this problem, constraints are frequently used, for example on the node degree, number of edges, or constraints on regulation functions between network components. We propose to exploit topological considerations in the inference of gene regulatory networks. Such systems are often…

Dynamic network analysisTranscription GeneticMicroarraysSciencePosterior probabilityGene regulatory networkBiologycomputer.software_genreBioinformaticsNetwork topology03 medical and health sciences0302 clinical medicineYeastsGeneticsComputer SimulationGene Regulatory NetworksGene NetworksBiology030304 developmental biologyRegulatory NetworksHyperparameter0303 health sciencesMultidisciplinaryModels GeneticSystems BiologyQuantitative Biology::Molecular NetworksCell CycleQRComputational BiologyBayesian networkGene Expression RegulationROC CurveMedicineData miningcomputerAlgorithms030217 neurology & neurosurgeryCombinatorial explosionBiological networkResearch ArticlePLoS ONE

researchProduct

Joint Gaussian Processes for Biophysical Parameter Retrieval

2017

Solving inverse problems is central to geosciences and remote sensing. Radiative transfer models (RTMs) represent mathematically the physical laws which govern the phenomena in remote sensing applications (forward models). The numerical inversion of the RTM equations is a challenging and computationally demanding problem, and for this reason, often the application of a nonlinear statistical regression is preferred. In general, regression models predict the biophysical parameter of interest from the corresponding received radiance. However, this approach does not employ the physical information encoded in the RTMs. An alternative strategy, which attempts to include the physical knowledge, co…

FOS: Computer and information sciencesHyperparameter010504 meteorology & atmospheric sciencesComputer scienceRemote sensing application0211 other engineering and technologiesMachine Learning (stat.ML)Regression analysis02 engineering and technologyInverse problem01 natural sciencesMachine Learning (cs.LG)Data modelingNonparametric regressionComputer Science - Learningsymbols.namesakeStatistics - Machine LearningRadiative transfersymbolsGeneral Earth and Planetary SciencesElectrical and Electronic EngineeringGaussian processAlgorithm021101 geological & geomatics engineering0105 earth and related environmental sciencesIEEE Transactions on Geoscience and Remote Sensing

researchProduct

Randomized Rx For Target Detection

2018

This work tackles the target detection problem through the well-known global RX method. The RX method models the clutter as a multivariate Gaussian distribution, and has been extended to nonlinear distributions using kernel methods. While the kernel RX can cope with complex clutters, it requires a considerable amount of computational resources as the number of clutter pixels gets larger. Here we propose random Fourier features to approximate the Gaussian kernel in kernel RX and consequently our development keep the accuracy of the nonlinearity while reducing the computational cost which is now controlled by an hyperparameter. Results over both synthetic and real-world image target detection…

FOS: Computer and information sciencesHyperparameter020301 aerospace & aeronauticsComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionMultivariate normal distribution02 engineering and technologyObject detectionMachine Learning (cs.LG)symbols.namesakeKernel (linear algebra)Kernel method0203 mechanical engineeringKernel (statistics)Gaussian functionsymbolsClutterAnomaly detectionAlgorithm021101 geological & geomatics engineering

researchProduct

An LP-based hyperparameter optimization model for language modeling

2018

In order to find hyperparameters for a machine learning model, algorithms such as grid search or random search are used over the space of possible values of the models hyperparameters. These search algorithms opt the solution that minimizes a specific cost function. In language models, perplexity is one of the most popular cost functions. In this study, we propose a fractional nonlinear programming model that finds the optimal perplexity value. The special structure of the model allows us to approximate it by a linear programming model that can be solved using the well-known simplex algorithm. To the best of our knowledge, this is the first attempt to use optimization techniques to find per…

FOS: Computer and information sciencesMathematical optimizationPerplexityLinear programmingComputer scienceMachine Learning (stat.ML)02 engineering and technology010501 environmental sciences01 natural sciencesTheoretical Computer ScienceNonlinear programmingMachine Learning (cs.LG)Random searchSimplex algorithmSearch algorithmStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringFOS: MathematicsMathematics - Optimization and Control0105 earth and related environmental sciencesHyperparameterComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Computer Science - LearningHardware and ArchitectureOptimization and Control (math.OC)Hyperparameter optimization020201 artificial intelligence & image processingLanguage modelSoftwareInformation Systems

researchProduct