Search results for "machine learning."

showing 10 items of 1455 documents

Optimized Kernel Entropy Components

2016

This work addresses two main issues of the standard Kernel Entropy Component Analysis (KECA) algorithm: the optimization of the kernel decomposition and the optimization of the Gaussian kernel parameter. KECA roughly reduces to a sorting of the importance of kernel eigenvectors by entropy instead of by variance as in Kernel Principal Components Analysis. In this work, we propose an extension of the KECA method, named Optimized KECA (OKECA), that directly extracts the optimal features retaining most of the data entropy by means of compacting the information in very few features (often in just one or two). The proposed method produces features which have higher expressive power. In particular…

FOS: Computer and information sciencesComputer Networks and CommunicationsKernel density estimationMachine Learning (stat.ML)02 engineering and technologyKernel principal component analysisMachine Learning (cs.LG)Artificial IntelligencePolynomial kernelStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringMathematicsbusiness.industry020206 networking & telecommunicationsPattern recognitionComputer Science ApplicationsComputer Science - LearningKernel methodKernel embedding of distributionsVariable kernel density estimationRadial basis function kernelKernel smoother020201 artificial intelligence & image processingArtificial intelligencebusinessSoftwareIEEE Transactions on Neural Networks and Learning Systems

researchProduct

Simplifying Probabilistic Expressions in Causal Inference

2018

Obtaining a non-parametric expression for an interventional distribution is one of the most fundamental tasks in causal inference. Such an expression can be obtained for an identifiable causal effect by an algorithm or by manual application of do-calculus. Often we are left with a complicated expression which can lead to biased or inefficient estimates when missing data or measurement errors are involved. We present an automatic simplification algorithm that seeks to eliminate symbolically unnecessary variables from these expressions by taking advantage of the structure of the underlying graphical model. Our method is applicable to all causal effect formulas and is readily available in the …

FOS: Computer and information sciencesComputer Science - Artificial Intelligencegraph theoryyksinkertaisuussimplificationgraphical modelMachine Learning (stat.ML)Machine Learning (cs.LG)Computer Science - Learningprobabilistic expressionArtificial Intelligence (cs.AI)Statistics - Machine Learningkausaliteettipiirrosmerkitcausal inferencegraafit

researchProduct

Anomaly Detection Framework Using Rule Extraction for Efficient Intrusion Detection

2014

Huge datasets in cyber security, such as network traffic logs, can be analyzed using machine learning and data mining methods. However, the amount of collected data is increasing, which makes analysis more difficult. Many machine learning methods have not been designed for big datasets, and consequently are slow and difficult to understand. We address the issue of efficient network traffic classification by creating an intrusion detection framework that applies dimensionality reduction and conjunctive rule extraction. The system can perform unsupervised anomaly detection and use this information to create conjunctive rules that classify huge amounts of traffic in real time. We test the impl…

FOS: Computer and information sciencesComputer Science - LearningComputer Science - Cryptography and SecurityCryptography and Security (cs.CR)Machine Learning (cs.LG)

researchProduct

Ensembles of Randomized Time Series Shapelets Provide Improved Accuracy while Reducing Computational Costs

2017

Shapelets are discriminative time series subsequences that allow generation of interpretable classification models, which provide faster and generally better classification than the nearest neighbor approach. However, the shapelet discovery process requires the evaluation of all possible subsequences of all time series in the training set, making it extremely computation intensive. Consequently, shapelet discovery for large time series datasets quickly becomes intractable. A number of improvements have been proposed to reduce the training time. These techniques use approximation or discretization and often lead to reduced classification accuracy compared to the exact method. We are proposin…

FOS: Computer and information sciencesComputer Science - LearningComputingMethodologies_PATTERNRECOGNITIONMachine Learning (cs.LG)

researchProduct

Renewable Energy Prediction using Weather Forecasts for Optimal Scheduling in HPC Systems

2014

The objective of the GreenPAD project is to use green energy (wind, solar and biomass) for powering data-centers that are used to run HPC jobs. As a part of this it is important to predict the Renewable (Wind) energy for efficient scheduling (executing jobs that require higher energy when there is more green energy available and vice-versa). For predicting the wind energy we first analyze the historical data to find a statistical model that gives relation between wind energy and weather attributes. Then we use this model based on the weather forecast data to predict the green energy availability in the future. Using the green energy prediction obtained from the statistical model we are able…

FOS: Computer and information sciencesComputer Science - LearningPhysics::Atmospheric and Oceanic PhysicsMachine Learning (cs.LG)

researchProduct

Retrieval of Case 2 Water Quality Parameters with Machine Learning

2018

Water quality parameters are derived applying several machine learning regression methods on the Case2eXtreme dataset (C2X). The used data are based on Hydrolight in-water radiative transfer simulations at Sentinel-3 OLCI wavebands, and the application is done exclusively for absorbing waters with high concentrations of coloured dissolved organic matter (CDOM). The regression approaches are: regularized linear, random forest, Kernel ridge, Gaussian process and support vector regressors. The validation is made with and an independent simulation dataset. A comparison with the OLCI Neural Network Swarm (ONSS) is made as well. The best approached is applied to a sample scene and compared with t…

FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciences0211 other engineering and technologiesFOS: Physical sciences02 engineering and technologyMachine learningcomputer.software_genre01 natural sciencesData modelingMachine Learning (cs.LG)Physics - Geophysicssymbols.namesakeRadiative transferGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsArtificial neural networkbusiness.industry6. Clean waterRandom forestGeophysics (physics.geo-ph)Support vector machineColored dissolved organic matterKernel (statistics)Physics - Data Analysis Statistics and ProbabilitysymbolsArtificial intelligencebusinesscomputerData Analysis Statistics and Probability (physics.data-an)

researchProduct

Retrieval of coloured dissolved organic matter with machine learning methods

2017

The coloured dissolved organic matter (CDOM) concentration is the standard measure of humic substance in natural waters. CDOM measurements by remote sensing is calculated using the absorption coefficient (a) at a certain wavelength (e.g. 440nm). This paper presents a comparison of four machine learning methods for the retrieval of CDOM from remote sensing signals: regularized linear regression (RLR), random forest (RF), kernel ridge regression (KRR) and Gaussian process regression (GPR). Results are compared with the established polynomial regression algorithms. RLR is revealed as the simplest and most efficient method, followed closely by its nonlinear counterpart KRR.

FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciences0211 other engineering and technologiesFOS: Physical sciences02 engineering and technologyMachine learningcomputer.software_genre01 natural sciencesMachine Learning (cs.LG)Physics - GeophysicsKrigingDissolved organic carbonLinear regression021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsPolynomial regressionbusiness.industry6. Clean waterGeophysics (physics.geo-ph)Random forestNonlinear systemColored dissolved organic matterKernel (statistics)Artificial intelligencebusinesscomputer

researchProduct

Gap Filling of Biophysical Parameter Time Series with Multi-Output Gaussian Processes

2018

In this work we evaluate multi-output (MO) Gaussian Process (GP) models based on the linear model of coregionalization (LMC) for estimation of biophysical parameter variables under a gap filling setup. In particular, we focus on LAI and fAPAR over rice areas. We show how this problem cannot be solved with standard single-output (SO) GP models, and how the proposed MO-GP models are able to successfully predict these variables even in high missing data regimes, by implicitly performing an across-domain information transfer.

FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciences0211 other engineering and technologiesFOS: Physical sciencesMachine Learning (stat.ML)02 engineering and technology01 natural sciencesQuantitative Biology - Quantitative MethodsMachine Learning (cs.LG)Data modelingsymbols.namesakeStatistics - Machine LearningApplied mathematicsTime seriesGaussian processQuantitative Methods (q-bio.QM)021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsSeries (mathematics)Linear modelProbability and statisticsMissing dataFOS: Biological sciencesPhysics - Data Analysis Statistics and ProbabilitysymbolsFocus (optics)Data Analysis Statistics and Probability (physics.data-an)

researchProduct

Disentangling Derivatives, Uncertainty and Error in Gaussian Process Models

2020

Gaussian Processes (GPs) are a class of kernel methods that have shown to be very useful in geoscience applications. They are widely used because they are simple, flexible and provide very accurate estimates for nonlinear problems, especially in parameter retrieval. An addition to a predictive mean function, GPs come equipped with a useful property: the predictive variance function which provides confidence intervals for the predictions. The GP formulation usually assumes that there is no input noise in the training and testing points, only in the observations. However, this is often not the case in Earth observation problems where an accurate assessment of the instrument error is usually a…

FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciencesComputer science0211 other engineering and technologiesMachine Learning (stat.ML)02 engineering and technology01 natural sciencesMachine Learning (cs.LG)symbols.namesakeStatistics - Machine LearningGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesVariance functionPropagation of uncertaintyVariance (accounting)Function (mathematics)Confidence intervalNonlinear systemNoiseKernel method13. Climate actionKernel (statistics)symbolsAlgorithmIGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium

researchProduct

A Deep Network Approach to Multitemporal Cloud Detection

2018

We present a deep learning model with temporal memory to detect clouds in image time series acquired by the Seviri imager mounted on the Meteosat Second Generation (MSG) satellite. The model provides pixel-level cloud maps with related confidence and propagates information in time via a recurrent neural network structure. With a single model, we are able to outline clouds along all year and during day and night with high accuracy.

FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciencesComputer scienceFeature extraction0211 other engineering and technologiesCloud detectionFOS: Physical sciencesCloud computing02 engineering and technologyCloud detection01 natural sciencesMachine Learning (cs.LG)Laboratory of Geo-information Science and Remote SensingLaboratorium voor Geo-informatiekunde en Remote Sensing021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingbusiness.industrySeviriDeep learningDeep learningPE&RCPhysics - Atmospheric and Oceanic PhysicsRecurrent neural networkRecurrent neural networksAtmospheric and Oceanic Physics (physics.ao-ph)Convolutional neural networksSatelliteArtificial intelligencebusinessNetwork approachIGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium

researchProduct