Search results for "Mining"
showing 10 items of 1730 documents
Analysis of clickstream data with mixture hidden markov models
2021
clickstream data sono un’importante fonte di informazioni per l’ecommerce, sebbene non siano semplici da gestire e convertire queste informazioni in un reale vantaggio competitivo non e un compito banale. In questo articolo, consid- ` eriamo l’applicazione dei mixture hidden Markov model a dati relativi al flusso di clickstream estratti dal portale e-commerce di un’azienda di servizi turistici. Sono stati individuati cluster relativi al comportamento di navigazione degli utenti e alla loro posizione geografica che forniscono indicazioni importanti per lo sviluppo di nuove strategie di business. Clickstream data is an important source of information for businesses, however it is not easy to …
Statistically Validated Networks for evaluating coherence in topic models
2022
Probabilistic topic models have become one of the most widespread machine learning technique for textual analysis purpose. In this framework, Latent Dirichlet Allocation (LDA) gained more and more popularity as a text modelling technique. The idea is that documents are represented as random mixtures over latent topics, where a distribution over words characterizes each topic. Unfortunately, topic models do not guarantee the interpretability of their outputs. The topics learned from the model may be characterized by a set of irrelevant or unchained words, being useless for the interpretation. In the framework of topic quality evaluation, the pairwise semantic cohesion among the top-N most pr…
Deep Completion Autoencoders for Radio Map Estimation
2022
Radio maps provide metrics such as power spectral density for every location in a geographic area and find numerous applications such as UAV communications, interference control, spectrum management, resource allocation, and network planning to name a few. Radio maps are constructed from measurements collected by spectrum sensors distributed across space. Since radio maps are complicated functions of the spatial coordinates due to the nature of electromagnetic wave propagation, model-free approaches are strongly motivated. Nevertheless, all existing schemes for radio occupancy map estimation rely on interpolation algorithms unable to learn from experience. In contrast, this paper proposes a…
Active Learning Methods for Efficient Hybrid Biophysical Variable Retrieval
2016
Kernel-based machine learning regression algorithms (MLRAs) are potentially powerful methods for being implemented into operational biophysical variable retrieval schemes. However, they face difficulties in coping with large training data sets. With the increasing amount of optical remote sensing data made available for analysis and the possibility of using a large amount of simulated data from radiative transfer models (RTMs) to train kernel MLRAs, efficient data reduction techniques will need to be implemented. Active learning (AL) methods enable to select the most informative samples in a data set. This letter introduces six AL methods for achieving optimized biophysical variable estimat…
Causal inference in geosciences with kernel sensitivity maps
2020
Establishing causal relations between random variables from observational data is perhaps the most important challenge in today's Science. In remote sensing and geosciences this is of special relevance to better understand the Earth's system and the complex and elusive interactions between processes. In this paper we explore a framework to derive cause-effect relations from pairs of variables via regression and dependence estimation. We propose to focus on the sensitivity (curvature) of the dependence estimator to account for the asymmetry of the forward and inverse densities of approximation residuals. Results in a large collection of 28 geoscience causal inference problems demonstrate the…
A probabilistic compressive sensing framework with applications to ultrasound signal processing
2019
Abstract The field of Compressive Sensing (CS) has provided algorithms to reconstruct signals from a much lower number of measurements than specified by the Nyquist-Shannon theorem. There are two fundamental concepts underpinning the field of CS. The first is the use of random transformations to project high-dimensional measurements onto a much lower-dimensional domain. The second is the use of sparse regression to reconstruct the original signal. This assumes that a sparse representation exists for this signal in some known domain, manifested by a dictionary. The original formulation for CS specifies the use of an l 1 penalised regression method, the Lasso. Whilst this has worked well in l…
SIGNAL ANALYSIS AND PERFORMANCE EVALUATION OF A VEHICLE CRASH TEST WITH A FIXED SAFETY BARRIER BASED ON HAAR WAVELETS
2011
Author's version of an article published in the journal: International Journal of Wavelets, Multiresolution and Information Processing. Also available from the publisher at: http://dx.doi.org/10.1142/s0219691311003979 This paper deals with the wavelet-based performance analysis of the safety barrier for use in a full-scale test. The test involves a vehicle, a Ford Fiesta, which strikes the safety barrier at a prescribed angle and speed. The vehicle speed before the collision was measured. Vehicle accelerations in three directions at the center of gravity were measured during the collision. The yaw rate was measured with a gyro meter. Using normal speed and high-speed video cameras, the beha…
Adaptive Techniques for Microarray Image Analysis with Related Quality Assessment
2007
We propose novel techniques for microarray image analysis. In particular, we describe an overall pipeline able to solve the most common problems of microarray image analysis. We pro- pose the microarray image rotation algorithm (MIRA) and the statis- tical gridding pipeline (SGRIP) as two advanced modules devoted to restoring the original microarray grid orientation and to detecting, the correct geometrical information about each spot of input mi- croarray, respectively. Both solutions work by making use of statis- tical observations, obtaining adaptive and reliable information about each spot property. They improve the performance of the microarray image segmentation pipeline (MISP) we rec…
Proceedings of MLSP2012 [front matter]
2012
Proba-V cloud detection Round Robin: Validation results and recommendations
2017
This paper discusses results from 12 months of a Round Robin exercise aimed at the inter-comparison of different cloud detection algorithms for Proba-V. Clouds detection is a critical issue for satellite optical remote sensing, since potential errors in cloud masking directly translates into significant uncertainty in the retrieved downstream geophysical products. Cloud detection is particularly challenging for Proba-V due to the presence of a limited number of spectral bands and the lack of thermal infrared bands. The main objective of the project was the inter-comparison of several cloud detection algorithms for Proba-V over a wide range of surface types and environmental conditions. Prob…