Search results for "Random forest"
showing 10 items of 121 documents
Assessing Spillover Effects of Spatial Policies with Semiparametric Zero-Inflated Models and Random Forests
2021
The aim of this work is to estimate the variation over time of the spatial spillover effects of a public policy that was devoted to boost rural development in France over the period 1993–2002. At a micro data level, it is often observed that the dependent variable, such as local employment in a municipality, does not vary along time, so that we face a kind of zero inflated phenomenon that cannot be dealt with a classical continuous response model or propensity score approaches. We consider two recent non parametric techniques that are able to deal with that estimation issue. The first approach consists in fitting two generalized additive models to estimate both the probability of no variati…
An efficient data model for energy prediction using wireless sensors
2019
International audience; Energy prediction is in high importance for smart homes and smart cities, since it helps reduce power consumption and provides better energy and cost savings. Many algorithms have been used for predicting energy consumption using data collected from Internet of Things (IoT) devices and wireless sensors. In this paper, we propose a system based on Multilayer Perceptron (MLP) to predict energy consumption of a building using collected information (e.g., light energy, day of the week, humidity, temperature, etc.) from a Wireless Sensor Network (WSN). We compare our system against four other classification algorithms, namely: Linear Regression (LR), Support Vector Machin…
2020
Piping erosion is one form of water erosion that leads to significant changes in the landscape and environmental degradation. In the present study, we evaluated piping erosion modeling in the Zarandieh watershed of Markazi province in Iran based on random forest (RF), support vector machine (SVM), and Bayesian generalized linear models (Bayesian GLM) machine learning algorithms. For this goal, due to the importance of various geo-environmental and soil properties in the evolution and creation of piping erosion, 18 variables were considered for modeling the piping erosion susceptibility in the Zarandieh watershed. A total of 152 points of piping erosion were recognized in the study area that…
Modifiable Lifestyle Factors and Cognitive Function in Older People: A Cross-Sectional Observational Study
2019
Background: The development of evidence-based interventions for delaying or preventing cognitive impairment is an important challenge. Most previous studies using self-report questionnaires face problems with reliability and consistency due to recall bias or misclassification among older people. Therefore, objective measurement of lifestyle components is needed to confirm the relationships between lifestyle factors and cognitive function. Aims: The current study examined the relationship between lifestyle factors collected with wearable sensors and cognitive function among community-dwelling older people using machine learning. Methods: In total, 855 participants (mean age: 73.8 years) wore…
Growing stock volume from multi-temporal landsat imagery through google earth engine
2019
Growing stock volume (GSV) is one of the most important variables for.forest management and is traditionally- estimated from ground measurements. These measurements are expensive and therefore sparse and hard to maintain in time on a regular basis. Remote sensing data combined with national forest inventories constitute a helpful tool to estimate and map forest attributes. However, most studies on GSV estimation from remote sensing data focus on small forest areas with a single or only a few species. The current study aims to map GSV in peninsular Spain, a rather large and very heterogeneous area. Around 50 000 wooded land plots from the Third Spanish National Forest Inventory (NFI3) were u…
Comparative Study of Several Machine Learning Algorithms for Classification of Unifloral Honeys
2021
Unifloral honeys are highly demanded by honey consumers, especially in Europe. To ensure that a honey belongs to a very appreciated botanical class, the classical methodology is palynological analysis to identify and count pollen grains. Highly trained personnel are needed to perform this task, which complicates the characterization of honey botanical origins. Organoleptic assessment of honey by expert personnel helps to confirm such classification. In this study, the ability of different machine learning (ML) algorithms to correctly classify seven types of Spanish honeys of single botanical origins (rosemary, citrus, lavender, sunflower, eucalyptus, heather and forest honeydew) was investi…
Computer-aided detection of cerebral microbleeds in susceptibility-weighted imaging.
2014
Susceptibility-weighted imaging (SWI) is recognized as the preferred MRI technique for visualizing cerebral vasculature and related pathologies such as cerebral microbleeds (CMBs). Manual identification of CMBs is time-consuming, has limited reliability and reproducibility, and is prone to misinterpretation. In this paper, a novel computer-aided microbleed detection technique based on machine learning is presented: First, spherical-like objects (potential CMB candidates) with their corresponding bounding boxes were detected using a novel multi-scale Laplacian of Gaussian technique. A set of robust 3-dimensional Radon- and Hessian-based shape descriptors within each bounding box were then ex…
Machine learning techniques to estimate the degree of binder activity of reclaimed asphalt pavement
2022
Part of this research was funded by the project RTI2018-096224-J-I00 that has been cofounded by the Spanish Ministry of Science and Innovation, inside the National Program for Fostering Excellence in Scientific and Technical Research, National Subprogram of Knowledge Generation, 2018 call, in the framework of the Spanish National Plan for Scientific and Technical Research and Innovation 2017-2020, and by the European Union, through the European Regional Development Fund, with the main objective of Promoting technological development, innovation and quality research. Part of this work was financially supported by the Italian Ministry of University and Research with the research Grant PRIN 20…
A comparison of ensemble algorithms for item-weighted Label Ranking
2023
Label Ranking (LR) is a non-standard supervised classification method with the aim of ranking a finite collection of labels according to a set of predictor variables. Traditional LR models assume indifference among alternatives. However, misassigning the ranking position of a highly relevant label is frequently regarded as more severe than failing to predict a trivial label. Moreover, switching two similar alternatives should be considered less severe than switching two different ones. Therefore, efficient LR classifiers should be able to take into account the similarities and individual weights of the items to be ranked. The contribution of this paper is to formulate and compare flexible i…
Robust estimation of mean electricity consumption curves by sampling for small areas in presence of missing values
2017
In this thesis, we address the problem of robust estimation of mean or total electricity consumption curves by sampling in a finite population for the entire population and for small areas. We are also interested in estimating mean curves by sampling in presence of partially missing trajectories.Indeed, many studies carried out in the French electricity company EDF, for marketing or power grid management purposes, are based on the analysis of mean or total electricity consumption curves at a fine time scale, for different groups of clients sharing some common characteristics.Because of privacy issues and financial costs, it is not possible to measure the electricity consumption curve of eac…