Search results for "Regression"
showing 10 items of 2619 documents
Image retrieval system for citizen services using penalized logistic regression models
2020
This paper describes a procedure to deal with large image collections obtained by smart city services based on interaction with citizens providing pictures. The semantic gap between the low-level image features and represented concepts and situations has been addressed using image retrieval techniques. A relevance feedback procedure is proposed for Content-Based Image Retrieval (CBIR) based on the modelling of user responses. One of the novelties of the proposal is that the feedback learning procedure can use the information that citizens themselves can provide when using these services.The proposed algorithm considers the probability of an image belonging to the set of those sought by the …
Renewable energy growth and the financial performance of electric utilities: A panel data study
2017
Electric utilities are under pressure to increase clean energy production. Although the adoption of renewable energy can improve the utilities' environmental performance, a fundamental question is if it also pays in economic terms. Building on the natural-resource-based view of the firm, we answer this question using two data analysis methods. First, we carry out a regression analysis of panel data from 66 large electric utilities covering the period 2005–2014, applying both a fixed and random effects estimator. Subsequently, we use the Granger causality test to explore possible causality links. Our results show a negative correlation at the firm level between renewable energy increase and …
“Natural wine” consumers and interest in label information: An analysis of willingness to pay in a new Italian wine market segment
2019
Abstract Increasing public attention to issues of health and environmental sustainability has contributed to a growing consumer demand for “natural” food and drinks. As has been observed, this trend has also affected the wine market, leading to the spread of so-called “natural wine”. According to the literature, consumers who are aware of the social and environmental impact of their consumption choices pay more attention to the information displayed on the label as a tool to reduce the risk associated with their purchase. This study seeks to identify which consumers are willing to pay for natural wine and to understand what information on the label influences their choice. This study is one…
District heating networks: enhancement of the efficiency
2019
International audience; During the decades the district heating's (DH) advantages (more cost-efficient heat generation and reduced air pollution) overcompensated the additional costs of transmission and distribution of the centrally produced thermal energy to consumers. Rapid increase in the efficiency of low-power heaters, development of separated low heat density areas in cities reduce the competitiveness of the large centralized DH systems in comparison with the distributed cluster-size networks and even local heating. Reduction of transmission costs, enhancement of the network efficiency by optimization of the design of the DH networks become a critical issue. The methodology for determ…
Autonomous ultrasonic inspection using Bayesian optimisation and robust outlier analysis
2020
The use of robotics is beginning to play a key role in automating the data collection process in Non Destructive Testing (NDT). Increasing the use of automation quickly leads to the gathering of large quantities of data, which makes it inefficient, perhaps even infeasible, for a human to parse the information contained in them. This paper presents a solution to this problem by making the process of NDT data acquisition an autonomous one as opposed to an automatic one. In order to achieve this, the robotic data acquisition task is treated as an optimisation problem, where one seeks to find locations with the highest indication of damage. The resulting algorithm combines damage detection tech…
VARIABLE SELECTION FOR NOISY DATA APPLIED IN PROTEOMICS
2014
International audience; The paper proposes a variable selection method for pro-teomics. It aims at selecting, among a set of proteins, those (named biomarkers) which enable to discriminate between two groups of individuals (healthy and pathological). To this end, data is available for a cohort of individuals: the biological state and a measurement of concentrations for a list of proteins. The proposed approach is based on a Bayesian hierarchical model for the dependencies between biological and instrumental variables. The optimal selection function minimizes the Bayesian risk, that is to say the selected set of variables maximizes the posterior probability. The two main contributions are: (…
Input Selection Methods for Soft Sensor Design: A Survey
2020
Soft Sensors (SSs) are inferential models used in many industrial fields. They allow for real-time estimation of hard-to-measure variables as a function of available data obtained from online sensors. SSs are generally built using industries historical databases through data-driven approaches. A critical issue in SS design concerns the selection of input variables, among those available in a candidate dataset. In the case of industrial processes, candidate inputs can reach great numbers, making the design computationally demanding and leading to poorly performing models. An input selection procedure is then necessary. Most used input selection approaches for SS design are addressed in this …
Do Randomized Algorithms Improve the Efficiency of Minimal Learning Machine?
2020
Minimal Learning Machine (MLM) is a recently popularized supervised learning method, which is composed of distance-regression and multilateration steps. The computational complexity of MLM is dominated by the solution of an ordinary least-squares problem. Several different solvers can be applied to the resulting linear problem. In this paper, a thorough comparison of possible and recently proposed, especially randomized, algorithms is carried out for this problem with a representative set of regression datasets. In addition, we compare MLM with shallow and deep feedforward neural network models and study the effects of the number of observations and the number of features with a special dat…
Nonlinear statistical retrieval of surface emissivity from IASI data
2017
Emissivity is one of the most important parameters to improve the determination of the troposphere properties (thermodynamic properties, aerosols and trace gases concentration) and it is essential to estimate the radiative budget. With the second generation of infrared sounders, we can estimate emissivity spectra at high spectral resolution, which gives us a global view and long-term monitoring of continental surfaces. Statistically, this is an ill-posed retrieval problem, with as many output variables as inputs. We here propose nonlinear multi-output statistical regression based on kernel methods to estimate spectral emissivity given the radiances. Kernel methods can cope with high-dimensi…
Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: a test in the Giampilieri catchment (Sicily, Italy)
2016
Debris flows are among the most hazardous phenomena in nature, requiring the preparation of suscep- tibility models in order to cope with this severe threat. The aim of this research was to verify whether a grid cell-based susceptibility model was capable of predicting the debris- flow initiation sites in the Giampilieri catchment (10 km2), which was hit by a storm on the 1st October 2009, resulting in more than one thousand landslides. This kind of event is to be considered as recurrent in the area as attested by historical data. Therefore, predictive models have been prepared by using forward stepwise binary logistic regression (BLR), a landslide inventory and a set of geo- environmental …