Search results for "random forest"
showing 10 items of 121 documents
Assessment of mental stress through the analysis of physiological signals acquired from wearable devices
2019
Mental stress is a physiological state that directly correlates to the quality of life of individuals. Generally speaking, but especially true for disabled or elderly subjects, the assessment of such condition represents a very strong indicator correlated to the difficulties, and, in some case, to the frustration that derives from the execution of a task that results troublesome to be accomplished. This article describes a novel procedure for the assessment of the mental stress level through the use of low invasive wireless wearable devices. The information contained in electrocardiogram, respiratory signal, blood volume pulse, and electroencephalogram was extracted to set up an estimator f…
Quantification of the heterogeneity of prognostic cellular biomarkers in ewing sarcoma using automated image and random survival forest analysis
2014
Driven by genomic somatic variation, tumour tissues are typically heterogeneous, yet unbiased quantitative methods are rarely used to analyse heterogeneity at the protein level. Motivated by this problem, we developed automated image segmentation of images of multiple biomarkers in Ewing sarcoma to generate distributions of biomarkers between and within tumour cells. We further integrate high dimensional data with patient clinical outcomes utilising random survival forest (RSF) machine learning. Using material from cohorts of genetically diagnosed Ewing sarcoma with EWSR1 chromosomal translocations, confocal images of tissue microarrays were segmented with level sets and watershed algorithm…
Post-processing of Pixel and Object-Based Land Cover Classifications of Very High Spatial Resolution Images
2020
The state of the art is plenty of classification methods. Pixel-based methods include the most traditional ones. Although these achieved high accuracy when classifying remote sensing images, some limits emerged with the advent of very high-resolution images that enhanced the spectral heterogeneity within a class. Therefore, in the last decade, new classification methods capable of overcoming these limits have undergone considerable development. Within this research, we compared the performances of an Object-based and a Pixel-Based classification method, the Random Forests (RF) and the Object-Based Image Analysis (OBIA), respectively. Their ability to quantify the extension and the perimeter…
Machine Learning Regression Approaches for Colored Dissolved Organic Matter (CDOM) Retrieval with S2-MSI and S3-OLCI Simulated Data
2018
The colored dissolved organic matter (CDOM) variable is the standard measure of humic substance in waters optics. CDOM is optically characterized by its spectral absorption coefficient, a C D O M at at reference wavelength (e.g., ≈ 440 nm). Retrieval of CDOM is traditionally done using bio-optical models. As an alternative, this paper presents a comparison of five machine learning methods applied to Sentinel-2 and Sentinel-3 simulated reflectance ( R r s ) data for the retrieval of CDOM: regularized linear regression (RLR), random forest regression (RFR), kernel ridge regression (KRR), Gaussian process regression (GPR) and support vector machines (SVR). Two different datasets of radiative t…
Random forests, a novel approach for discrimination of fish populations using parasites as biological tags.
2008
Due to the complexity of host-parasite relationships, discrimination between fish populations using parasites as biological tags is difficult. This study introduces, to our knowledge for the first time, random forests (RF) as a new modelling technique in the application of parasite community data as biological markers for population assignment of fish. This novel approach is applied to a dataset with a complex structure comprising 763 parasite infracommunities in population samples of Atlantic cod, Gadus morhua, from the spawning/feeding areas in five regions in the North East Atlantic (Baltic, Celtic, Irish and North seas and Icelandic waters). The learning behaviour of RF is evaluated in …
Minimal learning machine in hyperspectral imaging classification
2020
A hyperspectral (HS) image is typically a stack of frames, where each frame represents the intensity of a different wavelength of light. Each spatial pixel has a spectrum. In the classification of the HS image, each spectrum is classified pixel-by-pixel. In some of the real-time applications, the amount of the HS image data causes performance challenges. Those issues relate to the platforms (e.g. drones) payload restrictions, the issues of the available energy and to the complexity of the machine learning models. In this study, we introduce the minimal learning machine (MLM) as a computationally cheap training and classification machine learning method for the hyperspectral imaging classificatio…
Privacy Preserving Client/Vertical-Servers Classification
2019
We present a novel client/vertical-servers architecture for hybrid multi-party classification problem. The model consists of clients whose attributes are distributed on multiple servers and remain secret during training and testing. Our solution builds privacy-preserving random forests and completes them with a special private set intersection protocol that provides a central commodity server with anonymous conditional statistics. Subsequently, the private set intersection protocol can be used to privately classify the queries of new clients using the commodity server’s statistics. The proviso is that the commodity server must not collude with other parties. In cases where this restriction …
Feature selection on a dataset of protein families: from exploratory data analysis to statistical variable importance
2016
Proteins are characterized by several typologies of features (structural, geometrical, energy). Most of these features are expected to be similar within a protein family. We are interested to detect which features can identify proteins that belong to a family, as well as to define the boundaries among families. Some features are redundant: they could generate noise in identifying which variables are essential as a fingerprint and, consequently, if they are related or not to a function of a protein family. We defined an original approach to analyze protein features for defining their relationships and peculiarities within protein families. A multistep approach has been mainly performed in R …
<strong>Machine Learning and Atom-Based Quadratic Indices for Proteasome Inhibition Prediction </strong>
2015
The atom-based quadratic indices are used in this work together with some machine learning techniques that includes: support vector machine, artificial neural network, random forest and k-nearest neighbor. This methodology is used for the development of two quantitative structure-activity relationship (QSAR) studies for the prediction of proteasome inhibition. A first set consisting of active and non-active classes was predicted with model performances above 85% and 80% in training and validation series, respectively. These results provided new approaches on proteasome inhibitor identification encouraged by virtual screenings procedures. .
<strong>Predicting Proteasome Inhibition using Atomic Weighted Vector and Machine Learning</strong>
2018
Ubiquitin/Proteasome System (UPS) is a highly regulated mechanism of intracellular protein degradation and turnover. Through the concerted actions of a series of enzymes, proteins are marked for proteasomal degradation by being linked to the polypeptide co-factor, ubiquitin. The UPS participates in a wide array of biological functions such as antigen presentation, regulation of gene transcription and the cell cycle, and activation of NF-κB. Some researchers have applied QSAR method and machine learning in the study of proteasome inhibition (EC50(µmol/L)), such as: the analysis of proteasome inhibition prediction, in the prediction of multi-target inhibitors of UPP and in the prediction of p…