Search results for "Support Vector Machine"
showing 10 items of 306 documents
LEGO-based generalized set of two linear algebraic 3D bio-macro-molecular descriptors: Theory and validation by QSARs
2019
Abstract Novel 3D protein descriptors based on bilinear, quadratic and linear algebraic maps in R n are proposed. The latter employs the kth 2-tuple (dis) similarity matrix to codify information related to covalent and non-covalent interactions in these biopolymers. The calculation of the inter-amino acid distances is generalized by using several dis-similarity coefficients, where normalization procedures based on the simple stochastic and mutual probability schemes are applied. A new local-fragment approach based on amino acid-types and amino acid-groups is proposed to characterize regions of interest in proteins. Topological and geometric macromolecular cutoffs are defined using local and…
Defining classifier regions for WSD ensembles using word space features
2006
Based on recent evaluation of word sense disambiguation (WSD) systems [10], disambiguation methods have reached a standstill. In [10] we showed that it is possible to predict the best system for target word using word features and that using this 'optimal ensembling method' more accurate WSD ensembles can be built (3-5% over Senseval state of the art systems with the same amount of possible potential remaining). In the interest of developing if more accurate ensembles, w e here define the strong regions for three popular and effective classifiers used for WSD task (Naive Bayes – NB, Support Vector Machine – SVM, Decision Rules – D) using word features (word grain, amount of positive and neg…
Building an Optimal WSD Ensemble Using Per-Word Selection of Best System
2006
In Senseval workshops for evaluating WSD systems [1,4,9], no one system or system type (classifier algorithm, type of system ensemble, extracted feature set, lexical knowledge source etc.) has been discovered that resolves all ambiguous words into their senses in a superior way. This paper presents a novel method for selecting the best system for target word based on readily available word features (number of senses, average amount of training per sense, dominant sense ratio). Applied to Senseval-3 and Senseval-2 English lexical sample state-of-art systems, a net gain of approximately 2.5 – 5.0% (respectively) in average precision per word over the best base system is achieved. The method c…
Cell state prediction through distributed estimation of transmit power
2019
Determining the state of each cell, for instance, cell outages, in a densely deployed cellular network is a difficult problem. Several prior studies have used minimization of drive test (MDT) reports to detect cell outages. In this paper, we propose a two step process. First, using the MDT reports, we estimate the serving base station’s transmit power for each user. Second, we learn summary statistics of estimated transmit power for various networks states and use these to classify the network state on test data. Our approach is able to achieve an accuracy of 96% on an NS-3 simulation dataset. Decision tree, random forest and SVM classifiers were able to achieve a classification accuracy of…
Attention-based Model for Evaluating the Complexity of Sentences in English Language
2020
The automation of text complexity evaluation (ATCE) is an emerging problem which has been tackled by means of different methodologies. We present an effective deep learning- based solution which leverages both Recurrent Neural and the Attention mechanism. The developed system is capable of classifying sentences written in the English language by analysing their syntactical and lexical complexity. An accurate test phase has been carried out, and the system has been compared with a baseline tool based on the Support Vector Machine. This paper represents an extension of a previous deep learning model, which allows showing the suitability of Neural Networks to evaluate sentence complexity in tw…
Remote sensing image segmentation by active queries
2012
Active learning deals with developing methods that select examples that may express data characteristics in a compact way. For remote sensing image segmentation, the selected samples are the most informative pixels in the image so that classifiers trained with reduced active datasets become faster and more robust. Strategies for intelligent sampling have been proposed with model-based heuristics aiming at the search of the most informative pixels to optimize model's performance. Unlike standard methods that concentrate on model optimization, here we propose a method inspired in the cluster assumption that holds in most of the remote sensing data. Starting from a complete hierarchical descri…
Discovering single classes in remote sensing images with active learning
2012
When dealing with supervised target detection, the acquisition of labeled samples is one of the most critical phases: the samples must be yet representative of the class of interest, but must also be found among a vast majority of non-target examples. Moreover, the efficiency of the search is also an issue, since the samples labeled as background are not used by target detectors such as the support vector data description (SVDD). In this work we propose a competitive and effective approach to identify the most relevant training samples for one-class classification based on the use of an active learning strategy. The SVDD classifier is first trained with insufficient target examples. It is t…
Improving active learning methods using spatial information
2011
Active learning process represents an interesting solution to the problem of training sample collection for the classification of remote sensing images. In this work, we propose a criterion based on the spatial information that can be used in combination with a spectral criterion in order to improve the selection of training samples. Experimental results obtained on a very high resolution image show the effectiveness of regularization in spatial domain and open challenging perspectives for terrain campaigns planning. © 2011 IEEE.
Dosage individualization of erythropoietin using a profile-dependent support vector regression
2003
The external administration of recombinant human erythropoietin is the chosen treatment for those patients with secondary anemia due to chronic renal failure in periodic hemodialysis. The objective of this paper is to carry out an individualized prediction of the EPO dosage to be administered to those patients. The high cost of this medication, its side-effects and the phenomenon of potential resistance which some individuals suffer all justify the need for a model which is capable of optimizing dosage individualization. A group of 110 patients and several patient factors were used to develop the models. The support vector regressor (SVR) is benchmarked with the classical multilayer percept…
Prediction of the hemoglobin level in hemodialysis patients using machine learning techniques
2013
HighlightsDifferent prediction algorithms were used to predict Hb levels in CRF patients.Prediction errors in the validation cohorts of patients were around 0.6g/dl.Difficulty to obtain lower errors due to the measuring machine precision (0.2g/dl).Relevance analysis of features have been applied for each predictor. Patients who suffer from chronic renal failure (CRF) tend to suffer from an associated anemia as well. Therefore, it is essential to know the hemoglobin (Hb) levels in these patients. The aim of this paper is to predict the hemoglobin (Hb) value using a database of European hemodialysis patients provided by Fresenius Medical Care (FMC) for improving the treatment of this kind of …