Search results for "Training set"
showing 10 items of 68 documents
Bagging and Boosting with Dynamic Integration of Classifiers
2000
One approach in classification tasks is to use machine learning techniques to derive classifiers using learning instances. The co-operation of several base classifiers as a decision committee has succeeded to reduce classification error. The main current decision committee learning approaches boosting and bagging use resampling with the training set and they can be used with different machine learning techniques which derive base classifiers. Boosting uses a kind of weighted voting and bagging uses equal weight voting as a combining method. Both do not take into account the local aspects that the base classifiers may have inside the problem space. We have proposed a dynamic integration tech…
A new paradigm for pattern classification: Nearest Border Techniques
2013
Published version of a chapter in the book: AI 2013: Advances in Artificial Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-319-03680-9_44 There are many paradigms for pattern classification. As opposed to these, this paper introduces a paradigm that has not been reported in the literature earlier, which we shall refer to as the Nearest Border (NB) paradigm. The philosophy for developing such a NB strategy is as follows: Given the training data set for each class, we shall first attempt to create borders for each individual class. After that, we advocate that testing is accomplished by assigning the test sample to the class whose border it lies closest to…
On the Online Classification of Data Streams Using Weak Estimators
2016
In this paper, we propose a novel online classifier for complex data streams which are generated from non-stationary stochastic properties. Instead of using a single training model and counters to keep important data statistics, the introduced online classifier scheme provides a real-time self-adjusting learning model. The learning model utilizes the multiplication-based update algorithm of the Stochastic Learning Weak Estimator (SLWE) at each time instant as a new labeled instance arrives. In this way, the data statistics are updated every time a new element is inserted, without requiring that we have to rebuild its model when changes occur in the data distributions. Finally, and most impo…
Combining conjunctive rule extraction with diffusion maps for network intrusion detection
2013
Network security and intrusion detection are important in the modern world where communication happens via information networks. Traditional signature-based intrusion detection methods cannot find previously unknown attacks. On the other hand, algorithms used for anomaly detection often have black box qualities that are difficult to understand for people who are not algorithm experts. Rule extraction methods create interpretable rule sets that act as classifiers. They have mostly been combined with already labeled data sets. This paper aims to combine unsupervised anomaly detection with rule extraction techniques to create an online anomaly detection framework. Unsupervised anomaly detectio…
A Comparison of Advanced Regression Algorithms for Quantifying Urban Land Cover
2014
Quantitative methods for mapping sub-pixel land cover fractions are gaining increasing attention, particularly with regard to upcoming hyperspectral satellite missions. We evaluated five advanced regression algorithms combined with synthetically mixed training data for quantifying urban land cover from HyMap data at 3.6 and 9 m spatial resolution. Methods included support vector regression (SVR), kernel ridge regression (KRR), artificial neural networks (NN), random forest regression (RFR) and partial least squares regression (PLSR). Our experiments demonstrate that both kernel methods SVR and KRR yield high accuracies for mapping complex urban surface types, i.e., rooftops, pavements, gras…
Emulation of 2D Hydrodynamic Flood Simulations at Catchment Scale Using ANN and SVR
2021
Two-dimensional (2D) hydrodynamic models are one of the most widely used tools for flood modeling practices and risk estimation. The 2D models provide accurate results
Improving the k-NCN classification rule through heuristic modifications
1998
Abstract This paper presents an empirical investigation of the recently proposed k-Nearest Centroid Neighbours ( k -NCN) classification rule along with two heuristic modifications of it. These alternatives make use of both proximity and geometrical distribution of the prototypes in the training set in order to estimate the class label of a given sample. The experimental results show that both alternatives give significantly better classification rates than the k -Nearest Neighbours rule, basically due to the properties of the plain k -NCN technique.
Artificial neural networks for predicting dorsal pressures on the foot surface while walking
2012
In this work, artificial neural networks (ANNs) are proposed to predict the dorsal pressure over the foot surface exerted by the shoe upper while walking. A model that is based on the multilayer perceptron (MLP) is used since it can provide a single equation to model the exerted pressure for all the materials used as shoe uppers. Five different models are produced, one model for each one of the four subjects under study and an overall model for the four subjects. The inputs to the neural model include the characteristics of the material and the positions during a whole step of 14 pressure sensors placed on the foot surface. The goal is to find models with good generalization capabilities, (…
Generating Winter Wheat Global Crop Calendars in the Framework of Worldcereal
2021
In this study we present a methodology to develop a global winter wheat crop calendar based on the existing crop calendar products from FAO and GEOGLAM Crop Monitor in the framework of the WorldCereal project. It is based on integrating both datasets by building on the accuracy from Crop Monitor and the spatial resolution from the Food and Agriculture Organization of the United Nations (FAO). Additionally, given the global extent of WorldCereal and the gaps that both products present at global scale, we simulated the crop calendars in those areas not covered by any of the products. To do so, we integrated a Regression-Kriging model considering as training data the calendars derived from bot…
FABC: Retinal Vessel Segmentation Using AdaBoost
2010
This paper presents a method for automated vessel segmentation in retinal images. For each pixel in the field of view of the image, a 41-D feature vector is constructed, encoding information on the local intensity structure, spatial properties, and geometry at multiple scales. An AdaBoost classifier is trained on 789 914 gold standard examples of vessel and nonvessel pixels, then used for classifying previously unseen images. The algorithm was tested on the public digital retinal images for vessel extraction (DRIVE) set, frequently used in the literature and consisting of 40 manually labeled images with gold standard. Results were compared experimentally with those of eight algorithms as we…