Search results for "Training set"
showing 10 items of 68 documents
Silicon Nanowire Sensors Enable Diagnosis of Patients via Exhaled Breath
2016
Two of the biggest challenges in medicine today are the need to detect diseases in a noninvasive manner and to differentiate between patients using a single diagnostic tool. The current study targets these two challenges by developing a molecularly modified silicon nanowire field effect transistor (SiNW FET) and showing its use in the detection and classification of many disease breathprints (lung cancer, gastric cancer, asthma, and chronic obstructive pulmonary disease). The fabricated SiNW FETs are characterized and optimized based on a training set that correlate their sensitivity and selectivity toward volatile organic compounds (VOCs) linked with the various disease breathprints. The b…
2014
Large data sets classification is widely used in many industrial applications. It is a challenging task to classify large data sets efficiently, accurately, and robustly, as large data sets always contain numerous instances with high dimensional feature space. In order to deal with this problem, in this paper we present an online Logdet divergence based metric learning (LDML) model by making use of the powerfulness of metric learning. We firstly generate a Mahalanobis matrix via learning the training data with LDML model. Meanwhile, we propose a compressed representation for high dimensional Mahalanobis matrix to reduce the computation complexity in each iteration. The final Mahalanobis mat…
Path relinking and GRG for artificial neural networks
2006
Artificial neural networks (ANN) have been widely used for both classification and prediction. This paper is focused on the prediction problem in which an unknown function is approximated. ANNs can be viewed as models of real systems, built by tuning parameters known as weights. In training the net, the problem is to find the weights that optimize its performance (i.e., to minimize the error over the training set). Although the most popular method for training these networks is back propagation, other optimization methods such as tabu search or scatter search have been successfully applied to solve this problem. In this paper we propose a path relinking implementation to solve the neural ne…
Online Metric Learning Methods Using Soft Margins and Least Squares Formulations
2012
Online metric learning using margin maximization has been introduced as a way to learn appropriate dissimilarity measures in an efficient way when information as pairs of examples is given to the learning system in a progressive way. These schemes have several practical advantages with regard to global ones in which a training set needs to be processed. On the other hand, they may suffer from a poor performance depending on the quality of the examples and the particular tuning or other implementation details. This paper formulates several online metric learning alternatives using a passive-aggressive schema. A new formulation of the online problem using least squares is also introduced. The…
Ranking of Brain Tumour Classifiers Using a Bayesian Approach
2009
This study presents a ranking for classifers using a Bayesian perspective. This ranking framework is able to evaluate the performance of the models to be compared when they are inferred from different sets of data. It also takes into account the performance obtained on samples not used during the training of the classifiers. Besides, this ranking assigns a prior to each model based on a measure of similarity of the training data to a test case. An evaluation consisting of ranking brain tumour classifiers is presented. These multilayer perceptron classifiers are trained with 1H magnetic resonance spectroscopy (MRS) signals following a multiproject multicenter evaluation approach. We demonstr…
Bagging, bumping, multiview, and active learning for record linkage with empirical results on patient identity data
2011
Record linkage or deduplication deals with the detection and deletion of duplicates in and across files. For this task, this paper introduces and evaluates two new machine-learning methods (bumping and multiview) together with bagging, a tree-based ensemble-approach. Whereas bumping represents a tree-based approach as well, multiview is based on the combination of different methods and the semi-supervised learning principle. After providing a theoretical background of the methods, initial empirical results on patient identity data are given. In the empirical evaluation, we calibrate the methods on three different kinds of training data. The results show that the smallest training data set, …
New tyrosinase inhibitors selected by atomic linear indices-based classification models.
2005
In the present report, the use of the atom-based linear indices for finding functions that discriminate between the tyrosinase inhibitor compounds and inactive ones is presented. In this sense, discriminant models were applied and globally good classifications of 93.51% and 92.46% were observed for non-stochastic and stochastic linear indices best models, respectively, in the training set. The external prediction sets had accuracies of 91.67% and 89.44%. In addition, these fitted models were used in the screening of new cycloartane compounds isolated from herbal plants. A good behavior is shown between the theoretical and experimental results. These results provide a tool that can be used i…
A genetic algorithm approach to purify the classifier training labels for the analysis of remote sensing imagery
2017
This paper proposes a Genetic Algorithm (GA) approach to clean a given classifier training set for remote sensing image analysis. Starting from an initial set of training data, the new method called GA-Training Label Purifying (GA-TLP) consists of the significant training sample selection using GAs in order to maximize the classifier accuracy. This means to retain the most informative samples and to remove the uncertain, redundant, and misclassified ones. As a result of the selection process, we can obtain a purified training set. The proposed model is implemented and evaluated using a LANDSAT 7 ETM+ image. The experimental results confirm the effectiveness of the proposed approach.
Survival Prediction in Intrahepatic Cholangiocarcinoma: A Proof of Concept Study Using Artificial Intelligence for Risk Assessment
2021
Several scoring systems have been devised to objectively predict survival for patients with intrahepatic cholangiocellular carcinoma (ICC) and support treatment stratification, but they have failed external validation. The aim of the present study was to improve prognostication using an artificial intelligence-based approach. We retrospectively identified 417 patients with ICC who were referred to our tertiary care center between 1997 and 2018. Of these, 293 met the inclusion criteria. Established risk factors served as input nodes for an artificial neural network (ANN). We compared the performance of the trained model to the most widely used conventional scoring system, the Fudan score. Pr…
Identification of parameters of dynamic Preisach model by neural networks
2008
In this paper, an approach that allows to identify the parameters of dynamic Preisach model is presented. The fundamental idea of this method is to identify the parameters of a material by using a neural network trained by a collection of hysteresis curves, whose Preisach model is known. After a brief description of dynamic Preisach Model, the neural network that has been used is introduced. The construction of the training data set is illustrated. Finally, the effectiveness of the method is tested on both numerical as well as experimental data.