Search results for "Training set"

showing 10 items of 68 documents

Silicon Nanowire Sensors Enable Diagnosis of Patients via Exhaled Breath

2016

Two of the biggest challenges in medicine today are the need to detect diseases in a noninvasive manner and to differentiate between patients using a single diagnostic tool. The current study targets these two challenges by developing a molecularly modified silicon nanowire field effect transistor (SiNW FET) and showing its use in the detection and classification of many disease breathprints (lung cancer, gastric cancer, asthma, and chronic obstructive pulmonary disease). The fabricated SiNW FETs are characterized and optimized based on a training set that correlate their sensitivity and selectivity toward volatile organic compounds (VOCs) linked with the various disease breathprints. The b…

Lung DiseasesSiliconVolatile Organic CompoundsMaterials scienceTraining setNanowiresGeneral EngineeringGeneral Physics and AstronomyPulmonary diseaseNanotechnology02 engineering and technology010402 general chemistry021001 nanoscience & nanotechnology01 natural sciencesAsthma3. Good health0104 chemical sciencesBreath TestsOthersHumansGeneral Materials Science0210 nano-technologySilicon nanowiresBiomedical engineering
researchProduct

2014

Large data sets classification is widely used in many industrial applications. It is a challenging task to classify large data sets efficiently, accurately, and robustly, as large data sets always contain numerous instances with high dimensional feature space. In order to deal with this problem, in this paper we present an online Logdet divergence based metric learning (LDML) model by making use of the powerfulness of metric learning. We firstly generate a Mahalanobis matrix via learning the training data with LDML model. Meanwhile, we propose a compressed representation for high dimensional Mahalanobis matrix to reduce the computation complexity in each iteration. The final Mahalanobis mat…

Mahalanobis distanceTraining setApplied MathematicsFeature vectorHigh dimensionalcomputer.software_genreComputation complexityData miningBenchmark dataClassifier (UML)computerAlgorithmAnalysisMathematicsAbstract and Applied Analysis
researchProduct

Path relinking and GRG for artificial neural networks

2006

Artificial neural networks (ANN) have been widely used for both classification and prediction. This paper is focused on the prediction problem in which an unknown function is approximated. ANNs can be viewed as models of real systems, built by tuning parameters known as weights. In training the net, the problem is to find the weights that optimize its performance (i.e., to minimize the error over the training set). Although the most popular method for training these networks is back propagation, other optimization methods such as tabu search or scatter search have been successfully applied to solve this problem. In this paper we propose a path relinking implementation to solve the neural ne…

Mathematical optimizationInformation Systems and ManagementTraining setGeneral Computer ScienceArtificial neural networkComputer sciencebusiness.industryManagement Science and Operations ResearchSolverIndustrial and Manufacturing EngineeringBackpropagationEvolutionary computationTabu searchNonlinear programmingSearch algorithmModeling and SimulationArtificial intelligencebusinessMetaheuristicEuropean Journal of Operational Research
researchProduct

Online Metric Learning Methods Using Soft Margins and Least Squares Formulations

2012

Online metric learning using margin maximization has been introduced as a way to learn appropriate dissimilarity measures in an efficient way when information as pairs of examples is given to the learning system in a progressive way. These schemes have several practical advantages with regard to global ones in which a training set needs to be processed. On the other hand, they may suffer from a poor performance depending on the quality of the examples and the particular tuning or other implementation details. This paper formulates several online metric learning alternatives using a passive-aggressive schema. A new formulation of the online problem using least squares is also introduced. The…

Mathematical optimizationTraining setbusiness.industrymedia_common.quotation_subjectMachine learningcomputer.software_genreLeast squaresSchema (genetic algorithms)Margin maximizationMetric (mathematics)Learning methodsQuality (business)Artificial intelligencebusinesscomputerMathematicsmedia_common
researchProduct

Ranking of Brain Tumour Classifiers Using a Bayesian Approach

2009

This study presents a ranking for classifers using a Bayesian perspective. This ranking framework is able to evaluate the performance of the models to be compared when they are inferred from different sets of data. It also takes into account the performance obtained on samples not used during the training of the classifiers. Besides, this ranking assigns a prior to each model based on a measure of similarity of the training data to a test case. An evaluation consisting of ranking brain tumour classifiers is presented. These multilayer perceptron classifiers are trained with 1H magnetic resonance spectroscopy (MRS) signals following a multiproject multicenter evaluation approach. We demonstr…

Measure (data warehouse)Training setComputer sciencebusiness.industryPerspective (graphical)Bayesian probabilityPattern recognitionMachine learningcomputer.software_genreRanking (information retrieval)Random subspace methodSimilarity (network science)Multilayer perceptronArtificial intelligencebusinesscomputer
researchProduct

Bagging, bumping, multiview, and active learning for record linkage with empirical results on patient identity data

2011

Record linkage or deduplication deals with the detection and deletion of duplicates in and across files. For this task, this paper introduces and evaluates two new machine-learning methods (bumping and multiview) together with bagging, a tree-based ensemble-approach. Whereas bumping represents a tree-based approach as well, multiview is based on the combination of different methods and the semi-supervised learning principle. After providing a theoretical background of the methods, initial empirical results on patient identity data are given. In the empirical evaluation, we calibrate the methods on three different kinds of training data. The results show that the smallest training data set, …

Patient Identification SystemsTraining setComputer scienceActive learning (machine learning)business.industryHealth InformaticsEmpirical Researchcomputer.software_genreMachine learningComputer Science ApplicationsTask (project management)Set (abstract data type)Tree (data structure)Artificial IntelligenceIdentity (object-oriented programming)HumansBumpingMedical Record LinkageArtificial intelligenceData miningbusinesscomputerSoftwareRecord linkageComputer Methods and Programs in Biomedicine
researchProduct

New tyrosinase inhibitors selected by atomic linear indices-based classification models.

2005

In the present report, the use of the atom-based linear indices for finding functions that discriminate between the tyrosinase inhibitor compounds and inactive ones is presented. In this sense, discriminant models were applied and globally good classifications of 93.51% and 92.46% were observed for non-stochastic and stochastic linear indices best models, respectively, in the training set. The external prediction sets had accuracies of 91.67% and 89.44%. In addition, these fitted models were used in the screening of new cycloartane compounds isolated from herbal plants. A good behavior is shown between the theoretical and experimental results. These results provide a tool that can be used i…

Quantitative structure–activity relationshipMolecular modelStereochemistryTyrosinaseClinical BiochemistryMolecular ConformationPharmaceutical ScienceQuantitative Structure-Activity RelationshipBiochemistrySensitivity and SpecificityChemometricsDrug DiscoveryComputer SimulationEnzyme InhibitorsMolecular BiologyTraining setChemistryMonophenol MonooxygenaseOrganic ChemistryLinear discriminant analysisTriterpenesDiscriminantModels ChemicalTopological indexMolecular MedicineBiological systemBioorganicmedicinal chemistry letters
researchProduct

A genetic algorithm approach to purify the classifier training labels for the analysis of remote sensing imagery

2017

This paper proposes a Genetic Algorithm (GA) approach to clean a given classifier training set for remote sensing image analysis. Starting from an initial set of training data, the new method called GA-Training Label Purifying (GA-TLP) consists of the significant training sample selection using GAs in order to maximize the classifier accuracy. This means to retain the most informative samples and to remove the uncertain, redundant, and misclassified ones. As a result of the selection process, we can obtain a purified training set. The proposed model is implemented and evaluated using a LANDSAT 7 ETM+ image. The experimental results confirm the effectiveness of the proposed approach.

Sample selectionSupport vector machineTraining set020204 information systemsGenetic algorithm0211 other engineering and technologies0202 electrical engineering electronic engineering information engineering02 engineering and technologyClassifier (UML)021101 geological & geomatics engineeringRemote sensing2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)
researchProduct

Survival Prediction in Intrahepatic Cholangiocarcinoma: A Proof of Concept Study Using Artificial Intelligence for Risk Assessment

2021

Several scoring systems have been devised to objectively predict survival for patients with intrahepatic cholangiocellular carcinoma (ICC) and support treatment stratification, but they have failed external validation. The aim of the present study was to improve prognostication using an artificial intelligence-based approach. We retrospectively identified 417 patients with ICC who were referred to our tertiary care center between 1997 and 2018. Of these, 293 met the inclusion criteria. Established risk factors served as input nodes for an artificial neural network (ANN). We compared the performance of the trained model to the most widely used conventional scoring system, the Fudan score. Pr…

Scoring systemTertiary careArticle03 medical and health sciences0302 clinical medicineintrahepatic cholangiocarcinomaMedicinesurvival predictionIntrahepatic Cholangiocarcinomarisk scoringTraining setFudan scoreArtificial neural networkbusiness.industryRExternal validationGeneral Medicineartificial intelligencemachine learningCholangiocellular carcinoma030220 oncology & carcinogenesisMedicine030211 gastroenterology & hepatologyArtificial intelligencebusinessRisk assessmentartificial neural networkJournal of Clinical Medicine
researchProduct

Identification of parameters of dynamic Preisach model by neural networks

2008

In this paper, an approach that allows to identify the parameters of dynamic Preisach model is presented. The fundamental idea of this method is to identify the parameters of a material by using a neural network trained by a collection of hysteresis curves, whose Preisach model is known. After a brief description of dynamic Preisach Model, the neural network that has been used is introduced. The construction of the training data set is illustrated. Finally, the effectiveness of the method is tested on both numerical as well as experimental data.

Set (abstract data type)HysteresisIdentification (information)Training setArtificial neural networkComputer scienceGeneral Physics and AstronomyExperimental dataMagnetic hysteresisAlgorithm
researchProduct