Search results for "Bumping"

showing 1 items of 1 documents

Bagging, bumping, multiview, and active learning for record linkage with empirical results on patient identity data

2011

Record linkage or deduplication deals with the detection and deletion of duplicates in and across files. For this task, this paper introduces and evaluates two new machine-learning methods (bumping and multiview) together with bagging, a tree-based ensemble-approach. Whereas bumping represents a tree-based approach as well, multiview is based on the combination of different methods and the semi-supervised learning principle. After providing a theoretical background of the methods, initial empirical results on patient identity data are given. In the empirical evaluation, we calibrate the methods on three different kinds of training data. The results show that the smallest training data set, …

Patient Identification SystemsTraining setComputer scienceActive learning (machine learning)business.industryHealth InformaticsEmpirical Researchcomputer.software_genreMachine learningComputer Science ApplicationsTask (project management)Set (abstract data type)Tree (data structure)Artificial IntelligenceIdentity (object-oriented programming)HumansBumpingMedical Record LinkageArtificial intelligenceData miningbusinesscomputerSoftwareRecord linkageComputer Methods and Programs in Biomedicine
researchProduct