Search results for "ComputingMethodologies_PATTERNRECOGNITION"
showing 10 items of 296 documents
Bag of words representation and SVM classifier for timber knots detection on color images
2015
Knots as well as their density have a huge impact on the mechanical properties of wood boards. This paper addresses the issue of their automatic detection. An image processing pipeline which associates low level processing (contrast enhancement, thresholding, mathematical morphology) with bag-of-words approach is developed. We propose a SVM classification based on features obtained by SURF descriptors on RGB images, followed by a dictionary created using the bag-of-words approach. Our method was tested on color images from two different datasets with a total number of 640 knots. The mean recall (true positive) rate achieved was (92%) and (97%) for a single dictionary (built only on samples …
Why is this an anomaly? Explaining anomalies using sequential explanations
2022
Abstract In most applications, anomaly detection operates in an unsupervised mode by looking for outliers hoping that they are anomalies. Unfortunately, most anomaly detectors do not come with explanations about which features make a detected outlier point anomalous. Therefore, it requires human analysts to manually browse through each detected outlier point’s feature space to obtain the subset of features that will help them determine whether they are genuinely anomalous or not. This paper introduces sequential explanation (SE) methods that sequentially explain to the analyst which features make the detected outlier anomalous. We present two methods for computing SEs called the outlier and…
Intelligent system for material quality control using impact-echo testing
2008
This paper introduces an intelligent system to discern the quality of materials inspected by the impact-echo technique. The system includes a hardware setup to inspect parallelepiped-shape materials and a procedure to classify the material depending on its quality condition. Four levels of classification with different grades of knowledge about the material defects are approached: material condition, kind of defect, defect orientation, and defect dimension. The number of classes (material qualities) in the lowest classification level is 12. The procedure is applied on signals coming from 3D finite element simulations and lab experiments with aluminium specimens. The classification procedure…
Mining Interpretable Rules for Sentiment and Semantic Relation Analysis Using Tsetlin Machines
2020
Tsetlin Machines (TMs) are an interpretable pattern recognition approach that captures patterns with high discriminative power from data. Patterns are represented as conjunctive clauses in propositional logic, produced using bandit-learning in the form of Tsetlin Automata. In this work, we propose a TM-based approach to two common Natural Language Processing (NLP) tasks, viz. Sentiment Analysis and Semantic Relation Categorization. By performing frequent itemset mining on the patterns produced, we show that they follow existing expert-verified rule-sets or lexicons. Further, our comparison with other widely used machine learning techniques indicates that the TM approach helps maintain inter…
Detection of TV commercials
2004
This paper presents a system that labels TV shots either as commercial or program shots. The system uses two observations: logo presence and shot duration. These observations are modeled using HMMs, and a Viterbi decoder is finally used for shot labeling. The system has been tested on several hours of real video, achieving more than 99% correct labeling.
<title>Dynamic integration of multiple data mining techniques in a knowledge discovery management system</title>
1999
One of the most important directions in improvement of data mining and knowledge discovery, is the integration of multiple classification techniques of an ensemble of classifiers. An integration technique should be able to estimate and select the most appropriate component classifiers from the ensemble. We present two variations of an advanced dynamic integration technique with two distance metrics. The technique is one variation of the stacked generalization method, with an assumption that each of the component classifiers is the best one, inside a certain sub area of the entire domain area. Our technique includes two phases: the learning phase and the application phase. During the learnin…
Breast Ultra-Sound image segmentation: an optimization approach based on super-pixels and high-level descriptors
2015
International audience; Breast cancer is the second most common cancer and the leading cause of cancer death among women. Medical imaging has become an indispensable tool for its diagnosis and follow up. During the last decade, the medical community has promoted to incorporate Ultra-Sound (US) screening as part of the standard routine. The main reason for using US imaging is its capability to differentiate benign from malignant masses, when compared to other imaging techniques. The increasing usage of US imaging encourages the development of Computer Aided Diagnosis (CAD) systems applied to Breast Ultra-Sound (BUS) images. However accurate delineations of the lesions and structures of the b…
Text Classification Using “Anti”-Bayesian Quantile Statistics-Based Classifiers
2016
The problem of Text Classification (TC) has been studied for decades, and this problem is particularly interesting because the features are derived from syntactic or semantic indicators, while the classification, in and of itself, is based on statistical Pattern Recognition (PR) strategies. Thus, all the recorded TC schemes work using the fundamental paradigm that once the statistical features are inferred from the syntactic/semantic indicators, the classifiers themselves are the well-established ones such as the Bayesian, the Na¨ıve Bayesian, the SVM etc. and those that are neural or fuzzy. In this paper, we shall demonstrate that by virtue of the skewed distributions of the features, one …
Spam classification for online discussions
2010
Masteroppgave i informasjons- og kommunikasjonsteknologi 2010 – Universitetet i Agder, Grimstad Traditionally, spam messages filtering systems are built by integrating content-based analysis technologies which are developed from the experiences of dealing with E-mail spam. Recently, the new style of information appears in the Internet, Social Media platform, which also expands the space for Internet abusers. In this thesis, we not only evaluated the traditional content-based approaches to classify spam messages, we also investigated the possibility of integrating context-based technology with con-tent-based approaches to classify spam messages. We built spam classifiers using Novelty de-tec…
Metadata record for: Comprehensive dataset of shotgun metagenomes from oxygen stratified freshwater lakes and ponds
2021
This dataset contains key characteristics about the data described in the Data Descriptor Comprehensive dataset of shotgun metagenomes from oxygen stratified freshwater lakes and ponds. Contents: 1. human readable metadata summary table in CSV format 2. machine readable metadata file in JSON format