Search results for "computer.software_genre"
showing 10 items of 3858 documents
BIOfid dataset: publishing a German gold standard for named entity recognition in historical biodiversity literature
2019
The Specialized Information Service Biodiversity Research (BIOfid) has been launched to mobilize valuable biological data from printed literature hidden in German libraries for over the past 250 years. In this project, we annotate German texts converted by OCR from historical scientific literature on the biodiversity of plants, birds, moths and butterflies. Our work enables the automatic extraction of biological information previously buried in the mass of papers and volumes. For this purpose, we generated training data for the tasks of Named Entity Recognition (NER) and Taxa Recognition (TR) in biological documents. We use this data to train a number of leading machine learning tools and c…
Reproducing kernel hilbert spaces regression methods for genomic assisted prediction of quantitative traits.
2008
Abstract Reproducing kernel Hilbert spaces regression procedures for prediction of total genetic value for quantitative traits, which make use of phenotypic and genomic data simultaneously, are discussed from a theoretical perspective. It is argued that a nonparametric treatment may be needed for capturing the multiple and complex interactions potentially arising in whole-genome models, i.e., those based on thousands of single-nucleotide polymorphism (SNP) markers. After a review of reproducing kernel Hilbert spaces regression, it is shown that the statistical specification admits a standard mixed-effects linear model representation, with smoothing parameters treated as variance components.…
A Coclustering Approach for Mining Large Protein-Protein Interaction Networks
2012
Several approaches have been presented in the literature to cluster Protein-Protein Interaction (PPI) networks. They can be grouped in two main categories: those allowing a protein to participate in different clusters and those generating only nonoverlapping clusters. In both cases, a challenging task is to find a suitable compromise between the biological relevance of the results and a comprehensive coverage of the analyzed networks. Indeed, methods returning high accurate results are often able to cover only small parts of the input PPI network, especially when low-characterized networks are considered. We present a coclustering-based technique able to generate both overlapping and nonove…
The Application of Machine Learning Algorithms to the Analysis of Electromyographic Patterns From Arthritic Patients
2009
The main aim of our study was to investigate the possibility of applying machine learning techniques to the analysis of electromyographic patterns (EMG) collected from arthritic patients during gait. The EMG recordings were collected from the lower limbs of patients with arthritis and compared with those of healthy subjects (CO) with no musculoskeletal disorder. The study involved subjects suffering from two forms of arthritis, viz, rheumatoid arthritis (RA) and hip osteoarthritis (OA). The analysis of the data was plagued by two problems which frequently render the analysis of this type of data extremely difficult. One was the small number of human subjects that could be included in the in…
BED: A new dataset for EEG-based biometrics
2021
Various recent research works have focused on the use of electroencephalography (EEG) signals in the field of biometrics. However, advances in this area have somehow been limited by the absence of a common testbed that would make it possible to easily compare the performance of different proposals. In this work, we present a data set that has been specifically designed to allow researchers to attempt new biometric approaches that use EEG signals captured by using relatively inexpensive consumer-grade devices. The proposed data set has been made publicly accessible and can be downloaded from https://doi.org/10.5281/zenodo.4309471 . It contains EEG recordings and responses from 21 individuals…
A new method for morphometric analysis of opal phytoliths from plants.
2014
Micro-morphometry has substantially gained ground in the field of phytolith analysis, but the comparability of results is limited due to the use of different methods. This paper presents a new, user-friendly method based on open-source software (FIJI) that is proposed as a step towards the introduction of a standard method. After obtaining a mask of a phytolith by making a digital drawing, 27 commonly used variables of size and shape are measured automatically. This method is not only useful for phytolith analysis, but may also be used for other fields of morphometric research. Users can furthermore customize the software tool when additional variables are required.
Boosting Design Space Explorations with Existing or Automatically Learned Knowledge
2012
During development, processor architectures can be tuned and configured by many different parameters. For benchmarking, automatic design space explorations (DSEs) with heuristic algorithms are a helpful approach to find the best settings for these parameters according to multiple objectives, e.g. performance, energy consumption, or real-time constraints. But if the setup is slightly changed and a new DSE has to be performed, it will start from scratch, resulting in very long evaluation times. To reduce the evaluation times we extend the NSGA-II algorithm in this article, such that automatic DSEs can be supported with a set of transformation rules defined in a highly readable format, the fuz…
Evaluation of Record Linkage Methods for Iterative Insertions
2009
Summary Objectives: There have been many developments and applications of mathematical methods in the context of record linkage as one area of interdisciplinary research efforts. However, comparative evaluations of record linkage methods are still underrepresented. In this paper improvements of the Fellegi-Sunter model are compared with other elaborated classification methods in order to direct further research endeavors to the most promising methodologies. Methods: The task of linking records can be viewed as a special form of object identification. We consider several non-stochastic methods and procedures for the record linkage task in addition to the Fellegi-Sunter model and perform an e…
Bagging and Boosting with Dynamic Integration of Classifiers
2000
One approach in classification tasks is to use machine learning techniques to derive classifiers using learning instances. The co-operation of several base classifiers as a decision committee has succeeded to reduce classification error. The main current decision committee learning approaches boosting and bagging use resampling with the training set and they can be used with different machine learning techniques which derive base classifiers. Boosting uses a kind of weighted voting and bagging uses equal weight voting as a combining method. Both do not take into account the local aspects that the base classifiers may have inside the problem space. We have proposed a dynamic integration tech…
Real-time flaw detection on a complex object: comparison of results using classification with a support vector machine, boosting, and hyperrectangle-…
2006
We present a classification work performed on industrial parts using artificial vision, a support vector machine (SVM), boost- ing, and a combination of classifiers. The object to be controlled is a coated heater used in television sets. Our project consists of detect- ing anomalies under manufacturer production, as well as in classi- fying the anomalies among 20 listed categories. Manufacturer speci- fications require a minimum of ten inspections per second without a decrease in the quality of the produced parts. This problem is ad- dressed by using a classification system relying on real-time ma- chine vision. To fulfill both real-time and quality constraints, three classification algorit…