Search results for "Knowledge extraction"
showing 10 items of 58 documents
Reactome graph database: Efficient access to complex pathway data
2018
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…
Knowledge Discovery from Network Logs
2015
Modern communications networks are complex systems, which facilitates malicious behavior. Dynamic web services are vulnerable to unknown intrusions, but traditional cyber security measures are based on fingerprinting. Anomaly detection differs from fingerprinting in that it finds events that differ from the baseline traffic. The anomaly detection methodology can be modelled with the knowledge discovery process. Knowledge discovery is a high-level term for the whole process of deriving actionable knowledge from databases. This article presents the theory behind this approach, and showcases research that has produced network log analysis tools and methods. peerReviewed
Motivic Pattern Extraction in Symbolic Domain
2008
This chapter offers an overview of computational research in motivic pattern extraction. The central questions underlying the topic, concerning the formalization of the motivic structures, the matching strategies and the filtering of the results, have been addressed in various ways. A detailed analysis of these problems leads to the proposal of a new methodology, which will be developed throughout the study. One main conclusion of this review is that the problems cannot be tackled using purely mathematic or geometric heuristics or classical engineering tools, but require also a detailed understanding of the multiple constraints derived by the underlying cognitive context.
Intrusion Detection with Interpretable Rules Generated Using the Tsetlin Machine
2020
The rapid deployment in information and communication technologies and internet-based services have made anomaly based network intrusion detection ever so important for safeguarding systems from novel attack vectors. To this date, various machine learning mechanisms have been considered to build intrusion detection systems. However, achieving an acceptable level of classification accuracy while preserving the interpretability of the classification has always been a challenge. In this paper, we propose an efficient anomaly based intrusion detection mechanism based on the Tsetlin Machine (TM). We have evaluated the proposed mechanism over the Knowledge Discovery and Data Mining 1999 (KDD’99) …
Discovering representative models in large time series databases
2004
The discovery of frequently occurring patterns in a time series could be important in several application contexts. As an example, the analysis of frequent patterns in biomedical observations could allow to perform diagnosis and/or prognosis. Moreover, the efficient discovery of frequent patterns may play an important role in several data mining tasks such as association rule discovery, clustering and classification. However, in order to identify interesting repetitions, it is necessary to allow errors in the matching patterns; in this context, it is difficult to select one pattern particularly suited to represent the set of similar ones, whereas modelling this set with a single model could…
An introduction to knowledge computing
2014
This paper deals with the challenges related to self-management and evolution of massive knowledge collections. We can assume that a self-managed knowledge graph needs a kind of a hybrid of: an explicit declarative self-knowledge (as knowledge about own properties and capabilities) and an explicit procedural self-knowledge (as knowledge on how to utilize own properties and the capabilities for the self-management).We offer an extension to a traditional RDF model of describing knowledge graphs according to the Semantic Web standards so that it will also allow to a knowledge entity to autonomously perform or query from remote services different computational executions needed. We also introdu…
Analysis of ventricular fibrillation signals using feature selection methods
2012
Feature selection methods in machine learning models are a powerful tool to knowledge extraction. In this work they are used to analyse the intrinsic modifications of cardiac response during ventricular fibrillation due to physical exercise. The data used are two sets of registers from isolated rabbit hearts: control (G1: without physical training), and trained (G2). Four parameters were extracted (dominant frequency, normalized energy, regularity index and number of occurrences). From them, 18 features were extracted. This work analyses the relevance of each feature to classify the records in G1 and G2 using Logistic Regression, Multilayer Perceptron and Extreme Learning Machine. Three fea…
Towards MKDA: A Knowledge Discovery Assistant for Researches in Medicine
2007
Nowadays doctors are generating a huge amount of raw data. These data, analyzed with data mining techniques, could be sources of new knowledge. Unluckily such tasks need skilled data analysts, and not so much researchers in Medicine are also data mining experts. In this paper we present a web based system for knowledge discovery assistance in Medicine able to advice a medical researcher in this kind of tasks. The user must define only the experiment specifications in a formal language we have defined. The system GUI helps users in their composition. Then the system plans a Knowledge Discovery Process (KDP) on the basis of rules in a knowledge base. Finally the system executes the KDP and pr…
Knowledge Acquisition Based on Semantic Balance of Internal and External Knowledge
1999
This paper presents a strategy to handle incomplete knowledge during acquisition process. The goal of this research is to develop formal tools that benefit the law of semantic balance. The assumption is used that a situation inside the object’s boundary in some world should be in balance with a situation outside it. It means that continuous cognition of an object aspires to a complete knowledge about it and knowledge about internal structure of the object will be in balance with knowledge about relationships of the object with other objects in its environment. It is supposed that one way to discover incompleteness of knowledge about some object is to measure and compare knowledge about its …
<title>Dynamic integration of multiple data mining techniques in a knowledge discovery management system</title>
1999
One of the most important directions in improvement of data mining and knowledge discovery, is the integration of multiple classification techniques of an ensemble of classifiers. An integration technique should be able to estimate and select the most appropriate component classifiers from the ensemble. We present two variations of an advanced dynamic integration technique with two distance metrics. The technique is one variation of the stacked generalization method, with an assumption that each of the component classifiers is the best one, inside a certain sub area of the entire domain area. Our technique includes two phases: the learning phase and the application phase. During the learnin…