Search results for "Data mining"
showing 10 items of 907 documents
A text based indexing system for mammographic image retrieval and classification
2014
Abstract In modern medical systems huge amount of text, words, images and videos are produced and stored in ad hoc databases. Medical community needs to extract precise information from that large amount of data. Currently ICT approaches do not provide a methodology for content-based medical images retrieval and classification. On the other hand, from the Internet of Things (IoT) perspective, the ICT medical data can be produced by several devices. Produced data complies with all Big Data features and constraints. The IoT guidelines put at the center of the system a new smart software to manage and transform Big Data in a new understanding form. This paper describes a text based indexing sy…
Aligning Relational Schema and OWL Ontologies with Hidden Markov Model
2016
The problem of bridging the gap between relational schema and ontologies is actively investigated in the Semantic Web and business communities. The main motivations are the OBDA scenario, where a domain ontology allows to hidden the technical details of the db to end-users; and the persistent storage of ontologies in db for facilitating search and retrieval keeping the benefits of DBMSs such as security and integrity. In these cases, the ABox is usually stored into a db, and the TBox is maintained in an ontology; for this reason, schema alignment is a more significant problem than the instance matching one. The use of manual mappings is hard and expensive, especially for large representatio…
Semantic web service discovery system for road traffic information services
2015
Create a multi-agent platform for a traveller information system (FIPA standards).Extend Paulucci algorithm with the use of seven similarity measures.Weight the similarity measure according to semantic relation and parameter nature.Improved running-time with a filtering pre-process for non-functional parameters.Improved the recall by measuring the sibling relationship concepts. We describe a multi-agent platform for a traveller information system, allowing travellers to find the road traffic information web service (WSs) that best fits their requirements. After studying existing proposals for discovery of semantic WS, we implemented a hybrid matching algorithm, which is described in detail …
Combining content extraction heuristics
2008
The main text content of an HTML document on the WWW is typically surrounded by additional contents, such as navigation menus, advertisements, link lists or design elements. Content Extraction (CE) is the task to identify and extract the main content. Ongoing research has spawned several CE heuristics of different quality. However, so far only the Crunch framework combines several heuristics to improve its overall CE performance. Since Crunch, though, many new algorithms have been formulated. The CombinE system is designed to test, evaluate and optimise combinations of CE heuristics. Its aim is to develop CE systems which yield better and more reliable extracts of the main content of a web …
Heuristic Method to Improve Systematic Collection of Terminology
2016
In this paper, we propose an experimental tool for analysis and graphical representation of glossaries. The original heuristic algorithms and analysis methods incorporated into the tool appeared to be useful to improve the quality of the glossaries. The tool was used for analysis of ISTQB Standard Glossary of Terms Used in Software Testing. There are instances of problems found in ISTQB glossary related to its consistency, completeness, and correctness described in the paper.
A NSGA Based Approach for Content Based Image Retrieval
2013
The purpose of CBIR Content Based Image Retrieval systems is to allow users to retrieve pictures related to a semantic concept of their interest, when no other information but the images themselves is available. Commonly, a series of images are presented to the user, who judges on their relevance. Several different models have been proposed to help the construction of interactive systems based on relevance feedback. Some of these models consider that an optimal query point exists, and focus on adapting the similarity measure and moving the query point so that it appears close to the relevant results and far from those which are non-relevant. This implies a strong causality between the low l…
Export of Relational Databases to RDF Databases by Model Transformations
2011
The Semantic Web is a Web of Data. To fulfill this web with data we need methods how to transfer business data from existing relational databases. In most cases, textual mapping languages are used for the specification of correspondences between relational DB schema and OWL ontology. These languages generally are rather awkward and not well-suited for the specification of mappings in cases when there is a substantial semantic gap between the source ER schema and the target OWL ontology. At the same time specification of mappings is a classical use case for graphical model transformation languages. In our previous work [10] we have proposed a new, model transformation-based method for the sp…
A Semantic Model to Query Spatial–Temporal Data
2013
There is a growing need for the study of spatial–temporal objects and their relationships. A common approach for this task is the use of relational databases, which unfortunately do not allow inference. In this research, we introduce a new approach that uses the concept of a “continuum” together with ontologies and semantic Web technologies. The continuum allows us to define parent–child relationships between representations of objects. It also allows us to compare the evolution of two different objects and establish the relationships between them along time. Our approach is based on the four-dimensional fluent, which is extended to obtain spatial–temporal qualitative information from the a…
Text mining of biomedical literature: doing well, but we could be doing better.
2015
Information Decomposition in Multivariate Systems: Definitions, Implementation and Application to Cardiovascular Networks
2016
The continuously growing framework of information dynamics encompasses a set of tools, rooted in information theory and statistical physics, which allow to quantify different aspects of the statistical structure of multivariate processes reflecting the temporal dynamics of complex networks. Building on the most recent developments in this field, this work designs a complete approach to dissect the information carried by the target of a network of multiple interacting systems into the new information produced by the system, the information stored in the system, and the information transferred to it from the other systems; information storage and transfer are then further decomposed into amou…