Search results for "Trie"
showing 10 items of 4468 documents
A systematic analysis of duplicate records in Scopus
2015
In recent years, the Web of Science Core Collection and Scopus databases have become primary sources for conducting studies that evaluate scientific investigations. Such studies require that duplicate records be excluded to avoid errors of overrepresentation. In this line, we identify duplicate records in Scopus and examine their origins. Identifying journals with duplicate records in Scopus, selecting and downloading bibliographic journal records, and identifying and analyzing the duplicate records is the methodology adopted. Duplicate records are found when articles published in a journal are incorrectly mapped by Scopus to this journal and to a different journal from the same publisher a…
Aligning Relational Schema and OWL Ontologies with Hidden Markov Model
2016
The problem of bridging the gap between relational schema and ontologies is actively investigated in the Semantic Web and business communities. The main motivations are the OBDA scenario, where a domain ontology allows to hidden the technical details of the db to end-users; and the persistent storage of ontologies in db for facilitating search and retrieval keeping the benefits of DBMSs such as security and integrity. In these cases, the ABox is usually stored into a db, and the TBox is maintained in an ontology; for this reason, schema alignment is a more significant problem than the instance matching one. The use of manual mappings is hard and expensive, especially for large representatio…
Context-sensitive text mining with fitness leveling Genetic Algorithm
2015
Contextual processing is a great challenge for information retrieval study - the most approved techniques include scanning content of HTML web pages, user supported metadata analysis, automatic inference grounded on knowledge base, or content-oriented digital documents analysis. We propose a meta-heuristic by making use of Genetic Algorithms for Contextual Search (GACS) built on genetic programming (GP) and custom fitness leveling function to optimize contextual queries in exact search that represents unstructured phrases generated by the user. Our findings show that the queries built with GACS can significantly optimize the retrieval process.
On Keyword-Based Ad-Hoc Querying of Hospital Data Stored in Semistar Data Ontologies
2018
Abstract This paper sketches a possible solution to the problem of the currently growing necessity in various domains for domain experts to be able to query the database of the organization in a convenient manner. The paper focuses on the domain of hospital management where the normal practice is to involve a programmer as an intermediary between the managers and the database. This is an error-prone and cumbersome solution. The decision-making process of domain experts would hugely benefit if they could retrieve the information from the database themselves. There have been attempts to develop natural language-based query languages for this exact purpose, but the ultimate goal of the simplic…
A framework for context-sensitive metadata description
2006
Expectations regarding the new generation of Web depend on the success of Semantic Web technology. Resource Description Framework (RDF) is a basis for explicit and machine-readable representation of semantics. However RDF is not suitable for describing dynamic and context-sensitive resources (eg. processes). We present the Context Description Framework (CDF) as an extension of the RDF by adding a 'TrueInContext' component to the basic RDF triple ('subject-predicate-object'), and consider contextual value as a container of RDF statements. We also add a probabilistic component, which allows multilevel contextual dependence descriptions as well as presumes possibility for Bayesian reasoning wi…
From decoding a graph to processing a multimodal message: Interacting with data visualization in the news media
2020
Abstract Data visualisation – in the forms of graphs, charts, and maps – represents a text type growing in prevalence and impact in many cultural domains; education, journalism, business, PR, and more. Research on data visualisation reception is scarce, particularly that related to interactive and dynamic forms of data visualisation in digital media. Taking an approach inspired by grounded theory, in this article I investigate the ways in which young students interact with data visualisations found in digital news media. Combining observations from reading sessions with ten in-depth interviews, I investigate how the informants read, interpreted, and responded emotionally to data visualisati…
<title>Combining multiple image descriptions for browsing and retrieval</title>
2000
Retrieving images form large collections using image content is an important problem, in this multimedia age. A quick content-based visual access to the stored image is capital for efficient navigation through image collections. In this paper we introduce several techniques which characterize color homogeneous object and their spatial relationships for efficient content-based image retrieval. We present a region growing technique for efficient color homogeneous objects segmentation and extend the 2D string to an accurate description of spatial information and relationships. In order to improve content-based image retrieval, our method emphasized several objectives, such as: automated extrac…
Content Code Blurring: A New Approach to Content Extraction
2008
Most HTML documents on the world wide web contain far more than the article or text which forms their main content. Navigation menus, functional and design elements or commercial banners are typical examples of additional contents. Content extraction is the process of identifying the main content and/or removing the additional contents. We introduce content code blurring, a novel content extraction algorithm. As the main text content is typically a long, homogeneously formatted region in a web document, the aim is to identify exactly these regions in an iterative process. Comparing its performance with existing content extraction solutions we show thatfor most documents content code blurrin…
Semantic web service discovery system for road traffic information services
2015
Create a multi-agent platform for a traveller information system (FIPA standards).Extend Paulucci algorithm with the use of seven similarity measures.Weight the similarity measure according to semantic relation and parameter nature.Improved running-time with a filtering pre-process for non-functional parameters.Improved the recall by measuring the sibling relationship concepts. We describe a multi-agent platform for a traveller information system, allowing travellers to find the road traffic information web service (WSs) that best fits their requirements. After studying existing proposals for discovery of semantic WS, we implemented a hybrid matching algorithm, which is described in detail …
Semantic Portal for Legislative Information
2006
Semantic portals enabled by Semantic Web technologies have been suggested to provide a point of access to an integrated body of information about some domain. In the area of e-Government there are multiple possible domains for semantic portals, one of them being legislative work. In this paper we propose a semantic portal based on a rich metadata repository to support the retrieval of legislative information. The portal provides process oriented semantic browsing capabilities. A prototype of the portal has been implemented for the retrieval of Finnish legislative information.