Search results for "computer.software_genre"
showing 10 items of 3858 documents
Unifying Access to Heterogeneous Document Databases through Contextual Metadata
2011
Document databases available on the Internet carry massive information resources. To a person needing a piece of information on a specific domain, finding the piece, however, is often quite problematic even though there were a representative collection of databases available on the domain. The languages used in the content, the names of document types, their structures, the ways documents are organized and their retrieval techniques often vary in the databases. The databases containing legal information on the Internet offer a typical example. For finding relevant documents and for being able to interpret the content of the documents correctly, the user may need information about the contex…
Automated Creation of Expert Systems with the InteKRator Toolbox
2021
Expert systems have a long tradition in both medical informatics and artificial intelligence research. Traditionally, such systems are created by implementing knowledge provided by experts in a system that can be queried for answers. To automatically generate such knowledge directly from data, the lightweight InteKRator toolbox will be introduced here, which combines knowledge representation and machine learning approaches. The learned knowledge is represented in the form of rules with exceptions that can be inspected and that are easily comprehensible. An inference module allows for the efficient answering of queries, while at the same time offering the possibility of providing explanation…
A Semantic Layer on Semi-structured Data Sources for Intuitive Chatbots
2009
The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space can be interpreted as a "conceptual" space where the axes represent the latent primitive concepts of the analyzed corpus. The presented work aims at exploiting the properties of a data-driven semantic/conceptual space built using semi-structured data sources freely available on the web, like Wikipedia. Thi…
A Comparison of Language Identification Approaches on Short, Query-Style Texts
2010
In a multi-language Information Retrieval setting, the knowledge about the language of a user query is important for further processing. Hence, we compare the performance of some typical approaches for language detection on very short, query-style texts. The results show that already for single words an accuracy of more than 80% can be achieved, for slightly longer texts we even observed accuracy values close to 100%.
Comparing Translation and Post-editing: An Annotation Schema for Activity Units
2016
The current chapter introduces an annotation schema of TPR data that categorises post-editing behaviour into five different classes and compares general-language and domain-specific English-to-German translation and post-editing with respect to production times, key-logging (text production activity and text elimination activity) and eye-tracking data (total reading times on source text and on target text). The results support the hypothesis that post-editing is faster than translation from scratch for both domain-specific and non-domain-specific text types. When key-logging and eye-tracking data are taken into consideration, domain-specific texts require more effort when translating from s…
Meter for the Quantitative Analysis of Newspaper Sport Material
2016
This article presents a meter for the quantitative analysis of newspaper sport material. The meter makes it possible to measure and classify newspaper sport material in detail. The meter has three levels. The selected level depends on the research purpose and desired measurement accuracy. Measurement can focus on a certain level, or all levels can be used together. Individual variables can also be utilized at a certain level. The three levels with respective level units of observation are: 1) articles, photos, and graphics; 2) sets of articles; and 3) sets of data materials. The use of each level is presented in the article. The article also contains a summary of the newspaper sport materia…
A P2P Architecture for Multimedia Content Retrieval
2006
The retrieval facilities of most Peer-to-Peer (P2P) systems are limited to queries based on unique identifiers or small sets of keywords. This approach can be highly labor-intensive and inconsistent. In this paper we investigate a scenario where a huge amount of multimedia resources are shared in a P2P network, by means of efficient content-based image and video retrieval functionalities. The challenge in such systems is to limit the number of sent messages, maximizing the usefulness of each peer contacted in the query process. We achieve this goal by the adoption of a novel algorithm for routing user queries. The proposed approach exploits compact representations of multimedia resources sh…
A Taxonomy as a Vehicle for Learning
2009
In this article, we describe the development of a classification system providing a framework for analysis of, and communication about, a subgroup of learning objects. The objects we consider are highly visual, animated, interactive, and mathematics-related, and we call them VaniMaps. Secondly, we discuss the use of the system. In the first phase, the development was based on literature studies and discussions on examples of VaniMaps. In the second phase, the classification system was tested by students and their responses were analyzed to identify possible improvements. Now, the system is developed further based on experience gained while using it for different purposes. We see several pos…
A NSGA Based Approach for Content Based Image Retrieval
2013
The purpose of CBIR Content Based Image Retrieval systems is to allow users to retrieve pictures related to a semantic concept of their interest, when no other information but the images themselves is available. Commonly, a series of images are presented to the user, who judges on their relevance. Several different models have been proposed to help the construction of interactive systems based on relevance feedback. Some of these models consider that an optimal query point exists, and focus on adapting the similarity measure and moving the query point so that it appears close to the relevant results and far from those which are non-relevant. This implies a strong causality between the low l…
An Extended Data Object-driven Approach to Data Quality Evaluation: Contextual Data Quality Analysis
2019
This research is an extension of a data object-driven approach to data quality evaluation allowing to analyse data object quality in scope of multiple data objects. Previously presented approach was used to analyse one particular data object, mainly focusing on syntactic analysis. It means that the primary data object quality can be analysed against secondary data objects of unlimited number. This opportunity allows making more comprehensive, in-depth contextual data object analysis. The given analysis was applied to open data sets, making comparison between previously obtained results and results of application of the extended approach, underlying importance and benefits of the given exten…