Search results for " processing"
showing 10 items of 7549 documents
Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language
2016
Nowadays, most of the data on the Web is still in the form of unstructured text. Knowledge extraction from unstructured text is highly desirable but extremely challenging due to the inherent ambiguity of natural language. In this article, we present an architecture of an information extraction system based on the concept of Embedded Controlled Language that allows for extracting formal semantic knowledge from an unstructured text corpus. Moreover, the presented approach has a potential to support multilingual input and output.
Semantic retrieval: an approach to representing, searching and summarising text documents
2011
Nowadays, the internet is the major source of information for millions of people. There are many search tools available on the net but finding appropriate text information is still difficult. The retrieval efficiency of the presently used systems cannot be significantly improved: ‘bag of words’ interpretation causes losing semantics of texts. We applied the functional approach to represent English text documents. It allows taking into account semantic relations between words when indexing documents and use ordinary English sentences as queries to a search engine. The proposed retrieval mechanisms return only highly relevant documents. They make it possible to generate content-aware summarie…
Automatic building of a visual interface for content-based multiresolution retrieval of paleontology images
2001
In this article we present research work in the field of content-based image retrieval in large databases applied to the paleontology image database of the Universite´ de Bourgogne, Dijon, France, called ‘‘TRANS’TYFIPAL.’’ Our indexing method is based on multiresolution decomposition of database images using wavelets. For each family of paleontology images we try to find a model image that represents it. The K-means automatic classification algorithm divides the space of parameters into several clusters. A model image for each cluster is computed from the wavelet transform of each image of the cluster. Then a search tree is built to offer users a graphic interface for retrieving images. So …
SHREC 2020: Retrieval of digital surfaces with similar geometric reliefs
2020
Abstract This paper presents the methods that have participated in the SHREC’20 contest on retrieval of surface patches with similar geometric reliefs and the analysis of their performance over the benchmark created for this challenge. The goal of the context is to verify the possibility of retrieving 3D models only based on the reliefs that are present on their surface and to compare methods that are suitable for this task. This problem is related to many real world applications, such as the classification of cultural heritage goods or the analysis of different materials. To address this challenge, it is necessary to characterize the local ”geometric pattern” information, possibly forgetti…
Searching Silk Fabrics by Images Leveraging on Knowledge Graph and Domain Expert Rules
2021
The production of European silk textile is an endangered intangible cultural heritage. Digital tools can nowadays be developed to help preserving it, or even to make it more accessible for the public and the fashion industry. In this paper, we propose an image-based retrieval tool that leverages on a knowledge graph describing the silk textile production as well as rules formulated by experts of this domain. Out of several possible similarity scenarios, two have proven to work best and have been integrated into an exploratory search engine.
Towards a natural language-based interface for querying hospital data
2018
There is a growing necessity in various domains for non-programmers to be able to retrieve information gathered about the operation of the organization and stored in its databases. This information could hugely benefit the decision making process of the managers of the institution, but it is not often exploited due to the complexity of extracting the information from the existing data. In this paper we sketch a way how that information could be managed by the domain experts themselves by the means of a natural language-based query language that works upon data stored in the ontology. Our experiments show that the proposed approach is indeed easy-to-use by our target end-users - managers and…
A Semantic Layer on Semi-structured Data Sources for Intuitive Chatbots
2009
The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space can be interpreted as a "conceptual" space where the axes represent the latent primitive concepts of the analyzed corpus. The presented work aims at exploiting the properties of a data-driven semantic/conceptual space built using semi-structured data sources freely available on the web, like Wikipedia. Thi…
A Comparison of Language Identification Approaches on Short, Query-Style Texts
2010
In a multi-language Information Retrieval setting, the knowledge about the language of a user query is important for further processing. Hence, we compare the performance of some typical approaches for language detection on very short, query-style texts. The results show that already for single words an accuracy of more than 80% can be achieved, for slightly longer texts we even observed accuracy values close to 100%.
Enriching Didactic Similarity Measures of Concept Maps by a Deep Learning Based Approach
2021
Concept maps are significant tools able to support several tasks in the educational area such as curriculum design, knowledge organization and modeling, students' assessment and many others. They are also successfully used in learning activities in which students have to represent domain knowledge according to teacher's assignment. In this context, the development of Learning Analytics approaches would benefit of methods that automatically compare concept maps. Detecting concept maps similarities is relevant to identify how the same concepts are used in different knowledge representations. Algorithms for comparing graphs have been extensively studied in the literature, but they do not appea…
FrameNet CNL: A Knowledge Representation and Information Extraction Language
2014
The paper presents a FrameNet-based information extraction and knowledge representation framework, called FrameNet-CNL. The framework is used on natural language documents and represents the extracted knowledge in a tailor-made Frame-ontology from which unambiguous FrameNet-CNL paraphrase text can be generated automatically in multiple languages. This approach brings together the fields of information extraction and CNL, because a source text can be considered belonging to FrameNet-CNL, if information extraction parser produces the correct knowledge representation as a result. We describe a state-of-the-art information extraction parser used by a national news agency and speculate that Fram…