Search results for "Semantic similarity"
showing 10 items of 38 documents
Ontology-based service matching and discovery
2011
In this paper we consider ontologies as knowledge structures that specify attributes of services, their properties and relations among them to enable finding semantic similarity between service descriptions and service requests. Ontologies reflect semantic relationship between concepts represented by attributes in service descriptions and service requests. We use knowledge from ontologies to enhance the both user service requests and service descriptions by adding concepts that are not presented in the original descriptions, and use them in comparison process. It results in more precise matching since we consider also implicit concepts. Thus services and requests that do not contain exact m…
A Semantic Similarity Measure for the SIMS Framework
2008
The amount of currently available digital information grows rapidly. Relevant information is often spread over different information sources. An efficient and flexible framework to allow users to satisfy ef- fectively their information needs is required. The work presented in this paper describes SIMS (Semantic Information Management System), a ref- erence architecture for a framework performing semantic annotation, search and retrieval of information from multiple sources. The work pre- sented in this paper focuses on a specific SIMS module, the SIMS Semantic Content Navigator, proposing an algorithm and the related implementa- tion to calculate a semantic similarity measure inside an OWL …
An A* Based Semantic Tokenizer for Increasing the Performance of Semantic Applications
2013
Semantic Applications (SAs) makes use of ontolo- gies and their performance can depend on the syntactic labels of the modeled entities; even if several approaches have been devised to formalize ontologies, no formal approaches have been devised for naming their constituents, which look as long word concatenations without any particular separation. We present a novel semantic tokenizer that finds the sub-words through an application of the A* based search algorithm; the A* functions rely on a set of linguistic criteria and on the meta-cognitive perspective of the activity of reading.
An ontology-based retrieval system for mammographic reports
2015
In healthcare domain it can be useful to compare unstructured free-text clinical reports in order to enable the search for similar and/or relevant clinical cases. In data mining and text analysis tasks, the cosine similarity is usually used for texts comparison purposes. It is usually performed by computing the standard document vector cosine similarity between the two vectors representing the report pair under analysis. In this paper a novel system based on text pre-processing techniques and a modelled medical knowledge, using an improved radiological ontology, is proposed. Medical terms organized in a hierarchical tree can assess semantic similarity relationships between unstructured repo…
A Combined Fuzzy Semantic Similarity Measure In Owl Ontologies
2008
An algorithm is presented in this paper to calculate a semantic similarity measure inside an OWL ontology. The formulation is based on a combined measure taking into account the two most important aspects involved in the similarity computation. These are the structural properties of a concept, and the information content inside the ontology. We define a fuzzy system to blend these information sources with a training process over some ontologies. Finding a similarity measure between concepts of an ontology is a fundamental topic to accomplish information exchange on the Web. Through this measure it is possible to perform sophisticated queries over the web where the user is able to request co…
Automatic Illustration of Short Texts via Web Images
2015
In this paper we propose a totally unsupervised and automatic illustration method, which aims to find onto the Web a set of images to illustrate the content of an input short text. The text is modelled as a semantic space and a set of relevant keywords is extracted. We compare and discuss different methods to create semantic representations by keyword extraction. Keywords are used to query Google Image Search engine for a list of relevant images. We also extract information from the Web pages that include the retrieved images, to create an Image Semantic Space, which is compared to the Text Semantic Space in order to rank the list of retrieved images. Tests showed that our method achieves v…
Syntagmatic and Paradigmatic Associations in Information Retrieval
2003
It is shown that unconscious associative processes taking place in the memory of a searcher during the formulation of a search query in information retrieval — such as the production of free word associations and the generation of synonyms — can be simulated using statistical models that analyze the distribution of words in large text corpora. The free word associations as produced by subjects on presentation of stimulus words can be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. Both approaches are compared and validated …
Learning Similarity Scores by Using a Family of Distance Functions in Multiple Feature Spaces
2017
There exist a large number of distance functions that allow one to measure similarity between feature vectors and thus can be used for ranking purposes. When multiple representations of the same object are available, distances in each representation space may be combined to produce a single similarity score. In this paper, we present a method to build such a similarity ranking out of a family of distance functions. Unlike other approaches that aim to select the best distance function for a particular context, we use several distances and combine them in a convenient way. To this end, we adopt a classical similarity learning approach and face the problem as a standard supervised machine lea…
Toward Approximate GML Retrieval Based on Structural and Semantic Characteristics
2010
International audience; GML is emerging as the new standard for representing geographic information in GISs on the Web, allowing the encoding of structurally and semantically rich geographic data in self describing XML-based geographic entities. In this study, we address the problem of approximate querying and ranked results for GML data and provide a method for GML query evaluation. Our method consists of two main contributions. First, we propose a tree model for representing GML queries and data collections. Then, we introduce a GML retrieval method based on the concept of tree edit distance as an efficient means for comparing semi-structured data. Our approach allows the evaluation of bo…
Qualifying semantic graphs using model checking
2011
International audience; Semantic interoperability problems have found their solutions using languages and techniques from the Semantic Web. The proliferation of ontologies and meta-information has improved the understanding of information and the relevance of search engine responses. However, the construction of semantic graphs is a source of numerous errors of interpretation or modeling and scalability remains a major problem. The processing of large semantic graphs is a limit to the use of semantics in current information systems. The work presented in this paper is part of a new research at the border of two areas: the semantic web and the model checking. This line of research concerns t…