Search results for " similarity"
showing 10 items of 126 documents
Alignment free Dissimilarities for sequence classification
2015
One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples, and count the occurence of each L-tuple in the sequence. This representation defines a mapping of a sequence into a numerical space by a numerical feature vector of fixed length, that allows to measure sequence similarity in an alignment free way simply using disssimilarity functions between vectors. This work presents a benchmark study of 4 alignment free disssimilarity functions between sequences, computed on their L-tuples representation, for the purpose of sequence classification. In our experiments, we have tested the classes of geometric-based, correlation-based and information-based …
A Semantic Similarity Measure for the SIMS Framework
2008
The amount of currently available digital information grows rapidly. Relevant information is often spread over different information sources. An efficient and flexible framework to allow users to satisfy ef- fectively their information needs is required. The work presented in this paper describes SIMS (Semantic Information Management System), a ref- erence architecture for a framework performing semantic annotation, search and retrieval of information from multiple sources. The work pre- sented in this paper focuses on a specific SIMS module, the SIMS Semantic Content Navigator, proposing an algorithm and the related implementa- tion to calculate a semantic similarity measure inside an OWL …
An A* Based Semantic Tokenizer for Increasing the Performance of Semantic Applications
2013
Semantic Applications (SAs) makes use of ontolo- gies and their performance can depend on the syntactic labels of the modeled entities; even if several approaches have been devised to formalize ontologies, no formal approaches have been devised for naming their constituents, which look as long word concatenations without any particular separation. We present a novel semantic tokenizer that finds the sub-words through an application of the A* based search algorithm; the A* functions rely on a set of linguistic criteria and on the meta-cognitive perspective of the activity of reading.
An ontology-based retrieval system for mammographic reports
2015
In healthcare domain it can be useful to compare unstructured free-text clinical reports in order to enable the search for similar and/or relevant clinical cases. In data mining and text analysis tasks, the cosine similarity is usually used for texts comparison purposes. It is usually performed by computing the standard document vector cosine similarity between the two vectors representing the report pair under analysis. In this paper a novel system based on text pre-processing techniques and a modelled medical knowledge, using an improved radiological ontology, is proposed. Medical terms organized in a hierarchical tree can assess semantic similarity relationships between unstructured repo…
A Combined Fuzzy Semantic Similarity Measure In Owl Ontologies
2008
An algorithm is presented in this paper to calculate a semantic similarity measure inside an OWL ontology. The formulation is based on a combined measure taking into account the two most important aspects involved in the similarity computation. These are the structural properties of a concept, and the information content inside the ontology. We define a fuzzy system to blend these information sources with a training process over some ontologies. Finding a similarity measure between concepts of an ontology is a fundamental topic to accomplish information exchange on the Web. Through this measure it is possible to perform sophisticated queries over the web where the user is able to request co…
Automatic Illustration of Short Texts via Web Images
2015
In this paper we propose a totally unsupervised and automatic illustration method, which aims to find onto the Web a set of images to illustrate the content of an input short text. The text is modelled as a semantic space and a set of relevant keywords is extracted. We compare and discuss different methods to create semantic representations by keyword extraction. Keywords are used to query Google Image Search engine for a list of relevant images. We also extract information from the Web pages that include the retrieved images, to create an Image Semantic Space, which is compared to the Text Semantic Space in order to rank the list of retrieved images. Tests showed that our method achieves v…
Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results
2009
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.
Analogy
2020
Analogy is a mode of reasoning that is employed in problem solving, logic, science and art. The scheme of analogical reasoning is centred on the detection of similarity or common features across domains. Copi and Cohen (2005), Keynes (1921), Carnap (1980) suggested what analogical reasoning consists of. De Finetti (1938) proposed an alternative treatment of analogy as inference on what is invariant across statistical distributions of distinct event kinds. In problem solving theory, cognitive models of the content and the structural mapping of analogy has been built. Science and art has provided important test beds for models.
A method for quantifying atrial fibrillation organization based on wave-morphology similarity
2002
A new method for quantifying the organization of single bipolar electrograms recorded in the human atria during atrial fibrillation (AF) is presented. The algorithm relies on the comparison between pairs of local activation waves (LAWs) to estimate their morphological similarity, and returns a regularity index (/spl rho/) which measures the extent of repetitiveness over time of the detected activations. The database consisted of endocardial data from a multipolar basket catheter during AF and intraatrial recordings during atrial flutter. The index showed maximum regularity (/spl rho/=1) for all atrial flutter episodes and decreased significantly when increasing AF complexity as defined by W…
Morphological Similarity and Ecological Overlap in Two Rotifer Species
2013
Co-occurrence of cryptic species raises theoretically relevant questions regarding their coexistence and ecological similarity. Given their great morphological similitude and close phylogenetic relationship (i.e., niche retention), these species will have similar ecological requirements and are expected to have strong competitive interactions. This raises the problem of finding the mechanisms that may explain the coexistence of cryptic species and challenges the conventional view of coexistence based on niche differentiation. The cryptic species complex of the rotifer Brachionus plicatilis is an excellent model to study these questions and to test hypotheses regarding ecological differentia…