Search results for " similarity"
showing 10 items of 126 documents
A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics
2012
International audience; XML similarity evaluation has become a central issue in the database and information communities, its applications ranging over document clustering, version control, data integration and ranked retrieval. Various algorithms for comparing hierarchically structured data, XML documents in particular, have been proposed in the literature. Most of them make use of techniques for finding the edit distance between tree structures, XML documents being commonly modeled as Ordered Labeled Trees. Yet, a thorough investigation of current approaches led us to identify several similarity aspects, i.e., sub-tree related structural and semantic similarities, which are not sufficient…
XML document-grammar comparison: related problems and applications
2011
10.2478/s13537-011-0005-1; International audience; XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to th…
Connected pathway relative permeability from pore-scale imaging of imbibition
2016
Abstract Pore-scale images obtained from a synchrotron-based X-ray computed micro-tomography (µCT) imbibition experiment in sandstone rock were used to conduct Navier–Stokes flow simulations on the connected pathways of water and oil phases. The resulting relative permeability was compared with steady-state Darcy-scale imbibition experiments on 5 cm large twin samples from the same outcrop sandstone material. While the relative permeability curves display a large degree of similarity, the endpoint saturations for the µCT data are 10% in saturation units higher than the experimental data. However, the two datasets match well when normalizing to the mobile saturation range. The agreement is p…
Are calanco landforms similar to river basins?
2017
In the past badlands have been often considered as ideal field laboratories for studying landscape evolution because of their geometrical similarity to larger fluvial systems. For a given hydrological process, no scientific proof exists that badlands can be considered a model of river basin prototypes. In this paper the measurements carried out on 45 Sicilian calanchi, a type of badlands that appears as a small-scale hydrographic unit, are used to establish their morphological similarity with river systems whose data are available in the literature. At first the geomorphological similarity is studied by identifying the dimensionless groups, which can assume the same value or a scaled one in…
Fuzzy environmental analogy index to develop environmental similarity maps for designing air quality monitoring networks on a large-scale
2019
All activities aimed at studying the primary causes and effects of air pollution cannot disregard the fact that it is necessary to have an optimal air quality monitoring network for assessing population exposure to air pollution and predicting the magnitude of the health risks. In the framework of a cooperation between the ARPA Sicilia Organization and the Department of Engineering, University of Palermo, research was performed to develop an innovative methodology useful for defining environmental similarity maps aimed at supporting the design of air quality monitoring networks at the regional scale. This approach is based on a new index called the fuzzy environmental analogy index (FEAI) b…
Semantic HMC for Big Data Analysis
2014
International audience; Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.
Quantum GestART: Identifying and Applying Correlations between Mathematics, Art, and Perceptual Organization
2020
Mathematics can help analyze the arts and inspire new artwork. Mathematics can also help make transformations from one artistic medium to another, considering exceptions and choices, as well as artists' individual and unique contributions. We propose a method based on diagrammatic thinking and quantum formalism. We exploit decompositions of complex forms into a set of simple shapes, discretization of complex images, and Dirac notation, imagining a world of "prototypes" that can be connected to obtain a fine or coarse-graining approximation of a given visual image. Visual prototypes are exchanged with auditory ones, and the information (position, size) characterizing visual prototypes is con…
TESTING SIMILARITY COEFFICIENTS FOR ANALYSIS OF THE FOSSIL RECORD USING CLUSTERING METHODS: THE PALAEOZOIC FLORA AS A STUDY CASE
2020
This paper reports a global methodological approach based on the similarity and clustering methods of the Palaeozoic plant fossil record using a comparative approach between two similarity measures: the Jacard and Raup-Crick Coefficients. The results show that although the Raup-Crick Coefficients clearly have the potential for providing more robust results, the consequences of the extinction processes are better reflected in the similarity analysis based on the Jaccard Coefficients. On the other hand, the cluster analysis based on UPGMA algorithm shows four robust clusters and reveals new evidence for the singularity of Mississippian flora. Finally, the results obtained reveal that similari…
Adapted Transfer of Distance Measures for Quantitative Structure-Activity Relationships and Data-Driven Selection of Source Datasets
2012
Quantitative structure–activity relationships are regression models relating chemical structure to biological activity. Such models allow to make predictions for toxicologically relevant endpoints, which constitute the target outcomes of experiments. The task is often tackled by instance-based methods, which are all based on the notion of chemical (dis-)similarity. Our starting point is the observation by Raymond and Willett that the two families of chemical distance measures, fingerprint-based and maximum common subgraph-based measures, provide orthogonal information about chemical similarity. This paper presents a novel method for finding suitable combinations of them, called adapted tran…
Prediction of the difficulty level in a standardized reading comprehension test : contributions from cognitive psychology and psychometrics
2013
Esta investigación busca identificar posibles variables predictoras del nivel de dificultad de los ítems de comprensión de lectura utilizados en una prueba psicométrica estandarizada para la admisión a una institución universitaria. Se propusieron varios posibles predictores del nivel de dificultad, a saber: densidad proposicional, negaciones, estructura sintáctica, dificultad del vocabulario, presencia elementos de realce (palabras resaltadas tipográficamente), abstracción del ítem y grado de similitud entre opción correcta y texto relevante para resolver el ítem. Mediante el Modelo Logístico Lineal de Rasgo Latente se encontró que la cantidad de proposiciones, la estructura sintáctica y, …