Search results for " Information systems"
showing 10 items of 940 documents
A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics
2012
International audience; XML similarity evaluation has become a central issue in the database and information communities, its applications ranging over document clustering, version control, data integration and ranked retrieval. Various algorithms for comparing hierarchically structured data, XML documents in particular, have been proposed in the literature. Most of them make use of techniques for finding the edit distance between tree structures, XML documents being commonly modeled as Ordered Labeled Trees. Yet, a thorough investigation of current approaches led us to identify several similarity aspects, i.e., sub-tree related structural and semantic similarities, which are not sufficient…
XML document-grammar comparison: related problems and applications
2011
10.2478/s13537-011-0005-1; International audience; XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to th…
An overview on XML similarity: Background, current trends and future directions
2009
In recent years, XML has been established as a major means for information management, and has been broadly utilized for complex data representation (e.g. multimedia objects). Owing to an unparalleled increasing use of the XML standard, developing efficient techniques for comparing XML-based documents becomes essential in the database and information retrieval communities. In this paper, we provide an overview of XML similarity/comparison by presenting existing research related to XML similarity. We also detail the possible applications of XML comparison processes in various fields, ranging over data warehousing, data integration, classification/clustering and XML querying, and discuss some…
Extensible User-Based XML Grammar Matching
2009
International audience; XML grammar matching has found considerable interest recently due to the growing number of heterogeneous XML documents on the web and the increasing need to integrate, and consequently search and retrieve XML data originated from different data sources. In this paper, we provide an approach for automatic XML grammar matching and comparison aiming to minimize the amount of user effort required to perform the match task. We propose an open framework based on the concept of tree edit distance, integrating different matching criterions so as to capture XML grammar element semantic and syntactic similarities, cardinality and alternativeness constraints, as well as data-ty…
Diagnosis of Incipient Bearing Faults using Convolutional Neural Networks
2019
The majority of faults occurring in rotating electrical machinery is attributed to bearings. To reduce downtime, it is desired to apply various diagnostic methods so that bearing degradation can be detected in good time prior to a complete failure. The work presented in this paper utilizes a data-driven machine learning approach based on convolutional neural networks (CNNs) in order to diagnose different types of bearing faults. A one-dimensional CNN is trained on vibration signals and compared to a two-dimensional CNN trained in time-frequency domain using continuous wavelet transform (CWT). The proposed method is demonstrated on data collected from run-to-failure tests.The results show th…
Trading off accuracy for efficiency by randomized greedy warping
2016
Dynamic Time Warping (DTW) is a widely used distance measure for time series data mining. Its quadratic complexity requires the application of various techniques (e.g. warping constraints, lower-bounds) for deployment in real-time scenarios. In this paper we propose a randomized greedy warping algorithm for finding similarity between time series instances. We show that the proposed algorithm outperforms the simple greedy approach and also provides very good time series similarity approximation consistently, as compared to DTW. We show that the Randomized Time Warping (RTW) can be used in place of DTW as a fast similarity approximation technique by trading some classification accuracy for ve…
On the classification of dynamical data streams using novel “Anti-Bayesian” techniques
2018
Abstract The classification of dynamical data streams is among the most complex problems encountered in classification. This is, firstly, because the distribution of the data streams is non-stationary, and it changes without any prior “warning”. Secondly, the manner in which it changes is also unknown. Thirdly, and more interestingly, the model operates with the assumption that the correct classes of previously-classified patterns become available at a juncture after their appearance. This paper pioneers the use of unreported novel schemes that can classify such dynamical data streams by invoking the recently-introduced “Anti-Bayesian” (AB) techniques. Contrary to the Bayesian paradigm, tha…
A cloud based architecture to support Electronic Health Record.
2014
We introduce a novel framework of electronic healthcare enabled by a Cloud platform able to host both Hospital Information Systems (HIS) and Electronic Medical Record (EMR) systems and implement an innovative model of Electronic Health Record (EHR) that is not only patient-oriented but also supports a better governance of the whole healthcare system. The proposed EHR model adopts the state of the art of the Cloud technologies, being able to join the different clinical data of the patient stored within the HISs and EMRs either placed into a local Data Center or hosted into a Cloud Platform enabling new directions of data analysis.
Water erosion prediction by stochastic and empirical models in the Mediterranean: A case study in Northern Sicily (Italy)
2012
The present thesis aimed to explore the methodological advantages as well as limitations in applying different modelling approaches to predict water soil erosion in Mediterranean environments. The research was accomplished in the central northern part of Sicily (Italy), considering this region to be representative of Mediterranean environmental conditions. In this region soil degradation problems, due to water erosion are becoming more and more serious. Consequently, defining models being able to predict erosion susceptibility and to discriminate environmental factors causing erosion is important to protect soil resources. The prediction of the spatial distribution of soil erosion processes…
Mapping floristic diversity: a case study in Sicily
2021
This paper presents an operational methodology to map and analyze the floristic richness of “target species” in Natura 2000 sites, making use of G.I.S. tools and procedures. A Floristic diversity map (scale 1:50,000), covering an area of 612 km2, was produced by a team of experts as part of the management plans of “Madonie Mountains” Sites of Community Importance (SCIs), located in Sicily (Italy). The primary grid map represents the richness of “target species”, which include species of Community interests, taxa on the National Red List, endemic and threatened, species protected under International Conventions, taxa of phytogeographic importance. Secondary data frames include a three-dimens…