0000000001151336
AUTHOR
Caetano Traina
Adding Knowledge Extracted by Association Rules into Similarity Queries
International audience; In this paper, we propose new techniques to improve the quality of similarity queries over image databases performing association rule mining over textual descriptions and automatically extracted features of the image content. Based on the knowledge mined, each query posed is rewritten in order to better meet the user expectations. We propose an extension of SQL aimed at exploring mining processes over complex data, generating association rules that extract semantic information from the textual description superimposed to the extracted features, thereafter using them to rewrite the queries. As a result, the system obtains results closer to the user expectation than i…
Integrating user preference to similarity queries over medical images datasets
International audience; Large amounts of images from medical exams are being stored in databases, so developing retrieval techniques is an important research problem. Retrieval based on the image visual content is usually better than using textual descriptions, as they seldom gives every nuances that the user may be interested in. Content-based image retrieval employs the similarity among images for retrieval. However, similarity is evaluated using numeric methods, and they often orders the images by similarity in a way rather distinct from the user's intention. In this paper, we propose a technique to allow expressing the user's preference over attributes associated to the images, so simil…
CLEARMiner: a new algorithm for mining association patterns on heterogeneous time series from climate data
International audience; Recently, improvements in sensor technology contributed to increasing in spatial data acquisition. The use of remote sensing in many countries and states, where agricultural business is a large part of their gross income, can provide a valuable source to improve their economy. The combination of climate and remote sensing data can reveal useful information, which can help researchers to monitor and estimate the production of agricultural crops. Data mining techniques are the main tools to analyze and extract relationships and patterns. In this context, this paper presents a new algorithm for mining association patterns in Geo-referenced databases of climate and satel…
Identifying Algebraic Properties to Support Optimization of Unary Similarity Queries
International audience; Abstract. Conventional operators for data retrieval are either based on exact matching or on total order relationship among elements. Neither ofthem is appropriate to manage complex data, such as multimedia data, time series and genetic sequences. In fact, the most meaningful way tocompare complex data is by similarity. However, the Relational Algebra, employed in the Relational Database Management Systems (RDBMS),cannot express similarity criteria. In order to address this issue, we provide here an extension of the Relational Algebra, aimed at representingsimilarity queries in algebraic expressions. This paper identies fundamental properties to allow the integration…
XML document-grammar comparison: related problems and applications
10.2478/s13537-011-0005-1; International audience; XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to th…