Search results for "RETRIEVAL"
showing 10 items of 1176 documents
Some Results Using Different Approaches to Merge Visual and Text-Based Features in CLEF’08 Photo Collection
2009
This paper describes the participation of the MIRACLE team at the ImageCLEF Photographic Retrieval task of CLEF 2008. We succeeded in submitting 41 runs. Obtained results from text-based retrieval are better than content-based as previous experiments in the MIRACLE team campaigns [5, 6] using different software. Our main aim was to experiment with several merging approaches to fuse text-based retrieval and content-based retrieval results, and it happened that we improve the text-based baseline when applying one of the three merging algorithms, although visual results are lower than textual ones.
Cover Feature: Research Data in Chemistry – Results of the first NFDI4Chem Community Survey (Z. Anorg. Allg. Chem. 23‐24/2020)
2020
Institutionalism, cultural institutions and cultural policy in the Nordic countries
2010
In the article our aim is to analyse theoretically the questions: (1) what is the relevance of institutional approach in research about cultural policy and cultural institutions, and (2) how do the ...
BUCC Shared Task: Cross-Language Document Similarity
2015
We summarise the organisation and results of the first shared task aimed at detecting the most similar texts in a large multilingual collection. The dataset of the shared was based on Wikipedia dumps with interlanguage links with further filtering to ensure comparability of the paired articles. The eleven system runs we received have been evaluated using the TREC evaluation metrics. 1 Task description Parallel corpora of original texts with their translations provide the basis for multilingual NLP applications since the beginning of the 1990s. Relative scarcity of such resources led to greater attention to comparable (=less parallel) resources to mine information about possible translations…
RDF* Graph Database as Interlingua for the TextWorld Challenge
2019
This paper briefly describes the top-scoring submission to the First TextWorld Problems: A Reinforcement and Language Learning Challenge. To alleviate the partial observability problem, characteristic to the TextWorld games, we split the Agent into two independent components: Observer and Actor, communicating only via the Interlingua of the RDF* graph database. The RDF* graph database serves as the “world model” memory incrementally updated by the Observer via FrameNet informed Natural Language Understanding techniques and is used by the Actor for the efficient exploration and planning of the game Action sequences. We find that the deep-learning approach works best for the Observer componen…
Multi-data models translations in interoperable information systems
1996
Interoperation of heterogeneous and autonomous information systems has traditionally been hampered by semantic differences in their data models. In this paper, we address the problem by defining a methodology called TIME, which is based on an extensible meta model. Its key features are: a set of meta-types which can be used to represent the syntax and the semantics of data modeling concepts, a knowledge base of transformation rules that map a meta-type into other meta-types, and an inference engine which uses the transformation rules to translate schema from source to target models. The extensibility of the meta-model is achieved by organizing the meta-types into a generalization hierarchy …
Reducing the Human Effort in Text Line Segmentation for Historical Documents
2021
Labeling the layout in historical documents for preparing training data for machine learning techniques is an arduous task that requires great human effort. A draft of the layout can be obtained by using a document layout analysis (DLA) system that later can be corrected by the user with less effort than doing it from scratch. We research in this paper an iterative process in which the user only supervises and corrects the given draft for the pages automatically selected by the DLA system with the aim of reducing the required human effort. The results obtained show that similar DLA quality can be achieved by reducing the number of pages that the user has to annote and that the accumulated h…
A relevance feedback CBIR algorithm based on fuzzy sets
2008
CBIR (content-based image retrieval) systems attempt to allow users to perform searches in large picture repositories. In most existing CBIR systems, images are represented by vectors of low level features. Searches in these systems are usually based on distance measurements defined in terms of weighted combinations of the low level features. This paper presents a novel approach to combining features when using multi-image queries consisting of positive and negative selections. A fuzzy set is defined so that the degree of membership of each image in the repository to this fuzzy set is related to the user's interest in that image. Positive and negative selections are then used to determine t…
On some flaws of university rankings: The example of the SCImago report
2012
International audience; Using France as our main example, we show there is a much scope for improving the SCImago ranking. We detect problems of nomenclature, double affiliation, aggregation and of bias toward large public-funded research organizations. The output per scholar is more important than the output per organization. The examples we cite suggest that only detailed knowledge of the situation can help in addressing the issue and take us beyond any automatic reading of the metadata.
Does relevance matter to data mining research?
2008
Data mining (DM) and knowledge discovery are intelligent tools that help to accumulate and process data and make use of it. We review several existing frameworks for DM research that originate from different paradigms. These DM frameworks mainly address various DM algorithms for the different steps of the DM process. Recent research has shown that many real-world problems require integration of several DM algorithms from different paradigms in order to produce a better solution elevating the importance of practice-oriented aspects also in DM research. In this chapter we strongly emphasize that DM research should also take into account the relevance of research, not only the rigor of it. Und…