Search results for "Indexing"
showing 10 items of 94 documents
A Lack of Attribution: Closing the Citation Gap Through a Reform of Citation and Indexing Practices
2012
Fauna Europaea: Coleoptera 2 (excl. series Elateriformia, Scarabaeiformia, Staphyliniformia and superfamily Curculionoidea)
2015
Fauna Europaea provides a public web-service with an index of scientific names (including synonyms) of all living European land and freshwater animals, their geographical distribution at country level (up to the Urals, excluding the Caucasus region), and some additional information. The Fauna Europaea project covers about 230,000 taxonomic names, including 130,000 accepted species and 14,000 accepted subspecies, which is much more than the originally projected number of 100,000 species. This represents a huge effort by more than 400 contributing specialists throughout Europe and is a unique (standard) reference suitable for many users in science, government, industry, nature conservation an…
Exceptional Pattern Discovery
2017
This chapter is devoted to a discussion on exceptional pattern discovery, namely on scenarios, contexts, and techniques concerning the mining of patterns which are so rare or so frequent to be considered as exceptional and, then, of interest for an expert to shed lights on the domain. Frequent patterns have found broad applications in areas like association rule mining, indexing, and clustering [1, 20, 23]. The application of frequent patterns in classification also achieved some success in the classification of relational data [6, 13, 14, 19, 25], text [15], and graphs [7]. The part is organized as follows. First, the frequent pattern mining on classical datasets is presented. This is not …
Fragments of peer review: A quantitative analysis of the literature (1969-2015)
2018
This paper examines research on peer review between 1969 and 2015 by looking at records indexed from the Scopus database. Although it is often argued that peer review has been poorly investigated, we found that the number of publications in this field doubled from 2005. A half of this work was indexed as research articles, a third as editorial notes and literature reviews and the rest were book chapters or letters. We identified the most prolific and influential scholars, the most cited publications and the most important journals in the field. Co-authorship network analysis showed that research on peer review is fragmented, with the largest group of co-authors including only 2.1% of the wh…
Citations and metrics of journals discontinued from Scopus for publication concerns: the GhoS(t)copus Project
2020
Background: Scopus is a leading bibliometric database. It contains a large part of the articles cited in peer-reviewed publications. The journals included in Scopus are periodically re-evaluated to ensure they meet indexing criteria and some journals might be discontinued for 'publication concerns'. Previously published articles may remain indexed and can be cited. Their metrics have yet to be studied. This study aimed to evaluate the main features and metrics of journals discontinued from Scopus for publication concerns, before and after their discontinuation, and to determine the extent of predatory journals among the discontinued journals. Methods: We surveyed the list of discontinued jo…
Reverse-safe data structures for text indexing
2021
We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm which constructs a z-reverse-safe data structure that has size O(n) and answers pattern matching queries of length at most d optim…
Priming index of the Spanish word fragments from the Dasí, Soler, and Ruiz (2004) database
2007
Word-fragment completion is a frequently used test in implicit memory research. A database of 196 Spanish fragments was recently published (Dasi, Soler, & Ruiz, 2004) in which the fragments were described for indices, such as difficulty, familiarity, frequency, number of meanings, and so on (www.psychonomic.org/archive). In this work, a new index, thepriming index, is described for the same 196 fragments. This index is calculated for each fragment by subtracting the difficulty index (the proportion of correct completion when the fragment is not studied) from the proportion of correct completion when the fragment is studied, and it means the capacity of an item to be primed. In order to dete…
Folksonomijas analīze sociālajā vietnē Instagram
2018
Folksonomija ir lietotāju brīvi izvēlētu atslēgvārdu kopa, un kā informācijas apstrādes un organizēšanas veids internetā mūsdienās ir plaši izplatīta. Pētījuma mērķis ir izpētīt sociālās vietnes Instagram folksonomiju, analizēt to struktūru, izmantojot kategoriju veidošanas teorijas un Sāras Šatfordes-Leinas (Sara Shatford-Layne) attēlu indeksēšanas modeli. Ar aptaujas palīdzību noskaidrot lietotāju paradumus sociālās vietnes Instagram lietošanā un atslēgvārdu piešķiršanā. Pētījuma mērķa sasniegšanai izmantota gadījuma analīze. Pēc noteiktiem kritērijiem izveidota 30 Instagram fotogrāfiju izlase, kurai veikta pievienoto atslēgvārdu kontentanalīze. Kā arī tika veikta Instagram lietotāju apta…
Video preprocessing for audiovisual indexing
2003
We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…
A big data approach for sequences indexing on the cloud via burrows wheeler transform
2020
Indexing sequence data is important in the context of Precision Medicine, where large amounts of "omics"data have to be daily collected and analyzed in order to categorize patients and identify the most effective therapies. Here we propose an algorithm for the computation of Burrows Wheeler transform relying on Big Data technologies, i.e., Apache Spark and Hadoop. Our approach is the first that distributes the index computation and not only the input dataset, allowing to fully benefit of the available cloud resources. Copyright © 2020 for this paper by its authors.