Search results for "Trie"
showing 10 items of 4468 documents
CoCoDat: a database system for organizing and selecting quantitative data on single neurons and neuronal microcircuitry.
2004
We present a novel database system for organizing and selecting quantitative experimental data on single neurons and neuronal microcircuitry that has proven useful for reference-keeping, experimental planning and computational modelling. Building on our previous experience with large neuroscientific databases, the system takes into account the diversity and method-dependence of single cell and microcircuitry data and provides tools for entering and retrieving published data without a priori interpretation or summarizing. Data representation is based on the framework suggested by biophysical theory and enables flexible combinations of data on membrane conductances, ionic and synaptic current…
A methodology to assess the intrinsic discriminative ability of a distance function and its interplay with clustering algorithms for microarray data …
2013
Abstract Background Clustering is one of the most well known activities in scientific investigation and the object of research in many disciplines, ranging from statistics to computer science. Following Handl et al., it can be summarized as a three step process: (1) choice of a distance function; (2) choice of a clustering algorithm; (3) choice of a validation method. Although such a purist approach to clustering is hardly seen in many areas of science, genomic data require that level of attention, if inferences made from cluster analysis have to be of some relevance to biomedical research. Results A procedure is proposed for the assessment of the discriminative ability of a distance functi…
The Elephant in the Machine: Proposing a New Metric of Data Reliability and its Application to a Medical Case to Assess Classification Reliability
2020
In this paper, we present and discuss a novel reliability metric to quantify the extent a ground truth, generated in multi-rater settings, as a reliable basis for the training and validation of machine learning predictive models. To define this metric, three dimensions are taken into account: agreement (that is, how much a group of raters mutually agree on a single case)
¿Cómo funciona el sistema de innovación del sector cerámico español?
2013
[EN]: In this article we apply the functions of innovation systems framework to assess its appropriateness to characterise the innovation activity of the tile industry in Castellón. This framework is based on idea that a well functioning innovation system requires that a number of key activities take place. If this occurs innovative output is higher. Our analysis provides a deeper understanding of the role of innovation as a strategic option in a mature industry in the context of globalisation. By applying this new theoretical approach to study innovation and highlighting the functions that the system requires, we shown the constraints, inertias, challenges and opportunities that the innova…
Missing values in deduplication of electronic patient data
2011
Data deduplication refers to the process in which records referring to the same real-world entities are detected in datasets such that duplicated records can be eliminated. The denotation ‘record linkage’ is used here for the same problem.1 A typical application is the deduplication of medical registry data.2 3 Medical registries are institutions that collect medical and personal data in a standardized and comprehensive way. The primary aims are the creation of a pool of patients eligible for clinical or epidemiological studies and the computation of certain indices such as the incidence in order to oversee the development of diseases. The latter task in particular requires a database in wh…
Vectors of Pairwise Item Preferences
2019
Neural embedding has been widely applied as an effective category of vectorization methods in real-world recommender systems. However, its exploration of users’ explicit feedback on items, to create good quality user and item vectors is still limited. Existing neural embedding methods only consider the items that are accessed by the users, but neglect the scenario when a user gives high or low rating to a particular item. In this paper, we propose Pref2Vec, a method to generate vector representations of pairwise item preferences, users and items, which can be directly utilized for machine learning tasks. Specifically, Pref2Vec considers users’ pairwise item preferences as elementary units. …
Men's doubles professional tennis on hard courts: Game structure and point ending characteristics
2019
Despite the great tradition and importance of the doubles game in professional tennis, no literature has analysed to date the performance of professional players. Therefore, the information on the characteristics of the game, or the tactics related to how the points are won in doubles play is scarce. The objective of this study has been to describe the basic characteristics of the structure of the doubles game, and to establish how the points finish in doubles professional tennis played on hard courts. Thirty-four ATP doubles matches played in 2018 were analysed, which included a total of 40 professional players. As per the game structure, the results showed that, in comparison to the singl…
Research on Vocabulary Sizes and Codebook Universality
2014
Published version of an article in the journal: Abstract and Applied Analysis. Also available from the publisher at: http://dx.doi.org/10.1155/2014/697245 Open Access Codebook is an effective image representation method. By clustering in local image descriptors, a codebook is shown to be a distinctive image feature and widely applied in object classification. In almost all existing works on codebooks, the building of the visual vocabulary follows a basic routine, that is, extracting local image descriptors and clustering with a user-designated number of clusters. The problem with this routine lies in that building a codebook for each single dataset is not efficient. In order to deal with th…
Supplementary material from Competition between strains of Borrelia afzelii inside the rodent host and the tick vector
2018
Supplementary material supporting the paper
Flavonoid constituents of Stachys aegyptiaca
1991
International audience