Search results for "Trie"

showing 10 items of 4468 documents

CoCoDat: a database system for organizing and selecting quantitative data on single neurons and neuronal microcircuitry.

2004

We present a novel database system for organizing and selecting quantitative experimental data on single neurons and neuronal microcircuitry that has proven useful for reference-keeping, experimental planning and computational modelling. Building on our previous experience with large neuroscientific databases, the system takes into account the diversity and method-dependence of single cell and microcircuitry data and provides tools for entering and retrieving published data without a priori interpretation or summarizing. Data representation is based on the framework suggested by biophysical theory and enables flexible combinations of data on membrane conductances, ionic and synaptic current…

Computer sciencecomputer.internet_protocolRelational databaseModels NeurologicalAction PotentialsInformation Storage and Retrievalcomputer.software_genreMachine learningExternal Data RepresentationData retrievalAnimalsComputer SimulationLayer (object-oriented design)NeuronsDatabasebusiness.industryGeneral NeuroscienceExperimental dataRatsData sharingScalabilityDatabase Management SystemsArtificial intelligenceNeural Networks ComputerNerve NetbusinesscomputerXMLJournal of neuroscience methods

researchProduct

A methodology to assess the intrinsic discriminative ability of a distance function and its interplay with clustering algorithms for microarray data …

2013

Abstract Background Clustering is one of the most well known activities in scientific investigation and the object of research in many disciplines, ranging from statistics to computer science. Following Handl et al., it can be summarized as a three step process: (1) choice of a distance function; (2) choice of a clustering algorithm; (3) choice of a validation method. Although such a purist approach to clustering is hardly seen in many areas of science, genomic data require that level of attention, if inferences made from cluster analysis have to be of some relevance to biomedical research. Results A procedure is proposed for the assessment of the discriminative ability of a distance functi…

Computer sciencecomputer.software_genreBiochemistrysymbols.namesakeDiscriminative modelStructural BiologyCluster AnalysisRelevance (information retrieval)Cluster analysisMolecular BiologyOligonucleotide Array Sequence AnalysisClustering discriminative ability of a distance function external validation indicesSettore INF/01 - InformaticaResearchApplied MathematicsMutual informationPearson product-moment correlation coefficientComputer Science ApplicationsHierarchical clusteringEuclidean distanceRange (mathematics)Metric (mathematics)symbolsData miningTranscriptomecomputerAlgorithmsBMC Bioinformatics

researchProduct

The Elephant in the Machine: Proposing a New Metric of Data Reliability and its Application to a Medical Case to Assess Classification Reliability

2020

In this paper, we present and discuss a novel reliability metric to quantify the extent a ground truth, generated in multi-rater settings, as a reliable basis for the training and validation of machine learning predictive models. To define this metric, three dimensions are taken into account: agreement (that is, how much a group of raters mutually agree on a single case)

Computer sciencekneeMachine learningcomputer.software_genrelcsh:TechnologyTask (project management)lcsh:Chemistry03 medical and health sciencesMagnetic resonance imaging0302 clinical medicine0504 sociologyGeneral Materials Science030212 general & internal medicinelcsh:QH301-705.5InstrumentationCompetence (human resources)MRNetReliability (statistics)Fluid Flow and Transfer ProcessesGround truthreliabilityBasis (linear algebra)Point (typography)lcsh:Tbusiness.industryComputer Science::Information RetrievalProcess Chemistry and Technology05 social sciencesGeneral Engineering050401 social sciences methodslcsh:QC1-999Computer Science ApplicationsInter-rater reliabilitymachine learninglcsh:Biology (General)lcsh:QD1-999lcsh:TA1-2040inter-rater agreementArtificial intelligenceMetric (unit)lcsh:Engineering (General). Civil engineering (General)businessground truthcomputerlcsh:PhysicsApplied Sciences

researchProduct

¿Cómo funciona el sistema de innovación del sector cerámico español?

2013

[EN]: In this article we apply the functions of innovation systems framework to assess its appropriateness to characterise the innovation activity of the tile industry in Castellón. This framework is based on idea that a well functioning innovation system requires that a number of key activities take place. If this occurs innovative output is higher. Our analysis provides a deeper understanding of the role of innovation as a strategic option in a mature industry in the context of globalisation. By applying this new theoretical approach to study innovation and highlighting the functions that the system requires, we shown the constraints, inertias, challenges and opportunities that the innova…

Computer sciencemedia_common.quotation_subjectIndustria cerámicaFunctional approachSystem functionsContext (language use)Análisis funcionalIndustrial and Manufacturing Engineeringlcsh:TP785-869innovation systemsGlobalizationOrder (exchange)TaverneFunction (engineering)Industrial organizationmedia_commonFlexibility (engineering)system functionsInnovation systemInnovacions tecnològiqueslcsh:Clay industries. Ceramics. GlassMechanics of MaterialsSistemas de innovaciónCeràmicatile industryTile industryInnovation systemsCeramics and CompositesKey (cryptography)

researchProduct

Missing values in deduplication of electronic patient data

2011

Data deduplication refers to the process in which records referring to the same real-world entities are detected in datasets such that duplicated records can be eliminated. The denotation ‘record linkage’ is used here for the same problem.1 A typical application is the deduplication of medical registry data.2 3 Medical registries are institutions that collect medical and personal data in a standardized and comprehensive way. The primary aims are the creation of a pool of patients eligible for clinical or epidemiological studies and the computation of certain indices such as the incidence in order to oversee the development of diseases. The latter task in particular requires a database in wh…

Computer sciencemedia_common.quotation_subjectInferenceHealth InformaticsAmbiguityPatient dataMissing datacomputer.software_genreResearch and ApplicationsRegressionNeoplasmsStatisticsData deduplicationElectronic Health RecordsHumansData miningImputation (statistics)Medical Record LinkageRegistriescomputerRecord linkagemedia_common

researchProduct

Vectors of Pairwise Item Preferences

2019

Neural embedding has been widely applied as an effective category of vectorization methods in real-world recommender systems. However, its exploration of users’ explicit feedback on items, to create good quality user and item vectors is still limited. Existing neural embedding methods only consider the items that are accessed by the users, but neglect the scenario when a user gives high or low rating to a particular item. In this paper, we propose Pref2Vec, a method to generate vector representations of pairwise item preferences, users and items, which can be directly utilized for machine learning tasks. Specifically, Pref2Vec considers users’ pairwise item preferences as elementary units. …

Computer scienceneuraalilaskentaInitialization02 engineering and technology010501 environmental sciencesRecommender systemMachine learningcomputer.software_genre01 natural sciences0202 electrical engineering electronic engineering information engineeringvectorizationPreference (economics)Independence (probability theory)0105 earth and related environmental sciencesbusiness.industryComputer Science::Information RetrievalsuosittelujärjestelmätConditional probabilityneural embeddingVectorization (mathematics)Benchmark (computing)020201 artificial intelligence & image processingPairwise comparisonArtificial intelligencebusinesscomputer

researchProduct

Men's doubles professional tennis on hard courts: Game structure and point ending characteristics

2019

Despite the great tradition and importance of the doubles game in professional tennis, no literature has analysed to date the performance of professional players. Therefore, the information on the characteristics of the game, or the tactics related to how the points are won in doubles play is scarce. The objective of this study has been to describe the basic characteristics of the structure of the doubles game, and to establish how the points finish in doubles professional tennis played on hard courts. Thirty-four ATP doubles matches played in 2018 were analysed, which included a total of 40 professional players. As per the game structure, the results showed that, in comparison to the singl…

Computer sciencetacticsPhysical Therapy Sports Therapy and Rehabilitation010501 environmental sciences01 natural sciences03 medical and health sciencesProfessional players0302 clinical medicinedoublesGame structureEducación Física y DeportivaRelevance (information retrieval)performance analysislcsh:Sports medicineSet (psychology)0105 earth and related environmental sciencesStructure (mathematical logic)Point (typography)Performance analysisOffensiveComputingMilieux_PERSONALCOMPUTING030229 sport sciencesprofessional playersDoubleslcsh:RC1200-1245Mathematical economicshuman activitiesTactics

researchProduct

Research on Vocabulary Sizes and Codebook Universality

2014

Published version of an article in the journal: Abstract and Applied Analysis. Also available from the publisher at: http://dx.doi.org/10.1155/2014/697245 Open Access Codebook is an effective image representation method. By clustering in local image descriptors, a codebook is shown to be a distinctive image feature and widely applied in object classification. In almost all existing works on codebooks, the building of the visual vocabulary follows a basic routine, that is, extracting local image descriptors and clustering with a user-designated number of clusters. The problem with this routine lies in that building a codebook for each single dataset is not efficient. In order to deal with th…

ComputingMethodologies_PATTERNRECOGNITIONArticle SubjectApplied Mathematicslcsh:MathematicsInformationSystems_INFORMATIONSTORAGEANDRETRIEVALComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONVDP::Technology: 500::Information and communication technology: 550Analysis; Applied Mathematicslcsh:QA1-939Analysis

researchProduct

Supplementary material from Competition between strains of Borrelia afzelii inside the rodent host and the tick vector

2018

Supplementary material supporting the paper

ComputingMethodologies_SIMULATIONANDMODELINGInformationSystems_INFORMATIONSTORAGEANDRETRIEVALComputingMethodologies_DOCUMENTANDTEXTPROCESSINGComputingMilieux_COMPUTERSANDEDUCATIONComputerApplications_COMPUTERSINOTHERSYSTEMS

researchProduct

Flavonoid constituents of Stachys aegyptiaca

1991

International audience

ComputingMilieux_MISCELLANEOUS[SDV.BV.PEP] Life Sciences [q-bio]/Vegetal Biology/Phytopathology and phytopharmacy[SDV.BV.PEP]Life Sciences [q-bio]/Vegetal Biology/Phytopathology and phytopharmacySPECTROMETRIE UV

researchProduct