Search results for "Data mining"

showing 10 items of 907 documents

Grapes: a method and a SAS program for graphical representations of assessor performances

1994

GRAPES computes individual and global analyses of variance for sensory profiling data, consisting of several sessions in which all the panelists gave scores to all the products for a number of attributes. The fitted model takes into account the session effect. GRAPES summarizes the results by means of graphical assessor scatterplots which allow to check and to compare panelist performances, such as the way of using scale, the reliability, the discrimination power and the agreement with the panel. In addition, GRAPES detects the outliers for each of these criterion. The usefulness of GRAPES for the panel leader will be demonstrated using texture and flavor profiling of 4 restructured steaks …

0303 health sciences030309 nutrition & dieteticsComputer sciencebusiness.industry[SDV]Life Sciences [q-bio]Computer aidTEST DE CONSOMMATION04 agricultural and veterinary sciencescomputer.software_genre040401 food scienceSensory analysisSensory Systems[SDV] Life Sciences [q-bio]03 medical and health sciences0404 agricultural biotechnologyOutlierProfiling (information science)Data miningArtificial intelligenceGraphicsbusinesscomputerComputingMilieux_MISCELLANEOUSFood Science

researchProduct

Low-cost scalable discretization, prediction and feature selection for complex systems

2019

The introduced data-driven tool allows simultaneous feature selection, model inference, and marked cost and quality gains.

0303 health sciencesMultidisciplinary010504 meteorology & atmospheric sciencesDiscretizationComputer scienceData classificationProbabilistic logicComplex systemSciAdv r-articlesFeature selectioncomputer.software_genre01 natural sciences03 medical and health sciencesRange (mathematics)ScalabilityData miningCluster analysisAlgorithmcomputerResearch ArticlesMathematicsResearch Article030304 developmental biology0105 earth and related environmental sciences

researchProduct

Unlock ways to share data on peer review

2020

Peer review is the defining feature of scholarly communication. In a 2018 survey of more than 11, 000 researchers, 98% said that they considered peer review important or extremely important for ensuring the quality and integrity of scholarly communication.

0303 health sciencesMultidisciplinarybusiness.industry05 social sciencesdata miningPublic relations050905 science studiesResearch managementBibliometrics ; Scientometrics ; Research Integrity03 medical and health sciencesWork (electrical)Publishingpeer review data miningpeer reviewSociology0509 other social sciencesbusiness030304 developmental biology

researchProduct

SORT-CC: A procedure for the statistical treatment of free sorting data

2008

International audience; A statistical approach for the analysis of free sorting data is discussed. In a first stage, the sorting data from each subject are arranged into a dataset consisting of indicator variables which reflect the memberships of the stimuli to the groups formed by the subject under consideration. Thereafter, an appropriate standardization is applied on these data and a three way statistical method, namely Common Components and Specific Weights Analysis, is performed on the datasets thus obtained. This makes it possible to take account of the individual differences among the subjects and to depict graphical displays showing the relationships among the stimuli on the one han…

0303 health sciencesNutrition and DieteticsMultivariate analysisBasis (linear algebra)Standardization030309 nutrition & dieteticsComputer scienceTRISorting04 agricultural and veterinary sciences[SDV.IDA] Life Sciences [q-bio]/Food engineeringcomputer.software_genre040401 food scienceSORTING DATA03 medical and health sciences0404 agricultural biotechnologyDummy variableThree way[SDV.IDA]Life Sciences [q-bio]/Food engineeringsortStatistical analysisMULTIVARIALE ANALYSData miningcomputerFood Science

researchProduct

Hyperion

2019

Indexes are essential in data management systems to increase the speed of data retrievals. Widespread data structures to provide fast and memory-efficient indexes are prefix tries. Implementations like Judy, ART, or HOT optimize their internal alignments for cache and vector unit efficiency. While these measures usually improve the performance substantially, they can have a negative impact on memory efficiency. In this paper we present Hyperion, a trie-based main-memory key-value store achieving extreme space efficiency. In contrast to other data structures, Hyperion does not depend on CPU vector units, but scans the data structure linearly. Combined with a custom memory allocator, Hyperion…

0303 health sciencesRange query (data structures)Computer scienceData structurecomputer.software_genreSearch tree03 medical and health sciencesMemory managementTrieMemory footprintData miningCachecomputer030304 developmental biologyProceedings of the 2019 International Conference on Management of Data

researchProduct

Cell state prediction through distributed estimation of transmit power

2019

Determining the state of each cell, for instance, cell outages, in a densely deployed cellular network is a difficult problem. Several prior studies have used minimization of drive test (MDT) reports to detect cell outages. In this paper, we propose a two step process. First, using the MDT reports, we estimate the serving base station’s transmit power for each user. Second, we learn summary statistics of estimated transmit power for various networks states and use these to classify the network state on test data. Our approach is able to achieve an accuracy of 96% on an NS-3 simulation dataset. Decision tree, random forest and SVM classifiers were able to achieve a classification accuracy of…

050101 languages & linguisticsComputer science05 social sciencesProcess (computing)Decision tree5G-tekniikka02 engineering and technologymatkaviestinverkotTransmitter power outputcomputer.software_genreRandom forestcell outage detectionSupport vector machineBase stationmachine learningkoneoppiminen0202 electrical engineering electronic engineering information engineeringCellular network5G cellular networks020201 artificial intelligence & image processing0501 psychology and cognitive sciencesData miningcomputerTest data

researchProduct

Reverse-safe data structures for text indexing

2021

We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm which constructs a z-reverse-safe data structure that has size O(n) and answers pattern matching queries of length at most d optim…

050101 languages & linguisticsComputer sciencedata structure02 engineering and technologyprivacySet (abstract data type)combinatoric0202 electrical engineering electronic engineering information engineering0501 psychology and cognitive sciencesPattern matchingSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazionialgorithmSettore INF/01 - Informatica05 social sciencesSearch engine indexingINF/01 - INFORMATICAdata miningData structureMatrix multiplicationcombinatoricsExponent020201 artificial intelligence & image processingdata structure; algorithm; combinatorics; de Bruijn graph; data mining; privacyAlgorithmAdversary modelde Bruijn graphInteger (computer science)

researchProduct

An Internet-based program for depression using activity and physiological sensors: efficacy, expectations, satisfaction, and ease of use

2016

Cristina Botella,1,2 Adriana Mira,1 In&eacute;s Moragrega,2,3 Azucena Garc&iacute;a-Palacios,1,2 Juana Bret&oacute;n-L&oacute;pez,1,2 Diana Castilla,1,2 Antonio Riera L&oacute;pez del Amo,1 Carla Soler,1 Guadalupe Molinari,1 Soledad Quero,1,2 Ver&oacute;nica Guill&eacute;n-Botella,2,3 Ignacio Miralles,1,2 Sara Nebot,1 Berenice Serrano,1,2 Dennis Majoe,4 Mariano Alca&ntilde;iz,2,5 Rosa Mar&iacute;a Ba&ntilde;os2,31Department of Basic, Clinical Psychology and Psychobiology, Universitat Jaume, Castell&oacute;n, Spain; 2CIBER Physiopathology of Obesity and Nutrition, CIBERobn, Instituto de Salud Carlos III, Santiago de Compostela, Spain; 3Department o…

050103 clinical psychologymedicine.medical_specialtyNeuropsychiatric Disease and Treatmentmedicine.medical_treatmentPopulationefficacycomputer.software_genresensors03 medical and health sciences0302 clinical medicineIntervention (counseling)depression; ease of use; efficacy; Internet; sensors; satisfactionMedicine0501 psychology and cognitive sciencesStress measureseducationDepression (differential diagnoses)Original Researcheducation.field_of_studyInternetbusiness.industry05 social sciencessatisfactionActigraphyUsability030227 psychiatry3. Good healthCognitive behavioral therapydepressionPhysical therapyAnxietyData miningmedicine.symptombusinesscomputerease of use

researchProduct

Deliberation favours social efficiency by making people disregard their relative shares: evidence from USA and India

2017

Groups make decisions on both the production and the distribution of resources. These decisions typically involve a tension between increasing the total level of group resources (i.e. social efficiency) and distributing these resources among group members (i.e. individuals' relative shares). This is the case because the redistribution process may destroy part of the resources, thus resulting in socially inefficient allocations. Here we apply a dual-process approach to understand the cognitive underpinnings of this fundamental tension. We conducted a set of experiments to examine the extent to which different allocation decisions respond to intuition or deliberation. In a newly developed app…

1001Physics - Physics and SocietyDual-process model42media_common.quotation_subjectDistribution (economics)FOS: Physical sciencesPhysics and Society (physics.soc-ph)Social efficiencycomputer.software_genreTime pressureCorrections050105 experimental psychologydual process modelsMicroeconomicsintuitiondeliberationPsychology and Cognitive Neuroscience0502 economics and businessProduction (economics)0501 psychology and cognitive sciences050207 economicsequalityRobustness (economics)Set (psychology)[SHS.ECO] Humanities and Social Sciences/Economics and Financelcsh:Sciencedual-process modelsmedia_commonMultidisciplinaryCognitive Reflection Testbusiness.industry05 social sciencesCognition14Redistribution (cultural anthropology)Deliberation[SHS.ECO]Humanities and Social Sciences/Economics and FinanceefficiencyTrait[SHS.GESTION]Humanities and Social Sciences/Business administrationlcsh:QData miningPsychologybusiness[SHS.GESTION] Humanities and Social Sciences/Business administrationcomputerIntuitionResearch Article

researchProduct

Interpretability of Recurrent Neural Networks in Remote Sensing

2020

In this work we propose the use of Long Short-Term Memory (LSTM) Recurrent Neural Networks for multivariate time series of satellite data for crop yield estimation. Recurrent nets allow exploiting the temporal dimension efficiently, but interpretability is hampered by the typically overparameterized models. The focus of the study is to understand LSTM models by looking at the hidden units distribution, the impact of increasing network complexity, and the relative importance of the input covariates. We extracted time series of three variables describing the soil-vegetation status in agroe-cosystems -soil moisture, VOD and EVI- from optical and microwave satellites, as well as available in si…

2. Zero hungerMultivariate statisticsNetwork complexity010504 meteorology & atmospheric sciencesComputer science0211 other engineering and technologies02 engineering and technology15. Life on landcomputer.software_genre01 natural sciencesRecurrent neural networkDimension (vector space)Redundancy (engineering)Relevance (information retrieval)Data miningTime seriesWater contentcomputer021101 geological & geomatics engineering0105 earth and related environmental sciencesInterpretabilityIGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium

researchProduct