Search results for "Information Systems"

showing 10 items of 1926 documents

Pruning Incremental Linear Model Trees with Approximate Lookahead

2014

Incremental linear model trees with approximate lookahead are fast, but produce overly large trees. This is due to non-optimal splitting decisions boosted by a possibly unlimited number of examples obtained from a data source. To keep the processing speed high and the tree complexity low, appropriate incremental pruning techniques are needed. In this paper, we introduce a pruning technique for the class of incremental linear model trees with approximate lookahead on stationary data sources. Experimental results show that the advantage of approximate lookahead in terms of processing speed can be further improved by producing much smaller and consequently more explanatory, less memory consumi…

Stationary processComputational Theory and MathematicsComputer scienceLinear modelPruning (decision trees)AlgorithmTree (graph theory)Computer Science ApplicationsInformation SystemsData modelingIEEE Transactions on Knowledge and Data Engineering
researchProduct

Ranking coherence in topic models using statistically validated networks

2023

Probabilistic topic models have become one of the most widespread machine learning techniques in textual analysis. Topic discovering is an unsupervised process that does not guarantee the interpretability of its output. Hence, the automatic evaluation of topic coherence has attracted the interest of many researchers over the last decade, and it is an open research area. This article offers a new quality evaluation method based on statistically validated networks (SVNs). The proposed probabilistic approach consists of representing each topic as a weighted network of its most probable words. The presence of a link between each pair of words is assessed by statistically validating their co-oc…

Statistically Validated NetworksTopic coherenceText MiningProbabilistic Topic modelLibrary and Information SciencesInformation SystemsJournal of Information Science
researchProduct

The Psychological Science Accelerator’s COVID-19 rapid-response dataset

2023

Funder: Amazon Web Services (AWS) Imagine Grant

Statistics and Probability223 participants with varying completion rates. Participants completed the survey from 111 geopolitical regions in 44 unique languages/dialects. The anonymized dataset described here is provided in both raw and processed formats to facilitate re-use and further analyses. The dataset offers secondary analytic opportunities to explore copingBF Psychology230 Affective NeuroscienceHealth Behaviorand demographic information for each participant. Each participant started the study with the same general questions and then was randomized to complete either one longer experiment or two shorter experiments. Data were provided by 73Message framingDiseasesLibrary and Information Sciences:Ciências Sociais::Psicologia [Domínio/Área Científica]geographical and cultural context characterizationHV Social pathology. Social and public welfare. CriminologypandemiatEducationa general questionnaire examining health prevention behaviors and COVID-19 experienceddc:150SDG 3 - Good Health and Well-beingRA0421 Public health. Hygiene. Preventive MedicineSurveys and QuestionnairesAdaptation PsychologicalyleiskartoituksetHumansPendienteHealth behaviorsPandemicsframingBehaviour Change and Well-beingEmotion regulationSelf-determination messagingand self-determination across a diverseCOVID-19kansainvälinen vertailuResearch dataComputer Science Applicationswhich can be merged with other time-sampled or geographic data.cognitive reappraisalsglobal sample obtained at the onset of the COVID-19 pandemicterveyskäyttäytyminenIn response to the COVID-19 pandemic/dk/atira/pure/sustainabledevelopmentgoals/good_health_and_well_beingand autonomy framing manipulations on behavioral intentions and affective measures. The data collected (April to October 2020) included specific measures for each experimental studyStatistics Probability and UncertaintyPeople’s healthtutkimusaineistosurvey-tutkimusDatasetInformation Systemsthe Psychological Science Accelerator coordinated three large-scale psychological studies to examine the effects of loss-gain framing
researchProduct

A multi-local optimization algorithm

1998

The development of efficient algorithms that provide all the local minima of a function is crucial to solve certain subproblems in many optimization methods. A “multi-local” optimization procedure using inexact line searches is presented, and numerical experiments are also reported. An application of the method to a semi-infinite programming procedure is included.

Statistics and ProbabilityContinuous optimizationMathematical optimizationInformation Systems and ManagementMeta-optimizationManagement Science and Operations ResearchSemi-infinite programmingMaxima and minimaVector optimizationModeling and SimulationDiscrete Mathematics and CombinatoricsRandom optimizationMulti-swarm optimizationAlgorithmMetaheuristicMathematicsTop
researchProduct

The Serial Property and Restricted Balanced Contributions in discrete cost sharing problems

2006

We show that the Serial Poperty and Restricted Balanced Contributions characterize the subsidy-free serial cost sharing method (Moulin (1995)) in discrete cost allocation problems.

Statistics and ProbabilityCost allocationMathematical optimizationInformation Systems and ManagementProperty (philosophy)Computer scienceModeling and SimulationMoulinDiscrete Mathematics and CombinatoricsCost sharingManagement Science and Operations ResearchShapley valueTOP
researchProduct

On the analysis of a random walk-jump chain with tree-based transitions and its applications to faulty dichotomous search

2018

Random Walks (RWs) have been extensively studied for more than a century [1]. These walks have traditionally been on a line, and the generalizations for two and three dimensions, have been by extending the random steps to the corresponding neighboring positions in one or many of the dimensions. Among the most popular RWs on a line are the various models for birth and death processes, renewal processes and the gambler’s ruin problem. All of these RWs operate “on a discretized line”, and the walk is achieved by performing small steps to the current-state’s neighbor states. Indeed, it is this neighbor-step motion that renders their analyses tractable. When some of the transitions are to non-ne…

Statistics and ProbabilityCurrent (mathematics)Learning systemsRandom walk jumpsDichotomous searches02 engineering and technologyState (functional analysis)Random walkTime reversibilityBirth–death process020202 computer hardware & architectureChain (algebraic topology)020204 information systemsModeling and SimulationLine (geometry)Controlled random walks0202 electrical engineering electronic engineering information engineeringJumpStatistical physicsTime reversibilitiesMathematics
researchProduct

WOODIV, a database of occurrences, functional traits, and phylogenetic data for all Euro-Mediterranean trees

2021

Trees play a key role in the structure and function of many ecosystems worldwide. In the Mediterranean Basin, forests cover approximately 22% of the total land area hosting a large number of endemics (46 species). Despite its particularities and vulnerability, the biodiversity of Mediterranean trees is not well known at the taxonomic, spatial, functional, and genetic levels required for conservation applications. The WOODIV database fills this gap by providing reliable occurrences, four functional traits (plant height, seed mass, wood density, and specific leaf area), and sequences from three DNA-regions (rbcL, matK, and trnH-psbA), together with modelled occurrences and a phylogeny for all…

Statistics and ProbabilityData DescriptorDatabases FactualMediterranean RegionConservation biologySettore BIO/02 - Botanica SistematicaScienceQBiodiversityLibrary and Information SciencesTreesComputer Science ApplicationsEducationBiogeographySettore BIO/03 - Botanica Ambientale E Applicata[SDE]Environmental SciencesForestCommunity ecologyStatistics Probability and UncertaintyForest ecologyEcosystemPhylogenyInformation Systems
researchProduct

Spanish electoral archive. SEA database

2021

This paper introduces the SEA database (acronym for Spanish Electoral Archive). SEA brings together the most complete public repository available to date on Spanish election outcomes. SEA holds all the results recorded from the electoral processes of General (1979–2019), Regional (1989–2021), Local (1979–2019) and European Parliamentary (1987–2019) elections held in Spain since the restoration of democracy in the late 70 s, in addition to other data sets with electoral content. The data are offered for free and is presented in a homogeneous and friendly format. Most of the databases are available for download with data from various electoral levels, including from the ballot box level. This…

Statistics and ProbabilityData DescriptorHistoryDownloadSciencemedia_common.quotation_subject0211 other engineering and technologiesInference02 engineering and technologyLibrary and Information Sciencescomputer.software_genre01 natural sciencesEducation010104 statistics & probabilitySociologyVotingPolitical scienceAcronymSociety0101 mathematicsmedia_commonDatabaseQPolitics021107 urban & regional planningTurnoutDemocracyComputer Science ApplicationsMetadataBallotGovernmentEconomia Mètodes estadísticsStatistics Probability and UncertaintycomputerInformation SystemsScientific Data
researchProduct

A database for the monitoring of thermal anomalies over the Amazon forest and adjacent intertropical oceans

2015

AbstractAdvances in information technologies and accessibility to climate and satellite data in recent years have favored the development of web-based tools with user-friendly interfaces in order to facilitate the dissemination of geo/biophysical products. These products are useful for the analysis of the impact of global warming over different biomes. In particular, the study of the Amazon forest responses to drought have recently received attention by the scientific community due to the occurrence of two extreme droughts and sustained warming over the last decade. Thermal Amazoni@ is a web-based platform for the visualization and download of surface thermal anomalies products over the Ama…

Statistics and ProbabilityData DescriptorRainforestDatabases FactualDownloadOceans and SeasBiomeRainforestLibrary and Information SciencesGlobal WarmingEducationEffects of global warmingServerBaseline (configuration management)Global warmingTropical ecologyComputer Science ApplicationsOceanographyClimatologyEnvironmental scienceSatelliteForest ecologyStatistics Probability and UncertaintyClimate-change impactsSoftwareInformation SystemsScientific Data
researchProduct

Galaxy LIMS for next-generation sequencing.

2013

Abstract Summary: We have developed a laboratory information management system (LIMS) for a next-generation sequencing (NGS) laboratory within the existing Galaxy platform. The system provides lab technicians standard and customizable sample information forms, barcoded submission forms, tracking of input sample quality, multiplex-capable automatic flow cell design and automatically generated sample sheets to aid physical flow cell preparation. In addition, the platform provides the researcher with a user-friendly interface to create a request, submit accompanying samples, upload sample quality measurements and access to the sequencing results. As the LIMS is within the Galaxy platform, the …

Statistics and ProbabilityDatabasebusiness.industryComputer scienceSample (material)Interface (computing)High-Throughput Nucleotide Sequencingcomputer.software_genreBiochemistryDNA sequencingComputer Science ApplicationsWorkflowWorld Wide WebComputational MathematicsUser-Computer InterfaceSoftwareComputational Theory and MathematicsbusinessMolecular BiologycomputerSoftwareInformation SystemsBioinformatics (Oxford, England)
researchProduct