Search results for " Probability"

showing 10 items of 2176 documents

The effect of automated taxa identification errors on biological indices

2017

In benthic macroinvertebrate biomonitoring systems, the target is to determine the status of ecosystems based on several biological indices. To increase cost-efficiency, computer-based taxa identification for image data has recently been developed. Taxa identification errors can, however, have strong effects on the indices and thus on the determination of the ecological status. In order to shift the biomonitoring process towards automated expert systems, we need a clear understanding on the bias caused by automation. In this paper, we examine eleven classification methods in the case of macroinvertebrate image data and show how their classification errors propagate into different biological…

Computer science02 engineering and technologycomputer.software_genre01 natural sciencesSimilarity010104 statistics & probabilityArtificial IntelligenceBiomonitoring0202 electrical engineering electronic engineering information engineeringEcosystem0101 mathematicssimilarityta218Invertebrateta112General Engineeringerror propagation [diversity]Computer Science ApplicationssamanlaisuusTaxondiversity: error propagationBenthic zonebiomonitoringidentification020201 artificial intelligence & image processingIdentification (biology)Data miningSpecies richnessclassification errorcomputerExpert Systems with Applications

researchProduct

Optimizing Kernel Ridge Regression for Remote Sensing Problems

2018

Kernel methods have been very successful in remote sensing problems because of their ability to deal with high dimensional non-linear data. However, they are computationally expensive to train when a large amount of samples are used. In this context, while the amount of available remote sensing data has constantly increased, the size of training sets in kernel methods is usually restricted to few thousand samples. In this work, we modified the kernel ridge regression (KRR) training procedure to deal with large scale datasets. In addition, the basis functions in the reproducing kernel Hilbert space are defined as parameters to be also optimized during the training process. This extends the n…

Computer science0211 other engineering and technologiesHyperspectral imagingContext (language use)Basis function02 engineering and technology01 natural sciencesData set010104 statistics & probabilityKernel (linear algebra)Kernel methodKernel (statistics)Radial basis function kernel0101 mathematics021101 geological & geomatics engineeringReproducing kernel Hilbert spaceRemote sensingIGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium

researchProduct

Bayesian inference in Markovian queues

1994

This paper is concerned with the Bayesian analysis of general queues with Poisson input and exponential service times. Joint posterior distribution of the arrival rate and the individual service rate is obtained from a sample consisting inn observations of the interarrival process andm complete service times. Posterior distribution of traffic intensity inM/M/c is also obtained and the statistical analysis of the ergodic condition from a decision point of view is discussed.

Computer scienceBayesian probabilityErgodicityPosterior probabilityManagement Science and Operations ResearchBayesian inferencePoisson distributionComputer Science ApplicationsExponential functionTraffic intensitysymbols.namesakeComputational Theory and MathematicsStatisticssymbolsApplied mathematicsErgodic theoryQueueing Systems

researchProduct

Efficient linear fusion of partial estimators

2018

Abstract Many signal processing applications require performing statistical inference on large datasets, where computational and/or memory restrictions become an issue. In this big data setting, computing an exact global centralized estimator is often either unfeasible or impractical. Hence, several authors have considered distributed inference approaches, where the data are divided among multiple workers (cores, machines or a combination of both). The computations are then performed in parallel and the resulting partial estimators are finally combined to approximate the intractable global estimator. In this paper, we focus on the scenario where no communication exists among the workers, de…

Computer scienceBayesian probabilityInferenceAsymptotic distribution02 engineering and technology01 natural sciences010104 statistics & probability[INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingArtificial Intelligence0202 electrical engineering electronic engineering information engineeringStatistical inferenceFusion rules0101 mathematicsElectrical and Electronic EngineeringComputingMilieux_MISCELLANEOUSMinimum mean square errorApplied MathematicsConstrained optimizationEstimator020206 networking & telecommunicationsComputational Theory and MathematicsSignal ProcessingComputer Vision and Pattern RecognitionStatistics Probability and Uncertainty[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processingAlgorithmDigital Signal Processing

researchProduct

Adaptive Importance Sampling: The past, the present, and the future

2017

A fundamental problem in signal processing is the estimation of unknown parameters or functions from noisy observations. Important examples include localization of objects in wireless sensor networks [1] and the Internet of Things [2]; multiple source reconstruction from electroencephalograms [3]; estimation of power spectral density for speech enhancement [4]; or inference in genomic signal processing [5]. Within the Bayesian signal processing framework, these problems are addressed by constructing posterior probability distributions of the unknowns. The posteriors combine optimally all of the information about the unknowns in the observations with the information that is present in their …

Computer scienceBayesian probabilityPosterior probabilityInference02 engineering and technologyMachine learningcomputer.software_genre01 natural sciences010104 statistics & probabilityMultidimensional signal processing[INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingPrior probability0202 electrical engineering electronic engineering information engineering0101 mathematicsElectrical and Electronic EngineeringComputingMilieux_MISCELLANEOUSbusiness.industryApplied Mathematics020206 networking & telecommunicationsApproximate inferenceSignal ProcessingProbability distributionArtificial intelligencebusinessAlgorithmcomputer[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processingImportance sampling

researchProduct

A New Simple Computational Method of Simultaneous Constructing and Comparing Confidence Intervals of Shortest Length and Equal Tails for Making Effic…

2021

A confidence interval is a range of values that provides the user with useful information about how accurately a statistic estimates a parameter. In the present paper, a new simple computational method is proposed for simultaneous constructing and comparing confidence intervals of shortest length and equal tails in order to make efficient decisions under parametric uncertainty. This unified computational method provides intervals in several situations that previously required separate analysis using more advanced methods and tables for numerical solutions. In contrast to the Bayesian approach, the proposed approach does not depend on the choice of priors and is a novelty in the theory of st…

Computer scienceBayesian probabilityPrior probabilityProbability distributionQuantile functionPivotal quantityAlgorithmConfidence intervalParametric statisticsQuantile

researchProduct

On the Computation of Symmetrized M-Estimators of Scatter

2016

This paper focuses on the computational aspects of symmetrized Mestimators of scatter, i.e. the multivariate M-estimators of scatter computed on the pairwise differences of the data. Such estimators do not require a location estimate, and more importantly, they possess the important block and joint independence properties. These properties are needed, for example, when solving the independent component analysis problem. Classical and recently developed algorithms for computing the M-estimators and the symmetrized M-estimators are discussed. The effect of parallelization is considered as well as new computational approach based on using only a subset of pairwise differences. Efficiencies and…

Computer scienceComputation05 social sciencesEstimatorMultivariate normal distributionM-estimators01 natural sciencesIndependent component analysisscatter010104 statistics & probabilityScatter matrix0502 economics and businessPairwise comparison0101 mathematicsAlgorithmIndependence (probability theory)050205 econometrics Block (data storage)

researchProduct

A Bayesian unified framework for risk estimation and cluster identification in small area health data analysis.

2020

Many statistical models have been proposed to analyse small area disease data with the aim of describing spatial variation in disease risk. In this paper, we propose a Bayesian hierarchical model that simultaneously allows for risk estimation and cluster identification. Our model formulation assumes that there is an unknown number of risk classes and small areas are assigned to a risk class by means of independent allocation variables. Therefore, areas within each cluster are assumed to share a common risk but they may be geographically separated. The posterior distribution of the parameter representing the number of risk classes is estimated using a novel procedure that combines its prior …

Computer scienceEpidemiologyPathology and Laboratory Medicine01 natural sciencesGeographical locations010104 statistics & probabilityChickenpoxMathematical and Statistical TechniquesStatisticsMedicine and Health SciencesPublic and Occupational Health0303 health sciencesMultidisciplinarySimulation and ModelingQREuropeIdentification (information)Medical MicrobiologySmall-Area AnalysisViral PathogensVirusesPhysical SciencesMedicinePathogensAlgorithmsResearch ArticleHerpesvirusesScienceBayesian probabilityPosterior probabilityBayesian MethodDisease SurveillanceDisease clusterResearch and Analysis MethodsRisk AssessmentMicrobiologyVaricella Zoster Virus03 medical and health sciencesRisk classPrior probabilityCovariateBayesian hierarchical modelingHumansEuropean Union0101 mathematicsMicrobial Pathogens030304 developmental biologyBiology and life sciencesOrganismsStatistical modelBayes TheoremProbability TheoryProbability DistributionMarginal likelihoodConvolutionSpainPeople and placesDNA virusesMathematical FunctionsMathematicsPloS one

researchProduct

Register data in sample allocations for small-area estimation

2018

The inadequate control of sample sizes in surveys using stratified sampling and area estimation may occur when the overall sample size is small or auxiliary information is insufficiently used. Very small sample sizes are possible for some areas. The proposed allocation based on multi-objective optimization uses a small-area model and estimation method and semi-collected empirical data annually collected empirical data. The assessment of its performance at the area and at the population levels is based on design-based sample simulations. Five previously developed allocations serve as references. The model-based estimator is more accurate than the design-based Horvitz–Thompson estimator and t…

Computer scienceGeneral MathematicsGeography Planning and DevelopmentPopulationSample (statistics)01 natural sciences010104 statistics & probabilitySmall area estimationmodel-based EBLUP0502 economics and businessSampling designStatisticsrekisteritotanta0101 mathematicseducation050205 econometrics DemographyEstimationta113education.field_of_studyta112kaupparekisteritauxiliary and proxy data05 social sciencesEstimatortrade-off between areas and populationmonitavoiteoptimointiStratified samplingkohdentaminenmulti-objective optimizationSample size determinationGeneral Agricultural and Biological SciencesperformanceMathematical Population Studies

researchProduct

Adaptive Population Importance Samplers: A General Perspective

2016

Importance sampling (IS) is a well-known Monte Carlo method, widely used to approximate a distribution of interest using a random measure composed of a set of weighted samples generated from another proposal density. Since the performance of the algorithm depends on the mismatch between the target and the proposal densities, a set of proposals is often iteratively adapted in order to reduce the variance of the resulting estimator. In this paper, we review several well-known adaptive population importance samplers, providing a unified common framework and classifying them according to the nature of their estimation and adaptive procedures. Furthermore, we interpret the underlying motivation …

Computer scienceMatemáticasMonte Carlo methodPopulation02 engineering and technologyMachine learningcomputer.software_genre01 natural sciences010104 statistics & probability[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing0202 electrical engineering electronic engineering information engineering0101 mathematicseducationComputingMilieux_MISCELLANEOUSeducation.field_of_studybusiness.industryEstimator020206 networking & telecommunicationsStatistical classificationRandom measureMonte Carlo integrationData miningArtificial intelligencebusinessParticle filtercomputer[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processingImportance sampling

researchProduct