Search results for "sampling"

showing 10 items of 788 documents

On the stability and ergodicity of adaptive scaling Metropolis algorithms

2011

The stability and ergodicity properties of two adaptive random walk Metropolis algorithms are considered. The both algorithms adjust the scaling of the proposal distribution continuously based on the observed acceptance probability. Unlike the previously proposed forms of the algorithms, the adapted scaling parameter is not constrained within a predefined compact interval. The first algorithm is based on scale adaptation only, while the second one incorporates also covariance adaptation. A strong law of large numbers is shown to hold assuming that the target density is smooth enough and has either compact support or super-exponentially decaying tails.

Statistics and ProbabilityStochastic approximationMathematics - Statistics TheoryStatistics Theory (math.ST)Law of large numbersMultiple-try Metropolis01 natural sciencesStability (probability)010104 statistics & probabilityModelling and Simulation65C40 60J27 93E15 93E35Adaptive Markov chain Monte CarloFOS: Mathematics0101 mathematicsScalingMetropolis algorithmMathematicsta112Applied Mathematics010102 general mathematicsRejection samplingErgodicityProbability (math.PR)ta111CovarianceRandom walkMetropolis–Hastings algorithmModeling and SimulationAlgorithmStabilityMathematics - ProbabilityStochastic Processes and their Applications
researchProduct

Asymptotic efficiency of the calibration estimator in a high-dimensional data setting

2022

Abstract In a finite population sampling survey, auxiliary information is commonly used to improve the Horvitz-Thompson estimators and calibration has been extensively used by national statistical agencies over the last decades for that purpose. This method enables to make estimators consistent with known totals of auxiliary variables and to reduce variance if the calibration variables are explanatory for the variable of interest. Nowadays, it is not unusual anymore to have high-dimensional auxiliary data sets and adding too much additional calibration variables may increase the variance of calibration estimators. We study in this paper the asymptotic efficiency of the calibration estimator…

Statistics and ProbabilityVariance inflation factorAuxiliary variablesVariable (computer science)Calibration (statistics)Applied MathematicsStatisticsEstimatorVariance (accounting)Statistics Probability and UncertaintyPopulation samplingMathematicsJournal of Statistical Planning and Inference
researchProduct

Estimation of total electricity consumption curves by sampling in a finite population when some trajectories are partially unobserved

2019

International audience; Millions of smart meters that are able to collect individual load curves, that is, electricity consumption time series, of residential and business customers at fine scale time grids are now deployed by electricity companies all around the world. It may be complex and costly to transmit and exploit such a large quantity of information, therefore it can be relevant to use survey sampling techniques to estimate mean load curves of specific groups of customers. Data collection, like every mass process, may undergo technical problems at every point of the metering and collection chain resulting in missing values. We consider imputation approaches (linear interpolation, k…

Statistics and Probabilityconstructionkernel smoothingPopulationSurvey samplingimputation01 natural sciences010104 statistics & probability[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]0502 economics and businessStatisticsImputation (statistics)0101 mathematicseducationsurvey samplingfunctional data050205 econometrics Mathematicsconfidence bandsConsumption (economics)Estimationeducation.field_of_studymissing completely at randombusiness.industry05 social sciencesprincipal analysis by conditional estimationSampling (statistics)[MATH.MATH-PR]Mathematics [math]/Probability [math.PR]nearest neighboursKernel smoothervariance-estimationElectricityStatistics Probability and Uncertaintybusinessvariance approximation
researchProduct

A Bayesian comparison of cluster, strata, and random samples

1999

When sampling from finite populations, simple random sampling (SRS) is rarely used in practice, due to either high cost or information to be gained from more efficient designs. Bayesian hierarchical models are a natural framework to model the non-randomness in the sample. This paper concentrates on the effects that the design has on inference about characteristics of the finite population, and makes a critical comparison among some common designs.

Statistics and Probabilityeducation.field_of_studyApplied MathematicsBayesian probabilityPopulationSampling (statistics)Sample (statistics)Simple random sampleStratified samplingsymbols.namesakeStatisticssymbolsCluster samplingStatistics Probability and UncertaintyeducationMathematicsGibbs samplingJournal of Statistical Planning and Inference
researchProduct

Using Complex Surveys to Estimate theL1-Median of a Functional Variable: Application to Electricity Load Curves

2012

Mean proles are widely used as indicators of the electricity consumption habits of customers. Currently, Electricit e De France (EDF), estimates class load proles by using point-wise mean function. Unfortunately, it is well known that the mean is highly sensitive to the presence of outliers, such as one or more consumers with unusually high-levels of consumption. In this paper, we propose an alternative to the mean prole: the L1-median prole which is more robust. When dealing with large datasets of functional data (load curves for example), survey sampling approaches are useful for estimating the median prole and avoid storing all of the data. We propose here estimators of the median trajec…

Statistics and Probabilityeducation.field_of_studyComputer sciencePopulationEstimatorSurvey samplingSampling (statistics)Simple random sampleStratified samplingHorvitz–Thompson estimatorOutlierStatisticsStatistics Probability and UncertaintyeducationInternational Statistical Review
researchProduct

A Random Field Approach to Transect Counts of Wildlife Populations

1991

Line transect counting of a wildlife population is considered a sampling from a planar marked point process, where the marks describe the detectability of the animals. Sampling properties of transect counts and a new density estimator are derived from a counting process, which is a shot-noise field induced by the marked point process. A general formula for the sampling variance of a transect is derived and applied to compare five common types of transects. Some stereological connections of transect sampling and density estimators are shown.

Statistics and Probabilityeducation.field_of_studyRandom fieldCounting processCovariance functionPopulationSampling (statistics)EstimatorGeneral MedicineDensity estimationStatisticsStatistics Probability and UncertaintyeducationTransectMathematicsBiometrical Journal
researchProduct

On implementation of the Gibbs sampler for estimating the accuracy of multiple diagnostic tests

2010

Implementation of the Gibbs sampler for estimating the accuracy of multiple binary diagnostic tests in one population has been investigated. This method, proposed by Joseph, Gyorkos and Coupal, makes use of a Bayesian approach and is used in the absence of a gold standard to estimate the prevalence, the sensitivity and specificity of medical diagnostic tests. The expressions that allow this method to be implemented for an arbitrary number of tests are given. By using the convergence diagnostics procedure of Raftery and Lewis, the relation between the number of iterations of Gibbs sampling and the precision of the estimated quantiles of the posterior distributions is derived. An example conc…

Statistics and Probabilityeducation.field_of_studygastroesophageal reflux diseaseBayesian probabilityPopulationGold standard (test)Settore FIS/03 - Fisica Della MateriaGibbs sampler; Bayesian analysis; convergence diagnostics; diagnostic tests; gastroesophageal reflux diseaseSettore MED/01 - Statistica MedicaData setsymbols.namesakediagnostic testGibbs samplerConvergence (routing)Statisticsconvergence diagnosticsymbolsSensitivity (control systems)Statistics Probability and UncertaintyeducationAlgorithmBayesian analysiQuantileMathematicsGibbs samplingJournal of Applied Statistics
researchProduct

Bayesian Modeling of Sequential Discoveries

2022

We aim at modelling the appearance of distinct tags in a sequence of labelled objects. Common examples of this type of data include words in a corpus or distinct species in a sample. These sequential discoveries are often summarised via accumulation curves, which count the number of distinct entities observed in an increasingly large set of objects. We propose a novel Bayesian method for species sampling modelling by directly specifying the probability of a new discovery, therefore allowing for flexible specifications. The asymptotic behavior and finite sample properties of such an approach are extensively studied. Interestingly, our enlarged class of sequential processes includes highly tr…

Statistics and Probabilitylajistokartoitusspecies sampling modelslogistic regressionbayesilainen menetelmäaccumulation curvesotantaStatistics Probability and Uncertaintydirichlet processtilastolliset mallitpoisson-binomial distribution
researchProduct

Contributed discussion on article by Pratola

2016

The author should be commended for his outstanding contribution to the literature on Bayesian regression tree models. The author introduces three innovative sampling approaches which allow for efficient traversal of the model space. In this response, we add a fourth alternative.

Statistics and Probabilitymodel selectionMarkov Chain Monte Carlo (MCMC)Bayesian regression treeComputer scienceBig dataBayesian regression tree (BRT) modelsComputingMilieux_LEGALASPECTSOFCOMPUTINGbirth–death processMachine learningcomputer.software_genreSequential Monte Carlo methods01 natural sciencespopulation Markov chain Monte Carlo010104 statistics & probabilitysymbols.namesakebig data0502 economics and businessBayesian Regression Trees (BART)0101 mathematics050205 econometrics Bayesian treed regressionMultiple Try Metropolis algorithmsINFERÊNCIA ESTATÍSTICAbusiness.industryApplied MathematicsModel selection05 social sciencesRejection samplingData scienceVariable-order Bayesian networkTree (data structure)Tree traversalMarkov chain Monte Carlocontinuous time Markov processsymbolsArtificial intelligencebusinessBayesian linear regressioncommunication-freecomputerGibbs samplingBayesian Analysis
researchProduct

Systematic handling of missing data in complex study designs : experiences from the Health 2000 and 2011 Surveys

2016

We present a systematic approach to the practical and comprehensive handling of missing data motivated by our experiences of analyzing longitudinal survey data. We consider the Health 2000 and 2011 Surveys (BRIF8901) where increased non-response and non-participation from 2000 to 2011 was a major issue. The model assumptions involved in the complex sampling design, repeated measurements design, non-participation mechanisms and associations are presented graphically using methodology previously defined as a causal model with design, i.e. a functional causal model extended with the study design. This tool forces the statistician to make the study design and the missing-data mechanism explicit…

Statistics and Probabilitymultiple imputationComputer sciencecomputer.software_genre01 natural sciences010104 statistics & probability03 medical and health sciences0302 clinical medicinenon-responseSampling design030212 general & internal medicine0101 mathematicsCausal modelta112Clinical study designInverse probability weightingSampling (statistics)non-participationMissing dataData sciencedoubly robust methodsSurvey data collectionData miningStatistics Probability and Uncertaintycomputerinverse probability weightingStatisticiancausal model with designJournal of Applied Statistics
researchProduct