Search results for "missing"

showing 10 items of 174 documents

Regression with Imputed Covariates: A Generalized Missing Indicator Approach

2011

A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then…

Set (abstract data type)Reduction (complexity)Relation (database)Bias of an estimatorStatisticsCovariateSettore SECS-P/05 - EconometriaStatistics::MethodologyRegression analysisMissing dataRegressionMathematicsSSRN Electronic Journal

researchProduct

WEIGHTS AND IMPUTATIONS

2019

This chapter provides a description of the weighting and imputation strategies used to address problems of unit nonresponse, sample attrition and item nonresponse in the seventh wave of SHARE.

Settore SECS-P/05 - EconometriaWeights Imputations nonresponse errors attrition missing data.

researchProduct

A Generalized Missing-Indicator Approach to Regression with Imputed Covariates

2011

We consider estimation of a linear regression model using data where some covariate values are missing but imputations are available to fill in the missing values. This situation generates a tradeoff between bias and precision when estimating the regression parameters of interest. Using only the subsample of complete observations does not cause bias but may imply a substantial loss of precision because the complete cases may be too few. On the other hand, filling in the missing values with imputations may cause bias. We provide the new Stata command gmi, which handles such tradeoff by using either model reduction or Bayesian model averaging techniques in the context of the generalized miss…

Settore SECS-P/05Computer scienceSettore SECS-P/05 - EconometriaMissing dataBayesian inferenceRegressiongmi missing covariates imputation bias–precision tradeoff model reduction model averagingMathematics (miscellaneous)CovariateLinear regressionStatisticsEconometricsStatistics::MethodologyImputation (statistics)Settore SECS-P/01 - Economia PoliticaThe Stata Journal: Promoting communications on statistics and Stata

researchProduct

EOFs for gap filling in multivariate air quality data: a FDA approach

2010

Missing values are a common concern in spatiotemporal data sets. During recent years a great number of methods have been developed for gap filling. One of the emerging approaches is based on the Empirical Orthogonal Function (EOF) methodology, applied mainly on raw and univariate data sets presenting irregular missing patterns. In this paper EOF is carried out on a multivariate space-time data set, related to concentrations of pollutants recorded at different sites, after denoising raw data by FDA approach. Some performance indicators are computed on simulated incomplete data sets with also long gaps in order to show that the EOF reconstruction appears to be an improved procedure especially…

Settore SECS-S/01 - StatisticaFDA EOF missing data gap filling

researchProduct

Air quality and integration of short-term and long-term pollutant data

2008

Modelling PM10 is an important problem in statistical methodology, above all to explain the PM10 behaviour in space and time, since it has been linked to many adverse effects on human and environmental health. But the large spatial variability of the main traffic-related pollutants, and in particular here the PM10, implies the impossibility of obtaining from the data of the fixed stations a complete pictures of the atmospheric pollution in the urban areas. Information from fixed monitoring stations (long-term measurements) are therefore integrated with the ones deriving from mobile station (short-term measurements). Short-term measurements are incomplete and so it is necessary to integrate …

Settore SECS-S/01 - StatisticaPollution short-term series PM10 missing values single imputation method

researchProduct

Physics-aware Gaussian processes in remote sensing

2018

Abstract Earth observation from satellite sensory data poses challenging problems, where machine learning is currently a key player. In recent years, Gaussian Process (GP) regression has excelled in biophysical parameter estimation tasks from airborne and satellite observations. GP regression is based on solid Bayesian statistics, and generally yields efficient and accurate parameter estimates. However, GPs are typically used for inverse modeling based on concurrent observations and in situ measurements only. Very often a forward model encoding the well-understood physical relations between the state vector and the radiance observations is available though and could be useful to improve pre…

Signal Processing (eess.SP)FOS: Computer and information sciences010504 meteorology & atmospheric sciences0211 other engineering and technologies02 engineering and technologyStatistics - Applications01 natural sciencessymbols.namesakeFOS: Electrical engineering electronic engineering information engineeringApplications (stat.AP)Electrical Engineering and Systems Science - Signal ProcessingGaussian processGaussian process emulator021101 geological & geomatics engineering0105 earth and related environmental sciencesbusiness.industryEstimation theoryBayesian optimizationState vectorMissing dataBayesian statisticssymbolsGlobal Positioning SystembusinessAlgorithmSoftwareApplied Soft Computing

researchProduct

Measurements of Higgs boson production and couplings in diboson final states with the ATLAS detector at the LHC

2013

We acknowledge the support of ANPCyT, Argentina; YerPhI, Armenia; ARC, Australia; BMWF and FWF, Austria; ANAS, Azerbaijan; SSTC, Belarus; CNPq and FAPESP, Brazil; NSERC, NRC and CFI, Canada; CERN; CONICYT, Chile; CAS, MOST and NSFC, China; COLCIENCIAS, Colombia; MSMT CR, MPO CR and VSC CR, Czech Republic; DNRF, DNSRC and Lundbeck Foundation, Denmark; EPLANET, ERC and NSRF, European Union; IN2P3-CNRS, CEA-DSM/IRFU, France; GNSF, Georgia; BMBF, DFG, HGF, MPG and AvH Foundation, Germany; GSRT and NSRF, Greece; ISF, MINERVA, GIF, DIP and Benoziyo Center, Israel; INFN, Italy; MEXT and JSPS, Japan; CNRST, Morocco; FOM and NWO, Netherlands; BRF and RCN, Norway; MNiSW, Poland; GRICES and FCT, Portu…

Standard Modeldilepton: mass spectrumCiencias Físicas01 natural sciences7. Clean energySettore FIS/04 - Fisica Nucleare e SubnucleareHigh Energy Physics - ExperimentHiggs particle: hadroproduction//purl.org/becyt/ford/1 [https]High Energy Physics - Experiment (hep-ex)vector boson: fusion[PHYS.HEXP]Physics [physics]/High Energy Physics - Experiment [hep-ex]QCBosonPhysicsHIGGS BOSONLarge Hadron Collidervector boson: pair productiontransverse energy: missing-energy4. EducationATLAS experimentSettore FIS/01 - Fisica SperimentaleATLAS3. Good healthMassless particleCERN LHC CollHiggs particle: massPhysical SciencesComputingMethodologies_DOCUMENTANDTEXTPROCESSINGHiggs boson7000: 8000 GeV-cmsFísica nuclearAtlasLhcNeutrinoHiggs particle: decay modesParticle Physics - ExperimentCIENCIAS NATURALES Y EXACTASp p: scatteringNuclear and High Energy PhysicsParticle physicsmass spectrum: (4lepton)530 PhysicsCiências Naturais::Ciências Físicas:Ciências Físicas [Ciências Naturais]FOS: Physical sciencesddc:500.2ATLASdetector; LHC; Higgsbosonproduction; diboson530Massless ParticlesNnlo QCDNuclear physics0103 physical sciencesFysikddc:530High Energy Physics010306 general physicsTransverse-MomentumCondensed Matter::Quantum GasesHiggs particle: couplingScience & Technologyhep-ex010308 nuclear & particles physicsHigh Energy Physics::PhenomenologyFísicaQCD CorrectionsFermion//purl.org/becyt/ford/1.3 [https]Hadron CollidersDiboson ProductionAstronomíavector boson: leptonic decayHADRON-HADRON COLLISIONSProton-Proton CollisionsRoot-S=7 TevHiggs particle: hadroproduction ; Higgs particle: coupling ; vector boson: fusion ; p p: scattering ; CERN LHC Coll ; ATLAS ; Higgs particle: decay modes ; vector boson: pair production ; vector boson: leptonic decay ; mass spectrum: two-photon ; mass spectrum: (4lepton) ; dilepton: mass spectrum ; transverse energy: missing-energy ; Higgs particle: mass ; experimental results ; 7000: 8000 GeV-cmsExperimental High Energy PhysicsHigh Energy Physics::ExperimentCross-Sectionsmass spectrum: two-photonexperimental resultsLeptonBroken Symmetries

researchProduct

Estimating person parameters via item response model and simple sum score in small samples with few polytomous items: A simulation study

2018

Background The Item Response Theory (IRT) is becoming increasingly popular for item analysis. Theoretical considerations and simulation studies suggest that parameter estimates will become precise only by utilizing many items in large samples. Method A simulation study focusing on a single scale was performed on data with (a) n = 40, 60, 80, 120, 200, 300, 500, and 900 cases utilizing (b) 4, 8, 16, or 32 items. The items were (c) symmetrically distributed vs. skew (skewness 0, 1, and 2). Item loadings were (d) homogeneous vs. heterogeneous. Item loadings were (e) low vs. high. Half of the items had (f) a correlated error or not. The number of answering categories (g) was four vs. five. A to…

Statistics and ProbabilityAnalysis of VarianceScale (ratio)EpidemiologyItem analysisSkewPolytomous Rasch modelMissing data01 natural sciences010104 statistics & probability03 medical and health sciences0302 clinical medicineSimple (abstract algebra)SkewnessSample SizeStatisticsItem response theoryHumansRegression AnalysisComputer Simulation030212 general & internal medicine0101 mathematicsCorrelation of DataMathematicsStatistics in Medicine

researchProduct

Forecasting time series with missing data using Holt's model

2009

This paper deals with the prediction of time series with missing data using an alternative formulation for Holt's model with additive errors. This formulation simplifies both the calculus of maximum likelihood estimators of all the unknowns in the model and the calculus of point forecasts. In the presence of missing data, the EM algorithm is used to obtain maximum likelihood estimates and point forecasts. Based on this application we propose a leave-one-out algorithm for the data transformation selection problem which allows us to analyse Holt's model with multiplicative errors. Some numerical results show the performance of these procedures for obtaining robust forecasts.

Statistics and ProbabilityApplied MathematicsAutocorrelationExponential smoothingLinear modelData transformation (statistics)EstimatorMissing dataExpectation–maximization algorithmStatisticsStatistics Probability and UncertaintyAdditive modelAlgorithmMathematicsJournal of Statistical Planning and Inference

researchProduct

Correcting for non-ignorable missingness in smoking trends

2015

Data missing not at random (MNAR) is a major challenge in survey sampling. We propose an approach based on registry data to deal with non-ignorable missingness in health examination surveys. The approach relies on follow-up data available from administrative registers several years after the survey. For illustration we use data on smoking prevalence in Finnish National FINRISK study conducted in 1972-1997. The data consist of measured survey information including missingness indicators, register-based background information and register-based time-to-disease survival data. The parameters of missingness mechanism are estimable with these data although the original survey data are MNAR. The u…

Statistics and ProbabilityBackground informationFOS: Computer and information sciencesta112Test data generationComputer scienceSurvey samplingnon-participationta3142Smoking prevalenceBayesian inferenceMissing dataStatistics - Applicationsregistry dataMethodology (stat.ME)missing dataStatisticsSurvey data collectionRegistry dataApplications (stat.AP)Statistics Probability and Uncertaintysurvey samplingStatistics - Methodologysmoking prevalencehealth examination survey

researchProduct