Search results for "IMPUTATION"

showing 7 items of 57 documents

Bayesian models for data missing not at random in health examination surveys

2018

In epidemiological surveys, data missing not at random (MNAR) due to survey nonresponse may potentially lead to a bias in the risk factor estimates. We propose an approach based on Bayesian data augmentation and survival modelling to reduce the nonresponse bias. The approach requires additional information based on follow-up data. We present a case study of smoking prevalence using FINRISK data collected between 1972 and 2007 with a follow-up to the end of 2012 and compare it to other commonly applied missing at random (MAR) imputation approaches. A simulation experiment is carried out to study the validity of the approaches. Our approach appears to reduce the nonresponse bias substantially…

Statistics and ProbabilityFOS: Computer and information sciencesmedicine.medical_specialtymultiple imputationComputer scienceBayesian probability01 natural sciencesStatistics - Applicationssurvival analysisfollow-up dataMethodology (stat.ME)010104 statistics & probability03 medical and health sciencesHealth examination0302 clinical medicineEpidemiologyStatisticsmedicineApplications (stat.AP)030212 general & internal medicine0101 mathematicsSurvival analysisStatistics - MethodologyBayes estimatorta112elinaika-analyysiRisk factor (computing)Bayesian estimation3. Good healthhealth examination surveysStatistics Probability and UncertaintyMissing not at randomdata augmentation
researchProduct

Estimation of total electricity consumption curves by sampling in a finite population when some trajectories are partially unobserved

2019

International audience; Millions of smart meters that are able to collect individual load curves, that is, electricity consumption time series, of residential and business customers at fine scale time grids are now deployed by electricity companies all around the world. It may be complex and costly to transmit and exploit such a large quantity of information, therefore it can be relevant to use survey sampling techniques to estimate mean load curves of specific groups of customers. Data collection, like every mass process, may undergo technical problems at every point of the metering and collection chain resulting in missing values. We consider imputation approaches (linear interpolation, k…

Statistics and Probabilityconstructionkernel smoothingPopulationSurvey samplingimputation01 natural sciences010104 statistics & probability[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]0502 economics and businessStatisticsImputation (statistics)0101 mathematicseducationsurvey samplingfunctional data050205 econometrics Mathematicsconfidence bandsConsumption (economics)Estimationeducation.field_of_studymissing completely at randombusiness.industry05 social sciencesprincipal analysis by conditional estimationSampling (statistics)[MATH.MATH-PR]Mathematics [math]/Probability [math.PR]nearest neighboursKernel smoothervariance-estimationElectricityStatistics Probability and Uncertaintybusinessvariance approximation
researchProduct

Systematic handling of missing data in complex study designs : experiences from the Health 2000 and 2011 Surveys

2016

We present a systematic approach to the practical and comprehensive handling of missing data motivated by our experiences of analyzing longitudinal survey data. We consider the Health 2000 and 2011 Surveys (BRIF8901) where increased non-response and non-participation from 2000 to 2011 was a major issue. The model assumptions involved in the complex sampling design, repeated measurements design, non-participation mechanisms and associations are presented graphically using methodology previously defined as a causal model with design, i.e. a functional causal model extended with the study design. This tool forces the statistician to make the study design and the missing-data mechanism explicit…

Statistics and Probabilitymultiple imputationComputer sciencecomputer.software_genre01 natural sciences010104 statistics & probability03 medical and health sciences0302 clinical medicinenon-responseSampling design030212 general & internal medicine0101 mathematicsCausal modelta112Clinical study designInverse probability weightingSampling (statistics)non-participationMissing dataData sciencedoubly robust methodsSurvey data collectionData miningStatistics Probability and Uncertaintycomputerinverse probability weightingStatisticiancausal model with designJournal of Applied Statistics
researchProduct

Structure and determinants of production in Textile-Clothing-Leather-Skins (TCLS) craft industry in Benin: a study based on investigations of the val…

2014

Lieu et date de la conférence modifiées. Prétoria 31Juillet au 2 Août 2014 (Au lieu de Kigali -Rwanda); International audience; Sustainable economic growth in Benin requires a better understand of the informalsector, which contributes to two-thirds of GDP. Particularly, craft industry and TCLSsubsector is one of informal activity sector to be handled. The objectives of this work wereto identify the structure and factors that determine the production in TCLS craft industry.This study was based on a survey carried out in February 2011 on value creationdata in craft industry of TCLS in Benin. Lack of data induced by informal activities wascircumvented using an imputation method. A Multiple Cor…

[STAT.AP]Statistics [stat]/Applications [stat.AP][SHS.STAT]Humanities and Social Sciences/Methods and statisticsimputation methodinstrumental variablesEndogeneity[STAT.AP] Statistics [stat]/Applications [stat.AP]SMEDLS[SHS.STAT] Humanities and Social Sciences/Methods and statisticsinformel sector[SHS.ECO]Humanities and Social Sciences/Economics and FinanceCrafts[SHS.ECO] Humanities and Social Sciences/Economics and Finance
researchProduct

Analysis and evaluation of cell imputation

2008

incomplete datasovelluksettilastotiedeanalyysicell imputationregressionimputointiarviointi
researchProduct

CLUSTERING INCOMPLETE SPECTRAL DATA WITH ROBUST METHODS

2018

Abstract. Missing value imputation is a common approach for preprocessing incomplete data sets. In case of data clustering, imputation methods may cause unexpected bias because they may change the underlying structure of the data. In order to avoid prior imputation of missing values the computational operations must be projected on the available data values. In this paper, we apply a robust nan-K-spatmed algorithm to the clustering problem on hyperspectral image data. Robust statistics, such as multivariate medians, are more insensitive to outliers than classical statistics relying on the Gaussian assumptions. They are, however, computationally more intractable due to the lack of closed-for…

lcsh:Applied optics. PhotonicsMultivariate statisticsComputer scienceGaussianCorrelation clusteringRobust statisticsspectral datacomputer.software_genrelcsh:Technologysymbols.namesakeCURE data clustering algorithmImputation (statistics)interpolointiCluster analysisK-meansnan-K-spatmedlcsh:Tk-means clusteringlcsh:TA1501-1820robust statistical methodsMissing dataData setlcsh:TA1-2040OutliersymbolsData mininglcsh:Engineering (General). Civil engineering (General)computerclustering
researchProduct

Factors Influencing Teachers’ Use of ICT in Class: Evidence from a Multilevel Logistic Model

2022

Information and Communication Technologies (ICTs) have become a key factor in the educational context, especially in the aftermath of the COVID-19 pandemic, and, correctly implemented, can help to improve academic performance. The aim of this research was to analyse the factors that influence teachers’ decisions to use ICT more- or less frequently to carry out tasks and exercises in their classes. To this end, we estimated a multilevel logistic model with census data from the individualized evaluation of students of the Community of Madrid (Spain) carried out at the end of the 2018–2019 academic year in primary and secondary education. Additionally, we applied multiple imputation techniques…

multiple imputationTecnologia de la informacióICTlogistic regressionGeneral MathematicsICT; logistic regression; multilevel or hierarchical model; multiple imputation; teachingComputer Science (miscellaneous)ComputingMilieux_COMPUTERSANDEDUCATIONUNESCO::CIENCIAS ECONÓMICASmultilevel or hierarchical modelEngineering (miscellaneous)teachingMathematics; Volume 10; Issue 5; Pages: 799
researchProduct