6533b871fe1ef96bd12d206d

RESEARCH PRODUCT

Penalization and data reduction of auxiliary variables in survey sampling

Muhammad Ahmed Shehzad

subject

Estimateur assisté par un modèleModel-assisted estimatorRégression ridge[ MATH.MATH-GM ] Mathematics [math]/General Mathematics [math.GM]Calage sur composantes principalesPenalized calibration[MATH.MATH-GM] Mathematics [math]/General Mathematics [math.GM]Estimateur basé sur un modèleSurvey sampling[MATH.MATH-GM]Mathematics [math]/General Mathematics [math.GM]Ridge regressionCalage pénaliséModel-based estimatorColinéaritéEstimateur de Horvitz-ThompsonHorvitz-Thompson estimatorSondageMulticollinearityPrincipal component calibration

description

Survey sampling techniques are quite useful in a way to estimate population parameterssuch as the population total when the large dimensional auxiliary data setis available. This thesis deals with the estimation of population total in presenceof ill-conditioned large data set.In the first chapter, we give some basic definitions that will be used in thelater chapters. The Horvitz-Thompson estimator is defined as an estimator whichdoes not use auxiliary variables. Along with, calibration technique is defined toincorporate the auxiliary variables for sake of improvement in the estimation ofpopulation totals for a fixed sample size.The second chapter is a part of a review article about ridge regression estimationas a remedy for the multicollinearity. We give a detailed review ofthe model-based, design-based and model-assisted scenarios for ridge estimation.These estimates give improved results in terms of MSE compared to the leastsquared estimates. Penalized calibration is also defined under survey sampling asan equivalent estimation technique to the ridge regression in the classical statisticscase. Simulation results confirm the improved estimation compared to theHorvitz-Thompson estimator.Another solution to the ill-conditioned large auxiliary data is given in terms ofprincipal components analysis in chapter three. Principal component regression isdefined and its use in survey sampling is explored. Some new types of principalcomponent calibration techniques are proposed such as calibration on the secondmoment of principal component variables, partial principal component calibrationand estimated principal component calibration to estimate a population total. Applicationof these techniques on real data advocates the use of these data reductiontechniques for the improved estimation of population totals

https://tel.archives-ouvertes.fr/tel-00812880/document