Search results for "methodology."

showing 10 items of 822 documents

Design-based estimation for geometric quantiles with application to outlier detection

2010

Geometric quantiles are investigated using data collected from a complex survey. Geometric quantiles are an extension of univariate quantiles in a multivariate set-up that uses the geometry of multivariate data clouds. A very important application of geometric quantiles is the detection of outliers in multivariate data by means of quantile contours. A design-based estimator of geometric quantiles is constructed and used to compute quantile contours in order to detect outliers in both multivariate data and survey sampling set-ups. An algorithm for computing geometric quantile estimates is also developed. Under broad assumptions, the asymptotic variance of the quantile estimator is derived an…

Statistics and ProbabilityStatistics::TheoryTheoryofComputation_COMPUTATIONBYABSTRACTDEVICESStatistics::ApplicationsComputingMethodologies_SIMULATIONANDMODELINGApplied MathematicsMathematicsofComputing_NUMERICALANALYSISUnivariateInformationSystems_DATABASEMANAGEMENTEstimatorStatistics::ComputationQuantile regressionHorvitz–Thompson estimatorComputational MathematicsDelta methodComputational Theory and MathematicsTheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITYOutlierConsistent estimatorStatisticsStatistics::MethodologyMathematicsQuantileComputational Statistics & Data Analysis
researchProduct

Nonlinear parametric quantile models

2020

Quantile regression is widely used to estimate conditional quantiles of an outcome variable of interest given covariates. This method can estimate one quantile at a time without imposing any constraints on the quantile process other than the linear combination of covariates and parameters specified by the regression model. While this is a flexible modeling tool, it generally yields erratic estimates of conditional quantiles and regression coefficients. Recently, parametric models for the regression coefficients have been proposed that can help balance bias and sampling variability. So far, however, only models that are linear in the parameters and covariates have been explored. This paper …

Statistics and ProbabilityStatistics::Theoryquantile regressionEpidemiologyparametric010501 environmental sciences01 natural sciencesquantile regression coefficients models010104 statistics & probabilityOutcome variableHealth Information ManagementCovariateEconometricsHumansStatistics::MethodologyComputer Simulation0101 mathematicsChild0105 earth and related environmental sciencesParametric statisticsMathematicsModels StatisticalForced oscillation technique integrated loss function parametric quantile regression quantile regression coefficients models Child Computer Simulation Humans Regression Analysis Models Statistical Nonlinear DynamicsStatistics::ComputationQuantile regressionNonlinear systemNonlinear Dynamicsintegrated loss functionRegression AnalysisQuantileStatistical Methods in Medical Research
researchProduct

An extended continuous mapping theorem for outer almost sure weak convergence

2019

International audience; We prove an extended continuous mapping theorem for outer almost sure weak convergence in a metric space, a notion that is used in bootstrap empirical processes theory. Then we make use of those results to establish the consistency of several bootstrap procedures in empirical likelihood theory for functional parameters.

Statistics and ProbabilityWeak convergence010102 general mathematicsContinuous mapping theorem16. Peace & justiceEmpirical measure01 natural sciences010104 statistics & probabilityMetric spaceEmpirical likelihoodConsistency (statistics)[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Applied mathematicsStatistics::Methodology0101 mathematicsStatistics Probability and UncertaintyMathematics
researchProduct

Multivariate Nonparametric Tests

2004

Multivariate nonparametric statistical tests of hypotheses are described for the one-sample location problem, the several-sample location problem and the problem of testing independence between pairs of vectors. These methods are based on affine-invariant spatial sign and spatial rank vectors. They provide affine-invariant multivariate generalizations of the univariate sign test, signed-rank test, Wilcoxon rank sum test, Kruskal–Wallis test, and the Kendall and Spearman correlation tests. While the emphasis is on tests of hypotheses, certain references to associated affine-equivariant estimators are included. Pitman asymptotic efficiencies demonstrate the excellent performance of these meth…

Statistics and Probabilityeducation.field_of_studyMultivariate statisticsspatial signWilcoxon signed-rank testGeneral MathematicsRank (computer programming)PopulationNonparametric statisticsUnivariaterobustnessSpearman's rank correlation coefficientspatial rankPitman efficiencyStatisticsAffine invarianceEconometricsSign testStatistics::MethodologyStatistics Probability and UncertaintyeducationMathematics
researchProduct

Deducing self-interaction in eye movement data using sequential spatial point processes

2016

Eye movement data are outputs of an analyser tracking the gaze when a person is inspecting a scene. These kind of data are of increasing importance in scientific research as well as in applications, e.g. in marketing and man-machine interface planning. Thus the new areas of application call for advanced analysis tools. Our research objective is to suggest statistical modelling of eye movement sequences using sequential spatial point processes, which decomposes the variation in data into structural components having interpretation. We consider three elements of an eye movement sequence: heterogeneity of the target space, contextuality between subsequent movements, and time-dependent behaviou…

Statistics and ProbabilitymallintaminenFOS: Computer and information sciencesrecurrenceComputer sciencestochastic geometrylikelihoodcoverageVariation (game tree)Management Monitoring Policy and Lawheterogeneous media01 natural sciences050105 experimental psychologyPoint processMethodology (stat.ME)010104 statistics & probabilitysilmänliikkeetStatistical inference0501 psychology and cognitive sciences0101 mathematicsComputers in Earth SciencesStatistics - Methodologytietojärjestelmätstokastiset prosessitta112self-interacting random walkbusiness.industry05 social sciencesEye movementPattern recognitionStatistical modelRandom walkkatseenseurantakatseArtificial intelligenceGeometric modelingbusinessStochastic geometry
researchProduct

Additional file 4 of Development and validation of prediction model to estimate 10-year risk of all-cause mortality using modern statistical learning…

2021

Additional file 4. Distributions of the variables at baseline before and after multiple imputations.

Statistics::ApplicationsStatistics::MethodologyQuantitative Biology::GenomicsComputer Science::Operating Systems
researchProduct

On the Ambiguous Consequences of Omitting Variables

2015

This paper studies what happens when we move from a short regression to a long regression (or vice versa), when the long regression is shorter than the data-generation process. In the special case where the long regression equals the data-generation process, the least-squares estimators have smaller bias (in fact zero bias) but larger variances in the long regression than in the short regression. But if the long regression is also misspecified, the bias may not be smaller. We provide bias and mean squared error comparisons and study the dependence of the differences on the misspecification parameter.

Statistics::Machine LearningStatistics::TheoryC51C52BiasMisspecificationLeast-squares estimatorsddc:330Statistics::MethodologyC13Mean squared errorOmitted variablesStatistics::Computation
researchProduct

A semiparametric approach to estimate reference curves for biophysical properties of the skin

2006

Reference curves which take one covariable into account such as the age, are often required in medicine, but simple systematic and efficient statistical methods for constructing them are lacking. Classical methods are based on parametric fitting (polynomial curves). In this chapter, we describe a new methodology for the estimation of reference curves for data sets, based on nonparametric estimation of conditional quantiles. The derived method should be applicable to all clinical or more generally biological variables that are measured on a continuous quantitative scale. To avoid the curse of dimensionality when the covariate is multidimensional, a new semiparametric approach is proposed. Th…

Statistics::TheoryKernel density estimationcomputer.software_genre01 natural sciences010104 statistics & probability0502 economics and businessCovariateSliced inverse regressionApplied mathematicsStatistics::MethodologySemiparametric regression0101 mathematics[SHS.ECO] Humanities and Social Sciences/Economics and Finance050205 econometrics MathematicsParametric statisticsDimensionality reduction05 social sciencesNonparametric statistics[ SDV.SPEE ] Life Sciences [q-bio]/Santé publique et épidémiologie[SHS.ECO]Humanities and Social Sciences/Economics and Finance3. Good health[SDV.SPEE] Life Sciences [q-bio]/Santé publique et épidémiologie[SDV.SPEE]Life Sciences [q-bio]/Santé publique et épidémiologieC140;C630Data miningcomputerQuantile
researchProduct

Dynamic copula models for the spark spread

2011

We propose a non-symmetric copula to model the evolution of electricity and gas prices by a bivariate non-Gaussian autoregressive process. We identify the marginal dynamics as driven by normal inverse Gaussian processes, estimating them from a series of observed UK electricity and gas spot data. We estimate the copula by modeling the difference between the empirical copula and the independent copula. We then simulate the joint process and price options written on the spark spread. We find that option prices are significantly influenced by the copula and the marginal distributions, along with the seasonality of the underlying prices.

Statistics::TheoryMathematical financeCopula (linguistics)Statistics::Other StatisticsBivariate analysisLévy processStatistics::ComputationInverse Gaussian distributionsymbols.namesakeAutoregressive modelSpark spreadStatisticsEconometricssymbolsEconomicsStatistics::MethodologyMarginal distributionGeneral Economics Econometrics and FinanceFinanceQuantitative Finance
researchProduct

On the ambiguous consequences of omitting variables

2015

This paper studies what happens when we move from a short regression to a long regression (or vice versa), when the long regression is shorter than the data-generation process. In the special case where the long regression equals the data-generation process, the least-squares estimators have smaller bias (in fact zero bias) but larger variances in the long regression than in the short regression. But if the long regression is also misspecified, the bias may not be smaller. We provide bias and mean squared error comparisons and study the dependence of the differences on the misspecification parameter.

Statistics::TheoryMean squared errorjel:C52Regression dilutionjel:C51Local regressionjel:C13Regression analysisOmitted-variable biasCross-sectional regressionStatistics::ComputationOmitted variables Misspecification Least-squares estimators Bias Mean squared errorStatistics::Machine LearningStatisticsEconometricsStatistics::MethodologyRegression diagnosticNonlinear regressionMathematics
researchProduct