Search results for "Statistics::Theory"

showing 10 items of 56 documents

Varying-coefficient functional linear regression models

2008

This article considers a generalization of the functional linear regression in which an additional real variable influences smoothly the functional coefficient. We thus define a varying-coefficient regression model for functional data. We propose two estimators based, respectively, on conditional functional principal regression and on local penalized regression splines and prove their pointwise consistency. We check, with the prediction one day ahead of ozone concentration in the city of Toulouse, the ability of such nonlinear functional approaches to produce competitive estimations.

Statistics and ProbabilityPolynomial regressionStatistics::TheoryProper linear modelMultivariate adaptive regression splines010504 meteorology & atmospheric sciencesLocal regression01 natural sciences62G05 (62G20 62M20)Statistics::ComputationNonparametric regressionStatistics::Machine Learning010104 statistics & probabilityLinear regressionStatisticsStatistics::Methodology0101 mathematicsSegmented regressionRegression diagnosticComputingMilieux_MISCELLANEOUS0105 earth and related environmental sciencesMathematics
researchProduct

Estimating growth charts via nonparametric quantile regression: a practical framework with application in ecology.

2013

We discuss a practical and effective framework to estimate reference growth charts via regression quantiles. Inequality constraints are used to ensure both monotonicity and non-crossing of the estimated quantile curves and penalized splines are employed to model the nonlinear growth patterns with respect to age. A companion R package is presented and relevant code discussed to favour spreading and application of the proposed methods.

Statistics and ProbabilitySettore BIO/07 - EcologiaStatistics::TheoryEcology (disciplines)Nonparametric statisticsMonotonic functionRegressionStatistics::ComputationQuantile regressionNonlinear systemR packageStatisticsEconometricsStatistics::MethodologyGrowth charts Nonparametric regression quantiles Penalized splines P. oceanica modelling R softwareStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaGeneral Environmental ScienceMathematicsQuantile
researchProduct

The Induced Smoothed lasso: A practical framework for hypothesis testing in high dimensional regression.

2020

This paper focuses on hypothesis testing in lasso regression, when one is interested in judging statistical significance for the regression coefficients in the regression equation involving a lot of covariates. To get reliable p-values, we propose a new lasso-type estimator relying on the idea of induced smoothing which allows to obtain appropriate covariance matrix and Wald statistic relatively easily. Some simulation experiments reveal that our approach exhibits good performance when contrasted with the recent inferential tools in the lasso framework. Two real data analyses are presented to illustrate the proposed framework in practice.

Statistics and ProbabilityStatistics::TheoryInduced smoothingEpidemiologyComputer scienceFeature selectionWald test01 natural sciencesasthma researchStatistics::Machine Learning010104 statistics & probability03 medical and health sciencesHealth Information ManagementLasso (statistics)Linear regressionsparse modelsStatistics::MethodologyComputer Simulation0101 mathematicssandwich formula030304 developmental biologyStatistical hypothesis testing0303 health sciencesCovariance matrixlung functionRegression analysisStatistics::Computationsparse modelResearch DesignAlgorithmSmoothingvariable selectionStatistical methods in medical research
researchProduct

Selecting the tuning parameter in penalized Gaussian graphical models

2019

Penalized inference of Gaussian graphical models is a way to assess the conditional independence structure in multivariate problems. In this setting, the conditional independence structure, corresponding to a graph, is related to the choice of the tuning parameter, which determines the model complexity or degrees of freedom. There has been little research on the degrees of freedom for penalized Gaussian graphical models. In this paper, we propose an estimator of the degrees of freedom in $$\ell _1$$ -penalized Gaussian graphical models. Specifically, we derive an estimator inspired by the generalized information criterion and propose to use this estimator as the bias term for two informatio…

Statistics and ProbabilityStatistics::TheoryKullback–Leibler divergenceKullback-Leibler divergenceComputer scienceGaussianInformation Criteria010103 numerical & computational mathematicsModel complexityModel selection01 natural sciencesTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeStatistics::Machine LearningGeneralized information criterionEntropy (information theory)Statistics::MethodologyGraphical model0101 mathematicsPenalized Likelihood Kullback-Leibler Divergence Model Complexity Model Selection Generalized Information Criterion.Model selectionEstimatorStatistics::ComputationComputational Theory and MathematicsConditional independencesymbolsPenalized likelihoodStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaAlgorithmStatistics and Computing
researchProduct

Clusters of effects curves in quantile regression models

2018

In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM per…

Statistics and ProbabilityStatistics::TheoryMultivariate statistics05 social sciencesUnivariateFunctional data analysis01 natural sciencesQuantile regressionQuantile regression coefficients modeling Multivariate analysis Functional data analysis Curves clustering Variable selection010104 statistics & probabilityComputational Mathematics0502 economics and businessParametric modelCovariateStatistics::MethodologyApplied mathematics0101 mathematicsStatistics Probability and UncertaintyCluster analysisSettore SECS-S/01 - Statistica050205 econometrics MathematicsQuantile
researchProduct

Tests and estimates of shape based on spatial signs and ranks

2009

Nonparametric procedures for testing and estimation of the shape matrix in the case of multivariate elliptic distribution are considered. Testing for sphericity is an important special case. The tests and estimates are based on the spatial sign and rank covariance matrices. The estimates based on the spatial sign covariance matrix and symmetrized spatial sign covariance matrix are Tyler's [A distribution-free M-estimator of multivariate scatter, Ann. Statist. 15 (1987), pp. 234–251] shape matrix and and Dümbgen's [On Tyler's M-functional of scatter in high dimension, Ann. Inst. Statist. Math. 50 (1998), pp. 471–491] shape matrix, respectively. The test based on the spatial sign covariance m…

Statistics and ProbabilityStatistics::TheoryRank (linear algebra)Covariance matrixNonparametric statisticsCovarianceEstimation of covariance matricesScatter matrixStatisticsStatistics::MethodologySign testStatistics Probability and Uncertaintymoniulotteiset merkki- ja jarjestysluvutMathematicsSign (mathematics)Journal of Nonparametric Statistics
researchProduct

Design-based estimation for geometric quantiles with application to outlier detection

2010

Geometric quantiles are investigated using data collected from a complex survey. Geometric quantiles are an extension of univariate quantiles in a multivariate set-up that uses the geometry of multivariate data clouds. A very important application of geometric quantiles is the detection of outliers in multivariate data by means of quantile contours. A design-based estimator of geometric quantiles is constructed and used to compute quantile contours in order to detect outliers in both multivariate data and survey sampling set-ups. An algorithm for computing geometric quantile estimates is also developed. Under broad assumptions, the asymptotic variance of the quantile estimator is derived an…

Statistics and ProbabilityStatistics::TheoryTheoryofComputation_COMPUTATIONBYABSTRACTDEVICESStatistics::ApplicationsComputingMethodologies_SIMULATIONANDMODELINGApplied MathematicsMathematicsofComputing_NUMERICALANALYSISUnivariateInformationSystems_DATABASEMANAGEMENTEstimatorStatistics::ComputationQuantile regressionHorvitz–Thompson estimatorComputational MathematicsDelta methodComputational Theory and MathematicsTheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITYOutlierConsistent estimatorStatisticsStatistics::MethodologyMathematicsQuantileComputational Statistics & Data Analysis
researchProduct

Nonlinear parametric quantile models

2020

Quantile regression is widely used to estimate conditional quantiles of an outcome variable of interest given covariates. This method can estimate one quantile at a time without imposing any constraints on the quantile process other than the linear combination of covariates and parameters specified by the regression model. While this is a flexible modeling tool, it generally yields erratic estimates of conditional quantiles and regression coefficients. Recently, parametric models for the regression coefficients have been proposed that can help balance bias and sampling variability. So far, however, only models that are linear in the parameters and covariates have been explored. This paper …

Statistics and ProbabilityStatistics::Theoryquantile regressionEpidemiologyparametric010501 environmental sciences01 natural sciencesquantile regression coefficients models010104 statistics & probabilityOutcome variableHealth Information ManagementCovariateEconometricsHumansStatistics::MethodologyComputer Simulation0101 mathematicsChild0105 earth and related environmental sciencesParametric statisticsMathematicsModels StatisticalForced oscillation technique integrated loss function parametric quantile regression quantile regression coefficients models Child Computer Simulation Humans Regression Analysis Models Statistical Nonlinear DynamicsStatistics::ComputationQuantile regressionNonlinear systemNonlinear Dynamicsintegrated loss functionRegression AnalysisQuantileStatistical Methods in Medical Research
researchProduct

On the Ambiguous Consequences of Omitting Variables

2015

This paper studies what happens when we move from a short regression to a long regression (or vice versa), when the long regression is shorter than the data-generation process. In the special case where the long regression equals the data-generation process, the least-squares estimators have smaller bias (in fact zero bias) but larger variances in the long regression than in the short regression. But if the long regression is also misspecified, the bias may not be smaller. We provide bias and mean squared error comparisons and study the dependence of the differences on the misspecification parameter.

Statistics::Machine LearningStatistics::TheoryC51C52BiasMisspecificationLeast-squares estimatorsddc:330Statistics::MethodologyC13Mean squared errorOmitted variablesStatistics::Computation
researchProduct

"Table 14" of "Probing the quantum interference between singly and doubly resonant top-quark production in $pp$ collisions at $\sqrt{s}=13$ TeV with …

2019

The detector-level minimax-m(bl) distribution for events entering the Z+jets control region.

Statistics::Theory13000.0Proton-Proton ScatteringPhysics::Instrumentation and DetectorsAstrophysics::High Energy Astrophysical PhenomenaHigh Energy Physics::ExperimentCross SectionSIGP P --> W W b b
researchProduct