Search results for "Statistics::Methodology"
showing 10 items of 71 documents
Nonlinear parametric quantile models
2020
Quantile regression is widely used to estimate conditional quantiles of an outcome variable of interest given covariates. This method can estimate one quantile at a time without imposing any constraints on the quantile process other than the linear combination of covariates and parameters specified by the regression model. While this is a flexible modeling tool, it generally yields erratic estimates of conditional quantiles and regression coefficients. Recently, parametric models for the regression coefficients have been proposed that can help balance bias and sampling variability. So far, however, only models that are linear in the parameters and covariates have been explored. This paper …
An extended continuous mapping theorem for outer almost sure weak convergence
2019
International audience; We prove an extended continuous mapping theorem for outer almost sure weak convergence in a metric space, a notion that is used in bootstrap empirical processes theory. Then we make use of those results to establish the consistency of several bootstrap procedures in empirical likelihood theory for functional parameters.
Multivariate Nonparametric Tests
2004
Multivariate nonparametric statistical tests of hypotheses are described for the one-sample location problem, the several-sample location problem and the problem of testing independence between pairs of vectors. These methods are based on affine-invariant spatial sign and spatial rank vectors. They provide affine-invariant multivariate generalizations of the univariate sign test, signed-rank test, Wilcoxon rank sum test, Kruskal–Wallis test, and the Kendall and Spearman correlation tests. While the emphasis is on tests of hypotheses, certain references to associated affine-equivariant estimators are included. Pitman asymptotic efficiencies demonstrate the excellent performance of these meth…
Additional file 4 of Development and validation of prediction model to estimate 10-year risk of all-cause mortality using modern statistical learning…
2021
Additional file 4. Distributions of the variables at baseline before and after multiple imputations.
On the Ambiguous Consequences of Omitting Variables
2015
This paper studies what happens when we move from a short regression to a long regression (or vice versa), when the long regression is shorter than the data-generation process. In the special case where the long regression equals the data-generation process, the least-squares estimators have smaller bias (in fact zero bias) but larger variances in the long regression than in the short regression. But if the long regression is also misspecified, the bias may not be smaller. We provide bias and mean squared error comparisons and study the dependence of the differences on the misspecification parameter.
A semiparametric approach to estimate reference curves for biophysical properties of the skin
2006
Reference curves which take one covariable into account such as the age, are often required in medicine, but simple systematic and efficient statistical methods for constructing them are lacking. Classical methods are based on parametric fitting (polynomial curves). In this chapter, we describe a new methodology for the estimation of reference curves for data sets, based on nonparametric estimation of conditional quantiles. The derived method should be applicable to all clinical or more generally biological variables that are measured on a continuous quantitative scale. To avoid the curse of dimensionality when the covariate is multidimensional, a new semiparametric approach is proposed. Th…
Dynamic copula models for the spark spread
2011
We propose a non-symmetric copula to model the evolution of electricity and gas prices by a bivariate non-Gaussian autoregressive process. We identify the marginal dynamics as driven by normal inverse Gaussian processes, estimating them from a series of observed UK electricity and gas spot data. We estimate the copula by modeling the difference between the empirical copula and the independent copula. We then simulate the joint process and price options written on the spark spread. We find that option prices are significantly influenced by the copula and the marginal distributions, along with the seasonality of the underlying prices.
On the ambiguous consequences of omitting variables
2015
This paper studies what happens when we move from a short regression to a long regression (or vice versa), when the long regression is shorter than the data-generation process. In the special case where the long regression equals the data-generation process, the least-squares estimators have smaller bias (in fact zero bias) but larger variances in the long regression than in the short regression. But if the long regression is also misspecified, the bias may not be smaller. We provide bias and mean squared error comparisons and study the dependence of the differences on the misspecification parameter.
On the sign recovery by LASSO, thresholded LASSO and thresholded Basis Pursuit Denoising
2020
Basis Pursuit (BP), Basis Pursuit DeNoising (BPDN), and LASSO are popular methods for identifyingimportant predictors in the high-dimensional linear regression model Y = Xβ + ε. By definition, whenε = 0, BP uniquely recovers β when Xβ = Xb and β different than b implies L1 norm of β is smaller than the L1 norm of b (identifiability condition). Furthermore, LASSO can recover the sign of β only under a much stronger irrepresentability condition. Meanwhile, it is known that the model selection properties of LASSO can be improved by hard-thresholdingits estimates. This article supports these findings by proving that thresholded LASSO, thresholded BPDNand thresholded BP recover the sign of β in …
The Role of Covariance Matrix Forecasting Method in the Performance of Minimum-Variance Portfolios
2014
Providing a more accurate covariance matrix forecast can substantially improve the performance of optimized portfolios. Using out-of-sample tests, in this paper, we evaluate alternative covariance matrix forecasting methods by looking at (1) their forecast accuracy, (2) their ability to track the volatility of the minimum-variance portfolio, and (3) their ability to keep the volatility of the minimum-variance portfolio at a target level. We find large differences between the methods. Our results suggest that shrinkage of the sample covariance matrix improves neither the forecast accuracy nor the performance of minimum-variance portfolios. In contrast, switching from the sample covariance ma…