Search results for "Estimation"
showing 10 items of 924 documents
Lead-time and overdiagnosis estimation in neuroblastoma screening.
2003
In Germany, neuroblastoma is the most frequent extracranial solid childhood tumour. Its properties made it seem an ideal candidate for screening. A German trial assessed the effect of screening at one year of age from 1995-2001 in a nationwide project. We present here the methods developed for the estimation of lead-time and overdiagnosis in this project. Follow up on 1.5 million screened children and 2.1 million control children is currently available until June 2002. Ascertainment of control cohort cases and false negative cases is complete up to this date. A method for determining an empirical lead-time distribution and overdiagnosis estimate from comparing the age specific incidences in…
Affine equivariant multivariate rank methods
2003
The classical multivariate statistical methods (MANOVA, principal component analysis, multivariate multiple regression, canonical correlation, factor analysis, etc.) assume that the data come from a multivariate normal distribution and the derivations are based on the sample covariance matrix. The conventional sample covariance matrix and consequently the standard multivariate techniques based on it are, however, highly sensitive to outlying observations. In the paper a new, more robust and highly efficient, approach based on an affine equivariant rank covariance matrix is proposed and outlined. Affine equivariant multivariate rank concept is based on the multivariate Oja (Statist. Probab. …
An approximation to maximum likelihood estimates in reduced models
1990
SUMMARY An approximation to the maximum likelihood estimates of the parameters in a model can be obtained from the corresponding estimates and information matrices in an extended model, i.e. a model with additional parameters. The approximation is close provided that the data are consistent with the first model. Applications are described to log linear models for discrete data, to models for multivariate normal distributions with special covariance matrices and to mixed discrete-continuous models.
Parameter orthogonality and conditional profile likelihood: the exponential power function case
1999
Orthogonality, according to Fisher’s metrics, between the parameters of a probability density function, as well as giving rise to a series of statistical implications, makes it possible to express a function of conditional profile likelihood with better properties than the ordinary profile likelihood function. In the present paper the parameters of exponential power function are made orthogonal and the conditional profile likelihood of the shape parameter p is determined in order to study its properties with reference to p estimation. Moreover, by means of a simulation plan, a comparison is made between the estimates of p obtained from the conditional profile log-likelihood and those obtain…
Tests and estimates of shape based on spatial signs and ranks
2009
Nonparametric procedures for testing and estimation of the shape matrix in the case of multivariate elliptic distribution are considered. Testing for sphericity is an important special case. The tests and estimates are based on the spatial sign and rank covariance matrices. The estimates based on the spatial sign covariance matrix and symmetrized spatial sign covariance matrix are Tyler's [A distribution-free M-estimator of multivariate scatter, Ann. Statist. 15 (1987), pp. 234–251] shape matrix and and Dümbgen's [On Tyler's M-functional of scatter in high dimension, Ann. Inst. Statist. Math. 50 (1998), pp. 471–491] shape matrix, respectively. The test based on the spatial sign covariance m…
Eleccion de variables en regresion lineal un problema de decision
1986
A general structure for the problem of selection of variables in regression is proposed using the decision theory framework. In particular, some results for the choice of the best linear normal homocedastic model are obtained when the main purpose is either to specify the predictive distribution over the response variable or to obtain a point estimate of it. A comparison of our results with the most widespread classical ones is presented
What subject matter questions motivate the use of machine learning approaches compared to statistical models for probability prediction?
2014
This is a discussion of the following papers: "Probability estimation with machine learning methods for dichotomous and multicategory outcome: Theory" by Jochen Kruppa, Yufeng Liu, Gerard Biau, Michael Kohler, Inke R. Konig, James D. Malley, and Andreas Ziegler; and "Probability estimation with machine learning methods for dichotomous and multicategory outcome: Applications" by Jochen Kruppa, Yufeng Liu, Hans-Christian Diener, Theresa Holste, Christian Weimar, Inke R. Konig, and Andreas Ziegler.
Estimation of total electricity consumption curves by sampling in a finite population when some trajectories are partially unobserved
2019
International audience; Millions of smart meters that are able to collect individual load curves, that is, electricity consumption time series, of residential and business customers at fine scale time grids are now deployed by electricity companies all around the world. It may be complex and costly to transmit and exploit such a large quantity of information, therefore it can be relevant to use survey sampling techniques to estimate mean load curves of specific groups of customers. Data collection, like every mass process, may undergo technical problems at every point of the metering and collection chain resulting in missing values. We consider imputation approaches (linear interpolation, k…
A Random Field Approach to Transect Counts of Wildlife Populations
1991
Line transect counting of a wildlife population is considered a sampling from a planar marked point process, where the marks describe the detectability of the animals. Sampling properties of transect counts and a new density estimator are derived from a counting process, which is a shot-noise field induced by the marked point process. A general formula for the sampling variance of a transect is derived and applied to compare five common types of transects. Some stereological connections of transect sampling and density estimators are shown.
A semiparametric approach to estimate reference curves for biophysical properties of the skin
2006
Reference curves which take one covariable into account such as the age, are often required in medicine, but simple systematic and efficient statistical methods for constructing them are lacking. Classical methods are based on parametric fitting (polynomial curves). In this chapter, we describe a new methodology for the estimation of reference curves for data sets, based on nonparametric estimation of conditional quantiles. The derived method should be applicable to all clinical or more generally biological variables that are measured on a continuous quantitative scale. To avoid the curse of dimensionality when the covariate is multidimensional, a new semiparametric approach is proposed. Th…