Search results for " Regression"

showing 10 items of 1835 documents

Model-Assisted Estimation Through Random Forests in Finite Population Sampling

2021

In surveys, the interest lies in estimating finite population parameters such as population totals and means. In most surveys, some auxiliary information is available at the estimation stage. This information may be incorporated in the estimation procedures to increase their precision. In this article, we use random forests (RFs) to estimate the functional relationship between the survey variable and the auxiliary variables. In recent years, RFs have become attractive as National Statistical Offices have now access to a variety of data sources, potentially exhibiting a large number of observations on a large number of variables. We establish the theoretical properties of model-assisted proc…

Statistics and ProbabilityEstimationFOS: Computer and information sciences0303 health scienceseducation.field_of_studyPopulationAstrophysics::Cosmology and Extragalactic Astrophysics01 natural sciencesPopulation samplingNonparametric regressionRandom forestMethodology (stat.ME)010104 statistics & probability03 medical and health sciencesVariance estimationStatisticsQuantitative Biology::Populations and EvolutionSurvey data collectionStage (hydrology)0101 mathematicsStatistics Probability and UncertaintyeducationStatistics - Methodology030304 developmental biologyMathematics
researchProduct

A Software Tool for the Exponential Power Distribution: The normalp Package

2005

In this paper we present the normalp package, a package for the statistical environment R that has a set of tools for dealing with the exponential power distribution. In this package there are functions to compute the density function, the distribution function and the quantiles from an exponential power distribution and to generate pseudo-random numbers from the same distribution. Moreover, methods concerning the estimation of the distribution parameters are described and implemented. It is also possible to estimate linear regression models when we assume the random errors distributed according to an exponential power distribution. A set of functions is designed to perform simulation studi…

Statistics and ProbabilityExponential distributionTheoretical computer scienceComputer scienceAsymptotic distributionDistribution fittingLaplace distributionExponential familyGamma distributionStatistics Probability and UncertaintyNatural exponential familyProbability integral transformAlgorithmlcsh:Statisticslcsh:HA1-4737exponential power distribution R estimation linear regressionSoftwareJournal of Statistical Software
researchProduct

Subject-specific odds ratios in binomial GLMMs with continuous response

2007

In a regression context, the dichotomization of a continuous outcome variable is often motivated by the need to express results in terms of the odds ratio, as a measure of association between the response and one or more risk factors. Starting from the recent work of Moser and Coombs (Odds ratios for a continuous outcome variable without dichotomizing, Statistics in Medicine, 2004, 23, 1843-1860), in this article we explore in a mixed model framework the possibility of obtaining odds ratio estimates from a regression linear model without the need of dichotomizing the response variable. It is shown that the odds ratio estimators derived from a linear mixed model outperform those from a binom…

Statistics and ProbabilityGeneral linear modelProper linear modelDichotomizingBinomial regressionLinear modelLogistic regressionOdds ratioEfficiencyRandom effects modelLogistic regressionGeneralized linear mixed modelRandom effectStatisticsEconometricsDiagnostic odds ratioStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaMathematics
researchProduct

Methods and Tools for Bayesian Variable Selection and Model Averaging in Normal Linear Regression

2018

In this paper, we briefly review the main methodological aspects concerned with the application of the Bayesian approach to model choice and model averaging in the context of variable selection in regression models. This includes prior elicitation, summaries of the posterior distribution and computational strategies. We then examine and compare various publicly available R-packages, summarizing and explaining the differences between packages and giving recommendations for applied users. We find that all packages reviewed (can) lead to very similar results, but there are potentially important differences in flexibility and efficiency of the packages.

Statistics and ProbabilityGeneral linear modelProper linear modelbusiness.industryComputer science05 social sciencesPosterior probabilityRegression analysisFeature selectionMachine learningcomputer.software_genre01 natural sciences010104 statistics & probabilityBayesian multivariate linear regression0502 economics and businessLinear regressionEconometricsArtificial intelligence050207 economics0101 mathematicsStatistics Probability and UncertaintyBayesian linear regressionbusinesscomputerInternational Statistical Review
researchProduct

dglars: An R Package to Estimate Sparse Generalized Linear Models

2014

dglars is a publicly available R package that implements the method proposed in Augugliaro, Mineo, and Wit (2013), developed to study the sparse structure of a generalized linear model. This method, called dgLARS, is based on a differential geometrical extension of the least angle regression method proposed in Efron, Hastie, Johnstone, and Tibshirani (2004). The core of the dglars package consists of two algorithms implemented in Fortran 90 to efficiently compute the solution curve: a predictor-corrector algorithm, proposed in Augugliaro et al. (2013), and a cyclic coordinate descent algorithm, proposed in Augugliaro, Mineo, and Wit (2012). The latter algorithm, as shown here, is significan…

Statistics and ProbabilityGeneralized linear modelEXPRESSIONMathematical optimizationTISSUESFortrancyclic coordinate descent algorithmdgLARSFeature selectionDANTZIG SELECTORpredictor-corrector algorithmLIKELIHOODLEAST ANGLE REGRESSIONsparse modelsDifferential (infinitesimal)differential geometrylcsh:Statisticslcsh:HA1-4737computer.programming_languageMathematicsLeast-angle regressionExtension (predicate logic)Expression (computer science)generalized linear modelsBREAST-CANCER RISKVARIABLE SELECTIONDifferential geometrydifferential geometry generalized linear models dgLARS predictor-corrector algorithm cyclic coordinate descent algorithm sparse models variable selection.MARKERSHRINKAGEStatistics Probability and UncertaintyHAPLOTYPESSettore SECS-S/01 - StatisticacomputerAlgorithmSoftware
researchProduct

Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter

2018

A large class of modeling and prediction problems involves outcomes that belong to an exponential family distribution. Generalized linear models (GLMs) are a standard way of dealing with such situations. Even in high-dimensional feature spaces GLMs can be extended to deal with such situations. Penalized inference approaches, such as the $$\ell _1$$ or SCAD, or extensions of least angle regression, such as dgLARS, have been proposed to deal with GLMs with high-dimensional feature spaces. Although the theory underlying these methods is in principle generic, the implementation has remained restricted to dispersion-free models, such as the Poisson and logistic regression models. The aim of this…

Statistics and ProbabilityGeneralized linear modelMathematical optimizationGeneralized linear modelsPredictor-€“corrector algorithmGeneralized linear model02 engineering and technologyPoisson distributionDANTZIG SELECTOR01 natural sciencesCross-validationHigh-dimensional inferenceTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeExponential familyLEAST ANGLE REGRESSION0202 electrical engineering electronic engineering information engineeringApplied mathematicsStatistics::Methodology0101 mathematicsCROSS-VALIDATIONMathematicsLeast-angle regressionLinear model020206 networking & telecommunicationsProbability and statisticsVARIABLE SELECTIONEfficient estimatorPredictor-corrector algorithmComputational Theory and MathematicsDispersion paremeterLINEAR-MODELSsymbolsSHRINKAGEStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaStatistics and Computing
researchProduct

Differential geometric least angle regression: a differential geometric approach to sparse generalized linear models

2013

Summary Sparsity is an essential feature of many contemporary data problems. Remote sensing, various forms of automated screening and other high throughput measurement devices collect a large amount of information, typically about few independent statistical subjects or units. In certain cases it is reasonable to assume that the underlying process generating the data is itself sparse, in the sense that only a few of the measured variables are involved in the process. We propose an explicit method of monotonically decreasing sparsity for outcomes that can be modelled by an exponential family. In our approach we generalize the equiangular condition in a generalized linear model. Although the …

Statistics and ProbabilityGeneralized linear modelSparse modelMathematical optimizationGeneralized linear modelsVariable selectionPath following algorithmEquiangular polygonGeneralized linear modelLASSODANTZIG SELECTORsymbols.namesakeExponential familyLasso (statistics)Sparse modelsDifferential geometryInformation geometryCOORDINATE DESCENTFisher informationERRORMathematicsLeast-angle regressionLeast angle regressionGeneralized degrees of freedomsymbolsSHRINKAGEStatistics Probability and UncertaintySimple linear regressionInformation geometrySettore SECS-S/01 - StatisticaAlgorithmCovariance penalty theory
researchProduct

A differential-geometric approach to generalized linear models with grouped predictors

2016

We propose an extension of the differential-geometric least angle regression method to perform sparse group inference in a generalized linear model. An efficient algorithm is proposed to compute the solution curve. The proposed group differential-geometric least angle regression method has important properties that distinguish it from the group lasso. First, its solution curve is based on the invariance properties of a generalized linear model. Second, it adds groups of variables based on a group equiangularity condition, which is shown to be related to score statistics. An adaptive version, which includes weights based on the Kullback-Leibler divergence, improves its variable selection fea…

Statistics and ProbabilityGeneralized linear modelStatistics::TheoryMathematical optimizationProper linear modelGeneral MathematicsORACLE PROPERTIESGeneralized linear modelSPARSITYGeneralized linear array model01 natural sciencesGeneralized linear mixed modelCONSISTENCY010104 statistics & probabilityScore statistic.LEAST ANGLE REGRESSIONLinear regressionESTIMATORApplied mathematicsDifferential geometry0101 mathematicsDivergence (statistics)MathematicsVariance functionDifferential-geometric least angle regressionPATH ALGORITHMApplied MathematicsLeast-angle regressionScore statistic010102 general mathematicsAgricultural and Biological Sciences (miscellaneous)Group lassoGROUP SELECTIONStatistics Probability and UncertaintyGeneral Agricultural and Biological SciencesSettore SECS-S/01 - Statistica
researchProduct

Premature conclusions about the signal‐to‐noise ratio in structural equation modeling research : A commentary on Yuan and Fang (2023)

2023

In a recent article published in this journal, Yuan and Fang (British Journal of Mathematical and Statistical Psychology, 2023) suggest comparing structural equation modeling (SEM), also known as covariance-based SEM (CB-SEM), estimated by normal-distribution-based maximum likelihood (NML), to regression analysis with (weighted) composites estimated by least squares (LS) in terms of their signal-to-noise ratio (SNR). They summarize their findings in the statement that “[c]ontrary to the common belief that CB-SEM is the preferred method for the analysis of observational data, this article shows that regression analysis via weighted composites yields parameter estimates with much smaller stan…

Statistics and ProbabilityHenseler-Ogasawara specificationeffect sizetilastomenetelmätpartial least squares structural equation modelingGeneral MedicinerakenneyhtälömallitregressioanalyysiArts and Humanities (miscellaneous)sum scorescovariance-based structural equation modelingcomposite modelregression analysis with weighted compositesfactor score regressionGeneral Psychology
researchProduct

Accounting for previous events to model and predict traffic accidents at the road segment level: A study in Valencia (Spain)

2022

Abstract Predicting the occurrence of traffic accidents is essential for establishing preventive measures and reducing the impact of traffic accidents. In particular, it is fundamental to make predictions using fine spatio-temporal units. In this paper, the daily risk of traffic accident occurrence across the road network of Valencia (Spain) is modeled through logistic regression models. The spatio-temporal dependence between the observations is accounted for through the inclusion of lagged binary covariates representing the previous occurrence of a traffic accident within a spatio-temporal window centered at each combination of day and segment of the network. A temporal distance of 28 days…

Statistics and ProbabilityIndex (economics)Temporal distanceTraffic accidentNames of the days of the weekCovariateStatisticsStatistical and Nonlinear PhysicsMatthews correlation coefficientLogistic regressionMathematicsPhysica A: Statistical Mechanics and its Applications
researchProduct