Search results for " Models"

showing 10 items of 4240 documents

Inferential tools in penalized logistic regression for small and sparse data: A comparative study.

2016

This paper focuses on inferential tools in the logistic regression model fitted by the Firth penalized likelihood. In this context, the Likelihood Ratio statistic is often reported to be the preferred choice as compared to the ‘traditional’ Wald statistic. In this work, we consider and discuss a wider range of test statistics, including the robust Wald, the Score, and the recently proposed Gradient statistic. We compare all these asymptotically equivalent statistics in terms of interval estimation and hypothesis testing via simulation experiments and analyses of two real datasets. We find out that the Likelihood Ratio statistic does not appear the best inferential device in the Firth penal…

Statistics and ProbabilityScore testPRESS statisticEpidemiologyStatistics as TopicScoreWald testLogistic regression01 natural sciences010104 statistics & probability03 medical and health sciences0302 clinical medicineHealth Information ManagementStatisticsEconometricsHumans030212 general & internal medicine0101 mathematicsStatisticMathematicsLikelihood FunctionsModels StatisticalLogistic regression firth penalized likelihood sandwich formula score statistic gradient statisticLogistic ModelsLikelihood-ratio testData Interpretation StatisticalSample SizeAncillary statisticSettore SECS-S/01 - StatisticaStatistical methods in medical research
researchProduct

A spatial analysis of Italian unemployment differences

2008

Using spatial econometric models, this paper focuses attention on the spatial structure of provincial unemployment disparities of Italian provinces for the year 2003. On the basis of findings from the economic literature and of the available socio-economic data, various model specifications including supply- and demand-side variables are tested. Further we use ESDA analysis as equivalent to integration analysis on time series; therefore it is applied on each variable, dependent and independent, involved in the statistical model. The suggestions of ESDA lead us to the most adequate statistical model, which estimates indicate that there is a significant degree of neighbouring effect (i.e. pos…

Statistics and ProbabilitySpatial correlationSpatial structuremedia_common.quotation_subjectESDAStatistical modelSDG 8 - Decent Work and Economic Growth/dk/atira/pure/sustainabledevelopmentgoals/decent_work_and_economic_growthMunicipal levelEconometric modelVariable (computer science)Spatial modelsUnemploymentEconometricsEconomicsCommon spatial patternRegional unemploymentStatistics Probability and Uncertaintymedia_common
researchProduct

The Induced Smoothed lasso: A practical framework for hypothesis testing in high dimensional regression.

2020

This paper focuses on hypothesis testing in lasso regression, when one is interested in judging statistical significance for the regression coefficients in the regression equation involving a lot of covariates. To get reliable p-values, we propose a new lasso-type estimator relying on the idea of induced smoothing which allows to obtain appropriate covariance matrix and Wald statistic relatively easily. Some simulation experiments reveal that our approach exhibits good performance when contrasted with the recent inferential tools in the lasso framework. Two real data analyses are presented to illustrate the proposed framework in practice.

Statistics and ProbabilityStatistics::TheoryInduced smoothingEpidemiologyComputer scienceFeature selectionWald test01 natural sciencesasthma researchStatistics::Machine Learning010104 statistics & probability03 medical and health sciencesHealth Information ManagementLasso (statistics)Linear regressionsparse modelsStatistics::MethodologyComputer Simulation0101 mathematicssandwich formula030304 developmental biologyStatistical hypothesis testing0303 health sciencesCovariance matrixlung functionRegression analysisStatistics::Computationsparse modelResearch DesignAlgorithmSmoothingvariable selectionStatistical methods in medical research
researchProduct

Nonlinear parametric quantile models

2020

Quantile regression is widely used to estimate conditional quantiles of an outcome variable of interest given covariates. This method can estimate one quantile at a time without imposing any constraints on the quantile process other than the linear combination of covariates and parameters specified by the regression model. While this is a flexible modeling tool, it generally yields erratic estimates of conditional quantiles and regression coefficients. Recently, parametric models for the regression coefficients have been proposed that can help balance bias and sampling variability. So far, however, only models that are linear in the parameters and covariates have been explored. This paper …

Statistics and ProbabilityStatistics::Theoryquantile regressionEpidemiologyparametric010501 environmental sciences01 natural sciencesquantile regression coefficients models010104 statistics & probabilityOutcome variableHealth Information ManagementCovariateEconometricsHumansStatistics::MethodologyComputer Simulation0101 mathematicsChild0105 earth and related environmental sciencesParametric statisticsMathematicsModels StatisticalForced oscillation technique integrated loss function parametric quantile regression quantile regression coefficients models Child Computer Simulation Humans Regression Analysis Models Statistical Nonlinear DynamicsStatistics::ComputationQuantile regressionNonlinear systemNonlinear Dynamicsintegrated loss functionRegression AnalysisQuantileStatistical Methods in Medical Research
researchProduct

Building up adjusted indicators of students' evaluation of university courses using generalized item response models

2012

This article advances a proposal for building up adjusted composite indicators of the quality of university courses from students’ assessments. The flexible framework of Generalized Item Response Models is adopted here for controlling the sources of heterogeneity in the data structure that make evaluations across courses not directly comparable. Specifically, it allows us to: jointly model students’ ratings to the set of items which define the quality of university courses; explicitly consider the dimensionality of the items composing the evaluation form; evaluate and remove the effect of potential confounding factors which may affect students’ evaluation; model the intra-cluster variabilit…

Statistics and ProbabilityStructure (mathematical logic)Computer sciencemedia_common.quotation_subjectadjusted indicators explanatory item response models multidimensional latent traits multilevel models evaluation of university courses potential confounding factorsRegression analysisData structureAffect (psychology)Multilevel dataComputingMilieux_COMPUTERSANDEDUCATIONEconometricsMathematics educationQuality (business)Settore SECS-S/05 - Statistica SocialeStatistics Probability and UncertaintySet (psychology)Settore SECS-S/01 - Statisticamedia_commonCurse of dimensionality
researchProduct

Understanding the determinants of volatility clustering in terms of stationary Markovian processes

2016

Abstract Volatility is a key variable in the modeling of financial markets. The most striking feature of volatility is that it is a long-range correlated stochastic variable, i.e. its autocorrelation function decays like a power-law τ − β for large time lags. In the present work we investigate the determinants of such feature, starting from the empirical observation that the exponent β of a certain stock’s volatility is a linear function of the average correlation of such stock’s volatility with all other volatilities. We propose a simple approach consisting in diagonalizing the cross-correlation matrix of volatilities and investigating whether or not the diagonalized volatilities still kee…

Statistics and ProbabilityVolatility clusteringVolatility Econophysics Long-range correlation Stochastic processes First passage timeStochastic volatilityProbability density functionCondensed Matter PhysicsSABR volatility model01 natural sciencesSettore FIS/07 - Fisica Applicata(Beni Culturali Ambientali Biol.e Medicin)010305 fluids & plasmasHeston modelFinancial models with long-tailed distributions and volatility clustering0103 physical sciencesForward volatilityEconometricsVolatility (finance)010306 general physicsMathematics
researchProduct

Segmented relationships to model erosion of regression effect in Cox regression

2010

In this article we propose a parsimonious parameterisation to model the so-called erosion of the covariate effect in the Cox model, namely a covariate effect approaching to zero as the follow-up time increases. The proposed parameterisation is based on the segmented relationship where proper constraints are set to accomodate for the erosion. Relevant hypothesis testing is discussed. The approach is illustrated on two historical datasets in the survival analysis literature, and some simulation studies are presented to show how the proposed framework leads to a test for a global effect with good power as compared with alternative procedures. Finally, possible generalisations are also present…

Statistics and ProbabilitybreakpointEpidemiologyProportional hazards modelLiver Cirrhosis BiliaryErosion (morphology)Lupus NephritisSet (abstract data type)Segmented regressionHealth Information ManagementNonlinear DynamicsRegression toward the meanCox modelCovariateStatisticsEconometricsHumansComputer SimulationSettore SECS-S/05 - Statistica SocialeSettore SECS-S/01 - Statisticaerosion of effectStatistical hypothesis testingMathematicsFollow-Up StudiesProportional Hazards Models
researchProduct

cglasso: An R Package for Conditional Graphical Lasso Inference with Censored and Missing Values

2023

Sparse graphical models have revolutionized multivariate inference. With the advent of high-dimensional multivariate data in many applied fields, these methods are able to detect a much lower-dimensional structure, often represented via a sparse conditional independence graph. There have been numerous extensions of such methods in the past decade. Many practical applications have additional covariates or suffer from missing or censored data. Despite the development of these extensions of sparse inference methods for graphical models, there have been so far no implementations for, e.g., conditional graphical models. Here we present the general-purpose package cglasso for estimating sparse co…

Statistics and Probabilityconditional Gaussian graphical modelscglasso conditional Gaussian graphical models glasso high-dimensionality sparsity censoring missing dataglassosparsityhigh-dimensionalityconditional Gaussian graphical models glasso high-dimensionality sparsity censoring missing datacglassomissing datacensoringStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaSoftware
researchProduct

Bioclimatic atlas of the terrestrial Arctic

2023

AbstractThe Arctic is the region on Earth that is warming at the fastest rate. In addition to rising means of temperature-related variables, Arctic ecosystems are affected by increasingly frequent extreme weather events causing disturbance to Arctic ecosystems. Here, we introduce a new dataset of bioclimatic indices relevant for investigating the changes of Arctic terrestrial ecosystems. The dataset, called ARCLIM, consists of several climate and event-type indices for the northern high-latitude land areas > 45°N. The indices are calculated from the hourly ERA5-Land reanalysis data for 1950–2021 in a spatial grid of 0.1 degree (~9 km) resolution. The indices are provided in three subsets…

Statistics and Probabilityhiilidioksidiarctic regionmeltingclimate changeswarmingPhysiologyEventsrainfallLibrary and Information SciencesklimatologiaEducationeliömaantiedeSnowilmastoSpecies distribution modelsVariabilityClimate-changeclimate1172 Environmental sciencesbiogeographyarktinen aluetemperaturecarbon dioxidesulaminenclimatologyilmastonmuutoksetecosystems (ecology)ekologiaComputer Science Applicationsekosysteemit (ekologia)sademääräclimate changeImpactsSea-icelämpötilaStatistics Probability and UncertaintyTrendslämpeneminenInformation Systemsclimate-change ecology
researchProduct

Bayesian Modeling of Sequential Discoveries

2022

We aim at modelling the appearance of distinct tags in a sequence of labelled objects. Common examples of this type of data include words in a corpus or distinct species in a sample. These sequential discoveries are often summarised via accumulation curves, which count the number of distinct entities observed in an increasingly large set of objects. We propose a novel Bayesian method for species sampling modelling by directly specifying the probability of a new discovery, therefore allowing for flexible specifications. The asymptotic behavior and finite sample properties of such an approach are extensively studied. Interestingly, our enlarged class of sequential processes includes highly tr…

Statistics and Probabilitylajistokartoitusspecies sampling modelslogistic regressionbayesilainen menetelmäaccumulation curvesotantaStatistics Probability and Uncertaintydirichlet processtilastolliset mallitpoisson-binomial distribution
researchProduct