Search results for " Models"
showing 10 items of 4240 documents
Inferential tools in penalized logistic regression for small and sparse data: A comparative study.
2016
This paper focuses on inferential tools in the logistic regression model fitted by the Firth penalized likelihood. In this context, the Likelihood Ratio statistic is often reported to be the preferred choice as compared to the ‘traditional’ Wald statistic. In this work, we consider and discuss a wider range of test statistics, including the robust Wald, the Score, and the recently proposed Gradient statistic. We compare all these asymptotically equivalent statistics in terms of interval estimation and hypothesis testing via simulation experiments and analyses of two real datasets. We find out that the Likelihood Ratio statistic does not appear the best inferential device in the Firth penal…
A spatial analysis of Italian unemployment differences
2008
Using spatial econometric models, this paper focuses attention on the spatial structure of provincial unemployment disparities of Italian provinces for the year 2003. On the basis of findings from the economic literature and of the available socio-economic data, various model specifications including supply- and demand-side variables are tested. Further we use ESDA analysis as equivalent to integration analysis on time series; therefore it is applied on each variable, dependent and independent, involved in the statistical model. The suggestions of ESDA lead us to the most adequate statistical model, which estimates indicate that there is a significant degree of neighbouring effect (i.e. pos…
The Induced Smoothed lasso: A practical framework for hypothesis testing in high dimensional regression.
2020
This paper focuses on hypothesis testing in lasso regression, when one is interested in judging statistical significance for the regression coefficients in the regression equation involving a lot of covariates. To get reliable p-values, we propose a new lasso-type estimator relying on the idea of induced smoothing which allows to obtain appropriate covariance matrix and Wald statistic relatively easily. Some simulation experiments reveal that our approach exhibits good performance when contrasted with the recent inferential tools in the lasso framework. Two real data analyses are presented to illustrate the proposed framework in practice.
Nonlinear parametric quantile models
2020
Quantile regression is widely used to estimate conditional quantiles of an outcome variable of interest given covariates. This method can estimate one quantile at a time without imposing any constraints on the quantile process other than the linear combination of covariates and parameters specified by the regression model. While this is a flexible modeling tool, it generally yields erratic estimates of conditional quantiles and regression coefficients. Recently, parametric models for the regression coefficients have been proposed that can help balance bias and sampling variability. So far, however, only models that are linear in the parameters and covariates have been explored. This paper …
Building up adjusted indicators of students' evaluation of university courses using generalized item response models
2012
This article advances a proposal for building up adjusted composite indicators of the quality of university courses from students’ assessments. The flexible framework of Generalized Item Response Models is adopted here for controlling the sources of heterogeneity in the data structure that make evaluations across courses not directly comparable. Specifically, it allows us to: jointly model students’ ratings to the set of items which define the quality of university courses; explicitly consider the dimensionality of the items composing the evaluation form; evaluate and remove the effect of potential confounding factors which may affect students’ evaluation; model the intra-cluster variabilit…
Understanding the determinants of volatility clustering in terms of stationary Markovian processes
2016
Abstract Volatility is a key variable in the modeling of financial markets. The most striking feature of volatility is that it is a long-range correlated stochastic variable, i.e. its autocorrelation function decays like a power-law τ − β for large time lags. In the present work we investigate the determinants of such feature, starting from the empirical observation that the exponent β of a certain stock’s volatility is a linear function of the average correlation of such stock’s volatility with all other volatilities. We propose a simple approach consisting in diagonalizing the cross-correlation matrix of volatilities and investigating whether or not the diagonalized volatilities still kee…
Segmented relationships to model erosion of regression effect in Cox regression
2010
In this article we propose a parsimonious parameterisation to model the so-called erosion of the covariate effect in the Cox model, namely a covariate effect approaching to zero as the follow-up time increases. The proposed parameterisation is based on the segmented relationship where proper constraints are set to accomodate for the erosion. Relevant hypothesis testing is discussed. The approach is illustrated on two historical datasets in the survival analysis literature, and some simulation studies are presented to show how the proposed framework leads to a test for a global effect with good power as compared with alternative procedures. Finally, possible generalisations are also present…
cglasso: An R Package for Conditional Graphical Lasso Inference with Censored and Missing Values
2023
Sparse graphical models have revolutionized multivariate inference. With the advent of high-dimensional multivariate data in many applied fields, these methods are able to detect a much lower-dimensional structure, often represented via a sparse conditional independence graph. There have been numerous extensions of such methods in the past decade. Many practical applications have additional covariates or suffer from missing or censored data. Despite the development of these extensions of sparse inference methods for graphical models, there have been so far no implementations for, e.g., conditional graphical models. Here we present the general-purpose package cglasso for estimating sparse co…
Bioclimatic atlas of the terrestrial Arctic
2023
AbstractThe Arctic is the region on Earth that is warming at the fastest rate. In addition to rising means of temperature-related variables, Arctic ecosystems are affected by increasingly frequent extreme weather events causing disturbance to Arctic ecosystems. Here, we introduce a new dataset of bioclimatic indices relevant for investigating the changes of Arctic terrestrial ecosystems. The dataset, called ARCLIM, consists of several climate and event-type indices for the northern high-latitude land areas > 45°N. The indices are calculated from the hourly ERA5-Land reanalysis data for 1950–2021 in a spatial grid of 0.1 degree (~9 km) resolution. The indices are provided in three subsets…
Bayesian Modeling of Sequential Discoveries
2022
We aim at modelling the appearance of distinct tags in a sequence of labelled objects. Common examples of this type of data include words in a corpus or distinct species in a sample. These sequential discoveries are often summarised via accumulation curves, which count the number of distinct entities observed in an increasingly large set of objects. We propose a novel Bayesian method for species sampling modelling by directly specifying the probability of a new discovery, therefore allowing for flexible specifications. The asymptotic behavior and finite sample properties of such an approach are extensively studied. Interestingly, our enlarged class of sequential processes includes highly tr…