Search results for "Overdispersion"

showing 10 items of 14 documents

Can bayesian models play a role in dental caries epidemiology? Evidence from an application to the BELCAP data set

2012

Objectives The aim of this study was to show the potential of Bayesian analysis in statistical modelling of dental caries data. Because of the bounded nature of the dmft (DMFT) index, zero-inflated binomial (ZIB) and beta-binomial (ZIBB) models were considered. The effects of incorporating prior information available about the parameters of models were also shown. Methods The data set used in this study was the Belo Horizonte Caries Prevention (BELCAP) study (Bohning et al. (1999)), consisting of five variables collected among 797 Brazilian school children designed to evaluate four programmes for reducing caries. Only the eight primary molar teeth were considered in the data set. A data aug…

Malebounded dataBest fittingBayesian probabilityDeviance (statistics)informative priorDental CariesSettore MED/42 - Igiene Generale E ApplicataSettore MED/01 - Statistica MedicaOverdispersionPrior probabilityStatisticsHumansMedicineChildGeneral DentistryBayesian analysidmftDMF Indexbusiness.industryBelo Horizonte Caries Preventionzero-inflated betabinomialCaries epidemiologyPublic Health Environmental and Occupational HealthBayes TheoremStatistical modelRegressionzero-inflated binomialFemalebusinessAlgorithmsBrazilBayesian analysis; Belo Horizonte Caries Prevention; bounded data; dmft; informative prior; zero-inflated betabinomial; zero-inflated binomialCommunity Dentistry and Oral Epidemiology
researchProduct

Overdispersion tests in count-data analysis.

2008

Count data are commonly assumed to have a Poisson distribution, especially when there is no diagnostic procedure for checking this assumption. However, count data rarely fit the restrictive assumptions of the Poisson distribution. The violation of much of such assumptions commonly results in overdispersion, which invalidates the Poisson distribution. Undetected overdispersion may entail important misleading inferences, so its detection is essential. In this study, different overdispersion diagnostic tests are evaluated through two simulation studies. In Exp. 1, the nominal error rate is compared under different sample sizes and Λ conditions. Analysis shows a remarkable performance of the χ…

Word error rateDiagnostic testModels PsychologicalPoisson distributionStatistical powersymbols.namesakeQuasi-likelihoodOverdispersionSample size determinationStatisticssymbolsHumansPsychologyPsychologyGeneral PsychologyCount dataPsychological reports
researchProduct

Degree course change and student performance: a mixed-effect approach

2015

This paper focuses on students credits earning speed over time and its determinants, dealing with the huge percentage of students who do not take the degree within the legal duration in the Italian University System. A new indicator for the performance of the student career is proposed on real data, concerning the cohort of students enrolled at a Faculty of the University of Palermo (followed for 7 years). The new indicator highlights a typical zero-inflated distribution and suggests to investigate the effect of the degree course (DC) change on the student career. A mixed-effect model for overdispersed data is considered, with the aim of taking into account the individual variability as wel…

Statistics and ProbabilityMixed modelMotion chart05 social sciences050301 education01 natural sciencesCourse (navigation)Degree (temperature)010104 statistics & probabilityOverdispersionCohortComputingMilieux_COMPUTERSANDEDUCATIONMathematics educationuniversity credits expected years to the graduation overdispersion ZIP model longitudinal data motion chartSettore SECS-S/05 - Statistica Sociale0101 mathematicsStatistics Probability and UncertaintyDuration (project management)PsychologySettore SECS-S/01 - Statistica0503 educationUniversity system
researchProduct

Empirical Bayes improves assessments of diversity and similarity when overdispersion prevails in taxonomic counts with no covariates

2019

Abstract The assessment of diversity and similarity is relevant in monitoring the status of ecosystems. The respective indicators are based on the taxonomic composition of biological communities of interest, currently estimated through the proportions computed from sampling multivariate counts. In this work we present a novel method to estimate the taxonomic composition able to work even with a single sample and no covariates, when data are affected by overdispersion. The presence of overdispersion in taxonomic counts may be the result of significant environmental factors which are often unobservable but influence communities. Following the empirical Bayes approach, we combine a Bayesian mo…

0106 biological sciencesMultivariate statisticsBiological dataEmpirical Bayesian estimationEcologyTaxonomic compositionGeneral Decision SciencesEnvironmental monitoring010501 environmental sciencesBayesian inference010603 evolutionary biology01 natural sciencesBiodiversity assessment; Dirichlet-Multinomial model; Empirical Bayesian estimation; Environmental monitoring; Taxonomic compositionMarginal likelihoodBayes' theoremOverdispersionStatisticsTaxonomic rankDirichlet-Multinomial modelBiodiversity assessmentEcology Evolution Behavior and Systematics0105 earth and related environmental sciencesEmpirical Bayes methodMathematics
researchProduct

Modeling accident risk at the road level through zero-inflated negative binomial models: A case study of multiple road networks

2021

Abstract This paper presents a case study carried out in multiple cities of the Valencian Community (Spain) to determine the effect of sociodemographic and road characteristics on traffic accident risk. The analyzes are performed at the road segment level, considering the linear network representing the road structure of each city as a spatial lattice. The number of accidents observed in each road segment from 2010 to 2019 is taken as the response variable, and a zero-inflated modeling approach is considered. Count overdispersion and spatial dependence are also accounted for. Despite the complexity and sparsity of the data, the fitted models performed considerably well, with few exceptions.…

Statistics and ProbabilityComputer sciencespatial dependence0208 environmental biotechnologyAccident riskMagnitude (mathematics)Distribution (economics)02 engineering and technologyManagement Monitoring Policy and Law01 natural sciencestraffic accidents010104 statistics & probabilityOverdispersionCovariateStatisticsZero-inflated model0101 mathematicsComputers in Earth SciencesSpatial dependencelattice structurebusiness.industryIntegrated Nested Laplace Approximationzero-inflated model020801 environmental engineeringVariable (computer science)linear networksbusiness
researchProduct

Estimating the safety performance function for urban unsignalized four-legged one-way intersections in Palermo, Italy

2014

Abstract Starting from consideration that urban intersections are sites with promise for safety and operational improvements, the paper describes the steps taken to develop a crash predictive model for estimating the safety performance of urban unsignalized intersections located in Palermo, Italy. The focus is on unsignalized four-legged one-way intersections widespread in Italian downtowns. The sample considered in the study consist of 92 intersections in Palermo, Italy. For the study were collected crashes occurred in the sites during the years 2006-2012, geometric design and functional characteristics and traffic flow. Results showed that data were overdispersed and NB1 distributed. In o…

overdispersionCrashSample (statistics)safety performance functionsSafety performance functions urban intersection overdispersion correlationTransport engineeringurban intersectionGeometric designGeographyOverdispersionlcsh:TA1-2040correlationSettore ICAR/04 - Strade Ferrovie Ed AeroportiPerformance functionlcsh:Engineering (General). Civil engineering (General)Generalized estimating equationCivil and Structural Engineering
researchProduct

Assessment of Susceptibility Risk Factors for ADHD in Imaging Genetic Studies

2019

Objective: ADHD consists of a count of symptoms that often presents heterogeneity due to overdispersion and excess of zeros. Statistical inference is usually based on a dichotomous outcome that is underpowered. The main goal of this study was to determine a suited probability distribution to analyze ADHD symptoms in Imaging Genetic studies. Method: We used two independent population samples of children to evaluate the consistency of the standard probability distributions based on count data for describing ADHD symptoms. Results: We showed that the zero-inflated negative binomial (ZINB) distribution provided the best power for modeling ADHD symptoms. ZINB reveals a genetic variant, rs273342…

MaleGenotypeImaging geneticsPopulationNegative binomial distributionPolymorphism Single NucleotideADHD symptomsImaging Genetics03 medical and health sciencesImaging Three-Dimensional0302 clinical medicineOverdispersionRisk FactorsStatisticsmental disordersDevelopmental and Educational PsychologyStatistical inferenceHumansGenetic Predisposition to Disease0501 psychology and cognitive sciencesGenetic TestingLongitudinal StudiesPoisson DistributionProspective Studiesp-valueMAPRE2Childeducationchildhoodzero-inflated negative binomialeducation.field_of_studyModels Statisticalbasal ganglia perivascular volumes05 social sciencesMagnetic Resonance Imagingcount dataVirchow-Robin spaceBinomial DistributionClinical PsychologyAttention Deficit Disorder with HyperactivityChild PreschoolProbability distributionFemalePsychology030217 neurology & neurosurgery050104 developmental & child psychologyCount data
researchProduct

Challenges in dental statistics: data and modelling

2022

The aim of this work is to present the reflections and proposals derived from the first Workshop of the SISMEC STATDENT working group on statistical methods and applications in dentistry, held in Ancona (Italy) on 28th September 2011. STATDENT began as a forum of comparison and discussion for statisticians working in the field of dental research in order to suggest new and improve existing biostatistical and clinical epidemiological methods. During the meeting, we dealt with very important topics of statistical methodology for the analysis of dental data, covering the analysis of hierarchically structured and over-dispersed data, the issue of calibration and reproducibility, as well as some…

lcsh:R5-920Dental researchbusiness.industryCalibration (statistics)lcsh:Public aspects of medicinelcsh:RA1-1270Epidemiological methodMultilevel modelingExaminer reliabilityField (computer science)ReproducibilityOverdispersionSettore MED/01 - Statistica MedicaSurvey methodologyDmftDmft; Multilevel modeling; Zero-inflation; Overdispersion; Reproducibility; Examiner reliabilityDental EpidemiologyStatisticsMedicinebusinesslcsh:Medicine (General)Zero-inflation
researchProduct

Spatial analysis of traffic accidents near and between road intersections in a directed linear network.

2019

Although most of the literature on traffic safety analysis has been developed over areal zones, there is a growing interest in using the specific road structure of the region under investigation, which is known as a linear network in the field of spatial statistics. The use of linear networks entails several technical complications, ranging from the accurate location of traffic accidents to the definition of covariates at a spatial micro-level. Therefore, the primary goal of this study was to display a detailed analysis of a dataset of traffic accidents recorded in Valencia (Spain), which were located into a linear network representing more than 30 km of urban road structure corresponding t…

050210 logistics & transportationModels StatisticalComputer science05 social sciencesKernel density estimationPublic Health Environmental and Occupational HealthNegative binomial distributionAccidents TrafficHuman Factors and ErgonomicsRangingSpatial heterogeneityLinear networkSpatio-Temporal AnalysisOverdispersionSpain0502 economics and businessStatisticsCovariateHumans0501 psychology and cognitive sciencesBuilt EnvironmentSafety Risk Reliability and QualitySpatial analysis050107 human factorsAccident; analysis and prevention
researchProduct

A Bayesian Sequential Look at u-Control Charts

2005

We extend the usual implementation of u-control charts (uCCs) in two ways. First, we overcome the restrictive (and often inadequate) assumptions of the Poisson model; next, we eliminate the need for the questionable base period by using a sequential procedure. We use empirical Bayes(EB) and Bayes methods and compare them with the traditional frequentist implementation. EB methods are somewhat easy to implement, and they deal nicely with extra-Poisson variability (and, at the same time, informally check the adequacy of the Poisson assumption). However, they still need the base period. The sequential, full Bayes approach, on the other hand, also avoids this drawback of traditional u-charts. T…

Statistics and ProbabilityApplied MathematicsBayesian probabilityPoisson distributioncomputer.software_genreStatistical process controlsymbols.namesakeBayes' theoremOverdispersionFrequentist inferenceModeling and SimulationPrior probabilitysymbolsControl chartData miningcomputerMathematicsTechnometrics
researchProduct