Search results for "variable"

showing 10 items of 1674 documents

Moderating effects of subgroups in linear models

1989

SUMMARY Possibilities for moderating effects of a subgrouping variable on strength or direction of an association have been much discussed by social scientists but have not been given satisfactory statistical formulations. The results concern directed measures of associations in linear models containing just three variables. Some key words: Analysis of covariance; Analysis of variance; cG-distribution; Conditional independence; Graphical chain model; Parallel regressions; Yule-Simpson paradox. 1. INTRODUCTION Linear models are commonly used as a framework to estimate and test how a continuous response variable depends on potential influencing variables. This paper is concerned with the situ…

Statistics and ProbabilityAnalysis of covarianceeducation.field_of_studyApplied MathematicsGeneral MathematicsPopulationLinear modelContext (language use)ModerationAgricultural and Biological Sciences (miscellaneous)Conditional independenceStatisticsEconometricsStatistics Probability and UncertaintyGeneral Agricultural and Biological ScienceseducationRandom variableMathematicsVariable (mathematics)Biometrika
researchProduct

A Monte Carlo study comparing PIV, ULS and DWLS in the estimation of dichotomous confirmatory factor analysis

2012

We conducted a Monte Carlo study to investigate the performance of the polychoric instrumental variable estimator (PIV) in comparison to unweighted least squares (ULS) and diagonally weighted least squares (DWLS) in the estimation of a confirmatory factor analysis model with dichotomous indicators. The simulation involved 144 conditions (1,000 replications per condition) that were defined by a combination of (a) two types of latent factor models, (b) four sample sizes (100, 250, 500, 1,000), (c) three factor loadings (low, moderate, strong), (d) three levels of non-normality (normal, moderately, and extremely non-normal), and (e) whether the factor model was correctly specified or misspecif…

Statistics and ProbabilityArts and Humanities (miscellaneous)Sample size determinationMonte Carlo methodStatisticsInstrumental variable estimatorGeneral MedicinePolychoric correlationLeast squaresGeneral PsychologyConfirmatory factor analysisFactor analysisMathematicsBritish Journal of Mathematical and Statistical Psychology
researchProduct

Asymptotic optimality of myopic information-based strategies for Bayesian adaptive estimation

2016

This paper presents a general asymptotic theory of sequential Bayesian estimation giving results for the strongest, almost sure convergence. We show that under certain smoothness conditions on the probability model, the greedy information gain maximization algorithm for adaptive Bayesian estimation is asymptotically optimal in the sense that the determinant of the posterior covariance in a certain neighborhood of the true parameter value is asymptotically minimal. Using this result, we also obtain an asymptotic expression for the posterior entropy based on a novel definition of almost sure convergence on "most trials" (meaning that the convergence holds on a fraction of trials that converge…

Statistics and ProbabilityAsymptotic analysisMathematical optimizationPosterior probabilityBayesian probabilityMathematics - Statistics TheoryStatistics Theory (math.ST)050105 experimental psychologydifferential entropyDifferential entropyactive data selection03 medical and health sciences0302 clinical medicineactive learningFOS: Mathematics0501 psychology and cognitive sciencescost of observationdecision theoryMathematicsD-optimalityBayes estimatorSequential estimation05 social sciencesBayesian adaptive estimationAsymptotically optimal algorithmConvergence of random variablesasymptotic optimalitysequential estimation030217 neurology & neurosurgery
researchProduct

Cluster-Localized Sparse Logistic Regression for SNP Data

2012

The task of analyzing high-dimensional single nucleotide polymorphism (SNP) data in a case-control design using multivariable techniques has only recently been tackled. While many available approaches investigate only main effects in a high-dimensional setting, we propose a more flexible technique, cluster-localized regression (CLR), based on localized logistic regression models, that allows different SNPs to have an effect for different groups of individuals. Separate multivariable regression models are fitted for the different groups of individuals by incorporating weights into componentwise boosting, which provides simultaneous variable selection, hence sparse fits. For model fitting, th…

Statistics and ProbabilityBoosting (machine learning)Computer scienceMultivariable calculusComputational BiologyHigh-Throughput Nucleotide SequencingFeature selectionRegression analysisModels TheoreticalLogistic regressioncomputer.software_genrePolymorphism Single NucleotideRegressionComputational MathematicsLogistic ModelsData Interpretation StatisticalGeneticsCluster AnalysisHumansData miningCluster analysisMolecular BiologyUnit-weighted regressioncomputerGenome-Wide Association StudyStatistical Applications in Genetics and Molecular Biology
researchProduct

Random Logistic Maps II. The Critical Case

2003

Let (X n )∞ 0 be a Markov chain with state space S=[0,1] generated by the iteration of i.i.d. random logistic maps, i.e., X n+1=C n+1 X n (1−X n ),n≥0, where (C n )∞ 1 are i.i.d. random variables with values in [0, 4] and independent of X 0. In the critical case, i.e., when E(log C 1)=0, Athreya and Dai(2) have shown that X n → P 0. In this paper it is shown that if P(C 1=1)<1 and E(log C 1)=0 then (i) X n does not go to zero with probability one (w.p.1) and in fact, there exists a 0<β<1 and a countable set ▵⊂(0,1) such that for all x∈A≔(0,1)∖▵, P x (X n ≥β for infinitely many n≥1)=1, where P x stands for the probability distribution of (X n )∞ 0 with X 0=x w.p.1. A is a closed set for (X n…

Statistics and ProbabilityCombinatoricsDiscrete mathematicsDistribution (mathematics)Multivariate random variableInitial distributionGeneral MathematicsZero (complex analysis)Random elementProbability distributionStatistics Probability and UncertaintyRandom variableMathematicsJournal of Theoretical Probability
researchProduct

A Unified Approach to Likelihood Inference on Stochastic Orderings in a Nonparametric Context

1998

Abstract For data in a two-way contingency table with ordered margins, we consider various hypotheses of stochastic orders among the conditional distributions considered by rows and show that each is equivalent to requiring that an invertible transformation of the vectors of conditional row probabilities satisfies an appropriate set of linear inequalities. This leads to the construction of a general algorithm for maximum likelihood estimation under multinomial sampling and provides a simple framework for deriving the asymptotic distribution of log-likelihood ratio tests. The usual stochastic ordering and the so called uniform and likelihood ratio orderings are considered as special cases. I…

Statistics and ProbabilityCombinatoricsIndependent and identically distributed random variablesLinear inequalityTransformation (function)Likelihood-ratio testAsymptotic distributionApplied mathematicsConditional probability distributionStatistics Probability and UncertaintyStochastic orderingStatistical hypothesis testingMathematicsJournal of the American Statistical Association
researchProduct

Fast and universal estimation of latent variable models using extended variational approximations

2022

AbstractGeneralized linear latent variable models (GLLVMs) are a class of methods for analyzing multi-response data which has gained considerable popularity in recent years, e.g., in the analysis of multivariate abundance data in ecology. One of the main features of GLLVMs is their capacity to handle a variety of responses types, such as (overdispersed) counts, binomial and (semi-)continuous responses, and proportions data. On the other hand, the inclusion of unobserved latent variables poses a major computational challenge, as the resulting marginal likelihood function involves an intractable integral for non-normally distributed responses. This has spurred research into a number of approx…

Statistics and ProbabilityComputational Theory and Mathematicsmultivariate abundance datamuuttujatlaplace approximationmulti-response dataordinationStatistics Probability and Uncertaintyvariational approximationsgeneralized linear latent variable modelsestimointiTheoretical Computer ScienceStatistics and Computing
researchProduct

Blind Source Separation Based on Joint Diagonalization in R: The Packages JADE and BSSasymp

2017

Blind source separation (BSS) is a well-known signal processing tool which is used to solve practical data analysis problems in various fields of science. In BSS, we assume that the observed data consists of linear mixtures of latent variables. The mixing system and the distributions of the latent variables are unknown. The aim is to find an estimate of an unmixing matrix which then transforms the observed data back to latent sources. In this paper we present the R packages JADE and BSSasymp. The package JADE offers several BSS methods which are based on joint diagonalization. Package BSSasymp contains functions for computing the asymptotic covariance matrices as well as their data-based es…

Statistics and ProbabilityComputer scienceJADE (programming language)02 engineering and technologyLatent variableMachine learningcomputer.software_genre01 natural sciencesBlind signal separation010104 statistics & probabilityMatrix (mathematics)nonstationary source separationMixing (mathematics)0202 electrical engineering electronic engineering information engineeringsecond order source separation0101 mathematicslcsh:Statisticslcsh:HA1-4737computer.programming_languageta113Signal processingta112matematiikkamultivariate time seriesmathematicsbusiness.industryEstimator020206 networking & telecommunicationsriippumattomien komponenttien analyysiindependent component analysis; multivariate time series; nonstationary source separation; performance indices; second order source separationIndependent component analysisperformance indicesstatisticsindependent component analysisArtificial intelligenceStatistics Probability and UncertaintybusinesscomputerAlgorithmSoftwareJournal of Statistical Software
researchProduct

Modeling accident risk at the road level through zero-inflated negative binomial models: A case study of multiple road networks

2021

Abstract This paper presents a case study carried out in multiple cities of the Valencian Community (Spain) to determine the effect of sociodemographic and road characteristics on traffic accident risk. The analyzes are performed at the road segment level, considering the linear network representing the road structure of each city as a spatial lattice. The number of accidents observed in each road segment from 2010 to 2019 is taken as the response variable, and a zero-inflated modeling approach is considered. Count overdispersion and spatial dependence are also accounted for. Despite the complexity and sparsity of the data, the fitted models performed considerably well, with few exceptions.…

Statistics and ProbabilityComputer sciencespatial dependence0208 environmental biotechnologyAccident riskMagnitude (mathematics)Distribution (economics)02 engineering and technologyManagement Monitoring Policy and Law01 natural sciencestraffic accidents010104 statistics & probabilityOverdispersionCovariateStatisticsZero-inflated model0101 mathematicsComputers in Earth SciencesSpatial dependencelattice structurebusiness.industryIntegrated Nested Laplace Approximationzero-inflated model020801 environmental engineeringVariable (computer science)linear networksbusiness
researchProduct

Testing Goodness-of-Fit with the Kernel Density Estimator: GoFKernel

2015

To assess the goodness-of-fit of a sample to a continuous random distribution, the most popular approach has been based on measuring, using either L∞ - or L2 -norms, the distance between the null hypothesis cumulative distribution function and the empirical cumulative distribution function. Indeed, as far as I know, almost all the tests currently available in R related to this issue (ks.test in package stats, ad.test in package ADGofTest, and ad.test, ad2.test, ks.test, v.test and w2.test in package truncgof) use one of these two distances on cumulative distribution functions. This paper (i) proposes dgeometric.test, a new implementation of the test that measures the discrepancy between a s…

Statistics and ProbabilityCumulative distribution functionKernel density estimationProbability density functionKolmogorov–Smirnov testEmpirical distribution functionsymbols.namesakeGoodness of fitStatisticssymbolsStatistics Probability and UncertaintyNull hypothesisRandom variablelcsh:Statisticslcsh:HA1-4737SoftwareMathematicsJournal of Statistical Software
researchProduct