Search results for "hypothesis testing"
showing 10 items of 124 documents
Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses
2016
Much of science is (rightly or wrongly) driven by hypothesis testing. Even in situations where the hypothesis testing paradigm is correct, the common practice of basing inferences solely on p-values has been under intense criticism for over 50 years. We propose, as an alternative, the use of the odds of a correct rejection of the null hypothesis to incorrect rejection. Both pre-experimental versions (involving the power and Type I error) and post-experimental versions (depending on the actual data) are considered. Implementations are provided that range from depending only on the p-value to consideration of full Bayesian analysis. A surprise is that all implementations -- even the full Baye…
Adaptive trial design: a general methodology for censored time to event data.
2008
Adaptive designs allow a clinical trial design to be changed according to interim findings without inflating type I error. The Inverse Normal method can be considered as an adaptive generalization of classical group sequential designs. The use of the Inverse Normal method for censored survival data was demonstrated only for the logrank statistic. However, the logrank statistic is inefficient in the presence of nuisance covariates affecting survival. We demonstrate, how the Inverse Normal method can be applied to Cox regression analysis. The required independence between test statistics of the different stages of the trial can be obtained by two different approaches. One is using the indepen…
V: p-Werte: Was sie besagen und was nicht …
2002
Both an extensive data description and an explicit assessment of a study result's statistical significance should be presented in the result section of a clinical trial report. Whereas the description illustrates the order and clinical relevance of the study findings, the statistical significance describes its generalizability to patients not included in the clinical trial: Despite the random recruitment of patients into a trial, the study results may fail to represent clinical reality (for example the trial might show falsely positive efficacy findings, whereas in "clinical reality" efficacy appears rather limited). A p value measures the statistical significance of a study result -- the s…
Computation Cluster Validation in the Big Data Era
2017
Data-driven class discovery, i.e., the inference of cluster structure in a dataset, is a fundamental task in Data Analysis, in particular for the Life Sciences. We provide a tutorial on the most common approaches used for that task, focusing on methodologies for the prediction of the number of clusters in a dataset. Although the methods that we present are general in terms of the data for which they can be used, we offer a case study relevant for Microarray Data Analysis.
Bayesian hypothesis testing: A reference approach
2002
Summary For any probability model M={p(x|θ, ω), θeΘ, ωeΩ} assumed to describe the probabilistic behaviour of data xeX, it is argued that testing whether or not the available data are compatible with the hypothesis H0={θ=θ0} is best considered as a formal decision problem on whether to use (a0), or not to use (a0), the simpler probability model (or null model) M0={p(x|θ0, ω), ωeΩ}, where the loss difference L(a0, θ, ω) –L(a0, θ, ω) is proportional to the amount of information δ(θ0, ω), which would be lost if the simplified model M0 were used as a proxy for the assumed model M. For any prior distribution π(θ, ω), the appropriate normative solution is obtained by rejecting the null model M0 wh…
Influence of Background Knowledge and Language Proficiency on Comprehension of Domain-Specific Texts by University Students
2019
This paper presents the results of a quantitative study that explores two factors contributing to reading comprehension of domain specific texts, namely level of language proficiency and background knowledge. Overall, 32 students participated in the study by taking two custom-designed reading comprehension tests. The test scores were further analyzed using SPSS statistical software. The results of statistical tests revealed the differences between study groups as well as the effects of compensation. More precisely, the most proficient group scored higher on almost all tests and completed the tests more quickly than the remaining groups. The statistical tools used to test the data showed th…
Testing for goodness rather than lack of fit of continuous probability distributions.
2021
The vast majority of testing procedures presented in the literature as goodness-of-fit tests fail to accomplish what the term is promising. Actually, a significant result of such a test indicates that the true distribution underlying the data differs substantially from the assumed model, whereas the true objective is usually to establish that the model fits the data sufficiently well. Meeting that objective requires to carry out a testing procedure for a problem in which the statement that the deviations between model and true distribution are small, plays the role of the alternative hypothesis. Testing procedures of this kind, for which the term tests for equivalence has been coined in sta…
Statistics-driven Development of OBD Systems: An Overview
2006
Automotive on-board diagnostic (OBD) systems are designed to keep critical components under control during vehicle functioning, and to alert the driver in case of severe malfunctions. OBD systems aimed at reducing polluting emissions are mandatory on new motor vehicles. Some research projects conducted in cooperation between universities and the automotive industry have been quite successful in terms of knowledge advancement and industrial gain. An updated overview of the adopted methodologies and results obtained are given in this article. Such results can be valuable for both theorists and practitioners, since they witness the use of statistics as a powerful catalyst of technical progress…
Finding condensed descriptions for multi-dimensional data.
1976
Abstract We describe two programs that may be used to find condensed descriptions for data available in a contingency table or in a covariance matrix in the case that these data follow a multinomial or a multivariate normal distribution, respectively. The programs perform a stepwise model search among multiplicative models by computing appropriate likelihood-ratio test statistics.
Inference for the interclass correlation in familial data using small sample asymptotics
2012
Inference on the parent-offspring correlation coefficient is an important problem in the analysis of familial data, and point estimates and likelihood based inference are available in the literature. In this work, corrections for the signed log-likelihood ratio test statistics are proposed, based on small sample asymptotics, in order to achieve accurate small sample performance. The corrected statistic can be used for hypothesis testing as well as for interval estimation.