Search results for "tilastomenetelmät"
showing 10 items of 43 documents
How to reach optimal estimates of confidence intervals in microscopic counting of phytoplankton?
2021
Abstract Present practices in the microscopic counting of phytoplankton to estimate the reliability of results rely on the assumption of a random distribution of taxa in sample preparations. In contrast to that and in agreement with the literature, we show that aggregated distribution is common and can lead to over-optimistic confidence intervals, if estimated according to the shortcut procedure of Lund et al. based on the number of counted cells. We found a good linear correlation between the distribution independent confidence intervals for medians and those for parametric statistics so that 95% confidence intervals can be approximated by using a correction factor of 1.4. Instead, the rec…
On the statistics of pairs of logarithms of integers
2022
We study the statistics of pairs of logarithms of positive integers at various scalings, either with trivial weights or with weights given by the Euler function, proving the existence of pair correlation functions. We prove that at the linear scaling, which is not the usual scaling by the inverse of the average gap, the pair correlations exhibit a level repulsion similar to radial distribution functions of fluids. We prove total loss of mass phenomena at superlinear scalings, and constant nonzero asymptotic behavior at sublinear scalings. The case of Euler weights has applications to the pair correlation of the lengths of common perpendicular geodesic arcs from the maximal Margulis cusp nei…
Requirement analysis for an artificial intelligence model for the diagnosis of the COVID-19 from chest X-ray data
2021
There are multiple papers published about different AI models for the COVID-19 diagnosis with promising results. Unfortunately according to the reviews many of the papers do not reach the level of sophistication needed for a clinically usable model. In this paper I go through multiple review papers, guidelines, and other relevant material in order to generate more comprehensive requirements for the future papers proposing a AI based diagnosis of the COVID-19 from chest X-ray data (CXR). Main findings are that a clinically usable AI needs to have an extremely good documentation, comprehensive statistical analysis of the possible biases and performance, and an explainability module.
Can visualization alleviate dichotomous thinking? Effects of visual representations on the cliff effect
2021
Common reporting styles for statistical results in scientific articles, such as $p$ p -values and confidence intervals (CI), have been reported to be prone to dichotomous interpretations, especially with respect to the null hypothesis significance testing framework. For example when the $p$ p -value is small enough or the CIs of the mean effects of a studied drug and a placebo are not overlapping, scientists tend to claim significant differences while often disregarding the magnitudes and absolute differences in the effect sizes. This type of reasoning has been shown to be potentially harmful to science. Techniques relying on the visual estimation of the strength of evidence have been recom…
Statistical models and inference for spatial point patterns with intensity-dependent marks
2009
Marketing or methodology? : Exposing the fallacies of PLS with simple demonstrations
2023
Purpose Over the past 20 years, partial least squares (PLS) has become a popular method in marketing research. At the same time, several methodological studies have demonstrated problems with the technique but have had little impact on its use in marketing research practice. This study aims to present some of these criticisms in a reader-friendly way for non-methodologists. Design/methodology/approach Key critiques of PLS are summarized and demonstrated using existing data sets in easily replicated ways. Recommendations are made for assessing whether PLS is a useful method for a given research problem. Findings PLS is fundamentally just a way of constructing scale scores for regression. PL…
Rejoinder: fractures in the edifice of PLS
2023
Purpose This study aims to provide a response to the commentary by Yuan on the paper “Marketing or Methodology” in this issue of EJM. Design/methodology/approach Conceptual argument and statistical discussion. Findings The authors find that some of Yuan’s arguments are incorrect, or unclear. Further, rather than contradicting the authors’ conclusions, the material provided by Yuan in his commentary actually provides additional reasons to avoid partial least squares (PLS) in marketing research. As such, Yuan’s commentary is best understood as additional evidence speaking against the use of PLS in real-world research. Research limitations/implications This rejoinder, coupled with Yuan’s comm…
Optimization of spodumene identification by statistical approach for laser-induced breakdown spectroscopy data of lithium pegmatite ores
2021
Mapping with laser-induced breakdown spectroscopy (LIBS) can offer more than just the spatial distribution of elements: the rich spectral information also enables mineral recognition. In the present study, statistical approaches were used for the recognition of the spodumene from lithium pegmatite ores. A broad spectral range (280–820 nm) with multiple lines was first used to establish the methods based on vertex component analysis (VCA) and K-means and DBSCAN clusterings. However, with a view to potential on-site applications, the dimensions of the datasets must be reduced in order to accomplish fast analysis. Therefore, the capability of the methods in mineral identification was tested wi…
CLEASE: a versatile and user-friendly implementation of cluster expansion method
2018
Materials exhibiting a substitutional disorder such as multicomponent alloys and mixed metal oxides/oxyfluorides are of great importance in many scientific and technological sectors. Disordered materials constitute an overwhelmingly large configurational space, which makes it practically impossible to be explored manually using first-principles calculations such as density functional theory due to the high computational costs. Consequently, the use of methods such as cluster expansion (CE) is vital in enhancing our understanding of the disordered materials. CE dramatically reduces the computational cost by mapping the first-principles calculation results on to a Hamiltonian which is much fa…
Importance sampling type estimators based on approximate marginal Markov chain Monte Carlo
2020
We consider importance sampling (IS) type weighted estimators based on Markov chain Monte Carlo (MCMC) targeting an approximate marginal of the target distribution. In the context of Bayesian latent variable models, the MCMC typically operates on the hyperparameters, and the subsequent weighting may be based on IS or sequential Monte Carlo (SMC), but allows for multilevel techniques as well. The IS approach provides a natural alternative to delayed acceptance (DA) pseudo-marginal/particle MCMC, and has many advantages over DA, including a straightforward parallelisation and additional flexibility in MCMC implementation. We detail minimal conditions which ensure strong consistency of the sug…