Search results for "Statistica"
showing 10 items of 5969 documents
Bivariate logistic models for the analysis of the students' University "success"
2012
We analyze the students’ success at University by considering their performance in terms of both “qualitative performance”, measured by their grade average, and “quantitative performance”, measured by University Credits accumulated. To jointly model both marginal and association relationships with covariates, the analysis has been carried out by fitting a bivariate ordered logistic model (BOLM), in a nonparametric fashion, by penalized maximum likelihood estimation. The advantages of such model are in terms of parsimony and parameters interpretation, while preserving goodness-of-fit. The application regards an engineering student (ES) cohort from the University of Palermo.
Boosting for ranking data: an extension to item weighting
2021
Gli alberi decisionali sono una tecnica predittiva di machine learning particolarmente diffusa, utilizzata per prevedere delle variabili discrete (classificazione) o continue (regressione). Gli algoritmi alla base di queste tecniche sono intuitivi e interpretabili, ma anche instabili. Infatti, per rendere la classificazione più affidabile si `e soliti combinare l’output di più alberi. In letteratura, sono stati proposti diversi approcci per classificare ranking data attraverso gli alberi decisionali, ma nessuno di questi tiene conto ne dell’importanza, ne delle somiglianza dei singoli elementi di ogni ranking. L’obiettivo di questo articolo `e di proporre un’estensione ponderata del metodo …
Multi-Destination Trips and Tourism Statistics: Empirical Evidences in Sicily
2012
Abstract The knowledge of the actual magnitude and main features of tourism flows in a given destination is an essential prerequisite for the evaluation of tourism impacts and externalities. Indeed, many pleasure trips are often characterized by the visit to more than a single destination. Although the topic is well-documented in literature, the empirical results are limited to a few pioneering studies. The lack may be attributable to the failure of tourism organizations to collect data on multi-destination trip behaviour. This can be seen, for example, in the system of European statistics on tourism (according to the Council Directive 95/57 EC), where information on the average number of v…
Comment on "Estimating average annual per cent change in trend analysis"
2010
We discuss some issues relevant to paper of Clegg and co-authors published in Statistics in Medicine; 28, 3670-3682. Emphasis is on computation of the variance of the sum of products of two estimates, slopes and breakpoints.
The determination of maturity stages in male elasmobranchs (Chondrichthyes) using a segmented regression of clasper length on total length
2013
A novel statistical method for estimating the stages of maturity in male sharks and skates based on a segmented regression (SRM) is proposed. We hypothesize that this method is able to find the transition points in the three-phase relationship between total length (TL) and clasper length (CL). We applied an SRM to TL–CL data of nine species, from large pelagic sharks (e.g., Carcharhinus falciformis) to small coastal skates (e.g., Rioraja agassizi), captured in the southwestern Atlantic and northeastern Pacific. As expected, SRM detected two breakpoints, defining three maturity stages (immature, maturing, and mature), in six out of nine species. For three species, it was not possible to fin…
Learning-based multiresolution transforms with application to image compression
2013
In Harten's framework, multiresolution transforms are defined by predicting finer resolution levels of information from coarser ones using an operator, called prediction operator, and defining details (or wavelet coefficients) that are the difference between the exact and predicted values. In this paper we use tools of statistical learning in order to design a more accurate prediction operator in this framework based on a training sample, resulting in multiresolution decompositions with enhanced sparsity. In the case of images, we incorporate edge detection techniques in the design of the prediction operator in order to avoid Gibbs phenomenon. Numerical tests are presented showing that the …
贝叶斯因子及其在JASP中的实现
2018
Statistical inference plays a critical role in modern scientific research, however, the dominant method for statistical inference in science, null hypothesis significance testing (NHST), is often misunderstood and misused, which leads to unreproducible findings. To address this issue, researchers propose to adopt the Bayes factor as an alternative to NHST. The Bayes factor is a principled Bayesian tool for model selection and hypothesis testing, and can be interpreted as the strength for both the null hypothesis H0 and the alternative hypothesis H1 based on the current data. Compared to NHST, the Bayes factor has the following advantages: it quantifies the evidence that the data provide for…
Automating statistical diagrammatic representations with data characterization
2017
The search for an efficient method to enhance data cognition is especially important when managing data from multidimensional databases. Open data policies have dramatically increased not only the volume of data available to the public, but also the need to automate the translation of data into efficient graphical representations. Graphic automation involves producing an algorithm that necessarily contains inputs derived from the type of data. A set of rules are then applied to combine the input variables and produce a graphical representation. Automated systems, however, fail to provide an efficient graphical representation because they only consider either a one-dimensional characterizat…
Multiple Classifiers and Data Fusion for Robust Diagnosis of Gearbox Mixed Faults
2019
Detection and isolation of single and mixed faults in a gearbox are very important to enhance the system reliability, lifetime, and service availability. This paper proposes a hybrid learning algorithm, consisting of multilayer perceptron (MLP)- and convolutional neural network (CNN)-based classifiers, for diagnosis of gearbox mixed faults. Domain knowledge features are required to train the MLP classifier, while the CNN classifier can learn features itself, allowing to reduce the required knowledge features for the counterpart. Vibration data from an experimental setup with gearbox mixed faults is used to validate the effectiveness of the algorithms and compare them with conventional metho…
Comprehensive Strategy for Proton Chemical Shift Prediction: Linear Prediction with Nonlinear Corrections
2014
A fast 3D/4D structure-sensitive procedure was developed and assessed for the chemical shift prediction of protons bonded to sp3carbons, which poses the maybe greatest challenge in the NMR spectral parameter prediction. The LPNC (Linear Prediction with Nonlinear Corrections) approach combines three well-established multivariate methods viz. the principal component regression (PCR), the random forest (RF) algorithm, and the k nearest neighbors (kNN) method. The role of RF is to find nonlinear corrections for the PCR predicted shifts, while kNN is used to take full advantage of similar chemical environments. Two basic molecular models were also compared and discussed: in the MC model the desc…