6533b856fe1ef96bd12b1c28

RESEARCH PRODUCT

P-Value, Confidence Intervals, and Statistical Inference: A New Dataset of Misinterpretation

Ziyang LyuKaiping PengChuan-peng HuChuan-peng Hu

subject

PsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Intragroup ProcessesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Social CognitionPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Personality and CreativityPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Theories of Personality050109 social psychologyconfidence intervals (CIs) ; misinterpretation ; P-Value ; statistical inference ; replication crisisSocial and Behavioral SciencesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Moral BehaviorP-ValueStatisticsStatistical inferencePsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Testing and AssessmentPsyArXiv|Social and Behavioral Sciences|Social and Personality PsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Self-regulationGeneral PsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Motivational BehaviorPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Prejudice and DiscriminationPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Well-beingPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Social Influence05 social sciencesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Affect and Emotion RegulationBayes factorPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Social Well-beingPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Intergroup ProcessesFOS: Psychologybepress|Social and Behavioral Sciences|Psychology|Social PsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Self and Social Identitybepress|Social and Behavioral Sciences|Psychology|Personality and Social ContextsPsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Attitudes and Persuasionconfidence intervals (CIs)statistical inferenceSocial PsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Politicslcsh:BF1-990replication crisisPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Individual DifferencesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Nonverbal BehaviorPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|InterventionsPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Narrative ResearchPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|DiversityPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Genetic factors050105 experimental psychologymisinterpretationPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Interpersonal RelationshipsPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Personality and SituationsPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Personality ProcessesSignificance testingPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Impression Formation0501 psychology and cognitive sciencesp-valuePsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Violence and AggressionPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|DisabilityPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Achievement and StatusPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Prosocial BehaviorReplication crisisTask forcePsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Self-esteemConfidence intervalPsyArXiv|Social and Behavioral Scienceslcsh:PsychologyPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|SexualityPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Cultural DifferencesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Trait Theorybepress|Social and Behavioral SciencesPsyArXiv|Social and Behavioral Sciences|Social and Personality Psychology|Religion and SpiritualityNull hypothesis

description

Statistical inference is essential for science since the twentieth century (Salsburg, 2001). Since it's introduction into science, the null hypothesis significance testing (NHST), in which the P-value serves as the index of “statistically significant,” is the most widely used statistical method in psychology (Sterling et al., 1995; Cumming et al., 2007), as well as other fields (Wasserstein and Lazar, 2016). However, surveys consistently showed that researchers in psychology may not able to interpret P-value and related statistical procedures correctly (Oakes, 1986; Haller and Krauss, 2002; Hoekstra et al., 2014; Badenes-Ribera et al., 2016). Even worse, these misinterpretations of P-value may cause the abuse of P-value, for example, P-hacking (Simmons et al., 2011; John et al., 2012). To counter these misinterpretations and abuse of P-values, researchers have proposed many solutions. For example, complementing NHST with estimation-based statistics (Wilkinson and the Task Force on Statistical Inference, 1999; Cumming, 2014), lower the threshold for “significance” (Benjamin et al., 2017) or totally banning the use of NHST and related procedures (Trafimow and Marks, 2015) and using Bayes Factor (Wagenmakers et al., 2011, 2017). Of all these solutions, the estimation-based statistics was adopted by several mainstream psychological journals. One reason is that confidence intervals (CIs) of the estimation-based statistics help better statistical inference (though not guarantee it) (Coulson et al., 2010). However, the first step of changing is to know to what extent people in the field misinterpreting these statistical indices and how the misinterpretations caused abuse of these statistical procedures in research. Here we introduce the raw data available for anyone who is interested in examining how students and researchers misinterpret of P-value and CIs, as well as how NHST and CIs influence the interpretation of research results. Part of the results had been reported in our previous Chinese paper (Hu et al., 2016).

10.3389/fpsyg.2018.00868https://repository.publisso.de/resource/frl:6424562