The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.

6533b838fe1ef96bd12a3c61

RESEARCH PRODUCT

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.

Salla-maaria Laaksonen Jesse Haapoja Jesse Haapoja Teemu Kinnunen Matti Nelimarkka Matti Nelimarkka Reeta Pöyhtäri Reeta Pöyhtäri

subject

Big Data Computer science hate speech social media 518 Media and communications sosiaalinen media monitorointi 050801 communication & media studies Social issues 0508 media and communications politiikka datatiede Artificial Intelligence algoritmit 050602 political science & public administration Computer Science (miscellaneous)Social media algorithmic system vihapuhe Action research Objectivity (science)Original Research lcsh:T58.5-58.64 Datafication Social phenomenon lcsh:Information technology tekstinlouhinta 05 social sciences Citizen journalism 16. Peace & justice 113 Computer and information sciences Data science 0506 political science koneoppiminen machine learning Neutrality data science politics Information Systems

description

Laaksonen, S-M.; Haapoja, J.; Kinnunen, T., Nelimarkka, M. & Pöyhtäri, R. (2020, accepted). . Frontiers in Big Data: Data Mining and Management / Critical Data and Algorithm Studies. doi:10.3389/fdata.2020.00003 Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investigation. First, the collaboration offered a unique view for exploring how hate speech emerges as a technical problem. The project developed an adequately well-working algorithmic solution using supervised machine learning. We tested the performance of various feature extraction and machine learning methods and ended up using a combination of Bag-of-Words feature extraction with Support-Vector Machines. However, an automated approach required heavy simplification, such as using rudimentary scales for classifying hate speech and a reliance on word-based approaches, while in reality hate speech is a linguistic and social phenomenon with various tones and forms. Second, the action-research-oriented setting allowed us to observe affective responses, such as the hopes, dreams, and fears related to machine learning technology. Based on participatory observations, project artifacts and documents, interviews with project participants, and online reactions to the detection project, we identified participants' aspirations for effective automation as well as the level of neutrality and objectivity introduced by an algorithmic system. However, the participants expressed more critical views toward the system after the monitoring process. Our findings highlight how the powerful expectations related to technology can easily end up dominating a project dealing with a contested, topical social issue. We conclude by discussing the problematic aspects of datafying hate and suggesting some practical implications for hate speech recognition. Peer reviewed

year	journal	country	edition	language
2020-02-05

10.3389/fdata.2020.00003 https://trepo.tuni.fi/handle/10024/136074