Logo des Repositoriums
 
Workshopbeitrag

Enabling Informational Autonomy through Explanation of Content Moderation: UI Design for Hate Speech Detection

Vorschaubild nicht verfügbar

Volltext URI

Dokumententyp

Text/Workshop Paper

Zusatzinformation

Datum

2022

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

Content moderation using AI and in particular Hate Speech detection has been a research topic with a focus on natural language processing, classification algorithms and data benchmarks. Less attention has been dedicated to how the classification systems are later integrated into tools which support users in an application task. In this paper we review existing tools and prototypes. Furthermore, we design and implement an online user interface for explainability. The system is connected to a neural network classifier based on the HASOC benchmark. The interface allows users to enter messages, observe classification decisions and see similar messages for explanation. It provides support for users of social media who are interested in the performance of AI systems for content moderation and who want to observe the performance of hate speech detection tools. A qualitative evaluation with experts showed that our system can be helpful to bridge the gap between humans and AI.

Beschreibung

Sontheimer, Lukas; Schäfer, Johannes; Mandl, Thomas (2022): Enabling Informational Autonomy through Explanation of Content Moderation: UI Design for Hate Speech Detection. Mensch und Computer 2022 - Workshopband. DOI: 10.18420/muc2022-mci-ws12-260. Bonn: Gesellschaft für Informatik e.V.. MCI-WS12: UCAI 2022: Workshop on User-Centered Artificial Intelligence. Darmstadt. 4.-7. September 2022

Schlagwörter

Zitierform

Tags