CONAN-COunter NArratives through Nichesourcing : a Multilingual Dataset of Responses to Fight Online Hate SpeechCONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

Yi-Ling Chung, Elizaveta Kuzmenko, Serra Sinem Tekiroglu, Marco Guerini


Abstract
Although there is an unprecedented effort to provide adequate responses in terms of laws and policies to hate content on social media platforms, dealing with hatred online is still a tough problem. Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far by the research community, is to actually oppose hate content with counter-narratives (i.e. informed textual responses). In this paper, we describe the creation of the first large-scale, multilingual, expert-based dataset of hate-speech / counter-narrative pairs. This dataset has been built with the effort of more than 100 operators from three different NGOs that applied their training and expertise to the task. Together with the collected data we also provide additional annotations about expert demographics, hate and response type, and data augmentation through translation and paraphrasing. Finally, we provide initial experiments to assess the quality of our data.
Anthology ID:
P19-1271
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2019
Address:
Florence, Italy
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2819–2829
Language:
URL:
https://aclanthology.org/P19-1271
DOI:
10.18653/v1/P19-1271
Bibkey:
Cite (ACL):
Yi-Ling Chung, Elizaveta Kuzmenko, Serra Sinem Tekiroglu, and Marco Guerini. 2019. CONAN-COunter NArratives through Nichesourcing : a Multilingual Dataset of Responses to Fight Online Hate SpeechCONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2819–2829, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
CONAN-COunter NArratives through Nichesourcing : a Multilingual Dataset of Responses to Fight Online Hate SpeechCONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech (Chung et al., ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/P19-1271.pdf
Video:
 https://vimeo.com/384740828
Code
 marcoguerini/CONAN
Data
CONANHate Speech
Terminologies: