Uncategorized

[2311.09812] Large Language Models for Propaganda Span Annotation



Download a PDF of the paper titled Large Language Models for Propaganda Span Annotation, by Maram Hasanain and 2 other authors

Download PDF
HTML (experimental)

Abstract:The use of propagandistic techniques in online contents has increased in recent years aiming to manipulate online audiences. Efforts to automatically detect and debunk such content have been made addressing various modeling scenarios. These include determining whether the content (text, image, or multimodal) (i) is propagandistic, (ii) employs one or more propagandistic techniques, and (iii) includes techniques with identifiable spans. Significant research efforts have been devoted to the first two scenarios compared to the latter. Therefore, in this study, we focus on the task of detecting propagandistic textual spans. Specifically, we investigate whether large language models (LLMs), such as GPT-4, can effectively perform the task. Moreover, we study the potential of employing the model to collect more cost-effective annotations. Our experiments use a large-scale in-house dataset consisting of annotations from human annotators with varying expertise levels. The results suggest that providing more information to the model as prompts improves its performance compared to human annotations. Moreover, our work is the first to show the potential of utilizing LLMs to develop annotated datasets for this specific task, prompting it with annotations from human annotators with limited expertise. We plan to make the collected span-level labels from multiple annotators, including GPT-4, available for the community.

Submission history

From: Maram Hasanain [view email]
[v1]
Thu, 16 Nov 2023 11:37:54 UTC (4,600 KB)
[v2]
Sun, 14 Jan 2024 06:32:09 UTC (4,604 KB)



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *