Please use this identifier to cite or link to this item:
Title: Developing A Multilabel Corpus for the Quality Assessment of Online Political Talk
Authors: Jaidka, Kokil 
Issue Date: 24-Jun-2022
Publisher: European Language Resources Association (ELRA)
Citation: Jaidka, Kokil (2022-06-24). Developing A Multilabel Corpus for the Quality Assessment of Online Political Talk. Language Resources and Evaluation Conference 13 (1) : 5503-5510. ScholarBank@NUS Repository.
Abstract: This paper motivates and presents the Twitter Deliberative Politics dataset, a corpus of political tweets labeled for its deliberative characteristics. The corpus was randomly sampled from replies to US congressmen and women. It is expected to be useful to a general community of computational linguists, political scientists, and social scientists interested in the study of online political expression, computer-mediated communication, and political deliberation. The data sampling and annotation methods are discussed and classic machine learning approaches are evaluated for their predictive performance on the different deliberative facets. The paper concludes with a discussion of future work aimed at developing dictionaries for the quality assessment of online political talk in English. The dataset and a demo dashboard are available at
Source Title: Language Resources and Evaluation Conference
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
2022.lrec-1.589.pdfPublished version1.03 MBAdobe PDF



Page view(s)

checked on Feb 2, 2023


checked on Feb 2, 2023

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.