https://doi.org/10.25540/WVM0-4RNX
Title: | The National University of Singapore SMS Corpus | Creators: | Chen, T. Kan Min-Yen |
NUS Contact: | Min-Yen Kan | Subject: | Corpus creation Crowdsourcing Mechanical turk SMS corpus Chinese English |
DOI: | doi:10.25540/WVM0-4RNX | Description: | This is a corpus of SMS (Short Message Service) messages collected for research at the Department of Computer Science at the National University of Singapore. This dataset consists of 67,093 SMS messages taken from the corpus on Mar 9, 2015. The messages largely originate from Singaporeans and mostly from students attending the University. These messages were collected from volunteers who were made aware that their contributions were going to be made publicly available. The data collectors opportunistically collected as much metadata about the messages and their senders as possible, so as to enable different types of analyses. This corpus was collected by Tao Chen and Min-Yen Kan. If you use this data, please ensure the following paper is cited. For more details, please refer to Citation field.
|
Related Publications: | 10635/77836 | Citation: | When using this data, please cite the original publication and also the dataset.
|
License: | Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/ |
Appears in Collections: | Staff Dataset |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | |
---|---|---|---|---|---|
README.md | 1.43 kB | Unknown | OPEN | View/Download | |
smsCorpus_en_sql_2015.03.09_all.zip | 2.04 MB | ZIP | OPEN | View/Download | |
smsCorpus_en_xml_2015.03.09_all.zip | 2.36 MB | ZIP | OPEN | View/Download | |
smsCorpus_zh_sql_2015.03.09.zip | 978.75 kB | ZIP | OPEN | View/Download | |
smsCorpus_zh_xml_2015.03.09.zip | 1.18 MB | ZIP | OPEN | View/Download |
This item is licensed under a Creative Commons License