Please use this identifier to cite or link to this item: https://doi.org/10.25540/WVM0-4RNX
DC FieldValue
dc.titleThe National University of Singapore SMS Corpus
dc.contributor.authorChen, T.
dc.contributor.authorKan Min-Yen
dc.date.accessioned2017-11-09T01:09:07Z
dc.date.available2015-03-09
dc.date.issued2015-03-09
dc.identifier.citationChen, T., Kan Min-Yen (2015-03-09). The National University of Singapore SMS Corpus. ScholarBank@NUS Repository. [Dataset]. <a href="https://doi.org/10.25540/WVM0-4RNX" target="_blank">https://doi.org/10.25540/WVM0-4RNX</a>
dc.identifier.relatedcitationTao Chen and Min-Yen Kan (2013). Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus. Language Resources and Evaluation, 47(2)(2013), pages 299-355. URL: https://link.springer.com/article/10.1007%2Fs10579-012-9197-9
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/137343
dc.identifier.urihttps://doi.org/10.25540/WVM0-4RNX
dc.description.abstract<p> This is a corpus of SMS (Short Message Service) messages collected for research at the Department of Computer Science at the National University of Singapore. This dataset consists of 67,093 SMS messages taken from the corpus on Mar 9, 2015. The messages largely originate from Singaporeans and mostly from students attending the University. These messages were collected from volunteers who were made aware that their contributions were going to be made publicly available. The data collectors opportunistically collected as much metadata about the messages and their senders as possible, so as to enable different types of analyses. </p> <p> This corpus was collected by Tao Chen and Min-Yen Kan. If you use this data, please ensure the following paper is cited. For more details, please refer to Citation field. </p> <ul> <li>Tao Chen and Min-Yen Kan (2013). Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus. Language Resources and Evaluation, 47(2)(2013), pages 299-355. URL: <a href="https://link.springer.com/article/10.1007%2Fs10579-012-9197-9">https://link.springer.com/article/10.1007%2Fs10579-012-9197-9</a></li> </ul>
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectCorpus creation
dc.subjectCrowdsourcing
dc.subjectMechanical turk
dc.subjectSMS corpus
dc.subjectChinese
dc.subjectEnglish
dc.typeDataset
dc.contributor.departmentDEPT OF COMPUTER SCIENCE
dc.description.doidoi:10.25540/WVM0-4RNX
dc.type.dataset.md
dc.type.dataset.zip
dc.type.dataset.zip
dc.type.dataset.zip
dc.type.dataset.zip
dc.relation.item10635/77836
dc.description.contactprofileKAN MIN-YEN
Appears in Collections:Staff Dataset

Show simple item record
Files in This Item:
File Description SizeFormatAccess Settings 
README.md1.43 kBUnknown

OPEN

View/Download
smsCorpus_en_sql_2015.03.09_all.zip2.04 MBZIP

OPEN

View/Download
smsCorpus_en_xml_2015.03.09_all.zip2.36 MBZIP

OPEN

View/Download
smsCorpus_zh_sql_2015.03.09.zip978.75 kBZIP

OPEN

View/Download
smsCorpus_zh_xml_2015.03.09.zip1.18 MBZIP

OPEN

View/Download

Page view(s)

3,176
checked on Oct 14, 2021

Download(s)

939
checked on Oct 14, 2021

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons