Please use this identifier to cite or link to this item: https://doi.org/10.25542/BC41-IY5R
Title: CoNLL-2014 Shared Task: Grammatical Error Correction
Creators: Ng Hwee Tou 
Wu Siew Mei 
Ted Briscoe
Christian Hadiwinoto
Raymond Hendy Susanto
Christopher Bryant
NUS Contact: Hwee Tou Ng
Siew Mei Wu
Subject: Natural Language Processing
Natural Langauge Learning
Grammatical error detection
Grammatical error correction
DOI: doi:10.25542/BC41-IY5R
Description: 

The NUS Corpus of Learner English (NUCLE) was collected in a collaboration project between the National University of Singapore (NUS) Natural Language Processing (NLP) Group led by Prof. Hwee Tou Ng and the NUS Centre for English Language Communication (CELC) led by Prof. Siew Mei Wu. The work was carried out as part of the PhD thesis research of Daniel Dahlmeier at the NUS NLP Group.

The corpus consists of about 1,400 essays written by university students at the National University of Singapore on a wide range of topics, such as environmental pollution, healthcare, etc. It contains over one million words which are completely annotated with error tags and corrections. All annotations have been performed by professional English instructors at the NUS CELC.

The corpus is distributed under the standard NUS licensing agreement the terms and conditions for which are provided below. Please read the terms and conditions carefully.

To download and reuse this dataset, please refer to instructions on https://doi.org/10.25542/BC41-IY5R.

Related Publication:

  • Ng, Hwee Tou, & Wu, Siew Mei, & Briscoe, Ted, & Hadiwinoto, Christian, & Susanto, Raymond Hendy, & Bryant, Christopher (2014). The CoNLL-2014 Shared Task on Grammatical Error Correction. In Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task (CoNLL-2014 Shared Task). Baltimore, Maryland.
Related Publications: http://www.comp.nus.edu.sg/~nlp/conll14st/CoNLLST01.pdf
Citation: When using this data, please cite the original publication and also the dataset.
  • Ng, Hwee Tou, & Wu, Siew Mei, & Briscoe, Ted, & Hadiwinoto, Christian, & Susanto, Raymond Hendy, & Bryant, Christopher (2014). The CoNLL-2014 Shared Task on Grammatical Error Correction. In Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task (CoNLL-2014 Shared Task). Baltimore, Maryland.
  • Ng Hwee Tou, Wu Siew Mei, Ted Briscoe, Christian Hadiwinoto, Raymond Hendy Susanto, Christopher Bryant (2017-11-06). CoNLL-2014 Shared Task: Grammatical Error Correction. ScholarBank@NUS Repository. [Dataset]. https://doi.org/10.25542/BC41-IY5R
License: Please refer to the document "nucle_license.pdf" and relevant information on https://doi.org/10.25542/BC41-IY5R.
Appears in Collections:Staff Dataset

Show full item record
Files in This Item:
File Description SizeFormatAccess Settings 
nucle_license.pdfLicense Form45.08 kBAdobe PDF

OPEN

View/Download
official_submissions.tar.gzCorrected system outputs of 12 participating teams567.25 kBUnknown

OPEN

View/Download
m2scorer.tar.gzOfficial Scorer (version 3.2)22.3 kBUnknown

OPEN

View/Download
conll14st-test-data.tar.gzAnnotated Test Data. To download and reuse this dataset, please refer to instructions on https://doi.org/10.25542/BC41-IY5R.628.4 kBUnknown

OPEN

View/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.