|Title:||CoNLL-2014 Shared Task: Grammatical Error Correction|
|Creators:||NG HWEE TOU |
Wu Siew Mei
Raymond Hendy Susanto
|NUS Contact:||NG HWEE TOU |
Wu Siew Mei
|Subject:||Natural Language Processing|
Natural Langauge Learning
Grammatical error detection
Grammatical error correction
The NUS Corpus of Learner English (NUCLE) was collected in a collaboration project between the National University of Singapore (NUS) Natural Language Processing (NLP) Group led by Prof. Hwee Tou Ng and the NUS Centre for English Language Communication (CELC) led by Prof. Siew Mei Wu. The work was carried out as part of the PhD thesis research of Daniel Dahlmeier at the NUS NLP Group.
The corpus consists of about 1,400 essays written by university students at the National University of Singapore on a wide range of topics, such as environmental pollution, healthcare, etc. It contains over one million words which are completely annotated with error tags and corrections. All annotations have been performed by professional English instructors at the NUS CELC.
The corpus is distributed under the standard NUS licensing agreement the terms and conditions for which are provided below. Please read the terms and conditions carefully.
To download and reuse this dataset, please refer to instructions on https://doi.org/10.25542/BC41-IY5R.
|Citation:||When using this data, please cite the original publication and also the dataset.|
|License:||Please refer to the document "nucle_license.pdf" and relevant information on https://doi.org/10.25542/BC41-IY5R.|
|Appears in Collections:||Staff Dataset|
Show full item record
Files in This Item:
|nucle_license.pdf||License Form||45.08 kB||Adobe PDF|
|official_submissions.tar.gz||Corrected system outputs of 12 participating teams||567.25 kB||Unknown|
|m2scorer.tar.gz||Official Scorer (version 3.2)||22.3 kB||Unknown|
|conll14st-test-data.tar.gz||Annotated Test Data. To download and reuse this dataset, please refer to instructions on https://doi.org/10.25542/BC41-IY5R.||628.4 kB||Unknown|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.