Please use this identifier to cite or link to this item:
https://doi.org/10.1007/s00799-018-0242-1
Title: | Neural ParsCit: a deep learning-based reference string parser | Authors: | NIKHILA PRASAD A.N. Kaur, M KAN MIN-YEN |
Issue Date: | 19-May-2018 | Publisher: | Springer Science and Business Media LLC | Citation: | NIKHILA PRASAD A.N., Kaur, M, KAN MIN-YEN (2018-05-19). Neural ParsCit: a deep learning-based reference string parser. International Journal on Digital Libraries 19 (4) : 323-337. ScholarBank@NUS Repository. https://doi.org/10.1007/s00799-018-0242-1 | Abstract: | We present a deep learning approach for the core digital libraries task of parsing bibliographic reference strings. We deploy the state-of-the-art long short-term memory (LSTM) neural network architecture, a variant of a recurrent neural network to capture long-range dependencies in reference strings. We explore word embeddings and character-based word embeddings as an alternative to handcrafted features. We incrementally experiment with features, architectural configurations, and the diversity of the dataset. Our final model is an LSTM-based architecture, which layers a linear chain conditional random field (CRF) over the LSTM output. In extensive experiments in both English in-domain (computer science) and out-of-domain (humanities) test cases, as well as multilingual data, our results show a significant gain (p< 0.01) over the reported state-of-the-art CRF-only-based parser. | Source Title: | International Journal on Digital Libraries | URI: | https://scholarbank.nus.edu.sg/handle/10635/194766 | ISSN: | 1432-5012 1432-1300 |
DOI: | 10.1007/s00799-018-0242-1 |
Appears in Collections: | Staff Publications Elements |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
neural-parscit-deep.pdf | Accepted version | 440.15 kB | Adobe PDF | OPEN | Post-print | View/Download |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.