Automatic generation of labelled data for word sense disambiguation

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/13822

Title:	Automatic generation of labelled data for word sense disambiguation
Authors:	WANG YUNYAN
Keywords:	word sense disambiguation, manually labelled data, synonyms, hypernyms, AQUAINT, K-nearest
Issue Date:	4-May-2004
Citation:	WANG YUNYAN (2004-05-04). Automatic generation of labelled data for word sense disambiguation. ScholarBank@NUS Repository.
Abstract:	In this thesis, we proposed and evaluated a method for performing word sense disambiguation. Unlike commonly used machine learning methods, the proposed method does not use manually labeled data for training classifiers in order to perform word sense disambiguation. In this method, we first extract the instances that the Synonyms or Hyprnyms appear from the AQUAINT collection using Managing Gigabytes. Compare their feature with feature of the instance to be predicted using K-nearest neighbors belong to is selected as the predicted sense. We evaluated the method on the nouns of the SENSEVAL-1 English Trainable Sample Task and SENSEVAL-2 English Lexical Sample Task and showed that the method performed well relative to the predictor that used the most common sense of the word as identified by WordNet as prediction.
URI:	http://scholarbank.nus.edu.sg/handle/10635/13822
Appears in Collections:	Master's Theses (Open)

File	Description	Size	Format	Access Settings	Version
Thesis.pdf		509.3 kB	Adobe PDF	OPEN	None	View/Download

Check