Query segmentation based on eigenspace similarity
Zhang, C. ; Sun, N. ; Hu, X. ; Huang, T. ; Chua, T.-S.
Zhang, C.
Hu, X.
Huang, T.
Citations
Altmetric:
Alternative Title
Abstract
Query segmentation is essential to query processing. It aims to tokenize query words into several semantic segments and help the search engine to improve the precision of retrieval. In this paper, we present a novel unsupervised learning approach to query segmentation based on principal eigenspace similarity of query-word-frequency matrix derived from web statistics. Experimental results show that our approach could achieve superior performance of 35.8% and 17.7% in F-measure over the two baselines respectively, i.e. MI (Mutual Information) approach and EM optimization approach. © 2009 ACL and AFNLP.
Keywords
Source Title
ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.
Publisher
Series/Report No.
Collections
Rights
Date
2009
DOI
Type
Conference Paper