Please use this identifier to cite or link to this item: https://doi.org/10.1109/ICDM.2011.44
Title: Cross Domain Random Walk for query intent pattern mining from search engine log
Authors: Gu, S.
Yan, J.
Ji, L.
Yan, S. 
Huang, J.
Liu, N.
Chen, Y.
Chen, Z.
Keywords: Query intent pattern
Random walk
Semi-supervised learning
Transfer learning
Issue Date: 2011
Source: Gu, S.,Yan, J.,Ji, L.,Yan, S.,Huang, J.,Liu, N.,Chen, Y.,Chen, Z. (2011). Cross Domain Random Walk for query intent pattern mining from search engine log. Proceedings - IEEE International Conference on Data Mining, ICDM : 221-230. ScholarBank@NUS Repository. https://doi.org/10.1109/ICDM.2011.44
Abstract: Understanding search intents of users through their condensed short queries has attracted much attention both in academia and industry. The search intents of users are generally assumed to be associated with various query patterns, such as "MobileName price", where "MobileName" could be any named entity of mobile phone model and this pattern indicates that the user intends to buy a mobile phone. However, discovering the query intent patterns for general search is challenging mainly due to the difficulty in collecting sufficient training data for learning query patterns across a large number of searchable domains. In this work, we propose Cross Domain Random Walk (CDRW) algorithm, which is semi-supervised, to discover the query intent patterns across different domains from search engine click-through log data. Starting with some manually tagged seed queries in one or more independent domains, CDRW takes the query patterns as bridge and propagates the transition probability across domains to collect the query intent patterns among different domains based on the assumption that "users who have similar intent in different but similar domains will have high probability to share similar query patterns across domains". Different from classical random walk algorithms, CDRW walks across different domains to disseminate the shared knowledge in a transfer learning manner. Extensive experiment results on real log data of a commercial search engine well validate the effectiveness and efficiency of the proposed algorithm. © 2011 IEEE.
Source Title: Proceedings - IEEE International Conference on Data Mining, ICDM
URI: http://scholarbank.nus.edu.sg/handle/10635/69765
ISBN: 9780769544083
ISSN: 15504786
DOI: 10.1109/ICDM.2011.44
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

SCOPUSTM   
Citations

6
checked on Dec 13, 2017

Page view(s)

41
checked on Dec 9, 2017

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.