Please use this identifier to cite or link to this item: https://doi.org/10.1145/1989323.1989391
Title: TI: An efficient indexing mechanism for real-time search on tweets
Authors: Chen, C.
Li, F.
Ooi, B.C. 
Wu, S. 
Keywords: index
ranking
real-time search
Issue Date: 2011
Citation: Chen, C.,Li, F.,Ooi, B.C.,Wu, S. (2011). TI: An efficient indexing mechanism for real-time search on tweets. Proceedings of the ACM SIGMOD International Conference on Management of Data : 649-660. ScholarBank@NUS Repository. https://doi.org/10.1145/1989323.1989391
Abstract: Real-time search dictates that new contents be made available for search immediately following their creation. From the database perspective, this requirement may be quite easily met by creating an up-to-date index for the contents and measuring search quality by the time gap between insertion time and availability of the index. This approach, however, poses new challenges for micro-blogging systems where thousands of concurrent users may upload their micro-blogs or tweets simultaneously. Due to the high update and query loads, conventional approaches would either fail to index the huge amount of newly created contents in real time or fall short of providing a scalable indexing service. In this paper, we propose a tweet index called the TI (Tweet Index), an adaptive indexing scheme for microblogging systems such as Twitter. The intuition of the TI is to index the tweets that may appear as a search result with high probability and delay indexing some other tweets. This strategy significantly reduces the indexing cost without compromising the quality of the search results. In the TI, we also devise a new ranking scheme by combining the relationship between the users and tweets. We group tweets into topics and update the ranking of a topic dynamically. The experiments on a real Twitter dataset confirm the efficiency of the TI. © 2011 ACM.
Source Title: Proceedings of the ACM SIGMOD International Conference on Management of Data
URI: http://scholarbank.nus.edu.sg/handle/10635/41822
ISBN: 9781450306614
ISSN: 07308078
DOI: 10.1145/1989323.1989391
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.