Indexing Evolving Events from Tweet Streams

Cai, Hongyun, Huang, Zi, Srivastava, Divesh and Zhang, Qing (2015) Indexing Evolving Events from Tweet Streams. IEEE Transactions on Knowledge and Data Engineering, 27 11: 3001-3015. doi:10.1109/TKDE.2015.2445773


Author Cai, Hongyun
Huang, Zi
Srivastava, Divesh
Zhang, Qing
Title Indexing Evolving Events from Tweet Streams
Journal name IEEE Transactions on Knowledge and Data Engineering   Check publisher's open access policy
ISSN 1041-4347
1558-2191
Publication date 2015-11
Year available 2015
Sub-type Article (original research)
DOI 10.1109/TKDE.2015.2445773
Open Access Status Not Open Access
Volume 27
Issue 11
Start page 3001
End page 3015
Total pages 15
Place of publication Piscataway, NJ United States
Publisher Institute of Electrical and Electronics Engineers
Collection year 2016
Language eng
Formatted abstract
Tweet streams provide a variety of real-life and real-time information on social events that dynamically change over time. Although social event detection has been actively studied, how to efficiently monitor evolving events from continuous tweet streams remains open and challenging. One common approach for event detection from text streams is to use single-pass incremental clustering. However, this approach does not track the evolution of events, nor does it address the issue of efficient monitoring in the presence of a large number of events. In this paper, we capture the dynamics of events using four event operations (create, absorb, split, and merge), which can be effectively used to monitor evolving events. Moreover, we propose a novel event indexing structure, called Multi-layer Inverted List (MIL), to manage dynamic event databases for the acceleration of large-scale event search and update. We thoroughly study the problem of nearest neighbour search using MIL based on upper bound pruning, along with incremental index maintenance. Extensive experiments have been conducted on a large-scale real-life tweet dataset. The results demonstrate the promising performance of our event indexing and monitoring methods on both efficiency and effectiveness.
Keyword Event indexing
multi-layer inverted list
event evolution
Twitter
System
Q-Index Code C1
Q-Index Status Confirmed Code
Institutional Status UQ

Document type: Journal Article
Sub-type: Article (original research)
Collections: Official 2016 Collection
School of Information Technology and Electrical Engineering Publications
 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 1 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 1 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Sun, 08 Nov 2015, 00:17:38 EST by System User on behalf of School of Information Technol and Elec Engineering