Share Email Print

Proceedings Paper

Torpedo: topic periodicity discovery from text data
Author(s): Jingjing Wang; Hongbo Deng; Jiawei Han
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Although history may not repeat itself, many human activities are inherently periodic, recurring daily, weekly, monthly, yearly or following some other periods. Such recurring activities may not repeat the same set of keywords, but they do share similar topics. Thus it is interesting to mine topic periodicity from text data instead of just looking at the temporal behavior of a single keyword/phrase. Some previous preliminary studies in this direction prespecify a periodic temporal template for each topic. In this paper, we remove this restriction and propose a simple yet effective framework Torpedo to mine periodic/recurrent patterns from text, such as news articles, search query logs, research papers, and web blogs. We first transform text data into topic-specific time series by a time dependent topic modeling module, where each of the time series characterizes the temporal behavior of a topic. Then we use time series techniques to detect periodicity. Hence we both obtain a clear view of how topics distribute over time and enable the automatic discovery of periods that are inherent in each topic. Theoretical and experimental analyses demonstrate the advantage of Torpedo over existing work.

Paper Details

Date Published: 27 May 2015
PDF: 10 pages
Proc. SPIE 9499, Next-Generation Analyst III, 94990A (27 May 2015); doi: 10.1117/12.2180097
Show Author Affiliations
Jingjing Wang, Univ. of Illinois at Urbana-Champaign (United States)
Hongbo Deng, Yahoo! Inc. (United States)
Jiawei Han, Univ. of Illinois at Urbana-Champaign (United States)

Published in SPIE Proceedings Vol. 9499:
Next-Generation Analyst III
Barbara D. Broome; Timothy P. Hanratty; David L. Hall; James Llinas, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?