Netinfo Security ›› 2015, Vol. 15 ›› Issue (10): 46-52.doi: 10.3969/j.issn.1671-1122.2015.10.007

Previous Articles     Next Articles

Research on Microblog Hot Topic Detection Method Based on Term Energy Change

LIN Si-juan1,2(), LIN Bo-gang1,2, XU Wei1,2, YANG Yang1,2   

  1. 1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350108, China
    2. Key Lab of Information Security of Network System in Fujian Province, Fuzhou Fujian 350108, China
  • Received:2015-07-22 Online:2015-10-01 Published:2015-11-04

Abstract:

With the popularity of microblog, hot topic detection on microblog has been a hot area of research. Regarding the instantaneity of microblog as a point of penetration, the paper proposes a method of hot topic detection based on change of term energy by studying the change of term energy at different time domain. Based on traditional topic aging theory, the method divides all microblog data into different microblog windows, and introduces the concept of acceleration in physics, which uses the acceleration of terms to describe the change of the speed of the terms in the adjacent window. The paper combines the term acceleration and term weight into a compound weight to quantize term energy better. The paper uses double-conditional probability context similarity computing method based on single-conditional probability, and adds document distribution similarity to decrease the probability of topic confusion. The experiments show that the method is effective and stable in robustness. Compared with single-conditional probability context similarity model, the modified context similarity model has better clustering effect in different keyword detection methods.

Key words: hot topic detection, term energy, acceleration, context similarity

CLC Number: