信息网络安全 ›› 2014, Vol. 14 ›› Issue (9): 17-21.doi: 10.3969/j.issn.1671-1122.2014.09.004

• Orginal Article • Previous Articles     Next Articles

A Method of Discriminating Microblog Topic Position based on the Text Classification with Correlation of Subject

WANG Ming-yuan, JIA Yan, ZHOU Bin, HUANG Jiu-ming   

  1. College of Computer, National University of Defense Technology, Changsha Hunan 410073, China
  • Received:2014-08-06 Online:2014-09-01

Abstract: How to discriminate accurately the microblog topic position is one of the highlights in the short essay mining. This paper proposes a method based on the text classification with correlation of subject, which can discriminate users for the topic who is to support or oppose. The correlation of subject often leads to the text that have greatly different features. The method first obtain the topic keywords by extraction technology and mutual information, then classify the text to conversation corpus with the correlation of subject, at last adopt different method to analyze the comprehensive microblog topic position. The experimental results show that the method of correlated adopting machine learning and the uncorrelated adopting dictionary can greatly improve the discrimination accuracy. On this basis, we construct a model, can be used for the relevant government departments to monitor the Internet public opinion and business evaluate the products market, etc.

Key words: microblog, topic, position, correlation, naive-bayes