Netinfo Security ›› 2025, Vol. 25 ›› Issue (2): 281-294.doi: 10.3969/j.issn.1671-1122.2025.02.009

Previous Articles     Next Articles

Research on Offensive Language Detection in Social Networks Based on Emotion-Assisted Multi-Task Learning

JIN Di1,2,3, REN Hao1,2,3, TANG Rui1,2,3, CHEN Xingshu1,2,3, WANG Haizhou1,2,3()   

  1. 1. School of Cyber Science and Engineering, Sichuan University, Chengdu 610065, China
    2. Key Laboratory of Data Protection and Intelligent Management, Ministry of Education, Chengdu 610065, China
    3. China Cyber Science Research Institute, Sichuan University, Chengdu 610065, China
  • Received:2024-12-10 Online:2025-02-10 Published:2025-03-07

Abstract:

With the rapid development of the Internet and mobile Internet technologies, more and more people are eager to obtain information and express their views and opinions on social networks. However, in recent years, social networks have been flooded with an increasing amount of offensive language and other undesirable comments, leading to the proliferation of online violence. Currently, research on offensive language detection is mostly concentrated in the English language field, with few studies focused on offensive language detection in Chinese. To address this issue, this thesis collected a large amount of tweet data from the Sina Weibo platform and annotated the data according to established rules to construct a Chinese offensive language dataset. Then, statistical features, including sentiment features, content features, and communication features, were extracted. Finally, a multi-task learning-based offensive language detection model was constructed. The auxiliary task of sentiment analysis was introduced to improve the detection performance of the model by leveraging the high correlation between the two tasks. Experimental results show that the model proposed in this thesis outperforms other commonly used detection methods for offensive language detection. The research provides methods and ideas for future offensive language detection on social networks.

Key words: offensive language, multi-task learning, social networks, deep learning

CLC Number: