Netinfo Security ›› 2017, Vol. 17 ›› Issue (10): 42-49.doi: 10.3969/j.issn.1671-1122.2017.10.007

• Orginal Article • Previous Articles     Next Articles

Research on Imbalanced Abnormal Data Classification Algorithm Based on Active Learning

Bo WANG, Huaibin WANG   

  1. School of Computer and Communication Engineering, Tianjin University of Technology, Tianjin 300384, China
  • Received:2017-06-28 Online:2017-10-10 Published:2020-05-12

Abstract:

Network security is facing increasingly complex challenges. With the diversification of attack methods and types, the extent of damage is also increasing; network protection requirements have been from a single passive approach to data fusion of active network technology under the situation awareness. Therefore, for the study of abnormal data classification is still very important. However, the traditional classification algorithm in the face of unbalanced data, only consider the algorithm accuracy, ignoring the classification effect of the minority class, thus easily lead to attacks and vulnerabilities of false positives, and for the new type of abnormal recognition efficiency is not ideal. Aiming at the above problems, firstly, this paper uses the sampling method of active learning algorithm to improve the learning efficiency in a large number of samples; then, the classification algorithm is improved based on the idea of the combination classifier, and the classification accuracy of the algorithm is increased by using the misclassification cost function; finally, the feasibility and effectiveness of the proposed method are verified by comparing the proposed method with the traditional method.

Key words: network security, imbalanced classification, active learning, cost function, combination classification

CLC Number: