Netinfo Security ›› 2017, Vol. 17 ›› Issue (11): 50-54.doi: 10.3969/j.issn.1671-1122.2017.11.008

• Orginal Article • Previous Articles     Next Articles

Research on an Intrusion Detection Algorithm Based on PCA and Random-forest Classification

Weining LIN1,2, Mingzhi CHEN1,2(), Yunqing ZHAN3, Chuanbao LIU1,2   

  1. 1.College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350108, China
    2. Key Lab of Information Security of Network System in Fujian Province, Fuzhou Fujian 350108, China
    3. Electrical Power Research Institute of State Grid Fujian Electric Power Limited Company, Fuzhou Fujian 350007, China
  • Received:2017-08-15 Online:2017-11-20 Published:2020-05-12

Abstract:

Due to the low accuracy of existing intrusion detection methods, this paper proposes an intrusion detection algorithm based on PCA (principle component analysis) and random-forest classification. The idea of the algorithm is to clean the training data before classifying. Firstly, PCA is used to decompose the dataset and reduce noises. Then random-forest classifier is used to classify and train the processed data. The experiment uses machine learning library based on Python called scikit-learn and 20% NSL-KDD dataset. Experimental results show that compared with the commonly used intrusion detection technologies based on machine learning, the intrusion detection algorithm proposed in this paper can improve the detection accuracy more effectively.

Key words: machine learning, intrusion detection, PCA, random-forest

CLC Number: