Netinfo Security ›› 2015, Vol. 15 ›› Issue (5): 69-76.doi: 10.3969/j.issn.1671-1122.2015.05.011

Previous Articles     Next Articles

Research on P2P Traffic Identification Under the High Speed Network

MU Zheng1(), WU Jin1, XU Shu-juan2   

  1. 1. CNCERT/CC-LN, Shenyang Liaoning 110036, China
    2. China Mobile Group Liaoning co., ltd, Shenyang Liaoning 110179, China
  • Received:2015-04-09 Online:2015-05-10 Published:2018-07-16

Abstract:

Network traffic classification refers to classify the flow which mixed with a variety of applications in accordance with the protocol which flow used. Network traffic classification has been one of the hot spot in all walks of life. Research on network traffic classification can provide the basis to design the next generation of high performance network protocol, can provide the gist for network management and network traffic scheduling, can provide support for network attack protection and traffic cleaning. This paper analyzes the network traffic identification methods in nowadays mainstream sand summarized the flow classification technology development present situation and research results. According to current situation of rapid growth of P2P traffic and the characteristics of high-speed network traffic, focuses on the binary classification method of P2P traffic which under the high-speed networks. In this paper, data stream can be divided into P2P traffic and the normal network traffic by the traffic identification methods based on the behavior of the transport layer firstly, and dynamically generate marked characteristic of the training set according to the results of the classification. Effectively avoids the sample set which is used by the classification algorithm is not accurate because of the network data flow changed. And then put forward a kind of P2P traffic identification method based on C4.5 decision tree, this method only need to calculate several packets which in a data flow, and then the network traffic identification is completed. Don’t need to concern of single or double direction of the data flow, data encryption, etc. Experiments show that the recognition accuracy of this method is high; the computational complexity is low, suitable for high-speed network traffic identification.

Key words: high speed network, P2P, traffic identification

CLC Number: