Netinfo Security ›› 2016, Vol. 16 ›› Issue (8): 74-80.doi: 10.3969/j.issn.1671-1122.2016.08.012

• Orginal Article • Previous Articles     Next Articles

Research and Design on Malware Detection System Based on N-gram Algorithm

Jiawang ZHANG(), Yanwei LI   

  1. National Computer Network Emergency Response Technical Team Coordination Center of China, Beijing 100029, China
  • Received:2016-06-10 Online:2016-08-20 Published:2020-05-13

Abstract:

It is difficult to detect malware detection of unknown malicious programs, Aiming at solving this problem, this paper proposes an approach for extracting the dynamic features of malicious code semantics. This method extracts the permissions and API features of Android application to set up the semantic feature sequence with the N-gram algorithm. With screening of the feature sequence, the behavior sequence becomes more representative. First, in order to increase the effectiveness of the characteristics, analysis of experienced malware experts for each Android API function in SDK to add the corresponding weights, and the use of frequency and the weight value of each element of the N-gram sequence characteristics of re-calculated values in order to build a N-gram series model improved. Then, using a variety of machine learning algorithms for classification and detection, verify its effectiveness. The experimental results show that the improved N-gram algorithm and features in this paper can effectively detect malicious programs under Android platform.

Key words: machine learning, malicious code detection, N-gram, Android application

CLC Number: