Netinfo Security ›› 2016, Vol. 16 ›› Issue (9): 64-68.doi: 10.3969/j.issn.1671-1122.2016.09.013

• Orginal Article • Previous Articles     Next Articles

Research on Features Selection in Malware Clustering

Yi WANG(), Yong TANG, Zexin LU, Xin YU   

  1. School of Computer Science, National University of Defense Technology, Changsha Hunan 410073, China
  • Received:2016-07-25 Online:2016-09-20 Published:2020-05-13

Abstract:

The increment of malware has exploded in recent years. As a result, using cluster algorithm to detect malware families has received the favors of security vendors. Malware clustering is the task of converging sample that has similar behavior or structure in the same group (called a cluster), and features selection plays a vital role in malware clustering. Firstly this paper discusses carefully the common features used in existing study of malware clustering and compares these features with each other. The most of existing works focus on the clustering based on single feature vector, while single feature vector is not capable of describing all the characteristics of malware. To solve this problem, then multi feature vector pairs are proposed to cluster malware. Also, according to the clustering results, the specific indexes are defined to evaluate the selected feature vectors. Finally, combining with DBSCAN clustering algorithm, several feature vectors and their combinations are selected to test. The result shows that multi feature vector pairs are superior to single feature vector in identifying malware families.

Key words: features selection, malware, cluster analysis

CLC Number: