Netinfo Security ›› 2016, Vol. 16 ›› Issue (9): 64-68.doi: 10.3969/j.issn.1671-1122.2016.09.013
• Orginal Article • Previous Articles Next Articles
Yi WANG(), Yong TANG, Zexin LU, Xin YU
Received:
Online:
Published:
Abstract:
The increment of malware has exploded in recent years. As a result, using cluster algorithm to detect malware families has received the favors of security vendors. Malware clustering is the task of converging sample that has similar behavior or structure in the same group (called a cluster), and features selection plays a vital role in malware clustering. Firstly this paper discusses carefully the common features used in existing study of malware clustering and compares these features with each other. The most of existing works focus on the clustering based on single feature vector, while single feature vector is not capable of describing all the characteristics of malware. To solve this problem, then multi feature vector pairs are proposed to cluster malware. Also, according to the clustering results, the specific indexes are defined to evaluate the selected feature vectors. Finally, combining with DBSCAN clustering algorithm, several feature vectors and their combinations are selected to test. The result shows that multi feature vector pairs are superior to single feature vector in identifying malware families.
Key words: features selection, malware, cluster analysis
CLC Number:
TP309
Yi WANG, Yong TANG, Zexin LU, Xin YU. Research on Features Selection in Malware Clustering[J]. Netinfo Security, 2016, 16(9): 64-68.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://netinfo-security.org/EN/10.3969/j.issn.1671-1122.2016.09.013
http://netinfo-security.org/EN/Y2016/V16/I9/64