Netinfo Security ›› 2025, Vol. 25 ›› Issue (4): 564-577.doi: 10.3969/j.issn.1671-1122.2025.04.005

Previous Articles     Next Articles

Research on Malicious Websites Assessment Method Based on Multidimensional Features and PageRank Optimization

WANG Fangyuan1(), LIAN Zhichao1, LI Qianmu1, GU Huanhuan1, ZHAO Qian2   

  1. 1. School of Cyber Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
    2. Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
  • Received:2024-11-28 Online:2025-04-10 Published:2025-04-25

Abstract:

With the rapid development of internet technology, cybersecurity threats have become increasingly severe. Malicious websites, serving as primary carriers of cyberattacks, pose significant threats to user information security and digital asset safety through phishing scams, malware distribution, and other means. The purpose of this paper was to enhance the accuracy of malicious websites assessment, taking malicious websites as the research object, covering the research scope of multi-dimensional feature analysis and PageRank algorithm optimization. This study employed various research methods and theories, including domain name feature analysis, registration information inquiry, domain name inclusion search, traffic behavior analysis, content quality assessment, user behavior data collection, and time decay factor integration. This paper combined natural language processing technology, machine learning algorithms and time decay mechanisms, proposed a comprehensive malicious websites assessment system and verified its effectiveness in improving the accuracy of malicious websites assessment. Experimental results demonstrate that this method achieves a comprehensive accuracy rate to 99.99%, which represents a significant improvement over traditional methods. The research findings presented in this paper provide robust support for cybersecurity protection and remain significant for constructing a safer and more trustworthy online environment.

Key words: malicious websites assessment, PageRank algorithm, content quality assessment, user behavior data, time decay factor

CLC Number: