Netinfo Security ›› 2025, Vol. 25 ›› Issue (12): 1847-1862.doi: 10.3969/j.issn.1671-1122.2025.12.002

Previous Articles     Next Articles

Research on Protocol Fuzzing Technology Guided by Large Language Models

YANG Liqun1,2,3(), LI Zhen1, WEI Chaoren1, YAN Zhimin1, QIU Yongxin1   

  1. 1. School of Cyber Science and Technology, Beihang University, Beijing 100191, China
    2. School of Civil Aviation Safety Engineering, Civil Aviation Flight University of China, Guanghan 618307, China
    3. Engineering Research Center of Key Technologies for All-Electric General Aviation Aircraft of Sichuan Province, Guanghan 618307, China
  • Received:2025-08-23 Online:2025-12-10 Published:2026-01-06
  • Contact: YANG Liqun E-mail:lqyang@buaa.edu.cn

Abstract:

Security vulnerabilities in network protocol software occur frequently and pose serious threats to cyberspace security. Gray-box protocol fuzzing tools, such as AFLNet, have improved vulnerability detection by introducing coverage feedback and state modeling mechanisms. However, constrained by a persistent “semantic barrier”, these tools struggle to comprehend protocol syntax structures and contextual logic, resulting in limited testing efficiency. In recent years, large language models have demonstrated exceptional generalization and comprehension capabilities in tasks such as semantic modeling, contextual reasoning, and code generation, providing a promising pathway to overcome this barrier. This paper proposed LPF (LLMProFuzz), a protocol fuzzing framework guided by large language models, which addressed the limitations of traditional methods from three perspectives: firstly, automatically extracting protocol syntax templates through few-shot prompt engineering; secondly, designing a seed enrichment mechanism based on historical vulnerability characteristics to generate high-value initial cases that cover boundary and exceptional scenarios; thirdly, introducing a structure-aware mutation location selection strategy to increase the proportion of effective test cases. Experimental results on representative protocol stacks, including HTTP, FTP, and RTSP, demonstrate that LPF significantly outperforms baseline tools such as AFLNet and StateAFL in terms of code coverage, state coverage, and testing efficiency.

Key words: large language models, network protocol, fuzzing, structure-aware mutation, prompt engineering

CLC Number: