Netinfo Security ›› 2025, Vol. 25 ›› Issue (2): 177-193.doi: 10.3969/j.issn.1671-1122.2025.02.001

Previous Articles     Next Articles

Advances in Watermarking Techniques for Large Language Models

QIN Zhongyuan1, WANG Tiantian1, LIU Weiqiang2, ZHANG Qunfang2   

  1. 1. School of Cyber Science and Engineering, Southeast University, Nanjing 211102, China
    2. Artillery and Air-Defence Institute Nanjing Campus, Nanjing 211132, China
  • Received:2024-04-08 Online:2025-02-10 Published:2025-03-07

Abstract:

Currently Large Language Model (LLM) has achieved remarkable results in the fields of text generation, machine translation and sentiment analysis. In order to protect the model dataset and parameter copyrights, prevent unauthorized copying and use, and verify the authenticity of messages, watermarking techniques are needed to ensure the security and trustworthiness of LLM. According to the different points in time when LLM operates, this paper categorized the current watermarking techniques into three types, watermarks embedded in model training, watermarks inserted in the inference phase and additional watermarks after text generation. For the robustness, confidentiality and effectiveness needs of watermarking, this paper also organized the evaluation metrics of watermarking techniques and reviewed the existing anti-watermarking attacks. This paper provides a comprehensive overview of LLM watermarking techniques with the aim of further promoting their development and application.

Key words: large language model, text watermarking, weight protection, AI forensics

CLC Number: