Netinfo Security ›› 2025, Vol. 25 ›› Issue (7): 1163-1171.doi: 10.3969/j.issn.1671-1122.2025.07.014

Previous Articles     Next Articles

Research on Semantic Intelligent Recognition Algorithms for Meteorological Data Based on Large Language Models

FENG Wei, XIAO Wenming(), TIAN Zheng, LIANG Zhongjun, JIANG Bin   

  1. National Meteorological Information Centre, Beijing 100081, China
  • Received:2025-03-03 Online:2025-07-10 Published:2025-08-07
  • Contact: XIAO Wenming E-mail:xiaowm@cma.gov.cn

Abstract:

Meteorological data, as a typical spatiotemporal big data, faces severe data security challenges while empowering economic and social development. Addressing current issues in meteorological data security monitoring, such as insufficient semantic understanding, low accuracy in data feature recognition, and poor generalization capability, this study proposed an intelligent semantic recognition framework for meteorological data based on large language models. By constructing high-quality training datasets and domain knowledge bases, integrating Retrieval-Augmented Generation (RAG) with LoRA lightweight model technology, applying Chain-of-Thought (CoT) fine-tuning, and selecting PPO as the reinforcement learning algorithms to continuously optimize the recognition performance of the meteorological data security model. Experimental results demonstrate that this method effectively improves the accuracy of meteorological data feature recognition.

Key words: large language models, data security, semantic intelligent recognition, RAG, CoT

CLC Number: