Netinfo Security ›› 2021, Vol. 21 ›› Issue (11): 9-16.doi: 10.3969/j.issn.1671-1122.2021.11.002

Previous Articles     Next Articles

Research on Premise Selection Technology Based on Machine Learning Classification Algorithm

XIONG Yan1, CHENG Chuanhu2(), WU Jianshuang3, HUANG Wenchao1   

  1. 1. Department of Computer Science and Technology, University of Science and Technology of China, Hefei 230026, China
    2. Department of Cyberspace Security, University of Science and Technology of China, Hefei 230026, China
    3. Hefei Tianwei Information Security Technology Co., Ltd, Hefei 230009, China
  • Received:2021-06-21 Online:2021-11-10 Published:2021-11-24
  • Contact: CHENG Chuanhu E-mail:cchpersonal@163.com

Abstract:

Premise selection is the key technology to improve the success rate of automatic theorem provf. It can choose the lemma which is most likely to prove the current conjecture successfully according to the relevance of the proving goal. However, the relevance of the lemmas recommended by the existing premise selection algorithm is not high, and the automatic proof ability of the theorem cannot be further improved. To solve the above problems, a combination algorithm based on machine learning classification is proposed. The scheme starts from the relationship between formula structure and symbols, extracts effective feature vector set, and introduces LDA topics extraction techniques on the basis of k-nearest neighbor algorithm and naive Bayes algorithm to further capture the correlation between symbols and dependencies, which makes the final combination algorithm more accurate. Experimental results show that this method has higher accuracy than that the existing premise selection algorithm, and can effectively improve the success rate of automatic theorem provf.

Key words: automatic theorem proving, premise selection, LDA topics extraction, Coq proof assistant

CLC Number: