Netinfo Security ›› 2023, Vol. 23 ›› Issue (2): 54-63.doi: 10.3969/j.issn.1671-1122.2023.02.007

Previous Articles     Next Articles

A Scheme of Optimizing Deep Learning Model Using Bi-ADMM

XU Zhanyang, CHENG Luofei(), CHENG Jianchun, XU Xiaolong   

  1. School of Software, Nanjing University of Information Science and Technology, Nanjing 210044, China
  • Received:2022-10-21 Online:2023-02-10 Published:2023-02-28
  • Contact: CHENG Luofei E-mail:20201221009@nuist.edu.cn

Abstract:

ADMM is widely used in the field of traditional machine learning model optimization, and it has solved some deep learning optimization problems, and its performance in deep learning optimization has exceeded most of the gradient-based optimization algorithms. Compared with ADMM, Bi-ADMM converges faster and it is more stable. This paper proposed a optimization scheme (dlBi-ADMM) to optimize deep learning problem, and used an accelerated proximal gradient algorithm to optimize coupled variables to reduce the complexity of matrix inversion operations. Then, it provided the specific function of the optimization subproblem for each variable in detail. Finally, experiments show that the optimization results of the dlBi-ADMM algorithm proposed in this paper can improve the accuracy of the model more than the results of the dlADMM optimization, and the dlBi-ADMM algorithm performs better than the dlADMM algorithm in time efficiency.

Key words: deep learning, ADMM, dlADMM, Bi-ADMM, accelerated proximal gradient algorithms

CLC Number: