Netinfo Security ›› 2019, Vol. 19 ›› Issue (2): 53-59.doi: 10.3969/j.issn.1671-1122.2019.02.007

Previous Articles     Next Articles

Research on a Disk Data Synchronization Method Based on Directory Hash Tree

Shuai LI1, Xiaojie LIU2(), Bing XU1   

  1. 1.College of Computer Science, Sichuan University, Chengdu Sichuan 610041, China
    2.College of Cybersecurity, Sichuan University, Chengdu Sichuan 610041, China
  • Received:2018-11-15 Online:2019-02-10 Published:2020-05-11

Abstract:

With the widespread use of cloud computing, cloud data security has become increasingly important. An important area for cloud data security is cloud data disaster recovery backup. Currently, most mainstream cloud platforms use the Rsync synchronization algorithm in the data disaster recovery backup process. The Rsync synchronization algorithm is an efficient file data synchronization algorithm, but in the face of the new cloud storage environment, data backup is mostly performed on a disk basis. When the Rsync synchronization algorithm faces disk data with a large amount of data and a complicated partition directory structure, there is a problem that the determination of the unchanged file is inefficient and the synchronization of the newly added file is inefficient. This paper proposes a disk data synchronization method based on directory hash tree for this problem. The method can quickly determine the similarities and differences of files by using the directory hash tree while maintaining the same topology as the original disk directory tree, and use the Rsync method to synchronize the difference files and completely synchronize the newly added files. The experimental results show that the proposed method can synchronize the disk data more effectively than the single Rsync method, which improves the synchronization efficiency.

Key words: cloud storage, data disaster recovery backup, data synchronization, Rsync, directory hash tree

CLC Number: