信息网络安全 ›› 2015, Vol. 15 ›› Issue (2): 66-70.doi: 10.3969/j.issn.1671-1122.2015.02.011

• 理论研究 • 上一篇    下一篇

数字指纹生成方案及关键算法研究

刘文龙, 李晖(), 金东勋   

  1. 北京邮电大学信息与通信工程学院,北京 100876
  • 收稿日期:2014-11-25 出版日期:2015-02-10 发布日期:2015-07-05
  • 作者简介:

    作者简介: 刘文龙(1991-),男,山东,硕士研究生,主要研究方向:数字指纹技术;李晖(1970-),女,吉林,副教授,博士,主要研究方向:密码学及其应用、信息安全、无线通信安全;金东勋(1988-),男,吉林,硕士研究生,主要研究方向:无线通信安全。

  • 基金资助:
    国家自然科学基金[61370195]

Research on Digital Fingerprinting Generation Scheme and Key Algorithm

LIU Wen-long, LI Hui(), JIN Dong-xun   

  1. School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2014-11-25 Online:2015-02-10 Published:2015-07-05

摘要:

数字指纹技术作为新兴的数字版权保护技术,在抄袭检测方面发挥着重要的作用,而指纹生成算法直接决定数字指纹方案的性能。文章阐述了数字指纹方案中3种常用的指纹生成算法—MD5、SHA1、Rabin指纹算法,介绍了3种常用算法的基本原理,并通过实验对其进行了性能测试。为了测试3种算法的性能差异,实验中构建了20KB~20M大小不等的测试文件,首先对测试文件进行了预处理,去除了其中的无关字符,然后对处理过的文本采取分块策略进行指纹生成,并测试了算法在不同的分块策略和不同文件大小下指纹生成的效率。实验结果表明,哈希函数(MD5、SHA1)在大文件的指纹生成中具有较高的性能,而在小文件的指纹生成中,哈希函数与Rabin指纹算法具有同样的性能。实验结果为下一步制定新的指纹方案,进行指纹生成算法选择提供了实验支撑。

关键词: 数字指纹, 哈希函数, Rabin指纹算法, 抄袭检测

Abstract:

As a new digital copyright protection technology, digital fingerprinting technology plays an important role in plagiarism detection. Still, the fingerprinting generation algorithm directly determines the performance of the digital fingerprinting scheme. In this article, we expound several kinds of commonly used digital fingerprinting algorithms—MD5, SHA1, Rabin fingerprinting algorithm, and introduce the basic theories of the three algorithms. At the same time, we test the performance of each digital fingerprinting algorithm through experiment. To test the difference of the three algorithms in performance, we create many different sample test files whose sizes between 20KB and 20MB. In the experiment, we preprocess all the test files at first, remove all the irrelevant characters. Then we generate digital fingerprints of preprocessed files with block strategy, and test the generation efficiency of the three algorithms in different block strategies and different size files. The result indicates that hash function has better generation performance in terms of large files, while in terms of small files, hash function and Rabin fingerprint algorithm has the same performance. The experiment results provide experimental support for selecting proper fingerprinting generation algorithm in the next new fingerprinting scheme.

Key words: digital fingerprinting, Hash function, Rabin algorithm, plagiarism detection

中图分类号: