Netinfo Security ›› 2019, Vol. 19 ›› Issue (4): 73-81.doi: 10.3969/j.issn.1671-1122.2019.04.009

Previous Articles     Next Articles

Frequent Itemsets Mining Algorithm for Privacy Protection

Chen JIANG1,2(), Geng YANG1,2, Yunlu BAI1,3, Junmei MA4   

  1. 1. College of Computer Science and Software, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210023, China
    2. Jiangsu Key Laboratory of Bigdata Security Intelligent Processing, Nanjing Jiangsu 210023, China
    3. College of Information Engineering, Nanjing University of Chinese Medicine, Nanjing Jiangsu 210023, China
    4. Unit 31436 of PLA, Shenyang Liaoning 110805, China
  • Received:2018-12-05 Online:2019-04-10 Published:2020-05-11

Abstract:

A variety of differentially private FIM algorithms have been proposed. However, current solutions for this problem cannot well balance privacy and data utility over large-scale data. This paper proposes a new differentially private FIM algorithm(TrunSuper). This algorithm truncates the transaction datasets to reduce the dimension, and sorts the items in decreasing order, then eliminates the items with less support. In this way, it can reduce the information loss of the frequent itemsets. This paper also theoretically proves that TrunSuper can produce reasonably accurate results while satisfying differential privacy. Experiments on several real datasets shows that TrunSuper performs better than other previous solutions.

Key words: frequent itemsets mining, differential privacy, transaction truncating, Laplace mechanism

CLC Number: