信息网络安全 ›› 2023, Vol. 23 ›› Issue (8): 86-98.doi: 10.3969/j.issn.1671-1122.2023.08.008

• 技术研究 • 上一篇    下一篇

面向多源异构数据融合的隐私集合求交研究

丁江1, 张国艳1,2, 魏子重3, 王梅1,4()   

  1. 1.山东大学网络空间安全学院,青岛 266237
    2.山东区块链研究院,济南 250102
    3.山东浪潮科学研究院有限公司,济南 250101
    4.泉城实验室,济南 250100
  • 收稿日期:2023-01-30 出版日期:2023-08-10 发布日期:2023-08-08
  • 通讯作者: 王梅 E-mail:wangmeiz@sdu.edu.cn
  • 作者简介:丁江(1998—),男,安徽,硕士研究生,主要研究方向为隐私信息检索|张国艳(1977—),女,山东,副教授,博士,主要研究方向为区块链及隐私计算|魏子重(1984—),男,山东,高级工程师,硕士,主要研究方向为密码协议及计算机安全|王梅(1990—),女,山东,助理研究员,博士,主要研究方向为隐私集合求交集及安全认证
  • 基金资助:
    国家重点研发计划(2022YFB2702800);山东省自然科学基金(ZR2023MF045);山东省自然科学基金青年项目(ZR2023QF088);青岛市自然科学基金原创探索类项目(23-2-1-152-zyyd-jch)

Multi-Source Heterogeneous Data Collaboration via Private Set Intersection

DING Jiang1, ZHANG Guoyan1,2, WEI Zichong3, WANG Mei1,4()   

  1. 1. School of Cyber Science and Technology, Shandong University, Qingdao 266237, China
    2. Shandong Institute of Blockchain, Jinan 250102, China
    3. Inspur Academy of Science and Technology, Jinan 250101, China
    4. Quancheng Laboratory, Jinan 250100, China
  • Received:2023-01-30 Online:2023-08-10 Published:2023-08-08
  • Contact: WANG Mei E-mail:wangmeiz@sdu.edu.cn

摘要:

多源异构数据融合的痛点在于数据的低价值密度性和分散性。数据的多源异构性增加了数据聚合的难度,导致数据价值极度零散,使得数据融合方法面对多源异构大数据无的放矢,无法有效关联零散价值的数据。隐私集合求交(Private Set Intersection,PSI)不但可以使数据方放心提供数据,还可以将多源异构数据价值有效融合,是挖掘有效数据开展数据融合工作的新工具。为此,文章针对异构数据的整合、数据的多源以及大规模数据的并行处理3类问题,给出多源异构数据融合的3个新思路。

关键词: 隐私集合求交, 情报分析

Abstract:

The main point of multi-source heterogeneous data fusion is the low value density and dispersion of data. The multi-source heterogeneity of data increases the difficulty of data aggregation, leading to extreme fragmentation of data value, making data fusion methods to face multi-source heterogeneous big data with no target, and unable to effectively correlate data with fragmented value. Private Set Intersection (PSI) not only enables data providers to provide data with peace of mind, but also effectively integrates the value of heterogeneous data from multiple sources, and mines effective data to carry out data fusion work as a new tool. To this end, the article gave three new ideas for the fusion of heterogeneous data from multiple sources with respect to three types of problems: integration of heterogeneous data, multiple sources of data, and parallel processing of large-scale data.

Key words: private set intersection, intelligence analysis

中图分类号: