Netinfo Security ›› 2017, Vol. 17 ›› Issue (5): 28-36.doi: 10.3969/j.issn.1671-1122.2017.05.005

• Orginal Article • Previous Articles     Next Articles

A Survey of Open Source Software for Big Data Governance and Security

Wenjie WANG, Baiqing HU, Chi LIU()   

  1. School of Software, Beijing Institute of Technology, Beijing 100081, China
  • Received:2017-03-11 Online:2017-05-20 Published:2020-05-12

Abstract:

With the development of Internet technology, the amount of data increase exponentially. This data is no longer easily to be controlled by the owner which is different from the traditional technology. Therefore, big data security and privacy has become a hot issue. Big data security and governance is one of the most popular research fields to solve the data security and data privacy. This paper introduces the basic concepts of data security and governance first, and then talks about open source framework, including Apache Falcon, Apache Atlas, Apache Ranger, Apache Sentry and Kerberos. Apache Falcon and Apache Atlas can perform data lifecycle management, including data collection, data processing, data backup and data cleansing, for big data platforms, as well as for fine scheduling of components of big data platforms. Apache Ranger and Apache Sentry can fine grained authorization to do a specific action or operation and provide a central audit server. Kerberos is mainly used for big data platform for the authority of the framework of certification, and maintain security of the big data platform.

Key words: big data, security, governance, open source

CLC Number: