Netinfo Security ›› 2016, Vol. 16 ›› Issue (3): 59-63.doi: 10.3969/j.issn.1671-1122.2016.03.010

• Orginal Article • Previous Articles     Next Articles

Research and Design on the Storage Model for RDF Data Based on HBase

Yuanyuan WANG1(), Xiaodan LV1, Qi HU1, Hongchuan WU2   

  1. 1. College of Computer Science and Technology, Guizhou University, Guiyang Guizhou 550025, China
    2. College of Big Data and Information Engineering, Guizhou University, Guiyang Guizhou 550025, China
  • Received:2016-01-18 Online:2016-03-25 Published:2020-05-13

Abstract:

Aim ing at the storage of RDF data, this paper proposes an effective storage scheme based on the Rowkey and the distributed database HBase , which mainly use of the characteristics of HBase and RDF data. The method uses the classic BKDRHash algorithm to hash the predicate, and looks the hash value and the predicate as the primary key to enforce the data storage. Effective setting the Rowkey of HBase not only avoid the phenomenon of node accumulation, the use of BKDRHash algorithm also ensures the integrity of the data. In order to prove the validity of this storage mode, the experiment is to use MapReduce to load the data into HBase in parallel ways with the internal storage format HFile file. Experiments show that, for such a storage model, when the data quantity is large, the data loading can achieve better performance. The paper mainly uses the LUBM test set to carry on the simulation experiment, and it proves that the scheme is effective.

Key words: semantic Web, RDF data, HBase, MapReduce, HFile file

CLC Number: