Netinfo Security ›› 2017, Vol. 17 ›› Issue (8): 39-44.doi: 10.3969/j.issn.1671-1122.2017.08.006

• Orginal Article • Previous Articles     Next Articles

A Secondary Index Scheme of Big Data in HBase Based on Solr

Wenxian WANG1,2, Xingshu CHEN1,2, Haizhou WANG1,2(), Xiaosong WU2   

  1. 1.Cybersecurity Research Institute, Sichuan University, Chengdu Sichuan 610065, China
    2.Network and Trusted Computing Institute, College of Computer, Sichuan University, Chengdu Sichuan 610065, China
  • Received:2017-06-26 Online:2017-08-20 Published:2020-05-12

Abstract:

For the problem that HBase cannot provide secondary indexes and Huawei hindex scheme is difficult to meet the project demand of retrieval speed, designed an HBase secondary index scheme named SIHBase (Solr Indexing HBase) based on Solr. The scheme uses HBase Coprocessor to achieve the related callback function for creating, changing, deleting operations of the data tables and inserting, updating, deleting recovering operations of data. Thus, it can create and manage secondary indexes in Solr for HBase automatically and ensure the consistency of the data and index. The scheme has favorable generality and can create index for multi-column data of multiple tables in the meantime. And then extended the client-side function of HBase, increased the direct query interface of Solr, using efficient, flexible and diversified retrieval functions which Solr provided to realize quick retrieval for the mass data of HBase. Finally, a contrast experiment about query performance with hindex show that SIHBase was turned out to be much faster than hindex in query speed.

Key words: HBase, secondary index, Solr, quick retrieval

CLC Number: