信息网络安全 ›› 2015, Vol. 15 ›› Issue (1): 51-55.doi: 10.3969/j.issn.1671-1122.2015.01.009

• • 上一篇    下一篇

关系型数据库到HBase的转换设计

李青云(), 余文   

  1. 北京邮电大学计算机学院,北京100876
  • 收稿日期:2014-11-10 出版日期:2015-01-10 发布日期:2015-07-05
  • 作者简介:

    作者简介: 李青云(1990-),男,湖南,硕士研究生,主要研究方向:计算机应用;余文(1964-),男,甘肃,副教授,博士,主要研究方向:高性能计算、分子计算、神经网络和软计算。

  • 基金资助:
    国家自然科学基金[11272066]

Transition Design of Relational Database into HBase

Qing-yun LI(), Wen YU   

  1. School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2014-11-10 Online:2015-01-10 Published:2015-07-05

摘要:

随着信息技术的飞速发展,近年来出现了很多技术浪潮。云计算、物联网这些技术引起了无数学者的关注。在云计算盛行之前,互联网与移动终端的快速普及带来了爆发式的增长。毫无疑问,云计算的出现和移动互联网的普及推动了大数据这个热点技术的发展。拥有巨大的、有价值的数据和大数据技术会给企业带来成倍的收益。越来越多的企业关注和尝试大数据,并开始投入使用。大数据拥有超大规模、非结构化、种类多等特性。大数据存储技术和传统的关系型数据库管理系统(relational database management system,RDBMS)也是完全不同的技术。对大多数企业而言,之前的系统都是使用的传统的关系型数据库,这和新兴的并在大数据领域举足轻重的NoSQL具有很大的差别。如何完成从关系型数据到这种数据的迁移以及如何设计新的数据模式来符合新的存储系统成为设计者们研究的方向。文章介绍了一种在大数据里有着举足轻重的分布式存储系统HBase,主要讨论和研究了传统数据库到HBase的转换设计,并以一个例子详细介绍了如何更好地将RDBMS上的数据转换成符合HBase的数据模式,以此满足HBase的特性和大数据的要求。

关键词: 大数据, HBase, 数据库, 转换设计

Abstract:

With the rapid development of information technology, in recent years a lot of new technology emerged. These technology attracted the attention of numerous scholars such as cloud computing and Internet of things. Before the cloud was not in vogue, the Internet and the rapid popularization of mobile terminals have brought explosive growth in data volume. There is no doubt that the emergence of cloud computing and mobile Internet popularity contributed to the Big data hotspots. The huge, valuable data and Big data technology have will bring enterprise double the revenue. More and more enterprises and try to Big data, and began to put into use. Big data has a very large scale, unstructured, variety and other characteristics, and it’s storage technology and traditional relational database management system (RDBMS) is completely different technologies. As for most of the enterprises, systems used employed traditional relational database, which is greatly different from the newly and important NoSQL. Now, it’s the designers’ task to figure out how to complete the transformation from traditional data to this kind of data, and design a new data model to conform to the new storage system. In this paper, we introduced a distributed storage system that very important for Big Data - HBase, which is a kind of storage system based on open source implementation of BigTable designed by Google engineers. And then discussed how to better transform the data on the RDBMS to date conformed to the HBase data model, so as to meet the requirements of the characteristic of HBase and requirement of the Big Data.

Key words: big data, HBase, database, transition design

中图分类号: