09 - hbase_overview.pdf - HBASE OVERVIEW http\/www.tutorialspoint.com\/hbase\/hbase_overview.htm Copyright \u00a9 tutorialspoint.com Since 1970 RDBMS is the

09 - hbase_overview.pdf - HBASE OVERVIEW...

This preview shows page 1 - 2 out of 4 pages.

Copyright © tutorialspoint.comHBASE - OVERVIEWHBASE - OVERVIEWSince 1970, RDBMS is the solution for data storage and maintenance related problems. After theadvent of big data, companies realized the benefit of processing big data and started opting forsolutions like Hadoop.Hadoop uses distributed file system for storing big data, and MapReduce to process it. Hadoopexcels in storing and processing of huge data of various formats such as arbitrary, semi-, or evenunstructured.Limitations of HadoopHadoop can perform only batch processing, and data will be accessed only in a sequentialmanner. That means one has to search the entire dataset even for the simplest of jobs.A huge dataset when processed results in another huge data set, which should also be processedsequentially. At this point, a new solution is needed to access any point of data in a single unit oftime randomaccess.Hadoop Random Access DatabasesApplications such as HBase, Cassandra, couchDB, Dynamo, and MongoDB are some of thedatabases that store huge amounts of data and access the data in a random manner.What is HBase?HBase is a distributed column-oriented database built on top of the Hadoop file system. It is anopen-source project and is horizontally scalable.
Background image
Image of page 2

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture