The Differences between Pig, Hive, and.pptx - The Differences between Pig Hive and HBase Both Pig and Hive are high-level languages that compile to Map

The Differences between Pig, Hive, and.pptx - The...

This preview shows page 1 - 2 out of 15 pages.

The Differences between Pig, Hive, and HBase Both Pig and Hive are high-level languages that compile to Map Reduce. HBase is a completely different game: it allows Hadoop to support lookups/transactions on key/value pairs. HBase allows you to (1) do quick random lookups, versus scan all of data sequentially, (2) do insert/update/delete from middle, not just add/append. The differences between Pig and Hive are significant. Specifically: Pig doesn't require underlying structure to the data, Hive does imply structure via a metastore. This has its pros and cons. It allows Pig to be more suitable for ETL tasks where the input data is still a mish-mash and you want to convert it to be structured. On the other hand, Hive's metastore provides a dictionary that lets you easily see what columns exist in which tables, which can be very handy. Pig is a new language, easy to learn if you know languages similar to Perl. Hive is a subset of SQL with very simple variations to enable map-reduce-like computation. If you come from a SQL background you will find Hive QL extremely easy to pickup (many of your SQL queries
Image of page 1
Image of page 2

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture