Hadoop Hive Introduction




Hadoop Hive Overview Hadoop Hive is very similar to Apache Pig. What it does is let you create tables and load external files into tables using SQL. Then it creates MapReduce jobs in Java.  Java is a very wordy language so using Pig and Hive is simpler. Some have said that Hadoop Hive is a data warehouse tool (Bluntly put, […]

Read more

Steps to change hadoop hive default metastore Derby DB to MySQL DB




Steps to change hadoop hive default metastore Derby DB to MySQL DB Step 1: Install and start MySQL Step 2: Configure the MySQL Service and Connector Download mysql-connector-java-5.0.5.jar file and copy it to $HIVE_HOME/lib directory. Step 3: Create the Database and User Create a metastore_db database in MySQL database using root user $ mysql -u root -p Enter password: mysql> CREATE […]

Read more

Hadoop Hive Interview Questions And Answers




Hadoop Hive Interview Questions And Answers What is Hadoop Hive? Hadoop Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive was originally developed at Facebook. It’s now a Hadoop subproject with many contributors. Users need to concentrate only on the top […]

Read more

HBase Integration with Hadoop Hive




HBase Integration with Hadoop Hive In this post, we will discuss about the setup needed for HBase Integration with Hive and we will test this integration with the creation of some test hbase tables from hive shell and populate the contents of it from another hive table and finally verify these contents in hbase table. Reasons to use Hadoop Hive […]

Read more

Hadoop Hive UDF Part 2: Custom GenericUDF in Hive (NVL2)




Hadoop Hive UDF Part 2: Custom GenericUDF in Hive (NVL2) 1.0. What’s in this blog? In my previous blog on creating custom UDFs in Hadoop Hive, I covered a sample basic UDF.  This blog covers generic UDF creation, to mimic the same NVL2 functionality covered in the previous blog.  It includes sample data, java code for creating the UDF, expected results, […]

Read more