Hadoop Hive Introduction




Hadoop Hive Overview Hadoop Hive is very similar to Apache Pig. What it does is let you create tables and load external files into tables using SQL. Then it creates MapReduce jobs in Java.  Java is a very wordy language so using Pig and Hive is simpler. Some have said that Hadoop Hive is a data warehouse tool (Bluntly put, […]

Read more

Sqoop Interview Questions And Answers




Sqoop Interview Questions And Answers What is Sqoop? Sqoop is an open source project that enables data transfer from non-hadoop source to hadoop source. It can be remembered as SQL to Hadoop -> SQOOP. It allows user to specify the source and target location inside the Hadoop.  

Read more

HBASE Interview Questions And Answers




HBASE Interview Questions And Answers What is HBase? Hbase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS Why we use Habse? Hbase provide random read and write, Need to do thousand of operation per second on large data set. List the main component of HBase? Zookeeper Catalog Tables Master RegionServer Region How many Operational […]

Read more

Hadoop Hive Interview Questions And Answers




Hadoop Hive Interview Questions And Answers What is Hadoop Hive? Hadoop Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive was originally developed at Facebook. It’s now a Hadoop subproject with many contributors. Users need to concentrate only on the top […]

Read more

Flume Data Collection into HBase




Flume Data Collection into HBase We will discuss about collection of data into HBase directly through flume agent. In our previous posts under flume category, we have covered setup of flume agents for file roll, logger and HDFS sink types. In this, we are going to explore the details of HBase sink and its setup with live example. As we […]

Read more

Sqoop Interview Questions and Answers for Experienced




Sqoop Interview Questions and Answers for Experienced In this post we will provide some practical Sqoop Interview Questions and Answers for experienced hadoop developers. Sqoop Interview Questions and Answers for Experienced 1. What is Sqoop? Sqoop is an open source tool that enables users to transfer bulk data between Hadoop eco system and relational databases. 2. What are the relational databases […]

Read more

Hadoop Interview Questions And Answers




Hadoop Interview Questions And Answers 1. What does commodity Hardware in Hadoop world mean? ( D ) a) Very cheap hardware b) Industry standard hardware c) Discarded hardware d) Low specifications Industry grade hardware 2. Which of the following are NOT big data problem(s)? ( D) a) Parsing 5 MB XML file every 5 minutes b) Processing IPL tweet sentiments […]

Read more

HBase Integration with Hadoop Hive




HBase Integration with Hadoop Hive In this post, we will discuss about the setup needed for HBase Integration with Hive and we will test this integration with the creation of some test hbase tables from hive shell and populate the contents of it from another hive table and finally verify these contents in hbase table. Reasons to use Hadoop Hive […]

Read more

HBase Shell Commands in Practice




HBase Shell Commands in Practice In Our previous posts we have seen HBase Overview and HBase Installation, now it is the time to practice some Hbase Shell Commands to get familiarize with HBase. We will test a few Hbase shell commands in this post. HBase Shell Usage Quote all names in HBase Shell such as table and column names. Commas […]

Read more

Java Installation on Ubuntu




Java Installation on Ubuntu Below is the Installation Procedure for Oracle Java Installation on Ubuntu: Java Installation on Ubuntu: Download latest JDK version which is later than 1.6 from Oracle Site. In this installation, we used JDK 8 version. Download jdk-*-linux-x64.tar.gz zipped binary tarball for Linux 64 bit machine. Here ‘*’ refers to jdk version number for example 8 in our case. Copy […]

Read more
1 2 3