HBASE Interview Questions And Answers




HBASE Interview Questions And Answers What is HBase? Hbase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS Why we use Habse? Hbase provide random read and write, Need to do thousand of operation per second on large data set. List the main component of HBase? Zookeeper Catalog Tables Master RegionServer Region How many Operational […]

Read more

Flume Data Collection into HBase




Flume Data Collection into HBase We will discuss about collection of data into HBase directly through flume agent. In our previous posts under flume category, we have covered setup of flume agents for file roll, logger and HDFS sink types. In this, we are going to explore the details of HBase sink and its setup with live example. As we […]

Read more

HBase Integration with Hadoop Hive




HBase Integration with Hadoop Hive In this post, we will discuss about the setup needed for HBase Integration with Hive and we will test this integration with the creation of some test hbase tables from hive shell and populate the contents of it from another hive table and finally verify these contents in hbase table. Reasons to use Hadoop Hive […]

Read more

HBase Shell Commands in Practice




HBase Shell Commands in Practice In Our previous posts we have seen HBase Overview and HBase Installation, now it is the time to practice some Hbase Shell Commands to get familiarize with HBase. We will test a few Hbase shell commands in this post. HBase Shell Usage Quote all names in HBase Shell such as table and column names. Commas […]

Read more

Flume Data Collection into HBase




Flume Data Collection into HBase We will discuss about collection of data into HBase directly through flume agent. In our previous posts under flume category, we have covered setup of flume agents for file roll, logger and HDFS sink types. In this, we are going to explore the details of HBase sink and its setup with live example. As we […]

Read more

Hbase Daemons in Pseudo Distribution Mode




Hbase Daemons in Pseudo Distribution Mode In Hbase cluster, we can start hbase daemons with start-hbase.sh command or $ hbase-daemon.sh (start | stop | restart | autorestart) (master | zookeeper | regionserver) But in pseudo distribution mode (hbase.cluster.distributed=false), only HMaster daemon will be triggered but not the HRegionServer daemon or HQuorumPeer daemon. When we start the daemons with start-hbase.sh or […]

Read more

Hbase Installation in Fully Distribution Mode




Hbase Installation in Fully Distribution Mode This post is a continuation for previous post on Hbase Installation. In the previous we have discussed about Hbase installation in pseudo distribution mode and in this post we will learn how to install and configure Hbase in fully distribution mode. Prerequisites: JDK 1.6 or later versions of Java installed on each data node machine […]

Read more

HBase Installation in Pseudo Distribution Mode




HBase Installation in Pseudo Distribution Mode This post describes the procedure for HBase Installation on Ubuntu Machine in pseudo distributed mode using HDFS configuration. Prerequisites: Java is one of the main prerequisite. JDK 1.6 or later versions of Java installation is required to run HBase. Hadoop 1 or Hadoop 2 installed on pseudo distributed or fully distributed cluster. HBase Installation […]

Read more

HBase Installation in Pseudo Distribution Mode




HBase Installation in Pseudo Distribution Mode This post describes the procedure for HBase Installation on Ubuntu Machine in pseudo distributed mode using HDFS configuration. Prerequisites: Java is one of the main prerequisite. JDK 1.6 or later versions of Java installation is required to run HBase. Hadoop 1 or Hadoop 2 installed on pseudo distributed or fully distributed cluster. HBase Installation […]

Read more

Data Collection from HTTP Client into HBase




Data Collection from HTTP Client into HBase This post provides a proof of concept of data collection from HTTP client into HBase. In this post, we will setup a flume agent with HTTP Source, JDBC Channel and AsyncHBase Sink. Initially we concentrate on POC of HTTP client data collection into HBase and at the end of this post we will […]

Read more