Steps to change hadoop hive default metastore Derby DB to MySQL DB




Steps to change hadoop hive default metastore Derby DB to MySQL DB Step 1: Install and start MySQL Step 2: Configure the MySQL Service and Connector Download mysql-connector-java-5.0.5.jar file and copy it to $HIVE_HOME/lib directory. Step 3: Create the Database and User Create a metastore_db database in MySQL database using root user $ mysql -u root -p Enter password: mysql> CREATE […]

Read more

How To Setup Hadoop SSH Configuration In Linux Ubantu




How To Setup Hadoop SSH Configuration In Linux Ubantu Hadoop requires SSH access to manage its nodes, i.e. remote machines plus your local machine if you want to use Hadoop on it (which is what we want to do in this short tutorial). For our single-node setup of Hadoop, we therefore need to configure SSH access to localhost for the hduser user we […]

Read more

How To Setup Hadoop SSH Configuration In Linux CentOs




How To Setup Hadoop SSH Configuration In Linux CentOs Hadoop requires SSH access to manage its nodes, i.e. remote machines plus your local machine if you want to use Hadoop on it (which is what we want to do in this short tutorial). For our single-node setup of Hadoop, we therefore need to configure SSH access to localhost for the hduser user […]

Read more

What is Apache Zookeeper and It Uses




Apache ZooKeeper Apache ZooKeeper is a service for coordinating processes of distributed applications ZooKeeper provides a distributed configration,synchorization service and naming registry for distributed systems. Distributed applications use Zookeeper to store and mediate updates to important configuration information. ZooKeeper provides a very simple interface and services. Apache ZooKeeper brings these key benefits: Fast. ZooKeeper is especially fast with workloads where reads […]

Read more

Adding New node to a running Hadoop cluster




Adding New node to a running Hadoop cluster   In talking about Hadoop cluster, first we need to define two terms: cluster and node. A cluster is a collection of nodes. A node is a process running on a virtual or physical machine or in a container. We say process because a code would be running other programs beside Hadoop. […]

Read more

Sqoop Interview Questions And Answers




Sqoop Interview Questions And Answers What is Sqoop? Sqoop is an open source project that enables data transfer from non-hadoop source to hadoop source. It can be remembered as SQL to Hadoop -> SQOOP. It allows user to specify the source and target location inside the Hadoop.  

Read more

HBASE Interview Questions And Answers




HBASE Interview Questions And Answers What is HBase? Hbase is Column-Oriented , Open-Source, Multidimensional, Distributed database. It run on the top of HDFS Why we use Habse? Hbase provide random read and write, Need to do thousand of operation per second on large data set. List the main component of HBase? Zookeeper Catalog Tables Master RegionServer Region How many Operational […]

Read more

BIg data hadoop Mapreduce Java Programs




BIg data hadoop Mapreduce Java Programs BIg data hadoop Mapreduce Java Programs In big data hadoop we have so many components like Mapreduce,pig,hive,sqoop,hbase and many more.Generally so many companies are giving good preference for Mapreduce. BIg data hadoop Mapreduce Java Programs We can write Mapreduce in different different languages like c++ Java Python Perl Ruby R language The above all […]

Read more

Hadoop Hive Interview Questions And Answers




Hadoop Hive Interview Questions And Answers What is Hadoop Hive? Hadoop Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive was originally developed at Facebook. It’s now a Hadoop subproject with many contributors. Users need to concentrate only on the top […]

Read more

Flume Data Collection into HBase




Flume Data Collection into HBase We will discuss about collection of data into HBase directly through flume agent. In our previous posts under flume category, we have covered setup of flume agents for file roll, logger and HDFS sink types. In this, we are going to explore the details of HBase sink and its setup with live example. As we […]

Read more
1 2 3 8