Hadoop HBase

How to Import CSV File into HBase using importtsv

So far we have discussed HBase Create Table, How to insert data in HBase table and Update the records in HBase table.

In this tutorial, we will discuss how to import CSV File into HBase table using importtsv command.

Import CSV File into HBase using importtsvImport CSV File into HBase using importtsv

You can load bulk data in HBase table using this method as well. Many times in data analytics we receive the requirement where we have to load a CSV file into HBase table, and in such scenarios, this tutorial on import CSV data in HBase would be very helpful.

Load CSV data to HBase Table

Here are the simple steps to create a table and load a CSV file into that table.

• Login to the HBase shell
• Go to HBase shell by typing the below-

hbase shell

• Create an HBase table by using the below command or follow HBase create table command-

create ‘sensor’,’temp’,’vibration’,’pressure’

• Verify the table using either list or scan command

Now you will have to load the data and for which you need to come out of HBase shell.

Load CSV Data

Let’s say you have a CSV file named sample.csv in HDFS and you may want to keep in some HBase directory. For this use SCP command-

scp hbase.csv root@sandbox.hortonworks.com:/home/hbase

Now put it in HDFS using the below command-

hadoop dfs -copyFromLocal sample.csv /tmp

Now again move to HBase shell and follow the below command to load CSV data into HBase table-

hbase>
hbase org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.separator=, –
Dimporttsv.columns=”HBASE_ROW_KEY,id,temp:in,temp:out,vibration,pressure:in,pressure:out”
sensor hdfs://sandbox.hortonworks.com:/tmp/hbase.csv

Once the MapReduce job is executed, you can scan the table and find the details. You will see the data loaded into HBase table.

Also, you should note here that, ImportTsv command leaves a massive amount of log files in this location- /var/logs and so you should have enough space in the cluster.

Conclusion

These were the easiest method to import csv file into HBase table using importtsv command efficiently.

Do try to load CSV file into HBase table using importtsv and let us know for any difficulty.

There are many more methods to load a CSV file into HBase table, and we will discuss those in our coming blog posts.

Leave a Comment