Sunday, February 23, 2014

HBase Backup

HBase Backup:

Online backup


Again this is categorized in three ways
Replication: In this method you need to have a 2nd cluster where you will keep your replication for the data from the 1st cluster.
Hadoop/HBase Export command: which runs a map reduce job to copy table from one cluster to the same cluster or to other Hadoop cluster. This does not require any kind of downtime for backing/ exporting data.
In this method we need to export the data to the cluster and if we need to restore we need to restore it by Importing.
CopyTable: this is also online backup method which copies table from one cluster to another cluster or to the same cluster.

Offline Backup:
Distcp : this is a kind of file system backup, this copies a directory from HDFS to same cluster or to other cluster.
copyToLocal : this is less reliable way of copying directories from HDFS to local backup drive. If large amount of data is there then you need lot of Hadoop tune-up to copy successfully.

Offline Backup methods are full shutdown backup method, suppose you need to copy HBase you need to stop your HBase cluster, for a successful backup, as the files are being continuously moved, modified and changes while cluster is online, and copying in this scenario may fail.



No comments:

Post a Comment

Thank you for Commenting Will reply soon ......

Featured Posts

 Mastering Unix Commands: An In-depth Look at Chroot, Env, Nice, Nohup, StdBuf, and Timeout Hashtags: #UnixCommands #Chroot #Env #Nice #Nohu...