How to Execute HBase Commands from Shell Script? – Examples

Shell scripting is one of the widely used scripting language to automate day to day activities. Usually, Linux shells are interactive, they accept command as input from users and execute them. However, it will become repetitive as you have to type in all commands each time on terminal. Instead, you can bundle those commands in shell script. In this article, we will check how to execute HBase Commands from Shell Script with an example. Why Shell Script is Required? There are many reasons to use Linux shell scripting: It helps…

Continue ReadingHow to Execute HBase Commands from Shell Script? – Examples
Comments Off on How to Execute HBase Commands from Shell Script? – Examples

Hadoop – Export Hive Data with Quoted Values into Flat File and Example

In general, quoted values are values which are enclosed in single or double quotation marks. Usually, quoted values files are system generated where each and every fields in flat files is either enclosed in SINGLE or DOUBLE quotation mark. In this article, we will check how to export Hadoop Hive data with quoted values into flat file such as CSV file format. Quoted Value File Overview In the quoted values files, values are enclosed in quotation mark in case there is a embedded delimiter. For example, comma separated values file…

Continue ReadingHadoop – Export Hive Data with Quoted Values into Flat File and Example
Comments Off on Hadoop – Export Hive Data with Quoted Values into Flat File and Example

Hadoop and Netezza Comparison – Netezza vs Hadoop

Hadoop and Netezza are basically used with the Bigdata i.e. huge volume of the data. Netezza is used for analytics purpose i.e. OLAP application and Hadoop is used in batch processing. In this article, we will check Hadoop and Netezza Comparison - Netezza vs Hadoop. These two systems have their own advantages and disadvantages. We will try to compare the features and difference between Hadoop and Netezza. Read: Netezza and Redshift Comparison - Netezza vs Redshift Hadoop Features Following are some of the feature on Hadoop ecosystem: In the Hadoop ecosystem…

Continue ReadingHadoop and Netezza Comparison – Netezza vs Hadoop
2 Comments

Sqoop Command with Secure Password

Sqoop commands allows you to exchange the data between Hadoop and relational databases such as Netezza, Oracle etc. Sqoop required the password to connect to various databases and of course it has to be secured. In this article, we will discuss on various ways to execute the Sqoop Command with Secure Password. Read: Sqoop import Relational Database Table into HBase Table Import data using Sqoop Export data using Sqoop Sqoop Architecture – Mappers with No Reducers Sqoop Command with Secure Password Below are the some of the methods that we can…

Continue ReadingSqoop Command with Secure Password
Comments Off on Sqoop Command with Secure Password

Hadoop Hive Cumulative Sum, Average and Example

Latest version of Hive HQL supports the window analytics functions. You can make use of the Hadoop Hive Analytic functions to calculate the cumulative sum or running sum and cumulative average. Sum and Average analytical functions are used along with window options to calculate the Hadoop Hive Cumulative Sum or running sum. Hadoop Hive Cumulative Sum, Average Syntax: Below are the Syntax for Apache Hive Cumulative SUM, AVG analytic functions. You can use these function within query you have requirement to calculate cumulative SUM or AVG. SUM([DISTINCT | ALL] expression)…

Continue ReadingHadoop Hive Cumulative Sum, Average and Example
2 Comments

Run Hive Script File Passing Parameter and Working Example

Hive is used for batch and interactive SQL queries. Variable Substitution allows for tasks such as separating environment-specific configuration variables from code. The variable substitution is very important when you are calling the HQL scripts from shell or Python. You can pass the values to query that you are calling. In this article, we will see how to run Hive script file passing parameter to it. We also see the working examples. Run Hive Script File Passing Parameter You can use the set and use that variable within the script.…

Continue ReadingRun Hive Script File Passing Parameter and Working Example
Comments Off on Run Hive Script File Passing Parameter and Working Example

Hive String Functions and Examples

In this article, we will discuss on the various Hive string functions and usage. The HQL string functions are similar to the SQL string functions. Hive String Functions The string functions in Hive are listed below: Read: Apache Hive Extract Function Alternative and Examples Apache Hive group_concat Alternative and Example Hadoop Hive Regular Expression Functions and Examples Hadoop Hive Date Functions and Examples Hive concat (string A, string B,...) Function This Hive built-in strig function cocatenates all the given strings: hive> select CONCAT('concat','->','demo'); OK concat->demo Hive substr(string, int start, int…

Continue ReadingHive String Functions and Examples
Comments Off on Hive String Functions and Examples

Hadoop Hive Table Dynamic Partition and Examples

Partition in Hive is used for the better performance. Hive supports the single or multi column partition. You can manually add the partition to the Hive tables or Hive can dynamically partition. You can choose either methods based on your needs. In this article, we will discuss about the Hadoop Hive table dynamic partition and demonstrate using examples. Hadoop Hive Table Dynamic Partition In Hadoop Hive, data is stored as files on HDFS, whenever you partition the table in Hive, it creates sub directories within main directory using the partition…

Continue ReadingHadoop Hive Table Dynamic Partition and Examples
Comments Off on Hadoop Hive Table Dynamic Partition and Examples

Commonly used Hadoop Hive Commands and Examples

If you are already familiar with the SQL then Hive command syntax are easy to understand. In this article, we will discuss on the commonly used Hadoop Hive commands. Read: Cloudera Impala Generate Sequence Numbers without UDF Netezza ROWNUM Pseudo Column Alternative Run Impala SQL Script File Passing argument and Working Example  An Introduction to Hadoop Cloudera Impala Architecture Commonly used Hadoop Hive commands Below are the most commonly used Hadoop Hive commands: Hive Create Database A database is a collection of namespace in Hive. Below is the syntax to…

Continue ReadingCommonly used Hadoop Hive Commands and Examples
Comments Off on Commonly used Hadoop Hive Commands and Examples

Basic Hadoop HDFS Filesystem Operations With Examples

There are many interfaces to HDFS available, but the command line (CLI) is one of the simplest and, to many developers, the most familiar interface. You can perform most advanced and basic Hadoop HDFS filesystem operations using CLI. Basic Hadoop HDFS Filesystem Operations The when Hadoop HDFS filesystem is set, you can do all of the basic HDFS filesystem operations, such as reading files, creating directories, moving files, deleting data, and listing directories. You can also perform the advance Hadoop HDFS filesystem operations such as updates, administrator from command line.…

Continue ReadingBasic Hadoop HDFS Filesystem Operations With Examples
Comments Off on Basic Hadoop HDFS Filesystem Operations With Examples