The purpose of this course is to provide the participant with a solid understanding of Apache Hive, the open-source data warehouse infrastructure built on top of Hadoop. Hive handles petabytes of data, allowing analysis of large data sets and providing indexes to accelerate queries. It structures data and uses an SQL-like language called HiveQL.
An internal compiler translates HQL queries to MapReduce, Apache Tez, and Spark jobs. It uses different storage types, such as ORC, HBase, plain text, and others. It offers built-in user-defined functions to control data-mining tools like dates and strings. Hive is similar to relational databases in query language and storage, except it is structured differently. Apache Hive training program opens the door to Hadoop for those who do not know Java.
About Apache Hive
Objective
Apache Hive training program begins with instructions on installing and configuring Hive. It then moves on to the basic architecture and components of the Hive system, including the meta store, driver, query compiler, thrift server. It also describes the jobs that they perform. It discusses the Hive data units, buckets, partitions, tables, and databases.
It covers the various primitive and complex data types that Hive can handle. It goes into detailed explaining of how to work with tables, load data into them, write queries, insert data, and manage outputs. It then discusses the HQL language and its capabilities beyond SQL, providing examples to clarify the points that are made. It wraps things up with Hive UDFs.
Who should take this course?
Familiarity with Linux as well as SQL queries is recommended for the course. Basic knowledge of Core Java will also prove helpful in understanding how Hive works. The course does not have any formal prerequisites and registration is open to everyone.
Course Content
Introduction
- Hadoop
- What is Hive?
- Features of Hive
- Architecture of Hive
- Working of Hive
HIVE Installation
- Verifying JAVA Installation
- Verifying Hadoop Installation
- Downloading Hive
- Installing Hive
- Configuring Hive
- Downloading and Installing Apache
- Derby
- Configuring Metastore of Hive
- Verifying Hive Installation
HIVE Data Types
- Column Types
- Literals
- Null Value
- Complex Types
- Apache Hive
Create Database
- Create Dat
- abase Statement
DROP Database
- Drop Database Statement
CREATE Table
- Create Table Statement
- Load Data Statement
ALTER Table
- Alter Table Statement
- Rename To… Statement
- Change Statement
- Add Columns Statement
- Replace Statement
DROP Table
- Drop Table Statement
Partitioning
- Adding a Partition
- Renaming a Partition
- Dropping a Partition
BUILt-IN OPERATORS
- Relational Operators
- Arithmetic Operators
- Logical Operators
- Complex Operators
BUILT-IN FUNCTIONS
- Apache Hive
- Built-In Functions
- Aggregate Functions
Views and Indexes
- Creating a View
- Example
- Dropping a View
- Creating an Index
- Example
- Dropping an Index
HIVE QL SELECT…WHERE
- Syntax
- Example
HIVEQL SELECT…ORDER BY
- Syntax
- Example
HIVEQL GROUP BY
- Syntax
- Example
HIVEQL JOINS
- Syntax
- Example
- JOIN
- LEFT OUTER JOIN
- RIGHT OUTER JOIN
- FULL OUTER JOIN
Call Now- +91-921-276-0556