Hive is a data warehouse infrastructure superimposed on Hadoop that provides data summary, query, and analysis capabilities. Hive comes with an SQL-like language called HiveQL. Participants will load data into Hive, and then use HiveQL to build a query of the data set.
Coaching hours: 4
Participants can expect to spend approximately 12-15 hours beyond the coaching hours on completing this module.
What’s in it for you?
- Understand how to exploit Hive in Hadoop.
- Use basic Linux commands to load data into Hive.
- Read the data using Hive.
- Build basic queries using HiveQL.
- Save query results.