Master Big Data Ingestion and Analytics with Flume, Sqoop, Hive and Spark
Video description
Complete course on Sqoop, Flume, and Hive: Great for CCA175 and Hortonworks Spark Certification preparation
About This Video
Learn Sqoop, Flume, and Hive and successfully achieve CCA175 and Hortonworks Spark Certification
Understand the Hadoop Distributed File System (HDFS), along with exploring Hadoop commands to work effectively with HDFS
In Detail
In this course, you will start by learning about the …
Master Big Data Ingestion and Analytics with Flume, Sqoop, Hive and Spark
Video description
Complete course on Sqoop, Flume, and Hive: Great for CCA175 and Hortonworks Spark Certification preparation
About This Video
Learn Sqoop, Flume, and Hive and successfully achieve CCA175 and Hortonworks Spark Certification
Understand the Hadoop Distributed File System (HDFS), along with exploring Hadoop commands to work effectively with HDFS
In Detail
In this course, you will start by learning about the Hadoop Distributed File System (HDFS) and the most common Hadoop commands required to work with HDFS. Next, you'll be introduced to Sqoop Import, which will help you gain insights into the lifecycle of the Sqoop command and how to use the import command to migrate data from MySQL to HDFS, and from MySQL to Hive.
In addition to this, you will get up to speed with Sqoop Export for migrating data effectively, along with using Apache Flume to ingest data. As you progress, you will delve into Apache Hive, external and managed tables, working with different files, and Parquet and Avro. Toward the concluding section, you will focus on Spark DataFrames and Spark SQL.
By the end of this course, you will have gained comprehensive insights into big data ingestion and analytics with Flume, Sqoop, Hive, and Spark.
GroupByKey/ Group people based on Birthday months
00:05:54
ReduceByKey / Total Number of students in each Subject
00:06:44
SortByKey / Sort students based on their rollno
00:06:03
MapPartition / MapPartitionWithIndex
00:06:20
Change number of Partitions
00:03:34
Join / Join email address based on customer name
00:03:06
Spark Actions
00:06:06
Chapter 8 : Spark RDD Practice
Scala Tuples
00:03:05
Extract Error Logs from log files
00:10:23
Frequency of word in Text File
00:08:35
Population of each City
00:03:53
Orders placed by Customers
00:09:21
Movie Average Rating greater than 3
00:07:04
Chapter 9 : Spark Dataframes & Spark SQL
Dataframe Intro
00:02:17
Dafaframe from Json Files
00:04:46
Dataframe from Parquet Files
00:01:41
Dataframe from CSV Files
00:08:05
Dataframe from Avro/XML Files
00:04:54
Working with Different Compressions
00:06:34
DataFrame API Part1
00:04:51
DataFrame API Part2
00:06:24
Spark SQL
00:01:33
Working with Hive Tables in Spark
00:01:29
Start your Free Trial Self paced Go to the Course We have partnered with providers to bring you collection of courses, When you buy through links on our site, we may earn an affiliate commission from provider.
This site uses cookies. By continuing to use this website, you agree to their use.I Accept