Microsoft Azure Data Lake Storage Service (Gen1 and Gen2)
Video description
Learn to ingest, process, and export data in Azure Data Lake Storage Service Gen1 and Gen2 using Databricks and HDInsight
About This Video
Discover Microsoft Azure Data Lake
Learn to use Azure Databricks and HDInsight to process data in ADLS
Explore data lifecycle and architecture around Data Lake
In Detail
Azure Data Lake Storage Gen2 (ADLS) is a cloud-based repository for both structured and unstructured data. For …
Microsoft Azure Data Lake Storage Service (Gen1 and Gen2)
Video description
Learn to ingest, process, and export data in Azure Data Lake Storage Service Gen1 and Gen2 using Databricks and HDInsight
About This Video
Discover Microsoft Azure Data Lake
Learn to use Azure Databricks and HDInsight to process data in ADLS
Explore data lifecycle and architecture around Data Lake
In Detail
Azure Data Lake Storage Gen2 (ADLS) is a cloud-based repository for both structured and unstructured data. For example, you could use it to store everything, from documents to images to social media streams. This is one of the most effective ways to go for big data processing; that is, to store your data in ADLS and then process it using Spark, which is a faster version of Hadoop, on Azure Databricks.
This is a comprehensive hands-on course for anyone who is interested in Azure's big data analytics services. You will learn hands-on with examples to import data into ADLS and then securely access it and analyze it using Azure Databricks and Azure HDInsight. You will also learn how to monitor and optimize your Data Lake storage. This course provides an end-to-end demonstration for one to have a noticeably clear understanding of Data Lake.
By the end of this course, you will learn how to ingest, process, and export data using Databricks and HDInsight. You will have a solid understanding of Microsoft Azure Data Lake Storage Service (Gen1 and Gen2) and its features and properties, which will help you further in your professional endeavors.
Audience
This course is for anyone interested in Azure's big data analytics services. Also, Microsoft Azure data engineers, database and BI developers,
database administrators, data analysts, or similar profiles can opt for this course.
Just a basic understanding of data warehouse and database, in general, will help you understand this course better.
Demo: Azure Blob Storage to Data Lake Gen2 Using Data Factory
Demo: SQL Server to Data Lake Gen2 Using Data Factory
Demo: Amazon S3 to Data Lake Gen2 Using Data Factory
Chapter 5 : Data Flow Around Data Lake
Data Flow Around Data Lake
Data Lake and Transient Clusters
Chapter 6 : Azure Data Lake Processing Through Databricks
Demo Overview
Demo: Provision Databricks, Clusters, and Workbook
Demo: Mount Data Lake to Databricks DBFS
Demo: Explore, Analyze, Clean, Transform, and Load Data
Chapter 7 : Azure Data Lake Processing Through HDInsight
Demo Overview
Create Azure Data Lake Storage Gen2 (Source) and SQL Server (Destination)
What is Managed Identity
Add Managed Identity to Gen2 and Database Accounts
Create HDInsight Interactive Query Cluster
Ambari Overview and UI
Ingest Dataset into Data Lake Storage
Data Extraction with Hive
Data Transformation with Hive
Data Export Using Sqoop
Summary
Chapter 8 : Security Layers in Data Lake
Introduction
Storage Access Keys
SAS - Shared Access Signature
Azure Active Directory
Access Control List (ACL)
Firewalls and Virtual Networks
Encryption in Transit
Encryption at Rest
Advanced Threat Protection
Chapter 9 : Data Lake Monitoring and Optimization
Activity Log
Demo - Activity Logs
Metrics
Demo: Metrics
Demo: Insights
Demo: Alerts
Diagnostic Settings
Demo: Diagnostic Settings
Optimization
Chapter 10 : Practice Tests and Bonus
Delete Resources
Start your Free Trial Self paced Go to the Course We have partnered with providers to bring you collection of courses, When you buy through links on our site, we may earn an affiliate commission from provider.
This site uses cookies. By continuing to use this website, you agree to their use.I Accept