Unknown: explode(): Passing null to parameter #2 ($string) of type string is deprecated in /home/skilramit/htdocs/www.skilr.com/public/catalog/controller/product/product.php on line 502Microsoft Azure Data Lake Storage Online Course
This hands-on course provides a complete introduction to Azure Data Lake Storage Gen2 (ADLS), Microsoft’s cloud-based repository for structured and unstructured data. You’ll learn how ADLS can store diverse data types—from documents and images to social media streams—and see how it integrates with big data processing tools like Azure Databricks and HDInsight. Through practical examples, you’ll import data into ADLS, securely access and analyze it, and explore methods to optimize and monitor your storage. The course offers an end-to-end walkthrough of data ingestion, processing, and exporting using Spark on Databricks and HDInsight.
By the end, you’ll gain a clear understanding of both ADLS Gen1 and Gen2, along with their features and capabilities, empowering you to manage and analyze big data effectively in Azure.
Who should take this Course?
This course is designed for data engineers, cloud professionals, and analysts who want to master big data analytics on Azure. It’s also valuable for IT professionals, developers, and students looking to gain practical, hands-on experience with Azure Data Lake Storage, Databricks, and HDInsight. Anyone interested in building scalable big data solutions in the cloud will benefit from this course.
What you will learn
Explore Data Lake optimization strategy
Learn to monitor the performance of your Data Lake
Explore different tools and scenarios to ingest data into Data Lake
Discover the five layers of security to protect Data Lake
Explore data security and configure them using the Azure portal
Learn to monitor Azure Storage Service through Metrics
Course Outline
Course Introduction
Course Introduction
Introduction to Azure Cloud Computing
Create Azure Free Subscription
Azure Portal Overview
Azure Services Overview
Resource Management Group and Subscription
Resource Groups
Tagging
Delete Resources and Set Budget
Introduction to Azure Data Lake
Problem Statement
What is Data Lake?
Data Lake Versus Hadoop
How Data Lake Gen2 Evolved
Azure Data Lake Versus Azure Blob storage
Provision Azure Data Lake Gen2 Account
Azure Data Lake Gen2 Account Overview
Hierarchical Namespace
Other Data Lake Gen 2 Features
Data Ingestion
Tools to Ingest Data in Data Lake
Demo - Ingest Using Portal and SE
Demo- Ingest Data Using Azcopy
Demo: Azure Blob Storage to Data Lake Gen2 Using Data Factory
Demo: SQL Server to Data Lake Gen2 Using Data Factory
Demo: Amazon S3 to Data Lake Gen2 Using Data Factory
Data Flow Around Data Lake
Data Flow Around Data Lake
Data Lake and Transient Clusters
Azure Data Lake Processing Through Databricks
Demo Overview
Demo: Provision Databricks, Clusters, and Workbook
Demo: Mount Data Lake to Databricks DBFS
Demo: Explore, Analyze, Clean, Transform, and Load Data
Azure Data Lake Processing Through HDInsight
Demo Overview
Create Azure Data Lake Storage Gen2 (Source) and SQL Server (Destination)
What is Managed Identity
Add Managed Identity to Gen2 and Database Accounts