logo

Microsoft Azure Course

course overview

download outline

Select Country and City to View dates & book now

Overview

Learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run large data engineering workloads in the cloud.

Outline

Module 1: Explore Azure Databricks
Azure Databricks is a cloud service that provides a scalable platform for data analytics using Apache Spark.
  • Introduction
  • Get started with Azure Databricks
  • Identify Azure Databricks workloads
  • Understand key concepts
  • Exercise - Explore Azure Databricks
  • Knowledge check
  • Summary
Module 2: Use Apache Spark in Azure Databricks
Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.
  • Introduction
  • Get to know Spark
  • Create a Spark cluster
  • Use Spark in notebooks
  • Use Spark to work with data files
  • Visualize data
  • Exercise - Use Spark in Azure Databricks
  • Knowledge check
  • Summary
Module 3: Use Delta Lake in Azure Databricks
Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Databricks.
  • Introduction
  • Get Started with Delta Lake
  • Create Delta Lake tables
  • Create and query catalog tables
  • Use Delta Lake for streaming data
  • Exercise - Use Delta Lake in Azure Databricks
  • Knowledge check
  • Summary
Module 4: Use SQL Warehouses in Azure Databricks
Azure Databricks provides SQL Warehouses that enable data analysts to work with data using familiar relational SQL queries.
  • Introduction
  • Get started with SQL Warehouses
  • Create databases and tables
  • Create queries and dashboards
  • Exercise - Use a SQL Warehouse in Azure Databricks
  • Knowledge check
  • Summary
Module 5: Run Azure Databricks Notebooks with Azure Data Factory
Using pipelines in Azure Data Factory to run notebooks in Azure Databricks enables you to automate data engineering processes at cloud scale.
  • Introduction
  • Understand Azure Databricks notebooks and pipelines
  • Create a linked service for Azure Databricks
  • Use a Notebook activity in a pipeline
  • Use parameters in a notebook
  • Exercise - Run an Azure Databricks Notebook with Azure Data Factory
  • Knowledge check
  • Summary

Talk to an expert

Thinking about Onsite?

If you need training for 3 or more people, you should ask us about onsite training. Putting aside the obvious location benefit, content can be customised to better meet your business objectives and more can be covered than in a public classroom. Its a cost effective option. One on one training can be delivered too, at reasonable rates.

Submit an enquiry from any page on this site and let us know you are interested in the requirements box, or simply mention it when we contact you.

All $ prices are in USD unless it’s a NZ or AU date

SPVC = Self Paced Virtual Class

LVC = Live Virtual Class

Please Note: All courses are availaible as Live Virtual Classes

Trusted by over 1/2 million students in 15 countries

Our clients have included prestigious national organisations such as Oxford University Press, multi-national private corporations such as JP Morgan and HSBC, as well as public sector institutions such as the Department of Defence and the Department of Health.