MONSTER ACTIVITY
Intended Audience: Citizen Data Engineer, BI Developers, Data Analysts, Analytics Engineers
Level: 200-300 (Intermediate)
Skills: Low Code Data transformation, Data modeling, Data analysis, SQL
Duration: 9:00 – 17:00 (including 60min lunch and 2*15min coffee break)
Trainer: Tom Martens (Microsoft Data Platform MVP, Speaker, Book Author, Datamonster)
Prerequisites (some are softer, one is hard):
· Basic understanding of data modeling (star schema or dimensional modeling)
· Ability to access your Fabric-enabled workspace in your Power BI/Fabric tenant to create and query Fabric items like lakehouses, delta parquet tables, and semantic models from your laptop. The Fabric-enabled workspace can either reside inside a corporate Fabric or private tenant.
· If you do not have access to your own Fabric-enabled workspace, this will be provided by datamonster e.V.
· Bring your own laptop (no special programs are required), WIFI access is a must
· First experience with programming languages like R or Python (helpful but not required)
Course Outcome:
· Create and Execute notebooks inside Microsoft Fabric
· Use Python to transform and shape data
· Use Python to write to and read from delta tables
· Use Python to move data to the various stages of a lakehouse
· Understand the benefits of a medallion architecture
Abstract
This course introduces tackling data engineering tasks using Python and notebooks within Microsoft Fabric. In addition to an introduction to Python, modern lakehouse architecture is explained. You’ll be introduced to the skills necessary to succeed as a data engineer and how to apply these skills practically using Microsoft Fabric!
This course includes practical examples that any Citizen Data Engineer can immediately apply in their day-to-day job and many valuable tips related explicitly to tackling typical data engineering tasks.
Even though Microsoft Fabric is used during the course, using Python for data engineering with Spark can be combined with many other platforms that harness the power of Spark.
Agenda
1. Introduction
a. Course Overview
b. Goals and outcomes
c. Setting the expectations
2. Microsoft Fabric workspaces and notebooks
a. What makes a Fabric-enabled workspace unique: about Spark runtimes and environments
b. Introduction to notebooks
3. The relationship of lakehouses and notebooks
a. Creating a lakehouse
b. The default lakehouse of a notebook
4. Adding data to the lakehouse
a. Understanding methods to ingest data to the lakehouse
b. Inspecting data located in the lakehouse
5. Introduction to Python
a. What is Python
b. Python Variables
c. Python Statements
d. Python Control Structures
e. Python Data Structures
6. Introduction to the Delta Parquet format
a. What is the Delta Parquet file format
b. What is a Spark dataframe
7. Creating delta parquet tables using PySpark
a. What is PySpark
b. Writing data as a delta table
8. What is a delta tables, and why it’s crucial to modern lakehouses
a. Modern Lakehouse and the Medallion architecture
b. Delta Tables in the context of spark and RDD’s
c. A spark dataframe is not the same as a Pandas dataframe
9. After some concepts now up to the data shaping using PySpark
a. Shaping data using PySpark, including method chaining and user-defined functions
b. Introduction to the medallion architecture
c. Using the UPSERT method to move data between the different stages of a medallion architecture
10. Create a semantic model using direct lake
a. Understand different storage modes (Import, DirectQuery, Direct Lake)
b. Default semantic model vs custom semantic models
11. Inspecting a semantic model using Sempy
b. Use Sempy to inspect a semantic model
12. Orchestration of notebooks
a. Automate notebook execution
b. Data pipelines and other methods
13. A little Quiz and Closing
a. Quiz
b. Closing
-------------------------------------------------------------------------------------
Parkmöglichkeiten
Eigene Parkplätze der oh22information services GmbH
Alternative Parkmöglichkeiten:
Parkplatz am Palastweiher
Anfahrt mit öffentlichen Verkehrsmitteln
Hotelempfehlungen
Maritim Hotel Königswinter
Storyhotel Bergischer Hof Königswinter
Mo | Di | Mi | Do | Fr | Sa | So |
---|---|---|---|---|---|---|
26
|
27
|
28
|
29
|
30
|
31
|
1
|
|
|
|
|
|
|
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
|
|
|
|
|
|
|
9
|
10
|
11
|
12
|
13
|
14
|
15
|
|
|
|
|
|
|
|
16
|
17
|
18
|
19
|
20
|
21
|
22
|
|
|
|
|
|
|
|
23
|
24
|
25
|
26
|
27
|
28
|
29
|
|
|
|
|
|
|
|
30
|
1
|
2
|
3
|
4
|
5
|
6
|
|
|
|
|
|
|
|