Microsoft Fabric: Spark Configuration & Delta Tables
Skillsoft issued completion badges are earned based on viewing the percentage required or receiving a passing score when assessment is required. Spark is essential both in Microsoft Fabric and for the DP-600 certification test. In this course, you’ll learn how to create Python scripts for Spark batch jobs in Fabric, writing ETL transformations and modifying them for batch execution. You’ll configure and monitor Spark batch jobs, analyze job logs, use the Spark History Server, and learn how to schedule jobs with retry policies.
Next, you’ll configure starter pools and explore high concurrency sessions. You’ll customize Spark settings in Fabric, and create custom Spark pools and environments, linking them to notebooks. After that, you’ll focus on Delta tables, working with version history and table contents.
You’ll use the DESCRIBE HISTORY command to analyze table versions and explore time travel by viewing and restoring data to specific versions or timestamps with SparkSQL and PySpark. Finally, you’ll explore the differences between managed and external Delta tables.
This course is part of a series that prepares learners for Exam DP-600: Implementing Analytics Solutions Using Microsoft Fabric.