SlideShare a Scribd company logo
2
Most read
4
Most read
10
Most read
WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
Mark Kromer
Sr. Azure Data Program Manager
Microsoft
ETL Made Easy with Azure
Data Factory & Azure
Databricks
#UnifiedAnalytics #SparkAISummit
Azure Data Factory
Cloud ETL Patterns with ADF
3#UnifiedAnalytics #SparkAISummit
Nightly ETL Data Loads Code-free
Slowly Changing Dimension Scenario
Load Star Schema DW Scenario
Data Lake Data Science Scenario
Azure Data Factory
Workflow Data Pipelines/Control Flow
9#UnifiedAnalytics #SparkAISummit
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure Databricks
Azure Data Factory
Mapping Data Flows
13#UnifiedAnalytics #SparkAISummit
What is ADF Mapping Data Flow?
• Transform Data, At Scale, in the
Cloud, Zero-Code
– Cloud-first, scale-out ELT
– Code-free dataflow pipelines
• Serverless scale-out transformation
execution engine
• Maximum Productivity for Data
Engineers
– Does NOT require understanding of
Spark / Scala / Python / Java
• Resilient Data Transformation
Flows
– Built for big data scenarios with
unstructured data requirements
– Operationalize with Data Factory
scheduling, control flow and
monitoring
Code-free Data Transformation At Scale
• Does not require understanding of Spark, Big Data Execution
Engines, Clusters, Scala, Python …
• Focus on building business logic and data transformation
– Data cleansing
– Aggregation
– Data conversions
– Data prep
– Data exploration
Transformation Function Expression Language
Debug Data Flows with Data Preview and Data
Sampling
Deep Monitoring Introspection of Data Transformations
DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

More Related Content

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PDF
Databricks Delta Lake and Its Benefits
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PDF
Introducing Databricks Delta
PDF
Data Mesh for Dinner
PPTX
Data Lakehouse Symposium | Day 4
PPTX
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
PDF
Data lineage and observability with Marquez - subsurface 2020
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Databricks Delta Lake and Its Benefits
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Introducing Databricks Delta
Data Mesh for Dinner
Data Lakehouse Symposium | Day 4
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
Data lineage and observability with Marquez - subsurface 2020

What's hot (20)

PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
PDF
Intro to Delta Lake
PDF
Change Data Feed in Delta
PDF
Modernizing to a Cloud Data Architecture
PDF
Introduction SQL Analytics on Lakehouse Architecture
PDF
Introduction to Azure Data Lake
PPTX
Building an Effective Data Warehouse Architecture
PDF
Lakehouse in Azure
PDF
Making Data Timelier and More Reliable with Lakehouse Technology
PPTX
PPTX
Microsoft Azure Databricks
PDF
Owning Your Own (Data) Lake House
PPTX
Modern Data Architecture
PPTX
Building Modern Data Platform with Microsoft Azure
PDF
Introduction to Azure Data Factory
PDF
The Hidden Value of Hadoop Migration
PPTX
Databricks Fundamentals
PPTX
Azure Synapse Analytics Overview (r2)
PDF
Moving to Databricks & Delta
PDF
SQL Analytics Powering Telemetry Analysis at Comcast
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Intro to Delta Lake
Change Data Feed in Delta
Modernizing to a Cloud Data Architecture
Introduction SQL Analytics on Lakehouse Architecture
Introduction to Azure Data Lake
Building an Effective Data Warehouse Architecture
Lakehouse in Azure
Making Data Timelier and More Reliable with Lakehouse Technology
Microsoft Azure Databricks
Owning Your Own (Data) Lake House
Modern Data Architecture
Building Modern Data Platform with Microsoft Azure
Introduction to Azure Data Factory
The Hidden Value of Hadoop Migration
Databricks Fundamentals
Azure Synapse Analytics Overview (r2)
Moving to Databricks & Delta
SQL Analytics Powering Telemetry Analysis at Comcast
Ad

Similar to ETL Made Easy with Azure Data Factory and Azure Databricks (20)

PPTX
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
PPTX
Azure Data Factory ETL Patterns in the Cloud
PPTX
Azure Data Factory for Azure Data Week
PDF
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
PDF
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
PDF
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
PDF
Azure Data Engineer Course | Azure Data Engineer Training
PPTX
Azure Data Factory Data Flows Training (Sept 2020 Update)
PDF
DevOps for Applications in Azure Databricks: Creating Continuous Integration ...
PPTX
Microsoft Azure BI Solutions in the Cloud
PPTX
Big Data Analytics in the Cloud with Microsoft Azure
PDF
Free Demo on #Microsoft #SQLServer & #T-SQL with #Azure from SQL School
PDF
Life is but a Stream
PPTX
Mapping Data Flows Training deck Q1 CY22
PPTX
ADF Demo_ppt.pptx
PDF
Massive-Scale Entity Resolution Using the Power of Apache Spark and Graph
PPTX
Azure Data Factory Data Flows Training v005
PDF
Mapping Manager Brochure
PPTX
1- Introduction of Azure data factory.pptx
PPTX
Data Lake ETL in the Cloud with ADF
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the Cloud
Azure Data Factory for Azure Data Week
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
Mapping Data Flows in Azure Data Factory 1st Edition Mark Kromer
Azure Data Engineer Course | Azure Data Engineer Training
Azure Data Factory Data Flows Training (Sept 2020 Update)
DevOps for Applications in Azure Databricks: Creating Continuous Integration ...
Microsoft Azure BI Solutions in the Cloud
Big Data Analytics in the Cloud with Microsoft Azure
Free Demo on #Microsoft #SQLServer & #T-SQL with #Azure from SQL School
Life is but a Stream
Mapping Data Flows Training deck Q1 CY22
ADF Demo_ppt.pptx
Massive-Scale Entity Resolution Using the Power of Apache Spark and Graph
Azure Data Factory Data Flows Training v005
Mapping Manager Brochure
1- Introduction of Azure data factory.pptx
Data Lake ETL in the Cloud with ADF
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Data Lakehouse Symposium | Day 2
PDF
Democratizing Data Quality Through a Centralized Platform
PDF
Learn to Use Databricks for Data Science
PDF
Why APM Is Not the Same As ML Monitoring
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
PDF
Stage Level Scheduling Improving Big Data and AI Integration
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
PDF
Sawtooth Windows for Feature Aggregations
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PDF
Re-imagine Data Monitoring with whylogs and Spark
PDF
Raven: End-to-end Optimization of ML Prediction Queries
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
PDF
Massive Data Processing in Adobe Using Delta Lake
PDF
Machine Learning CI/CD for Email Attack Detection
PDF
Jeeves Grows Up: An AI Chatbot for Performance and Quality
DW Migration Webinar-March 2022.pptx
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 2
Democratizing Data Quality Through a Centralized Platform
Learn to Use Databricks for Data Science
Why APM Is Not the Same As ML Monitoring
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Stage Level Scheduling Improving Big Data and AI Integration
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Sawtooth Windows for Feature Aggregations
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Re-imagine Data Monitoring with whylogs and Spark
Raven: End-to-end Optimization of ML Prediction Queries
Processing Large Datasets for ADAS Applications using Apache Spark
Massive Data Processing in Adobe Using Delta Lake
Machine Learning CI/CD for Email Attack Detection
Jeeves Grows Up: An AI Chatbot for Performance and Quality

Recently uploaded (20)

PDF
[EN] Industrial Machine Downtime Prediction
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Managing Community Partner Relationships
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Computer network topology notes for revision
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Introduction to the R Programming Language
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
annual-report-2024-2025 original latest.
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
[EN] Industrial Machine Downtime Prediction
SAP 2 completion done . PRESENTATION.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
.pdf is not working space design for the following data for the following dat...
STUDY DESIGN details- Lt Col Maksud (21).pptx
Supervised vs unsupervised machine learning algorithms
STERILIZATION AND DISINFECTION-1.ppthhhbx
Managing Community Partner Relationships
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Computer network topology notes for revision
Reliability_Chapter_ presentation 1221.5784
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to the R Programming Language
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
annual-report-2024-2025 original latest.
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

ETL Made Easy with Azure Data Factory and Azure Databricks