SlideShare a Scribd company logo
8
Most read
9
Most read
13
Most read
WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
Premal Shah, Microsoft
Creating continuous integration pipelines on
Azure using Azure Databricks and Azure DevOps
#UnifiedAnalytics #SparkAISummit
What is DevOps?
3#UnifiedAnalytics #SparkAISummit
What is DevOps?
4#UnifiedAnalytics #SparkAISummit
People. Process. Products.
DevOps is the union of
people, process, and
products to enable
continuous delivery of
value to your end users.
“
”
Build
&
Test
Continuous
Delivery
Deploy
Operate
Monitor
&
Learn
Plan
&
Track
Develop
Donovan Brown, MSFT PM
What is Azure Databricks?
5#UnifiedAnalytics #SparkAISummit
Increase productivity
Build on a secure, trusted cloud
Scale without limits
Built with your needs in mind
Enterprise grade Azure security
Native integration with Azure services
Live collaboration
Enterprise-grade SLAs
E2E data pipelines using ADF
Integrated billing
Azure DevOps
6#UnifiedAnalytics #SparkAISummit
7#UnifiedAnalytics #SparkAISummit
Azure DevOps: Choose what you like
Any Language, Any Platform
8#UnifiedAnalytics #SparkAISummit
Azure
Databricks
Dev WS
Push
notebooks
to Azure
Devops
Azure
DevOps
Repo
Build Pipeline
Artifact
Release
Pipeline
Deploy
Notebook
to staging
Azure
Databricks
Staging WS
Execute
Tests
Deploy
Notebook
to prod
Azure
Databricks
Prod WS
Implementation in Azure Databricks Notebooks
9#UnifiedAnalytics #SparkAISummit
Azure
Databricks
Dev WS
Azure
DevOps
Repo
Build Pipeline
Artifact
Release
Pipeline
Run with
staging
cluster
Azure
Databricks
Staging WS
Execute
Tests
Run with
Prod
cluster
Azure
Databricks
Prod WS
Implementation in IDE (PyCharm, IntelliJ)
DB Connect
Azure Databricks REST API/CLI
• Provides an easy-to-use interface to the Azure
Databricks platform. CLI (open source project) is
built on top of the REST APIs
– Workspace API
• Deploy notebooks from Azure DevOps to Azure Databricks
– DBFS API
• Deploy libraries from Azure DevOps to Azure Databricks
– Jobs API
• Execute notebooks and Spark code once deployed
10#UnifiedAnalytics #SparkAISummit
Demo
#UnifiedAnalytics #SparkAISummit
DevOps for ML: Goals
• Repeatability of model creation & behavior
• Evaluation of model predictions
• Managing different model versions and files
• Operationalization of the model
• Monitoring of training and scoring pipelines
12#UnifiedAnalytics #SparkAISummit
13#UnifiedAnalytics #SparkAISummit
Demo
#UnifiedAnalytics #SparkAISummit
Summary
• Two approaches
• Implementation in Azure Notebooks
• Implementation in IDE
• Azure DevOps to build CI/CD pipelines (you can selectively use)
• REST/CLI APIs
• Model CI/CD on Azure
• Azure Databricks: Data preparation and model training
• Azure ML: Model deployment and management
• Azure DevOps: CI/CD pipeline
15#UnifiedAnalytics #SparkAISummit
Call to action
• Build a CI/CD pipeline using Azure DevOps
• Azure Databricks documentation
• Azure DevOps pipelines
• Incorporate DevOps in your Azure Databricks
implementation
16#UnifiedAnalytics #SparkAISummit
THANK YOU!
“It is not the answer that enlightens, but the
question”
Eugene Ionesco
#UnifiedAnalytics #SparkAISummit
DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

More Related Content

What's hot (20)

PDF
Microservices for Application Modernisation
Ajay Kumar Uppal
 
PDF
IaC on AWS Cloud
Bhuvaneswari Subramani
 
PPTX
Microservices Architecture - Bangkok 2018
Araf Karsh Hamid
 
PDF
AZ-204 : Implement Azure security
AzureEzy1
 
PPTX
Terraform on Azure
Mithun Shanbhag
 
PPTX
Microservices Architecture & Testing Strategies
Araf Karsh Hamid
 
PDF
Microservices architecture overview v3
Dmitry Skaredov
 
PPTX
Hashicorp Corporate and Product Overview
Stenio Ferreira
 
PPTX
Microsoft Azure Technical Overview
gjuljo
 
PDF
AWS IAM
Diego Pacheco
 
PDF
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Henning Jacobs
 
PDF
Simplifying Model Management with MLflow
Databricks
 
PDF
Azure DevOps Presentation
InCycleSoftware
 
PPSX
Microservices Architecture - Cloud Native Apps
Araf Karsh Hamid
 
PPTX
Introduction to Apache Kafka
Jeff Holoman
 
PPSX
Agile, User Stories, Domain Driven Design
Araf Karsh Hamid
 
PDF
Cloud-Native Security
VMware Tanzu
 
PDF
Grand tour of Azure API Management.pdf
Sherman37
 
PPTX
Using Azure DevOps to continuously build, test, and deploy containerized appl...
Adrian Todorov
 
PPTX
Azure - Identity as a service
BizTalk360
 
Microservices for Application Modernisation
Ajay Kumar Uppal
 
IaC on AWS Cloud
Bhuvaneswari Subramani
 
Microservices Architecture - Bangkok 2018
Araf Karsh Hamid
 
AZ-204 : Implement Azure security
AzureEzy1
 
Terraform on Azure
Mithun Shanbhag
 
Microservices Architecture & Testing Strategies
Araf Karsh Hamid
 
Microservices architecture overview v3
Dmitry Skaredov
 
Hashicorp Corporate and Product Overview
Stenio Ferreira
 
Microsoft Azure Technical Overview
gjuljo
 
AWS IAM
Diego Pacheco
 
Optimizing Kubernetes Resource Requests/Limits for Cost-Efficiency and Latenc...
Henning Jacobs
 
Simplifying Model Management with MLflow
Databricks
 
Azure DevOps Presentation
InCycleSoftware
 
Microservices Architecture - Cloud Native Apps
Araf Karsh Hamid
 
Introduction to Apache Kafka
Jeff Holoman
 
Agile, User Stories, Domain Driven Design
Araf Karsh Hamid
 
Cloud-Native Security
VMware Tanzu
 
Grand tour of Azure API Management.pdf
Sherman37
 
Using Azure DevOps to continuously build, test, and deploy containerized appl...
Adrian Todorov
 
Azure - Identity as a service
BizTalk360
 

Similar to DevOps for Applications in Azure Databricks: Creating Continuous Integration Pipelines on Azure Using Azure Databricks and Azure DevOps (20)

PDF
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Databricks
 
PDF
Scaling ML-Based Threat Detection For Production Cyber Attacks
Databricks
 
PDF
Connecting the Dots: Integrating Apache Spark into Production Pipelines
Databricks
 
PDF
Accelerating Machine Learning on Databricks Runtime
Databricks
 
PDF
Databricks: What We Have Learned by Eating Our Dog Food
Databricks
 
PDF
Industrializing Machine Learning on an Enterprise Azure Platform with Databri...
Databricks
 
PPTX
Databricks on AWS.pptx
Wasm1953
 
PDF
Databricks with R: Deep Dive
Databricks
 
PDF
Self-Service Apache Spark Structured Streaming Applications and Analytics
Databricks
 
PDF
Big Data Adavnced Analytics on Microsoft Azure
Mark Tabladillo
 
PPTX
Deep Learning Technical Pitch Deck
Nicholas Vossburg
 
PDF
AI on Spark for Malware Analysis and Anomalous Threat Detection
Databricks
 
PDF
Azure databricks c sharp corner toronto feb 2019 heather grandy
Nilesh Shah
 
PPTX
Global AI Bootcamp Madrid - Azure Databricks
Alberto Diaz Martin
 
PDF
How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...
Databricks
 
PPTX
Machine Learning and AI
James Serra
 
PDF
MLflow and Azure Machine Learning—The Power Couple for ML Lifecycle Management
Databricks
 
PPTX
Azure Databricks - An Introduction 2019 Roadshow.pptx
pascalsegoul
 
PPTX
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Alberto Diaz Martin
 
PDF
Spark Operator—Deploy, Manage and Monitor Spark clusters on Kubernetes
Databricks
 
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Databricks
 
Scaling ML-Based Threat Detection For Production Cyber Attacks
Databricks
 
Connecting the Dots: Integrating Apache Spark into Production Pipelines
Databricks
 
Accelerating Machine Learning on Databricks Runtime
Databricks
 
Databricks: What We Have Learned by Eating Our Dog Food
Databricks
 
Industrializing Machine Learning on an Enterprise Azure Platform with Databri...
Databricks
 
Databricks on AWS.pptx
Wasm1953
 
Databricks with R: Deep Dive
Databricks
 
Self-Service Apache Spark Structured Streaming Applications and Analytics
Databricks
 
Big Data Adavnced Analytics on Microsoft Azure
Mark Tabladillo
 
Deep Learning Technical Pitch Deck
Nicholas Vossburg
 
AI on Spark for Malware Analysis and Anomalous Threat Detection
Databricks
 
Azure databricks c sharp corner toronto feb 2019 heather grandy
Nilesh Shah
 
Global AI Bootcamp Madrid - Azure Databricks
Alberto Diaz Martin
 
How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...
Databricks
 
Machine Learning and AI
James Serra
 
MLflow and Azure Machine Learning—The Power Couple for ML Lifecycle Management
Databricks
 
Azure Databricks - An Introduction 2019 Roadshow.pptx
pascalsegoul
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Alberto Diaz Martin
 
Spark Operator—Deploy, Manage and Monitor Spark clusters on Kubernetes
Databricks
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Learn to Use Databricks for Data Science
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Ad

Recently uploaded (20)

PPTX
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
PPTX
MENU-DRIVEN PROGRAM ON ARUNACHAL PRADESH.pptx
manvi200807
 
PPTX
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
PDF
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
PPTX
Artificial intelligence Presentation1.pptx
SaritaMahajan5
 
PPTX
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
PDF
A Web Repository System for Data Mining in Drug Discovery
IJDKP
 
PDF
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
PDF
ilide.info-tg-understanding-culture-society-and-politics-pr_127f984d2904c57ec...
jed P
 
PPTX
Module-2_3-1eentzyssssssssssssssssssssss.pptx
ShahidHussain66691
 
PDF
Kafka Use Cases Real-World Applications
Accentfuture
 
PPTX
covid 19 data analysis updates in our municipality
RhuAyungon1
 
PPT
Reliability Monitoring of Aircrfat commerce
Rizk2
 
PDF
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
PDF
Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeee
juadsr96
 
PDF
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
PPTX
microservices-with-container-apps-dapr.pptx
vjay22
 
PPTX
Natural Language Processing Datascience.pptx
Anandh798253
 
PPTX
Mynd company all details what they are doing a
AniketKadam40952
 
PDF
Exploiting the Low Volatility Anomaly: A Low Beta Model Portfolio for Risk-Ad...
Bradley Norbom, CFA
 
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
MENU-DRIVEN PROGRAM ON ARUNACHAL PRADESH.pptx
manvi200807
 
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
Artificial intelligence Presentation1.pptx
SaritaMahajan5
 
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
A Web Repository System for Data Mining in Drug Discovery
IJDKP
 
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
ilide.info-tg-understanding-culture-society-and-politics-pr_127f984d2904c57ec...
jed P
 
Module-2_3-1eentzyssssssssssssssssssssss.pptx
ShahidHussain66691
 
Kafka Use Cases Real-World Applications
Accentfuture
 
covid 19 data analysis updates in our municipality
RhuAyungon1
 
Reliability Monitoring of Aircrfat commerce
Rizk2
 
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
Datàaaaaaaaaaengineeeeeeeeeeeeeeeeeeeeeee
juadsr96
 
GOOGLE ADS (1).pdf THE ULTIMATE GUIDE TO
kushalkeshwanisou
 
microservices-with-container-apps-dapr.pptx
vjay22
 
Natural Language Processing Datascience.pptx
Anandh798253
 
Mynd company all details what they are doing a
AniketKadam40952
 
Exploiting the Low Volatility Anomaly: A Low Beta Model Portfolio for Risk-Ad...
Bradley Norbom, CFA
 

DevOps for Applications in Azure Databricks: Creating Continuous Integration Pipelines on Azure Using Azure Databricks and Azure DevOps

  • 1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
  • 2. Premal Shah, Microsoft Creating continuous integration pipelines on Azure using Azure Databricks and Azure DevOps #UnifiedAnalytics #SparkAISummit
  • 4. What is DevOps? 4#UnifiedAnalytics #SparkAISummit People. Process. Products. DevOps is the union of people, process, and products to enable continuous delivery of value to your end users. “ ” Build & Test Continuous Delivery Deploy Operate Monitor & Learn Plan & Track Develop Donovan Brown, MSFT PM
  • 5. What is Azure Databricks? 5#UnifiedAnalytics #SparkAISummit Increase productivity Build on a secure, trusted cloud Scale without limits Built with your needs in mind Enterprise grade Azure security Native integration with Azure services Live collaboration Enterprise-grade SLAs E2E data pipelines using ADF Integrated billing
  • 7. 7#UnifiedAnalytics #SparkAISummit Azure DevOps: Choose what you like Any Language, Any Platform
  • 8. 8#UnifiedAnalytics #SparkAISummit Azure Databricks Dev WS Push notebooks to Azure Devops Azure DevOps Repo Build Pipeline Artifact Release Pipeline Deploy Notebook to staging Azure Databricks Staging WS Execute Tests Deploy Notebook to prod Azure Databricks Prod WS Implementation in Azure Databricks Notebooks
  • 9. 9#UnifiedAnalytics #SparkAISummit Azure Databricks Dev WS Azure DevOps Repo Build Pipeline Artifact Release Pipeline Run with staging cluster Azure Databricks Staging WS Execute Tests Run with Prod cluster Azure Databricks Prod WS Implementation in IDE (PyCharm, IntelliJ) DB Connect
  • 10. Azure Databricks REST API/CLI • Provides an easy-to-use interface to the Azure Databricks platform. CLI (open source project) is built on top of the REST APIs – Workspace API • Deploy notebooks from Azure DevOps to Azure Databricks – DBFS API • Deploy libraries from Azure DevOps to Azure Databricks – Jobs API • Execute notebooks and Spark code once deployed 10#UnifiedAnalytics #SparkAISummit
  • 12. DevOps for ML: Goals • Repeatability of model creation & behavior • Evaluation of model predictions • Managing different model versions and files • Operationalization of the model • Monitoring of training and scoring pipelines 12#UnifiedAnalytics #SparkAISummit
  • 15. Summary • Two approaches • Implementation in Azure Notebooks • Implementation in IDE • Azure DevOps to build CI/CD pipelines (you can selectively use) • REST/CLI APIs • Model CI/CD on Azure • Azure Databricks: Data preparation and model training • Azure ML: Model deployment and management • Azure DevOps: CI/CD pipeline 15#UnifiedAnalytics #SparkAISummit
  • 16. Call to action • Build a CI/CD pipeline using Azure DevOps • Azure Databricks documentation • Azure DevOps pipelines • Incorporate DevOps in your Azure Databricks implementation 16#UnifiedAnalytics #SparkAISummit
  • 17. THANK YOU! “It is not the answer that enlightens, but the question” Eugene Ionesco #UnifiedAnalytics #SparkAISummit
  • 18. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT