SlideShare a Scribd company logo
Build, Scale, and Deploy Deep
Learning Pipelines with Ease
Tim Hunter (Software Engineer)
Sue Ann Hong (Software Engineer)
Jules S. Damji (Spark Community Evangelist)
July 27, 2017
Agenda
• Logistics
• Databricks Overview
• Build, Scale and Deploy Deep Learning Pipelines with Ease
• Q & A
Logistics
• We can’t hear you…
• Recording will be available...
• Slides will be available...
• Queue up Questions ….
• Orange Button for Tech Support difficulties...
TEAM
About Databricks
Started Spark project (now Apache Spark) at UC Berkeleyin 2009
PRODUCT
Unified Analytics Platform
MISSION
Making Big Data Simple
Accelerate innovation by
unifying data science,
engineering and business.
Unified Analytics
Platform
UNIFIED
INFRASTRUCTURE
UNIFIED
EXPERIENCE
ACROSS TEAMS
UNIFIED
ANALYTIC
WORKFLOWS
The Unified Analytics Platform
About Us
• Sue Ann Hong
• Software engineer @ Databricks
• Ph.D. from CMU in Machine Learning
• Contributor to MLlib
• Author of Deep Learning Pipelines
About Us
• Tim Hunter
• Software engineer @ Databricks
• Ph.D. from UC Berkeley in Machine Learning
• Very early Spark user
• Contributor to MLlib
• Author of Deep Learning Pipelines, TensorFrames and
GraphFrames
Build, Scale, and Deploy Deep
Learning Pipelines with Ease
Tim Hunter (Software Engineer)
Sue Ann Hong (Software Engineer)
July 27, 2017
Today
• Deep Learning at scale made easy: the vision
• Processing images with DL Pipelines
• Building simple Deep Learning models with transfer learning
• Model deployment via SQL
Further advanced topics will be covered in our next webinar.
Deep Learning with ease
What is Deep Learning?
• A set of machine learning techniques that use layers that
transform numerical inputs
• Classification
• Regression
• Arbitrary mapping
• Popular in the 80’s as Neural Networks
• Recently came back thanks to advances in data collection,
computation techniques, and hardware.
Success of Deep Learning
• Tremendous success for applications with complex data
• AlphaGo
• Image interpretation
• Automatictranslation
• Speech recognition
But still requires a lot of effort
• Low level APIs with steep learning curve
• Tedious to distribute computations
• Not well integrated with other enterprise tools
• No exact science around deep learning
• Success requires many engineer-hours
Deep Learning in industry
• Currently limited adoption
• Huge potential beyond the industrial giants
• How do we accelerate the road to massive availability?
A typical Deep Learning workflow
• Load data (images, text, time series, …)
• Interactive work
• Train
• Select an architecture for a neural network
• Optimize the weights of the NN
• Evaluateresults, potentially re-train
• Apply:
• Pass the data through the NN to produce new features or output
How can Spark help?
• A lot of libraries available for Deep Learning in Spark
• TensorFlowOnSpark, BigDL, …
• Goes from simple to very advanced
• See our previous webinar for more detail
• Spark is great at scaling out computations
• Distribute the transforms
• Manage the trainingcomputation
• Spark MLlib Pipelines
• Simple, concise APIto capture the ML workflow
Deep Learning Pipelines:
Deep Learning with Simplicity
• Open-source Databricks library:
https://github.com/databricks/spark-deep-learning
• Focuses on easeof useand integration,without sacrificing
performance
• Scales out common tasks
• Integrates with Spark APIs
• Primary language: Python
Deep Learning Pipelines
• Load data
• Interactive work
• Train
• Evaluate model
• Apply
• Image	loading	in	Spark
• Deploying	models	in	SQL
• Transfer	learning
• Distributed	tuning
• Distributed	prediction
• Pre-trained	models
This
webinar:
✓
✓
✓
✓
Image processing with DL
Pipelines and Databricks
Adds support for images in Spark
• ImageSchema, reader, conversion functions to/from numpy
arrays
• Most of the tools we’ll describe work on ImageSchema columns
from sparkdl import readImages
image_df = readImages(sample_img_dir)
Applying popular models
• Popular pre-trained models accessible through MLlib
Transformers
predictor = DeepImagePredictor(inputCol="image",
outputCol="predicted_labels",
modelName="InceptionV3")
predictions_df = predictor.transform(image_df)
Applying popular models
predictor = DeepImagePredictor(inputCol="image",
outputCol="predicted_labels",
modelName="InceptionV3")
predictions_df = predictor.transform(image_df)
Fast model training via
transfer learning
Example: Identify the James Bond cars
DEMO
Transfer Learning
Transfer Learning
Transfer Learning
Transfer Learning
Transfer Learning
SoftMax
GIANT PANDA 0.9
RED PANDA 0.05
RACCOON 0.01
…
Classifier
Transfer Learning
DeepImageFeaturizer
MLlib primer
• MLlib: the machine learning library included with Spark
• Transformer
• Transforms the data: takes a Spark dataframe and appends a new column
• Estimator
• Produces a model (fit)
• Pipeline: sequence of transformers and estimators
Transfer Learning as a Pipeline
MLlib Pipeline
Image
Loading Preprocessing
Logistic
Regression
DeepImageFeaturizer
DEMO
Sharing and exporting Deep
Learning models
Classifier
Deep	Learning	Model
Model Export and Sharing
Shipping predictors in SQL
Take a trained model / Pipeline, register a SQL UDF usable by
anyone in the organization
In Spark SQL:
registerKerasUDF(”my_object_recognition_function",
keras_model_file="/mymodels/007model.h5")
select image, my_object_recognition_function(image) as objects
from traffic_imgs
Conclusion
Deep Learning without Deep Pockets
• Simple API for Deep Learning, integrated with MLlib
• Scales common tasks with transformers and estimators
• Embeds Deep Learning models in MLlib and SparkSQL
• Early release of Deep Learning Pipelines
https://github.com/databricks/spark-deep-learning
Deep Learning Pipelines - future
In progress
• Hyper-parameter tuning for Keras models
• Official image support in Spark
Potential future work
• Scala API
• Text models
• Support for more backends, e.g. MXNet, PyTorch, BigDL
Resources
Blog posts & webinars — http://databricks.com/blog
• Deep Learning Pipelines
• GPU acceleration in Databricks
• BigDL on Databricks
• Deep Learning and Apache Spark
Docs for Deep Learning on Databricks — http://docs.databricks.com
• Getting started
• Deep Learning Pipelines Example
• Spark integration
Thank You!
Questions?
Happy Sparking & Deep Learning!
UNIFIED ANALYTICS PLATFORM
Try Apache Spark in Databricks!
• Collaborative cloud environment
• Free version (community edition)
DATABRICKS RUNTIME 3.0
• Apache Spark - optimized for the cloud
• Caching and optimization layer - DBIO
• Enterprise security - DBES
Try for free today
databricks.com

More Related Content

PDF
Build, Scale, and Deploy Deep Learning Pipelines with Ease Using Apache Spark
PDF
Build, Scale, and Deploy Deep Learning Pipelines Using Apache Spark
PDF
Build, Scale, and Deploy Deep Learning Pipelines Using Apache Spark
PPTX
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
PDF
What's New in Apache Spark 2.3 & Why Should You Care
PDF
Extending Spark's Ingestion: Build Your Own Java Data Source with Jean George...
PDF
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark with Ma...
PPTX
Deep Learning and Streaming in Apache Spark 2.x with Matei Zaharia
Build, Scale, and Deploy Deep Learning Pipelines with Ease Using Apache Spark
Build, Scale, and Deploy Deep Learning Pipelines Using Apache Spark
Build, Scale, and Deploy Deep Learning Pipelines Using Apache Spark
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
What's New in Apache Spark 2.3 & Why Should You Care
Extending Spark's Ingestion: Build Your Own Java Data Source with Jean George...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark with Ma...
Deep Learning and Streaming in Apache Spark 2.x with Matei Zaharia

What's hot (20)

PPTX
From Pipelines to Refineries: scaling big data applications with Tim Hunter
PDF
Composable Parallel Processing in Apache Spark and Weld
PDF
Web-Scale Graph Analytics with Apache® Spark™
PDF
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
PDF
A Tale of Three Tools: Kubernetes, Jsonnet, and Bazel
PDF
Web-Scale Graph Analytics with Apache® Spark™
PDF
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, & Deep Learning ...
PDF
What's New in Upcoming Apache Spark 2.3
PDF
Recent Developments In SparkR For Advanced Analytics
PDF
Accelerating Data Science with Better Data Engineering on Databricks
PDF
Spark Summit 2016: Connecting Python to the Spark Ecosystem
PDF
Spark DataFrames and ML Pipelines
PDF
Scaling Machine Learning To Billions Of Parameters
PDF
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
PDF
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
PDF
Embrace Sparsity At Web Scale: Apache Spark MLlib Algorithms Optimization For...
PDF
Apache Spark MLlib's Past Trajectory and New Directions with Joseph Bradley
PDF
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
PDF
Apache Spark's MLlib's Past Trajectory and new Directions
PDF
Writing Continuous Applications with Structured Streaming Python APIs in Apac...
From Pipelines to Refineries: scaling big data applications with Tim Hunter
Composable Parallel Processing in Apache Spark and Weld
Web-Scale Graph Analytics with Apache® Spark™
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
A Tale of Three Tools: Kubernetes, Jsonnet, and Bazel
Web-Scale Graph Analytics with Apache® Spark™
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, & Deep Learning ...
What's New in Upcoming Apache Spark 2.3
Recent Developments In SparkR For Advanced Analytics
Accelerating Data Science with Better Data Engineering on Databricks
Spark Summit 2016: Connecting Python to the Spark Ecosystem
Spark DataFrames and ML Pipelines
Scaling Machine Learning To Billions Of Parameters
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
Embrace Sparsity At Web Scale: Apache Spark MLlib Algorithms Optimization For...
Apache Spark MLlib's Past Trajectory and New Directions with Joseph Bradley
Microservices and Teraflops: Effortlessly Scaling Data Science with PyWren wi...
Apache Spark's MLlib's Past Trajectory and new Directions
Writing Continuous Applications with Structured Streaming Python APIs in Apac...
Ad

Similar to Build, Scale, and Deploy Deep Learning Pipelines with Ease (20)

PDF
Integrating Deep Learning Libraries with Apache Spark
PPTX
Emiliano Martinez | Deep learning in Spark Slides | Codemotion Madrid 2018
PDF
Deep Learning on Apache® Spark™: Workflows and Best Practices
PDF
Deep Learning on Apache® Spark™ : Workflows and Best Practices
PDF
Deep Learning on Apache® Spark™: Workflows and Best Practices
PPTX
Combining Machine Learning frameworks with Apache Spark
PDF
Build a deep learning pipeline on apache spark for ads optimization
PPTX
Combining Machine Learning Frameworks with Apache Spark
PDF
Data Science and Deep Learning on Spark with 1/10th of the Code with Roope As...
PDF
Data Con LA 2018 - A Tale of DL Frameworks: TensorFlow, Keras, & Deep Learnin...
PDF
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, and Deep Learnin...
PDF
Deep learning and Apache Spark
PDF
AWS re:Invent Deep Learning: Goin Beyond Machine Learning (BDT311)
PPTX
No BS Guide to Deep Learning in the Enterprise
PDF
Bringing Deep Learning into production
PPTX
BigDL Deep Learning in Apache Spark - AWS re:invent 2017
PDF
Index conf sparkai-feb20-n-pentreath
PDF
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
PPTX
Apache Spark MLlib
PPTX
Practical Distributed Machine Learning Pipelines on Hadoop
Integrating Deep Learning Libraries with Apache Spark
Emiliano Martinez | Deep learning in Spark Slides | Codemotion Madrid 2018
Deep Learning on Apache® Spark™: Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™: Workflows and Best Practices
Combining Machine Learning frameworks with Apache Spark
Build a deep learning pipeline on apache spark for ads optimization
Combining Machine Learning Frameworks with Apache Spark
Data Science and Deep Learning on Spark with 1/10th of the Code with Roope As...
Data Con LA 2018 - A Tale of DL Frameworks: TensorFlow, Keras, & Deep Learnin...
A Tale of Three Deep Learning Frameworks: TensorFlow, Keras, and Deep Learnin...
Deep learning and Apache Spark
AWS re:Invent Deep Learning: Goin Beyond Machine Learning (BDT311)
No BS Guide to Deep Learning in the Enterprise
Bringing Deep Learning into production
BigDL Deep Learning in Apache Spark - AWS re:invent 2017
Index conf sparkai-feb20-n-pentreath
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
Apache Spark MLlib
Practical Distributed Machine Learning Pipelines on Hadoop
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Data Lakehouse Symposium | Day 2
PPTX
Data Lakehouse Symposium | Day 4
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PDF
Democratizing Data Quality Through a Centralized Platform
PDF
Learn to Use Databricks for Data Science
PDF
Why APM Is Not the Same As ML Monitoring
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
PDF
Stage Level Scheduling Improving Big Data and AI Integration
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
PDF
Sawtooth Windows for Feature Aggregations
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PDF
Re-imagine Data Monitoring with whylogs and Spark
PDF
Raven: End-to-end Optimization of ML Prediction Queries
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
PDF
Massive Data Processing in Adobe Using Delta Lake
DW Migration Webinar-March 2022.pptx
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 4
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Democratizing Data Quality Through a Centralized Platform
Learn to Use Databricks for Data Science
Why APM Is Not the Same As ML Monitoring
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Stage Level Scheduling Improving Big Data and AI Integration
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Sawtooth Windows for Feature Aggregations
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Re-imagine Data Monitoring with whylogs and Spark
Raven: End-to-end Optimization of ML Prediction Queries
Processing Large Datasets for ADAS Applications using Apache Spark
Massive Data Processing in Adobe Using Delta Lake

Recently uploaded (20)

PPT
Introduction Database Management System for Course Database
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
System and Network Administration Chapter 2
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Digital Systems & Binary Numbers (comprehensive )
PPTX
L1 - Introduction to python Backend.pptx
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PPTX
Computer Software and OS of computer science of grade 11.pptx
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Introduction Database Management System for Course Database
Understanding Forklifts - TECH EHS Solution
Design an Analysis of Algorithms I-SECS-1021-03
Operating system designcfffgfgggggggvggggggggg
2025 Textile ERP Trends: SAP, Odoo & Oracle
Which alternative to Crystal Reports is best for small or large businesses.pdf
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
System and Network Administration Chapter 2
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Digital Systems & Binary Numbers (comprehensive )
L1 - Introduction to python Backend.pptx
Wondershare Filmora 15 Crack With Activation Key [2025
How to Choose the Right IT Partner for Your Business in Malaysia
How to Migrate SBCGlobal Email to Yahoo Easily
Computer Software and OS of computer science of grade 11.pptx
Reimagine Home Health with the Power of Agentic AI​
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf

Build, Scale, and Deploy Deep Learning Pipelines with Ease

  • 1. Build, Scale, and Deploy Deep Learning Pipelines with Ease Tim Hunter (Software Engineer) Sue Ann Hong (Software Engineer) Jules S. Damji (Spark Community Evangelist) July 27, 2017
  • 2. Agenda • Logistics • Databricks Overview • Build, Scale and Deploy Deep Learning Pipelines with Ease • Q & A
  • 3. Logistics • We can’t hear you… • Recording will be available... • Slides will be available... • Queue up Questions …. • Orange Button for Tech Support difficulties...
  • 4. TEAM About Databricks Started Spark project (now Apache Spark) at UC Berkeleyin 2009 PRODUCT Unified Analytics Platform MISSION Making Big Data Simple
  • 5. Accelerate innovation by unifying data science, engineering and business. Unified Analytics Platform UNIFIED INFRASTRUCTURE UNIFIED EXPERIENCE ACROSS TEAMS UNIFIED ANALYTIC WORKFLOWS
  • 7. About Us • Sue Ann Hong • Software engineer @ Databricks • Ph.D. from CMU in Machine Learning • Contributor to MLlib • Author of Deep Learning Pipelines
  • 8. About Us • Tim Hunter • Software engineer @ Databricks • Ph.D. from UC Berkeley in Machine Learning • Very early Spark user • Contributor to MLlib • Author of Deep Learning Pipelines, TensorFrames and GraphFrames
  • 9. Build, Scale, and Deploy Deep Learning Pipelines with Ease Tim Hunter (Software Engineer) Sue Ann Hong (Software Engineer) July 27, 2017
  • 10. Today • Deep Learning at scale made easy: the vision • Processing images with DL Pipelines • Building simple Deep Learning models with transfer learning • Model deployment via SQL Further advanced topics will be covered in our next webinar.
  • 12. What is Deep Learning? • A set of machine learning techniques that use layers that transform numerical inputs • Classification • Regression • Arbitrary mapping • Popular in the 80’s as Neural Networks • Recently came back thanks to advances in data collection, computation techniques, and hardware.
  • 13. Success of Deep Learning • Tremendous success for applications with complex data • AlphaGo • Image interpretation • Automatictranslation • Speech recognition
  • 14. But still requires a lot of effort • Low level APIs with steep learning curve • Tedious to distribute computations • Not well integrated with other enterprise tools • No exact science around deep learning • Success requires many engineer-hours
  • 15. Deep Learning in industry • Currently limited adoption • Huge potential beyond the industrial giants • How do we accelerate the road to massive availability?
  • 16. A typical Deep Learning workflow • Load data (images, text, time series, …) • Interactive work • Train • Select an architecture for a neural network • Optimize the weights of the NN • Evaluateresults, potentially re-train • Apply: • Pass the data through the NN to produce new features or output
  • 17. How can Spark help? • A lot of libraries available for Deep Learning in Spark • TensorFlowOnSpark, BigDL, … • Goes from simple to very advanced • See our previous webinar for more detail • Spark is great at scaling out computations • Distribute the transforms • Manage the trainingcomputation • Spark MLlib Pipelines • Simple, concise APIto capture the ML workflow
  • 18. Deep Learning Pipelines: Deep Learning with Simplicity • Open-source Databricks library: https://github.com/databricks/spark-deep-learning • Focuses on easeof useand integration,without sacrificing performance • Scales out common tasks • Integrates with Spark APIs • Primary language: Python
  • 19. Deep Learning Pipelines • Load data • Interactive work • Train • Evaluate model • Apply • Image loading in Spark • Deploying models in SQL • Transfer learning • Distributed tuning • Distributed prediction • Pre-trained models This webinar: ✓ ✓ ✓ ✓
  • 20. Image processing with DL Pipelines and Databricks
  • 21. Adds support for images in Spark • ImageSchema, reader, conversion functions to/from numpy arrays • Most of the tools we’ll describe work on ImageSchema columns from sparkdl import readImages image_df = readImages(sample_img_dir)
  • 22. Applying popular models • Popular pre-trained models accessible through MLlib Transformers predictor = DeepImagePredictor(inputCol="image", outputCol="predicted_labels", modelName="InceptionV3") predictions_df = predictor.transform(image_df)
  • 23. Applying popular models predictor = DeepImagePredictor(inputCol="image", outputCol="predicted_labels", modelName="InceptionV3") predictions_df = predictor.transform(image_df)
  • 24. Fast model training via transfer learning
  • 25. Example: Identify the James Bond cars
  • 26. DEMO
  • 32. SoftMax GIANT PANDA 0.9 RED PANDA 0.05 RACCOON 0.01 … Classifier Transfer Learning DeepImageFeaturizer
  • 33. MLlib primer • MLlib: the machine learning library included with Spark • Transformer • Transforms the data: takes a Spark dataframe and appends a new column • Estimator • Produces a model (fit) • Pipeline: sequence of transformers and estimators
  • 34. Transfer Learning as a Pipeline MLlib Pipeline Image Loading Preprocessing Logistic Regression DeepImageFeaturizer
  • 35. DEMO
  • 36. Sharing and exporting Deep Learning models
  • 38. Shipping predictors in SQL Take a trained model / Pipeline, register a SQL UDF usable by anyone in the organization In Spark SQL: registerKerasUDF(”my_object_recognition_function", keras_model_file="/mymodels/007model.h5") select image, my_object_recognition_function(image) as objects from traffic_imgs
  • 40. Deep Learning without Deep Pockets • Simple API for Deep Learning, integrated with MLlib • Scales common tasks with transformers and estimators • Embeds Deep Learning models in MLlib and SparkSQL • Early release of Deep Learning Pipelines https://github.com/databricks/spark-deep-learning
  • 41. Deep Learning Pipelines - future In progress • Hyper-parameter tuning for Keras models • Official image support in Spark Potential future work • Scala API • Text models • Support for more backends, e.g. MXNet, PyTorch, BigDL
  • 42. Resources Blog posts & webinars — http://databricks.com/blog • Deep Learning Pipelines • GPU acceleration in Databricks • BigDL on Databricks • Deep Learning and Apache Spark Docs for Deep Learning on Databricks — http://docs.databricks.com • Getting started • Deep Learning Pipelines Example • Spark integration
  • 44. UNIFIED ANALYTICS PLATFORM Try Apache Spark in Databricks! • Collaborative cloud environment • Free version (community edition) DATABRICKS RUNTIME 3.0 • Apache Spark - optimized for the cloud • Caching and optimization layer - DBIO • Enterprise security - DBES Try for free today databricks.com