SlideShare a Scribd company logo
Open Source Framework for
Deploying Data Science Models and
Cloud Based Applications
Pivotal Data Science Team
Open Source Framework for Deploying Data Science Models and Cloud Based Applications by Noelle Sio of Pivotal
What happened?
What should I do about it?
This is where Data Science comes in
What will happen next?
What Thought Leaders Have In Common
 Large amounts of structured and
unstructured data
 Deep personal knowledge of their
audience
 Quantified understanding of their
products
 Data-driven culture
 User experience optimized by data
science
Viewership
Advertisements Merchandise
Sales & Finance
$
Market Research &
Competitive Information
Audience Demographics
Internal Data Sources
Typical External Sources Semi/Unstructured Data
Clickstream
Social Media
Content
Data Science Impact
Business Motivation
Increase
Demand
Build Brand Equity
Increase Production
Efficiency
Optimize Ad
Spend Efficiency
Increase Customer
Engagement
• Campaign
Optimization
• Marketing Mix
Models
Data Science Opportunities
• Customer
segmentation
• Affinity analysis
• Social media analytics
• Supply/Demand
forecasting
Increase
Revenue
Reduce
Cost
Example Use Case: Ratings Prediction
Use Case: Increase ratings across viewer
demographics
How:
• Data: Viewership, transcripts and show
data combined in big data platform
• Model: Machine learning used to
identify the impact of production
decisions on viewership
Insights
Models  Insights  Actions
Models are built to
answer business
questions
e.g. what makes viewers tune-
in and tune-out?
Data Scientists
interpret models for
answers
e.g. On screen arguments
make viewers tune out
Report
Dashboard
BI Tool
Email
Presentation
Cloud App
End User
A good insight drives action that will generate value for stakeholders
Revisiting Rating Prediction Use Case
Model exposed to end users via cloud
application allowing what-if scenario building
Characteristics Of Actionable Insights
Real-time
ScalableSocial
Relevant
Accessible
Open
Benefits Of Cloud Based Applications
Service failure or
data loss at scale
Long innovation
cycles
Poor experience at
scale
Resilient, scale-out
messaging and
processing
Agile development
with cloud based
data services
Low-latency, in-
memory computing
Open Source Analytics Ecosystem
Media companies benefit from algorithmic breadth and scalability for
building and socializing data science models
MLlib
PL/X
Algorithms Visualization
Best of breed in-memory and in-database tools for an MPP platform
Example Scalable Open Source Platform
Hadoop++: Complementing the Hadoop platform are Data Science modeling tools.
SQL on Hadoop (e.g. HAWQ), Python/R interfaces to SQL, Apache Spark etc.
http://opendataplatform.org/
Apps
Data
Analytics
Leading Media companies are moving towards a platform with Hadoop at the core.
Data Science Pipeline On Hadoop++
MLlib
PL/X
Data Lake
Hadoop++
Structured +
Unstructured
Data
Open Source Framework For Ratings Prediction
Data Lake
Insights and
Model Results
Ratings Predictions
Business Levers
Hosted on
What-if Scenario
ApplicationContains structured
+ unstructured data
MLlib
PL/X
Gather video ads
impression stats
Data Lake
Ingest
Message Broker Simulate Ad
Server
Behavior
Impression Forecasts
Business Levers
Hosted on
Business Metrics
Dashboard
Expanding The Framework To Include Impression
Forecasting Modeling
MLlib
PL/X
Measuring Audience Engagement : Workflow
Parallel Parsing
of JSON
(PL/Python)
Twitter Decahose
(~55 million tweets/day)
Source: http
Sink: hdfs
HDFS
External
Tables
PXF
Nightly Cron Jobs
Topic Analysis
through MADlib
pLDA
Unsupervised
Sentiment Analysis
(PL/Python)
Hosted on
Key Takeaways
• Blended data sets lead to richer models and more
valuable insights
• Turn Data Science models and insights into value
generating actions through data driven applications.
• Open source = power and flexibility
• Platform extensibility is key to supporting Data Science
• Turnkey PaaS is available through CloudFoundry,
including infrastructure monitoring, server
configuration and scalability.
THANK YOU!

More Related Content

What's hot (20)

PPTX
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Caserta
 
PDF
Data catalog
iamtodor
 
PDF
Data Science: Harnessing Open Data for High Impact Solutions
Mohd Izhar Firdaus Ismail
 
PDF
Big Data Landscape 2016
Matt Turck
 
PDF
Info qiy foundation digital me - dappre-eng-aug17
BigDataExpo
 
PPTX
DMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial
 
PDF
Big Data: Its Characteristics And Architecture Capabilities
Ashraf Uddin
 
PDF
[Keynote HP] Guido Pezzin - Big Data - from theory to practice with the simpl...
Codemotion
 
PDF
Big Data Landscape 2018
Leanne Hwee
 
PDF
Introduction to Data Mining, Business Intelligence and Data Science
IMC Institute
 
PDF
Paving The Way To Data Driven
Mohd Izhar Firdaus Ismail
 
PPTX
Ai presentatie
LunaDuFour
 
PDF
Data science and visualization lab presentation
iHub Research
 
PDF
Gianluigi Viganò - How to use HP HEAVEN-on-demand functions for Big Data apps
Codemotion
 
PPTX
Introduction to data science club
Data Science Club
 
PPTX
TPA
suresh sood
 
PPTX
Big data and data mining
Emran Hossain
 
PDF
Frontiers in Alternative Data : Techniques and Use Cases
QuantUniversity
 
PPTX
Introduction to BIG DATA
Zeeshan Khan
 
PDF
Big Data Meetup: Analytical Systems Evolution
Provectus
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Caserta
 
Data catalog
iamtodor
 
Data Science: Harnessing Open Data for High Impact Solutions
Mohd Izhar Firdaus Ismail
 
Big Data Landscape 2016
Matt Turck
 
Info qiy foundation digital me - dappre-eng-aug17
BigDataExpo
 
DMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial
 
Big Data: Its Characteristics And Architecture Capabilities
Ashraf Uddin
 
[Keynote HP] Guido Pezzin - Big Data - from theory to practice with the simpl...
Codemotion
 
Big Data Landscape 2018
Leanne Hwee
 
Introduction to Data Mining, Business Intelligence and Data Science
IMC Institute
 
Paving The Way To Data Driven
Mohd Izhar Firdaus Ismail
 
Ai presentatie
LunaDuFour
 
Data science and visualization lab presentation
iHub Research
 
Gianluigi Viganò - How to use HP HEAVEN-on-demand functions for Big Data apps
Codemotion
 
Introduction to data science club
Data Science Club
 
Big data and data mining
Emran Hossain
 
Frontiers in Alternative Data : Techniques and Use Cases
QuantUniversity
 
Introduction to BIG DATA
Zeeshan Khan
 
Big Data Meetup: Analytical Systems Evolution
Provectus
 

Viewers also liked (19)

PPTX
Greenplum- an opensource
Rosy Mani
 
PPTX
Seattle code camp 2016 - Role of Data Science in Healthcare
Gaurav Garg
 
PPTX
Engineering patterns for implementing data science models on big data platforms
Hisham Arafat
 
PDF
Linear models for data science
Brad Klingenberg
 
PPTX
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
PPTX
H2O World - Top 10 Data Science Pitfalls - Mark Landry
Sri Ambati
 
PDF
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
KEY
Indexing thousands of writes per second with redis
pauldix
 
PDF
Data science
9diov
 
PDF
Microservices, containers, and machine learning
Paco Nathan
 
PDF
Creating a contemporary risk management system using python (dc)
Piero Ferrante
 
PDF
International Collaboration Networks in the Emerging (Big) Data Science
datasciencekorea
 
PDF
The Role of Data Science in Enterprise Risk Management, Presented by John Liu
NashvilleTechCouncil
 
PDF
Fiche Produit Verteego Data Suite, mars 2017
Jeremy Fain
 
PDF
Introduction to Data Science
Anastasiia Kornilova
 
PDF
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
 
PDF
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Spark Summit
 
PDF
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming by Ew...
Spark Summit
 
PDF
Data Visualisation for Data Science
Christophe Bontemps
 
Greenplum- an opensource
Rosy Mani
 
Seattle code camp 2016 - Role of Data Science in Healthcare
Gaurav Garg
 
Engineering patterns for implementing data science models on big data platforms
Hisham Arafat
 
Linear models for data science
Brad Klingenberg
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
Sri Ambati
 
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
Indexing thousands of writes per second with redis
pauldix
 
Data science
9diov
 
Microservices, containers, and machine learning
Paco Nathan
 
Creating a contemporary risk management system using python (dc)
Piero Ferrante
 
International Collaboration Networks in the Emerging (Big) Data Science
datasciencekorea
 
The Role of Data Science in Enterprise Risk Management, Presented by John Liu
NashvilleTechCouncil
 
Fiche Produit Verteego Data Suite, mars 2017
Jeremy Fain
 
Introduction to Data Science
Anastasiia Kornilova
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
 
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Spark Summit
 
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming by Ew...
Spark Summit
 
Data Visualisation for Data Science
Christophe Bontemps
 
Ad

Similar to Open Source Framework for Deploying Data Science Models and Cloud Based Applications by Noelle Sio of Pivotal (20)

PDF
Data Science at Scale - The DevOps Approach
Mihai Criveti
 
PDF
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
Mihai Criveti
 
PPTX
Comcast Labs Connect - PHLAI Conference Philadelphia 2018
Open Data Group
 
PDF
Artificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Juarez Junior
 
PDF
DevOps for DataScience
Stepan Pushkarev
 
PPTX
Azure Databricks for Data Scientists
Richard Garris
 
PDF
Building Data Science into Organizations: Field Experience
Databricks
 
PPTX
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
PDF
Bridging the Gap: from Data Science to Production
Florian Wilhelm
 
PDF
Embedded-ml(ai)applications - Bjoern Staender
Dataconomy Media
 
PDF
Productionizing Data Science at Experience
Matt Mills
 
PDF
Data Science & Machine Learning Platforms_ Key Market Trends and Growth Drive...
GargiBen
 
PDF
Real-World-Case-Studies-in-Data-Science.
Ozias Rondon
 
PDF
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
DATAVERSITY
 
PDF
How to make your data scientists happy
Hussain Sultan
 
PPTX
The Python ecosystem for data science - Landscape Overview
Dr. Ananth Krishnamoorthy
 
PPTX
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
Dataconomy Media
 
PPTX
Proposed Talk Outline for Pycon2017
Dr. Ananth Krishnamoorthy
 
PPTX
Data Science and Analysis.pptx
PrashantYadav931011
 
PDF
Everyday Data Science
Paul Laughlin
 
Data Science at Scale - The DevOps Approach
Mihai Criveti
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
Mihai Criveti
 
Comcast Labs Connect - PHLAI Conference Philadelphia 2018
Open Data Group
 
Artificial Intelligence and Machine Learning with the Oracle Data Science Cloud
Juarez Junior
 
DevOps for DataScience
Stepan Pushkarev
 
Azure Databricks for Data Scientists
Richard Garris
 
Building Data Science into Organizations: Field Experience
Databricks
 
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 
Bridging the Gap: from Data Science to Production
Florian Wilhelm
 
Embedded-ml(ai)applications - Bjoern Staender
Dataconomy Media
 
Productionizing Data Science at Experience
Matt Mills
 
Data Science & Machine Learning Platforms_ Key Market Trends and Growth Drive...
GargiBen
 
Real-World-Case-Studies-in-Data-Science.
Ozias Rondon
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
DATAVERSITY
 
How to make your data scientists happy
Hussain Sultan
 
The Python ecosystem for data science - Landscape Overview
Dr. Ananth Krishnamoorthy
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
Dataconomy Media
 
Proposed Talk Outline for Pycon2017
Dr. Ananth Krishnamoorthy
 
Data Science and Analysis.pptx
PrashantYadav931011
 
Everyday Data Science
Paul Laughlin
 
Ad

More from ETCenter (20)

PDF
Securing Content in the Cloud
ETCenter
 
PPTX
Building Highly Scalable Immersive Media Solutions on AWS
ETCenter
 
PPTX
How broadcasters can get in the VR game with sports
ETCenter
 
PPTX
Improve Efficiency by Double Digits – Leveraging Artificial Intelligence and ...
ETCenter
 
PPTX
Looking beyond the script
ETCenter
 
PPTX
Cloud Apps for Media Processing: IMF Packaging-on-Demand
ETCenter
 
PPTX
IP for Sports broadcast
ETCenter
 
PPTX
The distributive aspect of cloud on the digital world
ETCenter
 
PPTX
Cloud Transition Patterns for Media Enterprises
ETCenter
 
PPTX
Hacking IoT: the new threat for content assets
ETCenter
 
PPTX
BLOCKCHAIN & THE HOLLYWOOD SUPPLY CHAIN
ETCenter
 
PPTX
Graymeta C4 use case, Deduplication
ETCenter
 
PPTX
WRAST, Worldwide Repository for Assets. Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
PDF
Object storage is awesome.. ETC "Project Cloud" QTR meeting @ Disney/ABC
ETCenter
 
PPTX
Federated identity, Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
PPTX
Security + Cloud: What studios and vendors need to consider when adopting clo...
ETCenter
 
PDF
"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABC
ETCenter
 
PDF
Big Data/DIG: Domain-Specific Insight Graphs by Pedro Szekely of ISI/USC
ETCenter
 
PDF
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
ETCenter
 
PDF
This Is Not Your Parent’s Storage: Transitioning to Cloud Object Storage by I...
ETCenter
 
Securing Content in the Cloud
ETCenter
 
Building Highly Scalable Immersive Media Solutions on AWS
ETCenter
 
How broadcasters can get in the VR game with sports
ETCenter
 
Improve Efficiency by Double Digits – Leveraging Artificial Intelligence and ...
ETCenter
 
Looking beyond the script
ETCenter
 
Cloud Apps for Media Processing: IMF Packaging-on-Demand
ETCenter
 
IP for Sports broadcast
ETCenter
 
The distributive aspect of cloud on the digital world
ETCenter
 
Cloud Transition Patterns for Media Enterprises
ETCenter
 
Hacking IoT: the new threat for content assets
ETCenter
 
BLOCKCHAIN & THE HOLLYWOOD SUPPLY CHAIN
ETCenter
 
Graymeta C4 use case, Deduplication
ETCenter
 
WRAST, Worldwide Repository for Assets. Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
Object storage is awesome.. ETC "Project Cloud" QTR meeting @ Disney/ABC
ETCenter
 
Federated identity, Project Cloud QTR meeting @ Disney/ABC
ETCenter
 
Security + Cloud: What studios and vendors need to consider when adopting clo...
ETCenter
 
"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABC
ETCenter
 
Big Data/DIG: Domain-Specific Insight Graphs by Pedro Szekely of ISI/USC
ETCenter
 
An Introduction to Data Gravity by John Tkaczewski of FileCatalyst
ETCenter
 
This Is Not Your Parent’s Storage: Transitioning to Cloud Object Storage by I...
ETCenter
 

Recently uploaded (20)

PDF
The Growing Value and Application of FME & GenAI
Safe Software
 
PDF
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
PPTX
𝙳𝚘𝚠𝚗𝚕𝚘𝚊𝚍—Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
 
PDF
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
 
PPTX
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
PDF
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PDF
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
 
PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
PDF
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
PPTX
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
PDF
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
PPSX
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
PPTX
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
PDF
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 
The Growing Value and Application of FME & GenAI
Safe Software
 
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
𝙳𝚘𝚠𝚗𝚕𝚘𝚊𝚍—Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
 
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
 
CapCut Pro Crack For PC Latest Version {Fully Unlocked} 2025
pcprocore
 
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
Open Source Milvus Vector Database v 2.6
Zilliz
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
Enabling the Digital Artisan – keynote at ICOCI 2025
Alan Dix
 
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 

Open Source Framework for Deploying Data Science Models and Cloud Based Applications by Noelle Sio of Pivotal

  • 1. Open Source Framework for Deploying Data Science Models and Cloud Based Applications Pivotal Data Science Team
  • 3. What happened? What should I do about it? This is where Data Science comes in What will happen next?
  • 4. What Thought Leaders Have In Common  Large amounts of structured and unstructured data  Deep personal knowledge of their audience  Quantified understanding of their products  Data-driven culture  User experience optimized by data science
  • 5. Viewership Advertisements Merchandise Sales & Finance $ Market Research & Competitive Information Audience Demographics Internal Data Sources Typical External Sources Semi/Unstructured Data Clickstream Social Media Content
  • 6. Data Science Impact Business Motivation Increase Demand Build Brand Equity Increase Production Efficiency Optimize Ad Spend Efficiency Increase Customer Engagement • Campaign Optimization • Marketing Mix Models Data Science Opportunities • Customer segmentation • Affinity analysis • Social media analytics • Supply/Demand forecasting Increase Revenue Reduce Cost
  • 7. Example Use Case: Ratings Prediction Use Case: Increase ratings across viewer demographics How: • Data: Viewership, transcripts and show data combined in big data platform • Model: Machine learning used to identify the impact of production decisions on viewership Insights
  • 8. Models  Insights  Actions Models are built to answer business questions e.g. what makes viewers tune- in and tune-out? Data Scientists interpret models for answers e.g. On screen arguments make viewers tune out Report Dashboard BI Tool Email Presentation Cloud App End User A good insight drives action that will generate value for stakeholders
  • 9. Revisiting Rating Prediction Use Case Model exposed to end users via cloud application allowing what-if scenario building
  • 10. Characteristics Of Actionable Insights Real-time ScalableSocial Relevant Accessible Open
  • 11. Benefits Of Cloud Based Applications Service failure or data loss at scale Long innovation cycles Poor experience at scale Resilient, scale-out messaging and processing Agile development with cloud based data services Low-latency, in- memory computing
  • 12. Open Source Analytics Ecosystem Media companies benefit from algorithmic breadth and scalability for building and socializing data science models MLlib PL/X Algorithms Visualization Best of breed in-memory and in-database tools for an MPP platform
  • 13. Example Scalable Open Source Platform Hadoop++: Complementing the Hadoop platform are Data Science modeling tools. SQL on Hadoop (e.g. HAWQ), Python/R interfaces to SQL, Apache Spark etc. http://opendataplatform.org/ Apps Data Analytics Leading Media companies are moving towards a platform with Hadoop at the core.
  • 14. Data Science Pipeline On Hadoop++ MLlib PL/X Data Lake Hadoop++ Structured + Unstructured Data
  • 15. Open Source Framework For Ratings Prediction Data Lake Insights and Model Results Ratings Predictions Business Levers Hosted on What-if Scenario ApplicationContains structured + unstructured data MLlib PL/X
  • 16. Gather video ads impression stats Data Lake Ingest Message Broker Simulate Ad Server Behavior Impression Forecasts Business Levers Hosted on Business Metrics Dashboard Expanding The Framework To Include Impression Forecasting Modeling MLlib PL/X
  • 17. Measuring Audience Engagement : Workflow Parallel Parsing of JSON (PL/Python) Twitter Decahose (~55 million tweets/day) Source: http Sink: hdfs HDFS External Tables PXF Nightly Cron Jobs Topic Analysis through MADlib pLDA Unsupervised Sentiment Analysis (PL/Python) Hosted on
  • 18. Key Takeaways • Blended data sets lead to richer models and more valuable insights • Turn Data Science models and insights into value generating actions through data driven applications. • Open source = power and flexibility • Platform extensibility is key to supporting Data Science • Turnkey PaaS is available through CloudFoundry, including infrastructure monitoring, server configuration and scalability.