SlideShare a Scribd company logo
Larry Querbach, Devon Energy Corporation
Transforming Devon’s Data Pipeline
with an Open Source Data Hub—
Built on Databricks
#SAISEnt3
About Devon Energy
Devon Energy is a leading
U.S. independent oil and
natural gas exploration and
production company.
• Over 3,000 employees
• $22 billion market cap
• Produces 541,000 BOE
(barrels of oil equivalent)
per day
2#SAISEnt3
Why Big Data and AI at Devon?
Growth is not linear, but exponential
3
Volume
Tow
PAS
Market Intelligence
Integrated Reservoir
Characterization
Reports
General Analysis
Text
Today’s
Challenges
SAP
Material Usage
High Complexity
Low
High
#SAISEnt3
Advanced Analytics is the Next
Data-Driven Step
Advanced analytics is the next step in
the data-driven journey, building on
the successes we have had with
analyzing our data, moving into much
more sophisticated problem solving
and prediction
Good data has enabled many creative
“tools” to be deployed to enable
decision-makers all over the company
to improve performance
Data Management at Devon has
enabled significant bottom-line
benefits and is the foundation for
data-driven decisions
4
Complexity
DATA
Easily
accessible,
consumable
ValuetoDevon
Advanced
Analytics
Clean,
Reliable
Dashboards, Mapping, Mobile
Machine Learning, Artificial Intelligence
Streaming, real-time, structured, “text” or unstructured
#SAISEnt3
Starting Point – Traditional Data Warehouse
Problem
Too Slow to Change
Too Expensive
Inconsistent User
Experience
Inconsistent Data, Delayed
Poor Access in the Field
Too Much IT Required
No Advanced Analytics/AI
What’s Working
Delivering Clean Data
Delivering Integrated Data
Connections to Systems
Based on Requirements
User Driven Analytics
5
Custom Code
Inconsistent Features
Inconsistent Experience
Poor Performance
Limited Mobility
Difficult mobile access
Only some dashboards
Not a native experience
Just mobile web browser
#SAISEnt3
Shifting the Paradigm: Batch to Streaming
The Value
Data is available in one place for both developers and users
Citizen Developers empowered with better data access and tools
Shortened development/deployment cycle
Refresh times reduced or eliminated
Deliver data at the speed of business
The Shift
Move from traditional ETL with its emphasis on batch data movement
Shift to ELT with faster replication of data from systems to the lake
Data transformations no longer single-threaded
Massively parallel processing of transformations
Incorporate streaming data into the lake
6#SAISEnt3
Speed to Market
Problem
Projects were too slow to deliver new features
Features were inconsistent across Projects
Too much time spent on fighting data quality issues
Approach
Leverage the data in the Data Warehouse, already in place
Use an integrated Cloud Platform, not just a set of development tools
Real Incremental Delivery: 1 week of Design and Build, 1 week of Testing and Deployment
Challenges
Complexity and Maturity Levels of the Technologies in Advanced Analytics
Best Data Source was Data Warehouse, temporary dependency and technical debt
Our Best Practices on design, publishing, and technical requirements not fully developed
7#SAISEnt3
Stability and Supportable Platform
Problem
No employees knew how the Complete solution really worked
Problems in the code base were difficult and long to resolve, often Duplicated between areas
Impossible to find Performance issues resulted in constant contention between support organizations
Approach
Minimize complexity by reducing the technologies used in the solution
Enlist Vendor Premier Support and Professional Field Engineers
Partner directly with a strong delivery partner with a proven track record, business acumen is key
Challenges
Adding Cloud technologies require new approaches to Troubleshooting
Deployed to production before the support team was established, Distracting the project team
8#SAISEnt3
User Experience
Problem
Users spend a lot of time in these tools and they have to be Comfortable
Critical process impact, need high levels of Adoption
Learning something new interferes with ability to Deliver Solutions
Approach
Brand the Solutions and the Projects, be clear about the value
Deliver a Modern, Sleek, and Elegant interface
Establish a contented community by leveraging, instead of fighting, Microsoft Excel
Execute with Organizational Change Management and Over Communicate
Challenges
Immature and non-integrated technologies create Experience Inconsistencies
Different Business Areas maturing the leveraging the technology at different rates
Rapidly Evolving technology changes user experience, creating confusion on which products to use
9#SAISEnt3
Data Hub Architecture
The Data Hub reinvents
our Data Warehouse and
Integration landscape.
This allows anyone to
build their own data and
analytics solutions and
share insights.
10#SAISEnt3
What can the Data Hub do?
1111
Make a prediction. Examples include: When will an asset fail? What will my operational costs be
over the next six months? Which supplier invoices are fraudulent?
Find a pattern. Examples include: What are the most common calls we receive about leases?
What are the most prevalent causes for employee dissatisfaction?
Search for data. Examples include: Where do I find data on our suppliers? How can I get sensor
data from the field?
Interact with data. Examples include: I want to combine financial and production data. I need to
filter and aggregate production data.
Mine documents. Examples include: I have handwritten log files I want to search. I have contracts
and invoices I want to put in a database to analyze.
I have data and want to:
I need help finding or processing data:
#SAISEnt3
How we leverage Databricks
Approach
Transform data to create curated Enterprise Data Services and Data Warehouse
Citizen Developers have access to the same tools as IT
Machine Learning/Deep Learning
Benefits
Reduced development cycle time for Enterprise Data Services and Data Warehouse
Citizen Developer created objects migrated, not rebuilt in an ETL tool
Analytics tool replacement/license reduction of 60%
ETL Tool replacement/license reduction of 40%
Overall cost reductions in license costs exceed $1M annually
Challenges
Migration of legacy code
Business expectations exceed our current technical ability, capacity and investment
12#SAISEnt3
Key Learnings in our Journey
Technology does not solve all of your problems
No solution replaces the need for subject matter experts
Innovation is a dirty business – prepare for the ride
Never stop innovating
Approach one domain at a time – but don’t lose sight of the Enterprise
Remember that it takes time to build trust – Don’t force automation before acceptance
Deployments are complicated, proof of concept doesn’t always transition to production
13#SAISEnt3
What is Next?
Necessary Features
User Activity Auditing and Monitoring
Learn how to Embed Data in Applications, deliver REST/OData Services for Developers
Refactoring Technical Debt
Refactor use of the technical platform components for early solutions, based on experiences
Streamline the Publishing Process with the new Publishing Model
Push more data into the Cloud
New Audiences
Beyond the initial business domains, get the entire Value Stream!
Move up to support the Executives, not just the Field Users
Address Support Organizations, not just Exploration and Production
14#SAISEnt3
Questions?
15#SAISEnt3

More Related Content

What's hot (20)

PDF
Leveraging Spark to Democratize Data for Omni-Commerce with Shafaq Abdullah
Databricks
 
PDF
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark Summit
 
PPTX
Migrating Big Data Workloads to the Cloud
Robert Sanders
 
PDF
Data Driven Decisions at Scale
Databricks
 
PPTX
Cloud and Big Data trends
Sebastien Goasguen
 
PPTX
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
PDF
Building the Next-gen Digital Meter Platform for Fluvius
Databricks
 
PPTX
Zero Downtime App Deployment using Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
PPTX
Eugene Polonichko "Architecture of modern data warehouse"
Lviv Startup Club
 
PDF
Operationalizing Machine Learning at Scale at Starbucks
Databricks
 
PPTX
CTO View: Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
PDF
Unifying Streaming and Historical Telemetry Data For Real-time Performance Re...
Databricks
 
PPTX
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
PDF
Bridging the Completeness of Big Data on Databricks
Databricks
 
PDF
Big Data in Production: Lessons from Running in the Cloud
Jen Aman
 
PDF
DataStax: Making a Difference with Smart Analytics
DataStax Academy
 
PDF
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Spark Summit
 
PPTX
Webinar: BI in the Sky - The New Rules of Cloud Analytics
SnapLogic
 
PDF
Accion Labs - Big Data Services
Accion Labs, Inc.
 
Leveraging Spark to Democratize Data for Omni-Commerce with Shafaq Abdullah
Databricks
 
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark Summit
 
Migrating Big Data Workloads to the Cloud
Robert Sanders
 
Data Driven Decisions at Scale
Databricks
 
Cloud and Big Data trends
Sebastien Goasguen
 
Getting It Right Exactly Once: Principles for Streaming Architectures
SingleStore
 
Building the Next-gen Digital Meter Platform for Fluvius
Databricks
 
Zero Downtime App Deployment using Hadoop
DataWorks Summit/Hadoop Summit
 
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
Eugene Polonichko "Architecture of modern data warehouse"
Lviv Startup Club
 
Operationalizing Machine Learning at Scale at Starbucks
Databricks
 
CTO View: Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
Unifying Streaming and Historical Telemetry Data For Real-time Performance Re...
Databricks
 
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
Bridging the Completeness of Big Data on Databricks
Databricks
 
Big Data in Production: Lessons from Running in the Cloud
Jen Aman
 
DataStax: Making a Difference with Smart Analytics
DataStax Academy
 
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Spark Summit
 
Webinar: BI in the Sky - The New Rules of Cloud Analytics
SnapLogic
 
Accion Labs - Big Data Services
Accion Labs, Inc.
 

Similar to Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Databricks with Larry Querbach (20)

PDF
Designing the Next Generation Data Lake
Robert Chong
 
PDF
The Cloud Data Lake Early Release Rukmani Gopalan
fossylenea
 
PPTX
Journey to analytics in the cloud
Saama
 
PDF
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
PPTX
[DSC Europe 24] Josip Saban - Buidling cloud data platforms in enterprises
DataScienceConferenc1
 
PDF
Building and running a Data and AI-driven news-media organization(c enrique-o...
Carlos Enrique Ortiz
 
PDF
How 3 trends are shaping analytics and data management
Abhishek Sood
 
PDF
How to make your data scientists happy
Hussain Sultan
 
PDF
DutchMLSchool 2022 - A Data-Driven Company
BigML, Inc
 
PPTX
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
DataWorks Summit
 
PDF
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
DATAVERSITY
 
PPTX
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
AshishHiwale1
 
PPTX
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
windu19
 
PDF
Data estate modernization feb webinar 2 18 2020
Matthew W. Bowers
 
PDF
Architecting a Platform for Enterprise Use - Strata London 2018
mark madsen
 
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
PDF
iabsg_dataroundtable
Peter Hubert
 
PDF
Gartner EA: The Rise of Data-driven Architectures
LeanIX GmbH
 
PDF
Lunch and Learn: You have the data, now what?
DiUS
 
PPTX
Open Analytics DC April 2012 Meetup
ikanow
 
Designing the Next Generation Data Lake
Robert Chong
 
The Cloud Data Lake Early Release Rukmani Gopalan
fossylenea
 
Journey to analytics in the cloud
Saama
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
[DSC Europe 24] Josip Saban - Buidling cloud data platforms in enterprises
DataScienceConferenc1
 
Building and running a Data and AI-driven news-media organization(c enrique-o...
Carlos Enrique Ortiz
 
How 3 trends are shaping analytics and data management
Abhishek Sood
 
How to make your data scientists happy
Hussain Sultan
 
DutchMLSchool 2022 - A Data-Driven Company
BigML, Inc
 
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
DataWorks Summit
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
DATAVERSITY
 
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
AshishHiwale1
 
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
windu19
 
Data estate modernization feb webinar 2 18 2020
Matthew W. Bowers
 
Architecting a Platform for Enterprise Use - Strata London 2018
mark madsen
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
iabsg_dataroundtable
Peter Hubert
 
Gartner EA: The Rise of Data-driven Architectures
LeanIX GmbH
 
Lunch and Learn: You have the data, now what?
DiUS
 
Open Analytics DC April 2012 Meetup
ikanow
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Learn to Use Databricks for Data Science
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Ad

Recently uploaded (20)

DOCX
Cat_Latin_America_in_World_Politics[1].docx
sales480687
 
PDF
Predicting Titanic Survival Presentation
praxyfarhana
 
PDF
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
 
DOCX
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
DOCX
brigada_PROGRAM_25.docx the boys white house
RonelNebrao
 
PPT
Camuflaje Tipos Características Militar 2025.ppt
e58650738
 
PPTX
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
 
PPTX
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
PDF
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
PDF
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
PPTX
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
PPTX
Mynd company all details what they are doing a
AniketKadam40952
 
PPTX
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
 
PDF
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
PDF
Digital-Transformation-for-Federal-Agencies.pdf.pdf
One Federal Solution
 
PPTX
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
Taqyea
 
PPTX
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
PPTX
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
PDF
CT-2-Ancient ancient accept-Criticism.pdf
DepartmentofEnglishC1
 
PPT
Reliability Monitoring of Aircrfat commerce
Rizk2
 
Cat_Latin_America_in_World_Politics[1].docx
sales480687
 
Predicting Titanic Survival Presentation
praxyfarhana
 
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
 
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
brigada_PROGRAM_25.docx the boys white house
RonelNebrao
 
Camuflaje Tipos Características Militar 2025.ppt
e58650738
 
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
 
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
 
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
Mynd company all details what they are doing a
AniketKadam40952
 
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
 
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
Digital-Transformation-for-Federal-Agencies.pdf.pdf
One Federal Solution
 
美国毕业证范本中华盛顿大学学位证书CWU学生卡购买
Taqyea
 
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
CT-2-Ancient ancient accept-Criticism.pdf
DepartmentofEnglishC1
 
Reliability Monitoring of Aircrfat commerce
Rizk2
 

Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Databricks with Larry Querbach

  • 1. Larry Querbach, Devon Energy Corporation Transforming Devon’s Data Pipeline with an Open Source Data Hub— Built on Databricks #SAISEnt3
  • 2. About Devon Energy Devon Energy is a leading U.S. independent oil and natural gas exploration and production company. • Over 3,000 employees • $22 billion market cap • Produces 541,000 BOE (barrels of oil equivalent) per day 2#SAISEnt3
  • 3. Why Big Data and AI at Devon? Growth is not linear, but exponential 3 Volume Tow PAS Market Intelligence Integrated Reservoir Characterization Reports General Analysis Text Today’s Challenges SAP Material Usage High Complexity Low High #SAISEnt3
  • 4. Advanced Analytics is the Next Data-Driven Step Advanced analytics is the next step in the data-driven journey, building on the successes we have had with analyzing our data, moving into much more sophisticated problem solving and prediction Good data has enabled many creative “tools” to be deployed to enable decision-makers all over the company to improve performance Data Management at Devon has enabled significant bottom-line benefits and is the foundation for data-driven decisions 4 Complexity DATA Easily accessible, consumable ValuetoDevon Advanced Analytics Clean, Reliable Dashboards, Mapping, Mobile Machine Learning, Artificial Intelligence Streaming, real-time, structured, “text” or unstructured #SAISEnt3
  • 5. Starting Point – Traditional Data Warehouse Problem Too Slow to Change Too Expensive Inconsistent User Experience Inconsistent Data, Delayed Poor Access in the Field Too Much IT Required No Advanced Analytics/AI What’s Working Delivering Clean Data Delivering Integrated Data Connections to Systems Based on Requirements User Driven Analytics 5 Custom Code Inconsistent Features Inconsistent Experience Poor Performance Limited Mobility Difficult mobile access Only some dashboards Not a native experience Just mobile web browser #SAISEnt3
  • 6. Shifting the Paradigm: Batch to Streaming The Value Data is available in one place for both developers and users Citizen Developers empowered with better data access and tools Shortened development/deployment cycle Refresh times reduced or eliminated Deliver data at the speed of business The Shift Move from traditional ETL with its emphasis on batch data movement Shift to ELT with faster replication of data from systems to the lake Data transformations no longer single-threaded Massively parallel processing of transformations Incorporate streaming data into the lake 6#SAISEnt3
  • 7. Speed to Market Problem Projects were too slow to deliver new features Features were inconsistent across Projects Too much time spent on fighting data quality issues Approach Leverage the data in the Data Warehouse, already in place Use an integrated Cloud Platform, not just a set of development tools Real Incremental Delivery: 1 week of Design and Build, 1 week of Testing and Deployment Challenges Complexity and Maturity Levels of the Technologies in Advanced Analytics Best Data Source was Data Warehouse, temporary dependency and technical debt Our Best Practices on design, publishing, and technical requirements not fully developed 7#SAISEnt3
  • 8. Stability and Supportable Platform Problem No employees knew how the Complete solution really worked Problems in the code base were difficult and long to resolve, often Duplicated between areas Impossible to find Performance issues resulted in constant contention between support organizations Approach Minimize complexity by reducing the technologies used in the solution Enlist Vendor Premier Support and Professional Field Engineers Partner directly with a strong delivery partner with a proven track record, business acumen is key Challenges Adding Cloud technologies require new approaches to Troubleshooting Deployed to production before the support team was established, Distracting the project team 8#SAISEnt3
  • 9. User Experience Problem Users spend a lot of time in these tools and they have to be Comfortable Critical process impact, need high levels of Adoption Learning something new interferes with ability to Deliver Solutions Approach Brand the Solutions and the Projects, be clear about the value Deliver a Modern, Sleek, and Elegant interface Establish a contented community by leveraging, instead of fighting, Microsoft Excel Execute with Organizational Change Management and Over Communicate Challenges Immature and non-integrated technologies create Experience Inconsistencies Different Business Areas maturing the leveraging the technology at different rates Rapidly Evolving technology changes user experience, creating confusion on which products to use 9#SAISEnt3
  • 10. Data Hub Architecture The Data Hub reinvents our Data Warehouse and Integration landscape. This allows anyone to build their own data and analytics solutions and share insights. 10#SAISEnt3
  • 11. What can the Data Hub do? 1111 Make a prediction. Examples include: When will an asset fail? What will my operational costs be over the next six months? Which supplier invoices are fraudulent? Find a pattern. Examples include: What are the most common calls we receive about leases? What are the most prevalent causes for employee dissatisfaction? Search for data. Examples include: Where do I find data on our suppliers? How can I get sensor data from the field? Interact with data. Examples include: I want to combine financial and production data. I need to filter and aggregate production data. Mine documents. Examples include: I have handwritten log files I want to search. I have contracts and invoices I want to put in a database to analyze. I have data and want to: I need help finding or processing data: #SAISEnt3
  • 12. How we leverage Databricks Approach Transform data to create curated Enterprise Data Services and Data Warehouse Citizen Developers have access to the same tools as IT Machine Learning/Deep Learning Benefits Reduced development cycle time for Enterprise Data Services and Data Warehouse Citizen Developer created objects migrated, not rebuilt in an ETL tool Analytics tool replacement/license reduction of 60% ETL Tool replacement/license reduction of 40% Overall cost reductions in license costs exceed $1M annually Challenges Migration of legacy code Business expectations exceed our current technical ability, capacity and investment 12#SAISEnt3
  • 13. Key Learnings in our Journey Technology does not solve all of your problems No solution replaces the need for subject matter experts Innovation is a dirty business – prepare for the ride Never stop innovating Approach one domain at a time – but don’t lose sight of the Enterprise Remember that it takes time to build trust – Don’t force automation before acceptance Deployments are complicated, proof of concept doesn’t always transition to production 13#SAISEnt3
  • 14. What is Next? Necessary Features User Activity Auditing and Monitoring Learn how to Embed Data in Applications, deliver REST/OData Services for Developers Refactoring Technical Debt Refactor use of the technical platform components for early solutions, based on experiences Streamline the Publishing Process with the new Publishing Model Push more data into the Cloud New Audiences Beyond the initial business domains, get the entire Value Stream! Move up to support the Executives, not just the Field Users Address Support Organizations, not just Exploration and Production 14#SAISEnt3