SlideShare a Scribd company logo
Simplifying AI Integration on
Spark
Hemshankar Sahu
Principal Software Engineer @ Informatica
About Speaker
Hemshankar Sahu
Principal Software Engineer @ Informatica
M. Tech. in Computer Science and Engg. From IIT Roorkee
9+ Years of Experience in IT Industry working as Full Stack Developer and ML Engineer.
Currently working on developing framework to help Integration of Machine Learning Algorithm
and Models into production system.
About Informatica
Enterprise Cloud Data Management leader
9,500+
customers
18 Trillion
cloud transactions
per month
85%
of Fortune 100
5
A Leader in Five
Gartner Magic
Quadrants
Agenda
▪ Context for the Talk
▪ Personas Involved
▪ Informatica On Spark
▪ Problem Details
▪ AI/ML Integration Problems
▪ Solution Details
▪ New Offering: AISR
▪ Simplifying AI/ML integration on Spark
▪ Demo
▪ Deploying, Integration, Auto CI-CD of AI
Solutions
▪ Summary
Context for the Talk
Personas Involved
Data Scientist vs Data Engineers: Personas involved in operationalizing the ML Algorithms
Data Scientist Data Engineer
Tasks Data Exploring, Model Building, Model Training
Data Ingestion, Data Pre-processing,
Transformation and Cleansing
Languages Python, R, Lisp SQL, Scala, Java/Python
Tools Notebook, R Studio, Matlab Spark, Data Engg. Tools (like Informatica)
Libraries Tensorflow, Keres, Pandas, Sickit Learn Hadoop, Spark
Informatica On Spark
Informatica Data Engineering Integration (DEI) Generates Spark Code
Executes On Cluster
Data Engineering Tool which uses Spark as Execution Engine
Same, familiar
Informatica design-time
Informatica Intelligent Cloud
Services
Cloud Data Integration Elastic
Enabling Spark serverless support for auto-scaling and provisioning
Auto-scaling Spark
cluster
Deployed to your
cloud network
Problem Details
AI/ML Integration Issues
Example problem use-case: Collaborating Data Engineers and Data Scientists
Informatica
DEI
Python 2.7
Python 2.7
Python 2.7
Python 3.6Python Developer
Python Developer
R Developer
Python 2.7 Python 2.7
Master
V1
V2
?
?
Spark Cluster
Issues
▪ Team Collaboration Required
▪ Data Scientist and Data Engineer invests time to
collaborate
▪ Manually Deploy the Binaries
▪ Downtime for each new version
▪ No Support for Different Runtimes
Data Science Team Data Engineering Team
V2 V2
Solution Details
New Offering: AISR
▪ Repository of AI Solutions
▪ A Solution is
▪ Code and Metadata
▪ Dependencies
▪ Runtime Details
▪ A Solution can
▪ Be in any language*
▪ With any dependency
▪ Run on GPU**
AI Solutions Repository
* Only Python supported in current release
** Provided hardware are present and drivers are installed, and solution contains the respective code
Runtimes
Tensorflow_Numpy
Sickitlearn_OpenCV
Solutions
Sentiment Analysis
AISR
Generated Code for executing from various platforms
Solution code, can be in any language
Dependencies: Files, installed software etc.
AISR
Image Processing
Image Classification
Image To Text
Example
Based on A General Solutions Repository
Solutions
Repository
CPP
Python
R
Java
DEI
Spark
REST
Java
Simplifying AI/ML integration on Spark
Example use-case solution: Collaborating Data Scientists and Data Engineers
Python 2.7
Python 2.7
Informatica
DEI
Python 3.6
Python Developer
Python Developer
R Developer
Master
V1
V2
AISR
Runtime-1
Runtime-1
Runtime-2
Runtime-3
V1
Runtime
V1
Runtime
V1
Runtime
Cluster
Benefits
▪ Minimum Collaboration
▪ Between Data Scientist and Data Engineer
▪ Auto Deploy of new Version
▪ No Downtime
▪ Multiple Versions Support
▪ Different version of same solution can be used.
▪ Support for Different Runtime
Data Science Team Data Engineering Team
V1
Runtime
V1
Runtime
Demo
Demo Use Case
Easy Collaboration, No Downtime and CI-CD
AISR DEI
Data Scientist Data Engineer
Image
Classification
Simplified Integration In Action
Runtimes
Python + TF + OpenCV
R Eco System
Solutions
Image To Text V1
AI Solutions Repo DEI
Generated Java Code for executing at spark executors
INFA wrapper and Core code, can be in any language
Dependencies: Files, installed software etc.
Object Detection V1
YARN
Spark Job Executor 1 Executor 2
Node 1
Node 2 Node 3
HDFS
CLUSTERInformatica
Data Scientist
Data Engineer
Mapping
Cached Binaries
Spark Job
Demo Recap
▪ Easily Created Solution
▪ Easily added a new AI Solution from Jupyter Notebook
▪ Explored the details of added solution
▪ Deployed and Tested
▪ Added Solution was deployed
▪ Explored various consumption options
▪ Created REST Endpoint and used it for testing
▪ Easily Integrated with Spark
▪ Created a mapping job using Informatica
▪ Created new Transformation to use the Deployed Solution
▪ Ran the mapping on Spark with selected Solution
▪ CI-CD
▪ Retrained the Solution with few clicks
▪ Used the re-trained Solution without any changes or downtime
AISR DEI
Summary
Summary
▪ Data Scientist Vs Data Engineer
▪ Collaboration is challenging and time consuming
▪ Easy Spark Job Creation using DEI
▪ Drag and Drop way of Spark Job Creation
▪ Easy Spark-AI Solution Integration using AISR
▪ Minimum Collaboration
▪ Processing happens at Spark Scale within Spark Cluster
▪ Better performance as compared to other serving platforms.
▪ Inbuilt CI-CD for AI Solutions
▪ No downtime in case Solution upgrades
▪ No changes required from Data Engineering environment
▪ AISR Framework
▪ Based on Generic Solutions Repository Implementation
▪ Partners can develop plugins to add or consume AI Solutions
▪ Overall Production Cost Reduction
Feedback
Your feedback is important to us.
Don’t forget to rate
and review the sessions.

More Related Content

What's hot (20)

PDF
NextGenML
Moldovan Radu Adrian
 
PDF
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Márton Kodok
 
PDF
Model versioning done right: A ModelDB 2.0 Walkthrough
Manasi Vartak
 
PDF
Scaling ML-Based Threat Detection For Production Cyber Attacks
Databricks
 
PDF
DevOps for Applications in Azure Databricks: Creating Continuous Integration ...
Databricks
 
PDF
MLflow with R
Databricks
 
PDF
[AI] ML Operationalization with Microsoft Azure
Korkrid Akepanidtaworn
 
PDF
Building a Streaming Data Pipeline for Trains Delays Processing
Databricks
 
PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
PDF
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
PDF
Seamless MLOps with Seldon and MLflow
Databricks
 
PDF
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Databricks
 
PDF
Using Apache Spark for Predicting Degrading and Failing Parts in Aviation
Databricks
 
PPTX
Google Vertex AI
VikasBisoi
 
PDF
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
dtz001
 
PDF
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
PDF
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
PDF
MLOps with Kubeflow
Saurabh Kaushik
 
PPTX
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
amesar0
 
PDF
Productionizing Machine Learning with Apache Spark, MLflow and ONNX from the ...
Databricks
 
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Márton Kodok
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Manasi Vartak
 
Scaling ML-Based Threat Detection For Production Cyber Attacks
Databricks
 
DevOps for Applications in Azure Databricks: Creating Continuous Integration ...
Databricks
 
MLflow with R
Databricks
 
[AI] ML Operationalization with Microsoft Azure
Korkrid Akepanidtaworn
 
Building a Streaming Data Pipeline for Trains Delays Processing
Databricks
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
Databricks
 
Seamless MLOps with Seldon and MLflow
Databricks
 
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Databricks
 
Using Apache Spark for Predicting Degrading and Failing Parts in Aviation
Databricks
 
Google Vertex AI
VikasBisoi
 
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
dtz001
 
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
Hamburg Data Science Meetup - MLOps with a Feature Store
Moritz Meister
 
MLOps with Kubeflow
Saurabh Kaushik
 
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
amesar0
 
Productionizing Machine Learning with Apache Spark, MLflow and ONNX from the ...
Databricks
 

Similar to Simplifying AI integration on Apache Spark (20)

PDF
Enabling a hardware accelerated deep learning data science experience for Apa...
Indrajit Poddar
 
PPTX
Data Science and CDSW
Jason Hubbard
 
PPTX
Innovations using PowerAI
Ganesan Narayanasamy
 
PPTX
Scaling Data Science on Big Data
DataWorks Summit
 
PPTX
Artificial Intelligence and Machine Learning and Python FINAL.pptx
masoomsingh0801
 
PDF
AI Scalability for the Next Decade
Paula Koziol
 
PDF
Spark summit 2019 infrastructure for deep learning in apache spark 0425
Wee Hyong Tok
 
PPTX
Northwestern 20181004 v9
home
 
PPTX
Part 1: Introducing the Cloudera Data Science Workbench
Cloudera, Inc.
 
PPTX
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Cloudera, Inc.
 
PDF
The Future of Data Science
DataWorks Summit
 
PPTX
DevOps for AI Apps
Richin Jain
 
PDF
Infrastructure for Deep Learning in Apache Spark
Databricks
 
PDF
Deep Learning Image Processing Applications in the Enterprise
Ganesan Narayanasamy
 
PDF
Ibm coe openpowerailabdubaiwithraptor
Ganesan Narayanasamy
 
PDF
An Update on Scaling Data Science Applications with SparkR in 2018 with Heiko...
Databricks
 
PPTX
Python in Artificial Intelligence and Machine Learning.pptx
chethanhk10
 
PDF
Why Hire Python Developers for AIML What Hiring Managers Need to Know.pdf
Elightwalk Technology PVT. LTD.
 
PDF
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
NUS-ISS
 
PPTX
AI Artificial Intelligent-Machine Learning-Deep Learning .pptx
Heba Ali
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Indrajit Poddar
 
Data Science and CDSW
Jason Hubbard
 
Innovations using PowerAI
Ganesan Narayanasamy
 
Scaling Data Science on Big Data
DataWorks Summit
 
Artificial Intelligence and Machine Learning and Python FINAL.pptx
masoomsingh0801
 
AI Scalability for the Next Decade
Paula Koziol
 
Spark summit 2019 infrastructure for deep learning in apache spark 0425
Wee Hyong Tok
 
Northwestern 20181004 v9
home
 
Part 1: Introducing the Cloudera Data Science Workbench
Cloudera, Inc.
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Cloudera, Inc.
 
The Future of Data Science
DataWorks Summit
 
DevOps for AI Apps
Richin Jain
 
Infrastructure for Deep Learning in Apache Spark
Databricks
 
Deep Learning Image Processing Applications in the Enterprise
Ganesan Narayanasamy
 
Ibm coe openpowerailabdubaiwithraptor
Ganesan Narayanasamy
 
An Update on Scaling Data Science Applications with SparkR in 2018 with Heiko...
Databricks
 
Python in Artificial Intelligence and Machine Learning.pptx
chethanhk10
 
Why Hire Python Developers for AIML What Hiring Managers Need to Know.pdf
Elightwalk Technology PVT. LTD.
 
SkillsFuture Festival at NUS 2019- Artificial Intelligence for Everyone - A P...
NUS-ISS
 
AI Artificial Intelligent-Machine Learning-Deep Learning .pptx
Heba Ali
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Learn to Use Databricks for Data Science
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Ad

Recently uploaded (20)

PPTX
covid 19 data analysis updates in our municipality
RhuAyungon1
 
PDF
ilide.info-tg-understanding-culture-society-and-politics-pr_127f984d2904c57ec...
jed P
 
PPTX
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
PDF
Exploiting the Low Volatility Anomaly: A Low Beta Model Portfolio for Risk-Ad...
Bradley Norbom, CFA
 
PPTX
727325165-Unit-1-Data-Analytics-PPT-1.pptx
revathi148366
 
PDF
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
 
PPTX
PPT2 W1L2.pptx.........................................
palicteronalyn26
 
PPTX
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
PDF
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
 
PPTX
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
PPTX
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
PPTX
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
PPTX
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
PDF
IT GOVERNANCE 4-2 - Information System Security (1).pdf
mdirfanuddin1322
 
PPTX
microservices-with-container-apps-dapr.pptx
vjay22
 
PDF
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
PPSX
PPT1_CB_VII_CS_Ch3_FunctionsandChartsinCalc.ppsx
animaroy81
 
PPT
intro to AI dfg fgh gggdrhre ghtwhg ewge
traineramrsiam
 
PDF
Data science AI/Ml basics to learn .pdf
deokhushi04
 
PPT
Reliability Monitoring of Aircrfat commerce
Rizk2
 
covid 19 data analysis updates in our municipality
RhuAyungon1
 
ilide.info-tg-understanding-culture-society-and-politics-pr_127f984d2904c57ec...
jed P
 
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
Exploiting the Low Volatility Anomaly: A Low Beta Model Portfolio for Risk-Ad...
Bradley Norbom, CFA
 
727325165-Unit-1-Data-Analytics-PPT-1.pptx
revathi148366
 
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
 
PPT2 W1L2.pptx.........................................
palicteronalyn26
 
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
 
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
 
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
IT GOVERNANCE 4-2 - Information System Security (1).pdf
mdirfanuddin1322
 
microservices-with-container-apps-dapr.pptx
vjay22
 
Informatics Market Insights AI Workforce.pdf
karizaroxx
 
PPT1_CB_VII_CS_Ch3_FunctionsandChartsinCalc.ppsx
animaroy81
 
intro to AI dfg fgh gggdrhre ghtwhg ewge
traineramrsiam
 
Data science AI/Ml basics to learn .pdf
deokhushi04
 
Reliability Monitoring of Aircrfat commerce
Rizk2
 

Simplifying AI integration on Apache Spark

  • 1. Simplifying AI Integration on Spark Hemshankar Sahu Principal Software Engineer @ Informatica
  • 2. About Speaker Hemshankar Sahu Principal Software Engineer @ Informatica M. Tech. in Computer Science and Engg. From IIT Roorkee 9+ Years of Experience in IT Industry working as Full Stack Developer and ML Engineer. Currently working on developing framework to help Integration of Machine Learning Algorithm and Models into production system.
  • 3. About Informatica Enterprise Cloud Data Management leader 9,500+ customers 18 Trillion cloud transactions per month 85% of Fortune 100 5 A Leader in Five Gartner Magic Quadrants
  • 4. Agenda ▪ Context for the Talk ▪ Personas Involved ▪ Informatica On Spark ▪ Problem Details ▪ AI/ML Integration Problems ▪ Solution Details ▪ New Offering: AISR ▪ Simplifying AI/ML integration on Spark ▪ Demo ▪ Deploying, Integration, Auto CI-CD of AI Solutions ▪ Summary
  • 6. Personas Involved Data Scientist vs Data Engineers: Personas involved in operationalizing the ML Algorithms Data Scientist Data Engineer Tasks Data Exploring, Model Building, Model Training Data Ingestion, Data Pre-processing, Transformation and Cleansing Languages Python, R, Lisp SQL, Scala, Java/Python Tools Notebook, R Studio, Matlab Spark, Data Engg. Tools (like Informatica) Libraries Tensorflow, Keres, Pandas, Sickit Learn Hadoop, Spark
  • 7. Informatica On Spark Informatica Data Engineering Integration (DEI) Generates Spark Code Executes On Cluster Data Engineering Tool which uses Spark as Execution Engine
  • 8. Same, familiar Informatica design-time Informatica Intelligent Cloud Services Cloud Data Integration Elastic Enabling Spark serverless support for auto-scaling and provisioning Auto-scaling Spark cluster Deployed to your cloud network
  • 10. AI/ML Integration Issues Example problem use-case: Collaborating Data Engineers and Data Scientists Informatica DEI Python 2.7 Python 2.7 Python 2.7 Python 3.6Python Developer Python Developer R Developer Python 2.7 Python 2.7 Master V1 V2 ? ? Spark Cluster Issues ▪ Team Collaboration Required ▪ Data Scientist and Data Engineer invests time to collaborate ▪ Manually Deploy the Binaries ▪ Downtime for each new version ▪ No Support for Different Runtimes Data Science Team Data Engineering Team V2 V2
  • 12. New Offering: AISR ▪ Repository of AI Solutions ▪ A Solution is ▪ Code and Metadata ▪ Dependencies ▪ Runtime Details ▪ A Solution can ▪ Be in any language* ▪ With any dependency ▪ Run on GPU** AI Solutions Repository * Only Python supported in current release ** Provided hardware are present and drivers are installed, and solution contains the respective code Runtimes Tensorflow_Numpy Sickitlearn_OpenCV Solutions Sentiment Analysis AISR Generated Code for executing from various platforms Solution code, can be in any language Dependencies: Files, installed software etc. AISR Image Processing Image Classification Image To Text Example Based on A General Solutions Repository Solutions Repository CPP Python R Java DEI Spark REST Java
  • 13. Simplifying AI/ML integration on Spark Example use-case solution: Collaborating Data Scientists and Data Engineers Python 2.7 Python 2.7 Informatica DEI Python 3.6 Python Developer Python Developer R Developer Master V1 V2 AISR Runtime-1 Runtime-1 Runtime-2 Runtime-3 V1 Runtime V1 Runtime V1 Runtime Cluster Benefits ▪ Minimum Collaboration ▪ Between Data Scientist and Data Engineer ▪ Auto Deploy of new Version ▪ No Downtime ▪ Multiple Versions Support ▪ Different version of same solution can be used. ▪ Support for Different Runtime Data Science Team Data Engineering Team V1 Runtime V1 Runtime
  • 14. Demo
  • 15. Demo Use Case Easy Collaboration, No Downtime and CI-CD AISR DEI Data Scientist Data Engineer Image Classification
  • 16. Simplified Integration In Action Runtimes Python + TF + OpenCV R Eco System Solutions Image To Text V1 AI Solutions Repo DEI Generated Java Code for executing at spark executors INFA wrapper and Core code, can be in any language Dependencies: Files, installed software etc. Object Detection V1 YARN Spark Job Executor 1 Executor 2 Node 1 Node 2 Node 3 HDFS CLUSTERInformatica Data Scientist Data Engineer Mapping Cached Binaries Spark Job
  • 17. Demo Recap ▪ Easily Created Solution ▪ Easily added a new AI Solution from Jupyter Notebook ▪ Explored the details of added solution ▪ Deployed and Tested ▪ Added Solution was deployed ▪ Explored various consumption options ▪ Created REST Endpoint and used it for testing ▪ Easily Integrated with Spark ▪ Created a mapping job using Informatica ▪ Created new Transformation to use the Deployed Solution ▪ Ran the mapping on Spark with selected Solution ▪ CI-CD ▪ Retrained the Solution with few clicks ▪ Used the re-trained Solution without any changes or downtime AISR DEI
  • 19. Summary ▪ Data Scientist Vs Data Engineer ▪ Collaboration is challenging and time consuming ▪ Easy Spark Job Creation using DEI ▪ Drag and Drop way of Spark Job Creation ▪ Easy Spark-AI Solution Integration using AISR ▪ Minimum Collaboration ▪ Processing happens at Spark Scale within Spark Cluster ▪ Better performance as compared to other serving platforms. ▪ Inbuilt CI-CD for AI Solutions ▪ No downtime in case Solution upgrades ▪ No changes required from Data Engineering environment ▪ AISR Framework ▪ Based on Generic Solutions Repository Implementation ▪ Partners can develop plugins to add or consume AI Solutions ▪ Overall Production Cost Reduction
  • 20. Feedback Your feedback is important to us. Don’t forget to rate and review the sessions.