SlideShare a Scribd company logo
Driving Insights for Your Digital
Business With Analytics
Srinath Perera (@srinath_perera)
VP – Research, WSO2
Member, Apache Foundation
Let’s do
“Analytics”?
Big Data Washing
• Collect anything that is easy to get
• Aggregate and Group
• Find a complex but pretty chart
• Predict something, but does not
measure it’s quality
• Claim you just got started!
Uber
• A company worth XX
• A taxi company that does not have cars or drivers
Picture by Dan Ruscoe (CC) https://www.flickr.com/photos/druscoe/8031488298
Game Changers
Role of Analytics in Digital Business
New Digital inspired
Products and Revenue
Streams
• New way to do business (e.g.
Uber, Amazon Go)
• Product as a Service (e.g. IoT
Jack hammer, Light as a service)
• Progressive Insurance Gadget
• Sell insights ( Telcos knows
where people are, credit card
companies know what people
buy and their demographics,
navigation apps know traffic)
Get Close to your
Customers
• Use analytics to optimize the
experience
• Predict issues and proactively
handle them ( e.g. reschedule
automatically when flight has
missed)
• Predict churn and act
• Track the brand and manage it
• Target your marketing
Optimizations
• Reduce Fraud
• Logistics, day to day operations
• Analytics for hiring and
Performance appraisal
• Predictive maintenance
• Sales analytics, demand
prediction
• Security and surveillance
Making this real
Conceptual Architecture
• APIs play a key role in
data collection
• Need to respond to
events as fast as
possible
• Incremental Analysis is
key
Role of Analytics in Digital Business
Only DAS, which has everything
Focus is on CEP (
siddhi), our core
differentiator and
80% streaming and
20% batch use
cases
Integrating with
Apache Spark as
oppose to
bundling it in
Data Collection
Points
1. APIs
2. Instrumentations built
into products being used
(e.g. SNMP, JMX)
3. Sensors and custom
instrumentations
4. Log analysis
5. Social networks and
other feeds
Data Collection API
 One Sensor API to publish
events
- REST, Thrift, Java, JMS,
Kafka
- Java clients, java script clients*
 First you define streams (think it
as a infinite table in SQL DB)
 Then publish events via Sensor
API
“Publish once, analyze
anyway you like”
KPIs and their Role
• KPIs (Key Performance Indicators) are numbers
that can give you an idea about performance of
something
– E.g. Countries have them ( GDP, Per Capita
Income, HDI index etc)
• Examples
– Company Revenue
– Lifetime value of a customer
– Revenue per Square foot ( in retail industry)
• Idea is to define them and monitor them. But
defining them is hard work!!
• Often one indicator tells half the story, and you
need several that cover different angles
insert overwrite table BusSpeed
select hour,
average(v) as avgV,
busID
from BusStream group by busID,
getHour(ts);
Batch Analytics
1. For simple analytics, you
can write Spark SQL
(SQL-like)
2. They operates on top of
data streams we
published
3. Run as MapReduce jobs
in Apache Spark
Picture by Dan Ruscoe (CC) https://www.flickr.com/photos/druscoe/8031488298
Lets go Beyond Batch
Incremental Analytics
• Most “Digital business” use cases
are incremental ( data keeps
coming, and results should be
updated)
• Can do just with batch, but slow
and lot of work
• DAS includes set of incremental
operators, works just with
streaming in most cases.
• Incremental ML is not included yet
Real-time: Value of some Insights degrade
Fast!
1. Stock Markets
2. Fraud
3. Surveillance
4. Patient Monitoring
5. Traffic
Real Time Analytics with CEP
Case Study: People Tracking via BLE
• Traffic Monitoring
• Smart retail
• Airport management
Track people through
• BLE via triangulation
• Higher level logic via CEP
Case Study: Realtime Soccer Analysis
Videohttps://www.youtube.com/watch?
v=nRI6buQ0NOM
Machine learning
• Given examples build a program
that matches those examples
• We call that program a “model”
• Major improvements in last few
years (e.g. deeplearning)
Can you “Write a program to drive a Car?”
Predictive Analytics
Machine Learner Wizard is
No More • Machine learner
provided wizard to
build machine
learning models
• Technology is
changing too fast to
keep building such a
Wizard
• We are dropping that
and instead support
models built with
other machine
learning tools
Using ML Models
• We support models built with
following tools
• PMML
• Spark
• We recommend PySpark as
default ( works with DAS)
• Models can be used them
with both WSO2 CEP and
ESB
• Tensorflow, H20 models are
coming ( can do already by
writing an extension)
Case Study: Predict Wait Time in the Airport
• Predicting the time to go
through airport using
location data
• Real-time updates and
events to passengers via the
App
Anomaly Detection
• Find the Odd one
out
• Anomalies by value
though “Clustering”
• Anomalies through time
using Markov Chains
• Detect Problems are
drill in to find details
• Available as a
solution White paper: Fraud Detection
and Prevention: A Data
Analytics ApproachImage "Reading" by Creative Stall (cc), Noun Project
Communicate
What is a Dashboard?
• Think a car dashboard
• It give you idea about
overall system in a glance
• It is boring when all is
good, and grab attention
when something is
wrong
• Support for drill down
and find root cause
• Starts with data in
tabular format
• Map each column
to dimension in
your plot like X,Y,
color, point size,
etc
• Create a chart with
few clicks
Powered by VizGrammer lib
that uses Vaga undneath (see
https://github.com/wso2/Vi
zGrammar)
Gadget Generation Wizard
• When data cross security
domains, there are
security and management
concerns
• APIs ( e.g. WSO2 APIM)
solve these problems
Often data are
accessed through the
network
– Mobile Apps
– Query interfaces
– Data integration
– As a Subscription
Expose data through API
Alerts
• Done through CEP queries
• Notifications ( sent via email,
SMS, Pager etc.)
• Goal is to give you peace of mind
( not having to check all the time)
• They should be specific
• They should be infrequent
• They should have very low false
positives
• Let users control sensitivity
Take the time to
Understand!!
Solutions
Photo by Tim Evanson (CC)
https://www.flickr.com/photos/timevanson/
6830726558
Role of Analytics in Digital Business
Key Differentiators
Thank You!
Questions?

More Related Content

PDF
The Rise of Streaming SQL and Evolution of Streaming Applications
PDF
SoC Keynote:The State of the Art in Integration Technology
PPTX
What Open Data and Open Source can do for Sri Lanka?
PPTX
Machine Learning in the Real World
PPTX
Mastering MapReduce: MapReduce for Big Data Management and Analysis
PDF
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
PDF
Predictive Analytics - Big Data & Artificial Intelligence
PDF
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
The Rise of Streaming SQL and Evolution of Streaming Applications
SoC Keynote:The State of the Art in Integration Technology
What Open Data and Open Source can do for Sri Lanka?
Machine Learning in the Real World
Mastering MapReduce: MapReduce for Big Data Management and Analysis
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
Predictive Analytics - Big Data & Artificial Intelligence
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...

What's hot (20)

PDF
Social media analytics using Azure Technologies
PDF
Machine Learning with Big Data using Apache Spark
PDF
Intro to Machine Learning with H2O and AWS
PPTX
Predictive modelling with azure ml
PDF
Data analysis trend 2015 2016 v071
PPTX
Real-time Big Data Analytics: From Deployment to Production
PDF
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
PDF
Data scientist enablement dse 400 week 8 roadmap
PDF
Azure Machine Learning
PPTX
Big Data Science in Scala
PDF
Rakuten - Recommendation Platform
PPTX
Dealing with uncertainty in fintech using AI
PPTX
Infochimps + CloudCon: Infinite Monkey Theorem
PPTX
Fortune Teller API - Doing Data Science with Apache Spark
PPTX
Fast Data Intelligence in the IoT - real-time data analytics with Spark
PDF
Machine Data Analytics
DOCX
Tools for Unstructured Data Analytics
PPT
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
PDF
02 a holistic approach to big data
PDF
Floods of Twitter Data - StampedeCon 2016
Social media analytics using Azure Technologies
Machine Learning with Big Data using Apache Spark
Intro to Machine Learning with H2O and AWS
Predictive modelling with azure ml
Data analysis trend 2015 2016 v071
Real-time Big Data Analytics: From Deployment to Production
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Data scientist enablement dse 400 week 8 roadmap
Azure Machine Learning
Big Data Science in Scala
Rakuten - Recommendation Platform
Dealing with uncertainty in fintech using AI
Infochimps + CloudCon: Infinite Monkey Theorem
Fortune Teller API - Doing Data Science with Apache Spark
Fast Data Intelligence in the IoT - real-time data analytics with Spark
Machine Data Analytics
Tools for Unstructured Data Analytics
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
02 a holistic approach to big data
Floods of Twitter Data - StampedeCon 2016
Ad

Viewers also liked (15)

PDF
Beyond Keyword Search with IBM Watson Explorer Webinar Deck
PDF
Why digital analytics?
PDF
10 Steps to Actionable Analytics for Digital Marketing
PPTX
Falling in love (again) with Digital Analytics in Higher Education (PSEWEB 20...
PPT
Scalable Realtime Analytics with declarative SQL like Complex Event Processin...
PPTX
Digital Marketing Overview
PDF
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
PDF
Graficos
DOCX
Alisson valdez
PDF
Mercadoindustrial.es Nº 110 Febrero 2017
DOC
Electronics product design companies in bangalore
PPTX
fungal disease in fishes
PDF
FSE 2016: Mobile analytics in context. Keynote Seattle Nov 14 2016
PPTX
Analytics in Digital Industry
PDF
UX is not UI: An introduction to UX
Beyond Keyword Search with IBM Watson Explorer Webinar Deck
Why digital analytics?
10 Steps to Actionable Analytics for Digital Marketing
Falling in love (again) with Digital Analytics in Higher Education (PSEWEB 20...
Scalable Realtime Analytics with declarative SQL like Complex Event Processin...
Digital Marketing Overview
Earley Executive Roundtable on Data Analytics - Session 2 - Mining Business I...
Graficos
Alisson valdez
Mercadoindustrial.es Nº 110 Febrero 2017
Electronics product design companies in bangalore
fungal disease in fishes
FSE 2016: Mobile analytics in context. Keynote Seattle Nov 14 2016
Analytics in Digital Industry
UX is not UI: An introduction to UX
Ad

Similar to Role of Analytics in Digital Business (20)

PDF
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
PDF
[WSO2Con EU 2017] Deriving Insights for Your Digital Business with Analytics
PDF
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
PPTX
WSO2 Workshop Sydney 2016 - Analytics
PPTX
Partner webinar presentation aws pebble_treasure_data
PPTX
IT webinar 2016
PDF
WSO2Con ASIA 2016: An Introduction to the WSO2 Analytics Platform
PPTX
Introduction to WSO2 Analytics Platform: 2016 Q2 Update
PPTX
QWC 2014 - A picture worth 1000 words
PPTX
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
PPTX
Analytics what to look for sustaining your growing business-
PPTX
Big Data Pipelines and Machine Learning at Uber
PDF
WSO2Con EU 2016: An Introduction to the WSO2 Analytics Platform
PPTX
Azure Data Explorer deep dive - review 04.2020
PPTX
Neo4j GraphTour New York_EY Presentation_Michael Moore
PPTX
Your Roadmap for An Enterprise Graph Strategy
PDF
How to build your own Delve: combining machine learning, big data and SharePoint
PDF
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
PDF
WSO2 Analytics Platform - The one stop shop for all your data needs
PDF
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
[WSO2Con EU 2017] Deriving Insights for Your Digital Business with Analytics
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2 Workshop Sydney 2016 - Analytics
Partner webinar presentation aws pebble_treasure_data
IT webinar 2016
WSO2Con ASIA 2016: An Introduction to the WSO2 Analytics Platform
Introduction to WSO2 Analytics Platform: 2016 Q2 Update
QWC 2014 - A picture worth 1000 words
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
Analytics what to look for sustaining your growing business-
Big Data Pipelines and Machine Learning at Uber
WSO2Con EU 2016: An Introduction to the WSO2 Analytics Platform
Azure Data Explorer deep dive - review 04.2020
Neo4j GraphTour New York_EY Presentation_Michael Moore
Your Roadmap for An Enterprise Graph Strategy
How to build your own Delve: combining machine learning, big data and SharePoint
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
WSO2 Analytics Platform - The one stop shop for all your data needs
Summit Australia 2019 - Supercharge PowerPlatform with AI - Dipankar Bhattach...

More from Srinath Perera (20)

PDF
Book: Software Architecture and Decision-Making
PDF
Data science Applications in the Enterprise
PDF
An Introduction to APIs
PDF
An Introduction to Blockchain for Finance Professionals
PDF
AI in the Real World: Challenges, and Risks and how to handle them?
PDF
Healthcare + AI: Use cases & Challenges
PDF
How would AI shape Future Integrations?
PDF
The Role of Blockchain in Future Integrations
PDF
Future of Serverless
PDF
Blockchain: Where are we? Where are we going?
PDF
Few thoughts about Future of Blockchain
PDF
A Visual Canvas for Judging New Technologies
PDF
Privacy in Bigdata Era
PDF
Blockchain, Impact, Challenges, and Risks
PPTX
Today's Technology and Emerging Technology Landscape
PDF
An Emerging Technologies Timeline
PDF
Analytics and AI: The Good, the Bad and the Ugly
PDF
Transforming a Business Through Analytics
PPTX
Doing Online Research
PPTX
How IOT & Big Data will shape up Future Economies?
Book: Software Architecture and Decision-Making
Data science Applications in the Enterprise
An Introduction to APIs
An Introduction to Blockchain for Finance Professionals
AI in the Real World: Challenges, and Risks and how to handle them?
Healthcare + AI: Use cases & Challenges
How would AI shape Future Integrations?
The Role of Blockchain in Future Integrations
Future of Serverless
Blockchain: Where are we? Where are we going?
Few thoughts about Future of Blockchain
A Visual Canvas for Judging New Technologies
Privacy in Bigdata Era
Blockchain, Impact, Challenges, and Risks
Today's Technology and Emerging Technology Landscape
An Emerging Technologies Timeline
Analytics and AI: The Good, the Bad and the Ugly
Transforming a Business Through Analytics
Doing Online Research
How IOT & Big Data will shape up Future Economies?

Recently uploaded (20)

PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
modul_python (1).pptx for professional and student
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Modelling in Business Intelligence , information system
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPT
Predictive modeling basics in data cleaning process
PPTX
Managing Community Partner Relationships
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
climate analysis of Dhaka ,Banglades.pptx
modul_python (1).pptx for professional and student
[EN] Industrial Machine Downtime Prediction
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Modelling in Business Intelligence , information system
A Complete Guide to Streamlining Business Processes
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Qualitative Qantitative and Mixed Methods.pptx
Predictive modeling basics in data cleaning process
Managing Community Partner Relationships
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
SAP 2 completion done . PRESENTATION.pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Introduction-to-Cloud-ComputingFinal.pptx

Role of Analytics in Digital Business

  • 1. Driving Insights for Your Digital Business With Analytics Srinath Perera (@srinath_perera) VP – Research, WSO2 Member, Apache Foundation
  • 3. Big Data Washing • Collect anything that is easy to get • Aggregate and Group • Find a complex but pretty chart • Predict something, but does not measure it’s quality • Claim you just got started!
  • 4. Uber • A company worth XX • A taxi company that does not have cars or drivers
  • 5. Picture by Dan Ruscoe (CC) https://www.flickr.com/photos/druscoe/8031488298 Game Changers
  • 7. New Digital inspired Products and Revenue Streams • New way to do business (e.g. Uber, Amazon Go) • Product as a Service (e.g. IoT Jack hammer, Light as a service) • Progressive Insurance Gadget • Sell insights ( Telcos knows where people are, credit card companies know what people buy and their demographics, navigation apps know traffic)
  • 8. Get Close to your Customers • Use analytics to optimize the experience • Predict issues and proactively handle them ( e.g. reschedule automatically when flight has missed) • Predict churn and act • Track the brand and manage it • Target your marketing
  • 9. Optimizations • Reduce Fraud • Logistics, day to day operations • Analytics for hiring and Performance appraisal • Predictive maintenance • Sales analytics, demand prediction • Security and surveillance
  • 11. Conceptual Architecture • APIs play a key role in data collection • Need to respond to events as fast as possible • Incremental Analysis is key
  • 13. Only DAS, which has everything Focus is on CEP ( siddhi), our core differentiator and 80% streaming and 20% batch use cases Integrating with Apache Spark as oppose to bundling it in
  • 14. Data Collection Points 1. APIs 2. Instrumentations built into products being used (e.g. SNMP, JMX) 3. Sensors and custom instrumentations 4. Log analysis 5. Social networks and other feeds
  • 15. Data Collection API  One Sensor API to publish events - REST, Thrift, Java, JMS, Kafka - Java clients, java script clients*  First you define streams (think it as a infinite table in SQL DB)  Then publish events via Sensor API
  • 17. KPIs and their Role • KPIs (Key Performance Indicators) are numbers that can give you an idea about performance of something – E.g. Countries have them ( GDP, Per Capita Income, HDI index etc) • Examples – Company Revenue – Lifetime value of a customer – Revenue per Square foot ( in retail industry) • Idea is to define them and monitor them. But defining them is hard work!! • Often one indicator tells half the story, and you need several that cover different angles
  • 18. insert overwrite table BusSpeed select hour, average(v) as avgV, busID from BusStream group by busID, getHour(ts); Batch Analytics 1. For simple analytics, you can write Spark SQL (SQL-like) 2. They operates on top of data streams we published 3. Run as MapReduce jobs in Apache Spark
  • 19. Picture by Dan Ruscoe (CC) https://www.flickr.com/photos/druscoe/8031488298 Lets go Beyond Batch
  • 20. Incremental Analytics • Most “Digital business” use cases are incremental ( data keeps coming, and results should be updated) • Can do just with batch, but slow and lot of work • DAS includes set of incremental operators, works just with streaming in most cases. • Incremental ML is not included yet
  • 21. Real-time: Value of some Insights degrade Fast! 1. Stock Markets 2. Fraud 3. Surveillance 4. Patient Monitoring 5. Traffic
  • 23. Case Study: People Tracking via BLE • Traffic Monitoring • Smart retail • Airport management Track people through • BLE via triangulation • Higher level logic via CEP
  • 24. Case Study: Realtime Soccer Analysis Videohttps://www.youtube.com/watch? v=nRI6buQ0NOM
  • 25. Machine learning • Given examples build a program that matches those examples • We call that program a “model” • Major improvements in last few years (e.g. deeplearning) Can you “Write a program to drive a Car?” Predictive Analytics
  • 26. Machine Learner Wizard is No More • Machine learner provided wizard to build machine learning models • Technology is changing too fast to keep building such a Wizard • We are dropping that and instead support models built with other machine learning tools
  • 27. Using ML Models • We support models built with following tools • PMML • Spark • We recommend PySpark as default ( works with DAS) • Models can be used them with both WSO2 CEP and ESB • Tensorflow, H20 models are coming ( can do already by writing an extension)
  • 28. Case Study: Predict Wait Time in the Airport • Predicting the time to go through airport using location data • Real-time updates and events to passengers via the App
  • 29. Anomaly Detection • Find the Odd one out • Anomalies by value though “Clustering” • Anomalies through time using Markov Chains • Detect Problems are drill in to find details • Available as a solution White paper: Fraud Detection and Prevention: A Data Analytics ApproachImage "Reading" by Creative Stall (cc), Noun Project
  • 31. What is a Dashboard? • Think a car dashboard • It give you idea about overall system in a glance • It is boring when all is good, and grab attention when something is wrong • Support for drill down and find root cause
  • 32. • Starts with data in tabular format • Map each column to dimension in your plot like X,Y, color, point size, etc • Create a chart with few clicks Powered by VizGrammer lib that uses Vaga undneath (see https://github.com/wso2/Vi zGrammar) Gadget Generation Wizard
  • 33. • When data cross security domains, there are security and management concerns • APIs ( e.g. WSO2 APIM) solve these problems Often data are accessed through the network – Mobile Apps – Query interfaces – Data integration – As a Subscription Expose data through API
  • 34. Alerts • Done through CEP queries • Notifications ( sent via email, SMS, Pager etc.) • Goal is to give you peace of mind ( not having to check all the time) • They should be specific • They should be infrequent • They should have very low false positives • Let users control sensitivity
  • 35. Take the time to Understand!!
  • 37. Photo by Tim Evanson (CC) https://www.flickr.com/photos/timevanson/ 6830726558