SlideShare a Scribd company logo
Event Processing for better
(Big) Data
Vinod Vydier
Middleware Specialist @ Oracle
Agenda
§  Why use event processing?
§  Event Processing Applications
§  Technical Architecture
§  Use of In-Memory data-grid
§  Use cases
Challenges Working with Big Data
• Storing Data has becoming cheap, however the
storage is not infinite and has to be managed to make
use of the data effectively.
• Hadoop has inherent latency for responding to real
time events (which can produce high volume data at
high velocity) and typically involves real responses.
• Event Processing helps in getting clean data with
context and less redundancy into HDFS, so the Hadoop
jobs can be more effective.
• Event Processing helps in responding back in real
time, and storing the data in HDFS for better historical
analysis.
Why use Event Processing Infrastructure
Application has any one or more of the following
conditions:
§  Requires high throughput and low latency
processing.
§  Has continuously streaming data.
§  Real-time correlation between multiple incoming data
sources.
§  Time-sensitive alerts, aggregations and calculations.
§  Needs to look for patterns in the data stream.
§  Data does not need to be stored, if there is nothing
of interest in it.
§  Problem is more easily solved by analyzing before
storing in HDFS.
Filtering, Real-time Intelligence for Big Data
VOLUME VELOCITY VARIETY VALUE
SOCIAL
BLOG
SMART
METER
101100101001
001001101010
101011100101
010100100101
FAST DATA
Event Processing Intelligence
GREATER
Stay ahead of Big Data
Filter out,
correlate
Move time-critical analysis to front of process
• Filter out noise (example: data ticks with no
changes), add context (by correlating multiple
sources), increase relevance.
• Identify certain critical conditions as you insert data
into the warehouse.
Getting ahead of the curve: Fast Data
Big Data
minutesms
Fast Data
Historicaldepth:deep
Historicaldepth:shallow
Example:
analysis of traffic
patterns and
congestion times for
urban planning
Example:
monitoring of traffic
cameras to ensure given
license plates are not in
use on multiple vehicles
Add “depth” to your fast data by
merging output of MapReduce to
stream processing
Adapter
Adapter
Processor
Adapter HDFS
Data Source
Queries
<<Source>>
<<Source>>
<<Sink>>
Service1 Service2
Export Import
Event Processing Network (EPN)
Event Processing Application
Queries
Channel
Channel
Channel
Channel
What is an Event Processing application
Data Source
Event Processing inputs
Ø  Streams
Ø  Continuous input, often in high-
volume
Ø  Time ordered
Ø  Does not end
Ø  Impossible to process / analyze in
real-time with traditional relational
database systems
Example: Raw Sensor Event
streams, GPS, Market Data Feeds
BA BOEING D 77.575 800 20080305 10:03:02:78
DO DUPOD
NT
D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
C CITIGROUP D 34.125 2000 20080305 10:03:03:05
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
C CITIGROUP D 34.125 2000 20080305 10:03:03:05
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
C CITIGROUP D 34.125 2000 20080305 10:03:03:05
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
Event Processing provides a new data
management infrastructure to support and
analyze Streams in real-time
BA BOEING D 77.575 41.575
800
20080305 10:03:02:78
DO DUPONT D 41.575 3000 20080305 10:03:04:12
BA BOEING D 77.575 800 20080305 10:03:02:78
C CITIGROUP D 34.125 2000 20080305 10:03:03:05
BA BOEING D 77.575 800 20080305 10:03:02:78
Filtering
Ø  New stream filtered for specific criteria,
e.g. stock price > $22
Ø  Correlation & Aggregation
Ø  Scrolling, time-based window metrics,
e.g. average # of stock trades in the
last hour
Ø  Pattern Matching
Ø  Notification of detected event patterns,
e.g. price changes A, B and C occurred
within 15 minute window
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
……
• Event Processing done in-Memory (not in Database)
• Logic is defined through Continuous Queries on the data
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46
DO DUPONT D 41.575 3000 20080305 10:03:04:12
AA ALCOA INC D 20.125 1000 20080305 10:03:01:55
AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10
BA BOEING D 77.575 800 20080305 10:03:02:78
BA BOEING D 77.575 41.575
800
20080305 10:03:02:78
DO DUPONT D 41.575 3000 20080305 10:03:04:12
COMPLEX QUERIES
Event Processing outputs
Data crunching for Event Processing done in a
in-memory data grid
•  High throughput for storing data
•  Aggregation and event querying
•  Pattern implementation flexibility combining complementary
technologies
•  Handle and correlate events in real time, including support for
multiple patterns:
•  Pre processing (buffer inputs)
•  In Event Processing (to cache reference data)
•  Post Processing (to expose processed events to consuming
apps)
Data Grid
Event Processing
Consolidat
ed & in-
context
Data
Filtered/
Aggregat
ed Data
HDFS and
traditional storage
In-memory events on the data stream
n  Threshold Management
n  Detecting threshold conditions across multiple
event streams
n  Using cache to:
n  Allow dynamic configuration of thresholds
n  Add (via join) contextual data to support
aggregation
n  Using pattern matching to find sustained
conditions
n  Alert Generation
n  Using relations to represent state and state
transitions
n  Using “missing event” patterns to monitor
expected response(s)
n  Alarm Management
n  Using pattern matching to remove extraneous
alarm events
n  e.g. power off alarm preceded by tamper alarm
within (n) minutes
X
Alarm Filtering Scenario
Discard Power Off Alarm if there was a Tamper
Alarm for the same meter within the previous 5
seconds
Visualizing events on the data stream
JMS
Resource Locations
Matches and Alerts
SQL
Event Processing Application
JMS
Geo-Fencing Definitions
SQL
MapViewer
Manager
JMS Protocol Integration
n Common integration touch point with Service Bus
n Business Activity Monitoring integration
HTTP Publish/Subscribe
n Support pub/sub events between Event server and
web clients.
n Clients don’t need to poll for updates (unlike
traditional HTTP).
n Clients subscribe to and publish to event channels.
n Bayeux protocol
n Light weight and the payload is JSON
Visual/SOA integration with Event Processing
Event Processing High Level Architecture
JSON
Adapter CacheProcessor POJO
EPN (Event Processing Network) Elements
HTTP Pub/S
Query Plan and Real Time Monitoring
Event Driven SOA: Simplify Business
Complexity
•  Real-time business insight
•  Preempt and react instantaneously to Enterprise, Environmental and Global
Business conditions
•  Gain business insight using previously untapped, raw event sources
•  Hot-pluggable integration
•  Transparent SOA infrastructure interoperability
•  Distributed, deployment ready, pre-integrated, in-memory Data Grid,
and Java low latency determinism.
•  Lightweight high performance Java Event Server platform
•  Real-time business friendly analyst oriented
visualization layers
•  Powerful, extensible Event Processing Analysis abstraction
•  Business user dashboards
•  Business user domain specific natural language layers
•  Real-time predictive analytics
Event Processing use cases in different
industries
1.  Customer Experience
2.  Transportation, Logistics & Fleet Management
3.  Utilities: Demand & Response, Smart Meter
4.  Public Sector: Emergency Response,
Intelligence
5.  Telcos: Real Time billing & WiFi offloading,
Mobile billboard
Customer Experience
n  Industry focus on new buzzword: Customer
Experience
n  Desire to harness potential of social networks for
better targeted marketing
Event Processing can help with:
n  Monitoring in real-time customer activity (social
networks, location (e.g. proximity to stores, etc) and
identifying opportunities in real-time
n  Correlating with existing information (customer/
shopping profiles, etc.)
n  Generating real-time alerts
Transportation, Logistics and Fleet Management
n  Constant industry pressure for greater
efficiency
n  Need to differentiate through premium
services and greater reliability and visibility
n  Availability of cheap wireless sensors
(temperature, GPS, etc.) that can be included
in packages/containers/trucks
Event Processing can help with:
n  Real-time monitoring of inflow of data from
sensors
n  Trends detection / prediction (to rise, etc.)
n  Leveraging spatial/geo-location capabilities.
Utilities
n  Adoption of Smart Meters: concerns about bandwidth/ processing
power required to handle the information they generate, desire to offer
value-add services
n  Ever increasing electricity demand
n  Demand for real-time billing & analytics
n  Greater customer expectations re: outage & response times
n  Regulations
Event Processing can help with:
n  Alerting of consumption trends in real-time, enabling “Demand/
Response”
n  Real-time detection of problems (abnormal spikes in consumption
indicative of leaks, etc.)
n  Filtering out redundant or nested (ex: tree fell on the line) outage
errors and problems
n  Tracking of resources and personnel
Telco
n  Overloaded data networks and new strategies to offload traffic:
real-time billing vs. unlimited, offloading to WiFi, degradation of
service from 4G to 3G, etc.
n  GPS-enabled phones offer new location-based marketing
opportunities: “mobile billboards”
How can Event Processing help:
n  Event Processing infrastructure can handle massive amounts
of data generated by mobile devices, filter out, correlate and
aggregate in real-time to only retain valuable information
n  Event Processing can plug into all types of feeds, from devices
to social networks
n  Event Processing can be integrated with spatial and geo-
location technology to send location specific data to the user.
Public Sector
n  Heightened security requirements
n  Ever increasing population in urban areas drives optimization
requirements
n  Increasing number of real-time data: video feeds, GPS data,
traffic data, etc.
n  Applications: Security Intelligence, geo-fencing, “Smart
Cities”, traffic control, gateless tolls
How Event Processing can help:
n  Event Processing can be integrated with spatial and geo-
location technology to track location specific data with a user.
n  Event Processing can plug in any data feed such as video /
face recognition
n  Event Processing meets performance & availability
requirements in this space
Thanks for attending!!

More Related Content

PPTX
Big Data, Analytics and Real Time Event Processing
PDF
WSO2 Big Data Analytics Platform
PDF
TripleLift: Preparing for a New Programmatic Ad-Tech World
PDF
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
PPTX
ironSource Atom BigData Berlin
PDF
Building the Next-gen Digital Meter Platform for Fluvius
PDF
ChakraView – A 360° Approach to Data Quality
PPT
Billions of Rows, Millions of Insights, Right Now
Big Data, Analytics and Real Time Event Processing
WSO2 Big Data Analytics Platform
TripleLift: Preparing for a New Programmatic Ad-Tech World
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
ironSource Atom BigData Berlin
Building the Next-gen Digital Meter Platform for Fluvius
ChakraView – A 360° Approach to Data Quality
Billions of Rows, Millions of Insights, Right Now

What's hot (20)

PPTX
Five ways database modernization simplifies your data life
PDF
Lambda-B-Gone: In-memory Case Study for Faster, Smarter and Simpler Answers
PPTX
HP Discover: Real Time Insights from Big Data
PPTX
DataStax Enterprise in Practice (Field Notes)
PPTX
Intuit Analytics Cloud 101
PPTX
In-Memory Computing Webcast. Market Predictions 2017
PDF
Transforming Your Business with Fast Data – Five Use Case Examples
PPTX
Our journey with druid - from initial research to full production scale
PDF
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
PDF
VP of WW Partners by Alan Chhabra
PDF
Big data meetup budapest adding data schemas to snowplow
PDF
Data Strategies for Managing the Cycles in Oil and Gas
PDF
How to Build Fast Data Applications: Evaluating the Top Contenders
PDF
Advanced data science algorithms applied to scalable stream processing by Dav...
PDF
Memory Database Technology is Driving a New Cycle of Business Innovation
PPTX
Netflix Data Engineering @ Uber Engineering Meetup
PDF
Converging Database Transactions and Analytics
PPTX
Building the Foundation for a Latency-Free Life
PDF
Architecting for Real-Time Big Data Analytics
PDF
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Five ways database modernization simplifies your data life
Lambda-B-Gone: In-memory Case Study for Faster, Smarter and Simpler Answers
HP Discover: Real Time Insights from Big Data
DataStax Enterprise in Practice (Field Notes)
Intuit Analytics Cloud 101
In-Memory Computing Webcast. Market Predictions 2017
Transforming Your Business with Fast Data – Five Use Case Examples
Our journey with druid - from initial research to full production scale
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
VP of WW Partners by Alan Chhabra
Big data meetup budapest adding data schemas to snowplow
Data Strategies for Managing the Cycles in Oil and Gas
How to Build Fast Data Applications: Evaluating the Top Contenders
Advanced data science algorithms applied to scalable stream processing by Dav...
Memory Database Technology is Driving a New Cycle of Business Innovation
Netflix Data Engineering @ Uber Engineering Meetup
Converging Database Transactions and Analytics
Building the Foundation for a Latency-Free Life
Architecting for Real-Time Big Data Analytics
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Ad

Viewers also liked (20)

PPTX
Big Data and Data Science: The Technologies Shaping Our Lives
PDF
Learning Rule Based Programming using Games @DecisionCamp 2016
PDF
The Future of Work
PDF
Drools Happenings 7.0 - Devnation 2016
PDF
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
PDF
Intelligent Business Processes
PDF
Installing Complex Event Processing On Linux
PPTX
Reactconf 2014 - Event Stream Processing
PDF
Comparative Analysis of Personal Firewalls
PPTX
Access control attacks by nor liyana binti azman
PPTX
Session hijacking
PPT
Debs 2011 tutorial on non functional properties of event processing
PDF
Tutorial in DEBS 2008 - Event Processing Patterns
PDF
Ceh v8 labs module 03 scanning networks
PPT
Complex Event Processing with Esper and WSO2 ESB
PPT
Chapter 12
PPSX
CyberLab CCEH Session - 3 Scanning Networks
PDF
Nmap scripting engine
PPT
Debs2009 Event Processing Languages Tutorial
PDF
Analizadores de Protocolos
Big Data and Data Science: The Technologies Shaping Our Lives
Learning Rule Based Programming using Games @DecisionCamp 2016
The Future of Work
Drools Happenings 7.0 - Devnation 2016
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
Intelligent Business Processes
Installing Complex Event Processing On Linux
Reactconf 2014 - Event Stream Processing
Comparative Analysis of Personal Firewalls
Access control attacks by nor liyana binti azman
Session hijacking
Debs 2011 tutorial on non functional properties of event processing
Tutorial in DEBS 2008 - Event Processing Patterns
Ceh v8 labs module 03 scanning networks
Complex Event Processing with Esper and WSO2 ESB
Chapter 12
CyberLab CCEH Session - 3 Scanning Networks
Nmap scripting engine
Debs2009 Event Processing Languages Tutorial
Analizadores de Protocolos
Ad

Similar to Real Time Event Processing and In-­memory analysis of Big Data - StampedeCon 2013 (20)

PDF
Speeding up big data with event processing
PDF
Soa12c launch 5 event processing shmakov eng cr
PDF
Event Stream Processing SAP
PDF
Introduction to Streaming Analytics
PDF
WSO2 Complex Event Processor - Product Overview
PDF
ACM DEBS 2015: Realtime Streaming Analytics Patterns
PDF
DEBS 2015 Tutorial : Patterns for Realtime Streaming Analytics
PDF
Put Events to Work and Respond in Real Time
PDF
Citi Tech Talk: Event Driven Kafka Microservices
PDF
Introduction to Stream Processing
PDF
Data Ingestion in Big Data and IoT platforms
PDF
Stream Processing and Complex Event Processing together with Kafka, Flink and...
PPTX
WebAction-Sami Abkay
ODP
Batch processing in EDA (Event Driven Architectures)
PPTX
Siddhi: A Second Look at Complex Event Processing Implementations
PDF
Complex event processing platform handling millions of users - Krzysztof Zarz...
PDF
Shared time-series-analysis-using-an-event-streaming-platform -_v2
PPTX
Complex Event Prosessing
PPT
Processing Patterns for Predictive Business
PPT
Open Source Event Processing for Sensor Fusion Applications
Speeding up big data with event processing
Soa12c launch 5 event processing shmakov eng cr
Event Stream Processing SAP
Introduction to Streaming Analytics
WSO2 Complex Event Processor - Product Overview
ACM DEBS 2015: Realtime Streaming Analytics Patterns
DEBS 2015 Tutorial : Patterns for Realtime Streaming Analytics
Put Events to Work and Respond in Real Time
Citi Tech Talk: Event Driven Kafka Microservices
Introduction to Stream Processing
Data Ingestion in Big Data and IoT platforms
Stream Processing and Complex Event Processing together with Kafka, Flink and...
WebAction-Sami Abkay
Batch processing in EDA (Event Driven Architectures)
Siddhi: A Second Look at Complex Event Processing Implementations
Complex event processing platform handling millions of users - Krzysztof Zarz...
Shared time-series-analysis-using-an-event-streaming-platform -_v2
Complex Event Prosessing
Processing Patterns for Predictive Business
Open Source Event Processing for Sensor Fusion Applications

More from StampedeCon (20)

PDF
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
PDF
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
PDF
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
PDF
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
PDF
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
PDF
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
PDF
Foundations of Machine Learning - StampedeCon AI Summit 2017
PDF
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
PDF
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
PDF
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
PDF
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
PDF
A Different Data Science Approach - StampedeCon AI Summit 2017
PDF
Graph in Customer 360 - StampedeCon Big Data Conference 2017
PDF
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
PDF
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
PDF
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
PDF
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
PDF
Innovation in the Data Warehouse - StampedeCon 2016
PPTX
Creating a Data Driven Organization - StampedeCon 2016
PPTX
Using The Internet of Things for Population Health Management - StampedeCon 2016
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Innovation in the Data Warehouse - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Mushroom cultivation and it's methods.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Machine Learning_overview_presentation.pptx
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
1. Introduction to Computer Programming.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Mushroom cultivation and it's methods.pdf
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Assigned Numbers - 2025 - Bluetooth® Document
cloud_computing_Infrastucture_as_cloud_p
A comparative study of natural language inference in Swahili using monolingua...
Programs and apps: productivity, graphics, security and other tools
SOPHOS-XG Firewall Administrator PPT.pptx
Unlocking AI with Model Context Protocol (MCP)
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Machine Learning_overview_presentation.pptx
OMC Textile Division Presentation 2021.pptx
A Presentation on Artificial Intelligence
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
1. Introduction to Computer Programming.pptx

Real Time Event Processing and In-­memory analysis of Big Data - StampedeCon 2013

  • 1. Event Processing for better (Big) Data Vinod Vydier Middleware Specialist @ Oracle
  • 2. Agenda §  Why use event processing? §  Event Processing Applications §  Technical Architecture §  Use of In-Memory data-grid §  Use cases
  • 3. Challenges Working with Big Data • Storing Data has becoming cheap, however the storage is not infinite and has to be managed to make use of the data effectively. • Hadoop has inherent latency for responding to real time events (which can produce high volume data at high velocity) and typically involves real responses. • Event Processing helps in getting clean data with context and less redundancy into HDFS, so the Hadoop jobs can be more effective. • Event Processing helps in responding back in real time, and storing the data in HDFS for better historical analysis.
  • 4. Why use Event Processing Infrastructure Application has any one or more of the following conditions: §  Requires high throughput and low latency processing. §  Has continuously streaming data. §  Real-time correlation between multiple incoming data sources. §  Time-sensitive alerts, aggregations and calculations. §  Needs to look for patterns in the data stream. §  Data does not need to be stored, if there is nothing of interest in it. §  Problem is more easily solved by analyzing before storing in HDFS.
  • 5. Filtering, Real-time Intelligence for Big Data VOLUME VELOCITY VARIETY VALUE SOCIAL BLOG SMART METER 101100101001 001001101010 101011100101 010100100101 FAST DATA Event Processing Intelligence GREATER
  • 6. Stay ahead of Big Data Filter out, correlate Move time-critical analysis to front of process • Filter out noise (example: data ticks with no changes), add context (by correlating multiple sources), increase relevance. • Identify certain critical conditions as you insert data into the warehouse.
  • 7. Getting ahead of the curve: Fast Data Big Data minutesms Fast Data Historicaldepth:deep Historicaldepth:shallow Example: analysis of traffic patterns and congestion times for urban planning Example: monitoring of traffic cameras to ensure given license plates are not in use on multiple vehicles Add “depth” to your fast data by merging output of MapReduce to stream processing
  • 8. Adapter Adapter Processor Adapter HDFS Data Source Queries <<Source>> <<Source>> <<Sink>> Service1 Service2 Export Import Event Processing Network (EPN) Event Processing Application Queries Channel Channel Channel Channel What is an Event Processing application Data Source
  • 9. Event Processing inputs Ø  Streams Ø  Continuous input, often in high- volume Ø  Time ordered Ø  Does not end Ø  Impossible to process / analyze in real-time with traditional relational database systems Example: Raw Sensor Event streams, GPS, Market Data Feeds BA BOEING D 77.575 800 20080305 10:03:02:78 DO DUPOD NT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 C CITIGROUP D 34.125 2000 20080305 10:03:03:05 CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 C CITIGROUP D 34.125 2000 20080305 10:03:03:05 CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 C CITIGROUP D 34.125 2000 20080305 10:03:03:05 CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 Event Processing provides a new data management infrastructure to support and analyze Streams in real-time BA BOEING D 77.575 41.575 800 20080305 10:03:02:78 DO DUPONT D 41.575 3000 20080305 10:03:04:12 BA BOEING D 77.575 800 20080305 10:03:02:78 C CITIGROUP D 34.125 2000 20080305 10:03:03:05 BA BOEING D 77.575 800 20080305 10:03:02:78
  • 10. Filtering Ø  New stream filtered for specific criteria, e.g. stock price > $22 Ø  Correlation & Aggregation Ø  Scrolling, time-based window metrics, e.g. average # of stock trades in the last hour Ø  Pattern Matching Ø  Notification of detected event patterns, e.g. price changes A, B and C occurred within 15 minute window CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 …… • Event Processing done in-Memory (not in Database) • Logic is defined through Continuous Queries on the data CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 CAT CATERPILLAR D 22.5 600 20080305 10:03:03:46 DO DUPONT D 41.575 3000 20080305 10:03:04:12 AA ALCOA INC D 20.125 1000 20080305 10:03:01:55 AXP AMER EXPRESS CO D 45.875 500 20080305 10:03:02:10 BA BOEING D 77.575 800 20080305 10:03:02:78 BA BOEING D 77.575 41.575 800 20080305 10:03:02:78 DO DUPONT D 41.575 3000 20080305 10:03:04:12 COMPLEX QUERIES Event Processing outputs
  • 11. Data crunching for Event Processing done in a in-memory data grid •  High throughput for storing data •  Aggregation and event querying •  Pattern implementation flexibility combining complementary technologies •  Handle and correlate events in real time, including support for multiple patterns: •  Pre processing (buffer inputs) •  In Event Processing (to cache reference data) •  Post Processing (to expose processed events to consuming apps) Data Grid Event Processing Consolidat ed & in- context Data Filtered/ Aggregat ed Data HDFS and traditional storage
  • 12. In-memory events on the data stream n  Threshold Management n  Detecting threshold conditions across multiple event streams n  Using cache to: n  Allow dynamic configuration of thresholds n  Add (via join) contextual data to support aggregation n  Using pattern matching to find sustained conditions n  Alert Generation n  Using relations to represent state and state transitions n  Using “missing event” patterns to monitor expected response(s) n  Alarm Management n  Using pattern matching to remove extraneous alarm events n  e.g. power off alarm preceded by tamper alarm within (n) minutes X
  • 13. Alarm Filtering Scenario Discard Power Off Alarm if there was a Tamper Alarm for the same meter within the previous 5 seconds
  • 14. Visualizing events on the data stream JMS Resource Locations Matches and Alerts SQL Event Processing Application JMS Geo-Fencing Definitions SQL MapViewer Manager
  • 15. JMS Protocol Integration n Common integration touch point with Service Bus n Business Activity Monitoring integration HTTP Publish/Subscribe n Support pub/sub events between Event server and web clients. n Clients don’t need to poll for updates (unlike traditional HTTP). n Clients subscribe to and publish to event channels. n Bayeux protocol n Light weight and the payload is JSON Visual/SOA integration with Event Processing
  • 16. Event Processing High Level Architecture JSON Adapter CacheProcessor POJO EPN (Event Processing Network) Elements HTTP Pub/S
  • 17. Query Plan and Real Time Monitoring
  • 18. Event Driven SOA: Simplify Business Complexity •  Real-time business insight •  Preempt and react instantaneously to Enterprise, Environmental and Global Business conditions •  Gain business insight using previously untapped, raw event sources •  Hot-pluggable integration •  Transparent SOA infrastructure interoperability •  Distributed, deployment ready, pre-integrated, in-memory Data Grid, and Java low latency determinism. •  Lightweight high performance Java Event Server platform •  Real-time business friendly analyst oriented visualization layers •  Powerful, extensible Event Processing Analysis abstraction •  Business user dashboards •  Business user domain specific natural language layers •  Real-time predictive analytics
  • 19. Event Processing use cases in different industries 1.  Customer Experience 2.  Transportation, Logistics & Fleet Management 3.  Utilities: Demand & Response, Smart Meter 4.  Public Sector: Emergency Response, Intelligence 5.  Telcos: Real Time billing & WiFi offloading, Mobile billboard
  • 20. Customer Experience n  Industry focus on new buzzword: Customer Experience n  Desire to harness potential of social networks for better targeted marketing Event Processing can help with: n  Monitoring in real-time customer activity (social networks, location (e.g. proximity to stores, etc) and identifying opportunities in real-time n  Correlating with existing information (customer/ shopping profiles, etc.) n  Generating real-time alerts
  • 21. Transportation, Logistics and Fleet Management n  Constant industry pressure for greater efficiency n  Need to differentiate through premium services and greater reliability and visibility n  Availability of cheap wireless sensors (temperature, GPS, etc.) that can be included in packages/containers/trucks Event Processing can help with: n  Real-time monitoring of inflow of data from sensors n  Trends detection / prediction (to rise, etc.) n  Leveraging spatial/geo-location capabilities.
  • 22. Utilities n  Adoption of Smart Meters: concerns about bandwidth/ processing power required to handle the information they generate, desire to offer value-add services n  Ever increasing electricity demand n  Demand for real-time billing & analytics n  Greater customer expectations re: outage & response times n  Regulations Event Processing can help with: n  Alerting of consumption trends in real-time, enabling “Demand/ Response” n  Real-time detection of problems (abnormal spikes in consumption indicative of leaks, etc.) n  Filtering out redundant or nested (ex: tree fell on the line) outage errors and problems n  Tracking of resources and personnel
  • 23. Telco n  Overloaded data networks and new strategies to offload traffic: real-time billing vs. unlimited, offloading to WiFi, degradation of service from 4G to 3G, etc. n  GPS-enabled phones offer new location-based marketing opportunities: “mobile billboards” How can Event Processing help: n  Event Processing infrastructure can handle massive amounts of data generated by mobile devices, filter out, correlate and aggregate in real-time to only retain valuable information n  Event Processing can plug into all types of feeds, from devices to social networks n  Event Processing can be integrated with spatial and geo- location technology to send location specific data to the user.
  • 24. Public Sector n  Heightened security requirements n  Ever increasing population in urban areas drives optimization requirements n  Increasing number of real-time data: video feeds, GPS data, traffic data, etc. n  Applications: Security Intelligence, geo-fencing, “Smart Cities”, traffic control, gateless tolls How Event Processing can help: n  Event Processing can be integrated with spatial and geo- location technology to track location specific data with a user. n  Event Processing can plug in any data feed such as video / face recognition n  Event Processing meets performance & availability requirements in this space