Build your own Real Time Analytics and
Visualization, Enable Complex Event
Processing, Event Patterns and Aggregates




Ramesh / Vishnu
Supply Chain - Platform Team
Tom admiring his
  handywork !
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Database   Application Server
Elastic
           Search


                        Graylog2



                        Logstash



Database             Application Server
Elastic
                                StatsD
           Search


                        Graylog2



                        Logstash



Database             Application Server
Elastic
                                StatsD
           Search

                                          graphite
                        Graylog2



                        Logstash



Database             Application Server
Search




           Elastic
                                StatsD
           Search

                                          graphite
                        Graylog2



                        Logstash



Database             Application Server
Search    CEP




           Elastic
                                StatsD
           Search

                                          graphite
                        Graylog2



                        Logstash



Database             Application Server
Complex Event Processing
 ●   ElasticSearch as a Storage or Alternate DB
      ○  Faster on Lookup Queries than RDBMS
      ○  Can do simple predicate queries
      ○  Does not need multiple indexes (full text indexing)
      ○  Create fields out of interesting values

 ●   Statsd layer is a sliding window counter
      ○  Within a sliding window we can do regex patterns
      ○  Aggregates
      ○  Deviations
      ○  This is a Key aspect of the SOA Monitoring System (Complex
         patterns which need action)

Push the complex pattern back to ES or as a trigger for action
Use cases
● Every PO has a matching SO?

● Has a shelf in the warehouse just gone
  empty?

● Where is the current pile up happening?

● Is the SLA being breached?
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Search    CEP




           Elastic
                                StatsD
           Search

                                          graphite
                        Graylog2



                        Logstash



Database             Application Server
Are logs the only source of events?

● No - The database can be used as well.

● Events can be generated by capturing the
  Updates/Inserts/Deletes being made to the
  tables.

● These events can be published to an MQ to
  speed up replication (batch processing) or sent
  to the CEP engine.
Search    CEP




              Elastic
                                   StatsD
              Search

                                             graphite
                           Graylog2
Change Data
Capture
                           Logstash



 Database               Application Server
Distribute
                  Replication                        Search    CEP
 General
                   Events
Query Log



                                Elastic
             MQ                                      StatsD
                                Search

                                                               graphite
                                             Graylog2
                  Change Data
                  Capture
                                             Logstash


        log.cc
                    Database              Application Server
Elasticsearch
Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates
Time to Sing
                             Mood of Mysql




Note:image is from http://www.technocation.org
Mood of Mysql

● Music is the best way to express how one feels.

● Well, Mysql has a soul too, it has a mood :)

● Mysql can sing through each query(good/bad) it gets.

● Every query, Mysql gets, is intercepted in log.cc and
  sent acrross to an MQ Server. Subscribers to the
  queue ,on receiving a message play a musical note
  depending on the query they get.
Use case: Divide & Conquer General
query log
● Alternative to enabling general query log, which grows very
  fast in size and disk space becomes a concern on the master
  database.

● The queries are sent out to a queue on an MQ Server and an
  army of subscribers who listen to the queue , log the query
  on receiving a message.

● The general query log can now be distributed (among the
  subscribers).

● More number of subscribers => smaller the log & easy to
  rotate.
References

http://bazaar.launchpad.net/~mysql/mysql-replication-
listener/trunk

https://github.com/etsy/statsd/

https://launchpad.net/graphite

http://www.elasticsearch.org/

http://www.oscon.
com/oscon2011/public/schedule/detail/18785

http://technocation.org/
Thank you




 vishnuhr@flipkart.com
rameshpy@flipkart.com

More Related Content

PDF
Reactive Databases for Big Data applications
PPTX
Xanadu Based Big Data CBIR System:Automated Astronomical Objects Classificati...
PPTX
Pig on spark
PPTX
Pig on Spark
PPTX
Intro elasticsearch taswarbhatti
PDF
HUGIreland_CronanMcNamara_DataScience_ExpertModels.pdf
PDF
Graphalytics: A big data benchmark for graph processing platforms
PDF
The Future of Real-Time in Spark
Reactive Databases for Big Data applications
Xanadu Based Big Data CBIR System:Automated Astronomical Objects Classificati...
Pig on spark
Pig on Spark
Intro elasticsearch taswarbhatti
HUGIreland_CronanMcNamara_DataScience_ExpertModels.pdf
Graphalytics: A big data benchmark for graph processing platforms
The Future of Real-Time in Spark

What's hot (20)

PDF
Spark Streaming Intro @KTech
PPTX
An Intro to Elasticsearch and Kibana
PDF
ggplot2.SparkR: Rebooting ggplot2 for Scalable Big Data Visualization by Jong...
PDF
Building real time analytics applications using pinot : A LinkedIn case study
PPTX
Big linked geospatial data tools in ExtremeEarth-phiweek19
PDF
Deep Dive Into Catalyst: Apache Spark 2.0’s Optimizer
PPTX
Dataset Descriptions in Open PHACTS and HCLS
PDF
Distributed Point-in-Time Recovery with Postgres | PGConf.Russia 2018 | Eren ...
PDF
ISNCC 2017
PDF
Monitoring pg with_graphite_grafana
PDF
BDE SC3.3 Workshop - BDE Platform: Technical overview
PDF
Perceval, Graal and Arthur: The Quest for Software Project Data
PDF
What's new in spark 2.0?
PDF
ER 2016 Tutorial
PDF
ClickHouse Analytical DBMS: Introduction and Case Studies, by Alexander Zaitsev
PDF
Javantura v3 - Logs – the missing gold mine – Franjo Žilić
PPTX
Exploring MongoDB & Elasticsearch: Better Together
PPTX
Info gdal 20150915
PDF
Production Machine Learning
PDF
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Spark Streaming Intro @KTech
An Intro to Elasticsearch and Kibana
ggplot2.SparkR: Rebooting ggplot2 for Scalable Big Data Visualization by Jong...
Building real time analytics applications using pinot : A LinkedIn case study
Big linked geospatial data tools in ExtremeEarth-phiweek19
Deep Dive Into Catalyst: Apache Spark 2.0’s Optimizer
Dataset Descriptions in Open PHACTS and HCLS
Distributed Point-in-Time Recovery with Postgres | PGConf.Russia 2018 | Eren ...
ISNCC 2017
Monitoring pg with_graphite_grafana
BDE SC3.3 Workshop - BDE Platform: Technical overview
Perceval, Graal and Arthur: The Quest for Software Project Data
What's new in spark 2.0?
ER 2016 Tutorial
ClickHouse Analytical DBMS: Introduction and Case Studies, by Alexander Zaitsev
Javantura v3 - Logs – the missing gold mine – Franjo Žilić
Exploring MongoDB & Elasticsearch: Better Together
Info gdal 20150915
Production Machine Learning
Big Data & Machine Learning Pipelines: A Tale of Lambdas, Kappas and Pancakes
Ad

Viewers also liked (15)

PDF
a wild Supposition: can MySQL be Kafka ?
PDF
Punch clock for debugging apache storm
PPTX
Do you need microservices architecture?
PDF
Demystifying datastores
PDF
Visualising Basic Concepts of Docker
PDF
Spring IO '15 - Developing microservices, Spring Boot or Grails?
PDF
Let's Go: Introduction to Google's Go Programming Language
PDF
Drools 6.0 (Red Hat Summit)
PDF
Software Design in Practice (with Java examples)
PDF
Microservices with Spring Boot
PDF
Microservices with Java, Spring Boot and Spring Cloud
PDF
Microservice With Spring Boot and Spring Cloud
PDF
Bangalore Container Conference 2017 - Poster
PDF
Docker by Example - Basics
PPTX
Spring boot
a wild Supposition: can MySQL be Kafka ?
Punch clock for debugging apache storm
Do you need microservices architecture?
Demystifying datastores
Visualising Basic Concepts of Docker
Spring IO '15 - Developing microservices, Spring Boot or Grails?
Let's Go: Introduction to Google's Go Programming Language
Drools 6.0 (Red Hat Summit)
Software Design in Practice (with Java examples)
Microservices with Spring Boot
Microservices with Java, Spring Boot and Spring Cloud
Microservice With Spring Boot and Spring Cloud
Bangalore Container Conference 2017 - Poster
Docker by Example - Basics
Spring boot
Ad

Similar to Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates (20)

PDF
Fluentd meetup #3
PPTX
Realtime Analytics with MongoDB Counters (mongonyc 2012)
PDF
Hadoop, hive和scribe在运维方面的应用
PDF
Elastic Search
PPTX
The Elastic Stack as a SIEM
PDF
A Morning with MongoDB Barcelona: Use Cases and Roadmap
KEY
London devops logging
PDF
JDD2014: Real Big Data - Scott MacGregor
PDF
HPTS 2011: The NoSQL Ecosystem
PDF
The NoSQL Ecosystem
PDF
MongoDB Use Cases and Roadmap
PDF
No SQL Technologies
PPTX
Large scale computing with mapreduce
PPTX
NoSQL for the SQL Server Pro
PDF
Processing large-scale graphs with Google Pregel
PPTX
MongoDB Use Cases: Healthcare, CMS, Analytics
PPTX
Introduction to Apache Drill
PPTX
Big data hadoop ecosystem and nosql
PDF
Hadoop For OpenStack Log Analysis
PDF
Pittaro open stackloganalysis_20130416
Fluentd meetup #3
Realtime Analytics with MongoDB Counters (mongonyc 2012)
Hadoop, hive和scribe在运维方面的应用
Elastic Search
The Elastic Stack as a SIEM
A Morning with MongoDB Barcelona: Use Cases and Roadmap
London devops logging
JDD2014: Real Big Data - Scott MacGregor
HPTS 2011: The NoSQL Ecosystem
The NoSQL Ecosystem
MongoDB Use Cases and Roadmap
No SQL Technologies
Large scale computing with mapreduce
NoSQL for the SQL Server Pro
Processing large-scale graphs with Google Pregel
MongoDB Use Cases: Healthcare, CMS, Analytics
Introduction to Apache Drill
Big data hadoop ecosystem and nosql
Hadoop For OpenStack Log Analysis
Pittaro open stackloganalysis_20130416

More from vishnu rao (7)

PDF
Assessing Data Pipeline Quality & Sanity with Data Angiograms.pdf
PDF
A talk on mysql & aurora
PDF
Introduction to Apache Kafka
PDF
Mysql Relay log - the unsung hero
PDF
simple introduction to hadoop
PPTX
Druid beginner performance tips
PDF
StormWars - when the data stream shrinks
Assessing Data Pipeline Quality & Sanity with Data Angiograms.pdf
A talk on mysql & aurora
Introduction to Apache Kafka
Mysql Relay log - the unsung hero
simple introduction to hadoop
Druid beginner performance tips
StormWars - when the data stream shrinks

Recently uploaded (20)

PDF
Getting started with AI Agents and Multi-Agent Systems
PPTX
2018-HIPAA-Renewal-Training for executives
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Configure Apache Mutual Authentication
PDF
Architecture types and enterprise applications.pdf
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PPT
Module 1.ppt Iot fundamentals and Architecture
PPT
Geologic Time for studying geology for geologist
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Getting started with AI Agents and Multi-Agent Systems
2018-HIPAA-Renewal-Training for executives
Zenith AI: Advanced Artificial Intelligence
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
NewMind AI Weekly Chronicles – August ’25 Week III
Consumable AI The What, Why & How for Small Teams.pdf
Custom Battery Pack Design Considerations for Performance and Safety
Developing a website for English-speaking practice to English as a foreign la...
Configure Apache Mutual Authentication
Architecture types and enterprise applications.pdf
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Module 1.ppt Iot fundamentals and Architecture
Geologic Time for studying geology for geologist
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
OpenACC and Open Hackathons Monthly Highlights July 2025
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
The influence of sentiment analysis in enhancing early warning system model f...
CloudStack 4.21: First Look Webinar slides
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...

Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates

  • 1. Build your own Real Time Analytics and Visualization, Enable Complex Event Processing, Event Patterns and Aggregates Ramesh / Vishnu Supply Chain - Platform Team
  • 2. Tom admiring his handywork !
  • 4. Database Application Server
  • 5. Elastic Search Graylog2 Logstash Database Application Server
  • 6. Elastic StatsD Search Graylog2 Logstash Database Application Server
  • 7. Elastic StatsD Search graphite Graylog2 Logstash Database Application Server
  • 8. Search Elastic StatsD Search graphite Graylog2 Logstash Database Application Server
  • 9. Search CEP Elastic StatsD Search graphite Graylog2 Logstash Database Application Server
  • 10. Complex Event Processing ● ElasticSearch as a Storage or Alternate DB ○ Faster on Lookup Queries than RDBMS ○ Can do simple predicate queries ○ Does not need multiple indexes (full text indexing) ○ Create fields out of interesting values ● Statsd layer is a sliding window counter ○ Within a sliding window we can do regex patterns ○ Aggregates ○ Deviations ○ This is a Key aspect of the SOA Monitoring System (Complex patterns which need action) Push the complex pattern back to ES or as a trigger for action
  • 11. Use cases ● Every PO has a matching SO? ● Has a shelf in the warehouse just gone empty? ● Where is the current pile up happening? ● Is the SLA being breached?
  • 21. Search CEP Elastic StatsD Search graphite Graylog2 Logstash Database Application Server
  • 22. Are logs the only source of events? ● No - The database can be used as well. ● Events can be generated by capturing the Updates/Inserts/Deletes being made to the tables. ● These events can be published to an MQ to speed up replication (batch processing) or sent to the CEP engine.
  • 23. Search CEP Elastic StatsD Search graphite Graylog2 Change Data Capture Logstash Database Application Server
  • 24. Distribute Replication Search CEP General Events Query Log Elastic MQ StatsD Search graphite Graylog2 Change Data Capture Logstash log.cc Database Application Server
  • 27. Time to Sing Mood of Mysql Note:image is from http://www.technocation.org
  • 28. Mood of Mysql ● Music is the best way to express how one feels. ● Well, Mysql has a soul too, it has a mood :) ● Mysql can sing through each query(good/bad) it gets. ● Every query, Mysql gets, is intercepted in log.cc and sent acrross to an MQ Server. Subscribers to the queue ,on receiving a message play a musical note depending on the query they get.
  • 29. Use case: Divide & Conquer General query log ● Alternative to enabling general query log, which grows very fast in size and disk space becomes a concern on the master database. ● The queries are sent out to a queue on an MQ Server and an army of subscribers who listen to the queue , log the query on receiving a message. ● The general query log can now be distributed (among the subscribers). ● More number of subscribers => smaller the log & easy to rotate.