SlideShare a Scribd company logo
Apache Kafka and Analytics
in a Connected IoT World
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
LinkedIn
@KaiWaehner
www.confluent.io
www.kai-waehner.de
with Apache Kafka
Event
Streaming
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
5
STREAM
PROCESSING
Create and store
materialized views
Filter
Analyze in-flight
Time
C CC
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
TRADITIONAL
DATABASE
EVENT STREAM
PROCESSING
SELECT * FROM
DB_TABLE
CREATE TABLE T
AS SELECT * FROM
EVENT_STREAM
Active Query: Passive Data:
DB Table
Active Data: Passive Query:
Event Stream
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
TABLES STREAMS
USER
JAY
SUE
FRED
CREDIT_SCORE
695
430
710V1
V3
V2
PAYMENTS
42
18
65
...
USER
JAY
SUE
FRED
...
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
PUSH PULL
APP
Jay’s credit score is 670
Jay’s credit score is 710
Jay’s credit score is 695
What is Jay’s credit score now?
695
APP
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
The Log ConnectorsConnectors
Producer Consumer
Streaming Engine
Apache Kafka - The Rise of an Event Streaming Platform
9
=
Messaging
+
Storage
+
Integration
+
Processing
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Apache Kafka at Scale at Tech Giants
> 7 trillion messages / day > 6 Petabytes / day
“You name it”
* Kafka is not just used for big data
** Kafka Is not just used by tech giants
11
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
10 Reasons for Event Streaming with Apache Kafka
Real Time
Scalable
Cost Reduction
24/7 – Zero downtime, zero data loss
Decoupling – Storage, Domain-driven Design
Data (re-)processing and stateful client applications
Integration – Connectivity to IoT, legacy, big data, everything
Hybrid Architecture – On Premises, multi cloud, edge computing
Fully managed cloud
No vendor locking
12
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Device management
Unreliable networks
Connectivity beyond standards
Lightweight edge hardware
…
is not an IoT Platform!
Consumer IoT and Industrial IoT (IIoT)
Use Cases
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Ride-Sharing
More than just Messaging! Data correlation in real-time
for map-matching, ETA, cost calculation, and much more…
https://eng.lyft.com/a-new-real-time-map-matching-algorithm-at-lyft-da593ab7b006
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Connected Car Infrastructure
18
https://www.youtube.com/watch?v=yGLKi3TMJv8
• Real Time Data Analysis
• Swarm Intelligence
• Collaboration with Partners
• Predictive AI
• …
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Tesla
Trillions of messages per day for IoT use cases
https://www.confluent.io/kafka-summit-san-francisco-2019/0-60-teslas-streaming-data-platform/
https://www.confluent.io/blog/stream-processing-iot-data-best-practices-and-techniques/
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Track, manage, and locate
tools and other equipment
anytime and anywhere from
the warehouse to the jobsite https://www.confluent.io/customers/bosch/
https://events.confluent.io/online-talks/bosch-power-toolse-nables-real-time-analytics-on-iot-event-streams
DB Musterfirma | Vorname Name | Abteilung | Datum ("Einfügen > Kopf- und Fußzeile")
22Deutsche Bahn AG | Reisendeninformation
Consistent
real-time information
for travellers
across Germany
RI-Plattform
DB Musterfirma | Vorname Name | Abteilung | Datum ("Einfügen > Kopf- und Fußzeile")
23
Customer timetable
Operational
timetable
Assignments
Railway station
knowledge
Dispositions
Train positions
Matching
Aggregation
Consolidation
Apache
Kafka
Analysis
Railway station
Trains
Mobile Apps
Employees
Deutsche Bahn AG | Reisendeninformation
RI-Plattform
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Food Value Chain
IoT-Based and Data-Driven
Single source of truth
across the food value chain
(in the factories, and across regions)
Business critical
operations
(tracking, calculations, alerts, …)
https://www.confluent.io/blog/creating-iot-based-data-driven-food-value-chain-with-confluent-cloud/
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Postmodern ERP (coined by Gartner)
Replace legacy, monolithic and highly customized ERP suites
by a mixture of loosely coupled, exchangeable cloud-based and on-premises applications.
TMS
Legacy Proprietary
SOAP Web Services
Supplier
Alert
ForecastInventory Customer
Order
Core ERP
CRM
SaaS
Kafka Interface
MES
Proprietary
HTTP Web Services
LMS
Legacy Homegrown
Database + CDC
SRM
Kafka-native
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Real Time Supply Chain and Retailing IoT Platform
@ Mojix
https://www.confluent.io/customers/mojix/
Real-time operational intelligence with complex
event processing
Inventory accuracy increased from 65% to 99%
Omnichannel sales
Built using Confluent Cloud, Kafka, Kafka Connect
and Kafka Streams
Hybrid cloud across the edge – at retail stores and
distribution centers – and the cloud
Variety of sources, including RFID readers, camera
sensors, beacons, mobile devices and routers
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Cross-Company Supply Chain Integration
Streaming Replication and API Management
MirrorMaker 2
Confluent Replicator
Cluster Linking
Tier 2
Supplier
OEM Streaming integration
between companies
API Management
(REST et al) is not
appropriate for
streaming data
Infosec and politics are
your biggest hurdle
Tier 1
Supplier
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Augmented Reality for Smart Assistence
with Apache Kafka, Kafka Connect and ksqlDB
Pre-Processing and Data Correlation
(Kafka Streams / ksqlDB)
Receive
Command
Operator
(REST Proxy)
MES
(Java)
Send
Live
Metrics
Send
Command
Send
Production
StatusRobots
(C++)
Receive
Correlated
Information
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Cybersecurity
The threat is real!
Challenges
Stealing IP
DDoS
Ransomware / wiperware
WannaCry, NotPetya, …
Damage: Billions of dollars
”Supply chain attack”
Industry 4.0
Networking
Communication
Connectivity
Open standards
”Always-on”
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Legacy SIEM needs to evolve
ForwarderNetwork traffic
Firewall logs
RDBMS
Application logs
Adaptors
Beats
Sensor Data
Challenges:
● Proprietary forwarders that can only
send data to single source
● Data locked from being shared
● Difficult to scale with growing data
volumes
● Prohibitively high indexing costs
● Unable to filter out noisy data
● Slow batch processing
HTTP proxy logs
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
AI/ML
Modernized security information and event management (SIEM)
Filter,
transform,
aggregate
APP SIEM Index
Search
Curated streams
Forensic
Archive
HDFS
S3
Big Query
Syslog
CDC
Network traffic
Firewall logs
RDBMS
Application logs
Sensor Data
HTTP proxy logs
QRadar
Arcsight
Splunk
Elastic
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
BMW Group
Industry-ready NLP Service Framework Based on Kafka
https://www.confluent.io/kafka-summit-lon19/industry-ready-nlp-service-framework-kafka/
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Direct streaming ingestion
for model training
with TensorFlow I/O + Kafka Plugin
(no additional data storage
like S3 or HDFS required!)
Time
Model BModel A
Producer
Distributed
Commit Log
Streaming Ingestion and Model Training
with Kafka, Tiered Storage and TensorFlow IO
https://github.com/tensorflow/io
36
Model X
(at a later time)
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Confluent Tiered Storage for Kafka
Object Store
Processing Storage
Transactions,
auth, quota
enforcement,
compaction, ...
Local
Remote
Kafka
Apps
Store Forever
Older data is offloaded to inexpensive object
storage, permitting it to be consumed at any time.
Save $$$
Storage limitations, like capacity and duration,
are effectively uncapped.
Instantaneously scale up and down
Your Kafka clusters will be able to automatically
self-balance load and hence elastically scale
(Only available in Confluent Platform)
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
BI
Tool
AI/ML
Machine Vision for Quality Assurance and Yield Management
Apache Kafka and Applied Machine Learning
Filter, transform
aggregate, orchestrate
APP
Real-time alerting
Sensor Data
SCADA
MES
PLCs
OT
Team
Plant
Manager
Images
from Products
of Assembly Lines
IT
Team
Live
Ops
Machine Vision for
Quality Inspection
Reporting
Backup
Data Science Team
Data Lake
Why Confluent
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
The Rise of Event Streaming
2010
Apache Kafka
created at LinkedIn by
Confluent founders
2014
2020
80%
Fortune 100
Companies
trust and use
Apache Kafka
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
50
I N V E S T M E N T & T I M E
VALUE
3
4
5
1
2
Event Streaming Maturity Model
50
Initial Awareness /
Pilot (1 Kafka Cluster)
Start to Build Pipeline /
Deliver 1 New Outcome
(1 Kafka Cluster)
Mission-Critical
Deployment
(Stretched, Hybrid,
Multi-Region)
Build Contextual Event-
Driven Apps
(Stretched, Hybrid,
Multi-Region)
Central Nervous System
(Global Kafka)
Product, Support, Training, Partners, Technical Account Management...
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
51Confluent Platform
Fully Managed Cloud ServiceSelf Managed Software FREEDOM OF CHOICE
COMMITTER-DRIVEN EXPERTISE PartnersTrainingProfessional
Services
Enterprise
Support
Apache Kafka
EFFICIENT
OPERATIONS AT SCALE
PRODUCTION-
STAGE PREREQUISITES
UNRESTRICTED
DEVELOPER PRODUCTIVITY
SQL-based Stream Processing
KSQL (ksqlDB)
Rich Pre-built Ecosystem
Connectors | Hub | Schema Registry
Multi-language Development
non-Java clients | REST Proxy
GUI-driven Mgmt & Monitoring
Control Center
Flexible DevOps Automation
Operator | Ansible
Dynamic Performance &
Elasticity
Auto Data Balancer | Tiered Storage
Enterprise-grade Security
RBAC | Secrets | Audit logs
Data Compatibility
Schema Registry | Schema
Validation
Global Resilience
Multi-Region Clusters | Replicator
Developer Operator Architect
Open Source | Community licensed
PARTNERSHIP
FOR BUSINESS SUCCESS
Complete Engagement Model
Revenue / Cost / Risk Impact
TCO / ROI
Executive Buyer
IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
Global Event Streaming
Aggregate Small Footprint
Edge Deployments with
Replication (Aggregation)
Simplify Disaster Recovery
Operations with
Multi-Region Clusters
with RPO=0 and RTO=0
Stream Data Globally with
Replication and Cluster Linking
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
@KaiWaehner
www.kai-waehner.de
www.confluent.io
LinkedIn
Questions? Feedback?
Let’s connect!

More Related Content

PDF
Top use cases for 2022 with Data in Motion and Apache Kafka
PDF
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
PDF
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
PDF
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
PPTX
Bridge Your Kafka Streams to Azure Webinar
PDF
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
PDF
Building a Secure, Tamper-Proof & Scalable Blockchain on Top of Apache Kafka ...
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
Top use cases for 2022 with Data in Motion and Apache Kafka
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Bridge Your Kafka Streams to Azure Webinar
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Building a Secure, Tamper-Proof & Scalable Blockchain on Top of Apache Kafka ...
Benefits of Stream Processing and Apache Kafka Use Cases

What's hot (20)

PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
PDF
Real time data processing and model inferncing platform with Kafka streams (N...
PDF
Building a real-time data processing pipeline using Apache Kafka, Kafka Conne...
PDF
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
PDF
Understanding the TCO and ROI of Apache Kafka & Confluent
PDF
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
PDF
Build a Bridge to Cloud with Apache Kafka® for Data Analytics Cloud Services
PDF
How Apache Kafka helps to create Data Culture – How to Cross the Kafka Chasm
PDF
Real-time processing of large amounts of data
PPTX
A guide through the Azure Messaging services - Update Conference
PPTX
Supply Chain Optimization with Apache Kafka
PDF
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
PPTX
Should we manage events like APIs? | Kim Clark, IBM
PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
PDF
Apache Kafka as Event-Driven Open Source Streaming Platform (Prague Meetup)
PDF
Kafka Streams vs. KSQL for Stream Processing on top of Apache Kafka
PDF
Neha Narkhede | Kafka Summit London 2019 Keynote | Event Streaming: Our Cloud...
PDF
Apache Kafka for Smart Grid, Utilities and Energy Production
PDF
Bridge to Cloud: Using Apache Kafka to Migrate to GCP
PDF
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Real time data processing and model inferncing platform with Kafka streams (N...
Building a real-time data processing pipeline using Apache Kafka, Kafka Conne...
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Understanding the TCO and ROI of Apache Kafka & Confluent
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Build a Bridge to Cloud with Apache Kafka® for Data Analytics Cloud Services
How Apache Kafka helps to create Data Culture – How to Cross the Kafka Chasm
Real-time processing of large amounts of data
A guide through the Azure Messaging services - Update Conference
Supply Chain Optimization with Apache Kafka
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
Should we manage events like APIs? | Kim Clark, IBM
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka as Event-Driven Open Source Streaming Platform (Prague Meetup)
Kafka Streams vs. KSQL for Stream Processing on top of Apache Kafka
Neha Narkhede | Kafka Summit London 2019 Keynote | Event Streaming: Our Cloud...
Apache Kafka for Smart Grid, Utilities and Energy Production
Bridge to Cloud: Using Apache Kafka to Migrate to GCP
Apache Kafka as Event Streaming Platform for Microservice Architectures
Ad

Similar to Apache Kafka® and Analytics in a Connected IoT World (20)

PPTX
IoT and Event Streaming at Scale with Apache Kafka
PDF
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
PDF
Apache Kafka in the Airline, Aviation and Travel Industry
PDF
Apache Kafka in the Transportation and Logistics
PDF
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
PPTX
Unlock value with Confluent and AWS.pptx
PDF
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
PDF
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
PDF
App modernization on AWS with Apache Kafka and Confluent Cloud
PDF
Beyond the brokers - A tour of the Kafka ecosystem
PDF
Beyond the Brokers: A Tour of the Kafka Ecosystem
PDF
Apache Kafka in Financial Services - Use Cases and Architectures
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
PDF
Connected Vehicles and V2X with Apache Kafka
PDF
Apache kafka event_streaming___kai_waehner
PDF
Event Hub (i.e. Kafka) in Modern Data Architecture
PDF
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
PDF
dotScale 2017 Keynote: The Rise of Real Time by Neha Narkhede
PPTX
Streaming Data and Stream Processing with Apache Kafka
PPTX
Confluent-Ably-AWS-ID-2023 - GSlide.pptx
IoT and Event Streaming at Scale with Apache Kafka
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Apache Kafka in the Airline, Aviation and Travel Industry
Apache Kafka in the Transportation and Logistics
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Unlock value with Confluent and AWS.pptx
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
App modernization on AWS with Apache Kafka and Confluent Cloud
Beyond the brokers - A tour of the Kafka ecosystem
Beyond the Brokers: A Tour of the Kafka Ecosystem
Apache Kafka in Financial Services - Use Cases and Architectures
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Connected Vehicles and V2X with Apache Kafka
Apache kafka event_streaming___kai_waehner
Event Hub (i.e. Kafka) in Modern Data Architecture
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
dotScale 2017 Keynote: The Rise of Real Time by Neha Narkhede
Streaming Data and Stream Processing with Apache Kafka
Confluent-Ably-AWS-ID-2023 - GSlide.pptx
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
PDF
Migration, backup and restore made easy using Kannika
PDF
Five Things You Need to Know About Data Streaming in 2025
PDF
Data in Motion Tour Seoul 2024 - Keynote
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
PDF
Unlocking value with event-driven architecture by Confluent
PDF
Il Data Streaming per un’AI real-time di nuova generazione
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
PDF
Building API data products on top of your real-time data infrastructure
PDF
Speed Wins: From Kafka to APIs in Minutes
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
Webinar Think Right - Shift Left - 19-03-2025.pptx
Migration, backup and restore made easy using Kannika
Five Things You Need to Know About Data Streaming in 2025
Data in Motion Tour Seoul 2024 - Keynote
Data in Motion Tour Seoul 2024 - Roadmap Demo
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
Data in Motion Tour 2024 Riyadh, Saudi Arabia
Build a Real-Time Decision Support Application for Financial Market Traders w...
Strumenti e Strategie di Stream Governance con Confluent Platform
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
Building Real-Time Gen AI Applications with SingleStore and Confluent
Unlocking value with event-driven architecture by Confluent
Il Data Streaming per un’AI real-time di nuova generazione
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Break data silos with real-time connectivity using Confluent Cloud Connectors
Building API data products on top of your real-time data infrastructure
Speed Wins: From Kafka to APIs in Minutes
Evolving Data Governance for the Real-time Streaming and AI Era

Recently uploaded (20)

PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Machine Learning_overview_presentation.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Machine learning based COVID-19 study performance prediction
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Approach and Philosophy of On baking technology
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Programs and apps: productivity, graphics, security and other tools
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Machine Learning_overview_presentation.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Big Data Technologies - Introduction.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Machine learning based COVID-19 study performance prediction
Chapter 3 Spatial Domain Image Processing.pdf
Assigned Numbers - 2025 - Bluetooth® Document
The AUB Centre for AI in Media Proposal.docx
Approach and Philosophy of On baking technology
A comparative analysis of optical character recognition models for extracting...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
Mobile App Security Testing_ A Comprehensive Guide.pdf

Apache Kafka® and Analytics in a Connected IoT World

  • 1. Apache Kafka and Analytics in a Connected IoT World Kai Waehner Technology Evangelist [email protected] LinkedIn @KaiWaehner www.confluent.io www.kai-waehner.de
  • 3. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de 5 STREAM PROCESSING Create and store materialized views Filter Analyze in-flight Time C CC
  • 4. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de TRADITIONAL DATABASE EVENT STREAM PROCESSING SELECT * FROM DB_TABLE CREATE TABLE T AS SELECT * FROM EVENT_STREAM Active Query: Passive Data: DB Table Active Data: Passive Query: Event Stream
  • 5. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de TABLES STREAMS USER JAY SUE FRED CREDIT_SCORE 695 430 710V1 V3 V2 PAYMENTS 42 18 65 ... USER JAY SUE FRED ...
  • 6. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de PUSH PULL APP Jay’s credit score is 670 Jay’s credit score is 710 Jay’s credit score is 695 What is Jay’s credit score now? 695 APP
  • 7. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de The Log ConnectorsConnectors Producer Consumer Streaming Engine Apache Kafka - The Rise of an Event Streaming Platform 9 = Messaging + Storage + Integration + Processing
  • 8. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Apache Kafka at Scale at Tech Giants > 7 trillion messages / day > 6 Petabytes / day “You name it” * Kafka is not just used for big data ** Kafka Is not just used by tech giants 11
  • 9. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de 10 Reasons for Event Streaming with Apache Kafka Real Time Scalable Cost Reduction 24/7 – Zero downtime, zero data loss Decoupling – Storage, Domain-driven Design Data (re-)processing and stateful client applications Integration – Connectivity to IoT, legacy, big data, everything Hybrid Architecture – On Premises, multi cloud, edge computing Fully managed cloud No vendor locking 12
  • 10. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Device management Unreliable networks Connectivity beyond standards Lightweight edge hardware … is not an IoT Platform!
  • 11. Consumer IoT and Industrial IoT (IIoT) Use Cases
  • 12. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Ride-Sharing More than just Messaging! Data correlation in real-time for map-matching, ETA, cost calculation, and much more… https://eng.lyft.com/a-new-real-time-map-matching-algorithm-at-lyft-da593ab7b006
  • 13. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Connected Car Infrastructure 18 https://www.youtube.com/watch?v=yGLKi3TMJv8 • Real Time Data Analysis • Swarm Intelligence • Collaboration with Partners • Predictive AI • …
  • 14. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Tesla Trillions of messages per day for IoT use cases https://www.confluent.io/kafka-summit-san-francisco-2019/0-60-teslas-streaming-data-platform/ https://www.confluent.io/blog/stream-processing-iot-data-best-practices-and-techniques/
  • 15. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Track, manage, and locate tools and other equipment anytime and anywhere from the warehouse to the jobsite https://www.confluent.io/customers/bosch/ https://events.confluent.io/online-talks/bosch-power-toolse-nables-real-time-analytics-on-iot-event-streams
  • 16. DB Musterfirma | Vorname Name | Abteilung | Datum ("Einfügen > Kopf- und Fußzeile") 22Deutsche Bahn AG | Reisendeninformation Consistent real-time information for travellers across Germany RI-Plattform
  • 17. DB Musterfirma | Vorname Name | Abteilung | Datum ("Einfügen > Kopf- und Fußzeile") 23 Customer timetable Operational timetable Assignments Railway station knowledge Dispositions Train positions Matching Aggregation Consolidation Apache Kafka Analysis Railway station Trains Mobile Apps Employees Deutsche Bahn AG | Reisendeninformation RI-Plattform
  • 18. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
  • 19. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de
  • 20. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Food Value Chain IoT-Based and Data-Driven Single source of truth across the food value chain (in the factories, and across regions) Business critical operations (tracking, calculations, alerts, …) https://www.confluent.io/blog/creating-iot-based-data-driven-food-value-chain-with-confluent-cloud/
  • 21. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Postmodern ERP (coined by Gartner) Replace legacy, monolithic and highly customized ERP suites by a mixture of loosely coupled, exchangeable cloud-based and on-premises applications. TMS Legacy Proprietary SOAP Web Services Supplier Alert ForecastInventory Customer Order Core ERP CRM SaaS Kafka Interface MES Proprietary HTTP Web Services LMS Legacy Homegrown Database + CDC SRM Kafka-native
  • 22. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Real Time Supply Chain and Retailing IoT Platform @ Mojix https://www.confluent.io/customers/mojix/ Real-time operational intelligence with complex event processing Inventory accuracy increased from 65% to 99% Omnichannel sales Built using Confluent Cloud, Kafka, Kafka Connect and Kafka Streams Hybrid cloud across the edge – at retail stores and distribution centers – and the cloud Variety of sources, including RFID readers, camera sensors, beacons, mobile devices and routers
  • 23. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Cross-Company Supply Chain Integration Streaming Replication and API Management MirrorMaker 2 Confluent Replicator Cluster Linking Tier 2 Supplier OEM Streaming integration between companies API Management (REST et al) is not appropriate for streaming data Infosec and politics are your biggest hurdle Tier 1 Supplier
  • 24. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Augmented Reality for Smart Assistence with Apache Kafka, Kafka Connect and ksqlDB Pre-Processing and Data Correlation (Kafka Streams / ksqlDB) Receive Command Operator (REST Proxy) MES (Java) Send Live Metrics Send Command Send Production StatusRobots (C++) Receive Correlated Information
  • 25. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Cybersecurity The threat is real! Challenges Stealing IP DDoS Ransomware / wiperware WannaCry, NotPetya, … Damage: Billions of dollars ”Supply chain attack” Industry 4.0 Networking Communication Connectivity Open standards ”Always-on”
  • 26. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Legacy SIEM needs to evolve ForwarderNetwork traffic Firewall logs RDBMS Application logs Adaptors Beats Sensor Data Challenges: ● Proprietary forwarders that can only send data to single source ● Data locked from being shared ● Difficult to scale with growing data volumes ● Prohibitively high indexing costs ● Unable to filter out noisy data ● Slow batch processing HTTP proxy logs
  • 27. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de AI/ML Modernized security information and event management (SIEM) Filter, transform, aggregate APP SIEM Index Search Curated streams Forensic Archive HDFS S3 Big Query Syslog CDC Network traffic Firewall logs RDBMS Application logs Sensor Data HTTP proxy logs QRadar Arcsight Splunk Elastic
  • 28. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de BMW Group Industry-ready NLP Service Framework Based on Kafka https://www.confluent.io/kafka-summit-lon19/industry-ready-nlp-service-framework-kafka/
  • 29. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Direct streaming ingestion for model training with TensorFlow I/O + Kafka Plugin (no additional data storage like S3 or HDFS required!) Time Model BModel A Producer Distributed Commit Log Streaming Ingestion and Model Training with Kafka, Tiered Storage and TensorFlow IO https://github.com/tensorflow/io 36 Model X (at a later time)
  • 30. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Confluent Tiered Storage for Kafka Object Store Processing Storage Transactions, auth, quota enforcement, compaction, ... Local Remote Kafka Apps Store Forever Older data is offloaded to inexpensive object storage, permitting it to be consumed at any time. Save $$$ Storage limitations, like capacity and duration, are effectively uncapped. Instantaneously scale up and down Your Kafka clusters will be able to automatically self-balance load and hence elastically scale (Only available in Confluent Platform)
  • 31. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de BI Tool AI/ML Machine Vision for Quality Assurance and Yield Management Apache Kafka and Applied Machine Learning Filter, transform aggregate, orchestrate APP Real-time alerting Sensor Data SCADA MES PLCs OT Team Plant Manager Images from Products of Assembly Lines IT Team Live Ops Machine Vision for Quality Inspection Reporting Backup Data Science Team Data Lake
  • 33. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de The Rise of Event Streaming 2010 Apache Kafka created at LinkedIn by Confluent founders 2014 2020 80% Fortune 100 Companies trust and use Apache Kafka
  • 34. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de 50 I N V E S T M E N T & T I M E VALUE 3 4 5 1 2 Event Streaming Maturity Model 50 Initial Awareness / Pilot (1 Kafka Cluster) Start to Build Pipeline / Deliver 1 New Outcome (1 Kafka Cluster) Mission-Critical Deployment (Stretched, Hybrid, Multi-Region) Build Contextual Event- Driven Apps (Stretched, Hybrid, Multi-Region) Central Nervous System (Global Kafka) Product, Support, Training, Partners, Technical Account Management...
  • 35. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de 51Confluent Platform Fully Managed Cloud ServiceSelf Managed Software FREEDOM OF CHOICE COMMITTER-DRIVEN EXPERTISE PartnersTrainingProfessional Services Enterprise Support Apache Kafka EFFICIENT OPERATIONS AT SCALE PRODUCTION- STAGE PREREQUISITES UNRESTRICTED DEVELOPER PRODUCTIVITY SQL-based Stream Processing KSQL (ksqlDB) Rich Pre-built Ecosystem Connectors | Hub | Schema Registry Multi-language Development non-Java clients | REST Proxy GUI-driven Mgmt & Monitoring Control Center Flexible DevOps Automation Operator | Ansible Dynamic Performance & Elasticity Auto Data Balancer | Tiered Storage Enterprise-grade Security RBAC | Secrets | Audit logs Data Compatibility Schema Registry | Schema Validation Global Resilience Multi-Region Clusters | Replicator Developer Operator Architect Open Source | Community licensed PARTNERSHIP FOR BUSINESS SUCCESS Complete Engagement Model Revenue / Cost / Risk Impact TCO / ROI Executive Buyer
  • 36. IoT and Event Streaming – @KaiWaehner - www.kai-waehner.de Global Event Streaming Aggregate Small Footprint Edge Deployments with Replication (Aggregation) Simplify Disaster Recovery Operations with Multi-Region Clusters with RPO=0 and RTO=0 Stream Data Globally with Replication and Cluster Linking