SlideShare a Scribd company logo
1
Best Practices for Streaming IoT
Data with MQTT and Apache
Kafka®
Kai Waehner, Technology Evangelist, Confluent
Dominik Obermaier, CTO, HiveMQ
2
Speakers
Kai Waehner
Technology Evangelist, Confluent
kai.waehner@confluent.io
@KaiWaehner
Dominik Obermaier
CTO, HiveMQ
dominik.obermaier@hivemq.com
@dobermai
4
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
5
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
6
Global Automotive Company
Builds Connected Car Infrastructure
Use Cases:
• Connected Car Infrastructure (Cars, Traffic Lights, Cloud Services, etc.)
• Real Time Analytics (Predictive Maintenance, etc.)
• Continuous Services / Sales
• Partner Integration (Car workshop, gas station, food market, etc.)
• …
77
Other Components
Kafka Ecosystem
(3) Read Data
TensorFlow I/O
(5) Deploy
Model
(2) Preprocess
Data
(8a)
Alert Car
Real Time
Kafka App
TensorFlow
Serving
gRPC
Car Sensor
HiveMQ
MQTT Broker
MQTT
Connector
Kafka Connect
or
Confluent Proxy
or
HiveMQ Plugin
Kafka
Cluster
Kafka
Connect
Grafana
Elastic
Search
KSQL
Tensor
Flow
(04)
Train Model
Real Time
Kafka Streams
Application
(Java / Scala)
Tensor Flow
Real Time
Edge
Computing
(C / librdkafka)
Tensor Flow
Lite
(1)
Ingest Data
(8b)
Alert Driver
(e.g. mobile App)
(6a) Consume
Car Data
(6b)
All Data
(7) Potential
Defect
8
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
9
Cloud Native Infrastructure
Benefits
• Scalable
• Flexible
• Agile
• Elastic
• Automated
• Etc.
10
• IoT-specific features for bad network / connectivity
• Widely used (mostly IoT, but also web and mobile apps
via MQTT over WebSocket)
• Built on top of TCP/IP for constrained devices and
unreliable networks
• Many (open source) broker implementations
• Many (open source) client libraries
MQTT - Publish / subscribe messaging protocol
11
MQTT Trade-Offs
Pros
• Lightweight
• All programming languages supported
• Built for poor connectivity / high latency scenarios (e.g. mobile networks!)
• High scalability and availability *
• ISO Standard
• Most popular IoT protocol
Cons
• Only pub/sub, not stream processing
• Asynchronous processing (clients can be offline for long time)
• No reprocessing of events
* Depending on Broker Implementation
12
A Streaming Platform is the Underpinning of an Event-driven Architecture
Ubiquitous connectivity
Globally scalable platform for all
event producers and consumers
Immediate data access
Data accessible to all consumers
in real time
Single system of record
Persistent storage to enable
reprocessing of past events
Continuous queries
Stream processing capabilities for
in-line data transformation
Microservices
DBs
SaaS apps
Mobile
Customer 360
Real-time fraud
detection
Data warehouse
Producers
Consumers
Database
change
Microservices
events
SaaS
data
Customer
experiences
Streams of real time events
Stream processing apps
13
Kafka Trade-Offs (from IoT perspective)
Pros
• Stream processing, not just pub/sub
• High throughput
• Large scale
• High availability
• Long term storage and buffering
• Reprocessing of events
• Good integration to rest of the enterprise
Cons
• Not built for tens of thousands connections
• Requires stable network and good infrastructure
• No IoT-specific features like keep alive, last will or testament
14
(De facto) Standards for Processing IoT Data
A Match Made In Heaven
+ =
15
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
16
Advanced Demo
17
MVP
18
Live Demo
End-to-End Integration and Data Processing for
100000 Connected Cars
19
https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference
or
http://bit.ly/kafka-mqtt-ml-demo
=> Try it out in 30 minutes!
Demo 100.000 Connected Cars
(Kafka + MQTT + TensorFlow)
20
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
21
Typical Journey
Value
Maturity (Investment & time)
2
Enterprise
Streaming Pilot /
Early Production
Pub + Sub Store Process
5
Central Nervous
System
1
Developer
Interest
Pre-Streaming
4
Global
Streaming
3
SLA Ready,
Integrated
Streaming
Projects
Platform
22
Start Small, but prepare for Scalability from Beginning
1. Use cloud native and scalable components
• Confluent Platform is cloud native and built for scale
• HiveMQ is cloud native and built for scale
2. Don’t deep dive too much in the beginning –
but understand options
• HiveMQ Kafka Extension?
• Confluent MQTT connectors?
• Customer Integration?
3. Plan for Enterprise-readiness
• Security
• Monitoring
• Operations tooling
• Bi-directional communication
23
Choose the right tool stack and infrastructure
Understand Trade-Offs and choose the right options for deployments
• Edge
• On Premise
• Cloud
Use the best tools for the job
• Confluent Platform for Event Streaming
• HiveMQ for MQTT messaging and connectivity
24
Separation of concerns
1. Devices
2. Gateway
3. Integration
4. Data Streaming
5. Consumer Apps
Decouple tasks
• Source integration
• Data processing
• Business logic
• Sink integration
• Analytics
• …
25
Different data for different use cases
• Database, Data Lake
• Search
• Real time, Near Real Time, Batch
• Streaming, Request-Response
• CQRS, Event Sourcing
• Machine Learning
There is no single
MASTER DATA EVENT…
26
Agenda
• Use Case
• Architecture
• Live Demo
• Best Practices
• Next steps
27
The HiveMQ Platform
28
The HiveMQ Platform – Open Source and Enterprise-grade
29
Confluent Platform
Operations and Security
Development & Stream Processing
Support,Services,Training&Partners
Apache Kafka
Security plugins | Role-Based Access Control
Control Center | Replicator | Auto Data Balancer | Operator
Connectors
Clients | REST Proxy
MQTT Proxy | Schema Registry
KSQL
Connect Continuous Commit Log Streams
Complete Event
Streaming Platform
Mission-critical
Reliability
Freedom of Choice
Datacenter Public Cloud Confluent Cloud
Self-Managed Software Fully-Managed Service
30
Confluent Cloud
Cloud-Native Confluent Platform Fully-Managed Service
Available on the leading public clouds with mission-critical SLAs and consumption-based pricing.
Serverless Kafka characteristics:
Pay-as-you-go, elastic auto-scaling, abstracting infrastructure (topics not brokers)
Spend your time on your applications!
31
Next steps…
Try out the demo in 30 minutes:
https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference
http://bit.ly/kafka-mqtt-ml-demo
Check out the documentation and blog posts
• HiveMQ and Apache Kafka - Streaming IoT Data and MQTT Messages:
https://www.hivemq.com/blog/streaming-iot-data-and-mqtt-messages-to-apache-kafka/
• Internet of Things (IoT) and Event Streaming at Scale with Apache Kafka and MQTT:
https://www.confluent.io/blog/iot-with-kafka-connect-mqtt-and-rest-proxy
Contact us for questions or any other feedback:
• Website, Email, Slack, Phone, …
• Dominik: dominik@hivemq.com , Kai: kai.waehner@confluent.io
32
Questions? Feedback?
Kai Waehner
Technology Evangelist
kai.waehner@confluent.io
LinkedIn
@KaiWaehner
www.confluent.io
www.kai-waehner.de
Please contact us!
Dominik Obermaier
CTO HiveMQ
dominik.obermaier@hivemq.com
www.linkedin.com/in/dobermai
www.hivemq.com
www.twitter.com/dobermai

More Related Content

What's hot (20)

PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PDF
Kubernetes 101 - an Introduction to Containers, Kubernetes, and OpenShift
DevOps.com
 
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
PPTX
Introduction to kubernetes
Rishabh Indoria
 
PDF
Modernizing to a Cloud Data Architecture
Databricks
 
PPTX
OpenTelemetry For Architects
Kevin Brockhoff
 
PDF
Apache Kafka in the Airline, Aviation and Travel Industry
Kai Wähner
 
PPTX
Azure data platform overview
James Serra
 
PDF
Everything You wanted to Know About Distributed Tracing
Amuhinda Hungai
 
PDF
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kai Wähner
 
PDF
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks
 
PDF
Building an open data platform with apache iceberg
Alluxio, Inc.
 
PDF
Apache Kafka in Financial Services - Use Cases and Architectures
Kai Wähner
 
PDF
Nifi workshop
Yifeng Jiang
 
PDF
Real time stock processing with apache nifi, apache flink and apache kafka
Timothy Spann
 
PDF
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
StreamNative
 
PPTX
Inside open metadata—the deep dive
DataWorks Summit
 
PDF
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
PPTX
Demystifying Data Warehouse as a Service
Snowflake Computing
 
PDF
Lakehouse in Azure
Sergio Zenatti Filho
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Kubernetes 101 - an Introduction to Containers, Kubernetes, and OpenShift
DevOps.com
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Introduction to kubernetes
Rishabh Indoria
 
Modernizing to a Cloud Data Architecture
Databricks
 
OpenTelemetry For Architects
Kevin Brockhoff
 
Apache Kafka in the Airline, Aviation and Travel Industry
Kai Wähner
 
Azure data platform overview
James Serra
 
Everything You wanted to Know About Distributed Tracing
Amuhinda Hungai
 
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kai Wähner
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks
 
Building an open data platform with apache iceberg
Alluxio, Inc.
 
Apache Kafka in Financial Services - Use Cases and Architectures
Kai Wähner
 
Nifi workshop
Yifeng Jiang
 
Real time stock processing with apache nifi, apache flink and apache kafka
Timothy Spann
 
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
StreamNative
 
Inside open metadata—the deep dive
DataWorks Summit
 
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Databricks
 
Demystifying Data Warehouse as a Service
Snowflake Computing
 
Lakehouse in Azure
Sergio Zenatti Filho
 

Similar to Best Practices for Streaming IoT Data with MQTT and Apache Kafka® (20)

PDF
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
Kai Wähner
 
PDF
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
confluent
 
PDF
Best Practices for Streaming Connected Car Data with MQTT & Kafka
HiveMQ
 
PDF
IoT Sensor Analytics with Python, Jupyter, TensorFlow, Keras, Apache Kafka, K...
Kai Wähner
 
PDF
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
PDF
Lightweight and scalable IoT Architectures with MQTT
Dominik Obermaier
 
PDF
Processing IoT Data from End to End with MQTT and Apache Kafka
confluent
 
PDF
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
PPTX
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
PDF
IoT Sensor Analytics with Kafka, ksqlDB and TensorFlow
Kai Wähner
 
PDF
Io t data streaming
ratthaslip ranokphanuwat
 
PDF
Event Hub (i.e. Kafka) in Modern Data Architecture
Guido Schmutz
 
PDF
Apache Kafka® and Analytics in a Connected IoT World
confluent
 
PDF
Unleashing Apache Kafka and TensorFlow in the Cloud

Kai Wähner
 
PDF
Ingesting and Processing IoT Data Using MQTT, Kafka Connect and Kafka Streams...
confluent
 
PDF
Ingesting and Processing IoT Data - using MQTT, Kafka Connect and KSQL
Guido Schmutz
 
PDF
HiveMQ + Kafka: The ideal solution for IoT MQTT data integration
MargarethaErber
 
PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
PDF
Modernizing the Manufacturing Industry with Kafka and MQTT
Dominik Obermaier
 
PDF
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka
Kai Wähner
 
Viele Autos, noch mehr Daten: IoT-Daten-Streaming mit MQTT & Kafka (Kai Waehn...
confluent
 
Best Practices for Streaming Connected Car Data with MQTT & Kafka
HiveMQ
 
IoT Sensor Analytics with Python, Jupyter, TensorFlow, Keras, Apache Kafka, K...
Kai Wähner
 
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
Lightweight and scalable IoT Architectures with MQTT
Dominik Obermaier
 
Processing IoT Data from End to End with MQTT and Apache Kafka
confluent
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
IoT Sensor Analytics with Kafka, ksqlDB and TensorFlow
Kai Wähner
 
Io t data streaming
ratthaslip ranokphanuwat
 
Event Hub (i.e. Kafka) in Modern Data Architecture
Guido Schmutz
 
Apache Kafka® and Analytics in a Connected IoT World
confluent
 
Unleashing Apache Kafka and TensorFlow in the Cloud

Kai Wähner
 
Ingesting and Processing IoT Data Using MQTT, Kafka Connect and Kafka Streams...
confluent
 
Ingesting and Processing IoT Data - using MQTT, Kafka Connect and KSQL
Guido Schmutz
 
HiveMQ + Kafka: The ideal solution for IoT MQTT data integration
MargarethaErber
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
Modernizing the Manufacturing Industry with Kafka and MQTT
Dominik Obermaier
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
PDF
Migration, backup and restore made easy using Kannika
confluent
 
PDF
Five Things You Need to Know About Data Streaming in 2025
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
PDF
Unlocking value with event-driven architecture by Confluent
confluent
 
PDF
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
PDF
Building API data products on top of your real-time data infrastructure
confluent
 
PDF
Speed Wins: From Kafka to APIs in Minutes
confluent
 
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
Migration, backup and restore made easy using Kannika
confluent
 
Five Things You Need to Know About Data Streaming in 2025
confluent
 
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
Unlocking value with event-driven architecture by Confluent
confluent
 
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
Building API data products on top of your real-time data infrastructure
confluent
 
Speed Wins: From Kafka to APIs in Minutes
confluent
 
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Ad

Recently uploaded (20)

PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
PPSX
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
DOCX
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
PDF
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
PDF
UiPath Agentic AI ile Akıllı Otomasyonun Yeni Çağı
UiPathCommunity
 
PDF
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
PDF
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PPTX
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
PDF
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
PDF
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 
PDF
Python Conference Singapore - 19 Jun 2025
ninefyi
 
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
PDF
Kubernetes - Architecture & Components.pdf
geethak285
 
PDF
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
PDF
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
UiPath Agentic AI ile Akıllı Otomasyonun Yeni Çağı
UiPathCommunity
 
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 
Python Conference Singapore - 19 Jun 2025
ninefyi
 
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
Kubernetes - Architecture & Components.pdf
geethak285
 
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 

Best Practices for Streaming IoT Data with MQTT and Apache Kafka®

  • 1. 1 Best Practices for Streaming IoT Data with MQTT and Apache Kafka® Kai Waehner, Technology Evangelist, Confluent Dominik Obermaier, CTO, HiveMQ
  • 2. 2 Speakers Kai Waehner Technology Evangelist, Confluent [email protected] @KaiWaehner Dominik Obermaier CTO, HiveMQ [email protected] @dobermai
  • 3. 4 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 4. 5 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 5. 6 Global Automotive Company Builds Connected Car Infrastructure Use Cases: • Connected Car Infrastructure (Cars, Traffic Lights, Cloud Services, etc.) • Real Time Analytics (Predictive Maintenance, etc.) • Continuous Services / Sales • Partner Integration (Car workshop, gas station, food market, etc.) • …
  • 6. 77 Other Components Kafka Ecosystem (3) Read Data TensorFlow I/O (5) Deploy Model (2) Preprocess Data (8a) Alert Car Real Time Kafka App TensorFlow Serving gRPC Car Sensor HiveMQ MQTT Broker MQTT Connector Kafka Connect or Confluent Proxy or HiveMQ Plugin Kafka Cluster Kafka Connect Grafana Elastic Search KSQL Tensor Flow (04) Train Model Real Time Kafka Streams Application (Java / Scala) Tensor Flow Real Time Edge Computing (C / librdkafka) Tensor Flow Lite (1) Ingest Data (8b) Alert Driver (e.g. mobile App) (6a) Consume Car Data (6b) All Data (7) Potential Defect
  • 7. 8 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 8. 9 Cloud Native Infrastructure Benefits • Scalable • Flexible • Agile • Elastic • Automated • Etc.
  • 9. 10 • IoT-specific features for bad network / connectivity • Widely used (mostly IoT, but also web and mobile apps via MQTT over WebSocket) • Built on top of TCP/IP for constrained devices and unreliable networks • Many (open source) broker implementations • Many (open source) client libraries MQTT - Publish / subscribe messaging protocol
  • 10. 11 MQTT Trade-Offs Pros • Lightweight • All programming languages supported • Built for poor connectivity / high latency scenarios (e.g. mobile networks!) • High scalability and availability * • ISO Standard • Most popular IoT protocol Cons • Only pub/sub, not stream processing • Asynchronous processing (clients can be offline for long time) • No reprocessing of events * Depending on Broker Implementation
  • 11. 12 A Streaming Platform is the Underpinning of an Event-driven Architecture Ubiquitous connectivity Globally scalable platform for all event producers and consumers Immediate data access Data accessible to all consumers in real time Single system of record Persistent storage to enable reprocessing of past events Continuous queries Stream processing capabilities for in-line data transformation Microservices DBs SaaS apps Mobile Customer 360 Real-time fraud detection Data warehouse Producers Consumers Database change Microservices events SaaS data Customer experiences Streams of real time events Stream processing apps
  • 12. 13 Kafka Trade-Offs (from IoT perspective) Pros • Stream processing, not just pub/sub • High throughput • Large scale • High availability • Long term storage and buffering • Reprocessing of events • Good integration to rest of the enterprise Cons • Not built for tens of thousands connections • Requires stable network and good infrastructure • No IoT-specific features like keep alive, last will or testament
  • 13. 14 (De facto) Standards for Processing IoT Data A Match Made In Heaven + =
  • 14. 15 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 17. 18 Live Demo End-to-End Integration and Data Processing for 100000 Connected Cars
  • 19. 20 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 20. 21 Typical Journey Value Maturity (Investment & time) 2 Enterprise Streaming Pilot / Early Production Pub + Sub Store Process 5 Central Nervous System 1 Developer Interest Pre-Streaming 4 Global Streaming 3 SLA Ready, Integrated Streaming Projects Platform
  • 21. 22 Start Small, but prepare for Scalability from Beginning 1. Use cloud native and scalable components • Confluent Platform is cloud native and built for scale • HiveMQ is cloud native and built for scale 2. Don’t deep dive too much in the beginning – but understand options • HiveMQ Kafka Extension? • Confluent MQTT connectors? • Customer Integration? 3. Plan for Enterprise-readiness • Security • Monitoring • Operations tooling • Bi-directional communication
  • 22. 23 Choose the right tool stack and infrastructure Understand Trade-Offs and choose the right options for deployments • Edge • On Premise • Cloud Use the best tools for the job • Confluent Platform for Event Streaming • HiveMQ for MQTT messaging and connectivity
  • 23. 24 Separation of concerns 1. Devices 2. Gateway 3. Integration 4. Data Streaming 5. Consumer Apps Decouple tasks • Source integration • Data processing • Business logic • Sink integration • Analytics • …
  • 24. 25 Different data for different use cases • Database, Data Lake • Search • Real time, Near Real Time, Batch • Streaming, Request-Response • CQRS, Event Sourcing • Machine Learning There is no single MASTER DATA EVENT…
  • 25. 26 Agenda • Use Case • Architecture • Live Demo • Best Practices • Next steps
  • 27. 28 The HiveMQ Platform – Open Source and Enterprise-grade
  • 28. 29 Confluent Platform Operations and Security Development & Stream Processing Support,Services,Training&Partners Apache Kafka Security plugins | Role-Based Access Control Control Center | Replicator | Auto Data Balancer | Operator Connectors Clients | REST Proxy MQTT Proxy | Schema Registry KSQL Connect Continuous Commit Log Streams Complete Event Streaming Platform Mission-critical Reliability Freedom of Choice Datacenter Public Cloud Confluent Cloud Self-Managed Software Fully-Managed Service
  • 29. 30 Confluent Cloud Cloud-Native Confluent Platform Fully-Managed Service Available on the leading public clouds with mission-critical SLAs and consumption-based pricing. Serverless Kafka characteristics: Pay-as-you-go, elastic auto-scaling, abstracting infrastructure (topics not brokers) Spend your time on your applications!
  • 30. 31 Next steps… Try out the demo in 30 minutes: https://github.com/kaiwaehner/hivemq-mqtt-tensorflow-kafka-realtime-iot-machine-learning-training-inference http://bit.ly/kafka-mqtt-ml-demo Check out the documentation and blog posts • HiveMQ and Apache Kafka - Streaming IoT Data and MQTT Messages: https://www.hivemq.com/blog/streaming-iot-data-and-mqtt-messages-to-apache-kafka/ • Internet of Things (IoT) and Event Streaming at Scale with Apache Kafka and MQTT: https://www.confluent.io/blog/iot-with-kafka-connect-mqtt-and-rest-proxy Contact us for questions or any other feedback: • Website, Email, Slack, Phone, … • Dominik: [email protected] , Kai: [email protected]
  • 31. 32 Questions? Feedback? Kai Waehner Technology Evangelist [email protected] LinkedIn @KaiWaehner www.confluent.io www.kai-waehner.de Please contact us! Dominik Obermaier CTO HiveMQ [email protected] www.linkedin.com/in/dobermai www.hivemq.com www.twitter.com/dobermai