SlideShare a Scribd company logo
Stefan Bocutiu
@stepib
London Apache Kafka Meetup, Feb 2018
End to end topologies on Apache Kafka 

An IoT use case CONNECT ANALYSE REACT
About Today
Intros
IoT
High Level Architecture
Build the Pipeline
Q&A
Hi! It’s Stefan!

@stepib

https://github.com/stheppi
CONNECT ANALYSE REACT
Streaming Data Processing & Integration

KAFKA - KUBERNETES
EVOLVESCALEBUILDIoT initiative challenges
★ 94% face challenges collecting and analysing IoT data
★ 26+ Billion devices by 2020, Gartner
More Data, More Complexity
★ 2.500.000 trillion bytes produced daily
REAL TIME DATA INGESTION + IMMEDIATE QUERIES = 

ACTIONABLE / TIMELY INSIGHTS
IoT Data Flows
IoT data flow
store datastream & process datasensor data
IoT data flow
store datastream & process datasensor data
E T L
InfluxDBsensor data
MQTT
InfluxDB 

Connector
LENSES SQL
PROCESSOR
KAFKA 

CONNECT
KAFKA 

CONNECT
MQTT 

Connector
KAFKA STREAMS
InfluxDBsensor data
MQTT
InfluxDB 

Connector
LENSES SQL
PROCESSOR
KAFKA 

CONNECT
KAFKA 

CONNECT
MQTT 

Connector
KAFKA STREAMS
Synthetic Data
With Moquette

1 Broker, 

data generate
sensor data
InfluxDB Docker
®
IoT &
High Volumes, N devices

& irregular intervals
Real Time Analytics 

& Microservices
Unlimited streams of data, async transfers

Producers & Consumers decoupled operation
Kafka Streams API

Processing and analysing data in motion
Multiple sources of data 

& long term storeage
Kafka Connect API, 

move data with pluggable reusable & scalable connectors
An open source streaming framework
with messaging semantics where records are key-value pairs.
LENSES BOX

For Kafka Development
Single docker image with all services
Growing list of examples & synthetic data generators.
Stream Reactor 25+ connectors
Kafka CLI autocomplete.
Rest and WebSocket endpoints
Redux JS Client
Testing Libraries
FREE!
All you need to start working with Apache Kafka ecosystem
MQTT Source connector

• Message Queueing Telemetry Transport messaging protocol
• Optimised for connected devices and M2M protocol
• Low bandwidth, high-latency, unreliable networks
• Plain or SSL connections
• Kafka Connect Query Language
INSERT INTO kafka_topic
SELECT * FROM /mqtt_source_topic
[WITHCONVERTER=`myclass`]
MQTT
Distributed, Fault tolerant, scalable
No code required
CLI / UI / Monitor
KCQL Support
Multiple Distributions
• Process data in motion with Lenses SQL
• Create & Deploy Lenses SQL Processors
• Native Kafka & Kubernetes Support Scalability
• Visual Topology
• Monitor and manage processors via LENSES
Process & Analyse
Data in motion
InfluxDB Sink connector

• AVRO & JSON Support
• Error handling ( Noop, Retry, Throw)
• Field selection & field alias
• Multiple inserts supported
• Kafka Connect Query Language
• MEASUREMENTS, KCQL: INSERTS
• TAGS, KCQL: WITHTAG
• TIMESTAMPS, KCQL: WITHTIMESTAMP
• DURATION
Distributed, Fault tolerant, scalable
No code required
CLI / UI / Monitor
KCQL Support
Multiple Distributions
INSERT INTO sensorMeasure 

SELECT * FROM sensor_data_avg 

WITHTAG (id, poweredBy=Lenses)
Distributed, Fault tolerant, scalable
No code required
CLI / UI / Monitor
KCQL Support
Multiple Distributions
InfluxDB Sink connector

• Timestamp nanosecond
• KCQL
• No filter support
• No function support
• Throughput involves domain knowlege
• Kafka Connect rebalances
visual data flow
Visual and Interactive Topology

for a 360 view of your data
End-To-End Topology Management

Visualise your Kafka-based data flows

in real time
Wrap it up!
CONNECT
ANALYSE
REACT
Real Time, IoT pipelines in minutes!
MQTT
Contact us
www.landoop.com
info@landoop.com
Twitter @LandoopLtd
Github github.com/landoop
Gitter https://gitter.im/Landoop
Slack https://launchpass.com/landoop-community
Thank you!

More Related Content

What's hot (20)

PDF
Kafka summit apac session
Christina Lin
 
PDF
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
PPTX
Confluent Cloud Networking | Rajan Sundaram, Confluent
HostedbyConfluent
 
PDF
New Features in Confluent Platform 6.0 / Apache Kafka 2.6
Kai Wähner
 
PDF
Let's build a simple ingest to cloud datawarehouse with low code
Timothy Spann
 
PDF
Building a real-time data processing pipeline using Apache Kafka, Kafka Conne...
Paul Brebner
 
PDF
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Red Hat Developers
 
PDF
IoT Sensor Analytics with Kafka, ksqlDB and TensorFlow
Kai Wähner
 
PDF
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Kai Wähner
 
PDF
Building Event Driven (Micro)services with Apache Kafka
Guido Schmutz
 
PDF
End to-end large messages processing with Kafka Streams & Kafka Connect
confluent
 
PDF
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
HostedbyConfluent
 
PDF
Ingesting IoT data in Food Processing
confluent
 
PDF
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
PDF
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Kai Wähner
 
PDF
What's new in confluent platform 5.4 online talk
confluent
 
PDF
0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019
confluent
 
PDF
5 lessons learned for successful migration to Confluent cloud | Natan Silinit...
HostedbyConfluent
 
PDF
GCP for Apache Kafka® Users: Stream Ingestion and Processing
confluent
 
PDF
Connected Vehicles and V2X with Apache Kafka
Kai Wähner
 
Kafka summit apac session
Christina Lin
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
Confluent Cloud Networking | Rajan Sundaram, Confluent
HostedbyConfluent
 
New Features in Confluent Platform 6.0 / Apache Kafka 2.6
Kai Wähner
 
Let's build a simple ingest to cloud datawarehouse with low code
Timothy Spann
 
Building a real-time data processing pipeline using Apache Kafka, Kafka Conne...
Paul Brebner
 
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Red Hat Developers
 
IoT Sensor Analytics with Kafka, ksqlDB and TensorFlow
Kai Wähner
 
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Kai Wähner
 
Building Event Driven (Micro)services with Apache Kafka
Guido Schmutz
 
End to-end large messages processing with Kafka Streams & Kafka Connect
confluent
 
Apicurio Registry: Event-driven APIs & Schema governance for Apache Kafka | F...
HostedbyConfluent
 
Ingesting IoT data in Food Processing
confluent
 
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Kai Wähner
 
What's new in confluent platform 5.4 online talk
confluent
 
0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019
confluent
 
5 lessons learned for successful migration to Confluent cloud | Natan Silinit...
HostedbyConfluent
 
GCP for Apache Kafka® Users: Stream Ingestion and Processing
confluent
 
Connected Vehicles and V2X with Apache Kafka
Kai Wähner
 

Similar to MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu (20)

PDF
IoT Sensor Analytics with Python, Jupyter, TensorFlow, Keras, Apache Kafka, K...
Kai Wähner
 
PDF
Beyond the brokers - A tour of the Kafka ecosystem
Damien Gasparina
 
PDF
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
PDF
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
PDF
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
Dataops on streaming data: Kafka to InfluxDb via Kubernetes native flows
Lenses.io
 
PDF
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
Timothy Spann
 
PDF
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
PDF
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
HostedbyConfluent
 
PDF
Bank of China Tech Talk 2: Introduction to Streaming Data and Stream Processi...
confluent
 
PDF
DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...
InfluxData
 
PDF
Spark Streaming + Kafka 0.10: an integration story by Joan Viladrosa Riera at...
Big Data Spain
 
PDF
Kubernetes connectivity to Cloud Native Kafka | Evan Shortiss and Hugo Guerre...
HostedbyConfluent
 
PDF
JHipster conf 2019 - Kafka Ecosystem
Florent Ramiere
 
PPTX
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
PPTX
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
PDF
Bringing Streaming Data To The Masses: Lowering The “Cost Of Admission” For Y...
confluent
 
PDF
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
PDF
Chti jug - 2018-06-26
Florent Ramiere
 
IoT Sensor Analytics with Python, Jupyter, TensorFlow, Keras, Apache Kafka, K...
Kai Wähner
 
Beyond the brokers - A tour of the Kafka ecosystem
Damien Gasparina
 
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
Dataops on streaming data: Kafka to InfluxDb via Kubernetes native flows
Lenses.io
 
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
Timothy Spann
 
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
HostedbyConfluent
 
Bank of China Tech Talk 2: Introduction to Streaming Data and Stream Processi...
confluent
 
DataOps on Streaming Data: From Kafka to InfluxDB via Kubernetes Native Flows...
InfluxData
 
Spark Streaming + Kafka 0.10: an integration story by Joan Viladrosa Riera at...
Big Data Spain
 
Kubernetes connectivity to Cloud Native Kafka | Evan Shortiss and Hugo Guerre...
HostedbyConfluent
 
JHipster conf 2019 - Kafka Ecosystem
Florent Ramiere
 
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
Bringing Streaming Data To The Masses: Lowering The “Cost Of Admission” For Y...
confluent
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
Chti jug - 2018-06-26
Florent Ramiere
 
Ad

Recently uploaded (20)

PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
PPTX
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
PDF
Java 25 and Beyond - A Roadmap of Innovations
Ana-Maria Mihalceanu
 
PDF
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
PPTX
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
PDF
Python Conference Singapore - 19 Jun 2025
ninefyi
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PPTX
Practical Applications of AI in Local Government
OnBoard
 
PDF
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
PPTX
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
Simplifica la seguridad en la nube y la detección de amenazas con FortiCNAPP
Cristian Garcia G.
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
 
PDF
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
PDF
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
PPSX
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
01_Approach Cyber- DORA Incident Management.pptx
FinTech Belgium
 
Java 25 and Beyond - A Roadmap of Innovations
Ana-Maria Mihalceanu
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
Python Conference Singapore - 19 Jun 2025
ninefyi
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
Practical Applications of AI in Local Government
OnBoard
 
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Simplifica la seguridad en la nube y la detección de amenazas con FortiCNAPP
Cristian Garcia G.
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Open Source Milvus Vector Database v 2.6
Zilliz
 
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
 
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
Ad

MQTT. Kafka. InfluxDB. SQL. IoT Harmony. #tutorial by Stefan Bocutiu

  • 1. Stefan Bocutiu @stepib London Apache Kafka Meetup, Feb 2018 End to end topologies on Apache Kafka 
 An IoT use case CONNECT ANALYSE REACT
  • 2. About Today Intros IoT High Level Architecture Build the Pipeline Q&A
  • 3. Hi! It’s Stefan!
 @stepib
 https://github.com/stheppi CONNECT ANALYSE REACT Streaming Data Processing & Integration
 KAFKA - KUBERNETES
  • 4. EVOLVESCALEBUILDIoT initiative challenges ★ 94% face challenges collecting and analysing IoT data ★ 26+ Billion devices by 2020, Gartner More Data, More Complexity ★ 2.500.000 trillion bytes produced daily REAL TIME DATA INGESTION + IMMEDIATE QUERIES = 
 ACTIONABLE / TIMELY INSIGHTS IoT Data Flows
  • 5. IoT data flow store datastream & process datasensor data
  • 6. IoT data flow store datastream & process datasensor data E T L
  • 7. InfluxDBsensor data MQTT InfluxDB 
 Connector LENSES SQL PROCESSOR KAFKA 
 CONNECT KAFKA 
 CONNECT MQTT 
 Connector KAFKA STREAMS
  • 8. InfluxDBsensor data MQTT InfluxDB 
 Connector LENSES SQL PROCESSOR KAFKA 
 CONNECT KAFKA 
 CONNECT MQTT 
 Connector KAFKA STREAMS Synthetic Data With Moquette
 1 Broker, 
 data generate sensor data InfluxDB Docker
  • 9. ® IoT & High Volumes, N devices
 & irregular intervals Real Time Analytics 
 & Microservices Unlimited streams of data, async transfers
 Producers & Consumers decoupled operation Kafka Streams API
 Processing and analysing data in motion Multiple sources of data 
 & long term storeage Kafka Connect API, 
 move data with pluggable reusable & scalable connectors An open source streaming framework with messaging semantics where records are key-value pairs.
  • 10. LENSES BOX
 For Kafka Development Single docker image with all services Growing list of examples & synthetic data generators. Stream Reactor 25+ connectors Kafka CLI autocomplete. Rest and WebSocket endpoints Redux JS Client Testing Libraries FREE! All you need to start working with Apache Kafka ecosystem
  • 11. MQTT Source connector
 • Message Queueing Telemetry Transport messaging protocol • Optimised for connected devices and M2M protocol • Low bandwidth, high-latency, unreliable networks • Plain or SSL connections • Kafka Connect Query Language INSERT INTO kafka_topic SELECT * FROM /mqtt_source_topic [WITHCONVERTER=`myclass`] MQTT Distributed, Fault tolerant, scalable No code required CLI / UI / Monitor KCQL Support Multiple Distributions
  • 12. • Process data in motion with Lenses SQL • Create & Deploy Lenses SQL Processors • Native Kafka & Kubernetes Support Scalability • Visual Topology • Monitor and manage processors via LENSES Process & Analyse Data in motion
  • 13. InfluxDB Sink connector
 • AVRO & JSON Support • Error handling ( Noop, Retry, Throw) • Field selection & field alias • Multiple inserts supported • Kafka Connect Query Language • MEASUREMENTS, KCQL: INSERTS • TAGS, KCQL: WITHTAG • TIMESTAMPS, KCQL: WITHTIMESTAMP • DURATION Distributed, Fault tolerant, scalable No code required CLI / UI / Monitor KCQL Support Multiple Distributions INSERT INTO sensorMeasure 
 SELECT * FROM sensor_data_avg 
 WITHTAG (id, poweredBy=Lenses)
  • 14. Distributed, Fault tolerant, scalable No code required CLI / UI / Monitor KCQL Support Multiple Distributions InfluxDB Sink connector
 • Timestamp nanosecond • KCQL • No filter support • No function support • Throughput involves domain knowlege • Kafka Connect rebalances
  • 15. visual data flow Visual and Interactive Topology
 for a 360 view of your data End-To-End Topology Management
 Visualise your Kafka-based data flows
 in real time
  • 16. Wrap it up! CONNECT ANALYSE REACT Real Time, IoT pipelines in minutes! MQTT
  • 17. Contact us www.landoop.com [email protected] Twitter @LandoopLtd Github github.com/landoop Gitter https://gitter.im/Landoop Slack https://launchpass.com/landoop-community Thank you!