SlideShare a Scribd company logo
Introduction to Prometheus and Cortex
Bryan Boreham – @bboreham
July 2019
1
● See whether your system is operating as expected
○ Get alerted when it isn’t
● Drill down for troubleshooting and incident investigation
● View trends over time, to drive strategy
● Input into continuous improvement
Why do we Monitor?
2
Any quantity that tells you something about how
your system is operating.
- How much memory it is using
- How long the last operation took
- How many requests served today
What do we mean by “metrics” ?
3
How Prometheus works
4
Your
Apps
Disk
Exporters
Service
Discovery
Prometheus UI
5
Weave Cloud Dashboards
6
Grafana Dashboards
7
Prometheus metrics have a name and some labels:
http_requests_total{service="webserver",region="EU"} 136
● name is http_requests_total
● two labels, service and region, with values "webserver" and "EU"
● value is 136
Since this webserver program started, it has received 136 HTTP
requests.
Understanding metrics
8
Filter:
http_requests_total{service="webserver"}
Aggregate:
sum(http_requests_total)
Differentiate:
rate(http_requests_total[1m])
PromQL - Prometheus Query Language
9https://prometheus.io/docs/
Demo!
10
We don’t want to watch dashboards all day in case something goes
wrong - Prometheus can do that for us!
- alert: LowRequests
expr: rate(http_requests_total{service="webserver"}[1m]) < 0.1
for: 5m
Prometheus will execute this rule regularly.
AlertManager will send a notification via email, webhook, etc.
Automated Alerting
11
Cortex
12
Cortex is a time-series store built on Prometheus
- Horizontally scalable
- Highly Available
- Long-term storage
- Multi-tenant
Cortex is a CNCF Sandbox project
https://github.com/cortexproject/cortex
What is Cortex ?
13
Prometheus: basic operation
14
Your
Apps
Disk
“Scraping”
Exporters
Scaling Prometheus
15
Sharding Prometheus
16
Cortex high-level architecture
17
Sample Distribute Compress Store Query
Request rate
Error rate
Duration
RED is for Services
What to monitor: The RED model
18
Utilisation
Saturation
Errors
USE is for Resources
USE method
19
Questions?
20
● How Google Runs Production Systems
● SREs:
○ Have the skillset necessary to automate tasks
○ Do the same work as an operations team, but with
automation instead of manual labor
● SRE team responsible for latency, performance,
efficiency, change management, monitoring,
emergency response, and capacity planning
Site Reliability Engineering
21

More Related Content

PPTX
Prometheus design and philosophy
PDF
Cloud Monitoring with Prometheus
PDF
Getting Started Monitoring with Prometheus and Grafana
PDF
Prometheus Overview
PDF
Infrastructure & System Monitoring using Prometheus
PPT
Monitoring using Prometheus and Grafana
PPTX
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
PPTX
Prometheus in Practice: High Availability with Thanos (DevOpsDays Edinburgh 2...
Prometheus design and philosophy
Cloud Monitoring with Prometheus
Getting Started Monitoring with Prometheus and Grafana
Prometheus Overview
Infrastructure & System Monitoring using Prometheus
Monitoring using Prometheus and Grafana
Prometheus - Intro, CNCF, TSDB,PromQL,Grafana
Prometheus in Practice: High Availability with Thanos (DevOpsDays Edinburgh 2...

What's hot (20)

PPTX
An Introduction to Prometheus (GrafanaCon 2016)
PPTX
Monitoring With Prometheus
PDF
Distributed Tracing for Kafka with OpenTelemetry with Daniel Kim | Kafka Summ...
ODP
Monitoring With Prometheus
PDF
Prometheus
PDF
Explore your prometheus data in grafana - Promcon 2018
PDF
Monitoring with prometheus
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
PPTX
Grafana.pptx
PPTX
DevOps Tutorial For Beginners | DevOps Tutorial | DevOps Tools | DevOps Train...
PDF
Prometheus and Docker (Docker Galway, November 2015)
PDF
Prometheus - basics
PDF
Systems Monitoring with Prometheus (Devops Ireland April 2015)
PDF
Intro to open source observability with grafana, prometheus, loki, and tempo(...
PPTX
Prometheus for Monitoring Metrics (Fermilab 2018)
PPTX
Grafana Loki (Monitoring Tool) Presentation
PPTX
Prometheus and Grafana
ODP
Stream processing using Kafka
PPTX
Database CI/CD Pipeline
PPTX
Getting started with Docker
An Introduction to Prometheus (GrafanaCon 2016)
Monitoring With Prometheus
Distributed Tracing for Kafka with OpenTelemetry with Daniel Kim | Kafka Summ...
Monitoring With Prometheus
Prometheus
Explore your prometheus data in grafana - Promcon 2018
Monitoring with prometheus
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Grafana.pptx
DevOps Tutorial For Beginners | DevOps Tutorial | DevOps Tools | DevOps Train...
Prometheus and Docker (Docker Galway, November 2015)
Prometheus - basics
Systems Monitoring with Prometheus (Devops Ireland April 2015)
Intro to open source observability with grafana, prometheus, loki, and tempo(...
Prometheus for Monitoring Metrics (Fermilab 2018)
Grafana Loki (Monitoring Tool) Presentation
Prometheus and Grafana
Stream processing using Kafka
Database CI/CD Pipeline
Getting started with Docker
Ad

Similar to Introduction to Prometheus and Cortex (WOUG) (20)

PDF
Monitoring und Metriken im Wunderland
 
PDF
Prometheus Course from beginners to expert course
PDF
Prometheus course
PDF
Your data is in Prometheus, now what? (CurrencyFair Engineering Meetup, 2016)
PDF
Microservices and Prometheus (Microservices NYC 2016)
PPTX
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
PPTX
Monitoring What Matters: The Prometheus Approach to Whitebox Monitoring (Berl...
PDF
End to-end monitoring with the prometheus operator - Max Inden
PDF
Monitoring with prometheus at scale
PDF
Monitoring with prometheus at scale
PPTX
Prometheus - Open Source Forum Japan
PDF
Prometheus (Microsoft, 2016)
PDF
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
PPTX
Prometheus (Prometheus London, 2016)
PDF
DevOps Braga #15: Agentless monitoring with icinga and prometheus
PPTX
Prometheus (Monitorama 2016)
PPTX
MeetUp Monitoring with Prometheus and Grafana (September 2018)
PDF
An Introduction to Prometheus
PDF
Prometheus: A Next Generation Monitoring System (FOSDEM 2016)
PPTX
Prometheus for Monitoring Metrics (Percona Live Europe 2017)
Monitoring und Metriken im Wunderland
 
Prometheus Course from beginners to expert course
Prometheus course
Your data is in Prometheus, now what? (CurrencyFair Engineering Meetup, 2016)
Microservices and Prometheus (Microservices NYC 2016)
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Monitoring What Matters: The Prometheus Approach to Whitebox Monitoring (Berl...
End to-end monitoring with the prometheus operator - Max Inden
Monitoring with prometheus at scale
Monitoring with prometheus at scale
Prometheus - Open Source Forum Japan
Prometheus (Microsoft, 2016)
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Prometheus (Prometheus London, 2016)
DevOps Braga #15: Agentless monitoring with icinga and prometheus
Prometheus (Monitorama 2016)
MeetUp Monitoring with Prometheus and Grafana (September 2018)
An Introduction to Prometheus
Prometheus: A Next Generation Monitoring System (FOSDEM 2016)
Prometheus for Monitoring Metrics (Percona Live Europe 2017)
Ad

More from Weaveworks (20)

PDF
Weave AI Controllers (Weave GitOps Office Hours)
PDF
Flamingo: Expand ArgoCD with Flux (Office Hours)
PDF
Webinar: Capabilities, Confidence and Community – What Flux GA Means for You
PDF
Six Signs You Need Platform Engineering
PDF
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
PDF
Webinar: End to End Security & Operations with Chainguard and Weave GitOps
PDF
Flux Beyond Git Harnessing the Power of OCI
PDF
Automated Provisioning, Management & Cost Control for Kubernetes Clusters
PDF
How to Avoid Kubernetes Multi-tenancy Catastrophes
PDF
Building internal developer platform with EKS and GitOps
PDF
GitOps Testing in Kubernetes with Flux and Testkube.pdf
PDF
Intro to GitOps with Weave GitOps, Flagger and Linkerd
PDF
Implementing Flux for Scale with Soft Multi-tenancy
PDF
Accelerating Hybrid Multistage Delivery with Weave GitOps on EKS
PDF
The Story of Flux Reaching Graduation in the CNCF
PDF
Shift Deployment Security Left with Weave GitOps & Upbound’s Universal Crossp...
PDF
Securing Your App Deployments with Tunnels, OIDC, RBAC, and Progressive Deliv...
PDF
Flux’s Security & Scalability with OCI & Helm Slides.pdf
PDF
Flux Security & Scalability using VS Code GitOps Extension
PDF
Deploying Stateful Applications Securely & Confidently with Ondat & Weave GitOps
Weave AI Controllers (Weave GitOps Office Hours)
Flamingo: Expand ArgoCD with Flux (Office Hours)
Webinar: Capabilities, Confidence and Community – What Flux GA Means for You
Six Signs You Need Platform Engineering
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
Webinar: End to End Security & Operations with Chainguard and Weave GitOps
Flux Beyond Git Harnessing the Power of OCI
Automated Provisioning, Management & Cost Control for Kubernetes Clusters
How to Avoid Kubernetes Multi-tenancy Catastrophes
Building internal developer platform with EKS and GitOps
GitOps Testing in Kubernetes with Flux and Testkube.pdf
Intro to GitOps with Weave GitOps, Flagger and Linkerd
Implementing Flux for Scale with Soft Multi-tenancy
Accelerating Hybrid Multistage Delivery with Weave GitOps on EKS
The Story of Flux Reaching Graduation in the CNCF
Shift Deployment Security Left with Weave GitOps & Upbound’s Universal Crossp...
Securing Your App Deployments with Tunnels, OIDC, RBAC, and Progressive Deliv...
Flux’s Security & Scalability with OCI & Helm Slides.pdf
Flux Security & Scalability using VS Code GitOps Extension
Deploying Stateful Applications Securely & Confidently with Ondat & Weave GitOps

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Electronic commerce courselecture one. Pdf
PPTX
1. Introduction to Computer Programming.pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
A Presentation on Artificial Intelligence
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Spectroscopy.pptx food analysis technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Tartificialntelligence_presentation.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
A comparative analysis of optical character recognition models for extracting...
MIND Revenue Release Quarter 2 2025 Press Release
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Electronic commerce courselecture one. Pdf
1. Introduction to Computer Programming.pptx
Machine Learning_overview_presentation.pptx
Empathic Computing: Creating Shared Understanding
NewMind AI Weekly Chronicles - August'25-Week II
Unlocking AI with Model Context Protocol (MCP)
A Presentation on Artificial Intelligence
Programs and apps: productivity, graphics, security and other tools
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Spectroscopy.pptx food analysis technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Tartificialntelligence_presentation.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Spectral efficient network and resource selection model in 5G networks
A comparative analysis of optical character recognition models for extracting...

Introduction to Prometheus and Cortex (WOUG)