SlideShare a Scribd company logo
Deploying PostgreSQL
on Kubernetes
Jimmy Angelakos FOSDEM
Platform Architect 03/02/2019
SolarWinds MSP
Motivation
●
Service Oriented Architecture (SOA), including
Micro– , exemplified perfectly by Kubernetes
●
Kubernetes is here to stay
●
Fewer phonecalls at 4 am?
●
Play around at home for free
●
Or get commercial support
●
Cloud Compute, Storage → Commodity
●
(Industrial-strength) Postgres is hard
●
You want Postgres → Commodity to your users
●
By no means an exhaustive list of solutions or
in-depth analysis but an attempt to demystify
What this is not
I. A demo of me fiddling with terminals and window tiling
techniques on the screen
II. Me typing in Kubernetes commands so you can see how
they are typed in
III. And… press ENTER. Ok, there, it worked. See?
IV. No wait. It didn’t. Let me fiddle some more.
What this is
Contents:
I. Kubernetes basics
II. Small scale
III. Helm Charts
IV. Crunchy Data Operator
V. Observations
I.
Kubernetes (k8s) basics
K8s basics – 1: K8s & Containers
●
Container: Lightweight, standalone, executable package
– Containerized software will run on any environment with no differences
– Resource efficient vs. VMs
– Platform independent vs. “It works on my machine ¯_( ツ )_/¯ ”
●
K8s is a container orchestrator
– Written in Go (Golang)
– Cloud Native Computing Foundation (CNCF)
– Scaling, load balancing, safely rolling out updates
– Abstracting infrastructure via API: Can use any cloud provider (or none)
– Resources: k8s API objects
– “Pets vs Cattle” debate
K8s basics – 2: Terms
●
Cluster
– Master node runs API server (our interface to the Cluster)
– Worker nodes run Kubelet and Pods
– Namespaces: Virtual clusters (resource quotas)
●
Kubelet
– Talks to Master node, monitors Pods
●
Pod
– A container or group of containers sharing the same execution environment
– Container coupling: sharing a volume or IPC
●
Volume
– Storage abstraction, many types
K8s basics – 3: Moar terms
●
Minikube
– Single-node k8s cluster in a VM – install VirtualBox and you’re good to go.
●
Prometheus
– Monitoring solution for k8s (also by CNCF, so described as “best fit”…)
●
Custom Resource Definitions
– Write them to extend k8s API at will
●
Operator pattern
– Custom domain-specific controllers that work with CRDs
– Configure & manage stateful applications for you
– No need for out-of-band automation
K8s basics – 4: YAML files
●
Definitions
– YAML!
– kind of resource e.g. Pod
– metadata e.g. name, labels
– spec i.e. the desired state for the
resource
●
Kubectl
– CLI tool for interacting with Cluster
kubectl create -f my-pod.yaml
kubectl get pods
K8s basics – 5: Services
●
Service
– Exposes Pods externally via URL
– Entry point for a set of Pods performing the same function
– Targets Pods using a selector for the labels applied to Pods
– Can have Type: ClusterIP, NodePort, LoadBalancer, ExternalName
– Needs a way to route traffic from outside the Cluster
●
NodePort will assign the same Port from each Node
●
LoadBalancer will provision an external LB from cloud provider
K8s basics – 6: Deployments
●
Deployment
– Automates upgrades of applications with zero downtime
– Enables fast rollbacks to previous state
kubectl rollout undo deployment my-app --to-revision=5
– Defines number of replicated Pods in spec
●
Manages ReplicaSets for you
– Can have Strategy: RollingUpdate, Recreate
K8s basics – 7: State
●
Stateless Applications
– Usually as a Deployment of Pod Replicas accessed via a Service
●
Stateful Applications
– StatefulSets
●
Stable storage
●
Stable network identifiers
●
Ordered deployment & scaling
●
Ordered RollingUpdates
K8s basics – 8: StatefulSets
●
spec
– Defines replicas in unique Pods (with stable network identity & storage)
– Defines storage in PersistentVolumes
●
Headless Service
– No load balancing, no cluster IP: self-registration or discovery possible
– Governs DNS subdomain of Pods: e.g. mypod-1.myservice.mynamespace
●
PersistentVolumes: Provisioned storage as a resource
●
PersistentVolumeClaim: A request for storage, consumes PV resources
●
Deletion
– Does not remove PersistentVolumes (for safety)
– Does not guarantee Pod termination (scale to zero before)
II.
Small scale
Small scale – 1: The image
●
You need a PostgreSQL container image
– Roll your own
– Use an existing image
●
PostgreSQL Docker Community “Official image”
– https://github.com/docker-library/postgres
docker pull postgres
●
Bitnami PostgreSQL Docker image
– https://github.com/bitnami/bitnami-docker-postgresql
●
Crunchy Data containers
– https://github.com/CrunchyData/crunchy-containers
Small scale – 2: Deployment
●
Create a ConfigMap for the
configuration values →
●
Create a PersistentVolume and a
PersistentVolumeClaim
●
Create a Deployment for your
Container image & PV
●
Create a Service to expose the above.
Simple: NodePort
●
Connect to your database via exposed
port or kubectl port forwarding
apiVersion: v1
kind: ConfigMap
metadata:
name: postgres-config
labels:
app: postgres
data:
POSTGRES_DB: mydatabase
POSTGRES_USER: myuser
POSTGRES_PASSWORD: mypassword
III.
Helm Charts
Helm Charts – 1: Introduction
●
Helm
– A “package manager” for k8s. Helm is the client.
– Tiller is the server-side component installed in k8s
●
Charts
– Directories of (you guessed it) YAML files
– Describe a set of related k8s resources
– values.yaml lets you customise options and configuration
●
PostgreSQL use case
– One-stop installation for a set of replicated databases
– It makes sense!
Helm Charts – 2: PostgreSQL Chart
●
Contributed by Bitnami, upstreamed:
– https://github.com/helm/charts/tree/master/stable/postgresql
●
Default Docker image repo is Bitnami
●
Installation is as simple as:
helm install --name my-release -f values.yaml stable/postgresql
– A Release in this context is an installation, a deployment
●
Output will include some magic commands for getting the DB password and
connecting to the running instance
●
postgresql.conf or pg_hba.conf can be provided in files/ folder and will
be mounted as a ConfigMap (special Volume type for abstracting configuration)
NAME: my-release
LAST DEPLOYED: Fri Jan 25 15:20:58 2019
NAMESPACE: my-namespace
STATUS: DEPLOYED
RESOURCES:
==> v1/Secret
NAME TYPE DATA AGE
my-release-postgresql Opaque 1 3s
==> v1/ConfigMap
NAME DATA AGE
my-release-postgresql-init-scripts 1 3s
==> v1/Service
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
my-release-postgresql-headless ClusterIP None <none> 5432/TCP 3s
my-release-postgresql ClusterIP 10.101.211.6 <none> 5432/TCP 3s
==> v1beta2/StatefulSet
NAME DESIRED CURRENT AGE
my-release-postgresql 1 1 3s
==> v1/Pod(related)
NAME READY STATUS RESTARTS AGE
my-release-postgresql-0 0/1 Init:0/1 0 3s
NOTES:
** Please be patient while the chart is being deployed **
PostgreSQL can be accessed via port 5432 on the following DNS name from within your
cluster:
my-release-postgresql.my-namespace.svc.cluster.local
To get the password for "postgres" run:
export POSTGRESQL_PASSWORD=$(kubectl get secret --namespace my-namespace my-release-
postgresql -o jsonpath="{.data.postgresql-password}" | base64 --decode)
To connect to your database run the following command:
kubectl run my-release-postgresql-client --rm --tty -i --restart='Never' --namespace
my-namespace --image bitnami/postgresql --env="PGPASSWORD=$POSTGRESQL_PASSWORD" --command
-- psql --host my-release-postgresql -U postgres
To connect to your database from outside the cluster execute the following commands:
kubectl port-forward --namespace my-namespace svc/my-release-postgresql 5432:5432 &
psql --host 127.0.0.1 -U postgres
Helm Charts – 3: Internals
●
Defaults create:
– A StatefulSet with 1 Replica (1 Pod) running Postgres from the Docker image
– A Headless Service and a Service
– A PersistentVolumeClaim from the configured storage provisioner
●
Can be configured to:
– Load custom Postgres initialisation scripts as ConfigMaps from files/
– Start a metrics exporter to Prometheus:
●
https://github.com/wrouesnel/postgres_exporter
●
Export e.g. pg_stat_activity, pg_stat_replication or custom metrics
queries
Helm Charts – 4: Patroni Chart
●
For HA you can use the Helm Incubator Patroni Chart:
– https://github.com/helm/charts/tree/master/incubator/patroni
●
This, too, uses StatefulSets
●
Default installation deploys a 5 node Spilo cluster
– Zalando’s Spilo is Postgres & Patroni bundled image
●
Installation
helm repo add incubator https://kubernetes-charts-
incubator.storage.googleapis.com/
helm dependency update
helm install --name my-release incubator/patroni
IV.
Crunchy Operator
Crunchy Operator – 1
●
Crunchy Data PostgreSQL Operator
– https://github.com/CrunchyData/postgres-operator
●
Deploy Postgres with streaming replication & scaling
●
Add pgpool, pgbouncer, and metrics sidecars
●
Administer SQL policies, users, passwords
●
Assign labels to resources
●
Minor version upgrades
●
Perform backups and restores (or schedule them)
Crunchy Operator – 2
Quickstart:
●
git clone the GitHub repo, git checkout <tag>
●
source examples/envs.sh
●
make setupnamespace creates a “demo” namespace
●
conf/postgres-operator/pgo.yaml holds the configuration
●
make installrbac Creates RBAC resources and keys
●
make deployoperator
Crunchy Operator – 3: pgo
●
pgo is the CLI to interact with the operator
pgo create cluster my-cluster (--metrics if you want)
pgo show cluster my-cluster
pgo scale my-cluster --replica-count=2
pgo create pgbouncer my-cluster or
pgo create pgpool my-cluster to add
●
Backups
pgo create cluster my-cluster --pgbackrest
pgo backup my-cluster --backup-type=pgbackrest (or pgbasebackup)
pgo restore my-cluster
●
Manual failovers
pgo failover my-cluster –query (to get failover targets)
pgo failover my-cluster --target=my-failover-target-1
V.
Observations
Observations – 1: Deploying by hand
●
Good for rapid development
●
Offers equivalent isolation as VMs
●
Resource saving compared to VMs
●
Doesn’t offer many Cloud Native advantages
●
Production usage?
– Hard to maintain at scale unless you have an army of DBAs
Observations – 2: Helm Charts
●
Good for one-time deployments
●
Very clean and transparent
●
Major version upgrades?
●
Slave replicas – no failover unless you set it up explicitly
●
Flexibility to carry on using your existing solutions
●
Can be used by namespace-admin or plain user with
permissions
Observations – 3: Crunchy Operator
●
All-in-one solution, Postgres as an application
●
Makes many tasks easy via CLI and automates others
●
You need RBAC and cluster-admin permissions for creation of
CRDs
– Kubernetes does not support namespaced CRDs :(
– https://github.com/kubernetes/kubernetes/issues/65551
●
Under heavy development – perhaps not ideal for production?
– But so is Kubernetes :/
Observations – 4
●
Hard problem
– (Plain) Postgres cluster with multiple write nodes
– Multi-master is not always the solution
– Can leverage aforementioned solutions with 2ndQuadrant’s
pglogical for granularity
●
https://www.2ndquadrant.com/en/resources/pglogical/
●
Doesn’t even need a custom image, can be added as post-install hook
Alternatives?
●
DBaaS/PaaS like Heroku ($$$)
●
Managed cloudy DBs like EnterpriseDB’s (AWS) Postgres
●
Evil ;)
– Amazon RDS (/Aurora?) PostgreSQL
– Google Cloud SQL PostgreSQL
– Azure Database for PostgreSQL
●
Define as Services, connect to Endpoints
Thank you =)
Twitter: @vyruss
Photo: Forth Bridge, Firth of Forth, Edinburgh

More Related Content

What's hot (20)

PostgreSQL High Availability in a Containerized World
PostgreSQL High Availability in a Containerized World
Jignesh Shah
 
Query and audit logging in cassandra
Query and audit logging in cassandra
Vinay Kumar Chella
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Masahiko Sawada
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Databricks
 
Kubernetes
Kubernetes
erialc_w
 
Kubernetes - Security Journey
Kubernetes - Security Journey
Jerry Jalava
 
Introduction to kubernetes
Introduction to kubernetes
Gabriel Carro
 
How Kubernetes helps Devops
How Kubernetes helps Devops
Sreenivas Makam
 
Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster Recovery
Cloudera, Inc.
 
Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...
Lucas Jellema
 
MariaDB MaxScale
MariaDB MaxScale
MariaDB plc
 
Linux Systems Performance 2016
Linux Systems Performance 2016
Brendan Gregg
 
Building IAM for OpenStack
Building IAM for OpenStack
Steve Martinelli
 
Amazon EKS - security best practices - 2022
Amazon EKS - security best practices - 2022
Jean-François LOMBARDO
 
My sql failover test using orchestrator
My sql failover test using orchestrator
YoungHeon (Roy) Kim
 
Kubernetes presentation
Kubernetes presentation
GauranG Bajpai
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
Bill Liu
 
Gitlab, GitOps & ArgoCD
Gitlab, GitOps & ArgoCD
Haggai Philip Zagury
 
Hands-On Introduction to Kubernetes at LISA17
Hands-On Introduction to Kubernetes at LISA17
Ryan Jarvinen
 
Kubernetes Webinar - Using ConfigMaps & Secrets
Kubernetes Webinar - Using ConfigMaps & Secrets
Janakiram MSV
 
PostgreSQL High Availability in a Containerized World
PostgreSQL High Availability in a Containerized World
Jignesh Shah
 
Query and audit logging in cassandra
Query and audit logging in cassandra
Vinay Kumar Chella
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Masahiko Sawada
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Databricks
 
Kubernetes
Kubernetes
erialc_w
 
Kubernetes - Security Journey
Kubernetes - Security Journey
Jerry Jalava
 
Introduction to kubernetes
Introduction to kubernetes
Gabriel Carro
 
How Kubernetes helps Devops
How Kubernetes helps Devops
Sreenivas Makam
 
Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster Recovery
Cloudera, Inc.
 
Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...
Lucas Jellema
 
MariaDB MaxScale
MariaDB MaxScale
MariaDB plc
 
Linux Systems Performance 2016
Linux Systems Performance 2016
Brendan Gregg
 
Building IAM for OpenStack
Building IAM for OpenStack
Steve Martinelli
 
Amazon EKS - security best practices - 2022
Amazon EKS - security best practices - 2022
Jean-François LOMBARDO
 
My sql failover test using orchestrator
My sql failover test using orchestrator
YoungHeon (Roy) Kim
 
Kubernetes presentation
Kubernetes presentation
GauranG Bajpai
 
Building large scale transactional data lake using apache hudi
Building large scale transactional data lake using apache hudi
Bill Liu
 
Hands-On Introduction to Kubernetes at LISA17
Hands-On Introduction to Kubernetes at LISA17
Ryan Jarvinen
 
Kubernetes Webinar - Using ConfigMaps & Secrets
Kubernetes Webinar - Using ConfigMaps & Secrets
Janakiram MSV
 

Similar to Deploying PostgreSQL on Kubernetes (20)

Kubernetes 101
Kubernetes 101
Stanislav Pogrebnyak
 
Introduction to Kubernetes
Introduction to Kubernetes
Vishal Biyani
 
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC
 
Kubernetes Internals
Kubernetes Internals
Shimi Bandiel
 
Using PostgreSQL With Docker & Kubernetes - July 2018
Using PostgreSQL With Docker & Kubernetes - July 2018
Jonathan Katz
 
Kubernetes: My BFF
Kubernetes: My BFF
Jonathan Yu
 
Kubernetes Intro
Kubernetes Intro
Antonio Ojea Garcia
 
From CoreOS to Kubernetes and Concourse CI
From CoreOS to Kubernetes and Concourse CI
Denis Izmaylov
 
A DevOps guide to Kubernetes
A DevOps guide to Kubernetes
Paul Czarkowski
 
Cluster management with Kubernetes
Cluster management with Kubernetes
Satnam Singh
 
Kubernetes Introduction
Kubernetes Introduction
Miloš Zubal
 
Kubernetes
Kubernetes
Diego Pacheco
 
Kash Kubernetified
Kash Kubernetified
Michael Wojcikiewicz
 
DEVOPS UNIT 4 docker and services commands
DEVOPS UNIT 4 docker and services commands
billuandtanya
 
DevOps in AWS with Kubernetes
DevOps in AWS with Kubernetes
Oleg Chunikhin
 
Kubernetes for Beginners
Kubernetes for Beginners
DigitalOcean
 
Cloud Native PostgreSQL - APJ
Cloud Native PostgreSQL - APJ
EDB
 
Kubernetes for the PHP developer
Kubernetes for the PHP developer
Paul Czarkowski
 
LISA2017 Kubernetes: Hit the Ground Running
LISA2017 Kubernetes: Hit the Ground Running
Chris McEniry
 
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
Equnix Business Solutions
 
Introduction to Kubernetes
Introduction to Kubernetes
Vishal Biyani
 
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC 2018 - Patroni: Kubernetes-native PostgreSQL companion
PGConf APAC
 
Kubernetes Internals
Kubernetes Internals
Shimi Bandiel
 
Using PostgreSQL With Docker & Kubernetes - July 2018
Using PostgreSQL With Docker & Kubernetes - July 2018
Jonathan Katz
 
Kubernetes: My BFF
Kubernetes: My BFF
Jonathan Yu
 
From CoreOS to Kubernetes and Concourse CI
From CoreOS to Kubernetes and Concourse CI
Denis Izmaylov
 
A DevOps guide to Kubernetes
A DevOps guide to Kubernetes
Paul Czarkowski
 
Cluster management with Kubernetes
Cluster management with Kubernetes
Satnam Singh
 
Kubernetes Introduction
Kubernetes Introduction
Miloš Zubal
 
DEVOPS UNIT 4 docker and services commands
DEVOPS UNIT 4 docker and services commands
billuandtanya
 
DevOps in AWS with Kubernetes
DevOps in AWS with Kubernetes
Oleg Chunikhin
 
Kubernetes for Beginners
Kubernetes for Beginners
DigitalOcean
 
Cloud Native PostgreSQL - APJ
Cloud Native PostgreSQL - APJ
EDB
 
Kubernetes for the PHP developer
Kubernetes for the PHP developer
Paul Czarkowski
 
LISA2017 Kubernetes: Hit the Ground Running
LISA2017 Kubernetes: Hit the Ground Running
Chris McEniry
 
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
Equnix Business Solutions
 
Ad

More from Jimmy Angelakos (9)

Don't Do This [FOSDEM 2023]
Don't Do This [FOSDEM 2023]
Jimmy Angelakos
 
Slow things down to make them go faster [FOSDEM 2022]
Slow things down to make them go faster [FOSDEM 2022]
Jimmy Angelakos
 
Practical Partitioning in Production with Postgres
Practical Partitioning in Production with Postgres
Jimmy Angelakos
 
Changing your huge table's data types in production
Changing your huge table's data types in production
Jimmy Angelakos
 
The State of (Full) Text Search in PostgreSQL 12
The State of (Full) Text Search in PostgreSQL 12
Jimmy Angelakos
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Jimmy Angelakos
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic Data
Jimmy Angelakos
 
Eισαγωγή στην PostgreSQL - Χρήση σε επιχειρησιακό περιβάλλον
Eισαγωγή στην PostgreSQL - Χρήση σε επιχειρησιακό περιβάλλον
Jimmy Angelakos
 
PostgreSQL: Mέθοδοι για Data Replication
PostgreSQL: Mέθοδοι για Data Replication
Jimmy Angelakos
 
Don't Do This [FOSDEM 2023]
Don't Do This [FOSDEM 2023]
Jimmy Angelakos
 
Slow things down to make them go faster [FOSDEM 2022]
Slow things down to make them go faster [FOSDEM 2022]
Jimmy Angelakos
 
Practical Partitioning in Production with Postgres
Practical Partitioning in Production with Postgres
Jimmy Angelakos
 
Changing your huge table's data types in production
Changing your huge table's data types in production
Jimmy Angelakos
 
The State of (Full) Text Search in PostgreSQL 12
The State of (Full) Text Search in PostgreSQL 12
Jimmy Angelakos
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Jimmy Angelakos
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic Data
Jimmy Angelakos
 
Eισαγωγή στην PostgreSQL - Χρήση σε επιχειρησιακό περιβάλλον
Eισαγωγή στην PostgreSQL - Χρήση σε επιχειρησιακό περιβάλλον
Jimmy Angelakos
 
PostgreSQL: Mέθοδοι για Data Replication
PostgreSQL: Mέθοδοι για Data Replication
Jimmy Angelakos
 
Ad

Recently uploaded (20)

Wondershare PDFelement Pro 11.4.20.3548 Crack Free Download
Wondershare PDFelement Pro 11.4.20.3548 Crack Free Download
Puppy jhon
 
Software Engineering Process, Notation & Tools Introduction - Part 4
Software Engineering Process, Notation & Tools Introduction - Part 4
Gaurav Sharma
 
How the US Navy Approaches DevSecOps with Raise 2.0
How the US Navy Approaches DevSecOps with Raise 2.0
Anchore
 
Transmission Media. (Computer Networks)
Transmission Media. (Computer Networks)
S Pranav (Deepu)
 
Porting Qt 5 QML Modules to Qt 6 Webinar
Porting Qt 5 QML Modules to Qt 6 Webinar
ICS
 
Smart Financial Solutions: Money Lender Software, Daily Pigmy & Personal Loan...
Smart Financial Solutions: Money Lender Software, Daily Pigmy & Personal Loan...
Intelli grow
 
Microsoft Business-230T01A-ENU-PowerPoint_01.pptx
Microsoft Business-230T01A-ENU-PowerPoint_01.pptx
soulamaabdoulaye128
 
FME as an Orchestration Tool - Peak of Data & AI 2025
FME as an Orchestration Tool - Peak of Data & AI 2025
Safe Software
 
Generative Artificial Intelligence and its Applications
Generative Artificial Intelligence and its Applications
SandeepKS52
 
Migrating to Azure Cosmos DB the Right Way
Migrating to Azure Cosmos DB the Right Way
Alexander (Alex) Komyagin
 
Automated Migration of ESRI Geodatabases Using XML Control Files and FME
Automated Migration of ESRI Geodatabases Using XML Control Files and FME
Safe Software
 
Milwaukee Marketo User Group June 2025 - Optimize and Enhance Efficiency - Sm...
Milwaukee Marketo User Group June 2025 - Optimize and Enhance Efficiency - Sm...
BradBedford3
 
Advanced Token Development - Decentralized Innovation
Advanced Token Development - Decentralized Innovation
arohisinghas720
 
SAP PM Module Level-IV Training Complete.ppt
SAP PM Module Level-IV Training Complete.ppt
MuhammadShaheryar36
 
Artificial Intelligence Applications Across Industries
Artificial Intelligence Applications Across Industries
SandeepKS52
 
UPDASP a project coordination unit ......
UPDASP a project coordination unit ......
withrj1
 
Reimagining Software Development and DevOps with Agentic AI
Reimagining Software Development and DevOps with Agentic AI
Maxim Salnikov
 
Who will create the languages of the future?
Who will create the languages of the future?
Jordi Cabot
 
AI-Powered Compliance Solutions for Global Regulations | Certivo
AI-Powered Compliance Solutions for Global Regulations | Certivo
certivoai
 
IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORK.P.pptx
IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORK.P.pptx
usmanch7829
 
Wondershare PDFelement Pro 11.4.20.3548 Crack Free Download
Wondershare PDFelement Pro 11.4.20.3548 Crack Free Download
Puppy jhon
 
Software Engineering Process, Notation & Tools Introduction - Part 4
Software Engineering Process, Notation & Tools Introduction - Part 4
Gaurav Sharma
 
How the US Navy Approaches DevSecOps with Raise 2.0
How the US Navy Approaches DevSecOps with Raise 2.0
Anchore
 
Transmission Media. (Computer Networks)
Transmission Media. (Computer Networks)
S Pranav (Deepu)
 
Porting Qt 5 QML Modules to Qt 6 Webinar
Porting Qt 5 QML Modules to Qt 6 Webinar
ICS
 
Smart Financial Solutions: Money Lender Software, Daily Pigmy & Personal Loan...
Smart Financial Solutions: Money Lender Software, Daily Pigmy & Personal Loan...
Intelli grow
 
Microsoft Business-230T01A-ENU-PowerPoint_01.pptx
Microsoft Business-230T01A-ENU-PowerPoint_01.pptx
soulamaabdoulaye128
 
FME as an Orchestration Tool - Peak of Data & AI 2025
FME as an Orchestration Tool - Peak of Data & AI 2025
Safe Software
 
Generative Artificial Intelligence and its Applications
Generative Artificial Intelligence and its Applications
SandeepKS52
 
Automated Migration of ESRI Geodatabases Using XML Control Files and FME
Automated Migration of ESRI Geodatabases Using XML Control Files and FME
Safe Software
 
Milwaukee Marketo User Group June 2025 - Optimize and Enhance Efficiency - Sm...
Milwaukee Marketo User Group June 2025 - Optimize and Enhance Efficiency - Sm...
BradBedford3
 
Advanced Token Development - Decentralized Innovation
Advanced Token Development - Decentralized Innovation
arohisinghas720
 
SAP PM Module Level-IV Training Complete.ppt
SAP PM Module Level-IV Training Complete.ppt
MuhammadShaheryar36
 
Artificial Intelligence Applications Across Industries
Artificial Intelligence Applications Across Industries
SandeepKS52
 
UPDASP a project coordination unit ......
UPDASP a project coordination unit ......
withrj1
 
Reimagining Software Development and DevOps with Agentic AI
Reimagining Software Development and DevOps with Agentic AI
Maxim Salnikov
 
Who will create the languages of the future?
Who will create the languages of the future?
Jordi Cabot
 
AI-Powered Compliance Solutions for Global Regulations | Certivo
AI-Powered Compliance Solutions for Global Regulations | Certivo
certivoai
 
IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORK.P.pptx
IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORK.P.pptx
usmanch7829
 

Deploying PostgreSQL on Kubernetes

  • 1. Deploying PostgreSQL on Kubernetes Jimmy Angelakos FOSDEM Platform Architect 03/02/2019 SolarWinds MSP
  • 2. Motivation ● Service Oriented Architecture (SOA), including Micro– , exemplified perfectly by Kubernetes ● Kubernetes is here to stay ● Fewer phonecalls at 4 am? ● Play around at home for free ● Or get commercial support ● Cloud Compute, Storage → Commodity ● (Industrial-strength) Postgres is hard ● You want Postgres → Commodity to your users ● By no means an exhaustive list of solutions or in-depth analysis but an attempt to demystify
  • 3. What this is not I. A demo of me fiddling with terminals and window tiling techniques on the screen II. Me typing in Kubernetes commands so you can see how they are typed in III. And… press ENTER. Ok, there, it worked. See? IV. No wait. It didn’t. Let me fiddle some more.
  • 4. What this is Contents: I. Kubernetes basics II. Small scale III. Helm Charts IV. Crunchy Data Operator V. Observations
  • 6. K8s basics – 1: K8s & Containers ● Container: Lightweight, standalone, executable package – Containerized software will run on any environment with no differences – Resource efficient vs. VMs – Platform independent vs. “It works on my machine ¯_( ツ )_/¯ ” ● K8s is a container orchestrator – Written in Go (Golang) – Cloud Native Computing Foundation (CNCF) – Scaling, load balancing, safely rolling out updates – Abstracting infrastructure via API: Can use any cloud provider (or none) – Resources: k8s API objects – “Pets vs Cattle” debate
  • 7. K8s basics – 2: Terms ● Cluster – Master node runs API server (our interface to the Cluster) – Worker nodes run Kubelet and Pods – Namespaces: Virtual clusters (resource quotas) ● Kubelet – Talks to Master node, monitors Pods ● Pod – A container or group of containers sharing the same execution environment – Container coupling: sharing a volume or IPC ● Volume – Storage abstraction, many types
  • 8. K8s basics – 3: Moar terms ● Minikube – Single-node k8s cluster in a VM – install VirtualBox and you’re good to go. ● Prometheus – Monitoring solution for k8s (also by CNCF, so described as “best fit”…) ● Custom Resource Definitions – Write them to extend k8s API at will ● Operator pattern – Custom domain-specific controllers that work with CRDs – Configure & manage stateful applications for you – No need for out-of-band automation
  • 9. K8s basics – 4: YAML files ● Definitions – YAML! – kind of resource e.g. Pod – metadata e.g. name, labels – spec i.e. the desired state for the resource ● Kubectl – CLI tool for interacting with Cluster kubectl create -f my-pod.yaml kubectl get pods
  • 10. K8s basics – 5: Services ● Service – Exposes Pods externally via URL – Entry point for a set of Pods performing the same function – Targets Pods using a selector for the labels applied to Pods – Can have Type: ClusterIP, NodePort, LoadBalancer, ExternalName – Needs a way to route traffic from outside the Cluster ● NodePort will assign the same Port from each Node ● LoadBalancer will provision an external LB from cloud provider
  • 11. K8s basics – 6: Deployments ● Deployment – Automates upgrades of applications with zero downtime – Enables fast rollbacks to previous state kubectl rollout undo deployment my-app --to-revision=5 – Defines number of replicated Pods in spec ● Manages ReplicaSets for you – Can have Strategy: RollingUpdate, Recreate
  • 12. K8s basics – 7: State ● Stateless Applications – Usually as a Deployment of Pod Replicas accessed via a Service ● Stateful Applications – StatefulSets ● Stable storage ● Stable network identifiers ● Ordered deployment & scaling ● Ordered RollingUpdates
  • 13. K8s basics – 8: StatefulSets ● spec – Defines replicas in unique Pods (with stable network identity & storage) – Defines storage in PersistentVolumes ● Headless Service – No load balancing, no cluster IP: self-registration or discovery possible – Governs DNS subdomain of Pods: e.g. mypod-1.myservice.mynamespace ● PersistentVolumes: Provisioned storage as a resource ● PersistentVolumeClaim: A request for storage, consumes PV resources ● Deletion – Does not remove PersistentVolumes (for safety) – Does not guarantee Pod termination (scale to zero before)
  • 15. Small scale – 1: The image ● You need a PostgreSQL container image – Roll your own – Use an existing image ● PostgreSQL Docker Community “Official image” – https://github.com/docker-library/postgres docker pull postgres ● Bitnami PostgreSQL Docker image – https://github.com/bitnami/bitnami-docker-postgresql ● Crunchy Data containers – https://github.com/CrunchyData/crunchy-containers
  • 16. Small scale – 2: Deployment ● Create a ConfigMap for the configuration values → ● Create a PersistentVolume and a PersistentVolumeClaim ● Create a Deployment for your Container image & PV ● Create a Service to expose the above. Simple: NodePort ● Connect to your database via exposed port or kubectl port forwarding apiVersion: v1 kind: ConfigMap metadata: name: postgres-config labels: app: postgres data: POSTGRES_DB: mydatabase POSTGRES_USER: myuser POSTGRES_PASSWORD: mypassword
  • 18. Helm Charts – 1: Introduction ● Helm – A “package manager” for k8s. Helm is the client. – Tiller is the server-side component installed in k8s ● Charts – Directories of (you guessed it) YAML files – Describe a set of related k8s resources – values.yaml lets you customise options and configuration ● PostgreSQL use case – One-stop installation for a set of replicated databases – It makes sense!
  • 19. Helm Charts – 2: PostgreSQL Chart ● Contributed by Bitnami, upstreamed: – https://github.com/helm/charts/tree/master/stable/postgresql ● Default Docker image repo is Bitnami ● Installation is as simple as: helm install --name my-release -f values.yaml stable/postgresql – A Release in this context is an installation, a deployment ● Output will include some magic commands for getting the DB password and connecting to the running instance ● postgresql.conf or pg_hba.conf can be provided in files/ folder and will be mounted as a ConfigMap (special Volume type for abstracting configuration)
  • 20. NAME: my-release LAST DEPLOYED: Fri Jan 25 15:20:58 2019 NAMESPACE: my-namespace STATUS: DEPLOYED RESOURCES: ==> v1/Secret NAME TYPE DATA AGE my-release-postgresql Opaque 1 3s ==> v1/ConfigMap NAME DATA AGE my-release-postgresql-init-scripts 1 3s ==> v1/Service NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE my-release-postgresql-headless ClusterIP None <none> 5432/TCP 3s my-release-postgresql ClusterIP 10.101.211.6 <none> 5432/TCP 3s ==> v1beta2/StatefulSet NAME DESIRED CURRENT AGE my-release-postgresql 1 1 3s ==> v1/Pod(related) NAME READY STATUS RESTARTS AGE my-release-postgresql-0 0/1 Init:0/1 0 3s
  • 21. NOTES: ** Please be patient while the chart is being deployed ** PostgreSQL can be accessed via port 5432 on the following DNS name from within your cluster: my-release-postgresql.my-namespace.svc.cluster.local To get the password for "postgres" run: export POSTGRESQL_PASSWORD=$(kubectl get secret --namespace my-namespace my-release- postgresql -o jsonpath="{.data.postgresql-password}" | base64 --decode) To connect to your database run the following command: kubectl run my-release-postgresql-client --rm --tty -i --restart='Never' --namespace my-namespace --image bitnami/postgresql --env="PGPASSWORD=$POSTGRESQL_PASSWORD" --command -- psql --host my-release-postgresql -U postgres To connect to your database from outside the cluster execute the following commands: kubectl port-forward --namespace my-namespace svc/my-release-postgresql 5432:5432 & psql --host 127.0.0.1 -U postgres
  • 22. Helm Charts – 3: Internals ● Defaults create: – A StatefulSet with 1 Replica (1 Pod) running Postgres from the Docker image – A Headless Service and a Service – A PersistentVolumeClaim from the configured storage provisioner ● Can be configured to: – Load custom Postgres initialisation scripts as ConfigMaps from files/ – Start a metrics exporter to Prometheus: ● https://github.com/wrouesnel/postgres_exporter ● Export e.g. pg_stat_activity, pg_stat_replication or custom metrics queries
  • 23. Helm Charts – 4: Patroni Chart ● For HA you can use the Helm Incubator Patroni Chart: – https://github.com/helm/charts/tree/master/incubator/patroni ● This, too, uses StatefulSets ● Default installation deploys a 5 node Spilo cluster – Zalando’s Spilo is Postgres & Patroni bundled image ● Installation helm repo add incubator https://kubernetes-charts- incubator.storage.googleapis.com/ helm dependency update helm install --name my-release incubator/patroni
  • 25. Crunchy Operator – 1 ● Crunchy Data PostgreSQL Operator – https://github.com/CrunchyData/postgres-operator ● Deploy Postgres with streaming replication & scaling ● Add pgpool, pgbouncer, and metrics sidecars ● Administer SQL policies, users, passwords ● Assign labels to resources ● Minor version upgrades ● Perform backups and restores (or schedule them)
  • 26. Crunchy Operator – 2 Quickstart: ● git clone the GitHub repo, git checkout <tag> ● source examples/envs.sh ● make setupnamespace creates a “demo” namespace ● conf/postgres-operator/pgo.yaml holds the configuration ● make installrbac Creates RBAC resources and keys ● make deployoperator
  • 27. Crunchy Operator – 3: pgo ● pgo is the CLI to interact with the operator pgo create cluster my-cluster (--metrics if you want) pgo show cluster my-cluster pgo scale my-cluster --replica-count=2 pgo create pgbouncer my-cluster or pgo create pgpool my-cluster to add ● Backups pgo create cluster my-cluster --pgbackrest pgo backup my-cluster --backup-type=pgbackrest (or pgbasebackup) pgo restore my-cluster ● Manual failovers pgo failover my-cluster –query (to get failover targets) pgo failover my-cluster --target=my-failover-target-1
  • 29. Observations – 1: Deploying by hand ● Good for rapid development ● Offers equivalent isolation as VMs ● Resource saving compared to VMs ● Doesn’t offer many Cloud Native advantages ● Production usage? – Hard to maintain at scale unless you have an army of DBAs
  • 30. Observations – 2: Helm Charts ● Good for one-time deployments ● Very clean and transparent ● Major version upgrades? ● Slave replicas – no failover unless you set it up explicitly ● Flexibility to carry on using your existing solutions ● Can be used by namespace-admin or plain user with permissions
  • 31. Observations – 3: Crunchy Operator ● All-in-one solution, Postgres as an application ● Makes many tasks easy via CLI and automates others ● You need RBAC and cluster-admin permissions for creation of CRDs – Kubernetes does not support namespaced CRDs :( – https://github.com/kubernetes/kubernetes/issues/65551 ● Under heavy development – perhaps not ideal for production? – But so is Kubernetes :/
  • 32. Observations – 4 ● Hard problem – (Plain) Postgres cluster with multiple write nodes – Multi-master is not always the solution – Can leverage aforementioned solutions with 2ndQuadrant’s pglogical for granularity ● https://www.2ndquadrant.com/en/resources/pglogical/ ● Doesn’t even need a custom image, can be added as post-install hook
  • 33. Alternatives? ● DBaaS/PaaS like Heroku ($$$) ● Managed cloudy DBs like EnterpriseDB’s (AWS) Postgres ● Evil ;) – Amazon RDS (/Aurora?) PostgreSQL – Google Cloud SQL PostgreSQL – Azure Database for PostgreSQL ● Define as Services, connect to Endpoints
  • 34. Thank you =) Twitter: @vyruss Photo: Forth Bridge, Firth of Forth, Edinburgh