Running MariaDB in multiple data centers

Running MariaDB Across Data
Centers
Tim Tadeo
Senior Technical Sales Enginner (MariaDB Corporation)

“It’s been quite the Journey”

Agenda
 Background on the need for Multiple Datacenter DBMS Architectures
 High Availability, Active-Passive, Location/Application Affinity, Continuous Operation
 Topology Choices for determined use cases
 Traditional Disaster Recovery/Secondary Site – HA/FO
 Geo-Synchronous Distributed – Multi-Master/Active-Active
 How the Topologies work
 MariaDB Master/Slave, Galera Cluster

Answering a few simple Questions!
 What are we trying to solve?
 Why do we need to solve it?
 Where and When can we deploy it? (On-Prem /Cloud /Hybrid)
 How do we choose the correct design?
 Complex and Challenging, the need to simplify and manage!

Answering the Simple
Questions
Trade-Offs !
Scalability Reliability
Performance
Growth
Hardware failures
Reconciliation
Parallelism
Load distribution
Closer to users
Business continuation
Consolidation
Agility
Data integrity
Outage protection
Resilience
Network Artition

Running MariaDB in multiple data centers

Multiple Data Center Architectures

Master/Slave Replication with Multiple Data Centers
Data Center (DC1, Active) Data Center (DC2, Passive)
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
Node 1
P1: priority=3
P2: priority=1
Node 2
P1: priority = 3
P2: priority = 1
Node 3
P1: priority = 3
P2: priority = 1
Multi-master cluster
synchronous replication

Master/Slave Replication with Multiple Data Centers Semi-Synchronous
Data Center (DC1, Active) Data Center (DC2, Passive)
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
Master Slave SlaveMasterSlaveSlave

Master/Slave Replication with Read Scaling
Cluster 1 DC1 Cluster 2 DC2
Writes
port 3307
MariaDB MaxScale
Proxy
Master Binlog Server
Slave R1 Slave R2 Slave R100
Slave M1Slave M2
Reads
port 3308

Master/Slave Replication with a Dedicated Backup
MariaDB MaxScale
Proxy
Slave backups Master
Slave 2 reads Slave 3 reads
DC 1
DC 2
MariaDB Backup

Replication Types
All nodes are masters and
applications can read and
write from/to any node
The Master does not confirm
transactions to the client
application until at least one slave
has copied the change to its relay
log, and flushed it to disk
The Master does not wait for
Slave, the master writes events to
its binary log and slaves request
them when they are ready
Asynchronous
Replication
Semi-Synchronous
Replication
Synchronous
Replication

Replication and Clustering
Slave Slave Slave Slave Slave Slave Master Master
Master
Asynchronous
master/slave replication
Semi-synchronous
master/slave replication
Synchronous
multi-master clustering
Master Master

Master/SlaveReplication
Master Slave
Request next transaction (GTID = 2)1
Reply with next transaction (GTID = 3)3
Binary log Relay log
Read next transaction (GTID = 3)
2
Write next transaction (GTID = 3)
4TX (GTID 3)
TX (GTID 2)
TX (GTID 1)
TX (GTID 3)
TX (GTID 2)
TX (GTID 1)

Fundamentals: HA with MariaDB TX
AsynchronouswithAutomaticFailover
Master
GTID = 3
Slave 1
GTID = 2
Slave 2
GTID = 1
Master*
GTID = 2
Slave 2
GTID = 1
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
Master
GTID = 3
Slave 1
GTID = 2
Slave 2
GTID = 1
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy

Semi-SynchronouswithAutomaticFailover
Master
GTID = 3
Slave 1
GTID = 2
Slave 2
GTID = 1
Master*
GTID = 2
Slave 2
GTID = 1
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
Master
GTID = 3
Slave 1
GTID = 2
Slave 2
GTID = 1
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy

Topologies: HA with MariaDB TX
Master/SlaveReplicationwithMultipleDataCenters
Data Center (DC1, Active) Data Center (DC2, Passive or Active)
MariaDB MaxScale
Proxy
MariaDB MaxScale
Proxy
Node 1
P1: priority=3
P2: priority=1
Node 2
P1: priority = 3
P2: priority = 1
Node 3
P1: priority = 3
P2: priority = 1
Multi-master cluster
synchronous replication

 The master Galera cluster consists of 3 nodes (1 in DC1, 1 in DC2, 1 as Arbitrator). In case a Galera node in DC1 or DC2 fails, the other one is still active.
 The Arbitrator is used only for the quorum in order to avoid split-brain.
 The Galera arbitrator is a full node but does not store any data. All transactions are sent to this node, its network must be fast enough to support this.
MariaDBTXGalera3-node clusterwithMaxScale

Multi-MasterClustering
Transaction
Row 1
Row 2
Row 3
Get writes1
Node
Send writes2
Node
Node
Certify and apply writes
3

WhatisGalera?
 Replicates the InnoDB storage engine
 All accumulated knowledge about InnoDB is applicable query tuning, server parameters, buffer
sizes, all apply
 Synchronous replication

WhatisGalera?
 A MariaDB Galera cluster requires a minimal of 3 nodes
 However, one of the members of the cluster can be an arbitrator (2 node + 1 arbitrator)
 Despite not participating in data replication, the arbitrator still needs to be on a 3rd physical
node

Administration and Monitoring
MaxScale 2.3 – MariaDB
 a database proxy that forwards database statements to one or more database servers.
MariaDB MaxScale is designed to provide, transparently to applications, load balancing and high
availability functionality.
ClusterControl – SeveralNines
 A robust, all-inclusive open source database management system. Allows users to easily monitor,
deploy, manage, and scale highly available databases (MariaDB) either in the cloud or on premises.
SqlDM – Idera
 Provides an unprecedented level of diagnostic information on the health, performance, and status of
MariaDB instances across your environment. You can view, diagnose, and report on critical
performance statistics from a central point of control

Pros-Cons of Multiple Data Center
Architectures

Multi-MasterorNot
 If your application can handle deadlock errors, multi-master is good to use
 However, if a database has hot-spots, i.e. multi-master conflicts happen frequently, write
performance will suffer
 But read scalability is always guaranteed
 Use Master-Slave, if deadlock errors are a problem or conflict rate hurts your performance

GaleraCluster
 Good Performance
 Optimistic concurrency control
 Virtually synchronous replication
 Parallel replication
 Optimized group communication
 99.99% transparent
 InnoDB look & feel, automatic node joining
 Works in LAN / WAN / Cloud

Back-Up Slides Follow(Need them “MariaDB
Standardized” by Marketing

Galera Replication
MariaDB
Multi-MasterReplication

Galera Replication
MariaDB
MariaDB
There can be several
nodes

Galera Replication
MariaDB
MariaDB MariaDB
nodes

Galera Replication
MariaDB
MariaDB MariaDB
nodes
Client can connect
to any node

Galera Replication
MariaDB
MariaDB MariaDB
nodes
Client can connect to any
node
Read & Write access to
any noderead & write read & write read & write

Galera Replication
MariaDB
MariaDB MariaDB
nodes
Client can connect to any
node
Read & Write access to any
noderead & write read & write read & write
Replication is
synchronous

MariaDB Multi-master cluster
looks like one big
database with multiple
entry points
read & write read & write read & write

MariaDB
SynchronousReplication
MariaDB MariaDB
Transaction is
processed locally
up to commit
time
read & write
Galera Replication

Galera Replication
MariaDB
MariaDB MariaDB
Transaction is
processed locally
up to commit
time
Commit
Slave Queue Slave Queue

Galera Replication
MariaDB
MariaDB MariaDB
Client gets OK
statusOk

Galera Replication
MariaDB
MariaDB MariaDB
Transaction is
applied
in slaves

Galera Replication
MariaDB
MariaDB MariaDB
Load Balancing

Galera Replication
MariaDB
Quorum
MariaDB MariaDB
Load Balancing
x
Galera uses quorum based failure handling:
 When cluster partitioning is detected, the majority
partition "has quorum" and can continue
 A minority partition cannot commit transactions,
but will attempt to re-connect to primary
partition
 Note: 50% is not majority!
=> Minimum 3 nodes recommended.
Load balancer will notice errors & remove node from
pool

 The master Galera cluster consists of 3 nodes (1 in DC1, 1 in DC2, 1 as Arbitrator). In case a Galera node in DC1 or DC2 fails, the other one is still active.
 The Arbitrator is used only for the quorum in order to avoid split-brain.
 The Galera arbitrator is a full node but does not store any data. All transactions are sent to this node, its network must be fast enough to support this.
MaraiaDBTXGalera3-node clusterwithMAXScale

Master/SlaveCluster
Connection based routing
Low overhead
Balances a set of connections over a set of servers
 Uses monitoring feedback to identify master and slaves
 Connection weighting if configured
 Load balances queries in round robin across configured servers
Each application has a read connection and a write connection
Uses read connrouter to route 2 services mariadb monitors the
replication cluster
Automatic failover: election and promotion of slave
MASTER SLAVES
MaxScale
Write
Connection
Service
Read
Connection
Service
write read

MariaDBCluster
Connection based routing
Low overhead
Balances a set of connections over a set of servers
 Uses monitoring feedback to elect write master node
 Connection weighting if configured
 Load balances queries in round robin across configured servers
Each application has a read connection and a write connection
Uses read connrouter to route 2 services
GaleraMon monitors the cluster and elects the master
No external failover required
WRITE
MASTER NODES
READ NODES
MaxScale
Write
Connection
Service
Read
Connection
Service
write read

LoadSegregationWithinanApplication
Galera cluster or Master-Slave environment
Connection based or Statement based
Route queries to specific servers based on regular
expression match
One service for all workload configured to
 Route queries that match a “*from *users”
to server3
 All other queries follow default routing configured
(connection based or statement based)
 Monitors the cluster and elect the master
/*from *users/
SERVER2 SERVER3SERVER1
Client Application
All other queries
MaxScale

Running MariaDB in multiple data centers

More Related Content

What's hot (20)

Similar to Running MariaDB in multiple data centers (20)

More from MariaDB plc (20)

Recently uploaded (20)

Running MariaDB in multiple data centers

Editor's Notes