ArchitectingOnAWS_Module_13 goat bumrah i

RTO/RPO and Backup
Recovery Setup
Mod u l e 1 3

© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Module 13
If your infrastructure becomes unavailable, you need to be able to get your
application running again within an appropriate amount of time and at an
appropriate level of cost.
• Disaster Planning
• Recovery Options
The architectural need
Module Overview

Disaster Planning

What Are We Planning?
Everything fails, all the time.
-Werner Vogels
Large-scale events Colossal events
Small-scale events
How do we prepare for these?

Availability Concepts
• Minimizing downtime for your application
High availability
• Make sure your data is safe
Backup
• Get your applications and data back after a major
disaster
Disaster recovery

RPO and RTO
Recovery Point Objective
(RPO)
How often does data need to be
backed up?
Disaster
Time
Example:
The business can recover
from losing (at most) the
last 12 hours of data.

RPO and RTO
Recovery Point Objective
(RPO)
How often does data need to be
backed up?
Disaster
Time
Example:
The business can recover
from losing (at most) the
last 12 hours of data.
Recovery Time Objective (RTO)
How long can the application be
unavailable?
The application can be
unavailable for a
maximum of 1 hour.

Regions Can Go Down
Region 1 Region 2

Essential AWS Services and Features for Disaster Recovery
Regions
Storage Compute Networking Database Deployment
orchestration

Storage Should Be Duplicated
Amazon S3
Cross-region
replication
10

Amazon S3
Cross-region
replication
Replicated to
multiple Availability
Zones and multiple
devices in each
Availability Zone
Amazon S3
Glacier

Amazon S3
Amazon S3
Glacier
Amazon EBS
Cross-region
replication
• Create point-in-time
volume snapshots
• Copy snapshots across
regions and accounts
Replicated to
Zones and multiple
devices in each
Availability Zone

Amazon S3
AWS Snowball
Cross-region
replication
volume snapshots
Transfers large volumes
(>10TB) of data more
quickly than high-speed
Internet.
Replicated to
Zones and multiple
devices in each
Availability Zone
Amazon S3
Glacier
Amazon EBS

Amazon S3
AWS
DataSync
AWS Snowball
Cross-region
replication
volume snapshots
Transfers large volumes
(>10TB) of data more
quickly than high-speed
Internet.
Replicated to
Zones and multiple
devices in each
Availability Zone
Amazon S3
Glacier
Amazon EBS

Spinning Your Compute Back Up Should Be Easy
Custom
AMIs
Obtain and boot new server instances or containers within minutes
Custom
container
images

Networking Disaster Recovery Options
Amazon Route
53
• Traffic distribution
• Failover

Amazon Route
53
Elastic Load Balancing
• Load balancing
• Health checks
and failover
• Failover

Amazon Route
53
Amazon VPC
• Load balancing
• Health checks
and failover
Extend your existing
on-premises
network topology to
the cloud.
• Failover

Amazon Route
53
AWS Direct Connect
Amazon VPC
• Load balancing
• Health checks
and failover
Extend your existing
on-premises
network topology to
the cloud.
Fast and consistent
replication/backups of
your large on-premises
environment to the cloud
• Failover

Databases Should Be
Backed Up and Redundant
Amazon RDS
• Snapshot data and save it
in a separate region.
• Combine Read Replicas
with Multi-AZ to build a
resilient disaster recovery
strategy.
• Automatic backups

Databases Should Be
Backed Up and Redundant
Amazon DynamoDB
Amazon RDS
• Back up full tables in seconds.
• Use point-in-time-recovery to
continuously back up tables for up to 35
days.
• Initiate backups with a single click in the
console or a single API call.
• Build multi-region, multi-master tables
for fast local performance for globally
distributed apps with Global tables.
• Snapshot data and save it
in a separate region.
• Combine Read Replicas
with Multi-AZ to build a
resilient disaster recovery
strategy.
• Retain automated backups

Use Automation To Quickly Recover
AWS
CloudFormation
Use templates to
quickly deploy
collections of
resources as
needed

AWS
CloudFormation
Use templates to
quickly deploy
collections of
resources as
needed
AWS Elastic
Beanstalk
Quickly redeploy
your entire stack in
only a few clicks

AWS OpsWorks
• Automatic host
replacement
• Combine it with AWS
CloudFormation in the
recovery phase
• Provision a new stack
that supports the
defined RTO
AWS
CloudFormation
Use templates to
quickly deploy
collections of
resources as
needed
AWS Elastic
Beanstalk
Quickly redeploy
your entire stack in
only a few clicks

Recovery Strategies

Backup and Restore Example
Amazon S3
Amazon
Glacier
Remote
location
/mybucket
Amazon S3
Standard IA
Lifecycle
policy
Remote
location
AWS DR Region
Amazon EC2
Backup Restore
Amazon S3
Amazon
Glacier
/mybucket
Amazon S3
Standard IA
Lifecycle
policy

Backing up On-Premises Data to AWS
AWS Storage
Gateway
Amazon
S3
Amazon
Glacier
File gateway
Tape gateway
Volume gateway
On-premises

AWS Storage Gateway
On-premises
infrastructure
File
gateway
Amazon
Glacier
S3-IA
N
FS
v3
/ v4.1
Backup
server
Volume
gateway
iSCSI
Tape
gateway
VTL - iSCSI
Volume
gateway
S3
Amazon
Glacier
Tape gateway VTL
EBS snapshots
Amazon S3
S3
Standard

Direct attached or SAN disks
Host
Use Case: Off-Site Backup Solution
with Gateway-Stored Volumes
On-premises data center
iSCSI
Hypervisor
SSL
CIFS/
NFS
File
servers
Volume
storage
Upload
buffer
AWS Storage Gateway VM
Snapshots
(incremental
backup)
Create new volumes
in Amazon EBS or
on your local
gateway's storage
Host
AWS Storage
Gateway

Restore Backup To On-Premises
Data Center: Gateway-Stored
On-premises data center
Direct-attached or SAN disks
iSCSI
CIFS/
NFS
File
servers
Volume
storage
Upload
buffer
AWS Storage Gateway
VM
Provision a new local disk
and restore a snapshot to
it
Snapshot
AWS Storage
Gateway
Hypervisor
Host

Backup and Restore
Preparation Phase
• Take backups of current systems.
• Store backups in Amazon S3.
• Describe procedure to restore from backup on AWS.
• Know which AMI to use; build your own as needed.
• Know how to restore system from backups.
• Know how to switch to new system.
• Know how to configure the deployment.

Backup and Restore
In case of disaster:
• Retrieve backups from Amazon S3.
• Bring up required infrastructure.
• Amazon EC2 instances with prepared AMIs, ELB, etc.
• Use AWS CloudFormation to automate deployment of core networking.
• Restore system from backup.
• Switch over to the new system.
• Adjust DNS records to point to AWS.

Pilot Light Example
Web
Server
App
Server
Database
Server
Data mirroring/replication
Not running
User or system
Amazon Route 53
hosted zone
DB
secondar
y
Database
Server
DB
Web
Server
App
Server

Pilot Light Example
Web
Server
App
Server
Starts in
minutes
User or system
Amazon Route 53
hosted zone
DB
secondar
y
Database
Server
DB
Web
Server
App
Server

Pilot Light
Advantage
• Very cost-effective (uses fewer 24/7 resources)
Preparation Phase
• Set up Amazon EC2 instances to replicate or mirror data.
• Ensure that you have all supporting custom software packages available in AWS.
• Create and maintain Amazon Machine Images (AMI) of key servers where fast recovery
is required.
• Regularly run these servers, test them, and apply any software updates and
configuration changes.
• Consider automating the provisioning of AWS resources.

Pilot Light
In case of disaster
• Automatically bring up resources around the replicated core data set.
• Scale the system as needed to handle current production traffic.
• Switch over to the new system.
• Adjust DNS records to point to AWS.
Objectives
• RTO: As long as it takes to detect need for DR and automatically scale up replacement
system
• RPO: Depends on replication type

Fully Working Low-Capacity Standby
Web
Server
App
Server
Low capacity
User or system
Amazon Route 53
hosted zone
Auto
Scaling
Auto
Scaling
Database
Server
Database
Server
DB
secondar
y
Database
Server
DB
Web
Server
App
Server
Web
Server
App
Server

Web
server
App
server
Low capacity
User or system
Amazon Route 53
hosted zone
Web
server
App
server
Database
Server
DB
secondar
y
Database
Server
Database
Server
DB
Web
Server
App
Server
Web
Server
App
Server

Advantages
• Can take some production traffic at any time
• Cost savings (IT footprint smaller than full DR)
Preparation
• Similar to Pilot Light
• All necessary components running 24/7, but not scaled for production traffic
• Best practice: continuous testing
• “Trickle” a statistical subset of production traffic to DR site

In case of disaster
• Immediately fail over most critical production load
• Adjust DNS records to point to AWS
• (Auto) Scale the system further to handle all production load
Objectives
• RTO: For critical load: as long as it takes to fail over; for all other load, as long as it takes
to scale further

Multi-Site Active-Active
Web
server
Web
server
App
server
Full capacity
User or system
Amazon Route 53
hosted zone
Web
server
App
server
Database
Server
Database
Server
Database
Server
DB
secondar
y
Database
Server
Database
Server
DB
Web
Server
App
Server
Web
Server
App
Server

Multi-Site Active-Active
Advantages
• At any moment, can take all production load
Preparation
• Similar to low-capacity standby
• Fully scaling in/out with production load
In case of disaster
• Immediately fail over all production load
Objectives
• RTO: As long as it takes to fail over

Common Practices for Disaster Recovery on AWS
 Lower priority
use cases
 Solutions:
Amazon S3,
Storage
Gateway
 Meeting lower RTO
and RPO
requirements
 Core services
 Scale AWS resources
in response to a DR
event
 Solutions that
require RTO and
RPO in minutes
 Business-critical
services
 Auto-failover of
your
environment in
AWS to a running
duplicate
Cost: $ Cost: $$ Cost: $$$ Cost: $$$$

Best Practices For Being Prepared
Check for
software licensing
issues
Practice ”Game Day”
exercises
Start simple

One more thing..
Your feedback is critical for us!
• Login to https://aws.training
• Click on My Transcript, then on the Archived tab
• Find the training completed Architecting on AWS, and then click
Evaluate.

© 2019 Amazon Web Services, Inc. or its affiliates. All rights reserved. This work may not be reproduced or redistributed, in whole or in
part, without prior written permission from Amazon Web Services, Inc. Commercial copying, lending, or selling is prohibited. Corrections
or feedback on the course, please email us at: aws-course-feedback@amazon.com. For all other questions, contact us at:
https://aws.amazon.com/contact-us/aws-training/. All trademarks are the property of their owners.
Thank You

ArchitectingOnAWS_Module_13 goat bumrah i

More Related Content

Similar to ArchitectingOnAWS_Module_13 goat bumrah i (20)

More from m23aid005 (9)

Recently uploaded (20)

ArchitectingOnAWS_Module_13 goat bumrah i

Editor's Notes