Disaster Recovery — AWS Solutions Architect Associate Complete Course.

Chapter 24: AWS Disaster Recovery Fundamentals

5 min readFeb 8, 2022

We have reached one of the last chapters of the AWS Solutions Architect Associate Course! Today we are going to look at Disaster Recovery in AWS. Let’s imagine that, for whatever reason, an earthquake or tsunami occurs in a region where we host our servers, and our application stops working. In the case of an MVP or a minor application, it would not be so serious, but let’s imagine it is a business-critical application that affects our business considerably if it goes down. Let’s see the different Disaster Recovery strategies using Amazon Web Services!

Disaster Recovery for the AWS Solutions Architect Associate Certification.
  1. RPO and RTO
  2. Disaster Recovery Strategy
  3. Typical Disaster Recovery Questions in the AWS SAA Exam

Remember that you can find all the chapters from the course at the following link:

RPO AND RTO

When we talk about disaster recovery, we have to understand two fundamental concepts before defining the strategies:

  • RPO (Recovery Point Objective) → Maximum amount of data — as measured by time — that can be lost after a recovery from a disaster
  • RTO (Recovery Time Objective) → Metric that defines the time to recover your IT infrastructure and services following a disaster to ensure business continuity
RPO & RTO
RPO & RTO

DISASTER RECOVERY STRATEGIES

Disaster recovery strategies can be categorized into four approaches, ranging from the low cost and low complexity of making backups to more complex procedures.

Disaster Recovery Strategies (via docs.aws.amazon.com).
Disaster Recovery Strategies (via docs.aws.amazon.com).

BACKUP AND RESTORE

Backup & Restore is the most straightforward strategy to implement. On the one hand, it has a high RPO/RTO, but on the other hand, it’s the cheapest one. Using this strategy, you only need to frequently back up your data and restore it in the case of a disaster. In addition to data, you must redeploy the infrastructure, configuration, and application code in the recovery Region.

Backup and Restore Strategy on AWS.
Backup and Restore Strategy on AWS.

PILOT LIGHT

With the pilot light approach, you replicate your data from one Region to another and provide a copy of your core workload infrastructure (just the critical parts). Resources required to support data replication and back-ups, such as databases and object storage, are always on. Other elements, such as application servers, are switched off and are only used during disaster recovery failover, always having the option to quickly provision a full-scale production environment by switching on and scaling out your application servers.

AWS Pilot Light Disaster Recovery Strategy.
AWS Pilot Light Disaster Recovery Strategy.

WARM STANDBY

The warm standby approach ensures a scaled-down but functional copy of your production environment in another Region. So basically, your whole system will be active in a different region but smaller. In case of a disaster, we scale out.

This approach extends the pilot light concept and decreases the time to recovery because your workload is always-on in another Region.

AWS Warm Standby Disaster Recovery Strategy.
AWS Warm Standby Disaster Recovery Strategy.

MULTI-SITE ACTIVE/ACTIVE

Multi-Site active/active has the lowest RTO (reducing recovery time to near zero) but is way more expensive than the other options. For that reason, it’s recommendable for business-critical applications. In this case, you run your workload simultaneously in multiple Regions as part of a multi-site active/active strategy, allowing users to access your workload in any of the Regions in which it is deployed. In a disaster, we point to a different region using Route 53.

AWS Multi-Site active/active Disaster Recovery Strategy.
AWS Multi-Site active/active Disaster Recovery Strategy.

TYPICAL EXAM QUESTIONS

Honestly, in the AWS Solutions Architect Associate exam, there will not be many questions about Disaster Recovery; but in the case of appearing, they would be very similar to these:

A company plans to replicate a limited set of core services to the Disaster Recovery site, ready to take over seamlessly during a disaster. The company will switch off all other services. Which Disaster Recovery should the company use?

  1. Backup and restore
  2. Pilot light
  3. Warm standby
  4. Multi-site

Solution: 2. With the pilot light approach, you replicate your data from one region to another and provide a copy of your core workload infrastructure (just the critical parts). Resources required to support data replication and back-ups, such as databases and object storage, are always on. Other elements, such as application servers, are switched off and are only used during disaster recovery failover, always having the option to quickly provision a full-scale production environment by switching on and scaling out your application servers.

A company plans to keep its core business-critical systems running in the Disaster Recovery cloud. Other services can be replicated but switched off. Which Disaster Recovery strategy should the company use?

  1. Backup and restore
  2. Pilot light
  3. Warm standby
  4. Multi-site

Solution: 3. In a warm standby scenario, a scaled-down version of a fully functional environment is always running in the cloud. When a disaster happens, the system quickly scales up to handle the production load. It extends the pilot light solution. You can read more about the Disaster Recovery Strategies at the following link.

A company needs to ensure they can seamlessly failover between AWS Regions in the event of a disaster with minimal downtime and data loss. The applications will run in an active-active configuration. Which Disaster Recovery strategy should the company use?

  1. Backup and restore
  2. Pilot light
  3. Warm standby
  4. Multi-site

Solution: 4. In a multi-site strategy, the system runs in more than one location, each capable of handling the full production workload. This model, often known as an active-active configuration, is the most expensive but provides the highest level of availability, with zero downtime and near zero loss.

More Questions?

  • Do you want more than 500 AWS practice questions?
  • Access to a real exam simulator to thoroughly prepare for the exam.
  • You can download all of the AWS questions on PDF.

All of this and more at FullCertified!

Thanks for Reading!

If you like my work and you want to support me…

  1. You can follow me on Medium here.
  2. Feel free to clap if this post is helpful for you! :)

--

--

Gonzalo Fernandez Plaza
Gonzalo Fernandez Plaza

Written by Gonzalo Fernandez Plaza

Computer Science Engineer & Tech Lead 🖥️. Publishing AWS & Snowflake ❄️ courses & exams. https://www.fullcertified.com

No responses yet