AWS Resilience Hub

[Pages:152]AWS Resilience Hub

User Guide

AWS Resilience Hub User Guide

AWS Resilience Hub: User Guide

Copyright ? 2023 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may not be used in connection with any product or service that is not Amazon's, in any manner that is likely to cause confusion among customers, or in any manner that disparages or discredits Amazon. All other trademarks not owned by Amazon are the property of their respective owners, who may or may not be affiliated with, connected to, or sponsored by Amazon.

AWS Resilience Hub User Guide

Table of Contents

What is AWS Resilience Hub? ............................................................................................................... 1 AWS Resilience Hub ? Resilience management ............................................................................... 1 How AWS Resilience Hub works ........................................................................................... 2 AWS Resilience Hub ? Resilience testing ........................................................................................ 4 AWS Resilience Hub concepts ...................................................................................................... 5 Resiliency .......................................................................................................................... 5 Recovery point objective (RPO) ............................................................................................ 5 Recovery time objective (RTO) ............................................................................................. 5 Estimated workload recovery time objective .......................................................................... 5 Estimated workload recovery point objective ......................................................................... 5 Application ........................................................................................................................ 6 Application component ....................................................................................................... 6 Application compliance status .............................................................................................. 6 Resiliency drift ................................................................................................................... 6 Resiliency assessment ......................................................................................................... 6 Resiliency score .................................................................................................................. 7 Disruption type .................................................................................................................. 7 Fault injection experiments ................................................................................................. 7 SOP .................................................................................................................................. 7 Supported AWS Resilience Hub resources ...................................................................................... 8

Getting started ................................................................................................................................ 10 Prerequisites ............................................................................................................................ 10 Add an application ................................................................................................................... 10 Step 1: Get started by adding an application ....................................................................... 11 Step 2: Manage your application resources .......................................................................... 11 Step 3: Add resources to your AWS Resilience Hub application ............................................... 12 Step 4: Set RTO and RPO .................................................................................................. 15 Step 5: Setup resiliency drift detection ............................................................................... 16 Step 6: Setup permissions ................................................................................................. 17 Step 7: Configure the application configuration parameters ................................................... 17 Step 8: Add tags to your application ................................................................................... 18 Step 9: Review and publish ............................................................................................... 18 Step 10: Run an assessment .............................................................................................. 18

Using AWS Resilience Hub ................................................................................................................. 20 Applications ............................................................................................................................. 20 Viewing application summary ............................................................................................ 22 Editing application resources ............................................................................................. 23 Grouping resources in an AppComponent ............................................................................ 28 Publish a new application version ....................................................................................... 30 Viewing application versions .............................................................................................. 31 Viewing resources of your application ................................................................................. 31 Deleting an application ..................................................................................................... 32 Application configuration parameters ................................................................................. 32 Managing resiliency policies ....................................................................................................... 33 Creating resiliency policies ................................................................................................. 34 Accessing resiliency policy details ....................................................................................... 36 Resiliency assessments .............................................................................................................. 37 Running resiliency assessments .......................................................................................... 37 Reviewing assessments reports .......................................................................................... 38 Deleting resiliency assessments .......................................................................................... 43 Managing alarms ...................................................................................................................... 43 Creating alarms from the assessment report ........................................................................ 43 Viewing alarms ................................................................................................................ 45 Standard operating procedures .................................................................................................. 47

iii

AWS Resilience Hub User Guide

Building an SOP based on AWS Resilience Hub recommendations ........................................... 48 Creating a custom SSM document ...................................................................................... 49 Using a custom SSM document instead of the default ........................................................... 50 Testing SOPs ................................................................................................................... 50 Viewing standard operating procedures ............................................................................... 50 Amazon Fault Injection Service experiments ................................................................................ 51 Creating AWS FIS experiments from the assessment report .................................................... 52 Running an AWS FIS experiment ........................................................................................ 53 Viewing fault injection experiments .................................................................................... 53 Amazon Fault Injection Service experiment failures/status check ............................................ 55 Understanding resiliency scores .................................................................................................. 57 Accessing the Resiliency score of your applications ............................................................... 57 Calculating resiliency scores ............................................................................................... 59 Integrating recommendations into applications ............................................................................ 67 Modifying the AWS CloudFormation template ...................................................................... 68 Using AWS Resilience Hub APIs to describe and manage application ....................................................... 71 Preparing the application .......................................................................................................... 71 Create an application ........................................................................................................ 71 Create resiliency policy ..................................................................................................... 72 Import application resource and monitor import status ......................................................... 72 Publish your application and assign resiliency policy ............................................................. 74 Running and analyzing the application ....................................................................................... 75 Run and monitor a resiliency assessment ............................................................................ 76 Create resiliency policy ..................................................................................................... 78 Modify your application ............................................................................................................ 88 Manually add resources ..................................................................................................... 88 Grouping resources into a single Application Component ....................................................... 89 Excluding a resource from an AppComponent ...................................................................... 90 Security ........................................................................................................................................... 91 Data protection ........................................................................................................................ 91 Encryption at rest ............................................................................................................. 92 Encryption in transit ......................................................................................................... 92 Identity and access management ............................................................................................... 92 Audience ......................................................................................................................... 92 Authenticating with identities ............................................................................................ 93 Managing access using policies .......................................................................................... 95 How AWS Resilience Hub works with IAM ............................................................................ 96 Infrastructure security ............................................................................................................. 130 Working with other services ............................................................................................................. 131 AWS CloudFormation .............................................................................................................. 131 AWS Resilience Hub and AWS CloudFormation templates .................................................... 131 Learn more about AWS CloudFormation ............................................................................ 131 AWS CloudTrail ...................................................................................................................... 132 AWS Systems Manager ............................................................................................................ 132 AWS Trusted Advisor ............................................................................................................... 132 Document history ........................................................................................................................... 134 AWS Glossary ................................................................................................................................. 148

iv

AWS Resilience Hub User Guide AWS Resilience Hub ? Resilience management

What is AWS Resilience Hub?

AWS Resilience Hub is a central location for you to manage and improve the resilience posture of your applications on AWS. AWS Resilience Hub enables you to define your resilience goals, assess your resilience posture against those goals, and implement recommendations for improvement based on the AWS Well-Architected Framework. Within AWS Resilience Hub, you can also create and run Amazon Fault Injection Service experiments, which mimic real-life disruptions to your application to help you better understand dependencies and uncover potential weaknesses. AWS Resilience Hub provides a central place with all the AWS services and tools that you need to continuously strengthen your resilience posture. AWS Resilience Hub works with other services to provide recommendations and help you to manage your application resources. For more information, see Working with other services (p. 131).

The following table provides the documentation links of all the related resiliency services.

Related AWS resiliency services

AWS resiliency service

AWS Elastic Disaster Recovery

AWS Backup

Amazon Route 53 Application Recovery Controller (Route 53 ARC)

Documentation link What is Elastic Disaster Recovery

What is AWS Backup

What is Amazon Route 53 Application Recovery Controller

Topics ? AWS Resilience Hub ? Resilience management (p. 1) ? AWS Resilience Hub ? Resilience testing (p. 4) ? AWS Resilience Hub concepts (p. 5) ? AWS Resilience Hub supported resources (p. 8)

AWS Resilience Hub ? Resilience management

AWS Resilience Hub gives you a central place to define, validate, and track the resiliency of your AWS application. AWS Resilience Hub helps you to protect your applications from disruptions, and reduce recovery costs to optimize business continuity to help meet compliance and regulatory requirements. You can use AWS Resilience Hub to do the following:

? Analyze your infrastructure and get recommendations to improve the resiliency of your applications. In addition to architectural guidance for improving your application resiliency, the recommendations provide code for meeting your resiliency policy, implementing tests, alarms, and standard operating procedures (SOPs) that you can deploy and run with your application in your integration and delivery (CI/CD) pipeline.

? Evaluate recovery time objective (RTO) and recovery point objective (RPO) targets under different conditions.

1

AWS Resilience Hub User Guide How AWS Resilience Hub works ? Optimize business continuity while reducing recovery costs. ? Identify and resolve issues before they occur in production. After you deploy an application into production, you can add AWS Resilience Hub to your CI/CD pipeline to validate every build before it is released into production.

How AWS Resilience Hub works

The following diagram provides a high-level outline of how AWS Resilience Hub works.

2

AWS Resilience Hub User Guide How AWS Resilience Hub works

3

AWS Resilience Hub User Guide AWS Resilience Hub ? Resilience testing

Describe

Describe your application by importing resources from AWS CloudFormation stacks, Terraform state files, AWS Resource Groups, Amazon Elastic Kubernetes Service clusters, or you can choose from applications that are already defined in AWS Service Catalog AppRegistry. Define

Define the resilience policies for your applications. These policies include RTO and RPO targets for applications, infrastructure, Availability Zone, and Region disruptions. These targets are used to estimate whether the application meets the resiliency policy. Assess

After you describe your application and attach a resiliency policy to it, run a resiliency assessment. The AWS Resilience Hub assessment uses best practices from the AWS Well-Architected Framework to analyze the components of an application and uncover potential resilience weaknesses. These weaknesses can be caused by incomplete infrastructure setup, misconfiguration, or situations where additional configuration improvements are needed. To improve resiliency, update your application and resiliency policy according to the recommendations from the assessment report. Recommendations include configurations of components, alarms, tests, and recovery SOPs. Then, you can run another assessment and compare the results with the previous report to see how much resiliency improves. Reiterate this process until your estimated workload RTO and estimated workload RPO meets your RTO and RPO targets. Validate

Run tests to measure the resiliency of your AWS resources and the amount of time it takes to recover from application, infrastructure, Availability Zone, and AWS Region incidents. To measure resiliency, these tests simulate outages of your AWS resources. Examples of outages include network unavailable errors, failovers, stopped processes, Amazon RDS boot recovery, and problems with your Availability Zone. View and track

After you deploy an AWS application into production, you can use AWS Resilience Hub to continue tracking the resiliency posture of the application. If an outage occurs, the operator can view the outage in AWS Resilience Hub and launch the associated recovery process.

AWS Resilience Hub ? Resilience testing

AWS Resilience Hub allows you to perform Amazon Fault Injection Service (AWS FIS) tests and experiments on your AWS workloads and maintain optimal resilience. These tests stress an application by creating disruptive events so that you can observe how your application responds. AWS FIS provides multiple pre-built scenarios and large selection of actions that generate disruptions. In addition, it also includes controls and guardrails that you need to run the experiments in production. The controls and guardrails include options to perform automatic roll back or stop the experiment if specific conditions are met. To get started using the AWS FIS to run experiments from AWS Resilience Hub console, complete the prerequisites that are defined in the section called "Prerequisites" (p. 10) section.

The following table lists all the available AWS FIS options from the navigation pane and the links to the associated AWS FIS documentation that contains the procedures to start using AWS FIS tests from AWS Resilience Hub console.

AWS FIS navigation menu options and references

AWS FIS navigation menu option

AWS FIS documentation

Resilience testing

Create an experiment template

4

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download

To fulfill the demand for quickly locating and searching documents.

It is intelligent file search solution for home and business.

Literature Lottery

Related searches