Data Warehousing on AWS

Data Warehousing on AWS

January 2021

Notices

Customers are responsible for making their own independent assessment of the information in this document. This document: (a) is for informational purposes only, (b) represents current AWS product offerings and practices, which are subject to change without notice, and (c) does not create any commitments or assurances from AWS and its affiliates, suppliers or licensors. AWS products or services are provided "as is" without warranties, representations, or conditions of any kind, whether express or implied. The responsibilities and liabilities of AWS to its customers are controlled by AWS agreements, and this document is not part of, nor does it modify, any agreement between AWS and its customers.

? 2020 Amazon Web Services, Inc. or its affiliates. All rights reserved.

Contents

Introduction ..........................................................................................................................1 Introducing Amazon Redshift ..............................................................................................2 Modern Analytics and Data Warehousing Architecture......................................................3

AWS Analytics Services...................................................................................................3 Analytics Architecture.......................................................................................................4 Data Warehouse Technology Options..............................................................................10 Row-Oriented Databases...............................................................................................10 Column-Oriented Databases .........................................................................................11 Massively Parallel Processing (MPP) Architectures .....................................................12 Amazon Redshift Deep Dive .............................................................................................12 Integration with Data Lake .............................................................................................12 Performance ...................................................................................................................13 Durability and Availability ...............................................................................................14 Elasticity and Scalability.................................................................................................15 Operations .........................................................................................................................16 Redshift Advisor .............................................................................................................16 Interfaces ........................................................................................................................17 Security ........................................................................................................................... 17 Cost Model .....................................................................................................................18 Ideal Usage Patterns......................................................................................................18 Anti-Patterns ................................................................................................................... 19 Migrating to Amazon Redshift ...........................................................................................20 One-Step Migration ........................................................................................................20 Two-Step Migration ........................................................................................................20 Wave-based Migration ...................................................................................................21 Tools and Additional Help for Database Migration ...........................................................21 Designing Data Warehousing Workflows .........................................................................22 Conclusion .........................................................................................................................25 Contributors .......................................................................................................................25

Further Reading.................................................................................................................25 Document Revisions..........................................................................................................26

Abstract

Enterprises across the globe want to migrate data warehousing to the cloud to improve performance and lower costs. This whitepaper discusses a modern approach to analytics and data warehousing architecture. It outlines services available on Amazon Web Services (AWS) to implement this architecture, and provides common design patterns to build data warehousing solutions using these services.

This whitepaper is aimed at data engineers, data analysts, business analysts, and developers.

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download