AWS Batch Processing Solution Diagram (using AWS Glue)

2022-01-11 awsaws-batch-processingdata-engineeringsolution-diagram

This diagram shows a typical batch processing solution on AWS with Amazon S3, AWS Lambda, Amazon Glue and Amazon Redshift:

  • Amazon S3 is used to store staging data extracted from source systems on-premises or on-cloud.
  • AWS Lambda is used to register data arrival in S3 buckets into ETL frameworks and trigger batch process process.
  • Amazon Glueis then used to integrate data like merging, sorting, filtering, aggregations, transformations and load the data.
  • Amazon Redshift is then used to store the transformed data.

This diagram is forked from AWS Batch Processing Solution Diagram

Data Lake
[Not supported by viewer]
Data Warehouse
[Not supported by viewer]
Data Transformation
(ETL)
[Not supported by viewer]
Amazon S3
[Not supported by viewer]
AWS Lambda
[Not supported by viewer]
Amazon Redshift
[Not supported by viewer]
Data Sources
[Not supported by viewer]
Data Staging 
(Raw Data Store)
[Not supported by viewer]
Data Submissions
[Not supported by viewer]
Amazon Glue
[Not supported by viewer]
Amazon S3
[Not supported by viewer]
AWS Glue
Data Catalog
[Not supported by viewer]
Metadata Store
[Not supported by viewer]
MsPortalFx.base.images-23 public:true sdk: MsPortalFx.Base.Images.Polychromatic.Files() category: General image/svg+xml MsPortalFx.base.images-23