Amazon S3 Glacier | AWS Big Data Blog

Amazon EMR streamlines big data processing with simplified Amazon S3 Glacier access

In this post, we demonstrate how to set up and use Amazon EMR on EC2 with S3 Glacier for cost-effective data processing.

Working with timestamp with time zone in your Amazon S3-based data lake

With a data lake built on Amazon Simple Storage Service (Amazon S3), you can use the purpose-built analytics services for a range of use cases, from analyzing petabyte-scale datasets to querying the metadata of a single object. AWS analytics services support open file formats such as Parquet, ORC, JSON, Avro, CSV, and more, so it’s […]

Keeping your data lake clean and compliant with Amazon Athena

June 2025: This post has been reviewed for accuracy and the following updates have been made: added new function to retrieve SQL query in the Lambda code; upgraded Python’s run time and version of sqlparse in the Lambda deployment package; added and removed actions in the Lambda policy; updated the CloudFormation template to reflect policy […]

AWS Big Data Blog

Category: Amazon S3 Glacier

Amazon EMR streamlines big data processing with simplified Amazon S3 Glacier access

Working with timestamp with time zone in your Amazon S3-based data lake

Keeping your data lake clean and compliant with Amazon Athena

Learn

Resources

Developers

Help