Different ways to create ad-hoc analysis on top of S3

214 Views Asked by Finkelson At 28 July 2025 at 08:39

I have a data lake in AWS S3. The format of data is Parquet. Daily workload is ~70G. I want to build some ad-hoc analytics on top of that data. To do that I see 2 options:

Use AWS Athena to request data via HiveQL to get data via AWS Glue (Data Catalog).
Move data from S3 into Redshift as a data warehouse and query Redshift to perform ad-hoc analysis.

What is the best way to do ah-hoc analysis in my case? Is there more efficient way? And what are pros and cons of mentioned options?

After 6 months I'm going to move data from S3 to Amazon Glacier, so that max data volume to query in S3/Redshift can be ~13T

Original Q&A

Different ways to create ad-hoc analysis on top of S3

There are 0 best solutions below

Related Questions in AMAZON-WEB-SERVICES

Related Questions in AMAZON-S3

Related Questions in BIGDATA

Related Questions in AMAZON-REDSHIFT

Related Questions in ADHOC-QUERIES

Trending Questions

Popular # Hahtags

Popular Questions