Load Parquet Files from ADLS Gen2 using ADF

1.1k Views Asked by sp_analytics At 28 July 2025 at 11:18

I would like to setup ADF pipeline in such a way that I need to load all the Parquet files hosted for 2+ years on ADLS Gen2 with a hierarchy of Year -> Month -> Day -> Hour - > Min. Over the period, we did have some file structure changes with a variance of 2-3 columns. I would like to pull all the common columns and load entire data in a sql table. Can someone please point me to the resources which could help with my requirement.

Thank you!

Original Q&A

There are 1 best solutions below

NiharikaMoola On 01 July 2022 at 11:16

In the Azure data factory pipeline,

Use the Get Metadata activity to get the list of parquet files.
Pass the child items to the ForEach activity to loop each current item.
Add the If condition activity inside ForEach activity to check if the date from the file is greater than the current time minus 2.
Add a copy data activity in True activities to copy data from source to sink.

You can refer to this document to copy data to the SQL table.

Load Parquet Files from ADLS Gen2 using ADF

There are 1 best solutions below

Related Questions in AZURE-PIPELINES

Related Questions in AZURE-DATA-FACTORY

Related Questions in AZURE-SYNAPSE

Related Questions in PARQUET-DATASET

Trending Questions

Popular # Hahtags

Popular Questions