How to add a validation in azure data factory pipeline to check file size?

3.8k Views Asked by SHIBASHISH TRIPATHY At 29 September 2020 at 17:49

I have multiple data sources I want to add a validation in azure data factory before loading into tables it should check for file size so that it is not empty. So if the file size is more than 10 kb or if it is not empty loading should start and if it is empty then loading should not start. I checked validation activity in Azure Data Factory but it is not showing size for multiple files in a folder. Any suggestions appreciated basically if I can add any python notebook for this validation will also do.

Original Q&A

There are 2 best solutions below

JSWilson On 29 September 2020 at 18:44

Use GetMetadata under General Activities, then send the result to an If Condition.

You will then need to get the file size from the Dataset.@item().name is the name of the file you want to get the size of.

If you are working with a directory do the following:

Then check the file size of each file.

This is what the ForEach settings looks like. Then you can use @item().name inside the ForEach to get at the file.

The data source will need to have the parameter FileName.

HarithaMaddi-MSFT On 08 October 2020 at 14:30

Following GIF shows step by step process on how to achieve the above requirement in ADF.

How to add a validation in azure data factory pipeline to check file size?

There are 2 best solutions below

Related Questions in AZURE

Related Questions in PYSPARK

Related Questions in AZURE-DATA-FACTORY

Related Questions in AZURE-DATA-LAKE

Related Questions in AZURE-DATABRICKS

Trending Questions

Popular # Hahtags

Popular Questions