I have 4000 files each averaging 30Kb in size landing in a folder on our on premise file system each day. I want to apply conditional logic (several and/or conditions) against details in their file names to only move files matching the conditions into another folder. I have tried linking a meta data activity which gets all files in the source folder with a filter activity which applies the conditional logic with a for each activity with an embedded copy activity. This works but it is taking hours to process the files. When running the pipeline in debug the output window appears to list each file copied as a line item. I’ve increased the batch count setting in the for each to 50 but it hasn’t improved things. Is there a way to link the filter activity directly to the copy activity without using for each activity? Ie pass the collection from the filter straight into copy’s source. Alternatively, some of our other pipelines just use the copy activity pointing to a source folder and we configure its filefilter setting with a simple regex using a combination of * and ?, which is extremely fast. However, in this particular scenario, my conditional logic is more complex and I need to compare attributes in each file’s name with values to decide if the file should be moved. The filefilter setting allows dynamic content so I could remove the filter activity completely, point the copy to the source folder and put the conditional logic in the filefilter’s dynamic content area but how would I get a reference to the file name to do the conditional checks?
How can I pass output from a filter activity directly to a copy activity in ADF?
2.6k Views Asked by Roy At
1
There are 1 best solutions below
Related Questions in COPY
- What is correct way to copy struct instance with fields in Go?
- On paste StartFragment - EndFragment Postman issue
- Copy paste a single value from within a cell to another cell
- Char array, char pointer, works in one case but not the other?
- Macro to copy, rename and organize a template worksheet
- Stored procedure with copy statement keeps running when called
- How can creation times be preserved while copying on Mac?
- Automate buildozer with windows bat file and ubuntu script
- If Cell contains specific text in col A copy cell from Col C into Col F stacked
- Why do I have a 403 error when trying to save a website
- SSRS Report - Graph when exported is compressed into a graphic
- Need to copy multiple cells in the same row of data from one sheet to another when a Checkbox is checked
- Is there a way to set an open workbook as a source without having the file name?
- How do you deep copy a variant of pointers in C++?
- Python Copy dictionary
Related Questions in AZURE-DATA-FACTORY
- Difficulty creating a data pipeline with Fabric Datafactory using REST
- Accessing REST API Status Codes using Azure Data Factory Copy Activity (or similar)?
- Use an activity output as the step name to get additional info in ADF
- Exit loop condition when running the synpase notebooks based on metadata dependencies
- Azure Data Factory Copy Activity Only Importing First Row of XML file
- ADF Copy Activity from Source Azure Synapse Analytics Target ADLSGen2 Storage account
- Parmeter values not resolving in ADF
- How to copy XML files in a folder F1 based on whether its content is present on folder F2 (disregarding file names)
- Can I move an Azure Data Factory Pipeline to Azure DevOps?
- tsql functions like REPLACE() failing in azure data factory pipeline connected to salesforce
- Get the URL from C# script used in ssis
- Reading Unstructured Text from the entire file in Azure Data Factory
- Unable to PUT JSON using ADF Dataflow, the error is "the JSON value could not be converted to System.Collections.Generic.List"
- Manipulating Json in Azure Data Factory activities
- Couchbase Connector in ADF
Related Questions in FILEFILTER
- WinMerge file filter to only include specific folders?
- NestJs Multer upload file with fileFilter cause infinite pending
- Java FileFilter Doesn't Show Any Output With Filtering .mp3 Extension
- Open file filter dialog without showing file filter
- InitLoader must be called on the main thread doInBackground
- Android 11 (30 API) File.listFiles() doesn't return all files
- How to get WHQL certification for a file driver for multiple Windows versions?
- How can I pass output from a filter activity directly to a copy activity in ADF?
- JFileChooser file filtering doesn't actually filter (at least as I intend it to)
- Android file opener for json files only (Java)
- How to make JFileChooser show every type of files except .huff files
- fileFilter on multer still allowing all file types
- How to use Method reference for FileFilter with getName().endsWith()
- java.lang.NoClassDeffoundError when attempting to use FileFilter
- Simple JFileChooser FileFilter not working
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Here is one solution:
Copy Data Source:
Copy Data Sink:
Copy Activity Source (to get it from JSON to .txt):
Sink will be .txt file in your Blob.
This should copy over all the files that you identified in your Filter Activity.
I realize this is a work around, but really is the only solution for what you are asking. Otherwise there is no way to link a filter activity straight to a copy activity.