Right now, our MERGE/COPY commands point to an s3 folder. Anytime there's more than a single csv file in the S3 folder, Snowflake throws a "duplicate rows" error. I manually move s3 files each morning so that there's only ever one file in the s3 folder. How can I tell snowflake to only MERGE/COPY the newest csv file in the folder? (NOTE: date/time is part of our naming convention for these csv files)
Snowflake: In a MERGE or COPY command (from external stage) can I specify that only the newest csv file should be merged/copied?
2.1k Views Asked by Evan Jennings At
1
There are 1 best solutions below
Related Questions in AMAZON-S3
- Convert JSON.gz to JSON in node js
- Downloading objects from S3 with presigned URL
- "Access Denied" - User's Permissions to S3 Bucket
- jQuery file upload to S3 (and rails) with CORS headers
- copying file from local machine to Ubuntu 12.04 returning permission denied
- AWS Flow Framework: Can we run activity worker and activity task on different EC2 instances
- Unable to access files from public s3 bucket with boto
- s3cmd not working as cron-task when echos/dates are added
- AWS S3 object listing
- React-native upload image to amazons s3
- S3 restrictions on quantity of object downloads
- How to upload a photo in Meteor to S3 and have it sync to database item?
- Limit upload size to S3 with presigned URL
- dragonfly-s3 with S3 IAM user causing a forbidden 403 response from Amazon
- Split S3 files into multiple output files
Related Questions in SNOWFLAKE-CLOUD-DATA-PLATFORM
- Snowflake subquery
- Error in granting ownership in snowflake tables
- Snowflake - Performance when column size is not specified
- snowflake json lateral subquery
- Looking to either Explode or unnest into an array in Snowflake SQL
- I am getting a Pipe Notifications bind failure
- How does run queue work in Snowflake? Is there a concept timeslice at all?
- TO_CHAR and SSSS (hours past midnight)
- Snowflake warehouse cache
- Power bi snowflake Default_role setting
- Error when installing `snowflake-connector-python` to GCP Cloud Composer
- How to Restart & Run All code if there is a Key Error during a ! pip install in Google Colab?
- Count number of records based on last updated date + null
- Task without Virtual Warehouse: Is the query failing or the task not starting?
- Need to include the offset value as expr in LAG functions
Related Questions in BOOMI
- Boomi error: Index: 0, Size: 0
- Test execution of prospect Tracking completed with errors.Embedded Message: Input value 'Prospect' is not a number;java.lang.NumberFormatException
- csv file data is not getting written properly into excel - Dell Boomi
- Using Boomi to cut variable length string at first white space
- Boomi integration - Dynamically inject mapping information
- WebApp in .Net C# 4.5.1 is not working for TLS 1.2
- Error integrating Salesforce to Boomi
- How to split XML into 2 separate docs in dell boomi?
- How can I get Boomi to return valid JSON
- Cannot Initialize Vendor Bill with the Vendor Payment Object in Boomi
- String index out of range: -1 in dell boomi
- How to Check node existence in dell Boomi.
- SEGMENT_UNKNOWN error while acessing SAP backend via JCO-connector
- How is a boomi process stored?
- Snowflake: In a MERGE or COPY command (from external stage) can I specify that only the newest csv file should be merged/copied?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Assuming you are using Dell Boomi to execute your COPY INTO command, are multiple files coming into your S3 bucket in the same load or are they incrementally loading?
If they are incrementally loading I would set PURGE = TRUE on your COPY INTO statement so that once the file is correctly copied it is deleted from your S3 bucket and when the next file comes in there won't be a conflict copying to your stage table. PURGE = TRUE requires you to make sure permissions are setup correctly to allow Snowflake to delete from your S3.
https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html#purging-files-after-loading
You can also query try doing something like the following if you want to try and get really clever: