AWS Glue - Tracking Processed Data on DocumentDB

139 Views Asked by Amit Zigelman At 26 November 2020 at 10:19

I have a DocumentDB as the data source.

I am running an AWS Glue job that pulls all the data from a certain table, and then inserts it to a RedShift cluster.

Is it possible to avoid adding duplicate data?

I have seen that AWS glue supports bookmarks,

This does not seem to work for DocumentDB as the data source

Thanks.

There are 0 best solutions below