AWS Glue - Tracking Processed Data on DocumentDB

121 Views Asked by At

I have a DocumentDB as the data source.

I am running an AWS Glue job that pulls all the data from a certain table, and then inserts it to a RedShift cluster.

Is it possible to avoid adding duplicate data?

I have seen that AWS glue supports bookmarks,

This does not seem to work for DocumentDB as the data source

Thanks.

0

There are 0 best solutions below