I have to pull some large amount of data from shopify through api2cart API. I need to pull data for 2/3 years back, which takes hours to pull as we can only pull 250 items per request. With each response, it provides a key to next 250 data pull and so on.
In my backend, I'm pulling data and saving it in a csv file through fs and then I call the api again for next 250 items. it works well on my local machine. the process continues untill all the data are fetched. I can pull years of data and it takes about 2-3 hours to pull about 100k/150k data.
Then I set up a NestJS Microservice and deployed on a digital ocean server. But when I make an API request for long time, after a few time, server gives me a 504 Gateway timeout Error.
Can't use the setTimeout as there' no limit to this process. is there any way to keep pulling data for hours or days?
What to do to pull data for hours without any 504 Gateway timeout Error?
Queues and Bull work well with NestJS, and I would recommend them for these long calls. It is really helpful as you can also retry calls, and add a timeout for failed jobs (to help with 417 errors).