Can we crawl and index Google Drive documents using nutch and solr?

2.6k Views Asked by At

I have tried indexing public url of a google drive document, but it seems that it does not work . Is there any way to crawl google drive documents via nutch and make their index using solr?

1

There are 1 best solutions below

0
On

Use Google Drive API to read/manage files

https://developers.google.com/drive/web/about-sdk

Drive Public URL's page won't have direct links to subdirectories, so you will get nothing if you crawl those pages.