While crawling a webpage I am getting various response types (image/text/html/json/css/js etc). I only need the .json files not the other ones. How can I filter other response types using HtmlUnit?
Problem is: The required data is stored in a specific .json file and that .json file doesn't have a unique url. So I am planning to filter other response type and download the content of all the json files. Later on I will clean the data.
Please help. Just an idea will be enough.
You can see modify the request and responses, as hinted here.
Check if the URL contains
.json
string, and then save it.