I want to know how to find sitemap in each domain and sub domain using python? Some examples:
abcd.com/sitemap.xml
abcd.com/sitemap.html
abcd.com/sitemap.html
sub.abcd.com/sitemap.xml
And etc.
What is the most probable sitemap names, locations and also extensions?
You should try using the URLLIB robotsparser
This will give you all the sitemaps in the robots.txt
Most of the sites are havig the sitemaps present there.