How to get all XML revisions for a Wikipedia article?

210 Views Asked by At

I am trying to get all the details of revisions made to a given Wikipedia article in the form of XML files. I am, however, not able to get it completely. For example this link provides an XML containing only 1000 edits.

The details to get all the revisions are given here.

I used curl to get it, however, I could not get what I desire. Has anybody used this method? Or is there any other method to get an XML or a set of XMLs containing all the revisions of an article?

1

There are 1 best solutions below

3
On

You can get all Wikipedia page's revision history by using MediaWiki API:

https://en.wikipedia.org/w/api.php?action=query&format=xml&prop=revisions&rvprop=parsetree&titles=Tiger%20snake&rvlimit=500

rvlimit is 50 for normal users, 500 for registered and 5000 for bots. To get all revisions you need to create a loop by using rvcontinue value from the previous response.