I am trying to scrape the website in this link:
https://www.whoscored.com/Players/11119/Show/Lionel-Messi
I want to get the data from the Summary, Defensive, Offensive tabs and i tried this:
import requests
# This is the json data link i wish to get from the defensive tab (from developer tools)
url = "https://www.whoscored.com/StatisticsFeed/1/GetPlayerStatistics?category=summary&subcategory=defensive&statsAccumulationType=0&isCurrent=true&playerId=11119&teamIds=&matchId=&stageId=&tournamentOptions=&sortBy=Rating&sortAscending=&age=&ageComparisonType=&appearances=&appearancesComparisonType=&field=Overall&nationality=&positionOptions=&timeOfTheGameEnd=&timeOfTheGameStart=&isMinApp=false&page=&includeZeroValues=true&numberOfPlayersToPick="
response = requests.get(url)
print(response.json())
I can't manage to print the json data and scrape the data I need from the tab, am I doing anything wrong? Thanks for the help.
When executing your code as provided, the response comes back with a 403 "Forbidden" status.
Probably this provider doesn't want their data scraped and checks the request for signs of non-browser access, and returns this error.
Be sure to check your
response.status_code
before proceeding.