I am trying to get table data from wikipedia but I keep getting the error
AttributeError: 'NoneType' object has no attribute 'findAll'
Here is my code.
from bs4 import BeautifulSoup
import urllib
import urllib.request
wiki = "https://en.wikipedia.org/wiki/List_of_current_United_States_Senators"
page = urllib.request.urlopen(wiki)
soup = BeautifulSoup(page, "lxml")
name = ""
party = ""
state = ""
picture = ""
link = ""
district = ""
table = soup.find("table", { "class" : "wikitable sortable" })
f = open('output.csv', 'w')
for row in table.findAll("tr"):
cells = row.findAll("td")
state = cells[0].find(text=True)
picture = cells[2].findAll(text=True)
name = cells[3].find(text=True)
party = cells[4].find(text=True)
write_to_file = name + "," + state + "," + party + "," + link + "," + picture + "," + district + "\n"
print (write_to_file)
f.write(write_to_file)
f.close()
Any help, even another way to do it (thought about using the wiki api but I'm rather lost on what to use), would be appreciated.
The main problem you are facing is that
soup.find("table", { "class" : "wikitable sortable" })
returnsNone
. There is an element of classsortable wikitable sortable
, though, and maybe you want that element.I fixed that and added an
if
and a fewprint
s. It still doesn't work, but the problem is easier to fix, I guess. Now it's your turn :)