For some reason urllib is corrupting documents that i download from a website. The url is definitely correct. The documents download with their correct names but they appear corrupted once opened. I found that the download is encoded with deflate so i attempted to decode it but i keep getting either these two errors when i modify the "zlib" part.
errors:
error-2 while preparing to decompress data: inconsistent stream state
or
error -3 while decompressing data: incorrect header check
here is a code snippet of where the problem is:
def download_file(url, name):
try:
f = urllib.urlopen(url)
fh = open(name, 'wb')
if f.info().get('content-encoding') == 'deflate':
fh = zlib.decompress(f.read(),16 +zlib.MAX_WBITS)
#fh.write(f.read())
fh.close()
print " File Downloaded : " , name
except Exception:
raise