I have the following code that I am using to try to chunk response from a http.client.HTTPSConnection get request to an API (please note that the response is gzip encoded:
connection = http.client.HTTPSConnection(api, context = ssl._create_unverified_context())
connection.request('GET', api_url, headers = auth)
response = connection.getresponse()
while chunk := response.read(20):
data = gzip.decompress(chunk)
data = json.loads(chunk)
print(data)
This always gives out an error that it is not a gzipped file (b'\xe5\x9d')
. Not sure how I can chunk data and still achieve what I am trying to do here. Basically, I am chunking so that I don't have to load the entire response in memory.
Please note I can't use any other libraries like requests, urllib etc.
The most probable reason for that is, the response your received is indeed not a gzipped file.
I notice that in your code, you pass a variable called
auth
. Typically, a server won't send you a compressed response if you don't specify in the request headers that you can accept it. If there is only auth-related keys in your headers like your variable name suggests, you won't receive a gzipped response. First, make sure you have'Accept-Encoding': 'gzip'
in your headers.Going forward, you will face another problem:
gzip.decompress
will expect a complete file, so you would need to reconstruct it and load it entirely in memory before doing that, which would undermine the whole point of chunking the response. Trying to decompress a part of a gzip withgzip.decompress
will most likely give you anEOFError
saying something likeCompressed file ended before the end-of-stream marker was reached
.I don't know if you can manage that directly with the gzip library, but I know how to do it with
zlib
. You will also need to convert yourchunk
to a file-like object, you can do that withio.BytesIO
. I see you have very strong constraints on libraries, butzlib
andio
are part of the python default, so hopefully you have them available. Here is a rework of your code that should help you going on: