Python convert mix from UTF-16 and UTF-8(?) to regular string

469 Views Asked by At

I have bytes (from requests.get) like this:

<th class=\"app_result_head\">\u0414\u043e\u043b\u0436\u043d\u0438\u043a<\/th>

How do I convert this to proper python string like this?:

<th class="app_result_head">Должник</th>
1

There are 1 best solutions below

4
On BEST ANSWER

my_bytes - 'bytes' in question. As it turns out answer is rather simple.

out = my_bytes.decode('raw_unicode_escape')
out = out.replace('\"', '"')
out = out.replace('\/', "/")

From docs for raw_unicode_escape:

Latin-1 encoding with \uXXXX and \UXXXXXXXX for other code points.

This is exactly what I've needed