Replacing one unicode character in a string

39 Views Asked by Pekka Henttonen At 05 February 2024 at 11:01

I have a problem with an utf-8 encoded XML text. In Notepad++ it looks like this. Besides normal spaces there are some U+2002 ("ENSP") characters.

How can I get rid of them:

str.replace("u'2002'", " ") does nothing.

str.encode and str.decode are not available, because of the version of Python.

Regex and unicodedata.normalize loose Scandinavian characters and make the text generally unusable.

There are 0 best solutions below