I have several files obtained by un-archiving some tar archive using gnu tar
under macOS. These files have names like %8A%AE%AD%E1⠭⨭ - %84%87 %FCML1.ipynb
due to using Cyrillic letters. It seems that %8A
and so on are cp866-codes, but there are also some unicode characters presented (like ⨭
) that appear to be unicode representation of some byte sequences that accidently are valid utf-8 codes. I want to decode everything to unicode/UTF-8 to be able to rename my files. How can I do it?
Decoding cp866-encoded filenames with Cyrillic letters after tar unarchiving
124 Views Asked by Ilya V. Schurov At
1
This Python function can help: