utf8_decode for objective-c

1.3k Views Asked by At

Possible Duplicate:
unicode escapes in objective-c

I have a LATIN1 string.

Artîsté

When I json_encode it, it escapes some chars and converts it to single byte UTF8.

Art\u00eest\u00e9

If I just json_decode it, I believe it is decoding in UTF8

Artîsté

In order to get my original string back, I have to call utf8_decode

Artîsté

Is there a way to handle this conversion in objective-c?

2

There are 2 best solutions below

1
On BEST ANSWER

You might be looking for this:

NSString *string = (some string with non-ASCII characters in it);
char const *string_as_latin1 = [string cStringUsingEncoding:NSISOLatin1StringEncoding];

or possibly this:

NSData *data_latin1 = [string dataUsingEncoding:NSISOLatin1StringEncoding allowLossyConversion:YES];
1
On

I have a LATIN1 string.

I don't think you do. Assuming you are talking about PHP, json_encode() only accepts UTF-8 strings, and bails out if it hits a non-UTF-8 high-byte sequence:

json_encode("Art\xeest\xe9")
"Art"
json_encode("Art\xc3\xaest\xc3\xa9")
"Art\u00eest\u00e9"

I think you had a proper UTF-8 string to start with, then you encoded and decoded it to get the exact same UTF-8 string back. But then you're displaying it or processing it in another step you haven't shown us, that treats your string as if it were Latin-1.