A little confusion about the java native2ascii tool. Definition for the tool in Java 6:
Converts a file with native-encoded characters (characters which are non-Latin 1 and non-Unicode) to one with Unicode-encoded characters.
Then why does it also transform characters belonging to Latin 1 table (such as é) to unicode encoded representation (\u00e9) ???
Latin 1 (iso 8859-1) table is available here for instance http://en.wikipedia.org/wiki/ISO/IEC_8859-1#Codepage_layout
That implies that i cannot directly work with properties files for some european languages such as french.
To clarify my question:
native2ascii shouldnt convert latin1 characters (as per its description). é is a valid latin1 character. Therefore why is it converted ?
You can work with properties files with french and other characters. Properties accepts
\uxxxx
sequences. You can work with national characters directly since Properties has load(Reader reader) method. Then the file can be in any encoding, you will provide the reader that decodes the file correctly, egnew InputStreamReader(new FileInputStream(1.properities), Charset.forName("ISO-8859-1"));
I also agree that native2ascii should not convert
é
because it's a legal latin-1 char and docs says latin-1 chars are not converted.