Windows use uniscribe library to substitute arabic and indi typed characters based on their location. The new glyph is still have the original unicode of the typed character althogh it has its dedicated representation in Unicode How to get the Unicode of what is actually displayed not what is typed.
How to get glyph unicode representation of Unicode character
1.8k Views Asked by esaesa At
2
There are 2 best solutions below
1
Mihai Nita
On
Your interpretation of what is happening in Uniscribe is not correct. Once you have glyphs the original information is gone there is no reliable way to go back to Unicode.
Even without going to Arabic, there is no way to distinguish if the glyph for the fi ligature (for example) comes from 'f' and 'i' (U+0066 U+0069) or from 'fi' (U+FB01). (http://www.fileformat.info/info/unicode/char/fb01/index.htm)
Also, some of the resulting glyphs do not have a Unicode value associated with them, so there is no "Unicode of what is actually displayed"
Related Questions in WINDOWS
- Get Maximum Log Size
- Debugging Windows Services while starting
- Possible consequences of duplicate ProgId for different classes
- How to chain BCryptEncrypt and BCryptDecrypt calls using AES in GCM mode?
- mingw-64 conflicting declarations when cross-compiling
- I run an EXE program from a Windows Service but I can't see form C#?
- Why is PowerShell "not recognized" when installing Chocolatey?
- How to check if Windows device is phone or tablet/pc?
- How to add directories to Cygwin gcc default search path
- Can't install anything with pip2 on Windows 7 due to UnicodeDecodeError
- Active directory and linux nslcd binding without extending the AD schema
- How To Prevent Over Scrolling in Scroll Viewer Windows Phone 8.1
- Unicode error from pip install
- Where is the 'EnablePinning' property in the ribbon framework's recent items?
- How can I implement the same models and data across ASP.NET and Windows Apps
Related Questions in UNICODE
- Why is executing Java code in comments with certain Unicode characters allowed?
- LXML to write in unicode?
- erlang os:cmd() command with UTF8 binary
- How to encode bytes as a printable unicode string (like base64 for ascii)
- Unicode error from pip install
- How to express the full range of values of a char in F#?
- Change lowercase and uppercase of characters in java
- Need code for removing all unicode characters in vb6
- Error passing Unicode string through JSONObject
- How to combine Unicode characters
- FreeType2 and OpenGL : Use unicode
- Unicode Japanese prolonged sound mark excluded from Kana script?
- Parsing string containing Unicode character names
- How can I add an icon to select box choices?
- Displaying unicode characters in Python 3
Related Questions in FONTS
- How do I find the fonts that are not loading in a CORS situation ( MoovWeb )?
- css font-family doesnt work on internet explorer
- How get to know what font family does anchor element use
- Firefox Windows / Linux inconstancies
- Using GLYPHICONS's free pngs with classes
- Java Swing: JLabel´s Text / Font shown incompletely
- Xib taking long time (>1s) to load. UIFont cache seems to blame
- HTML validation generates error
- Failed to decode downloaded font on expressJS/NodeJS
- fpdf custom font works but not on any other computer
- Is there a font with 3px max width
- Design Issue - Making a Font Global (C++, Marmalade)
- How to understand this CSS inheritance (or override, selection)
- Adobe Flash Pro: Locate asset with missing font
- Use font awesome star rating define by width
Related Questions in UNISCRIBE
- Calculating the positions of glyphs in Windows
- Converting LTR to RTL?
- What exactly happens when Complex Script Support is enabled?
- Uniscribe and Text Outlines
- Facing challenges in ScriptShape(USP10.dll) Windows API in Windows 10
- How to adjust the width of a font using the Uniscribe API
- Arabic letter noon ghunna incorrectly displayed with a dot
- Trying to export a report with unicode text to pdf using SynPDF, results in mixed-up text
- How to tell if a surrogate pair Unicode character is supported by the font?
- Using ScriptGetProperties with Delphi
- Why doesn't FONTSIGNATURE reflect lfCharSet?
- How to draw rotated text with ScriptTextOut?
- complex text layout of Indic scripts
- How to implement Uniscribe for Linux
- Uniscribe ScriptShape character not found in Simplified Arabic Windows Svr 2012 or W10
Related Questions in GLYPH-SUBSTITUTION
- pdfbox additionalGlyphs file can be used to add custom glyphs?
- How to show alternative CJK glyphs in WIndows CE?
- Want to use glyph getting error in a C# WinUI 3 Desktop program
- How to use anchored pairs with fonts on Android
- fonttools reading cmap data
- How to correctly print Indic (Telugu) text in QB64 on Windows 64-bit?
- How do I make a font similar to the Segoe UI Emoji font?
- Is there an easy way to create fonts that use alternate Unicode glyphs
- PDF Tamil writing using PDFBox
- Debugging font substitution in windows UNIDRV
- Using font character substitutions in CSS or JS
- Browser support for UTF-8 symbols
- How to get glyph unicode representation of Unicode character
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
There are lots of tools for this like ICU, Charmap and the rest. I myself recommand http://unicode.codeplex.com, it uses Unicode Character Database to represent characters.
Note that unicode is just some information about characters and never spoke about representation. They just suggest to implement a word just like their example. so that to view each code you need
Standard Unicode Fontlike MS Arial Unicode whichis the largest and the best choise in windows platform.Most of the characters are implemented in this font but for new characters you need an update for it (if there are such an update) or you can use the font which you know that it implemented your desire characters