Is there any limit to the number of distinct graphemes that can be represented with a Unicode encoding such as UTF-8? Does, for example, the Unicode standard restrict the number of consecutive combining characters?
Is the set of distinct graphemes infinite?
176 Views Asked by Anthony Faull At
1
There are 1 best solutions below
Related Questions in UNICODE
- Why is executing Java code in comments with certain Unicode characters allowed?
- LXML to write in unicode?
- erlang os:cmd() command with UTF8 binary
- How to encode bytes as a printable unicode string (like base64 for ascii)
- Unicode error from pip install
- How to express the full range of values of a char in F#?
- Change lowercase and uppercase of characters in java
- Need code for removing all unicode characters in vb6
- Error passing Unicode string through JSONObject
- How to combine Unicode characters
- FreeType2 and OpenGL : Use unicode
- Unicode Japanese prolonged sound mark excluded from Kana script?
- Parsing string containing Unicode character names
- How can I add an icon to select box choices?
- Displaying unicode characters in Python 3
Related Questions in DIACRITICS
- Slidify no longer renders accent marks
- How to import Romanian diacritics from csv to mysql database
- How to remove diacritics (accents) from a string?
- Why does this code to replace accented chars with html codes fail to work?
- Php preg_match for french characters
- How to display accented Latin letters
- FTP Error 553 - Filename not allowed using umlauts
- python write umlauts into file
- SQLITE custom Accent collation function and LIKE queries
- Marklogic Diacritic Sensitive search not working for unfiltered searches
- charset issue with XSS api in CQ5 , à being displayed as �
- Weird issue with encoding diacritics
- make url from umlaut words in python
- SQL cuts polish diacritics
- Python: filenames with german umlaut
Related Questions in GRAPHEME
- Is the set of distinct graphemes infinite?
- Unicode GraphemeBreakProperty spec including extra characters?
- How to iterate over grapheme clusters in Crystal?
- Split Unicode entities by graphemes
- C#'s StringInfo and TextElementEnumerator can't recognize graphemes properly
- How to split Devanagari bi-tri and tetra conjunct consonants as a whole from a string?
- Is there a way to map english letter(s) (or graphemes) in word from correspondent phoneme(s) in Python?
- get unicode graphemes as unsplitted item with python2.7
- convert similar sound word parts
- How does the SQL length function handle unicode graphemes?
- C++ Unicode: Bytes, Code Points and Graphemes
- What is the difference between ‘combining characters’ and ‘grapheme extenders’ in Unicode?
- Get grapheme character count in javascript strings?
- How do I identify which letter of the alphabet a word starts with in Objective-C?
- regular expression to match name initials - PCRE
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
The set of possible combinations of a character and combining marks after it is infinite (though only countably infinite ☺). The Unicode Standard says explicitly in clause 2.1 (in chapter 2): “All combining characters can be applied to any base character and can, in principle, be used with any script.” A combination of a letter and a diacritic can be used as a base character for another diacritic, and so on.
At a higher protocol level, as in a data format specification, you can of course impose limit e.g. on the number of consecutive combining marks. The Unicode Standard, however, does not set such restrictions.