I have a list of unicode code points, something along these lines (not an actual set, problem illustration only):
uni050B
uni050C
uni050D
uni050E
uni050F
uni0510
uni0511
uni0512
uni0513
uni1E00
uni1E01
uni1E3E
uni1E3F
uni1E80
uni1E81
uni1E82
uni1E83
uni1E84
uni1E85
uni1EA0
and so forth…
I need to find the unicode-range
for these. Some parts of this set are continuous, with some points missing - so the range is not U+050B-1EA0
.
Is there a sensible way of extracting those continuous "sub-ranges"?
I don't know of anything "off-the-shelf" but easy enough to calculate. Below finds consecutive numbers and builds a
unicode-range
using Python:Output: