Regular expression for capturing all skin-tone variations of an emoji

1.4k Views Asked by Cai At 31 March 2016 at 11:03

I'm trying to use a regex to capture tweets containing the substring at least twice, so I'm using an unsophisticated ^.+ .+ .+$. However this doesn't match strings which instead contain, for example, .

Is there a smart way I can capture an emoji with any or none skin-tone variation, without just putting each one in a row (like [])?

Original Q&A

There are 1 best solutions below

Cai On 31 March 2016 at 11:29 BEST ANSWER

Thanks to comments above, I've found that emojis I've encountered on twitter are unicode, and skin-tone variations are combining characters in the range 1f3fb–1f3ff.

http://unicode.org/reports/tr51/#Emoji_Modifiers_Table

So for me what I wanted was [\x{1f3fb}-\x{1f3ff}]?, with [\x{1f3fb}-\x{1f3ff}]? being something I can then drop next to any unmodified emoji to include skin-tone variations.

Regular expression for capturing all skin-tone variations of an emoji

There are 1 best solutions below

Related Questions in REGEX

Related Questions in EMOJI

Related Questions in ONIGURUMA

Related Questions in EMOJI-TONES

Trending Questions

Popular # Hahtags

Popular Questions