How to count the correct length of a string with emojis in javascript?

Question

I&#8217;ve a little problem. I&#8217;m using NodeJS as backend. Now, an user has a field &#8220;biography&#8221;, where the user can write something about himself. Suppose that this field has 220 maxlength, and suppose this as input: As you can see there aren&#8217;t 220 emojis (there are 37 emojis), but if I…

Accepted Answer

str.length gives the count of UTF-16 units.Unicode-proof way to get string length in codepoints (in characters) is [...str].length as iterable protocol splits the string to codepoints.If we need the length in graphemes (grapheme clusters), we have these native ways:a. Unicode property escapes in RegExp. See for example: Unicode-aware version of w or Matching emoji.b. Intl.Segmenter — coming soon, probably in ES2021. Can be tested with a flag in the last V8 versions (realization was synced with the last spec in V8 86). Unflagged (shipped) in V8 87.See also:The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)What every JavaScript developer should know about UnicodeJavaScript has a Unicode problemUnicode-aware regular expressions in ES2015ES6 Strings (and Unicode, ❤) in DepthJavaScript for impatient programmers. Unicode – a brief introduction

Advertisement

Answer