How can I allow only alphanumeric including Chinese, Japanese and all that cryptographic languages?

Question

I'm currently trying to filter out any bad char from a string to only allow alphanumeric ones but I need to include Chinese, Japanese and all that non-Latin languages as well. After some hours of reading RegEx, I'm more confused than informed. Currently I have: Without the {Han} everything works well, but no Chinese chars. Any idea? I want to

Accepted Answer

I suggest removing all chars other than letters and digits:let string = 'Test=😕查看         ';string = string.replace(/[^p{L}p{N}]+/ug,' ').trim();console.log(string);If you need to allow diacritics add p{M} there:string.replace(/[^p{L}p{N}p{M}]+/ug,' ').trim();

Advertisement

Answer