Easton Man's Channel
23:54 · May 15, 2023 · Mon
Daniel Lemire's blog
Computing the UTF-8 size of a Latin 1 string quickly (ARM NEON edition)
Telegraph
|
source
Telegraph
Computing the UTF-8 size of a Latin 1 string quickly (ARM NE…
While most of our software relies on Unicode strings, we often still encounter legacy encodings such as Latin 1. Before we convert Latin 1 strings to Unicode (e.g., UTF-8), we must compute the size of the UTF-8 string. It is fairly easy: all ASCII characters…
Home
Powered by
BroadcastChannel
&
Sepia