codepoints
Break text into Unicode scalars with names and UTF-8/16/32 byte views.
Instructions
Break text into its code points with names and UTF-8/16/32 byte views.
Returns one entry per Unicode scalar (astral characters stay whole): the
char, its codepoint as 'U+XXXX', the Unicode name (or a
placeholder for unnamed control/format/private-use scalars), and big-endian
utf8/utf16/utf32 byte views as hex. count is the code-point length,
which differs from len() only for surrogate-pair-bearing input.
Example: codepoints("é") -> count 1, char "é" at codepoint "U+00E9"
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| text | Yes | Text to break into its constituent Unicode scalars. |