From Newsgroup: comp.lang.fortran
On Mon, 25 Nov 2024 08:35:48 -0300, Wolfgang Agnes wrote:
It's a bit difficult to understand ``surrogates''.
The Unicode folks just decided that the ranges 0xD800-0xDBFF (1024 codes
of “high surrogates”) and 0xDC00-0xDFFF (1024 codes of “low surrogates”)
would be used in pairs to represent codes above 0xFFFF in UTF-16 encoding. This gives an additional 1024×1024 = 1048576 different codes, which should
be enough to cover the entire (current) Unicode range, which officially
goes up to 0x10FFFF. At least, that’s what they’re saying right now.
In the full UCS-4 encoding, those ranges are considered invalid.
--- Synchronet 3.20a-Linux NewsLink 1.114