History log of /dragonfly/test/debug/utf8bin.c (Results 1 – 1 of 1)
Revision (<<< Hide revision tags) (Show revision tags >>>) Date Author Comments
Revision tags: v6.2.1, v6.2.0, v6.3.0, v6.0.1, v6.0.0, v6.0.0rc1, v6.1.0, v5.8.3, v5.8.2, v5.8.1, v5.8.0, v5.9.0, v5.8.0rc1, v5.6.3, v5.6.2, v5.6.1, v5.6.0, v5.6.0rc1, v5.7.0, v5.4.3, v5.4.2, v5.4.1, v5.4.0, v5.5.0, v5.4.0rc1, v5.2.2, v5.2.1, v5.2.0, v5.3.0, v5.2.0rc, v5.0.2, v5.0.1, v5.0.0, v5.0.0rc2, v5.1.0, v5.0.0rc1, v4.8.1, v4.8.0, v4.6.2, v4.9.0, v4.8.0rc, v4.6.1, v4.6.0, v4.6.0rc2, v4.6.0rc, v4.7.0, v4.4.3, v4.4.2, v4.4.1, v4.4.0, v4.5.0, v4.4.0rc
# 8a84c799 16-Aug-2015 Matthew Dillon <dillon@apollo.backplane.com>

locale - Add two new API functions

* Add wcrtombin() and mbintowcr(), powerful buffer-to-buffer functions
to translate UTF8, which can also perform escaping and de-escaping,
allow legacy parsing

locale - Add two new API functions

* Add wcrtombin() and mbintowcr(), powerful buffer-to-buffer functions
to translate UTF8, which can also perform escaping and de-escaping,
allow legacy parsing, and do checks with varying levels of strictness.

NOTE: If no locale or "C" locale, these functions are 1:1 (do not do
UTF8 processing). A UTF8 locale must be used to access these
functions.

* Add wcrtoutf8() and utf8towcr(). These work exactly the same as
the above functions but are locale-agnostic. They always operate in
UTF8 and locale initialization is not necessary.

* Escaping and de-escaping for 8-bit-clean round-trips with two of the
modes.

* Can construct UTF8B from unescaped 8-bit-clean data, can deconstruct
UTF8B (restore the original clean data), and can also be flagged
generically to take already-escaped UTF8B or wchars as input.

* Can be used to perform strict checks.

* Very powerful incremental buffer validation and processing capabilities.
Far more visibility to the caller rather than hiding state in a tracking
state structure. No tracking state structure.

* Does not do idiotic nul buffer termination. Caller passes explicitly
sized data, can opt to include or not include a terminator (depending
on what is desired), and the byte buffers can contain any 8-bit code
including 0x00.

show more ...