W3C home > Mailing lists > Public > ietf-http-wg@w3.org > January to March 2013

Re: Delta Compression and UTF-8 Header Values

From: Frédéric Kayser <f.kayser@free.fr>
Date: Sun, 10 Feb 2013 07:44:19 +0100
Message-Id: <A4C04DB9-2524-49EC-8774-AF2EBF3EA350@free.fr>
To: ietf-http-wg@w3.org
Comparing Unicode strings without prior normalisation can lead to surprising results: "Frédéric" and "Frédéric" will probably look the same in your email client, but try to paste them in the search field and you'll probably see that they don't match since the first one uses precomposed diacritics and the second one combining ones.

Le 9 févr. 2013 à 15:12, Martin Nilsson a écrit :

> You don't need to know the length in characters to compare strings. Just comparing byte on byte works fine. Null is encoded the same, and byte zero only appear as null in UTF-8, so strlen works fine. So far strings are hollerith encoded in HTTP/2, so it should be a moot point anyway.
Received on Sunday, 10 February 2013 06:44:48 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Sunday, 10 February 2013 06:44:54 GMT