- From: Poul-Henning Kamp <phk@phk.freebsd.dk>
- Date: Fri, 03 Aug 2012 05:39:14 +0000
- To: "Martin Nilsson" <nilsson@opera.com>
- cc: ietf-http-wg@w3.org
In message <op.wifsn6poiw9drz@manganese.bredbandsbolaget.se>, "Martin Nilsson" writes: >On Thu, 02 Aug 2012 10:27:35 +0200, Poul-Henning Kamp <phk@phk.freebsd.dk> >wrote: > >> [Not thrilled about UTF8 due to performance] > >I would like to know more on what operations you need. I imagine that most >relevant operations (splitting, joining, comparing, strlen) can be >performed directly on the encoded UTF8 string as efficient as on ASCII. >Normalization and upper/lowercasing is trickier, but mostly because of all >the Unicode rules, not UTF8 itself(though it doesn't help). It is indeed the complex Unicode tables that worries me, but as I said, there may also be advantages to UTF8, so I'm not decided either way. -- Poul-Henning Kamp | UNIX since Zilog Zeus 3.20 phk@FreeBSD.ORG | TCP/IP since RFC 956 FreeBSD committer | BSD since 4.3-tahoe Never attribute to malice what can adequately be explained by incompetence.
Received on Friday, 3 August 2012 05:39:37 UTC