Re: FYI... Binary Optimized Header Encoding for SPDY

In message <op.wifsn6poiw9drz@manganese.bredbandsbolaget.se>, "Martin Nilsson" 
writes:
>On Thu, 02 Aug 2012 10:27:35 +0200, Poul-Henning Kamp <phk@phk.freebsd.dk>  
>wrote:
>
>> [Not thrilled about UTF8 due to performance]
>
>I would like to know more on what operations you need. I imagine that most  
>relevant operations (splitting, joining, comparing, strlen) can be  
>performed directly on the encoded UTF8 string as efficient as on ASCII.  
>Normalization and upper/lowercasing is trickier, but mostly because of all  
>the Unicode rules, not UTF8 itself(though it doesn't help).

It is indeed the complex Unicode tables that worries me, but as I said,
there may also be advantages to UTF8, so I'm not decided either way.

-- 
Poul-Henning Kamp       | UNIX since Zilog Zeus 3.20
phk@FreeBSD.ORG         | TCP/IP since RFC 956
FreeBSD committer       | BSD since 4.3-tahoe    
Never attribute to malice what can adequately be explained by incompetence.

Received on Friday, 3 August 2012 05:39:37 UTC