Re: Chinese typography and U+FF5E ~ FULLWIDTH TILDE

So, for Japanese layout this could possibly change the behavior of U+FF5E from being essentially Western to being the same as U+301C?

Generally when there are two codepoints available for a given glyph, in Japanese this allows for two different behaviors or use cases for these similar glyphs. The SJIS to Unicode conversion path of the wavy dash (U+301C) informs that it should follow Japanese rules (and prohibit line start); but if you want “Western” treatment of the full-width tilde (U+FF5E) (i.e. be allowed to start the line), your proposal #3 would eliminate that “feature”?

--Nat

On 2/28/17, 11:07 AM, "Eric Muller" <eric.muller@efele.net> wrote:

    CLREQ currently says that U+FF5E ~ FULLWIDTH TILDE is prohibited at line 
    start, not prohibited at line end (Appendix A). Its Unicode lb property 
    is ID, which allows this character to be a line start in most cases, and 
    therefore does not satisfy JLREQ. There is no mention of U+301C 〜 WAVE 
    DASH.
    
    JLREQ lists U+301C 〜 WAVE DASH in cl-03 hyphens, prohibits it at line 
    start, and not at line end (just like CLREQ does for U+FF5E). Its 
    Unicode lb property is NS, which satisfies JLREQ. There is no mention of 
    U+FF5E (JLREQ ignores all fullwidth characters). U+007F TILDE is listed 
    as a western character, proportional.
    
    I can think of three solutions:
    - use U+301C 〜 WAVE DASH in CLREQ
    - tailor lb for Chinese to make U+FF5E have lb = NS
    - just make U+FF5E hae lb = NS
    
    In a corpus of ~30K Chinese books, I find 681,803 occurrences of U+FF5E 
    ~ FULLWIDTH TILDE, but only 3,258 occurrences of U+301C 〜 WAVE DASH. It 
    seems to me that Chinese users have voted on U+FF5E, and that the first 
    solution is not viable.
    
    I don't see a downside to the third solution, so it is my current best 
    proposal.
    
    Other solutions? suggestions?
    
    Thanks,
    Eric.
    
    
    
    

Received on Tuesday, 28 February 2017 22:44:59 UTC