W3C home > Mailing lists > Public > www-style@w3.org > December 2013

[css-text-module]Line Breaking for Simple Chinese

From: Xiaoqian Cindy Wu <xiaoqian@w3.org>
Date: Tue, 03 Dec 2013 17:11:39 +0800
Message-ID: <529DA04B.4030202@w3.org>
To: www-style <www-style@w3.org>
CC: (wrong string) d <bobbytung@wanderer.tw>, rniwa@apple.com, kojiishi@gluesoft.co.jp
Hello :)

For the CSS Text Module Level
3[http://www.w3.org/TR/css3-text/#line-breaking], its really thoughtful
to introduce [ZHMARK <http://www.w3.org/TR/css3-text/#ZHMARK>]
Punctuation Mark Usage(1995) as a reference, but Im afraid this
reference is a bit out of date now.

In 2011, another 2 national authorized standards(GB/T
158342011Punctuation Mark Usage[1], GB/T 158352011 Numbers Usage) were
published for the text layout of Simple Chinese and replaced the 1995
one. In these new version requirements, more detail about the
line-breaking and position of the Punctuation Mark and Numbers are offered.

I have translated the related paragraphs into English. Since the spec is
already in LC, I'm not sure whether there will be enough time to change
anything, still I hope this helps.

--------------------------------------------------------------------------

*GB/T 158342011
*5. Positioning of Punctuation Mark.
5.1 Positioning of Punctuation Mark in Horizontal Writing Mode
5.1.1 Full stops(U+3002), commas(U+FF0C), ideographic commas(U+3001),
semicolons(U+FF1B), colons(U+FF1A) should all be after the corresponding
text, and carry one em space. They should be placed in the lower left
corner, and could not be the start of a line.
/(for 'loose', breaks before semicolons(U+FF1B), colons(U+FF1A),
Question marks(U+FF1F, U=003F), exclamation marks(U+FF01, U+0021) are
allowed, which might not be //appropriate //for Simple Chinese//)/

5.1.2 Question marks(U+FF1F, U=003F), exclamation marks(U+FF01, U+0021)
should all be after the corresponding text, and carry one em space. They
should be placed in the left side, and could not be the start of a line.
When two question marks(or exclamation marks) are used together, they
should only carry one em space; when three question marks(or exclamation
marks) are used together, they should only carry two em space; when one
question mark and exclamation mark are used together, they should only
carry one em space.

5.1.3 Left quotation marks(U+2018, U+201C), left brackets(U+FF08,
U+3014, U+3010), left double angle brackets(U+300A), and left angle
brackets(U+3008) should be placed on the left side of the relative
characters and could not be the end of a line, while right quotation
marks(U+2019, U+201D), right brackets(U+FF09, U+3015, U+3011), right
double angle brackets(U+300B), and left angle brackets(U+3009) should be
placed on the right side of the relative characters and could not be the
start of a line. Each of these marks should carry one em space.

5.1.4 A double dash(U+2014) is between the two corresponding words, and
carry two em space. It should be aligned to the vertical center of the
corresponding base character, could not be separated into 2 parts nor to
be the start and the end of a line at the same time.

5.1.5 A double ellipsis(U+2026) should carry two em space. When 2 double
ellipsis are used together, they should carry 4 em space and make a
independent line. A double ellipsis could not be separated into 2 parts
nor to be the start and the end of a line at the same time.

5.1.6 The en dash of hyphens(U+2013) is a little shorter than the
Chinese character "one" and should carry half an em space; the dash of
the hyphens(U+2010) is a little longer than the Chinese character "one"
and should carry one em space; the wave dash(U+301C) of the hyphens
should carry one em space. All of the hyphens should be aligned to the
vertical center of the corresponding base character and should not be
the start of a line.
/(for 'loose', breaks before //en dash of hyphens(U+2013)//, //dash of
the hyphens(U+2010)//, //wave dash(U+301C) of the hyphens//are allowed,
which might not be appropriate for Simple Chinese//)/

5.1.7 Interpuncts(U+00B7) are between the two corresponding words and
carry half an em space. They should be aligned to the vertical center of
the corresponding base character and should not be the start of a line.

5.1.8 Emphasis dots and proper marks(underline) should be underneath the
characters.

5.1.9 Slash marks(U+002F) carry half an em space and could not be the
start not the end of a line.

5.1.10 When a punctuation mark is at the end of the line, to beautify
the whole composition, even if it's a full-width character, it should
carry the same em space of a half-width character.

5.1.11 In the practice of composition, for a better composition or
reading experience, or to avoid the line-breaking of the last character
of a bottom paragraph or a new page cause by the last character(which
will result in a wasteful and ugly layout), we could reasonably reduce
the space of the punctuation mark.
*
*5.2 Positioning of Punctuation Mark in Vertical Writing Mode
5.2.1 Full stops(U+FE12), commas(U+FE10), question marks(U+FE16),
exclamation marks(U+FE15), ideographic commas(U+FE11),
semicolons(U+FE14), colons(U+FE13) should all be placed in right corner
under the corresponding text.

5.2.2 Double dashs(U+2014), double ellipsis(U+2026),
interpuncts(U+00B7), slash marks(U+002F) and hyphens should be placed in
the middle under the corresponding text, in a vertical writing mode;

5.2.3 Quotation marks(U+FE41, U+FE42, U+FE43, U+FE44) and
brackets(U+FE35, U+FE36, U+FE37, U+FE38, U+FE39, U+3A) should be up or
down the corresponding text.

5.2.4 Presentation form for vertical wavy low lines(U+FE34) should be on
the left side of the the corresponding text.

5.2.5 Sesame dots(U+FE51) should be on the right side while the
presentation form for vertical low lines(U+FE33) should be on the left
side of the corresponding text.

5.2.6 The rules about whether a certain punctuation mark could be the
start or the end of a line in Horizontal Writing Mode must be honored in
Vertical Writing Mode.

*GB/T 158352011*
5.1.7 Line-breaking
An Arabic number in Chinese should stay in one line and never be broken.

/(All unicode in this translation were found in [3][4][5], plz tell me
if there's any mistakes)/

[1]GB/T 158342011
https://docs.google.com/document/d/1CAS2MZ5ET9CIxFskr8YcXqcaxcrBNyyoOonSs8E8Fds/edit
[2]GB/T 158352011
[3]Unicode for CJK Punctuation Mark
http://www.unicode.org/charts/PDF/U3000.pdf
[4]Unicode for Punctuation Mark in Vertical Writing Mode
http://www.unicode.org/charts/PDF/UFE10.pdf
[5]Unicode for Other CJK Punctuation Mark
http://www.unicode.org/charts/PDF/UFE30.pdf


-- 
Best Regards,
Xiaoqian(Cindy) Wu
Received on Tuesday, 3 December 2013 09:11:52 UTC

This archive was generated by hypermail 2.3.1 : Monday, 2 May 2016 14:39:17 UTC