Revised proposed liaison to Unicode re: "Supplemental Subtitle/Caption Character Data"

Hi Nigel,

Per ACTION-409, please find below a revised proposed submission to
Unicode re: "Supplemental Subtitle/Caption Character Data".

I have also updated IMSC1 per related ISSUE-397.

Best,

-- Pierre

""""
The W3C Timed Text Working Group (TTWG) [1] develops specifications
for subtitle and caption delivery applications. It has, in the
process, collected sets of characters (for selected locales) that have
been proven useful in the latter. These sets, documented at [2], are
derived in part from the analysis of home video content. The TTWG
notes that Unicode CLDR does not include characters specifically
intended for subtitling/captioning application, e.g. the QUARTER NOTE
(U+2669) character.

The TTWG therefore suggests that Unicode consider adding the following
"Supplemental Subtitle/Caption Character Data" to the CLDR
supplemental data [3].

https://dvcs.w3.org/hg/ttml/raw-file/bc0f3b1a9104/ttml-ww-profiles/cldr-supplemental-data/cldr-sub-cap-supplemental-data.xml

This data is organized in localized sets of characters
(<localizedSet>) that can include, by reference, common sets of
characters (<commonSet>) that are reused across multiple localized
sets.

The localized sets would be maintained according to the following
(non-mutually-exclusive) rules:

- each localized set should include all characters that can be found
in subtitle/captions intended for presentation in the locale
- each localized set should include the 'base' common set
- when adding exemplar characters to the main, punctuation or number
sets associated with a locale, the same characters should be added to
the corresponding <localizedSet> unless inappropriate for
subtitle/caption applications

The objective of this request is to encourage the creation of a common
set of characters for subtitling and captioning applications that can
be referenced by W3C and other organizations, enhancing the chances
that subtitles/captions are presented consistently across systems.

TTWG is available to provide additional information and looks forward
to hearing from, and working with, the Unicode consortium.

[1] http://www.w3.org/AudioVideo/TT/
[2] https://dvcs.w3.org/hg/ttml/raw-file/tip/ttml-ww-profiles/ttml-ww-profiles.html#recommended-unicode-code-points-per-language
[3] http://www.unicode.org/reports/tr35/tr35-39/tr35-info.html
"""

Received on Monday, 3 August 2015 20:09:40 UTC