W3C home > Mailing lists > Public > public-html-bugzilla@w3.org > July 2011

[Bug 13292] New: Needs Clarification: What is an end-of-file marker in the context of Unicode? My guess is you could use one of the Byte Order Mark as noted in [http://msdn.microsoft.com/en-us/site/dd374101]? In particular, "The Unicode value U+FFFF is illegal in plain te

From: <bugzilla@jessica.w3.org>
Date: Tue, 19 Jul 2011 00:44:16 +0000
To: public-html-bugzilla@w3.org
Message-ID: <bug-13292-2486@http.www.w3.org/Bugs/Public/>
http://www.w3.org/Bugs/Public/show_bug.cgi?id=13292

           Summary: Needs Clarification: What is an end-of-file marker in
                    the context of Unicode? My guess is you could use one
                    of the Byte Order Mark as noted in
                    [http://msdn.microsoft.com/en-us/site/dd374101]? In
                    particular, "The Unicode value U+FFFF is illegal in
                    plain te
           Product: HTML WG
           Version: unspecified
          Platform: Other
               URL: http://www.whatwg.org/specs/web-apps/current-work/#web
                    vtt-cue-text-parsing-rules
        OS/Version: other
            Status: NEW
          Severity: normal
          Priority: P3
         Component: other Hixie drafts (editor: Ian Hickson)
        AssignedTo: ian@hixie.ch
        ReportedBy: contributor@whatwg.org
         QAContact: public-html-bugzilla@w3.org
                CC: mike@w3.org


Specification:
http://www.whatwg.org/specs/web-apps/current-work/multipage/the-video-element.html
Multipage: http://www.whatwg.org/C#webvtt-cue-text-parsing-rules
Complete: http://www.whatwg.org/c#webvtt-cue-text-parsing-rules

Comment:
Needs Clarification: What is an end-of-file marker in the context of Unicode?
My guess is you could use one of the Byte Order Mark as noted in
[http://msdn.microsoft.com/en-us/site/dd374101]? In particular, "The Unicode
value U+FFFF is illegal in plain text files and cannot be passed between
applications. It is reserved for the private use of an application." I have
two concerns about this. First, since this codepoint is not actually a legal
value, some languages might check (not that I know of any) and outright refuse
to assign it to a variable of character type. Second, there is a chance the
input already contains this value (0xef 0xbf 0xbf).

Posted from: 78.224.146.118
User agent: Opera/9.80 (Windows NT 6.1; U; fr) Presto/2.9.168 Version/11.50

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Tuesday, 19 July 2011 00:44:19 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 16:31:13 UTC