& really? from Jon Diamond on 2008-07-24 (public-qa-dev@w3.org from July 2008)

From: Jon Diamond <strategictech@gmail.com>
Date: Thu, 24 Jul 2008 04:34:32 -0400
To: public-qa-dev@w3.org
Message-ID: <c31c0e670807240134u57cadb94p60363612446143bc@mail.gmail.com>

Please don't mistake this as a suggestion for the markup validation tool.
Although I would love to be able to feed multiple pages for evaluation...
but for the http://www.w3.org/2003/12/semantic-extractor.html tool... why
can't you just disregard invalid character entities?

I mean
http://validator.w3.org/check?uri=http%3A%2F%2Fwww.imageworksstudio.com%2F&charset=%28detect+automatically%29&doctype=Inline&group=0#result

I think that we can all safely assume that with modern programming it is not
significantly more taxing on a system to ignore a & standing alone compared
to a set of characters... containing a &...; the pattern has the exact same
number of parameters... they could almost be called identical from a
mathematical standpoint.  in either case you are looking for an & and two
other characters... be it ![;] or [;]

Needless to say I was sad that I could not see my sites data semantically
extracted due to a branding decision.   I will further  research how I can
work around this issue as I have time, however; I believe that it is an
unnecessary "feature" of xhtml.

Regards,
JD

Received on Thursday, 24 July 2008 11:37:27 UTC