- From: Jon Diamond <strategictech@gmail.com>
- Date: Thu, 24 Jul 2008 04:34:32 -0400
- To: public-qa-dev@w3.org
- Message-ID: <c31c0e670807240134u57cadb94p60363612446143bc@mail.gmail.com>
Please don't mistake this as a suggestion for the markup validation tool. Although I would love to be able to feed multiple pages for evaluation... but for the http://www.w3.org/2003/12/semantic-extractor.html tool... why can't you just disregard invalid character entities? I mean http://validator.w3.org/check?uri=http%3A%2F%2Fwww.imageworksstudio.com%2F&charset=%28detect+automatically%29&doctype=Inline&group=0#result I think that we can all safely assume that with modern programming it is not significantly more taxing on a system to ignore a & standing alone compared to a set of characters... containing a &...; the pattern has the exact same number of parameters... they could almost be called identical from a mathematical standpoint. in either case you are looking for an & and two other characters... be it ![;] or [;] Needless to say I was sad that I could not see my sites data semantically extracted due to a branding decision. I will further research how I can work around this issue as I have time, however; I believe that it is an unnecessary "feature" of xhtml. Regards, JD
Received on Thursday, 24 July 2008 11:37:27 UTC