- From: Kingsley Idehen <kidehen@openlinksw.com>
- Date: Thu, 17 Sep 2009 11:54:52 -0400
- To: Shane McCarron <shane@aptest.com>
- CC: Ivan Herman <ivan@w3.org>, W3C RDFa task force <public-rdf-in-xhtml-tf@w3.org>, public-rdfa@w3.org
Shane McCarron wrote: > Happy to! > > Basically, what you need to remember is that XHTML+RDFa is an XML > dialect, so the document first and foremost needs to be xml well formed. This I know, the page is generated :-) > In particular this means that elements with EMPTY content models > (like img) need to be terminated, attribute values need to have values > and those values need to be quoted, ampersands that are literal > ampersands need to be expressed as an entity. So... for example, > embedded URLs need to have ampersands changed to &. img elements > need to have '/>' at their end. > You can see the complete set of errors for that page at > http://validator.w3.org/check?uri=http%3A%2F%2Fdbpedia.org%2Fpage%2FAlbert_Einstein&charset=(detect+automatically)&doctype=Inline&group=0 > > > Some of the errors are not really a big deal - parsers don't care > about validation per se. But they do care about well-formedness so it > is important the document be well formed. > > Does this help? Or do you need some more specific suggestions? Just need technical description of problems, I am very familiar with all the technology and technical details. Basically, just need rapid-fire identification of technical issue. Since we have our own DBMS hosted XML parser, I just need to check how come we didn't add validation to the generation process :-) Kingsley > > Kingsley Idehen wrote: >> Shane McCarron wrote: >>> I agree that this is great news. However, when I went and LOOKED at >>> the page we use in our examples, >>> http://dbpedia.org/page/Albert_Einstein , I see that it is invalid >>> in a number of respects. It is unfortunate that dbpedia is at the >>> mercy of the resources they troll. The markup ends up being hard to >>> manage, I suspect. >>> I was forced to put my parser into "html" mode to get it to swallow >>> that page: >>> >>> http://htmlwg.mn.aptest.com/rdfa/extract_rdfa.pl?format=n3&type=html&uri=http://dbpedia.org/page/Albert_Einstein >>> >> Shane, >> >> Just provide feedback and we'll have it fixed. I am aware of some the >> issue above. >> >> We are in a continuum, nothing is static etc.. >> >> Just provide feedback :-) >> >> Kingsley >>> >>> Ivan Herman wrote: >>>> FYI >>>> >>>> -------- Original Message -------- >>>> Subject: DBpedia Pages now include RDFa >>>> Resent-Date: Wed, 16 Sep 2009 15:32:01 +0000 >>>> Resent-From: public-lod@w3.org >>>> Date: Wed, 16 Sep 2009 11:31:19 -0400 >>>> From: Kingsley Idehen <kidehen@openlinksw.com> >>>> To: public-lod@w3.org >>>> CC: public-lod <public-lod@w3.org>, >>>> "dbpedia-discussion@lists.sourceforge.net" >>>> <dbpedia-discussion@lists.sourceforge.net> >>>> References: >>>> <f914914c0909160749s4adaa3b5q517b418a1b0a9619@mail.gmail.com> >>>> <4AB1024E.4000309@openlinksw.com> >>>> >>>> All, >>>> >>>> Quick FYI. >>>> >>>> All DBpedia pages now include RDFa. Thus, re. the DBpedia Linked Data >>>> Space, you now have HTML+RDFa as a structured metadata representation >>>> alternative to N3, Turtle, RDF/XML, and RDF/JSON (*new*). >>>> >>>> >>> >> >> > -- Regards, Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen President & CEO OpenLink Software Web: http://www.openlinksw.com
Received on Thursday, 17 September 2009 15:55:29 UTC