W3C home > Mailing lists > Public > semantic-web@w3.org > December 2010

Re: More information about XMLified, HTML

From: Antoine Zimmermann <antoine.zimmermann@insa-lyon.fr>
Date: Mon, 13 Dec 2010 11:06:51 +0100
Message-ID: <4D05F03B.70306@insa-lyon.fr>
To: adasal <adam.saltiel@gmail.com>
CC: nathan@webr3.org, Kamler Hammez <hammezkam@gmail.com>, semantic-web@w3.org
Le 12/12/2010 16:35, adasal a écrit :
> Antoine,
> When you say '...and turning HTML pages into valid XML is not something
> addressed by the semantic web' I am not sure what you mean. What is the
> 'semantic web' here?

By "semantic web" I don't mean something very specific but it definitely 
is about RDF, not XML and HTML, although you can have RDF in XML or 
embed RDF in HTML via RDFa.

> I think it is addressed by a python tool here
> http://code.google.com/p/semantic-fire/

This does not convert HTML pages into well-formed XML. This addresses 
the issue that is raised on Semantic Overflow as pointed out by Kammler 
Hammez but is not relevant for the subject of this thread.

> and there must be other tools, too, but I don't have references to hand.
> Generally the problem is referred to as Lifting, although this is restricted
> to XML -->  RDF and RDF -->  some expressive representation of the semantics
> contained. So XML -->  RDFa would be an example.
> This leaves the problem of HTML -->  XML. I'm not sure that HTMLtidy will get
> all the way here though.

HTML Tidy turns a HTML web page into a well-formed XML document. Again, 
there's nothing related to RDF and it has nothing semantic, it's pure 
syntactical transformation.
But if you have XML documents instead of messy HTML pages (as most 
webpages online are), you can use XML-specific tools to, for instance, 
generate RDF or something else. But this does not make the 
"XMLification" a semantic web issue.

> I haven't used semantic fire but is should be capable of the task.

Semantic Fire generates RDF, which is not the subject of this thread but 
thanks for mentioning it, may be useful for other tasks.


> Best,
> Adam
> On 11 December 2010 06:27, Antoine Zimmermann<
> antoine.zimmermann@insa-lyon.fr>  wrote:
>> XMLified is not an existing word. I used it as I could have said RDFising
>> data or SPARQLising queries or whatever. The meaning is exactly what Nathan
>> said, that is, something that is turned into valid XML. For HTML pages, it
>> can be done by using HTML Tidy. It is not a semantic web term and turning
>> HTML pages into valid XML is not something addressed by the semantic web.
>> AZ.
>> Le 11/12/2010 00:20, Nathan a écrit :
>>   Kamler Hammez wrote:
>>>> Hi,
>>>> I'am a bit lost and finding few documents, but what could be a simple
>>>> definition of XMLified with HTML pages? And what are relationships
>>>> that may exisit between both?
>>> Could you give some background on why you're asking the question?
>>> XMLified isn't a usual term to hear around the semantic web space
>>> (perhaps I'm wrong) - my immediate thought is:
>>> XMLified = something thing that has been turned in to valid XML
>>> XMLified HTML = XHTML
>>> However, I'm unsure what that has to do with the semantic web or why
>>> you're asking?
>>> Hoping to help more,
>>> Best,
>>> Nathan

Antoine Zimmermann
Researcher at:
Laboratoire d'InfoRmatique en Image et Systèmes d'information
Database Group
7 Avenue Jean Capelle
69621 Villeurbanne Cedex
Lecturer at:
Institut National des Sciences Appliquées de Lyon
20 Avenue Albert Einstein
69621 Villeurbanne Cedex
Received on Monday, 13 December 2010 10:07:26 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 19:48:19 UTC