- From: Fabien Gandon <Fabien.Gandon@sophia.inria.fr>
- Date: Tue, 29 Aug 2006 17:25:53 +0200
- To: public-grddl-wg <public-grddl-wg@w3.org>
(Including Harry's comments)
Use case #7 - Digital libraries and focused indexes: W3C wants to
automate the publication of Technical Reports.
Many organizations produce and publish document on the web (e.g. white
papers, standards, user guides, administrative forms, etc.) and thus
many organizations grow digital libraries as part of their web site.
As an example, the most visible part of W3C work is its Technical
Reports published by the working groups. These reports are published
following a well-defined process. TR Automation is a project based on
the use of Semantic Web technologies to allow W3C to streamline the
publication paper trail of W3C Technical Reports, to maintain an
RDF-formalized index of these specifications and to create a number of
tools using these newly available data. This scenario can be generalized
to any organization interested in maintaining a portal to a digital
library with customized indexes, dedicated search forms, navigation
widgets, etc.
The W3C TR automation project aims at generating a formalized and
authoritative list of W3C Technical Reports in RDF, updated as soon as a
new Technical Report is officially published. This RDF list is then used
to generate a number of web pages and tools including:
* customized indexes of the Technical Reports list (sorted by
status, by date, by editor, by title, or by activity/group);
* statistics and reporting tools on the records of publication of
Technical Reports are provided and rely on SPARQL queries solved against
the metadata extracted from the reports;
* bibliographic tools.
These deliverables all rely on a shared XSLT stylesheet to extract
metadata about Technical Reports in RDF (GRDDL). By crawling the
published reports and applying this transformation to them, a complete
and up-to-date RDF index is built from resources distributed over the
organization and then used to generate integrated views.
The simple fact that the XHTML documents put online follow an official
template allows a GRDDL stylesheet to be defined to extract
corresponding RDF annotations that can then be used to generate portals
and support workflows.
--
"there is no finer investment for any community
than putting milk into babies."
-- Winston Churchill.
____________
|__ _ |_ http://www-sop.inria.fr/acacia/personnel/Fabien.Gandon/
| (_||_) INRIA Sophia Antipolis - ph# (33)(0)4 92 38 77 88
Received on Tuesday, 29 August 2006 15:27:41 UTC