Re: DeSeRe’24 Workshop on Decentralised search and Recommendations

Sarven – I have no connection to this conference, but the general answer to the question is that PDF is a semantically rich format which includes full support for structural semantics, content semantics as well as RDFa compatible “markup”.   So given a properly constructed PDF, retrieval of such information is well defined.  In fact, there is an industry standard for deterministically deriving equivalent HTML(+RDF, if present) - https://pdfa.org/resource/deriving-html-from-pdf/.

Leonard

From: Sarven Capadisli <info@csarven.ca>
Date: Tuesday, March 19, 2024 at 7:31 AM
To: public-solid@w3.org <public-solid@w3.org>
Subject: Re: DeSeRe’24 Workshop on Decentralised search and Recommendations
EXTERNAL: Use caution when clicking on links or opening attachments.


On 2024-03-19 10:55, Mohamed Ragab wrote:
> The workshop keynotes, papers and panel discussion mainly focus on Solid
> Search, indexing, distributed querying, and many more.


Great!

I have some obligatory questions:

Noting that contributions to the workshop requires PDFs (
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww2024.thewebconf.org%2Fcalls%2Fshort-papers%2F&data=05%7C02%7Clrosenth%40adobe.com%7Cd34f0be19f1245e06acd08dc48082c0d%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C638464447143941875%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=F2CZeUbeSK693jLpSrUB8UU2vYWvEU60IXK2pjNc14I%3D&reserved=0<https://www2024.thewebconf.org/calls/short-papers/> ), how do you see the
role of the workshop with respect to making scholarly knowledge (graphs)
about Solid available?

Bonus: How would one perform "Solid Search, indexing, distributed
querying" on the information within those PDF contributions? Will you
consider accepting contributions alternative to the Paper User Interface
(PUI)?

Recycling an obligatory quote by *the* Web developer:

> From: timbl@info .cern.ch (Tim Berners-Lee)
> Newsgroups: alt.hypertext
> Subject: WorldWideWeb: Summary
> Date: 6 Aug 91 16:00:12 GMT
>
> The WWW project merges the techniques of information retrieval and hypertext to
> make an easy but powerful global information system.
>
> The project started with the philosophy that much academic information should
> be freely available to anyone. It aims to allow information sharing within
> internationally dispersed teams, and the dissemination of information by
> support groups.

[..]

> The WWW model gets over the frustrating incompatibilities of data format
> between suppliers and reader by allowing negotiation of format between a smart
> browser and a smart server.

https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.w3.org%2FPeople%2FBerners-Lee%2F1991%2F08%2Fart-6487.txt&data=05%7C02%7Clrosenth%40adobe.com%7Cd34f0be19f1245e06acd08dc48082c0d%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C638464447143950536%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=f%2FPwEdgzJqM%2BDSh94GkcIr9LXYb7F6sk8gctKe3P3a8%3D&reserved=0<https://www.w3.org/People/Berners-Lee/1991/08/art-6487.txt>


-Sarven
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcsarven.ca%2F%23i&data=05%7C02%7Clrosenth%40adobe.com%7Cd34f0be19f1245e06acd08dc48082c0d%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C638464447143956747%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=0TEChX3aJShXi1gLA8R7rwxx4wuHXnzWZIYaqWt%2Bppw%3D&reserved=0<https://csarven.ca/#i>

Received on Tuesday, 19 March 2024 11:36:41 UTC