RE: A Less Ephemeral Web


Thank you. I think that you both understand correctly. I’m talking about downloading HTML files or, more interestingly, the semantically- or schema-annotated objects in them, to either end users’ local storage or to their cloud-based drives (e.g. Microsoft OneDrive, Google Drive, Apple iCloud). Also topical is a set of interesting and worthwhile services that software developers, including third-party developers, could provide for end users should they choose to save, note or scrapbook webpages or the objects in them to their cloud-based drives. In these regards, I indicated some example services for collections of saved, noted, or scrapbooked news articles.

William, thank you for the information about previous works in the form of “semantic bookmarks”.

Aymeric, yes, in addition to semantically- or schema-annotated objects in HTML files, Web Components, too, could be utilized to represent various things (e.g. recipes or news articles) in HTML pages.

Interesting discussion about the potentials of semantic- and schema-based annotations as well as of Web Components. In addition to enhancing Web search, these technologies could be of use for saving, noting or scrapbooking (e.g. recipes or news articles). With cloud-based storage, software developers, including third-party developers, could provide end users with a set of interesting and worthwhile services. As Aymeric and William have pointed out, others (e.g. W3C participants) may have already thought of similar ideas over the years.

Best regards,

From: Aymeric Vitte<>
Sent: Friday, August 28, 2020 10:15 AM
To: William Van Woensel<>; Adam Sobieski<>;<>; Mailing List<>;<>;<>
Subject: Re: A Less Ephemeral Web

If I understand correctly, the concept is to store web pages parts without using full bookmarks and/or full wayback machine indexing and/or icloud web pages storage

Maybe take a look here:

This site was targeted for other purposes but clearly one of the target was to extract part of web pages and store them as "widgets", with real time retrieval or just stored as independent gadgets "forever"

There was a patent about this that did conflict with Web Components and therefore was sold to Google & co, W3C was probably part of the deal

So to reach my point (again if I am not misunderstanding what the discussion is about), maybe Web Components can do it or will
Le 28/08/2020 à 14:33, William Van Woensel a écrit :
Hi Adam,

If I understood correctly, your idea would involve semantically annotated bookmarks – i.e., allowing external third-party annotation but also leveraging pre-existing “data islands” within the bookmarked pages. If this is accurate, using that particular search term (i.e., semantic bookmarking) already points towards some useful prior work.

From a technical point of view, I’ve dabbled a bit in extracting structured data from annotated websites: checkout Any23<>, java-RDFa<> and JS RDFa<> (others could have emerged since).

Kind regards,


From: Adam Sobieski <><>
Sent: August-27-20 3:27 PM
To:<>; Mailing List <><>;<>;<>
Subject: A Less Ephemeral Web

CAUTION: The Sender of this email is not from within Dalhousie.
Web users could press or click on a Web browser button to save, note or scrapbook webpages and/or Web-schema-annotated objects in them, storing them for later use. This storage could be organized into folders and Web schema could be of use for sorting webpages and/or objects into these folders. For example, a NewsArticle schema could indicate that, when saved, noted or scrapbooked, Web content could be organized into a folder called “News Articles”.

Webpages and/or objects in them could be stored on users’ local devices or stored on cloud-based storage such as Microsoft OneDrive or Google Drive. This could be configurable for end users.

Cloud-based storage and extensible architectures could facilitate the development of apps, plugins and services to analyze the contents of collections of stored webpages to provide features for end users. Some specific examples of analytical services in the news space include: (1) notifying users of the distribution of the sources of their collected articles, (2) notifying users whether their collected articles contain misinformation or disinformation, (3) notifying users whether their collected articles are “left-leaning” or “right-leaning”, (4) indicating to users the distribution of topics in their collected articles, (5) indicating to users sentiment analysis upon their collected articles, (6) indicating to users the comprehensiveness of their news search and gathering processes for a given topic, and (7) providing other features made possible by other AI and natural language processing tools.

Multiple means of navigating collections of stored webpages and/or objects can be envisioned. Users could utilize a calendar-based widget to navigate collections of webpages or objects. Users could also navigate via a folders-based user experience.

Multiple means of searching collections of stored webpages and/or objects can be envisioned, for example searching for content by text strings or keywords.

Thank you. I hope that these ideas are of some interest to you.

Best regards,
Adam Sobieski


Sophia-Antipolis, France


Move your coins by yourself (browser version):

Bitcoin transactions made simple:

Zcash wallets made simple:

Bitcoin wallets made simple:

Get the torrent dynamic blocklist:

Check the 10 M passwords list:

Anti-spies and private torrents, dynamic blocklist:

Peersm :


node-Tor :

GitHub :

Received on Friday, 28 August 2020 18:06:12 UTC