Re: CBOR Tutorial from Leonard Rosenthol on 2018-02-02 (public-publ-wg@w3.org from February 2018)

From: Leonard Rosenthol <lrosenth@adobe.com>
Date: Fri, 2 Feb 2018 22:28:58 +0000
To: Brady Duga <duga@google.com>, Romain <rdeltour@gmail.com>
CC: Laurent Le Meur <laurent.lemeur@edrlab.org>, Baldur Bjarnason <baldur@rebus.foundation>, Ivan Herman <ivan@w3.org>, Schindler Wolfgang Dr. <w.schindler@pons.de>, "Davis, Greg" <greg.davis@pearson.com>, Ric Wright <rkwright@geofx.com>, W3C Publishing Working Group <public-publ-wg@w3.org>
Message-ID: <7BCE25F4-802F-4641-9005-BA8FE2FEDFA0@adobe.com>

> CBOR itself (ignoring WebPack, etc) doesn't seem like a poor choice for "off-the-web" use, though it does require a little more processing than a pre-indexed format would.
>
Guess it depends on how those files are being used.

If you thinking that such PWPs are of a “manageable size” and are loaded into a “library” (where they can be scanned/indexed on ingestion) such as is the case in the EPUB world today, then you are probably right – the overhead to index isn’t huge.

However, in my world of multi-megabyte (or even gigabyte!!) documents that need to be opened immediately (such that there isn’t even time for an indexing step) – it’s a non-starter.  I need a format that has already been indexed OR that can be processed entirely in stream.

Leonard

From: "Brady com>" <duga@google.com>
Date: Friday, February 2, 2018 at 12:52 PM
To: Romain <rdeltour@gmail.com>
Cc: Leonard Rosenthol <lrosenth@adobe.com>, Laurent Le Meur <laurent.lemeur@edrlab.org>, Baldur Bjarnason <baldur@rebus.foundation>, Ivan Herman <ivan@w3.org>, "Schindler Wolfgang Dr." <w.schindler@pons.de>, "Davis, Greg" <greg.davis@pearson.com>, Ric Wright <rkwright@geofx.com>, W3C Publishing Working Group <public-publ-wg@w3.org>
Subject: Re: CBOR Tutorial

To some extent. However, my point is even without a data model that provides an index, you can still enable random access in a CBOR blob. Obviously, you need to know what data has been encoded, but then you can (if you want) generate an index for the data you are interested in. CBOR itself (ignoring WebPack, etc) doesn't seem like a poor choice for "off-the-web" use, though it does require a little more processing than a pre-indexed format would.

On Fri, Feb 2, 2018 at 9:04 AM, Romain <rdeltour@gmail.com<mailto:rdeltour@gmail.com>> wrote:

On 2 Feb 2018, at 17:11, Brady Duga <duga@google.com<mailto:duga@google.com>> wrote:

 can random access be enabled with a single pass over the data to generate an index

Yes, isn’t it the conclusion we reached in [1] and [2]?

Best,
Romain.

[1] https://lists.w3.org/Archives/Public/public-publ-wg/2018Feb/0003.html<https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.w3.org%2FArchives%2FPublic%2Fpublic-publ-wg%2F2018Feb%2F0003.html&data=02%7C01%7Clrosenth%40adobe.com%7Cb3a3a29388cd4648458208d56a65a987%7C71f1da39c0a84d5a8d88a67b23c30bf4%7C0%7C0%7C636531907234503818&sdata=iilISoDo1QBnVaFHY7U2Teg5xzlvBVzoEPXOfJc7zS4%3D&reserved=0>
[2] https://lists.w3.org/Archives/Public/public-publ-wg/2018Feb/0004.html<https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.w3.org%2FArchives%2FPublic%2Fpublic-publ-wg%2F2018Feb%2F0004.html&data=02%7C01%7Clrosenth%40adobe.com%7Cb3a3a29388cd4648458208d56a65a987%7C71f1da39c0a84d5a8d88a67b23c30bf4%7C0%7C0%7C636531907234503818&sdata=BUKbiWOSX0p224DLbsMSUnRxaB4flsBK7mm4Dtf6zis%3D&reserved=0>

Received on Friday, 2 February 2018 22:29:23 UTC