Re: WebPlatform Browser Support phased approach? from Doug Schepers on 2013-11-01 (public-webplatform-tests@w3.org from October to December 2013)

From: Doug Schepers <schepers@w3.org>
Date: Fri, 01 Nov 2013 14:10:52 -0400
To: Ronald Mansveld <ronald@ronaldmansveld.nl>
CC: public-webplatform-tests@w3.org, Janet Swisher <jswisher@mozilla.com>
Message-ID: <5273EEAC.5090000@w3.org>

Hi, Ronald–

On 11/1/13 1:47 PM, Ronald Mansveld wrote:
> It's OK. I've ran into Jean-Yves here at the London Office, and he
> brought me into contact with some of his american collegues. A bug has
> been filed to have the data be available as JSON, but it seems like
> their raw data are indeed the HTML-tables, so either way it would mean
> parsing that data. Either on their side, or on our side. There have been
> talks about extracting that data to machine-readable, but for now that's
> likely to be in the future.
>
> As for the MediaWiki extension: can you send me an example or spec of
> the precise JSON-formatting it expects?

Let me get back to you on that, but for now, here's the extension code 
itself:

 
https://github.com/webplatform/mediawiki/tree/master/extensions/CompaTables


> What might be a solution for now:
>
> - Use MediaWiki and JSON (and all benefits) when CIU/H5T data is available
> - Bypass MediaWiki and show the MDN table if no CIU/H5T data is available
>
> That way a lot of properties would still have the MW extension, just the
> entries that we don't have CIU/H5T data for would have to resort to the
> MDN-fallback.

I don't think that saves us any work. With this approach, we'd still 
need to output the raw table at some point, so whether it's MDN or 
CIU/H5T data doesn't matter.


> By simply looking at the analytics-data for the pages, we can always
> decide to manually provide MDN-JSON for pages with high request-rates,
> until a good parser has been written.
>
>
> I've come a long way parsing the MDN-data to JSON, the main problem is
> that some of the key-data is lumped together in 1 table-cell. So it's
> hard to extract that data in a correct way. I am trying however, just
> not sure about the right way to do so. (Part of the current solution is
> replacing a <br> with a textnode with a specific string, so I have a
> textual marker in the nodeValue where I can split the text on. Parsing
> this data really does feel like clutching at straws to get somewhere at
> times...

Very ugly :( I'm sorry you had to do that.

On the bright side, once this project is viable, nobody else will have 
to go through that pain. :)


> Let me know if the fallback-option would be feasible (I'm not too
> familiar with the current set-up of the servers etc, so I can't really
> make a call on that one), or that I should continue parsing the table to
> JSON.

Yeah, I still don't know, because I'm not that familiar with the 
technical details of how we'd do this alternate approach (inserting the 
table data into the pages directly).


Regards-
-Doug

Received on Friday, 1 November 2013 18:11:07 UTC