Minutes, TPAC F2F WG meeting from Chris Lilley on 2012-11-01 (public-webfonts-wg@w3.org from November 2012)

From: Chris Lilley <chris@w3.org>
Date: Thu, 1 Nov 2012 16:15:04 +0100
To: "Levantovsky, Vladimir" <Vladimir.Levantovsky@MonotypeImaging.com>
CC: "public-webfonts-wg@w3.org" <public-webfonts-wg@w3.org>
Message-ID: <501165131.20121101161504@w3.org>
On Thursday, November 1, 2012, 3:54:43 PM, Vladimir wrote:

LV> Chris and Tab - could you please retrieve and
LV> sent out to the WG the IRC notes from today's morning session?

http://www.w3.org/2012/11/01-webfonts-minutes.html    

and below as text


                        WebFonts TPAC f2f, Lyon

01 Nov 2012

   [2]Agenda

      [2] http://lists.w3.org/Archives/Public/public-webfonts-wg/2012Oct/0006.html

   See also: [3]IRC log

      [3] http://www.w3.org/2012/11/01-webfonts-irc

Attendees

   Present
          Vlad, Jonathan, Sylvain, Tab, aaaa, Taichi, Chris

   Regrets
          Raph, JDaggett

   Chair
          Vlad

   Scribe
          TabAtkins

Contents

     * [4]Topics
         1. [5]MIME types
     * [6]Summary of Action Items
     __________________________________________________________

   <TabAtkins_> ScribeNick: TabAtkins

   <TabAtkins_> chrisl: The evaluation report will be joint-edited
   by Chris Lilley and Tab Atkins

   <TabAtkins_> Vlad: Given that we have at least an initial
   breakdown of benefits per feature, plus we can get this extra
   data from Raph I assume.

   <TabAtkins_> ChrisL: Presumably ask other members to run the
   WOFF2 compressor over their own font collections and report
   back data.

   <TabAtkins_> ChrisL: I don't just want a number, but also a
   breakdown of features, like unicode-range, use of opentype
   features, toolchain used to produce,e tc.

   <TabAtkins_> jfkthame: Not sure if toolchain is necessarily
   something we can easily quantify, but getting a lot of data
   seems important.

   <TabAtkins_> ChrisL: Would it be possible to get SIL to run
   this over a lot of their fonts?

   <TabAtkins_> jfkthame: Probably.

   <TabAtkins_> ChrisL: Okay. That would be useful, as the fonts
   are certainly different in their use of OpenType tables.

   <TabAtkins_> ChrisL: And MS has a lot of fonts.

   <TabAtkins_> sylvain: We do.

   <TabAtkins_> ChrisL: So it would be good if someone could
   delegate someone to run the processor over them.

   <TabAtkins_> ChrisL: Is there still someone from the typography
   group?

   <TabAtkins_> sylvain: Yeah, we can ask them.

   <TabAtkins_> [round of introductions]

   <TabAtkins_> ChrisL: Does NTT have a reasonable range of fonts
   that you'd be willing to run things over?

   <TabAtkins_> taichi: My font and standardization work is
   somewhat independent from my work in NTT.

   <TabAtkins_> taichi: I'm working on a project called GlyphWiki,
   which 20k glyphs collected now.

   <TabAtkins_> [7]http://en.glyphwiki.org/wiki/GlyphWiki:MainPage

      [7] http://en.glyphwiki.org/wiki/GlyphWiki:MainPage

   <TabAtkins_> [discussion of glyphwiki]

   <ChrisL> "GlyphWiki" is a shareable kanji (hanzi, hanja) glyph
   database system on the Internet for a solution of using
   unencoded kanji characters in digitizing text resources in
   kanji. "

   <TabAtkins_> taichi: I presented some of the infrastructure of
   glyphwiki at atypi last year.

   <TabAtkins_> Vlad: In preliminary discussions, LZMA was flagged
   as a potential issue. Anything going on there?

   <TabAtkins_> TabAtkins_: I recall Raph talking about that.
   Implementing LZMA properly requires reading a lot of source
   code, and possibly some papers in Russian.

   <TabAtkins_> TabAtkins_: Dunno what his current thinking is,
   but I know that discussion of re-speccing definitely came up
   before.

   <TabAtkins_> jfkthame: Does anyone know about the patent
   situation around LZMA?

   <TabAtkins_> TabAtkins_: I know we're apparently not very
   concerned about it. ^_^

   <TabAtkins_> [discussion of European patent law]

   <TabAtkins_> sylvain: Looks like the code was released to the
   public domain in 2008.

   <TabAtkins_> ChrisL: Being public domain isn't my huge concern
   - it's that the source code appears to be the spec.

   <TabAtkins_> ChrisL: There are a lot of forum posts where
   people discuss weird things in the source code that need to be
   a specific thing, and they're not sure why.

   <TabAtkins_> ChrisL: Which is fine, until you need to port it
   to another architecture and find it was relying on interesting
   platform behavior.

   <TabAtkins_> jfkthame: As well, beyond being public domain,
   there can still be patents in the area.

   <TabAtkins_> ChrisL: And why is LZMA specifically being used
   here? is it just "better than gzip", or does it have unique
   qualities that make it worthwhile?

   <TabAtkins_> taichi: As an entropy encoder, it's a fairly
   general compression technique, which is very good.

   <TabAtkins_> taichi: One of the most common parts of recent
   good compression technique is extracting common parts.

   <TabAtkins_> taichi: Hangul, for example, is particularly good
   for this kind of part-extraction compression, because of its
   unique structure.

   <TabAtkins_> ChrisL: My point is that it seems that, if LZMA
   turns out to be unusable, it's not the end of the road for us -
   we can switch to another entropy coder.

   <TabAtkins_> sylvain: I wonder how much of a loss there would
   be by just reusing gzip as the actual encoder?

   <TabAtkins_> TabAtkins_: I know that Raph has investigated
   that. I don't know the numbers, though.

   <TabAtkins_> ChrisL: Given that the preprocessor step is
   effectively removing redundancy, I would imagine that gzip
   doesn't have as much to work on.

   <TabAtkins_> Vlad: Not necessarily. The preprocessor removes
   redundancy, yes, but in ways that are probably invisible to an
   entropy coder.

   <TabAtkins_> Vlad: For example, I may have a bounding box in
   one table, and a min/max X/Y extents in another table. These
   are equivalent pieces of data.

   <TabAtkins_> Vlad: But they won't compress together if both
   left in.

   <TabAtkins_> ChrisL: If LZMA is generally better than gzip, I
   wonder if http-bis folk have looked into it as a better way to
   pursue general compression.

   <TabAtkins_> Vlad: jdaggett brought that up earlier, when we
   first brought up discussion of compression in webfonts.

   <TabAtkins_> Vlad: But yes, lzma can pick up redundancy in your
   bitstream which would be very different from what your
   preprocessor would extract.

   <TabAtkins_> ChrisL: Speaking of that, I suppose that's
   something else the Evaluation Report should look at - are there
   incremental improvements in compression we can make that would
   greatly increase the computational load? If so, mobile might
   not be happy about that.

   <TabAtkins_> TabAtkins_: I know that Raph has identified
   several specific places where he can tweak the algorithm for
   exactly that kind of incremental gain. He's run numbers, and
   they weren't impressive enough for him to bundle them into his
   default proposal, but the WG can discuss and decide what might
   be worth adding in.

   <TabAtkins_> Vlad: Speaking of computational load, it's really
   the decompress we care about.

   <TabAtkins_> Vlad: With the original MTX compression, the
   difference in compress/decompress load is 10:1.

   <TabAtkins_> Vlad: What, if anything, can we discuss about the
   future plans for the WOFF2 spec?

   <TabAtkins_> Vlad: We have a file format proposal from Raph.

   <TabAtkins_> Vlad: I dont' think we have more to talk about
   beyond that, and dont' know how much more to talk about before
   we get the Evaluation Report to give us confidence we're going
   the right way.

   <TabAtkins_> ChrisL: to some extent, the details of the file
   format are idnependent of the compression choices, so it's
   reasonabl eto talk about it before we have compression data.

   <TabAtkins_> ChrisL: I don't recall seeing anything
   particularly bad in the file format part, but I haven't read it
   overly recently or with my standardization hat on.

   <TabAtkins_> jfkthame: What we have now is more or less a rough
   draft, not yet ready for speccing.

   <TabAtkins_> TabAtkins_: I'm willing to do an editorial pass
   over what we ahve and get it prepped into the right
   form/language for a real spec. That kind of work is fun and
   brainless.

   <TabAtkins_> TabAtkins_: Since Raph isn't familiar with that
   kind of structure we like.

   <TabAtkins_> Vlad: Okay, so of key importance is to confirm
   that some folks can give us help with the LZMA spec.

   <TabAtkins_> Vlad: sylvain, I know you have compression experts
   and a good security team, so it would be good to have
   confirmation that this is something we can use.

   <TabAtkins_> ChrisL: Yeah, because as it stands, this'll be
   mandatory in WOFF2.

   <TabAtkins_> ChrisL: If it's legal, that makes it an Essential
   Requirement, which turns the legal knobs in a particular way.

   <TabAtkins_> Vlad: From past experience, folks like Moz are
   more willing to use open-source code...

   <TabAtkins_> jfkthame: Yeah.

   <TabAtkins_> jfkthame: One thing that nags at me is that if
   LZMA isn't very well specced, how confident are we in its
   security, then, and how confident are we in being able to fix
   security issues that arise when we expose it to data coming
   over the wire?

   <TabAtkins_> jfkthame: WOFF1 is reasoanbly simple enough that
   if a security problem arises, we're confident that someone can
   go in and fix it. LZMA sounds like it might not be in that
   realm.

   <TabAtkins_> ChrisL: Right. That's a reason we used gzip in
   WOFF1 - it was quicker, and it had wide security review already
   and was in wide use.

   <TabAtkins_> ACTION Tab to see what sort of security review has
   been done on LZMA, and to see if any "real" spec work has been
   done internally on it.

   <trackbot> Created ACTION-111 - See what sort of security
   review has been done on LZMA, and to see if any "real" spec
   work has been done internally on it. [on Tab Atkins Jr. - due
   2012-11-08].

   <TabAtkins_> ACTION sylvain to see what sort of security review
   has been done on LZMA, and to see if any "real" spec work has
   been done internally on it.

   <trackbot> Created ACTION-112 - See what sort of security
   review has been done on LZMA, and to see if any "real" spec
   work has been done internally on it. [on Sylvain Galineau - due
   2012-11-08].

   <TabAtkins_> ChrisL: Do any other WG members whose companies
   are large enough to be worth suing want to do a similar review?

   <TabAtkins_> ChrisL: Would Monotype be happy using this and
   deploying it?

   <TabAtkins_> Vlad: Yeah. In that case we'd be using it as the
   controlled input. Generating, not consuming.

   <TabAtkins_> jfkthame: I don't know if the security people in
   Moz have looked at LZMA. I'll see if anyone's done fuzzing on
   it, etc.

   <TabAtkins_> TabAtkins_: I feel like we've done some fuzzing as
   well.

   <TabAtkins_> ACTION Chris to ask http-bis folks if they're
   looked at LZMA, and if so, if they've rejected it as something
   they don't want to touch, or what.

   <trackbot> Created ACTION-113 - Ask http-bis folks if they're
   looked at LZMA, and if so, if they've rejected it as something
   they don't want to touch, or what. [on Chris Lilley - due
   2012-11-08].

   <TabAtkins_> <br dur=15min>

   <TabAtkins_> ChrisL: Talking of chartering, people should
   remember to rejoin the group.

MIME types

   <TabAtkins_> Vlad: When are media types used?

   <TabAtkins_> ChrisL: Sent by the server.

   <TabAtkins_> ChrisL: One place you'd expect to see them is
   @font-face, but not really. Instead, we have a custom registry
   of strings for font-types.

   <TabAtkins_> TabAtkins_: Though we could add the real media
   types later.

   <TabAtkins_> Vlad: Yes, I'm just looking for places where the
   media types would be used but not currently.

   <TabAtkins_> ChrisL: Another one is that the UA can send a list
   of media types it accepts (conneg) and the server can then
   decide which to send.

   <TabAtkins_> Vlad: During the coffee break, we agreed to go
   ahead with the existing plan (register application/font-woff),
   and separately try to register top-level 'font/'.

   <TabAtkins_> ChrisL: At the same time we're doing this,
   opentype/truetype are gearing up to register font types as
   well.

   <TabAtkins_> ChrisL: Regardless, we definitely need to register
   the WOFF1 media type.

   <TabAtkins_> Vlad: Is it as simple as wrapping up the text and
   sending it to ietf?

   <TabAtkins_> ChrisL: Yes, it goes to a different group to
   establish it's been reviewed, etc. plh does that liaisoning.

   <TabAtkins_> Vlad: Since I have to do the same thing in ISO, I
   was just wondering who to contact.

   <TabAtkins_> ACTION chris to finish registration of WOFF1 mime
   type.

   <trackbot> Created ACTION-114 - Finish registration of WOFF1
   mime type. [on Chris Lilley - due 2012-11-08].

   <TabAtkins_> ChrisL: I noticed that you wrote "WOFF1". I was
   under the impression that WOFF2 would use the same string.

   <TabAtkins_> Vlad: I thought the same.

   <TabAtkins_> TabAtkins_, jfkthame: Disagree.

   <TabAtkins_> jfkthame: You lose the ability to do conneg, to
   distinguish in font-face, etc. Defeats the point of having a
   mime type at all, really.

   <TabAtkins_> ChrisL: Okay. That's fine. In the future we'll
   register a "woff2", then. No need to be specific in the
   mimetype right now as "woff1".

   <TabAtkins_> ChrisL: Related to that, as WOFF1 is a PR, and the
   deadline is Nov 8, make sure you've responded favorably.

   <TabAtkins_> ChrisL: MS hasn't responded yet.

   <TabAtkins_> ChrisL: It makes my job presenting this to W3M
   much easier if we don't have to second-guess whether
   non-responses are because they hate it or just forgot it.

   <TabAtkins_> Vlad: Speaking of top-level types, about a year
   ago there was a discussion in ietf about font-type
   registration.

   <TabAtkins_> Vlad: Some people asked why it's not yet
   registered, others complained that prior attempts failed and so
   discouraging new attempts...

   <TabAtkins_> Vlad: I responded that I heard a rumor that it
   takes a lifetime to get it in.

   <TabAtkins_> Vlad: and someone responded with "who said that?
   Just submit it!"

   <TabAtkins_> Vlad: So Chris, do you have an insider info on
   what kind of effort this requires?

   <TabAtkins_> ChrisL: It varies. There is relatively recent
   evidence of adding new types (model).

   <TabAtkins_> ChrisL: A while back there was significant
   pushback from people not wanting to extend any new top-level
   types, because they were laser-focused on their use in email
   clients.

   <TabAtkins_> ChrisL: But now there are other opinions, and they
   recognize that they are useful outside of email.

   <TabAtkins_> ChrisL: but I still estimate it'll take the better
   part of a year.

   <TabAtkins_> Vlad: There's been a new attempt to make a
   proposal.

   <TabAtkins_> Vlad: From the process pov, how much more
   complicated will it be compared to regular subtype
   registratino?

   <TabAtkins_> ChrisL: Fairly more complex. You have to make a
   draft which becomes an RFC, etc.

   <TabAtkins_> Vlad: Would it be outrageous plagiarism to reuse
   some of the text from previous attempts?

   <TabAtkins_> ChrisL: Not at all.

   <TabAtkins_> Vlad: Okay, because Dave Singer tried this last
   time.

   <TabAtkins_> ACTION Chris and Vlad to put together the draft
   for top-level font registration, possibly based on text from
   previous attempts.

   <trackbot> Created ACTION-115 - And Vlad to put together the
   draft for top-level font registration, possibly based on text
   from previous attempts. [on Chris Lilley - due 2012-11-08].

   <TabAtkins_> Vlad: Looking through the mailing list, several
   people have offered to help with this process, such as Martin
   Durst, Thom Phinney, etc.

Summary of Action Items

   [End of minutes]
     ________________

-- 
 Chris Lilley   Technical Director, Interaction Domain                 
 W3C Graphics Activity Lead, Fonts Activity Lead
 Co-Chair, W3C Hypertext CG
 Member, CSS, WebFonts, SVG Working Groups
Received on Thursday, 1 November 2012 15:15:52 UTC