- From: Chris Lilley <chris@w3.org>
- Date: Thu, 1 Nov 2012 16:15:04 +0100
- To: "Levantovsky, Vladimir" <Vladimir.Levantovsky@MonotypeImaging.com>
- CC: "public-webfonts-wg@w3.org" <public-webfonts-wg@w3.org>
On Thursday, November 1, 2012, 3:54:43 PM, Vladimir wrote:
LV> Chris and Tab - could you please retrieve and
LV> sent out to the WG the IRC notes from today's morning session?
http://www.w3.org/2012/11/01-webfonts-minutes.html
and below as text
WebFonts TPAC f2f, Lyon
01 Nov 2012
[2]Agenda
[2] http://lists.w3.org/Archives/Public/public-webfonts-wg/2012Oct/0006.html
See also: [3]IRC log
[3] http://www.w3.org/2012/11/01-webfonts-irc
Attendees
Present
Vlad, Jonathan, Sylvain, Tab, aaaa, Taichi, Chris
Regrets
Raph, JDaggett
Chair
Vlad
Scribe
TabAtkins
Contents
* [4]Topics
1. [5]MIME types
* [6]Summary of Action Items
__________________________________________________________
<TabAtkins_> ScribeNick: TabAtkins
<TabAtkins_> chrisl: The evaluation report will be joint-edited
by Chris Lilley and Tab Atkins
<TabAtkins_> Vlad: Given that we have at least an initial
breakdown of benefits per feature, plus we can get this extra
data from Raph I assume.
<TabAtkins_> ChrisL: Presumably ask other members to run the
WOFF2 compressor over their own font collections and report
back data.
<TabAtkins_> ChrisL: I don't just want a number, but also a
breakdown of features, like unicode-range, use of opentype
features, toolchain used to produce,e tc.
<TabAtkins_> jfkthame: Not sure if toolchain is necessarily
something we can easily quantify, but getting a lot of data
seems important.
<TabAtkins_> ChrisL: Would it be possible to get SIL to run
this over a lot of their fonts?
<TabAtkins_> jfkthame: Probably.
<TabAtkins_> ChrisL: Okay. That would be useful, as the fonts
are certainly different in their use of OpenType tables.
<TabAtkins_> ChrisL: And MS has a lot of fonts.
<TabAtkins_> sylvain: We do.
<TabAtkins_> ChrisL: So it would be good if someone could
delegate someone to run the processor over them.
<TabAtkins_> ChrisL: Is there still someone from the typography
group?
<TabAtkins_> sylvain: Yeah, we can ask them.
<TabAtkins_> [round of introductions]
<TabAtkins_> ChrisL: Does NTT have a reasonable range of fonts
that you'd be willing to run things over?
<TabAtkins_> taichi: My font and standardization work is
somewhat independent from my work in NTT.
<TabAtkins_> taichi: I'm working on a project called GlyphWiki,
which 20k glyphs collected now.
<TabAtkins_> [7]http://en.glyphwiki.org/wiki/GlyphWiki:MainPage
[7] http://en.glyphwiki.org/wiki/GlyphWiki:MainPage
<TabAtkins_> [discussion of glyphwiki]
<ChrisL> "GlyphWiki" is a shareable kanji (hanzi, hanja) glyph
database system on the Internet for a solution of using
unencoded kanji characters in digitizing text resources in
kanji. "
<TabAtkins_> taichi: I presented some of the infrastructure of
glyphwiki at atypi last year.
<TabAtkins_> Vlad: In preliminary discussions, LZMA was flagged
as a potential issue. Anything going on there?
<TabAtkins_> TabAtkins_: I recall Raph talking about that.
Implementing LZMA properly requires reading a lot of source
code, and possibly some papers in Russian.
<TabAtkins_> TabAtkins_: Dunno what his current thinking is,
but I know that discussion of re-speccing definitely came up
before.
<TabAtkins_> jfkthame: Does anyone know about the patent
situation around LZMA?
<TabAtkins_> TabAtkins_: I know we're apparently not very
concerned about it. ^_^
<TabAtkins_> [discussion of European patent law]
<TabAtkins_> sylvain: Looks like the code was released to the
public domain in 2008.
<TabAtkins_> ChrisL: Being public domain isn't my huge concern
- it's that the source code appears to be the spec.
<TabAtkins_> ChrisL: There are a lot of forum posts where
people discuss weird things in the source code that need to be
a specific thing, and they're not sure why.
<TabAtkins_> ChrisL: Which is fine, until you need to port it
to another architecture and find it was relying on interesting
platform behavior.
<TabAtkins_> jfkthame: As well, beyond being public domain,
there can still be patents in the area.
<TabAtkins_> ChrisL: And why is LZMA specifically being used
here? is it just "better than gzip", or does it have unique
qualities that make it worthwhile?
<TabAtkins_> taichi: As an entropy encoder, it's a fairly
general compression technique, which is very good.
<TabAtkins_> taichi: One of the most common parts of recent
good compression technique is extracting common parts.
<TabAtkins_> taichi: Hangul, for example, is particularly good
for this kind of part-extraction compression, because of its
unique structure.
<TabAtkins_> ChrisL: My point is that it seems that, if LZMA
turns out to be unusable, it's not the end of the road for us -
we can switch to another entropy coder.
<TabAtkins_> sylvain: I wonder how much of a loss there would
be by just reusing gzip as the actual encoder?
<TabAtkins_> TabAtkins_: I know that Raph has investigated
that. I don't know the numbers, though.
<TabAtkins_> ChrisL: Given that the preprocessor step is
effectively removing redundancy, I would imagine that gzip
doesn't have as much to work on.
<TabAtkins_> Vlad: Not necessarily. The preprocessor removes
redundancy, yes, but in ways that are probably invisible to an
entropy coder.
<TabAtkins_> Vlad: For example, I may have a bounding box in
one table, and a min/max X/Y extents in another table. These
are equivalent pieces of data.
<TabAtkins_> Vlad: But they won't compress together if both
left in.
<TabAtkins_> ChrisL: If LZMA is generally better than gzip, I
wonder if http-bis folk have looked into it as a better way to
pursue general compression.
<TabAtkins_> Vlad: jdaggett brought that up earlier, when we
first brought up discussion of compression in webfonts.
<TabAtkins_> Vlad: But yes, lzma can pick up redundancy in your
bitstream which would be very different from what your
preprocessor would extract.
<TabAtkins_> ChrisL: Speaking of that, I suppose that's
something else the Evaluation Report should look at - are there
incremental improvements in compression we can make that would
greatly increase the computational load? If so, mobile might
not be happy about that.
<TabAtkins_> TabAtkins_: I know that Raph has identified
several specific places where he can tweak the algorithm for
exactly that kind of incremental gain. He's run numbers, and
they weren't impressive enough for him to bundle them into his
default proposal, but the WG can discuss and decide what might
be worth adding in.
<TabAtkins_> Vlad: Speaking of computational load, it's really
the decompress we care about.
<TabAtkins_> Vlad: With the original MTX compression, the
difference in compress/decompress load is 10:1.
<TabAtkins_> Vlad: What, if anything, can we discuss about the
future plans for the WOFF2 spec?
<TabAtkins_> Vlad: We have a file format proposal from Raph.
<TabAtkins_> Vlad: I dont' think we have more to talk about
beyond that, and dont' know how much more to talk about before
we get the Evaluation Report to give us confidence we're going
the right way.
<TabAtkins_> ChrisL: to some extent, the details of the file
format are idnependent of the compression choices, so it's
reasonabl eto talk about it before we have compression data.
<TabAtkins_> ChrisL: I don't recall seeing anything
particularly bad in the file format part, but I haven't read it
overly recently or with my standardization hat on.
<TabAtkins_> jfkthame: What we have now is more or less a rough
draft, not yet ready for speccing.
<TabAtkins_> TabAtkins_: I'm willing to do an editorial pass
over what we ahve and get it prepped into the right
form/language for a real spec. That kind of work is fun and
brainless.
<TabAtkins_> TabAtkins_: Since Raph isn't familiar with that
kind of structure we like.
<TabAtkins_> Vlad: Okay, so of key importance is to confirm
that some folks can give us help with the LZMA spec.
<TabAtkins_> Vlad: sylvain, I know you have compression experts
and a good security team, so it would be good to have
confirmation that this is something we can use.
<TabAtkins_> ChrisL: Yeah, because as it stands, this'll be
mandatory in WOFF2.
<TabAtkins_> ChrisL: If it's legal, that makes it an Essential
Requirement, which turns the legal knobs in a particular way.
<TabAtkins_> Vlad: From past experience, folks like Moz are
more willing to use open-source code...
<TabAtkins_> jfkthame: Yeah.
<TabAtkins_> jfkthame: One thing that nags at me is that if
LZMA isn't very well specced, how confident are we in its
security, then, and how confident are we in being able to fix
security issues that arise when we expose it to data coming
over the wire?
<TabAtkins_> jfkthame: WOFF1 is reasoanbly simple enough that
if a security problem arises, we're confident that someone can
go in and fix it. LZMA sounds like it might not be in that
realm.
<TabAtkins_> ChrisL: Right. That's a reason we used gzip in
WOFF1 - it was quicker, and it had wide security review already
and was in wide use.
<TabAtkins_> ACTION Tab to see what sort of security review has
been done on LZMA, and to see if any "real" spec work has been
done internally on it.
<trackbot> Created ACTION-111 - See what sort of security
review has been done on LZMA, and to see if any "real" spec
work has been done internally on it. [on Tab Atkins Jr. - due
2012-11-08].
<TabAtkins_> ACTION sylvain to see what sort of security review
has been done on LZMA, and to see if any "real" spec work has
been done internally on it.
<trackbot> Created ACTION-112 - See what sort of security
review has been done on LZMA, and to see if any "real" spec
work has been done internally on it. [on Sylvain Galineau - due
2012-11-08].
<TabAtkins_> ChrisL: Do any other WG members whose companies
are large enough to be worth suing want to do a similar review?
<TabAtkins_> ChrisL: Would Monotype be happy using this and
deploying it?
<TabAtkins_> Vlad: Yeah. In that case we'd be using it as the
controlled input. Generating, not consuming.
<TabAtkins_> jfkthame: I don't know if the security people in
Moz have looked at LZMA. I'll see if anyone's done fuzzing on
it, etc.
<TabAtkins_> TabAtkins_: I feel like we've done some fuzzing as
well.
<TabAtkins_> ACTION Chris to ask http-bis folks if they're
looked at LZMA, and if so, if they've rejected it as something
they don't want to touch, or what.
<trackbot> Created ACTION-113 - Ask http-bis folks if they're
looked at LZMA, and if so, if they've rejected it as something
they don't want to touch, or what. [on Chris Lilley - due
2012-11-08].
<TabAtkins_> <br dur=15min>
<TabAtkins_> ChrisL: Talking of chartering, people should
remember to rejoin the group.
MIME types
<TabAtkins_> Vlad: When are media types used?
<TabAtkins_> ChrisL: Sent by the server.
<TabAtkins_> ChrisL: One place you'd expect to see them is
@font-face, but not really. Instead, we have a custom registry
of strings for font-types.
<TabAtkins_> TabAtkins_: Though we could add the real media
types later.
<TabAtkins_> Vlad: Yes, I'm just looking for places where the
media types would be used but not currently.
<TabAtkins_> ChrisL: Another one is that the UA can send a list
of media types it accepts (conneg) and the server can then
decide which to send.
<TabAtkins_> Vlad: During the coffee break, we agreed to go
ahead with the existing plan (register application/font-woff),
and separately try to register top-level 'font/'.
<TabAtkins_> ChrisL: At the same time we're doing this,
opentype/truetype are gearing up to register font types as
well.
<TabAtkins_> ChrisL: Regardless, we definitely need to register
the WOFF1 media type.
<TabAtkins_> Vlad: Is it as simple as wrapping up the text and
sending it to ietf?
<TabAtkins_> ChrisL: Yes, it goes to a different group to
establish it's been reviewed, etc. plh does that liaisoning.
<TabAtkins_> Vlad: Since I have to do the same thing in ISO, I
was just wondering who to contact.
<TabAtkins_> ACTION chris to finish registration of WOFF1 mime
type.
<trackbot> Created ACTION-114 - Finish registration of WOFF1
mime type. [on Chris Lilley - due 2012-11-08].
<TabAtkins_> ChrisL: I noticed that you wrote "WOFF1". I was
under the impression that WOFF2 would use the same string.
<TabAtkins_> Vlad: I thought the same.
<TabAtkins_> TabAtkins_, jfkthame: Disagree.
<TabAtkins_> jfkthame: You lose the ability to do conneg, to
distinguish in font-face, etc. Defeats the point of having a
mime type at all, really.
<TabAtkins_> ChrisL: Okay. That's fine. In the future we'll
register a "woff2", then. No need to be specific in the
mimetype right now as "woff1".
<TabAtkins_> ChrisL: Related to that, as WOFF1 is a PR, and the
deadline is Nov 8, make sure you've responded favorably.
<TabAtkins_> ChrisL: MS hasn't responded yet.
<TabAtkins_> ChrisL: It makes my job presenting this to W3M
much easier if we don't have to second-guess whether
non-responses are because they hate it or just forgot it.
<TabAtkins_> Vlad: Speaking of top-level types, about a year
ago there was a discussion in ietf about font-type
registration.
<TabAtkins_> Vlad: Some people asked why it's not yet
registered, others complained that prior attempts failed and so
discouraging new attempts...
<TabAtkins_> Vlad: I responded that I heard a rumor that it
takes a lifetime to get it in.
<TabAtkins_> Vlad: and someone responded with "who said that?
Just submit it!"
<TabAtkins_> Vlad: So Chris, do you have an insider info on
what kind of effort this requires?
<TabAtkins_> ChrisL: It varies. There is relatively recent
evidence of adding new types (model).
<TabAtkins_> ChrisL: A while back there was significant
pushback from people not wanting to extend any new top-level
types, because they were laser-focused on their use in email
clients.
<TabAtkins_> ChrisL: But now there are other opinions, and they
recognize that they are useful outside of email.
<TabAtkins_> ChrisL: but I still estimate it'll take the better
part of a year.
<TabAtkins_> Vlad: There's been a new attempt to make a
proposal.
<TabAtkins_> Vlad: From the process pov, how much more
complicated will it be compared to regular subtype
registratino?
<TabAtkins_> ChrisL: Fairly more complex. You have to make a
draft which becomes an RFC, etc.
<TabAtkins_> Vlad: Would it be outrageous plagiarism to reuse
some of the text from previous attempts?
<TabAtkins_> ChrisL: Not at all.
<TabAtkins_> Vlad: Okay, because Dave Singer tried this last
time.
<TabAtkins_> ACTION Chris and Vlad to put together the draft
for top-level font registration, possibly based on text from
previous attempts.
<trackbot> Created ACTION-115 - And Vlad to put together the
draft for top-level font registration, possibly based on text
from previous attempts. [on Chris Lilley - due 2012-11-08].
<TabAtkins_> Vlad: Looking through the mailing list, several
people have offered to help with this process, such as Martin
Durst, Thom Phinney, etc.
Summary of Action Items
[End of minutes]
________________
--
Chris Lilley Technical Director, Interaction Domain
W3C Graphics Activity Lead, Fonts Activity Lead
Co-Chair, W3C Hypertext CG
Member, CSS, WebFonts, SVG Working Groups
Received on Thursday, 1 November 2012 15:15:52 UTC