Roll call: Bjoern, Nick[niq] (half here), Ville[scop], Yan, Karl, Yves, Olivier[yod], Dom, Terje[xover] (arrived later)
last meeting:  http://www.w3.org/mid/C798D705-8925-11D8-AEFA-000393A63FC8@w3.org


** Agenda 1 -  checklink and robots **

[00:45:48:] <scop> I understood OT has been testing 3.9.3-dev a bit, what about others?
[00:46:49:] * bjoern_ fwiw, did not test checklink...
[00:47:22:] * yod happy with new feature, with the reservation that I wonder whether it should ignore the robots protocol for non-recursive mode
[00:47:58:] <scop> yod: I have a feeling that could be a bit hairy
[00:48:07:] <scop> (to implement, that is)
[00:48:34:] <yod> scop: because of different UA/ RobotUA classes?
[00:48:58:] * yod would like to know other's gut feeling about that too, beyond implementation issue
[00:49:10:] <scop> yod: yep, might be possible to work around that though by directly accessing RobotRules, dunno
[00:49:45:] * bjoern_ thinks that link checkers should ignore at least Disallow: *...
[00:50:29:] <niq> nope
[00:50:36:] <bjoern_> it's after all just a HEAD and following robots.txt makes link checkers less useful
[00:50:59:] <yod> niq? 
[00:51:15:] <niq> should display "forbidden by robot rules" with a link to howto change that to allow the link checker
[00:51:57:] <niq> that'll work with default imple of robotrules
[00:52:03:] <dom__> I think in non-recursive mode, the linkchecker is hardly a robot
[00:52:09:] <dom__> it's merely a browser
[00:52:27:] <bjoern_> indeed
[00:52:29:] <niq> it is.  And it falls straight into ban-me traps
[00:52:50:] <niq> and it subjects webservers to rapid-fire
[00:53:01:] <bjoern_> how so?
[00:53:13:] <dom__> hmm... actually, what I mean is a bit more precise: the link checker should not fail when the primary URI is excluded by robots rules
[00:53:28:] <bjoern_> that too
[00:53:30:] <dom__> ... only when checked URIs inside the page falls down onto these rules
[00:53:53:] * niq thinks it should
[00:54:12:] <karlcow> I would respect robots.txt... if someone put a robots.txt with Disallow, it's because they have reasons for that, this same person who's in charge will have also the possibility to tweak a configuration to let the link checker goes if needed. UserAgent string for example
[00:54:17:] <niq> otherwise it's open to varoius attacks, like pointing it at a bad-crawler-trap page directly
[00:54:31:] <bjoern_> well, as an author, if I want my links checked and the link checker says I should test manually, I would open the link in my browser which yields in much more traffic than caused by the link checker (HEAD vs GET, style sheets, images, ...)
[00:54:34:] <niq> karlcow++
[00:55:08:] <scop> niq/karlcow++
[00:55:27:] <niq> bjoern_: as author is one thing, but an online robot can be pointed at a third-party webserver, including in a malicious attack
[00:56:13:] <scop> http://qa-dev.w3.org/wlc/checklink?uri=http%3A%2F%2Fkoti.welho.com%2Fvskytta%2Ft.html
[00:56:47:] <scop> what's missing is the link to a howto describing how to allow the link checker to access the site
[00:57:05:] <karlcow> yep
[00:57:12:] <__Yves> well people are usually editing robots.txt once for all and use User-Agent *
[00:57:30:] <bjoern_> I want to validate external links (internal ones never break) and I cannot change the robots.txt of a foreign server.
[00:57:34:] <niq> scop++
[00:57:45:] <bjoern_> I would use my own link checker that does not honor robots.txt instead
[00:58:21:] <niq> fine.  so that can fall straight into a ban-me tarpit and start generating 403s on every page
[00:58:23:] <karlcow> yes bjoern_: but you can't force people if they don't want. People have really the choice or not.
[00:58:24:] * yod agrees at least with Dom's point about not stopping when the (checked) page is disallowed
[00:58:31:] <bjoern_> And typically you use robots.txt for things you don't want to show up on search engines...
[00:59:23:] * yod would like to make a distinction on recursive/nonrecursive
[00:59:36:] <yod> I don't think there is any disagreement with recursive mode, is there?
[00:59:45:] <yod> (bjoern?)
[00:59:50:] <bjoern_> We could limit the number of pages/host to prevent malicious use
[01:00:42:] <niq> even one page could get the checker banned automatically from a site
[01:01:41:] <scop> # of pages/host is not too much different from "full" robots.txt "compliance", it also produces unsatisfactory results for author POV
[01:02:05:] <scop> http://www.robotstxt.org/wc/exclusion.html#robotstxt
[01:02:43:] <scop> btw, fwiw, LWP does not support the "revised internet-draft" version of the spec
[01:03:10:] <karlcow> ok the "spec" is clear
[01:03:23:] <karlcow> it's for all robots
[01:03:32:] <karlcow> it's for retrieved document
[01:03:41:] <karlcow> not mentiong of indexing.
[01:03:46:] <bjoern_> HEAD is not retrieval
[01:03:52:] <karlcow> s/g//
[01:04:08:] <yod> [[    Robots are often used for maintenance and indexing purposes, by ]]
[01:04:25:] <karlcow> yep
[01:04:33:] <karlcow> maintenance ;) for example
[01:04:42:] <__Yves> HEAD retreives meta-information, so it is partly retreival
[01:04:49:] <scop> the "spec" talks about "visiting"
[01:04:59:] <bjoern_> it says " Note that these instructions apply to any HTTP method on a URL."
[01:05:00:] <__Yves> GET retreives data and metadata, so not only the content
[01:05:10:] * yod waiting in a corner for the spec bashing to start
[01:05:19:] <karlcow> ahaha
[01:05:33:] <bjoern_> It's only a draft...
[01:05:34:] <scop> :)
[01:06:01:] <__Yves> if ever crwalers would be willing to start using OPTIONS * :)
[01:06:25:] <bjoern_> yeah...
[01:06:40:] <bjoern_> and means in web servers to configure OPTIONS...
[01:06:49:] * yod thinks... that we need to find a way to make checklink behave, and that robots.txt is one such mechanism
[01:06:59:] * yod would be happy with :
[01:07:03:] <niq> s/one/the/
[01:07:47:] <yod> 1 - inviting people to be nicer to checklink in their robots.txt
[01:08:15:] <yod> 2 - an option (not available in recursive mode?) to ignore the protocol
[01:08:31:] <niq> 2--
[01:08:43:] <yod> with default to follow it and a note on responsibility + other "behave mechanisms" (timer?)
[01:09:12:] <niq> the trouble is, any such option is an open invitation to the malicious
[01:09:24:] <scop> there's already the 1 sec delay, not bound to robots.txt as such
[01:09:59:] <yod> yeah I was thinking of increasing the delay when not following robots.txt
[01:10:07:] <niq> tell that to a slashdotted site
[01:11:58:] <bjoern_> is recursive mode limited to the host of the original page uri?
[01:12:07:] <karlcow> For the malicious ones... Checklink is a perl program Open Source... a real malicious geek will anyway reactivate what he wants. So I think the options can be minimum.
[01:12:22:] <scop> bjoern_: host + base uri
[01:12:44:] <bjoern_> so it follows only "internal" links?
[01:12:58:] <__Yves> malicious ones don't need that to do a DoS
[01:13:02:] <scop> no, the restriction is for *documents*, not links
[01:13:28:] <__Yves> bet more on a user fumbling with a config than someone wanting to doe vil things
[01:13:32:] <bjoern_> I mean, if I have a link on x.org to www.w3.org, would it follow links on www.w3.org?
[01:14:04:] <scop> depends on definition of "follow", but yes, it would do the "link checking" on them, ie HEAD
[01:14:49:] <bjoern_> why?
[01:14:55:] * dom__ wonders how a bot is supposed to react when robots.txt is forbidden of being visited by a robots.txt file 
[01:15:09:] * dom__ knows he's looking for troubles :)
[01:15:41:] <scop> dom__: http://www.robotstxt.org/wc/norobots-rfc.html section 3.1
[01:15:44:] <bjoern_> The bot would be ashamed and hide in the corner of the server...
[01:16:18:] * xover arrives...
[01:16:28:] <__Yves> dom: and you can make it forget this using a Cache-Control: no-cache, no-store
[01:16:28:] <bjoern_> what do we do with <meta robots nofollow>?
[01:16:45:] <scop> bjoern_: why what?  /me lost...
[01:17:09:] <bjoern_> Why it would check links on foreign sites in recursive mode
[01:17:49:] <scop> well, it is a link checker?  note, that is not the same as recursing offsite
[01:17:59:] <scop> <meta robots> unhandled ATM
[01:18:42:] <scop> oops, misread, it does not check links *on* foreign sites.  it does check links *to* foreign sites
[01:20:19:] <bjoern_> ok
[01:22:19:] <yod> well, we don't seem to have an agreement on that
[01:22:45:] <bjoern_> what do we do then?
[01:22:54:] <yod> launch 3.9.3 beta
[01:22:59:] <yod> get feedback
[01:23:03:] <yod> decide what to do
[01:23:13:] <scop> yod++
[01:23:18:] <dom__> (I think the current behavior is fine, although I'd prefer the way I proposed)
[01:24:08:] <yod> I think this discussion had interesting points, will re-use that to steer feedback when we go to beta
[01:24:30:] <yod> speaking of which, please play with the instance on qa-dev, as well as the markup validator there
[01:24:43:] <yod> they have the latest lwp, which we need to try
[01:24:59:] <yod> [closing this item]

(later)
[02:22:29:] <scop> btw, first cut at documenting the /robots.txt stuff for checklink up @ http://qa-dev.w3.org/wlc/checklink?uri=http%3A%2F%2Fkoti.welho.com%2Fvskytta%2Ft.html
[02:22:36:] <scop> wording improvements welcome


** Agenda 2 - CSS validator - progress and priorities **
[01:25:58:] <bjoern_> dodji updated libcroco CVS, I am going to have a look at that
[01:26:07:] <bjoern_> no progress on css schema
[01:26:28:] <__Yves> ok, so I recently closed some issues, partly by fixing the grammar (which is really thin) and by upgrading javaCC
[01:27:06:] <__Yves> would be nice to have a test suite (that can act as a regression TS as well)
[01:27:24:] <bjoern_> __Yves, I can look at the bugs and prioritize them to some extend
[01:27:29:] <__Yves> also a list of "needs to be fixed in priority" would be nice :)
[01:27:52:] <__Yves> bjeorn: well, what may have a high priority to me might not have the same for others
[01:28:02:] <__Yves> s/bjeorn/bjoern_/
[01:28:18:] <__Yves> so guidance from users and people interacting with users is welcomed :)
[01:29:02:] <bjoern_> Well, I would probably give those highest priority which most users complained about...
[01:29:16:] <bjoern_> btw http://www.w3.org/Bugs/Public/buglist.cgi?product=CSSValidator
[01:29:40:] <__Yves> yep, saw this, I found the .not bug there (and fixed it)
[01:29:59:] <yod> P2 quite crowded 
[01:30:50:] * scop notes that P2 is the default in Bugzilla
[01:31:07:] <__Yves> yeah and P1 is for a URI that has moved...
[01:31:24:] <bjoern_> http://www.w3.org/Bugs/Public/show_bug.cgi?id=337
[01:31:40:] <bjoern_> It should probably be closed as invalid
[01:32:12:] <__Yves> yes
[01:32:23:] <__Yves> so only P2 bugs remains
[01:33:04:] <__Yves> (if the mime type is good, there are no reason it would work regarldess of the URI)
[01:33:05:] <bjoern_> there is a P5 http://www.w3.org/Bugs/Public/show_bug.cgi?id=399 which should probably have higher priority
[01:33:08:] <yod> so ACTION: bjoern to modify priorities in CSSValidator's bugzilla
[01:33:23:] <yod> and ACTION: Yves to fix bugs
[01:33:24:] <yod> :)
[01:33:32:] <__Yves> yeah :)
[01:33:34:] <bjoern_> what do we do re test suite?
[01:33:42:] <__Yves> ACTION yod to start a test suite :)
[01:33:47:] <bjoern_> !!!
[01:34:06:] <yod> I have not touched test suites for a while, my bad
[01:34:16:] <__Yves> bjoern : I have a set of files used to test some bugs, they can be used to do regression test, but not more
[01:34:29:] <yod> Yves: send that list to me
[01:34:29:] <__Yves> and we need perhaps more thatn that (from regular stuff to corner cases)
[01:34:37:] <__Yves> and this works also for the markup validator
[01:34:51:] <yod> I'll try to work on that within the next 2 weeks
[01:34:51:] <__Yves> (including weird encodings cornercases)
[01:35:07:] <bjoern_> I also have a number of test pages/style sheets, a number of them linked from bugzilla...
[01:35:07:] <__Yves> yod: remind me so that I won't forget
[01:35:18:] <yod> I will...
[01:35:35:] <yod> ACTION: Yves send olivier list of "test" cases URI for the CSS validator
[01:35:48:] <yod> now I know I will remind you
[01:36:12:] <yod> on a related (to the CSS validator) note, the spanish office is motivated to handle translation of interfaces and errors
[01:36:14:] <bjoern_> s/Yves/Yves and Bjoern/
[01:36:38:] <yod> I will (tomorrow I think) work on a plan for translations and maintenance thereof
[01:37:16:] <yod> anything else on the CSS validator?
[01:37:30:] <bjoern_> should be straightforward for the css validator
[01:37:42:] <yod> bjoern_: I think so
[01:37:45:] <bjoern_> I would like information from sijtsche/plh/whoever on how much CSS3 is supposed to be implemented
[01:37:49:] <__Yves> that should be it (note that with the new JavaCC, preformance improved)
[01:38:02:] <__Yves> yeah, so do I, and information on support for other profiles
[01:38:29:] <yod> bjoern_: would you like to start a mail thread about it on qa-dev?
[01:38:36:] <bjoern_> There are lots of things i am not sure about whether they are unimplemented or broken...
[01:38:42:] <yod> or w-v-c if you prefer
[01:39:18:] <bjoern_> I would prefer if you send them a mail to summarize what's implemented / what they implemented / something like that
[01:39:30:] <bjoern_> cc'ing w-v-c/qa-dev
[01:39:31:] <yod> fine
[01:39:33:] <yod> I will
[01:40:07:] <bjoern_> oh, and probably w3c-css-wg
[01:40:10:] <yod> ACTION: olivier contact PLH/Sijtsche and ask them what is implemented / to what extent (esp. CSS3)
[01:40:58:] <bjoern_> (btw, Bert has an ongoing action item to make sure CSS 2.1 is supported in the css validator...)
[01:40:41:] <yod> [closing item]


** Agenda 3 - Markup Validator **
[01:42:22:] <yod> Markup Validator : not much feedback on 0.6.5b2, beyond style issues
[01:42:41:] <yod> bjoern animating interesting discussions
[01:42:58:] <bjoern_> without much luck, as I expected...
[01:43:15:] <yod> there were answers... from the usual suspects
[01:43:45:] <bjoern_> There wasn't much feedback on previous betas either (not considering my comments), it seems we have a general feedback issue
[01:44:08:] <yod> Well, this beta was pretty much low profile
[01:44:19:] <yod> compared to others, which were announced much more broadly
[01:44:47:] <bjoern_> which did not yield in much feedback either
[01:44:55:] <yansanmo> add a <form> with a <textarea> on the beta page, you will get more feedback
[01:45:17:] <yod> feedback issue: are we asking the right people?
[01:45:54:] <xover> Should we announce betas on the v.w3.org:80 front page?
[01:45:58:] <yod> form would make it easy to send feedback, but people would have to know the existence of the beta 
[01:46:25:] <yod> xover: that's where I was going...
[01:46:40:] <yod> it would make some sense
[01:47:54:] * bjoern_ would like to point out that asking for specific feedback ("how to improve this?") on mailing lists such as css-discuss does neither yield in much feedback...
[01:48:17:] <bjoern_> though it might be helpful to clearly state what we want feedback on, specific design issues
[01:49:05:] * yod received feedback - people complaining about the fact that the validator fails at XML wellformedness
[01:49:06:] <bjoern_> for example, in the german web authoring newsgroup, as validator newbie got confused by the Server: header as it talked about "Frontpage" which he no longer uses
[01:49:15:] <yod> not a new issue, just more pressure to fix it
[01:49:40:] <bjoern_> s/as/a/
[01:50:00:] <xover> Was this feedback public?
[01:50:05:] <yod> no
[01:50:14:] <xover> Can it be made public?
[01:50:36:] <yod> not sure - I can ask
[01:50:36:] <bjoern_> why?
[01:50:49:] <yod> but we know the issue already
[01:51:24:] <yod> one new (?) thing though, in that discussion I was told that there would be an OpenSP hack dealing with is limitations with XML
[01:51:25:] <xover> Because I know of very few cases where the Validator fails at well-formedness checking; vague statements in private are not conductive to identifying and fixing the issues.
[01:51:33:] <yod> I am yet to receive more details on that
[01:51:49:] * niq back
[01:51:57:] <niq> yod: OpenSP hack?
[01:52:15:] <niq> who told you that?
[01:52:39:] <bjoern_> xover, http://www.websitedev.de/markup/validator/tests/ supposedly covers most of them
[01:53:04:] <bjoern_> as does the "limitations" page
[01:53:08:] <bjoern_> (though not all)
[01:53:20:] <niq> hmm
[01:53:45:] <xover> Thanks Bjoern. That's concrete and very helpful!
[01:53:49:] <yod> niq : it was already a secondhand account, but I'll followup and will tell you as soon as I know
[01:55:37:] * yod wondering how/when we will fix these issues? After m12n, by using an XML parser for XML docs? 
[01:55:49:] <yod> that would be 3 release cycles from now
[01:56:07:] * niq still has to install mod_validator @ qa-dev
[01:56:10:] <xover> Yes, that's the only real fix for this. Anything else would be in the nature of a workaround.
[01:59:25:] <yod> ACTION: nick to install mod_validator @ qa-dev
[01:59:36:] <xover> BTW, /me is waiting for Bjoern to send the message detailing his reservations about the CVS plans.
[01:59:39:] <niq> But it's only going to be useful if people then use/look at it
[01:59:57:] <bjoern_> ACTION: bjoern to send reservations re CVS plans to the list
[02:00:05:] * scop is interested in mod_validator
[02:00:23:] <bjoern_> niq also has an AI "demo test harness"...
[02:00:41:] <bjoern_> and progress on this?
[02:00:57:] <bjoern_> (from http://www.w3.org/mid/FCEF9326-73B4-11D8-AE66-000393A63FC8@w3.org)
[02:02:17:] * yod very interested in demo test harness
[02:02:35:] <xover> Oh, and Bjoern, limited testing seems to indicate that the latest CSS change alleviated the "Guillotine" symptoms.
[02:02:54:] <xover> Could you check current CVS and see if it's gone?
[02:03:10:] <bjoern_> sure
[02:03:39:] * xover waits for yod to yell "ACTION"... :-)
[02:04:09:] <bjoern_> is it on qa-dev already?
[02:04:10:] <yod> silence in the studio
[02:04:23:] <xover> qa-dev updates from CVS every 15 minutes.
[02:05:00:] <bjoern_> no, not fixed
[02:05:06:] <bjoern_> not on the frontpage
[02:05:25:] <bjoern_> neither on the results page (if you hover over the "HTML 4.01" in the "Invalid" box, for example)
[02:05:33:] <xover> Gah! Ok. This is MSIE:win 6sp1?
[02:05:43:] <bjoern_> yes.
[02:06:23:] * yod notes that David (Dorward) said he'd try to give it a look
[02:06:29:] <yod> earlier today
[02:06:34:] <bjoern_> also note that the first error line is below the end of the navbar, i.e., you have lots of space below the "invalid" box
[02:07:07:] <bjoern_> what are our next steps for check?
[02:07:58:] <xover> Fix the CSS issues, get another Beta out the door, get release out the door...
[02:08:01:] <yod> I'd be willing to release 0.6.5 as soon as the CSS is cleaned up
[02:08:11:] <yod> and possibly the navbar question sorted ou
[02:08:12:] <yod> t
[02:08:43:] <bjoern_> maybe someone should post to css-discuss asking for volunteers to fix it? claim none of us is running windows...
[02:09:16:] * niq raises eyebrow at bjoern_
[02:09:16:] <xover> Well, a good explanation of what the underlying issue actually _is_ would be very valuable.
[02:09:30:] <xover> (and allows us to avoid/fix it in the future)
[02:10:43:] <bjoern_> http://archivist.incutio.com/viewlist/css-discuss/20177 ?
[02:12:01:] <xover> Well, it's not comprehensive but may be enough to figure it out from first principles.
[02:12:50:] <bjoern_> http://www.google.com/search?q=%22guillotine+bug%22 should help here...
[02:13:06:] <bjoern_> I may have a look at this after the meeting...
[02:13:25:] <bjoern_> I am fine with fixing the css and releasing it.
[02:13:35:] <xover> Last time I check those Google results I found only references to it but no good explanations.
[02:13:58:] <bjoern_> (I rather make progress than trying to do a perfect release)
[02:14:03:] <bjoern_> http://www.fabrice-pascal.de/bugbase/guillotine/ is good but german...
[02:14:43:] <xover> Yes, maintaining forward momentum may be the most important thing here.
[02:15:07:] <bjoern_> no :hover { background-color } might fix it
[02:15:31:] <xover> Ah, that's a good clue!
[02:15:45:] <yod> +1 for momentum
[02:15:57:] <bjoern_> adding <br style="clear:left;"> after floated elements might too
[02:16:10:] <bjoern_> do we need another beta?
[02:16:34:] <xover> I think so (but a quick one).
[02:16:55:] <bjoern_> did you manage to compare :8001 traffic to :80 traffic?
[02:17:05:] <bjoern_> and how many IE users on :8001?
[02:17:12:] <xover> Nope. But I can take that as an AI if you want?
[02:17:17:] <bjoern_> To get an idea how many people actually "test" it?
[02:17:21:] <bjoern_> fine with me
[02:17:31:] <xover> (Unless yod volunteers?)
[02:17:40:] <bjoern_> a quick beta is also fine
[02:17:56:] <yod> I would be happy with a quick beta quickly
[02:18:17:] <yod> as for the stats, I will do it 
[02:18:48:] <yod> ACTION : olivier to compare :8001 traffic to :80 traffic
[02:19:04:] <bjoern_> If we have a CSS fix that appears to work, can we agree to release <= May 1st?
[02:19:48:] * yod notes that end april, beginning of may is a period with many holidays in .jp
[02:19:56:] <bjoern_> just to set a deadline?
[02:19:59:] <yod> and I'll spend part of that somewhere in the mountains
[02:20:02:] * xover can't commit to anything right now, but has no additional items blocking release...
[02:20:18:] <yod> that said, I'm fine with <= May 1st unless something big comes up
[02:21:22:] <yod> anything else?
[02:21:24:] <bjoern_> that's a RESOLUTION then?
[02:21:27:] <yod> yes
[02:21:58:] <yod> RESOLVED : short beta3 for Markup Validator, and release <= May 1st 
[02:23:35:] <bjoern_> next meeting? bi-weekly? time?
[02:23:41:] <bjoern_> any progress on this?
[02:24:09:] <yod> I'd have to check, but consensus seemed to be bi-weekly, same time as today
[02:24:11:] <bjoern_> and public accounts for qa-dev participants
[02:24:30:] <bjoern_> http://cgi.w3.org/MemberAccess/Public
[02:24:58:] <yod> ACTION : all - get public Web accounts (if no member/team account already)
[02:25:11:] <bjoern_> "Describe the specific purpose for this public access name and password"
[02:25:16:] <bjoern_> what to put in there?
[02:25:29:] <bjoern_> for those who do not have an account?
[02:25:44:] <yod> mention "QA Dev - contact olivier"
[02:25:50:] <yod> that will do
 RESOLUTION: bi-weekly meeting

[02:27:39:] <yod> ADJOURNED