Re: grounding wordnet in the web

Hi again,

> Hmm... it seems fairly reasonable; but I don't recommend computing
> the CBD on-demand.  I expect "baking" will be more manageable
> than "frying".

Ok, thanks for the suggestions. We don't know yet if we'll have a big 
or small iron, but the document can stay agnostic to that issue (or at 
least for the moment). Changes can be made later if we don't want to 
return CBD's after all (whether or not they are baked or not), right?

> Rather than putting big iron behind this service, I suggest you
> throttle it. Aggressively advocate that tools that know that
> they will rely on access to wordnet data in advance cache the
> data they need, and if anybody is making, say, more than 100
> requests per hour, start returning "401 unauthorized; get a cache"
> and if the server gets busy, just return "5xx I'm too busy; you
> might try the _bittorrent bulk download_".

Do you think this is appropriate to put in the document [1] or should 
something like this be stated somewhere else? At least this already 
presumes the iron we're gonna use. I would like to keep decisions we 
can't make yet out of the document so that we can move forward with it 
- hope that's an ok way of going about this.

> If you're only going to document usage that involves downloading,
> please say that it's due to some sort of limitation, a la:
> 
>   Ordinary lookup[webarch 3.1] of wordnet terms in the Web is in
>   progress but not yet available; for now, we suggest you download
>   the data in bulk ...

I'd like to write the document with both usages in mind (as if they 
were already in place) so that we can agree on how we're going to 
actually implement this. I will try to rephrase the section so that 
it's made clear that both usages are possible.

Thanks,
Mark.

--------
[1]http://www.w3.org/2001/sw/BestPractices/WNET/wn-conversion.html

-- 
  Mark F.J. van Assem - Vrije Universiteit Amsterdam
        markREMOVE@cs.vu.nl - http://www.cs.vu.nl/~mark

Received on Thursday, 23 February 2006 16:31:07 UTC