- From: Digitome Ltd. <digitome@iol.ie>
- Date: Sat, 19 Apr 1997 15:06:55 +0100
- To: w3c-sgml-wg@w3.org
[Peter Murray-Rust] > >But people are already starting to think of robotic applications of XML. Here >we can see distributed servers with different components. If a user agent >(= robot) gets a document (=data to be processed) and this is not WF, then >it cannot reliably do anything further with that information. Does that not depend on the robot? I mean, if I want a robot to trawl the net for documents older than 1 year with < 1000 bytes containing the phrase "love poem" the robot does not need to concern itself with WF. > Guessing >what the author really meant is a recipe for disaster - we've all seen that with >HTML. I dunno. People seem to want software that guesses. See my previous post re graphing spreadsheet data! > Remember that we are already designing into XML the ability to carry >large collections of links which have to be precise. These will (I assume) be >routinely processed by machines, not humans. Machines that do this sort of stuff should reject non WF docs. This is fine. The problem I see is if this is mandated in the XML spec. I cannot see how this can be enforced anyway. No-one can stop my Perl scripts from forging on despite WF errors! I can see a lot of sense in mandating that assumptions made by user agents should be available to users however. >I can see the value of trying to recover document content *for human >consumption*, but not for machine consumption. Again I think it depends on the task the machine is performing. > >[I hope it didn't come across that I have anything other than total admiration >for James Clark's software, as I have stated before. Indeed I couldn't be >using SGML if it didn't exist.] Not at all. Like you I probably would not have got my head around SGML if it were not for James' tools. I was simply making the point that the likes of nsgmls foo.sgm | grep -c "^(BAR$" can be a useful thing to do even if foo.sgm markup contains errors. Sean
Received on Saturday, 19 April 1997 10:30:26 UTC