HTML parsing...

viktor@irisz.hu writes:
 > > How about using avh HTML parser (let me know if you need the
 > > URL). It's pretty flexible, and quite fast. 
 > 
 > YES, sure I'm interested! I spent lot of times these days with hunting
 > for tools and docs in this topic, even to discover what the hell
 > SGML is, but didn't found one which addresses this task,
 > and it's written in java. Specialy I want to implement some kind
 > of SSI which fits to my needs. Jesterday finaly made a parser
 > with jax, which can react when founding
 > <jigsaw [name[=value]]+> style tags...

I can't find the URL again, can you try looking with altavista
(htmlpars was included in the name of the file - you might want to try
archie too). Sorry.

 > > Note that parsing the file
 > > should be done in FileResource.updateFileAttributes, so that it gets
 > > done only once, when the file (ie the content) of the resource
 > > changes
 > 
 > I've put it there (in updateFileAttributes), but as I look now, I see
 > that I've override FileResource.initialize:
 > 
 > 	public void initialize(Object values[]) {
 > 		super.initialize(values);
 > 		updateAttributes();
 > 	 }
 > 
 > I hope I had some reasons to do it, but didn't made a note...
 > Sure I found that the title didn't calculated when the resource
 > is created, only when it's updated.

This bug has been fixed, updateFileAttributes is called when created

Anselm.

Received on Friday, 6 September 1996 16:17:25 UTC