W3C home > Mailing lists > Public > www-amaya@w3.org > January to March 2012

RE: How to filter a corrupted html file (corrupted for Amaya) ?

From: Boris Kadir <boris.kadir@lcpcomtech.com>
Date: Fri, 17 Feb 2012 09:45:02 +0100
To: <www-amaya@w3.org>
Message-ID: <6BC0C2F2D95342D2B4B46B33703B039A@ordifixe>
Thank you Scott, Keith and Laurent for your answers and your advices.

Before sending my question, I actually validated the html file against
different version of Html and Xhtml DTD. (I validated the file first in
Dreamweaver, then, following your instructions, in the w3c validator). 

I also opened the file in WordPad and NoteBook and several times saved it in
text format.

After dozens of tries, Amaya progressively accepted the file without
crashing. (At a moment the bug moved at another place and Amaya crashed when
I reached this new point.)

Now I can scan the whole document down and up, what was impossible.
The bug seems to have gone away...

Great thanks and best regards

Boris



-----Message d'origine-----
De : Scott Ferguson [mailto:prettyfly.productions@gmail.com] 
Envoyé : jeudi 16 février 2012 10:01
À : www-amaya@w3.org
Objet : Re: How to filter a corrupted html file (corrupted for Amaya) ?

On 16/02/12 18:52, Boris Kadir wrote:
> Hello,
> 
> I have a problem with a html file: Amaya crashes every time I reach a
> certain place.
> 
> The file seems to be corrupted at this point. (Although I can open and
read
> it without problem in Notepad++, Wordpad, Dreamweaver...)
> 
> How can I filter the file in order to recover a "clean" (for Amaya) html
> file?

You could 'try' pasting it into the w3.org[*1] validate page, and select
the "Clean up Markup with HTML-Tidy"[*2] option - then work with the
"tidied" output.

[*1]
http://validator.w3.org/#validate_by_input+with_options

> How can I detect the source of the problem ?
>

The validator usually makes that easy


<snipped>

> 
> Thank you by advance
> Boris Kadir 
> 
> 
> 



[*2]
I believe their is a MS Windows port, but I've never used it:-
http://tidy.sourceforge.net/



Kind regards
Received on Friday, 17 February 2012 08:45:35 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 7 January 2015 14:53:43 UTC