- From: Josef Eines <josef@medienor.no>
- Date: Sun, 22 May 2022 21:07:23 +0200
- To: freshnorge@gmail.com
- Cc: html-tidy@w3.org
Been trying to tidy up the html strings on one of my websites, running the code through linux. For some reason I can't seem to make it work. I've run: sudo apt-get install tidy To "tidy" it up I go: curl localhost address | tidy -iq (please note I have all articles stored as a xhtml file). From my understanding the -q is for quiet input while the "i" is for indents and it fixes the main issue. I'm trying to tidy up all the htmls on these pages https://www.fornye.no & https://www.alarmsystem.no Problem I'm running into is that the UTF8 gets translated into the ascii-USA version and I can no longer read the text file....I must be doing something wrong. It looks like 'tidy -iq -utf8' should work: $ man tidy Character encodings -utf8 use UTF-8 for both input and output but it didn't for me with LANG=C Just out of curiosity - what output does 'locale' give you? This does work for me: export LANG=nb_NO.utf8 (or en_US.utf8 or even C.utf8) tidy -iq test.html Regards, Josef
Received on Monday, 23 May 2022 06:56:35 UTC