Re: [EXT] Re: Misidentified language of an html document

Andreas Prilop 🇮🇱 wrote:

Ian Fieggen wrote:



Warning: This document appears to be written in Afrikaans



This seems to be some kind of bizarre overflow.
The document below contains 128 times the word “English” and is identified as “Shona”.
When you delete one instance of “English”, you get no warning
despite “lang=ru”.


Although fixing the language-detection algorithm would be the better solution, a short-term solution would be to re-write the diagnostic along the lines of the following —

Information: the Nu Html Checker has inferred that the document may be written in Afrikaans — if this is indeed the case, please replace lang="en" by lang="af" (or variant) in the opening <HTML> element.

--
Philip Taylor

This email, its contents and any attachments are intended solely for the addressee and may contain confidential information. In certain circumstances, it may also be subject to legal privilege. Any unauthorised use, disclosure, or copying is not permitted. If you have received this email in error, please notify us and immediately and permanently delete it. Any views or opinions expressed in personal emails are solely those of the author and do not necessarily represent those of Royal Holloway, University of London. It is your responsibility to ensure that this email and any attachments are virus free.

Received on Wednesday, 12 June 2024 09:03:35 UTC