- From: Pete Cordell <petejson@codalogic.com>
- Date: Mon, 18 Nov 2013 18:19:46 -0000
- To: "John Cowan" <cowan@mercury.ccil.org>
- Cc: "Tim Bray" <tbray@textuality.com>, Martin J. Dürst <duerst@it.aoyama.ac.jp>, "Henry S. Thompson" <ht@inf.ed.ac.uk>, "IETF Discussion" <ietf@ietf.org>, "JSON WG" <json@ietf.org>, "Anne van Kesteren" <annevk@annevk.nl>, <www-tag@w3.org>
----- Original Message From: "John Cowan" > Pete Cordell scripsit: > >> Not useless if you're trying to tell the difference between a hand >> editted Windows cp-1252 (or whatever it's called) encoded text file >> and a UTF-8 encoded text file. > > In principle you cannot tell, but in practice it's possible to > discriminate > betwen the two with extremely high reliability. John, Do you mean that the presence of a UTF-8 BOF sequence doesn't prove that it's not Windows cp-1252 or do you mean you can tell apart a UTF-8 and cp-1252 file without BOMs? If the latter, do the relevant tools take the time to distinguish the 2 without BOMs? Thanks, Pete Cordell Codalogic Ltd C++ tools for C++ programmers, http://codalogic.com Read & write XML in C++, http://www.xml2cpp.com
Received on Monday, 18 November 2013 18:26:16 UTC