John, In message "Xpath transform changes and questions" on 00/03/17, "John Boyer" <jboyer@PureEdge.com> writes: > i) Serialization of the root node requires that we output the byte order > mark and xmldecl read by parse() on input. If parse() is not under our > control, we cannot specify that it retains this information. This would > seem to suggest that root node serialization should result in the empty > string, which in turn suggests that serialize should output in UTF-8 > regardless of the input encoding. That would be OK with me. I prefer serializeing in UTF-8 regardless of the input encoding. It is not impossible but painful to get the BOM and the XML declaration with existing XML processors. If the serialization had to encode in non-UTF encoding, it had to check whether each character could be encoded in that encoding. This check is hard to implement in Java. You have to specify also output encoding of string type results of XPath. A string in XPath is character sequence, not octet sequence. -- TAMURA Kent @ Tokyo Research Laboratory, IBMReceived on Wednesday, 22 March 2000 01:12:37 GMT
This archive was generated by hypermail 2.2.0 + w3c-0.29 : Thursday, 13 January 2005 12:10:09 GMT