# Re: xslt transformation and encoding

From: David Carlisle <davidc@nag.co.uk>
Date: Fri, 28 May 2004 14:17:50 +0100
Message-Id: <200405281317.OAA07361@penguin.nag.co.uk>

```

This is a general xml question rather than a mathml one.

if i use <xsl:output encoding="UTF-8"/>, and \usepackage[utf8]
{inputenc},
latex doesn't recognize anything (i use last miktex distribution under
windows);

There are several utf8 packages available for latex (see ctan archives)
if you have the current "december 2003" version of latex the core
inputenc package has utf8 support, so you should be able to go
\usepackage[utf8]{inputenc}

Other alternatives: before processing the file modify the dtd
using a global replace so that for example

<!ENTITY rarr             "&#x02192;" >

is

<!ENTITY rarr             "\rarr " >

Then the character data will come out as TeX macro syntax, and you just
need to define those macros.

A third possibility would be to use saxon7 and XSLT2 character map
functionality which would allow you to specify how you want these
characters to be output (whether or not they were inout as entities)

A final (for now) possibility, and one I have used in the past is to use
the xml output method rather than text and an encoding of US-ASCII
then the characters will be output as "& # 2 1 9 2 ;" rather than being
an error. You just then edit with sed or some other such editor and
replace
& # [0-9]* ;
by
{\U\1}

then your file ends up with lots of macro names such as \U2192 which you
just need to define in TeX to do the right thing...

David

--
The LaTeX Companion
http://www.awprofessional.com/bookstore/product.asp?isbn=0201362996
http://www.amazon.co.uk/exec/obidos/tg/detail/-/0201362996/202-7257897-0619804

________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The