No subject


Sat Oct 14 20:46:50 EDT 2006


that this is the numeric (hexadecimal) entity reference to the 'right
single quotation mark' character, equally represented by the (easier
to identify) character entity reference '’'.

http://www.w3.org/TR/REC-html40/charset.html#h-5.3 gives a short (and
good) explanation of the meaning and relationship of character and
numeric entity references.

>
> When the form data is read the quote becomes \222 . Where do I find
> how this is being encoded so I can put back the quote. or other special
> characters, I found one  reference which said that this was due to UTF-8
> encoding .
>

I'm not sure how or why this would be transcoded to \222 especially
since this chart
http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references#List_of_HTML_character_entity_references
specifies that the Unicode position is U+2019 (8217)

It most likely is happening in your server, or application rather than
the user agent, but may involve all three.  What is your application
and server setup?  E.g. Apache 2.0 on Debian running a PHP script
through the PHP module version 5.

What headers are used in the ultimate document(s) being served? and
are important ones missing?  E.g. <meta http-equiv="Content-Type"
content="text/html; charset=ISO-8859-1">

A good reference to Character Encoding that doesn't neccessarily
involve reading all the standards and RFCs is this wikipedia article
http://en.wikipedia.org/wiki/Character_encoding

hth,

Greg

> This is a problem for me since the text containing the quote is stored
> in a data base. An xml database query breaks when it thinks the record
> contains binary data.
>
> Don Leslie
> _______________________________________________
> gnhlug-discuss mailing list
> gnhlug-discuss at mail.gnhlug.org
> http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss
>



More information about the gnhlug-discuss mailing list