html special characters and web forms
lan4pal at newcomb.mv.com
lan4pal at newcomb.mv.com
Mon Oct 10 14:51:01 EDT 2005
I thought '\222' was base-8 for "Right Single Quotation Mark,"
really 'rsquo' (8217, x2019) in the "newer" stds.
MS' Windows-1252 char. codes used most items from \200 (=128=x80)
to \237 (=159=x9F) (a quick check of Greg's link reminded me
about MS' char. codes; if curious, see
http://en.wikipedia.org/wiki/Windows-1252). These still shows
up often on the web, I assume from non-updated pages. Most
browsers still display these items (e.g., the trademark symbol
is 8482, but 153 displays it as well in Opera and Mozilla/Firefox
[and I assume still in MSIE]).
Not that this solves the problem, just a bit of web history,
Lee A. N.
-------- Original Message --------
Subject: Re: html special characters and web forms
Date: Mon, 10 Oct 2005 09:08:36 -0400
From: Greg Rundlett <greg.rundlett at gmail.com>
To: Donald Leslie {74279} <dml at swlmail.msd.ray.com>
CC: gnhlug-discuss at mail.gnhlug.org
On 10/9/05, Donald Leslie {74279} <dml at swlmail.msd.ray.com> wrote:
.
.
.
> When the form data is read the quote becomes \222 . Where do I find
> how this is being encoded so I can put back the quote. or other special
> characters, I found one reference which said that this was due to UTF-8
> encoding .
>
I'm not sure how or why this would be transcoded to \222 especially
since this chart
http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references#List_of_HTML_character_entity_references
specifies that the Unicode position is U+2019 (8217)
It most likely is happening in your server, or application rather than
the user agent, but may involve all three. What is your application
and server setup? E.g. Apache 2.0 on Debian running a PHP script
through the PHP module version 5.
What headers are used in the ultimate document(s) being served? and
are important ones missing? E.g. <meta http-equiv="Content-Type"
content="text/html; charset=ISO-8859-1">
A good reference to Character Encoding that doesn't neccessarily
involve reading all the standards and RFCs is this wikipedia article
http://en.wikipedia.org/wiki/Character_encoding
hth,
Greg
> This is a problem for me since the text containing the quote is stored
> in a data base. An xml database query breaks when it thinks the record
> contains binary data.
>
> Don Leslie
> _______________________________________________
> gnhlug-discuss mailing list
> gnhlug-discuss at mail.gnhlug.org
> http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss
>
More information about the gnhlug-discuss
mailing list