On Sat, 8 Jan 2005, Realbot wrote:
> I'm having some problems with a web application of mine.
Forms submission including characters outside of us-ascii is
non-trivial, and isn't in itself a Perl problem.
OT: commentary of mine at
http://ppewww.ph.gla.ac.uk/~flavell/...form-i18n.html
Until one can get that part sorted out to one's satisfaction, any
fiddling around that one might do in one's Perl script would be a bit
pointless, IMHO. And discussion of the web part would be more at home
on comp.infosystems.
www.authoring.cgi (beware the automoderation bot).
> <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
If we assume that the page itself is really coded in utf-8 (note that
in the event of a dispute, the server's actual HTTP Content-type
header wins over anything that you might secrete in a meta
http-equiv), then you can expect current browsers to submit
utf-8-encoded form data. But not-quite-so-new browsers - even some
which support utf-8 display - get utf-8 forms submission sadly wrong.
> <form name="formutfget" method="GET">
In -theory- the method GET supports nothing better than the us-ascii
character coding. But see my commentary for further discussion.
> The strange thing is that running this form under these environments
[...]
> So, it seems that in the former I get escaped unicode character and
> in the latter UTF-8 ones.
It looks as if somebody is trying to ape the misbegotten behaviour of
MSIE.
In a practical sense there isn't one right answer - there are several
compromises, depending on which browsers support what. But none of
the details here are features of the Perl programming language,
AFAICS.
good luck