Bill wrote:
>>>>
>>> He's using HTML::Entities to decode unicode for a bullet, and it does
>>> not seem to work well. Perl support for Unicode over Ā is still
>>> in the works for some modules.
>>>
>>
>> Thanks Bill. Is there another more appropriate choice?
>
>
> Why should you have to decode this anyway? On my system, even decoded,
> it will not display correctly outside of the browser. Why not leave it
> as is?
>
Without going into overmuch detail, for my purposes (not display, but
rather analysis of content) the undecoded characters get royally in the
way. I assume something along the lines of s/&#([0..9]){1,4}/ / will
turn them all into ' ' but had hoped for something a bit better as there
is some useful semantics amongst the rubble.
R
|