Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > Perl > decode_entities possible bug?

Reply
Thread Tools

decode_entities possible bug?

 
 
Richard Bell
Guest
Posts: n/a
 
      05-29-2004
decode_entities does not appear to decode this text

<span class="linksep1">•</span>

The sequence &#8226 is left untouched.

Is this correct/expected behavior, a bug, or what?

Thanks for any help.

R
 
Reply With Quote
 
 
 
 
Richard Bell
Guest
Posts: n/a
 
      05-31-2004
Bob Walton wrote:
> Richard Bell wrote:
>
>> decode_entities does not appear to decode this text
>>
>> <span class="linksep1">•</span>
>>
>> The sequence &#8226 is left untouched.
>>
>> Is this correct/expected behavior, a bug, or what?
>>
>> Thanks for any help.
>>
>> R

>
>
> Perhaps you could clarify what "decode_entities" is? Is it some sub or
> module you wrote, or part of a CPAN module? If the latter, which one of
> the 6000+ modules is it a method of? Thanks.
>


Apologies, I should have been clearer. HTML::Entities.

R
 
Reply With Quote
 
 
 
 
Richard Bell
Guest
Posts: n/a
 
      05-31-2004
Bob Walton wrote:
> Richard Bell wrote:
>
>> decode_entities does not appear to decode this text
>>
>> <span class="linksep1">•</span>
>>
>> The sequence &#8226 is left untouched.
>>
>> Is this correct/expected behavior, a bug, or what?
>>
>> Thanks for any help.
>>
>> R

>
>
> Perhaps you could clarify what "decode_entities" is? Is it some sub or
> module you wrote, or part of a CPAN module? If the latter, which one of
> the 6000+ modules is it a method of? Thanks.
>


Apologies, I should have been clearer. HTML::Entities.

R
 
Reply With Quote
 
Richard Bell
Guest
Posts: n/a
 
      05-31-2004
Richard Bell wrote:
> Bob Walton wrote:
>
>> Richard Bell wrote:
>>
>>> decode_entities does not appear to decode this text
>>>
>>> <span class="linksep1">•</span>
>>>
>>> The sequence &#8226 is left untouched.
>>>
>>> Is this correct/expected behavior, a bug, or what?
>>>
>>> Thanks for any help.
>>>
>>> R

>>
>>
>>
>> Perhaps you could clarify what "decode_entities" is? Is it some sub
>> or module you wrote, or part of a CPAN module? If the latter, which
>> one of the 6000+ modules is it a method of? Thanks.
>>


Apologies, I should have been clearer. HTML::Entities.

R
 
Reply With Quote
 
Richard Bell
Guest
Posts: n/a
 
      05-31-2004
Bill wrote:
> Bob Walton wrote:
>
>> Richard Bell wrote:
>>
>>> decode_entities does not appear to decode this text
>>>
>>> <span class="linksep1">•</span>
>>>
>>> The sequence &#8226 is left untouched.
>>>
>>> Is this correct/expected behavior, a bug, or what?
>>>
>>> Thanks for any help.
>>>
>>> R

>>
>>
>>
>> Perhaps you could clarify what "decode_entities" is? Is it some sub
>> or module you wrote, or part of a CPAN module? If the latter, which
>> one of the 6000+ modules is it a method of? Thanks.
>>

> He's using HTML::Entities to decode unicode for a bullet, and it does
> not seem to work well. Perl support for Unicode over Ā is still in
> the works for some modules.
>


Thanks Bill. Is there another more appropriate choice?

R
 
Reply With Quote
 
Bill
Guest
Posts: n/a
 
      05-31-2004
>>>
>> He's using HTML::Entities to decode unicode for a bullet, and it does
>> not seem to work well. Perl support for Unicode over Ā is still
>> in the works for some modules.
>>

>
> Thanks Bill. Is there another more appropriate choice?


Why should you have to decode this anyway? On my system, even decoded,
it will not display correctly outside of the browser. Why not leave it
as is?

 
Reply With Quote
 
Richard Bell
Guest
Posts: n/a
 
      05-31-2004
Bill wrote:
>>>>
>>> He's using HTML::Entities to decode unicode for a bullet, and it does
>>> not seem to work well. Perl support for Unicode over Ā is still
>>> in the works for some modules.
>>>

>>
>> Thanks Bill. Is there another more appropriate choice?

>
>
> Why should you have to decode this anyway? On my system, even decoded,
> it will not display correctly outside of the browser. Why not leave it
> as is?
>

Without going into overmuch detail, for my purposes (not display, but
rather analysis of content) the undecoded characters get royally in the
way. I assume something along the lines of s/&#([0..9]){1,4}/ / will
turn them all into ' ' but had hoped for something a bit better as there
is some useful semantics amongst the rubble.

R
 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Entities.pm - How does decode_entities work? Dave Saville Perl Misc 12 12-17-2010 04:32 AM
efficient data loading with Python, is that possible possible? igor.tatarinov@gmail.com Python 10 12-14-2007 04:44 PM
is it possible to possible to create an iterator from a callback interace? aninnymouse@gmail.com C Programming 4 02-21-2006 02:10 PM
XML + XSD: Is it possible to get all possible Values for an Element? Markus Java 1 11-22-2005 02:51 PM
decode_entities possible bug? Richard Bell Perl Misc 14 06-01-2004 08:05 PM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57