Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > Perl > Where to look for source of HTML::Parser

Reply
Thread Tools

Where to look for source of HTML::Parser

 
 
Himanshu Garg
Guest
Posts: n/a
 
      01-31-2004
Hello,

I am trying to pinpoint an apparent bug in HTML:arser. The
encoding of the text seems to change incorrectly if the locale isn't
set properly.

However Parser.pm in the directory
(/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
doesn't seem to contain the "real" parsing statements.

Could you tell me where to look for the "real" code.

Thank You
Himanshu.
 
Reply With Quote
 
 
 
 
Gunnar Hjalmarsson
Guest
Posts: n/a
 
      01-31-2004
Himanshu Garg wrote:
> I am trying to pinpoint an apparent bug in HTML:arser. The
> encoding of the text seems to change incorrectly if the locale
> isn't set properly.
>
> However Parser.pm in the directory
> (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
> doesn't seem to contain the "real" parsing statements.
>
> Could you tell me where to look for the "real" code.


HTML:arser requires HTML::Entities, and it's probably the latter you
want to study.

--
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl

 
Reply With Quote
 
 
 
 
Himanshu Garg
Guest
Posts: n/a
 
      01-31-2004
Gunnar Hjalmarsson <(E-Mail Removed)> wrote in message news:<FwFSb.47096$(E-Mail Removed)>...
> Himanshu Garg wrote:
> > I am trying to pinpoint an apparent bug in HTML:arser. The
> > encoding of the text seems to change incorrectly if the locale
> > isn't set properly.
> >
> > However Parser.pm in the directory
> > (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
> > doesn't seem to contain the "real" parsing statements.
> >
> > Could you tell me where to look for the "real" code.

>
> HTML:arser requires HTML::Entities, and it's probably the latter you
> want to study.

No it is not the one I am looking for. HTML/Entities.pm merely encodes
and decodes entities.

I am looking for where other things like tags, their attributes etc
are handled and I do not find them in either of these.

Nevertheless, I thank you very much for your reply.

Thank You
++imanshu.
 
Reply With Quote
 
Joe Smith
Guest
Posts: n/a
 
      01-31-2004
Himanshu Garg wrote:

> Hello,
>
> I am trying to pinpoint an apparent bug in HTML:arser. The
> encoding of the text seems to change incorrectly if the locale isn't
> set properly.
>
> However Parser.pm in the directory
> (/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/HTML/)
> doesn't seem to contain the "real" parsing statements.
>
> Could you tell me where to look for the "real" code.


Take a look at the beginning of the perl module.
/usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/HTML/Parser.pm
require DynaLoader;
@ISA=qw(DynaLoader);
HTML:arser->bootstrap($VERSION);
That means Parser.pm uses Parser.so (and Parser.bs).

/usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/.packlist
/usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/Parser.bs
/usr/local/lib/perl5/site_perl/5.8.0/sun4-solaris/auto/HTML/Parser/Parser.so

The *.so files are built from *.h and *.c files.
..cpan/build/HTML-Parser-3.34/hctype.h
..cpan/build/HTML-Parser-3.34/hparser.c
..cpan/build/HTML-Parser-3.34/hparser.h
..cpan/build/HTML-Parser-3.34/Parser.bs
..cpan/build/HTML-Parser-3.34/Parser.c
..cpan/build/HTML-Parser-3.34/Parser.o
..cpan/build/HTML-Parser-3.34/Parser.pm
..cpan/build/HTML-Parser-3.34/Parser.xs
..cpan/build/HTML-Parser-3.34/pfunc.h

--
I love my TiVo - http://www.inwap.com/u/joe/tivo/
 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Can I use a look-ahead and a look-behind at the same time? dan.j.weber@gmail.com Perl Misc 4 04-09-2008 10:25 PM
A Look Inside Microsoft's Open Source Software Lab Waylon Kenning NZ Computing 3 04-10-2006 09:03 AM
Why do look-ahead and look-behind have to be fixed-width patterns? inhahe Python 3 01-28-2005 12:50 PM
<tr> with a 1x1 image as a filler on a table with padding of 2 look thicker in netscape but they look ok in IE. Serial # 19781010 HTML 1 08-10-2003 09:05 PM



Advertisments