Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > Perl > Perl Misc > Is there a better way to search CPAN than search.cpan.org?

Reply
Thread Tools

Is there a better way to search CPAN than search.cpan.org?

 
 
usenet@DavidFilmer.com
Guest
Posts: n/a
 
      10-11-2005
CPAN is great, but it has become quite large (and somewhat haphazardly
organized). The "search.cpan.org" interface has not changed
significantly (except for the temporary orange color) since I first
played around with Perl five years ago. This interface is rather
crude, apparently supporting only simple (but somewhat ordered) OR
searches, without support for notations often found in other search
engines, such as phrase quoting or "+".

The CPAN FAQs mention three other search methods. One is a broken link,
and the other two offer no advantages that I can see. I've played
around with Google's "Advanced Search" against this domain, but the
signal-to-noise ratio is often quite low.

There is often a great Perl module out there that does whatever I want,
but finding it can be really dicey unless I get lucky guessing the
keywords.

Is there a better way to search CPAN??? It seems like a vast treasure
without a map!

 
Reply With Quote
 
 
 
 
Gunnar Hjalmarsson
Guest
Posts: n/a
 
      10-11-2005
wrote:
> Is there a better way to search CPAN???


One way:

http://www.google.com/search?q=site:...n.org+whatever

And don't forget that you optionally can _browse_ per category.

--
Gunnar Hjalmarsson
Email: http://www.gunnar.cc/cgi-bin/contact.pl
 
Reply With Quote
 
 
 
 
usenet@DavidFilmer.com
Guest
Posts: n/a
 
      10-11-2005
Gunnar Hjalmarsson wrote:
> One way:
> http://www.google.com/search?q=site:...n.org+whatever


Yeah, I've played around with that (as I mentioned). But the results
have not been very good for me - I can't see how to restrict the search
to package descriptions (which is usually what I would want to do).
Google searches EVERYTHING, including source code, bug reports, POD,
etc - even archvied versions of every package. When the search is run
against such a wide information base, keywords tend to occur and repeat
all over the place, and I can get dozens of (often irrelevant) hits on
search terms (because the search terms are usually very Perl-ish and
occur widely). If there is a way to restrict Google to search only
current module descriptions, I don't know what it is.

>
> And don't forget that you optionally can _browse_ per category.
>


True, and that is helpful sometimes, but I find that the categories
seem to be generally old and poorly maintained (and not always
intuitively arranged). For example, the category "File Handle
Input/Output" has only sixteen modules (plus two sub-categories, IO and
Log). For some reason, this category includes "Expect" (which I would
have thought would be under "Control Flow Utilities"), but does not
include MANY other packages, such as IO::All (the Swiss Army Knife of
filehandle I/O).

 
Reply With Quote
 
A. Sinan Unur
Guest
Posts: n/a
 
      10-11-2005
wrote in
news: oups.com:

> Gunnar Hjalmarsson wrote:
>> One way:
>> http://www.google.com/search?q=site:...n.org+whatever

>
> Yeah, I've played around with that (as I mentioned). But the results
> have not been very good for me - I can't see how to restrict the
> search to package descriptions (which is usually what I would want to
> do). Google searches EVERYTHING, including source code, bug reports,
> POD, etc - even archvied versions of every package. When the search
> is run against such a wide information base, keywords tend to occur
> and repeat all over the place, and I can get dozens of (often
> irrelevant) hits on search terms (because the search terms are usually
> very Perl-ish and occur widely). If there is a way to restrict Google
> to search only current module descriptions, I don't know what it is.


You can use "Advanced Search":

Occurrences Return results where my terms occur

Here, restrict matches to URL.

http://www.google.com/search?hl=en&l...rl%3A+io+site%
3Asearch.cpan.org&btnG=Search

or http://tinyurl.com/8qgqz

http://www.google.com/search?hl=en&lr=&as_qdr=all&am
p;q=allinurl%3A+www+site%3Asearch.cpan.org&btnG=Se arch

or http://tinyurl.com/c7zvq

Sinan

--
A. Sinan Unur <>
(reverse each component and remove .invalid for email address)

comp.lang.perl.misc guidelines on the WWW:
http://mail.augustmail.com/~tadmc/cl...uidelines.html
 
Reply With Quote
 
Dr.Ruud
Guest
Posts: n/a
 
      10-12-2005
schreef:

> If there is a way to
> restrict Google to search only current module descriptions, I don't
> know what it is.


Include in your search texts that are exclusively (absent) on such
pages.
Example:

allintext: Module.version Annotate.this.POD -Latest.release uri fetch
site:search.cpan.org

where 'uri fetch' is of course the dynamic part.
Leave the -Latest.release out to also find older versions.


Or look for a URL with /README in it:

allinurl: /README io all site:search.cpan.org

where 'io all' is the dynamic part. But that will also find older
versions.


--
Affijn, Ruud <http://www.pandora.com/?sc=sh770781&cmd=tunermini>

"Gewoon is een tijger."

 
Reply With Quote
 
Randy Kobes
Guest
Posts: n/a
 
      10-12-2005
wrote:
> Gunnar Hjalmarsson wrote:

[ ... ]
>
>>And don't forget that you optionally can _browse_ per category.
>>

> True, and that is helpful sometimes, but I find that the categories
> seem to be generally old and poorly maintained (and not always
> intuitively arranged). For example, the category "File Handle
> Input/Output" has only sixteen modules (plus two sub-categories, IO and
> Log). For some reason, this category includes "Expect" (which I would
> have thought would be under "Control Flow Utilities"), but does not
> include MANY other packages, such as IO::All (the Swiss Army Knife of
> filehandle I/O).


Although not as extensive nor as full-featured as search.cpan.org,
one thing I've implemented in our CPAN search engine at
http://cpan.uwinnipeg.ca/htdocs/faqs/cpan-search.html is
an automatic categorization, using AI::Categorizer (to within
some confidence level), of modules that don't have categories
supplied by PAUSE. For example, the "File Handle Input/Output"
categeory:
http://cpan.uwinnipeg.ca/chapter/Fil...e_Input_Output
contains the IO subcategory
http://cpan.uwinnipeg.ca/chapter/Fil...nput_Output/IO
which does include IO-All.

The way these categories are set up is perhaps worth explaining.
When an author uploads a package to PAUSE, there's an option
available to associate registered modules with one of the
major categories. In the above example, IO::All has been associated
with the "File_Handle_Input_Output" category, and so the IO-All
package appears in the File_Handle_Input_Output/IO subcategory.
Similarly, HTTP::Request of libwww-perl is associated with the
"World_Wide_Web_HTML_HTTP_CGI" category, so that libwww-perl
appears in the "World_Wide_Web_HTML_HTTP_CGI/HTTP" subcategory.

There has been discussions of adding a key-words field in the
META.yml file that recent distributions carry, in order to
provide better search results, but the details of this haven't
been finalized.

--
best regards,
randy kobes
 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
GL2 better than the XLs? Consumer grade HDs better than pro-sumer Mini DVs? dh@. DVD Video 1 08-28-2008 07:20 PM
rt.cpan.org, search.cpan.org: why so unuseable? Ben Bullock Perl Misc 12 07-08-2008 12:51 PM
Is splint really better than lint? Is there a better tool than splint? Peter Bencsik C Programming 2 09-21-2006 10:02 PM
Upgrading to CPAN.pm v1.76 install Bundle::CPAN fails carl d. Perl Misc 1 05-10-2005 09:35 PM
search within a search within a search - looking for better way...my script times out Abby Lee ASP General 5 08-02-2004 04:01 PM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57