Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > ASP .Net > ASP General > trying to collect links on my site, how to crawl it?

Reply
Thread Tools

trying to collect links on my site, how to crawl it?

 
 
Danny
Guest
Posts: n/a
 
      05-27-2004
I am trying to crawl my site to get a list of links. I am using the regular
expressions to get the href tags from the pages and reading the links using
xmlhttp module.

is there an efficient way to loop through the links? If you start with the
home page, how do you do it? How do yo ukeep track of the pages you have
been to and which to do next.

I am not sure how to go about tihs.

I tried some loop but it seemed to take too long to crawl my site.

Thanks again


 
Reply With Quote
 
 
 
 
Aaron Bertrand [MVP]
Guest
Posts: n/a
 
      05-27-2004
For all .asp and .html files etc., you could build the list of files to hit
using FileSystemObject... store the list in a database or text file, and
scroll through it...

--
Aaron Bertrand
SQL Server MVP
http://www.aspfaq.com/


"Danny" <> wrote in message
news:Wlctc.55736$. net...
>I am trying to crawl my site to get a list of links. I am using the
>regular
> expressions to get the href tags from the pages and reading the links
> using
> xmlhttp module.
>
> is there an efficient way to loop through the links? If you start with
> the
> home page, how do you do it? How do yo ukeep track of the pages you have
> been to and which to do next.
>
> I am not sure how to go about tihs.
>
> I tried some loop but it seemed to take too long to crawl my site.
>
> Thanks again
>
>



 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Re: How include a large array? Edward A. Falk C Programming 1 04-04-2013 08:07 PM
Can Google crawl .aspx pages? John ASP .Net 2 01-31-2007 01:54 AM
The 3-minutes RESET CRAWL Zolaris Computer Support 1 09-07-2004 07:21 PM
router (SMC Barricade 7004ABR) slows internet to a crawl until unplugged gene Computer Information 7 03-07-2004 06:09 PM
Internet Explorer slows to a crawl chthonia@hotmail.com Computer Support 3 12-22-2003 02:42 AM



Advertisments