Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > HTML > Website cleanup

Reply
Thread Tools

Website cleanup

 
 
RICHARD BROMBERG
Guest
Posts: n/a
 
      06-07-2006


A couple of years ago I was tasked with maintaing a web site that had been
built by someone else. I am using Frontpage 2000.

Their work habits were no better than mine and now there are dozens of
images that were uploaded to the Host and likewise dozens of old HTML files
that are completely orphaned, i.e. nothing references them.

Does anyone have a suggestion or know of a utility that will identify these
unused files so I can clean up the site.


 
Reply With Quote
 
 
 
 
Alan J. Flavell
Guest
Posts: n/a
 
      06-07-2006
On Wed, 7 Jun 2006, RICHARD BROMBERG wrote:

> Their work habits were no better than mine and now there are dozens
> of images that were uploaded to the Host and likewise dozens of old
> HTML files that are completely orphaned, i.e. nothing references
> them.
>
> Does anyone have a suggestion or know of a utility that will
> identify these unused files so I can clean up the site.


Recent versions of Xenu link checker can do that
http://home.snafu.de/tilman/xenulink.html

A fast and generally recommended link checker, but it has some known
shortcomings. It uses Windows' own routines for accessing URLs, and
they perform some silent fixups (e.g correcting "\" to "/") before the
link checker gets to see them, which means those kind of broken link
cannot be found.

See http://members.chello.nl/f.visser3/x...ned-files.html for
thirdparty documentation.

However, a comment I found elsewhere says that the ophan check doesn't
recognise images that are only called-out from a stylesheet.


Personally, when getting rid of believed-to-be-orphaned files, I
set their filemode on the server so that the HTTPD cannot read them,
and then watch out for any corresponding errors in the logs for a
while, before finally deleting them. That way, it's easy to reinstate
any that weren't really orphaned.

h t h
 
Reply With Quote
 
 
 
 
RICHARD BROMBERG
Guest
Posts: n/a
 
      06-07-2006
Alan

The Mcafee Site Advisor warns that http://home.snafu.de/tilman/xenulink.html
will download unwanted spyware, and other junk so I don't think I will use
it.

Mcafee Site Advisor is a free download that rates websites for safety.

Check it out.




"Alan J. Flavell" <> wrote in message
news la.ac.uk...
> On Wed, 7 Jun 2006, RICHARD BROMBERG wrote:
>
> > Their work habits were no better than mine and now there are dozens
> > of images that were uploaded to the Host and likewise dozens of old
> > HTML files that are completely orphaned, i.e. nothing references
> > them.
> >
> > Does anyone have a suggestion or know of a utility that will
> > identify these unused files so I can clean up the site.

>
> Recent versions of Xenu link checker can do that
> http://home.snafu.de/tilman/xenulink.html
>
> A fast and generally recommended link checker, but it has some known
> shortcomings. It uses Windows' own routines for accessing URLs, and
> they perform some silent fixups (e.g correcting "\" to "/") before the
> link checker gets to see them, which means those kind of broken link
> cannot be found.
>
> See http://members.chello.nl/f.visser3/x...ned-files.html for
> thirdparty documentation.
>
> However, a comment I found elsewhere says that the ophan check doesn't
> recognise images that are only called-out from a stylesheet.
>
>
> Personally, when getting rid of believed-to-be-orphaned files, I
> set their filemode on the server so that the HTTPD cannot read them,
> and then watch out for any corresponding errors in the logs for a
> while, before finally deleting them. That way, it's easy to reinstate
> any that weren't really orphaned.
>
> h t h



 
Reply With Quote
 
RICHARD BROMBERG
Guest
Posts: n/a
 
      06-07-2006
Thanks

I like that idea better than permenantly deleting stuff.


"Stan McCann" <> wrote in message
news:Xns97DB563E6A4BBstanmccann@216.234.192.142...
> "Alan J. Flavell" <> wrote in
> news la.ac.uk:
>
> > On Wed, 7 Jun 2006, RICHARD BROMBERG wrote:
> >
> >> Their work habits were no better than mine and now there are dozens
> >> of images that were uploaded to the Host and likewise dozens of old
> >> HTML files that are completely orphaned, i.e. nothing references
> >> them.
> >>
> >> Does anyone have a suggestion or know of a utility that will
> >> identify these unused files so I can clean up the site.

>
> > Personally, when getting rid of believed-to-be-orphaned files, I
> > set their filemode on the server so that the HTTPD cannot read them,
> > and then watch out for any corresponding errors in the logs for a
> > while, before finally deleting them. That way, it's easy to
> > reinstate any that weren't really orphaned.

>
> I do something similar. Rather than messing with file attributes, I
> rename the files placing a common extension on all files. That way, I
> can delete the whole lot in one go by deleting everything in the web
> space with that file extension. I usually use a file extension like
> 6.7.6 so somepage.html becomes somepage.html.6.7.6 and somegif.gif
> becomes somegif.gif.6.7.6; same for jpg, css, php or whatever. Maybe
> not better, but a bit different.
>
> --
> Stan McCann, "Uncle Pirate" http://stanmccann.us/
> Webmaster, NMSU Alamogordo http://alamo.nmsu.edu/
> Implementing http://blinkynet.net/comp/uip5.html



 
Reply With Quote
 
Stan McCann
Guest
Posts: n/a
 
      06-07-2006
"Alan J. Flavell" <> wrote in
news la.ac.uk:

> On Wed, 7 Jun 2006, RICHARD BROMBERG wrote:
>
>> Their work habits were no better than mine and now there are dozens
>> of images that were uploaded to the Host and likewise dozens of old
>> HTML files that are completely orphaned, i.e. nothing references
>> them.
>>
>> Does anyone have a suggestion or know of a utility that will
>> identify these unused files so I can clean up the site.


> Personally, when getting rid of believed-to-be-orphaned files, I
> set their filemode on the server so that the HTTPD cannot read them,
> and then watch out for any corresponding errors in the logs for a
> while, before finally deleting them. That way, it's easy to
> reinstate any that weren't really orphaned.


I do something similar. Rather than messing with file attributes, I
rename the files placing a common extension on all files. That way, I
can delete the whole lot in one go by deleting everything in the web
space with that file extension. I usually use a file extension like
6.7.6 so somepage.html becomes somepage.html.6.7.6 and somegif.gif
becomes somegif.gif.6.7.6; same for jpg, css, php or whatever. Maybe
not better, but a bit different.

--
Stan McCann, "Uncle Pirate" http://stanmccann.us/
Webmaster, NMSU Alamogordo http://alamo.nmsu.edu/
Implementing http://blinkynet.net/comp/uip5.html
 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Automated Cleanup and Restoring Back to Basline Config NutZ Cisco 0 01-07-2006 02:32 AM
Registry Cleanup - Hardware Drivers =?Utf-8?B?TGFycnktTGFzZXI=?= Wireless Networking 1 12-16-2005 08:57 PM
Symantec Express Cleanup or alternative Ron P Firefox 2 04-24-2005 10:39 AM
Downloads "cleanup" in Firefox Jim Firefox 2 02-15-2005 04:02 PM
Cleanup Newsgroups miskairal Firefox 5 05-04-2004 02:49 AM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57