[thelist] site auditing

Jeroen Sangers evolt at jeroensangers.com
Fri Feb 7 12:40:01 CST 2003


j.d. welch <so.there at showtunepink.com> wrote:
> does anyone have a recommendation of a programatic approach to
> auditing a site for
>
> a) unreferenced documents (pages with nothing linking to them)
> b) unreferenced images (the image isn't called by any of the pages)
>
> ...not necessarily at the same time.  the site in question is a major
> redesign/reorganization, and a lot old, now-unused cruft has remained.
> it's really too large to do this manually; is there a quick and clever
> way to determine which bits can be thrown out?
>
> tia,
>
> jd

Most site checkers I know have a function to check for orphan files. Have a
look at Linkbot or Xenu's Link Sleuth.
Watch out with files hidden in javascript, SSI includes, etc... Those will
generally be reported as orphan files!


Kind regards,

Jeroen Sangers

www.jeroensangers.com
www.fimcap.org





More information about the thelist mailing list