Posts Tagged ‘broken links’

Quick and dirty way to find broken links on your website

Thursday, July 23rd, 2009

These days I had to find if there was any broken link (error 404) in a group of sites.

I found this to be very useful:

wget –recursive –spider http://levycarneiro.com

This command will download everything from the URL, and generate a report like this:

Found 13 broken links.

http://levycarneiro.com/levy@levycarneiro.com referred by:
http://levycarneiro.com/
http://levycarneiro.com/images/posts/Multiple_models_one_form_NewProject.jpg referred by:
http://levycarneiro.com/category/ruby-on-rails/
http://levycarneiro.com/tag/twitter/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/twitter/
http://levycarneiro.com/tag/rails/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/rails/
http://levycarneiro.com/tag/portfolio/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/portfolio/
http://levycarneiro.com/2009/04/transito-nao-an-experiment-with-twitter-traffic-jams-and-ruby-on-rails/levy@levycarneiro.com referred by:
http://levycarneiro.com/2009/04/transito-nao-an-experiment-with-twitter-traffic-jams-and-ruby-on-rails/
http://levycarneiro.com/tag/traffic/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/traffic/
http://levycarneiro.com/tag/projects/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/projects/
http://levycarneiro.com/category/projects/levy@levycarneiro.com referred by:
http://levycarneiro.com/category/projects/
http://levycarneiro.com/tag/ruby/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/ruby/
http://levycarneiro.com/category/ruby-on-rails/levy@levycarneiro.com referred by:
http://levycarneiro.com/category/ruby-on-rails/
http://levycarneiro.com/category/twitter/levy@levycarneiro.com referred by:
http://levycarneiro.com/category/twitter/
http://levycarneiro.com/tag/ruby-on-rails/levy@levycarneiro.com referred by:
http://levycarneiro.com/tag/ruby-on-rails/

I’ve got some work to do then :)