Learn how Google Sees Your Site - Spider it Yourself
If your are just getting into SEO, or are wondering why your site isn’t showing up in Google results the way you want it, start by trying to understand how Googlebot sees your site. An excellent way to do this is by using your own robot, or a link-checker, to spider your site. I’ve always used Xenu’s Link Sleuth to spider my site. Not only can you find broken or dead links, but you can get some insight into how search engines crawl your site.
This is especially important for dynamic sites. Sites that append session IDs or other session or user-specific data to URLs, or depend on cookies or (ack) javascript for navigation will either not spider at all, or cause duplicate page entries for the same content. Recently, Google has gotten better at handling PHPSESSID, but it’s a good idea to try to remove any extraneous session information until a session is necessary.
Spidering your site will give you a lot of information about how search engines see your site. It’s up to you to interpret those results and work to make sure that what they see reflects the true architecture of your site.
Recently Steamed:
- 07/23/2008: Mochi Media Site Relaunch
- 07/22/2008: Amazon S3 Downtime Visualized
- 07/21/2008: NPR Launches Open API, offers Widgets



You can also use Google’s webmaster tools. Once you sign up and prove you own the site (you have to upload a placeholder file that Google then looks for) you have access to google’s indexing data about your site. It’s free.
http://www.google.com/webmasters/
Rich
Comment by Rich — 8/30/2006 @ 5:53 pm