Growing Toolset for Managing SEO Indexing, Crawling and Pagerank Flow
Thursday, March 12th, 2009With the recent introduction of the canonical link tag, search engines are starting to give us a pretty comprehensive set of tools to manage how a website is crawled and indexed. These tools have been developing over time, and are a bit ad-hoc and overlap in confusing ways, but we now have some tools that solve some traditionally thorny SEO problems.
I thought it would be good to sit back and take inventory of these tools, and how we can use them.
First of all, here are some of the issues we’re trying to solve:
- Keeping search engines from indexing pages we don’t want them to index.
- Keeping search engines from crawling pages we don’t want them to crawl.
- Keeping search engines from giving page rank to certain pages (whether on our site or on another site).
- For pages that have variations in the URL due to parameters, capitalization issues, different pathways, etc, getting search engines to index just one version of that URL, and focus all page rank other URL formats get onto that one URL.
- Removing pages from the index we’d like to get out.
To manage these issues, we now have some good tools:


