New Canonical Link Tag & Duplicate Content

Posted on

  • email
  • twitter
  • facebook
  • share this

share this



Duplicate content (and its associated penalties/issues) may be a thing of the past now that the three major search engines in Yahoo!, Google, and Microsoft now support a new HTML tag, the <link> tag, which reduces URL duplicates by documenting the preferred URL form to access each page no matter how it is retrieved. In layman's terms, you can now tell the search engines which URL it should have for the current page. If you're concerned that search engines aren't indexing all of your content, using the <link> tag may benefit your SEO efforts greatly.

As it stands, search engines do a fine job of determing the correct URL, but usually only when the page redirects correctly. The redirects can be messy for dynamic sites that pass multiple parameters, for example, session IDs and other forms of variance. When you use the <link> tag, you can indicate the canonical URL form for crawlers to use for each page of content, no matter how it was retrieved. This puts the preferred URL form with the content, so that it is always available to the crawler.

Say for example that you notice the following URLs have been indexed:

http://www.example.com/products?trackingid=feed
http://www.example.com/products?sessionid=hgjkeor2
http://www.example.com/products?printable=yes&trackingid=footer

Specifying a link tag in the <head> section of your page content (as seen below) will indicate to the crawler that the URL is present and that it should be represented as the preferred canonical URL designated:

<link rel=”canonical” href=”http://www.example.com/products” />

The <link> consortium has provided a few technical details which are important to note:

- URL paths can be absolute or relative (though absolute paths are reccommended).
- A <link> tag can only point to a canonical URL form within the same domain and not across domains.
- The <link> tag will be treated similarly to a 301 redirect, meaning that link references will be transferred.
- Canonical links will be considered erroneous and deferred, if tags are not used as intended.

It's anyone's guess whether SEOs and Web professionals will latch onto the canonical link tag. Search engines will most likely adhere to it a majority of the time, but they do reserve the right to do what they want. Joost de Valk has written a post on canonical URL links and provided some plugins for Wordpress, Magento and Drupal.


Some previous posts and articles on duplicate content:

 

:: Create a local presence wherever you want to do business. Get a local phone number. ::


Login To Comment


Become a Member

Not already a part of our community? Sign up to participate in the discussion. It's free and quick.

Sign Up

4 comments

NancyG 02-13-2009 2:33 PM

Is this canonical link tagging available for enterprise search engines that sit behind the firewall?

Peter A. Prestipino 02-13-2009 2:37 PM

A good question. Simple answer no - it's exclusive to the search engines mentioned. Although, if you're using Google or Yahoo! search to power the engine, perhaps.

John AlanR 02-13-2009 3:04 PM

I noticed the phrase "Canonical links will be considered erroneous and deferred, if tags are not used as intended." That got me thinking.

Unfortunately, mistakes happen. Also, some people choose to defy web design standards. It might be pessimistic to say it, but isn't it inevitable that someone will accidentally or purposely misuse the <link> tag?

04-19-2009 8:16 PM

Just came across a post on the Google Webmaster Central blog about their support of the new canonical link tag. This tag is also recognized by Yahoo! and Microsoft....

Add to the discussion!

999 E Touhy Ave
Des Plaines, IL 60018

Toll Free: 1.800.817.1518
International: 1.773.628.2779
Fax: 1.773.272.0920
Email: info@websitemagazine.com

Facebook


Twitter