PDF Content Optimization in Google Results

Posted on

  • email
  • twitter
  • facebook
  • share this

share this



Google has been indexing PDF files - a stalwart of the myriad content forms accessible on the web - since 2001, and by our very rough estimate there are over 500 million PDFs currently indexed by the search engine. 

There remains quite a bit of confusion surrounding the use of PDFs in the relation to ranking on search engines, but Google has showed a quick glance at its cards, clearing up at least some confusion today in a post about the format in search results. If you're responsible for website promotion and driving traffic to its pages, pay close attention: 

- Google can index textual content from PDF files provided they are not password protected, but by using OCR (Optical Character Recognition) can extract text from images. To be safe, disconnect the actual text from design elements.

- Links within PDFs are treated similarly to HTML links, passing PageRank and other indexing signals. The links can not have the "nofollow" tag. Since PDF content is crawled, it is important therefore to follow the best practice guidance on the use of keyword-rich anchor text. 

- Google recommends a single copy of content (e.g. HTML, PDF, etc) but has provided a way for content owners/publishers to indicate the preferred URL by specifying the canonical version in the HTML or the HTTP haeders of the PDF. 

- There are two ways to influence the title of the PDF which appears in the search results - the metadata within the PDF and the anchor text of the links pointing to the PDF across the Web. 

Filed under: , , ,

:: Create a local presence wherever you want to do business. Get a local phone number. ::


Login To Comment


Become a Member

Not already a part of our community? Sign up to participate in the discussion. It's free and quick.

Sign Up

1 comment

RichardL 09-02-2011 6:18 PM

I still feel pdf's are still a professional way of doing business; so I am for pdf style. Using pdf style is still a very good way to provide important information even through search engines.

Add to the discussion!

999 E Touhy Ave
Des Plaines, IL 60018

Toll Free: 1.800.817.1518
International: 1.773.628.2779
Fax: 1.773.272.0920
Email: info@websitemagazine.com

Facebook


Twitter