Sorry this site requires JavaScript to be enabled in your browser. See the following guide on How to enable JavaScript in Internet Explorer, Netscape, Firefox and Safari. Alternatively you may be blocking JavaScript with an advert-related or developer plugin. Please check your browser plugins.

NOARCHIVE prevents contents from being read by the non paying audience via cached page copies, but without a NOPREVIEW directive PDFs and other non-HTML resources are accessible right from Google’s SERPs via the "View as HTML" link.

Is the lack of a NOPREVIEW directive just an oversight, or will it be part of another announcement?
Comments8 Comments  

Comments

Avatar Administrator
from dannysullivan 3760 Days ago #
Votes: 2

Well, they should have included a NOARCHIVE option which would have done the job. But yes, they should provide some option.

Avatar
from fantomaster 3760 Days ago #
Votes: 0

Incommodating the Web community piecemeal again - nothing new about that, unfortunately. Nice catch, anyway.

Avatar
from algoholic 3759 Days ago #
Votes: 0

Just treat those files as regular web pages, why not block them with login/cookie or deny crawlability? This coin got two sides - the value of links...

Avatar Moderator
from Sebastian 3759 Days ago #
Votes: 0

Say you’ve a script delivering PDF contents to various user agents coming from various IPs ... perhaps you need this granularity one day :)

Avatar
from HamletBatista 3759 Days ago #
Votes: 0

Good points. I’d prefer they remove the "views as ..." if you tell them not to cache the document via ’nocarchive’. If each search engine keeps adding non-standard directives and stuff, it will become a mess very soon.

Avatar Moderator
from Sebastian 3759 Days ago #
Votes: 0

Well, then my choice would be NOSNIPPET because both the snippet and the HTML version are previews. That would remove the SERP snippet too, probably unwanted. NOARCHIVE applied to a PDF would exactly mean that the PDF is not cached (useless because Google doesn’t provide "cached" links for PDFs on the SERPs) but it tells nothing about the HTML version, that’s how Google handles it ATM. Cached web formats and transformed non-Web formats are different, so I’d rather live with a little more complexity in favour of a precise crawler directive.

Avatar
from paisley 3758 Days ago #
Votes: 0

fyi.. you want your content to be archived.. i.e. indexed.. hint hint.. and if you want your PDFs out of the SERPs, put links to them only on one page and nofollow tag it.. also, you won’t get any kind of benefit from links within the PDF either. hmm.. question (until i go figure it out), is there a way to insert meta data in a PDF...?? hmm.. (wanders off to find some answers)

Avatar
from paisley 3758 Days ago #
Votes: 0

A-PDF INFO Changer seems to work, now for the experiment.

Upcoming Conferences

Search Marketing ExpoSearch Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.



Join us at an upcoming SMX event: