Published: Jan 22, 2009 - 04:29 pm
Story Found By: mvandemar 1581 Days ago
Category: Vertical Search
15 Comments
15 Comments
Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.
Join us at an upcoming SMX event:


Learn more about search marketing with our free online webcasts and webinars from our sister site, Digital Marketing Depot. Upcoming online events include:
Comments
Great find! Im e-mailing all of my clients right now to make sure they allow crawling of their search results pages.
Im disappointed that this change to the robots.txt seems to be a pointless political gesture. I thought Obama was trying to keep it real and avoid the spin. Are they hoping the uneducated masses see this robots.txt change as supporting Obamas new open information policy? Apparently yes.
Actually, Nick, this wasnt political in the slightest. They wiped the old site and replaced it with a completely new one, which makes perfect sense. All of the printer friendly pages that composed the vast majority of the old robots.txt dont exist anymore, and in fact none of the new pages even have print versions.What other people, including news sources such as the BBC, think about it is not the Whitehouses fault at all.
@MvandemarYep you maybe right, too many years of New Labour spin always has me assuming the worse. Poor reporting/understanding on the side of the BBC could be at fault.
Lol it looks like Matt got caught up in Obama mania being out there in DC for the inauguration.Good catch Michael, Im now off to try and get thousands of search pages indexed!
Take it easy, folks :-) Matt Cutts hasnt Endorsed indexing of search results In Google, at all.Matt was talking about "Crawling" not about "Indexing". Read for yourself what Matt said:"Youve probably seen this, but I like the new whitehouse.gov robots.txt: http://bit.ly/x4G7 Much more crawling allowed."And as you know there is a difference between "Crawling" and "Indexing" :-)
Harith, true, but if the pages are all search pages and duplicate content printer-friendly pages, why would Google want to crawl them unless they intend to index them.Also, how would you keep search pages out of the index asside from using the robots.txt file? Thats pretty much the most efficient method that I can think of.
Skitzzo,I understand and appreciate :-)However, not all craweled materials would be indexed in Google. I.e. "Much more crawling allowed" doesnt necessary mean to index the whole crawled content. ;-)
Harith, Matt on multiple occasions has suggested using robots.txt to block this type of auto-generated content, and suggests that if you dont do so you are in violation of Google Webmaster Guidelines.The purpose of crawling is to find content to index, and the purpose of blocking it with robots.txt as he suggested is to keep it out of the index.I know he doesnt actually want those in the serps, and so does everyone else. What Matt did was he said something that sounded good at the time, instead of something that actually meant anything. The post was making fun of that. :)
mvandemar"The post was making fun of that. :)"However, I wouldnt be surprised to see Matt chiming in:mvandemar, I dont endorse indexing of search results on Google :-)
In other breaking news: matt cutts fartsAbout somebody will write a blog post about that
I was only commenting on the fact that the new robots.txt allowed much more crawling. Which is why I used the word "crawling." Googles position on indexing search results has not changed.
Hold on, hold on... last time I checked my payslip had my company name at the top... Matt were you originally saying was that WE the seos and marketing people of the world should filter our own content because Google wasnt up to the job with duplication?...but it is now so we dont need to???If the engine cant remove duplicates or has an issue with it then improve the engine... but for every in the know SEO there are a thousand legitimate sites which (for example) hold duplicate copy for print purposes or in PDFs). You cant be expecting these people to do the engines filtration work?
I was only commenting on the fact that the new robots.txt allowed much more crawling. Which is why I used the word "crawling." Googles position on indexing search results has not changed.Actually, Matt, if you read he post I discussed where you highlighted the fact that the Google Webmaster Guidelines were changed to instruct people to block that kind content using robots.txt:http://www.mattcutts.com/blog/search-results-in-search-results/You did in fact advise people to block that content from getting crawled, and to do so to keep it out of the index.
Misinterpreting one 140 character tweet by Matt Cutts doesnt reverse Google policy of many years, I would have to say this is false information