Published: Jul 02, 2009 - 04:55 am
Story Found By: lorenbaker 1050 Days ago
Category: SEO
8 Comments
8 Comments
Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.
Join us at an upcoming SMX event:
Learn more about search marketing with our free online webcasts and webinars from our sister site, Search Marketing Now. Upcoming online events include:
Comments
Well, Im not sure how accurate this test is. Wordpress by default pings Ping-o-Matic which sends information about new post/page on Wordpress to over 20 pinging services. If pinging feature hasnt been disabled then the real reason of indexing may be pinging - not the XML sitemap.Unfortunately Micheal didnt metion if he has disabled pinging or not in his post.
Pretty lame exercise. Who really cares about getting a site indexed just to say it can be done?
@sebastian yes pinging was turned off@stevec828 sometimes when you work on large sites with over 100,000 pages youll want to speed up the natural crawling of really deep pages. And yes there are legitimate sites with over 100,000 pages.
Michael, thanks for updated on this. I will run few test on larger sites to see if XML sitemaps will help to get them indexed.
I would be inclined to agree.Creating a sitemap and submitting in Google is a fundamental way of getting indexed.Your site will have no authority, but it will also have no distrust/manipulation penalities.
"sometimes when you work on large sites with over 100,000 pages youll want to speed up the natural crawling of really deep pages."yeah ok...What does getting the home page indexed have to do with getting 100,000 pages indexed? More pages demand more link juice - and while XML sitemaps can alert Google to the existence of new pages, getting them indexed quickly is a whole nother ball game.If you just put up a 100,000 page site with sitemap XML sure you may get a dozen pages indexed but you really arent gonna get very far.Sitemap XML may speed up discovery - if you are working with a big site that updates often but with a limited number of pages crawled/day (e.g. 1000/day on a 1000,000 page site) with enough total domain PageRank then you want to use a sitemap to prod Google into spending that 1000/day crawl budget on new pages instead of old pages it already has in its index - but dont assume installing a sitemap XML will automatically speed up indexing.
Its curious that some people dont see that value in knowing how to get Google to do certain things. Sure, the benefit of indexing a site purely with an XML sitemap may be of limited value on its own, but testing whats possible allows you to be prepared when there is a need to react quickly. Why knock an experiment just because you dont know what to do with the information?
hmmmm... been laying off this tread, but why not put a few pennies in eh? Marios, I think part of the problem is that its a rudimentary understanding here... Halfdeck is entirely correct in that it is more of the discovery aspect and that indexation levels are reliant on other signals (links/authority etc...). Yes, I am a fan of testing and to that end, the post has value, but any SEO that has been around the block a few times should already know how to get a site (or important target pages) indexed...to that end, it is a bit of a yawner.Now, what this test doesnt do, is substantiate the claim that, youll want to speed up the natural crawling of really deep pages. This part Id be interested in a test that has;1. Control set... no sitemaps and only pinging enabled2. Only an HTML site map3. Only XML site map4. Both XML and HTML site mapsThis kind of run (of equally related sites) would tell us that. Id suspect that a site without an XML site map, but a solid internal linking and HTML map would be indexed just as deeply as anything. As we know, at the end of the day, the inbound links/site authority would be the main dictating factor..... as HD mentioned, discovery is different than indexation criteria....Ya know?