g1smd
I looked last week, but didn't use ARIN.
The information is less clear when you do a WHOIS search elsewhere.
That's 3 weeks after their last crawl, about the sort of schedule they hinted at.
Err, the "once per week" reference wasn't for you to do anything at all.
It was in reference to bots maybe checking it once per week.
This info seems to be posted widely, except in this thread...
OrgName: seomoz.org ===== =====
OrgID: SEOMO
Address: dotnetdotcom.org ===== =====
Address: 93 S. Jackson Street 10070
City: Seattle
StateProv: WA
PostalCode: 98104-2818
Country: US
RegDate: 2008-07-07
Updated: 2008-07-07
AdminHandle: NGE11-ARIN
AdminName: Gerner, Nick ===== =====
AdminPhone: +1-206-299-9628
AdminEmail: admin @ dot net dot com .org
TechHandle: NGE11-ARIN
TechName: Gerner, Nick ===== =====
TechPhone: +1-206-299-9628
TechEmail: admin @ dot net dot com .org
From: http://ws.arin.net/whois/?queryinput=seomoz.org [2008-10-21]
93 S Jackson Street is a PO Box service called Earth Class Mail.
I am waiting for the screams of anguish when one of the popular free image-hosting sites vanishes, taking with it hundreds of millions of images from tens of millions of users ... many of whom have no backup copies.
I finally get what Rand is saying: "Linkscape's spiders reach billions of pages" as in "Linkscape's data comes from a system that indexes billions of pages". So "Linkscape's spiders", not "SEOmoz's spiders". That doesn't say who runs the spider/crawler, it just says what type of system the data comes from.
Yes, it doesn't say "SEOmoz has a crawler that...". However, that was the assumption everyone had made. That's a clever usage of words; and Rand blames the listener for their misunderstanding. I think when you say it out load, and change the intonation slightly, suddenly it becomes clear.
Whether it was a calculated effort to be "fuzzy", I have no idea. I asked fairly early on for the UA, as I wanted to make sure I was not blocking it, and i wanted to see how many pages of a site had been analysed before seeing what data had been extrapolated from that.
*** Linkscapes bot ***
Yes, "Linkscape's bot" - the bot that collected the data to build the Linkscape product.
The Linkscape product is the data and the analysis.
"A" bot must have collected it, it didn't just appear out of thin air.
Note that "Linkscape's bot" is not interchangeable with "SEOmoz's bot".
"SEOmoz's bot" would imply a computer sat in the MOZplex, connected to the web, and pulling pages from sites all over the world to analyse.
"Linkscapes's bot" only implies "the bot that collected the data" without saying who owns or where that bot is physically located. It could be owned by a third party and be "hired" by the Moz to collect specific data. In that case it is "Linkscape's bot" the bot what made Linkscape, but it still isn't "SEOmoz's bot".
Yeah, it's all semantics.
Linkscape is a product. SEOmoz is a company. The possessive takes a subtle and different meaning.
So, a list of great designers - but how many build in SEO factors as a part of their designs?
De-dupe can take much more than a month... but the data in WMT often lags "real life" by several days to more than a week anyway.
Story: The Decline of Digg
Story: The Decline of Digg
So, how do you know that the Alexa page view count wasn't simply being inflated by the script users accessing hundreds or thousands of pages per day each, and now they are gone, the page view count is a more true reflection of what it always was?
Microsofties need this stuff more than anything else... yet most IIS sites FAIL with all types of canonicalisation. It just isn't on the radar of the designer, or developer.
Story: Sphinn Japan Launches
What does it run on?
Is it souped-up Pligg, or are have you moved over to something else?
What changes did you have to make, in order to accommodate changes in culture?
I see you use "tag clouds" over there? Is that something that is big in Japan, or a hint as to what is to come here in the future?
What is a "Special User"? Are those the moderators?
How long has it been running? It already has almost 400 users.
Story: Sphinn Japan Launches
I used a browser that is not logged in to here, and I could sphinn stuff over there too.
Story: Sphinn Japan Launches
@suzuzik Heh. Thanks for answering all those questions and making it clear how everything works.
Story already hot at: Announcing SEOmoz's Index of the Web and the Launch of our Linkscape Tool
I'll bet they now have some "interesting" insight into some very scummy tactics that other sites are using.
Who'll be first to build a tool that hooks their API to batch load URLs to report to Google for spamming?
No crawling found in June or July... and I didn't look any earlier than that.
I didn't find anything from 209.xxx.xxx.xxx in September, and it is too early to be thinking about October.
It is possible that not all of those are SEOmoz; some are unidentified.
I have found them in the logs.
In July and August, almost simultaneous requests from two different IP addresses, and then (in August only) one further request a few minutes later:
2008-July-xx - Almost Simultaneous Requests:
209.40.100.248 - 209.40.100.248 - HopOne Internet Corporation
209.160.24.62 - seomoz.org - HopOne Internet Corporation
2008-August-xx - Almost Simultaneous Requests:
209.103.165.202 - client.covesoft.net - HopOne Internet Corporation
209.160.24.62 - seomoz.org - HopOne Internet Corporation
2008-August-xx - A few minutes after the previous request:
209.40.112.202 - 209.40.112.202 - HopOne Internet Corporation
These logs don't record the UA that was presented.
Hmm. On further thought, it may be that the IPs I found in the logs are from some other tool use at SEOmoz, rather than used for this new link data.
I have no way of verifying anything much more, especially with the limited data that I can currently access. Treat my initial IP list, as speculation.
Google follows links out from NOINDEX (meta tag) pages; they treat NOINDEX purely as an instruction to not show the content of that page in the public index.
They still keep a copy of that page in their internal databases, assign PR to the page, and follow links in and out.
One reason that the Mozzers have been quiet online is that most of their staff are attending the SMX East Conference - on the opposite side of the USA (and four time zones different) to where their offices are.
I suspect they have zero time to be online and answering blog posts, forum posts, private messages, and emails - especially when they have at least several panels to speak on, and an expo booth to staff.
I am surprised that all this stuff wasn't a bit better thought through though, before launch - but now they are aware of the concerns, give them a week or two to now get on with whatever needs to be done.



Story: Proof SEOmoz Owns the Dotbot?