Sorry this site requires JavaScript to be enabled in your browser. See the following guide on How to enable JavaScript in Internet Explorer, Netscape, Firefox and Safari. Alternatively you may be blocking JavaScript with an advert-related or developer plugin. Please check your browser plugins.

neyne

 
from neyne 538 Days ago#
Votes: -2

yeah it is nice to get reminded that not everything is allowed when link building. one should have some lines that shouldn't be crossed.


from neyne 606 Days ago#
Votes: 3

Rand not everything that you find in Google Scholar is considered to be a scientific paper. For example, one of the results in that SERP that you cited is authored by one Aaron Mathew Wall and its title is "Search Engine Optimization Book". Sounds familiar?

But let’s take a look at some of the research papers that are found in that GScholar SERP:

For example, this one, dubbed: "Web spam detection via commercial intent analysis". If you get to the PDF (either by having an academic institution access or by paying for a subscription), you will find clear references to the dataset that was used (Webspam-UK2006, referrence #5) and a citation of the paper that describes the methodology used in the analysis (Castillo et al., reference #6), both of which you haven't done (at least not initially).

As for the peer review process, the paper was presented at 3rd International Workshop on Adversarial Information Retrieval on the Web, held in Banff, Canada in 2007. These workshops are organized by AirWeb as a part of their WWW events. AirWeb has a board of directors and an advisory board that reviews all the papers submitted for their conferences. For example, if you go to the Call for Papers page of the WWW2009 conference that was held in Madrid, you will find the following quote:

All papers will be peer-reviewed by at least three reviewers from an International Program Committee. Accepted papers will appear in the conference online proceedings published by the ACM Digital Library and the conference's web site. Authors of accepted papers will retain proprietary rights to their work, but will be required to sign a copyright release form (pdf file) to IW3C2. The Program Committee will select a small number of excellent papers for fast-track journal publication in the ACM Transactions on the Web (ACM TWEB).

Another paper that I found in that same SERP, "Towards agents participating in realistic multi-unit sealed-bid auctions" , was presented at the conference for which the board of directors of IFAAMAS performs the review. So you are wrong when claiming that scientific work in our (and related) fields is not peer reviewed or that it doesn’t provide references to raw data and methods.

Now to the point of the way you presented your data and methods. The only proof you have provided that your data is statistically significant is the mention that your stderr is very small and your claim that you are "rather confident in the robustness of our data" (paraphrasing). It took almost two months from the time I first objected to your correlation studies till you (under mounting pressure) provided a glimpse into the methods of your analysis, by revealing that your stderr is a standard error of a mean and not a standard error of a correlation coefficient.

The moment you provided that fraction of a description of your analysis method, your study was taken apart by an actual PhD in IR, dr. E. Garcia, someone you have personally referred as "one of the world's foremost authorities on the subjects of the IR (Information Retrieval) field & search engine technology" (your words from an interview at SEOMoz). He disputed your method of analysis, your calculations and your conclusions. Hell, he disputed it to such extent that he demanded you to publicly retract your publications using these methods and data. Retraction of an article is not something that is easily demanded in the scientific circles.

Rand, it was you that insisted on calling your research "scientific". I am sure that in the marketing world, giving your product an appealing title works perfectly. However, in the scientific research world, you must provide a bit more "meat" than just sweetened rhetoric. I suggested Ben to submit this research to a scientific paper, not because I think your results are worthy of publishing, but because that would mean that it would undergo a real peer review process. After reading  dr. Garcia’s review, I am not sure it would be approved for publication. And there is the real difference between the world of scientific publishing and web marketing – peer review process prevents bad science from seeing the light of the world, while in web marketing (or any other writing published online) anything can be published and only then (when authors deem it worthy of their time and effort) is the data presented for peer review.

I don’t have a personal problem with you or with your company. Hell, I pay you good money to be able to use your tools. You are obviously a very good marketer and a successful businessman. The fact that you have a large community of people behind you, ready to support you even when you are wrong, is admirable. But you are not a scientist and Ben is not a statistician. He seems like a very nice and smart guy and I am sure he is a great programmer. Hell I am certain he is a much, much better statistician and mathematician than I am. However, this is not about him or you or me or Dr. Garcia. Extraordinary claims require extraordinary evidence. Your claims were extraordinary not only by their nature but also by the way they were presented. Your evidence turned out to be somewhat less spectacular.



from neyne 607 Days ago#
Votes: 2

Rand. as Michael said, we would need the raw data. Not the table of data represented in the graph.

However, from the attached table, it seems like you calculated a mean of correlation coefficients for each SERP and then calculated a stderr of that mean relatively to the population of correlation coefficients you have measured. I will further investigate whether this is the correct way of assessing the significance of correlation


from neyne 608 Days ago#
Votes: 5

I have been all over this discussion on every possible blog/stream/conference. I just want to add a few points, questions:

  • Ben, when you say stderr, what does that mean? That you averaged out all the correlation factors of each parameter for each SERP and checked what the mean standard error is? Or did you perform a significance test for each of the correlations (11,000 of them for each parameter)? It is not clear what you guys did there.
  • Why not publish the data? Raw data, not descriptions of the data, descriptions of the method? That is what is done in science. When Distilled published their analysis of local factors (albeit on a much, much smaller scale), they added a link to the excel file (or Google doc, don't remember what it was). When scientists publish papers, they add supplemental data, which includes all the raw data and all the calculations/experimental methods.
  • There are numerous scientific publications out there, many of which would be most interested in the study of this scale. Why not publish your findings there? This would, of course require submitting all the data and methods to peer review but that is an integral part of a scientific process and if this (and previous) study is indeed scientific; there should be no problem in doing this.
  • An acceptable way of publishing scientific results, that are a product of a study that entails financial benefit to the researcher, is by providing the double blind access to the raw data to an independent party to perform all the analysis. When you publish a "scientific" study that shows how MozRank correlates to rankings better than any other metric, and all the data gathering, methods of analysis and interpretation was done by you, that puts your study outside of the scientific realm. Just ask any researcher whose study was funded by a pharma company that would benefit directly from a certain outcome of that study. If you guys want to call your study "science" (which you did in two separate articles), you have to submit it to a rather rigorous set of rules and limitations. Otherwise you seem like attributing scientific qualities to your work, only for the sake of self-promotion. John Andrews has very eloquently pointed out the distinction between science and marketing at SMX and those two shouldn’t be mixed.
  • As to more general comments that I have read recently all over the place regarding the SEO not being scientific, I have one word for ya - duh! Of course SEO is not science. No one ever claimed that SEO is science. It is a marketing branch. Just like architecture is not (only) science. It, however, includes some science. IR is science. Statistics is science. Psychology is science. SEO can make use of some of the scientific tools to improve their results and their data acquirement methods. Hairdressers are advised to know the chemical activity of the peroxide solutions they use for hair-bleaching, or they could burn the customer's scalp. Does that acquaintance with chemistry make hairdressing a scientific field? No, but they should by all means make use of valid scientific branches and research to improve their craft. The same way that science does not make new buildings beautiful, but architects still have to know scientific principles of engineering to be able to plan beautiful, but safe, buildings.

    Another claim that I hear a lot is how we cannot apply scientific methods to SEO, since Google is a "black box with hundreds of unknown parameters" so we don't really know what is happening.

    You think Google is a black box? Try E. coli. This bacterium has 4400 genes, function of many still not known to us. These genes replicate, jump within and outside the genome, mutate, produce proteins which themself combine and recombine, activate and inactivate under different environmental conditions, produce synergetic and reductive effects with millions of combinatory interactions. Actually, compared to E.coli, Google is a very simple machine. Yet that level of complexity has not stopped us from investigating, learning, deciphering and understanding complex molecular processes that go on within the membranes of those bacteria. And the humanity has benefited greatly from such studies, advance which would have not happened if the complexity of the model organism was taken into the account.

    I am not comparing SEO to molecular biology here, but the principle holds true. Our nature is to poke and investigate that which is unknown. Thousands of years of human history have proven that the scientific method is the best suited and most successful in performing those investigations. If we give up the curiosity and the proven method to satisfy it, there is not much left preventing us from becoming a mindless mass of sheep governed by false claims, popularity contests and principles of form over function...

    Over and out.


    from neyne 748 Days ago#
    Votes: 0

    Hmmm

    The fact that you see elements of your site (or the data contained within them) being scanned and showing in Linkscape in a direct correlation with DotBot's crawls means that on that particular crawl, Linkscape got your pages through the DotBot crawl. On the next round it may get it from some other crawler.

    Did blocking Dotbot completely remove your site as a link source from Linkscape ?


    from neyne 748 Days ago#
    Votes: 1

    Hmmm

    The fact that you see elements of your site (or the data contained within them) being scanned and showing in Linkscape in a direct correlation with DotBot's crawls means that on that particular crawl, Linkscape got your pages through the DotBot crawl. On the next round it may get it from some other crawler.

    Did blocking Dotbot completely remove your site as a link source from Linkscape ?


    from neyne 751 Days ago#
    Votes: 1

    ah now that you put it that way :)))))


    they just seem to keep spilling that feces bucket into the fan opening....


    from neyne 752 Days ago#
    Votes: 0

    Annie , this is exactly what was discussed when Linkscape came out. They didn't want to say what the useragent of the data gathering robot was because it would reveal that the data was partially bought and that the robots were not theirs. Sh*t hit the fan and Rand came up with the metatag NODMOZ (or whatever it was). Heh, that is easy, they parse the head anyways so they could just exclude the data with the tag from the analysis and not from indexing.

    In any case, I don't think the way they are pushing it is worse in its inaccuracy than a lot of other products that are being pushed on us. How many seconds does it take for Win7 to load?


    A lot has been said about the herd mentality of the people supporting Rand and SEOMoz, it would be funny if the Rand criticizing crowd started behaving that way too... Let's not go there.


    from neyne 752 Days ago#
    Votes: 0

    Annie , this is exactly what was discussed when Linkscape came out. They didn't want to say what the useragent of the data gathering robot was because it would reveal that the data was partially bought and that the robots were not theirs. Shit hit the fan and Rand came up with the metatag NODMOZ (or whatever it was). Heh, that is easy, they parse the head anyways so they could just exclude the data with the tag from the analysis and not from indexing.

    In any case, I don't think the way they are pushing it is worse in its inaccuracy than a lot of other products that are being pushed on us. How many seconds does it take for Win7 to load?


    A lot has been said about the herd mentality of the people supporting Rand and SEOMoz, it would be funny if the Rand criticizing crowd started behaving that way too... Let's not go there.


    from neyne 945 Days ago#
    Votes: 0
    Testing, and besides that am sick of twitter sphinns

    from neyne 945 Days ago#
    Votes: 0
    just looking for attention by barking at the big dogs. If it was based on something real, fine but when it’s based on a lack of sense of humor...

    from neyne 945 Days ago#
    Votes: 0
    Harith, this is getting pathetic... in so many ways. dude, you are weird...

    from neyne 945 Days ago#
    Votes: 0
    if i try to sell my car, can i also post it on sphinn ......................

    from neyne 992 Days ago#
    Votes: 0
    perceptive fella mr michael gray

    from neyne 1028 Days ago#
    Votes: 1
    there are quite a bit of shades of gray between the live domain and expired domain.what if i forgot to renew and renew it after it expired ? It still has all my content and everything is as it was, do all the links go away ? does it matter if the whois information is exactly the same as prior to the purchase ? What if I have a new contact information that I remember to update only when I renew the expired domain? If Google can tell when domains expire, how come we can see expired domains with the old cache still in SERPs ?I know for a fact that not all expired domains lose their link equity automatically, even when not falling into the categories that Danny described.In any case, like with everything else about Google, questions like "Does Google do X in a situation Y?" are best answered with "Sometimes it does and sometimes it doesn’t".

    from neyne 1050 Days ago#
    Votes: 0
    if you check out the page that was cached by google, you will see that the page was full of spammy linky content even before. So what most likely happened is that a student brought a page to some good locations for the relevant keywords by using the domain authority and links, posted an ad on the page offering it to the highest bidder and voila....

    from neyne 1050 Days ago#
    Votes: 2
    this is not exactly new information. We have ranked completely new websites on the first page of Yahoo for very competitive (adult) terms only by using keywords in the domain name as far back as 2002. A more interesting spin on this is how having keywords in the domain name could allow you to utilize those keywords more agressively in your on-page optimization, without tripping all kinds of penalties and filters...

    from neyne 1053 Days ago#
    Votes: 1
    kudos for the effort. Thank you so much!

    from neyne 1054 Days ago#
    Votes: 0
    I don’t think you can really say that it does one thing and not the other. You can maybe say that it doesn’t do only one thing.

    from neyne 1054 Days ago#
    Votes: 0
    "I assumed "related" to imply co-citation: if site C links to site A and site B, A and B are related."yeah that is the common knowledge but i have seen far too many examples where this was not the case. I think that there is more to just linking. Even if this is the case, the premise behind it would be based in an assumption according to which sites link only to sites on the same topic and we all know that never happens. So if it is according to link patterns, then the data in TouchGraph is meaningless and if it is according to some other parameter(s) then it is uncomprehensible. It definitely is interesting but very soon it starts being frustrating seeing all those graphs without being able to say something empyrical about the connections...

    from neyne 1054 Days ago#
    Votes: 0
    weeeeellll of course you can uncover backlinks of one node. However, having a whole map of who is linking to whom is a whole different thing. Think about 3-4 money sites being supported by links from a network of sattelite sites. Think fingerprinting linking patterns for heavily promoted sites in competitive niches. I can think of at least 4-5 occasions in the last year where i would pay good money to have that information presented to me in a graphical way. touchgraph is a great tool, don’t get me wrong, I have been using it for quite a while and am learning new things from it all the time, but the problem with it is the fuzzyness of the way Google lists websites as related. What does it mean ? what significance does Google’s definition of "related" bear on value that related sites could potentially transfer through links ? are related sites related onry through linking patterns or is there a semantic factor in play as well ? Lots of questions, not many answers and a situation where i find myself playing with pretty graphs until my whole screen is filled with bobbing spheres and lines and i cannot find my feet in it anymore...

    from neyne 1054 Days ago#
    Votes: 0
    if there was only such tool for backlinks :)

    from neyne 1057 Days ago#
    Votes: 0
    great article. left a comment on the site

    from neyne 1064 Days ago#
    Votes: 0
    great coverage. Thanks for doing it guys!

    from neyne 1076 Days ago#
    Votes: 0
    Excellent story.



    Upcoming Conferences

    Search Marketing ExpoSearch Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.



    Join us at an upcoming SMX event:

    Upcoming Webcasts

    Search Marketing Now Learn more about search marketing with our free online webcasts and webinars from our sister site, Search Marketing Now. Upcoming online events include: