- 43
- Sphinn It!
Posted By: aimClear 49 days ago
Topic Type: News Story (Jump to http://www.news.com)
Category: Web Analytics
3 Comments
3 Comments
Save the date for:
SMX Madrid (in Spanish, May 20-21)
SMX Advanced - Seattle, WA (June 3-4) Register today!
SMX Local & Mobile - San Francisco, CA (July 24-25) See the agenda, and register now!
SMX East - NYC - (Oct. 6-8)
SMX London - November 4 & 5, 2008
Comments
A great example that goes to show that your internet company is only as strong as your programming department. Now, not to be to harsh, it sounds to me like they are doing something wrong or using the wrong set of tools for the job. Web crawlers have been around for ages, and have been bypassing login pages for just as long.
A very wise man once told me over wine and cream cheese wontons: "If it's on the web you can scrap it".
The real question with these sort of tools, is the level of noise reduction that takes place. Spam can kill the usefulness of tools such as this very quickly.
@scott8723 I'm pretty sure it's not the crawlers that have the ability to bypass login pages to get at protected data, but rather the site owner detectes the crawler and let's it in without authentication.
@BrianChappell that's an excellent point, especially given how much pure marketing goes on inside the social networks.