TweetmemeBot's Invalid User-Agent String

| No TrackBacks
tweetmeme-logo.pngThis post is an attempt to officially document the Tweetmeme bot's invalid User-Agent string.  Maybe someone from Tweetmeme will see this, and configure their bot to use a more appropriate User-Agent string.  I've already alerted Tweetmeme of the concern via Twitter, but no one has responded.

Here's the problem.  The Tweetmeme bot claims it's Firefox 3.0.6 as shown here:

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-GB; rv:1.9.0.6) \
Gecko/2009011913 Firefox/3.0.6 TweetmemeBot

Unless the TweetmemeBot is actually a Firefox web-browser (which I doubt it is), this is an inappropriate and invalid User-Agent string.
Misrepresenting yourself, or your bot, in the User-Agent string is frowned upon for a number of reasons:

  • It skews web-statistics and other reports that webmasters and bloggers run against their server logs.  Since the TweetmemeBot says it's Firefox, most log analysis tools (e.g., webalizer) will count the TweetmemeBot as a hit from a user in Firefox.  But it's not a real hit from an actual user, it's just the TweetmemeBot.

  • A fake User-Agent string makes it harder for webmasters and administrators to deny access to bots they don't like, or bots that don't play fair.

As a side note, I should also add the TweetmemeBot does not obey the restrictions and limitations I configured in my root robots.txt file.

Of course, like any good online citizen, I won't end the conversation without making a positive suggestion.  To resolve this concern, the TweetmemeBot should change its User-Agent string to something more unique and appropriate:

Mozilla/5.0 (compatible; TweetmemeBot/1.0; +http://www.tweetmeme.com/)

If I'm misinformed and I'm completely wrong about the TweetmemeBot, I would encourage the Tweetmeme team to contact me or find me on Twitter.

Did You Find this Helpful?

Did you find this post helpful, or at least, interesting?

  

Send Mark a Direct Message

If you'd like to send me a direct message, please do so below. However, I do not publicly post comments or messages submitted directly to me. So, if you're going to try to SPAM me, or my blog, you're pretty much wasting your time.

400 characters remaining

Error

About Mark

A Silicon Valley native, Mark Kolich is a full-time Software Engineer, a casual entrepreneur, and a consultant for hire. A web technologies expert, his current focus is on building powerful and robust cloud-driven web-applications using Java, PHP, Perl, AJAX, DHTML, CSS, and JavaScript. His favorite programming languages are PHP, Java and JavaScript. He uses Linux, enjoys biking to work, loves building great software, and always writes elegant, readable, and maintainable code.

No TrackBacks

No trackbacks attached to this entry.

Twitter (@markkolich)

Translate

About this Entry

This page contains a single entry by Mark Kolich published on April 12, 2009 1:32 PM.

Prevent Google From Caching Your Site (Meta tags: googlebot and robots) was the previous entry in this blog.

10 Awesome .htaccess Hacks for Movable Type is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.