Google Analytics, the cloud and missing numbers #fail
2010-01-26 08:19:38
We ran the backend for a sweepstakes project for ABC/Lost this month -- it was one of those projects you simultaneously love and dread because a company like ABC/Disney can push a LOT of people to the microsite all at once. So, we set up the site on Rackspace's cloud (you can read the details about setting up the project here) and it all worked great.
As a bit of an experiment, we hooked up Google Analytics to the site to track visitors. Now, keep in mind, server stats are always a little tricky, what with bots and other automatic processes hitting your site all the time, but for something as tightly controlled as this sweepstakes site, this was a pretty good test of the technology.
And Google failed.
This conclusion wasn't from an arduous comparison of server logs and weighting what might be a real visit and what might be a bot -- that would be one of those vague, subjective conversations. The fact is that Google reported fewer page views than registrations -- which is impossible as a visitor would have to look at two pages minimum to register. One page to answer the questions, and the thank you page saying 'You got the right!'
Google Analytics is run on the client browser with JavaScript; I know visitors had to have JavaScript turned on to register for the sweepstakes because they couldn't have gotten into the site from ABC without JavaScript turned on. So, either Google is overwhelmed and not picking up the visits as we flood the server, or people are blocking Google's scripts.
No matter what the reason, that kind of discrepancy can't be ignored. We usually create special tracking scripts (on the server side) to filter out all the noise in the server logs, which is kind of the thought behind Google Analytics. But our tests so far show that a client side, JavaScript driven tracker, even by the geniuses at Google, just doesn't work…