What is Google Trends data — and what does it mean?
A little more than a year ago, we made Google Trends data available in real time; and increasingly, it’s helping people around the world explore the global reaction to major events.
The vast amount of searches — trillions take place every year — make Google Trends one of the world’s largest real time datasets. Examining what people search for provides a unique perspective on what they are currently interested in and curious about.
So when a big news story happens, how can you best interpret this data?
What is Trends data?
Trends data is an unbiased sample of our Google search data. It’s anonymized (no one is personally identified), categorized (determining the topic for a search query) and aggregated (grouped together). This allows us to measure interest in a particular topic across search, from around the globe, right down to city-level geography.
You can do it, too — the free data explorer on Google Trends allows you to search for a particular topic on Google or a specific set of search terms. Use the tool and you can see search interest in a topic or search term over time, where it’s most-searched, or what else people search for in connection with it.
There are two ways to filter the Trends data: real time and non-real time. Real time is a random sample of searches from the last seven days, while non-real time is another random sample of the full Google dataset that can go back anywhere from 2004 to ~36 hours ago. The charts will show you either one or the other, but not both together, because these are two separate random samples. We take a sample of the trillions of Google searches, because it would otherwise be too large to process quickly. By sampling our data, we can look at a dataset representative of all Google searches, while finding insights that can be processed within minutes of an event happening in the real world.
It’s a unique and powerful dataset, which can complement others, like demographic data from the census, as shown here in the Washington Post. As a sample, it gives us a way to analyse what people are searching for in real time as events unfold. But combining data can be tricky — for instance, it doesn’t make sense to compare Google Trends to other Google datasets, which are measured in different ways. For example, AdWords is meant for insights into monthly and average search volumes, specifically for advertisers, while Google Trends is designed to dig further into more granular data in real time.
What do the numbers mean?
Google Trends is a powerful tool for storytelling because it can allow us to explore the magnitude of different moments and how people react to those moments. We can look back and compare different terms against each other, like how different sports have ranked since 2004. We also can take the total searches for an event to help understand its sheer magnitude. When we released our 2015 Year in Search, we found there were astoundingly over 439 million searches on Google when Adele came back with ‘Hello’.
What’s most useful for storytelling is our normalized Trends data. This means that when we look at search interest over time for a topic, we’re looking at that interest as a proportion of all searches on all topics on Google at that time and location. When we look at regional search interest for a topic, we’re looking at the search interest for that topic in a given region as a proportion of all searches on all topics on Google in that same place and time.
For instance, if we look at the Trends around Bernie Sanders, we can see that Vermont has the highest search interest in the current senator. This is because of all states, Vermont has the highest percentage of searches for Sanders out of all searches in that state. If we had looked at raw data rather than normalized values, we would’ve seen larger states with higher populations rise to the top of the ranks.