{"id":50,"date":"2017-04-26T05:06:15","date_gmt":"2017-04-26T05:06:15","guid":{"rendered":"http:\/\/www.analyticsbook.org\/?page_id=50"},"modified":"2021-02-19T01:28:16","modified_gmt":"2021-02-19T01:28:16","slug":"google-analytics-sampled-data","status":"publish","type":"page","link":"https:\/\/www.analyticsbook.org\/google-analytics-sampled-data\/","title":{"rendered":"Google Analytics Sampled Data"},"content":{"rendered":"
\t\t\t\t\t\t\t
\n\t\t\t\t\t\t\t\t\t\t\t\t\t\tGordon Choi
\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tAuthor<\/p>\n
<\/p>\n
Google Analytics data sampling can be a major issue. Google Analytics shows sampled data in the reports when appropriate criteria are met.<\/p>\n
If a Google Analytics property is collecting an amount of data which has exceeded a single property’s data size limit, then this Google Analytics property starts showing sampled data in the reports.<\/p>\n
Data sampling happens when:<\/p>\n
When data sampling happens, your reports start losing accuracy in detailed data and Google Analytics may display a message telling you the report is based on sampled data, such as:<\/p>\n
\u201cThis report is based on 100,000 sessions (10.00% of sessions).\u201d<\/p>\n
For example, there were 1,000,000 sessions in your selected date range, Google Analytics took 100,000 sessions (10.00% of sessions) to calculate your report metrics and then multiply by 10 to achieve the totals.<\/p>\n
Assume Google Analytics has recorded 10,000 sessions for a particular landing page URL<\/abbr> from a total of 1,000,000 sessions. This translates to 1% of all sessions for this particular landing page. With 10% sampling, Google Analytics may randomly select any 100,000 sessions from all the 1,000,000 sessions. Within the 100,000 selected sessions, only 8,000 sessions belong to this particular landing page and this is how Google Analytics reports sessions for the particular landing page.<\/p>\n <\/ins><\/p>\n To improve reporting accuracy, it means it’s to increase the amount of data (or the amount of non sampled data). But how do you increase the amount of data in a sampled Google Analytics report?<\/p>\n In the Google Analytics reports, you have the option to either increase the sample size for improved accuracy, or decrease the sample size for improved report processing speed. All you have to do is to toggle a slider switch.<\/p>\n If you increased the sampled size, your report will be calculated from a larger sample size of sessions. For example:<\/p>\n \u201cThis report is based on 200,000 sessions (20.00% of sessions).\u201d<\/p>\n With the free version of Google Analytics, sampled data cannot be fully avoided, but can be only minimized.<\/p>\n By toggling the slider switch to increase the sampled data size, your reports may improve accuracy. However, Google Analytics has placed limitation on how much sampled data you will get in your reports. i.e. You cannot fully get rid of sampled data in your reports.<\/p>\n <\/ins><\/p>\n You want to limit the amount of sampled data showing in your Google Analytics reports.<\/p>\n One of the ways is to reduce the number of unique URLs by:<\/p>\n Consider the two URLs below.<\/p>\n m.example.com\/Hotel\/List\/Shanghai-Hotels\/ One of the URLs has some capital letters, but the other URL<\/abbr> has letters all in lowercase. The web browser returns identical page whether you enter the first URL<\/abbr> or the second URL<\/abbr>.<\/p>\n However, Google Analytics thinks they are two separate URLs. In Google Analytics, they will show up as two separate rows in reports.<\/p>\n This takes up an additional row in your reports which is really unnecessary. When you have many unnecessary rows in your report, your Google Analytics property will quickly reach the daily upper limit of 50,000 unique rows of pre-aggregated data. Therefore, sampled data will quickly come up in your reports.<\/p>\n You can make them becoming one single row in Google Analytics reports by using filters.<\/p>\n The filter converts any capital letters in all URLs into lowercase.<\/p>\n Going forward in your Google Analytics reports, you will only see one version of the above URL<\/abbr> and it will all be in lowercase.<\/p>\n Consider a hotel booking website which may have a page with URL<\/abbr> in which it displays a list of 15 hotels in Shanghai.<\/p>\n Typically for a hotel booking business, you display the hotels that are available for a particular check-in-date and a particular check-out-date. To achieve this, many hotel booking websites would have appended parameters and values to the URLs.<\/p>\n The two pages with different check-in and check-out date ranges may have slightly different hotels on each of them, but they are essentially the same page. Having dates as values in your URLs, you can easily end up with an indefinite number of URLs. In many cases, it makes it easier to consider all three URLs as the same page and have them reported as one single URL<\/abbr>:<\/p>\n Under your Google Analytics property, go to:<\/p>\n In the above case, merging the URLs will reduce the number of unique URLs that are going to appear in your Google Analytics reports. This will result in reducing the data size and give your data more rooms before running into the data sampling issue.<\/p>\n Consider this case if your website is using multiple URLs for your home page.<\/p>\n Practically, you should not be using multiple URLs for a single home page.<\/p>\n To resolve this, under your Google Analytics property, go to:<\/p>\n Going forward in your Google Analytics reports, you will end up seeing only one version of URL<\/abbr>:<\/p>\n Another case with a hotel booking website involves using multiple URL<\/abbr> versions of the same page. Consider the list page with 15 hotels in Shanghai. They are two typical ways to represent the same page.<\/p>\n In your Google Analytics reports, you only need one of them to appear, and the better option is the first URL<\/abbr> (i.e. the Static URL<\/abbr>).<\/p>\n Under your Google Analytics property, go to:<\/p>\n Going forward in your Google Analytics reports, you will end up seeing only one version of URL<\/abbr>:<\/p>\n If data sampling is a long term problem for your website’s data collection, data reporting and data analysis, then consider upgrading to the paid version Google Analytics Premium.<\/p>\n If you make the poor choices to consolidate URLs that should not have been consolidated, then you are going to lose data granularity. Examples of bad choices:<\/p>\n All the URLs represent different locations, and it makes no sense to consolidate them into one single URL<\/abbr>.<\/p>\n <\/ins><\/p>\n <\/p>\n Content on Gordon Choi’s Analytics Book is licensed under the CC Attribution-Noncommercial 4.0 International license.<\/em><\/p>\n Gordon Choi’s Other Books: <\/p>\n <\/p>\n [elementor-template id=”764″]\t\t<\/p>\n Copyright 2016-2021 www.AnalyticsBook.org<\/p>\n","protected":false},"excerpt":{"rendered":" How To Increase The Amount Of Data In A Sampled Google Analytics Report? Gordon Choi Author Chapter 14 – Google Analytics Sampled Data How does data sampling happen? How can data sampling cause issues? How can data accuracy be improved on sampled data? Is the data sampling issue finally resolved? What can be done to […]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"elementor_canvas","meta":{"footnotes":""},"yoast_head":"\nHow can data accuracy be improved on sampled data?<\/h2>\n
Is the data sampling issue finally resolved?<\/h2>\n
What can be done to reduced sampled data?<\/h2>\n
\n
Consolidate URLs by converting them into all lowercase<\/h3>\n
m.example.com\/hotel\/list\/shanghai-hotels\/<\/p>\nView -> Filters -> Add Filter -> Create New Filter<\/pre>\n
\n
m.example.com\/hotel\/list\/shanghai-hotels\/<\/pre>\n
Consolidate URLs that actually are very similar pages<\/h3>\n
m.example.com\/hotel\/list\/shanghai<\/pre>\n
m.example.com\/hotel\/list\/shanghai?check-in-date=2015-11-01&check-out-date=2015-11-03\nm.example.com\/hotel\/list\/shanghai?check-in-date=2015-11-05&check-out-date=2015-11-06<\/pre>\n
m.example.com\/hotel\/list\/shanghai<\/pre>\n
View -> View Settings<\/pre>\n
\n
Consolidate URLs that actually mean the same page<\/h3>\n
m.example.com\/\nm.example.com\/index.aspx\nm.example.com\/default.html<\/pre>\n
\n
View -> Filters -> Add Filter -> Create New Filter<\/pre>\n
\n
m.example.com\/<\/pre>\n
Static URL: http:\/\/m.example.com\/hotel\/list\/shanghai\nDynamic URL: http:\/\/m.example.com\/hotel\/list?city=shanghai<\/pre>\n
View -> Filters -> Add Filter -> Create New Filter<\/pre>\n
\n
m.example.com\/hotel\/list\/shanghai<\/pre>\n
Bad URL consolidation examples<\/h3>\n
m.example.com\/hotel\/list\/shanghai\nm.example.com\/hotel\/list\/shanghai?district=xuhui&brand=hanting\nm.example.com\/hotel\/list\/shanghai?district=changning&brand=jinjiang\nm.example.com\/hotel\/list\/shanghai?district=baoshan&brand=hanting<\/pre>\n
\nPrevious Chapters<\/h4>\n
\n
Next Chapters<\/h4>\n
\n
\n
\n
The China Mobile SEO Book<\/a>
Mobile Website Book<\/a>
<\/em><\/p>\n