I have followed the thread here about splitting out similar URLs, and follwed the example given but I have a bit of a problem.
Some of the URLs in my analytics page are being sorted correctly, then some of them don't seem to get sorted at all. In the 2nd image you can see the problem I'm having. The 4th and 5th links on that image, are meant to be grouped under the 1st link of /blog/
I've included a screenshot of my filter as well, as I might be missing something in the filter, to group all the /blog/ links together.
The way the links are in the website is like this:
great-blogs.co.uk/blog/123456789012345
Any idea on how to go about combining these rouge links into the main /blog/ link would be great.
(.*)/\d+(.*)
That works perfectly with the problem I had with the IDs, buried in the middle of the urls.
If you want the pages grouped only into /blog/, /blog/edit, or /blog/new then you would need field A of your filter to be
(/blog/)[0-9]+$|/blog/.*/(edit|new)
This assumes that the string that could show up at the end is numbers only, eg. /blog/1478744474. You could also modify this so that if you know it's just going to be numbers before /edit or /new, then the regex could be
(/blog/)[0-9]+$|/blog/[0-9]*/(edit|new)
The output field would be $A1$A2. Notice how I included the slashes in the field A groupings to make the output look a little cleaner.
Test this though, because I'm on a mobile right now and can't access a good online regex tester!!!
EDIT:
Two changes: 1) note how the * is now a + so that the number is actually required (eg. /blog/1234). 2) I had to modify the regex because I finally got to testing it and found that the brackets might impact the matching, and also because the advanced filters use the same brackets to piece together the output, I had to redo the regex and break it up into two matching groups (that probably doesn't make sense but I hope it works!).
Related
I've been reading a lot about grouping page on Google Analytics but haven't figured out a clear answer to a problem.
My issue is this one :
Same pages but one with / and other without /
Basically, when I read my analytics I have two different entry for the same page, because some external links send people to one entry without the trailing slash (lets call it Page1), on others send people to url with the trailing slash (Page2).
It's a bit anoying when reading the stats, because you have to add up these two pages to have a clear view about what's going on.
I tried one option: add filters that remove the trailing slash. With this, I was able to get all the statistics on Page 1. It was a simple filter (Search and replace filter) that was grouping the two pages.
However, looking back at this option, It created another problem: this filter is not retroactive, which means when I will look at Page1, I will have stats from the day I applied this filter, whereas Page2 will score 0 from this exact same date. A small picture to make that clear:
Statistics on Page2 with Slash
Statistics on Page1 filter
Clearly here there is a discontinuity on my stats. To check long term datas I have to select another page, and to check new data I have to check the page without the trailing slash.
I removed this filter because it's very difficult to read data right now, and I'm looking for a solution to groupe these two pages so my data will be readable...
Thank you very much for you help,
Michael
Edit: I'm on Wordpress, maybe a way there?
There is nothing you can do within Google Analytics. I suggest you create your reports in Google Data Studio, which is free, and which allows you to aggregate Urls by using regular expressions to find matching parts (example e.g. in this question).
What does the following patterm means in the URL, these pattern seen on variety of url like Linkedin
Enpv_301654428_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1
is this sparse Matrix representation ?
For example
https://www.linkedin.com/groups?gid=5010991&goback=%2Enpv_301654428_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1_*1&trk=prof-groups-membership-name
I know it's the value passed to the parameter goback, but what's the point to use this kind of strange looking values?
You may want to check this
this link!
It's basically a method to allow the system remember steps to go forward\backward in the navigation.
EDIT:
When you perform a search on LinkedIn, there are a number of facets on the left hand side of the search results page that you can use to refine your search, like company and location. When you click on a search result, it turns all of those facets into the *1_s and *2_s that you see in the URL. Those are used to construct the "Go back to Search Results" link, which will restore the facets you had selected.
This stackoverflow previous question may also help you.
For about a week, Google Analytics was erroneously reporting page views for a few request URIs, severely skewing my data. I have read that there is no way to remove data once it is reported. If this is the case, is there a way to simply hide this data from the view?
I have tried a number of things (such as creating global filters, view filters, etc.) to no avail. Using segments also doesn't work, because apparently you can only filter out visits/users (whereas my goal is to filter out page views associated with a specific page). At this point, I feel like I must be going about it the totally wrong way...
Below is a screenshot of the Behavior > Overview section. The page views I want to move are #1, #2, and #5.
Alex, unfortunately, there is nothing you can do about the historical data.
However, you can use simple filter to exclude pages you don't want to see (the filter field above the report table, not filters related to account/profiles) -- see the attached screen below.
Make sure you select exclude and then pick Page dimension. The easiest way would be to use regular expressions, like:
(a|b|c)
This one would remove any pages that contain either "a", or "b" or "c".
The expression would be probably a bit more complicated in your case and I suggest using tools like RegEx Hero (free, online). I am not sure if there is anything common for the pages you would like to remove from the reports, but regular expression can do quite a lot :).
One last thing -- be aware there is a slight difference in segments and (table) filters. If you use segments for page dimension, you would end up with ALL the pages that were seen during a visit, which includes the page you set in the segment. Might be a bit confusing, but see this article for detailed explanation.
I'm struggling with GA and how to combine two mediums.
More specifically, I have traffic from two versions of the same newsletter appearing in GA:
10122013_This-Is-My-Newsletter-Title. / newsletter_ubivox
10122013_This-Is-My-Newsletter-Title. / (newsletter_ubivox)
Obviously, this is not desirable. How can i setup a filter (maybe with regex) to combine these two versions into one?
Hopefully, this makes sense!
Past data is locked down so you can't touch that.
Instead of going into the regex world, could you not edit the tags so the campaign name and medium match? As soon as they match they will start to show as combined in GA.
Easier than filters and new code.
To further expand my answer, you can use an advanced Search and Replace filter to do what you want - https://support.google.com/analytics/answer/1034834?hl=en
I'd set up a test profile first though to make sure that this works as expected.
I have a url similar to this
/widgets/view/13031800
In the google analytics settings you can have it exclude url paramaters, but I can't figure out how to have it look at all
/widgets/view/ pages as one page. I don't need each one to be unique.
Any ideas?
Unfortunately there is not an easy answer to this. You can...
change what is passed in your _trackPageview calls on your page(s) to exclude the number
create a filter in the interface to strip off the number
Neither one of these solutions will retroactively apply (will not change data you already got).
Only way to see it for current data is to export your data to excel or whatever and do some manual work on it, like strip off the numbers and sum up the page views and remove duplicates sort of thing.