Hi @anton
Thank you very much for your answer!
I am aware of the connection of geo-attributes with the IP address and the reason why I use geo_country is because I found it more accurate comparing to user_ipaddress, which can jump even in the same country/city etc. The reason why I got confused with the data is because I found a large proportion of users, who seem to change countries and have page views in our platform in less than an hour. Since this percentage is around 30% in different samples and given that I believe our users are not that tech-savvy to use VPN technologies, I couldn’t explain it that way. However, your suggestion about the mobile users proved to be correct, since for those users this percentage decreased significantly.
About the bots, I excluded them from the beginning and concerning the anti-virus/adult-filter software I am not sure how can I check that. Maybe you can give me some hints? Also, there are user cases where the subnet is the same or not, and the same happens with the user agents.
Finally and most importantly, I think that the problem I have is a bit broader. Precisely, some of the issues are:
- the domain_sessionidx does not change every 30 mins, or two domain_sessionidx overlap at the same time
- there are sessions with page pings and no page views
- users cases where the the dvce screen width and screen height changes in the same session (and this is one of my doubts on whether the domain_userid is unique. How can the device change with the same cookie?)
An example is the user below:
[Moderator note: screenshot removed because it contained PII (IP addresses)]
This user had 167 page views in our platform yesterday in a time range of 21 hour and all of them in the same domain_sessionidx, in 4 different geo_counties, using 4 different user agents, 80 different ip addresses (some of them from the same subnet some not) and their device screen width and screen height changes over time. And it is not just an exception…
Overall, and excuse for going out of the main topic of this discussion (maybe I should post a new one), I would like to understand If this is something common generally, or maybe something went wrong from my side when setting up everything. Is there a way to check that? Some key point that I should look for? I am new to Snowplow, so what do you suggest in order to start unfolding the problems?
Thank you in advance and I am looking forward for your suggestions!