Hi,
when trying to model my time spent, I realized there are sometimes page_pings missing. Below you see an example of one page id:
pageview_id | derived_tstamp | event |
---|---|---|
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:43:46.593 UTC | page_ping |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:43:36.588 UTC | page_ping |
Missing row | — | — |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:43:16.592 UTC | page_ping |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:43:06.601 UTC | page_ping |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:42:56.592 UTC | page_ping |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:42:46.601 UTC | page_ping |
000c6430-229c-4ebd-8eea-d5cd446ce4fa | 2019-09-04 05:42:16.900 UTC | page_view |
Example query:
SELECT
*
FROM (
SELECT
web_page.id AS pageview_id,
page_urlpath,
event
FROM
`xxx.pageviews`,
UNNEST(contexts_com_snowplowanalytics_snowplow_web_page_1_0_0) AS web_page
WHERE
DATE(derived_tstamp) = "2019-09-04"
ORDER BY
pageview_id DESC
LIMIT
30) AS t
WHERE
pageview_id = "000c6430-229c-4ebd-8eea-d5cd446ce4fa"
My page activity tracking is set to 30 seconds / 10 seconds.
Do you have any clue, what’s happening here? Is there a technical reason?
Cheers
Andreas