Snowplow R100 Epidaurus released with PII pseudonymization support

knservis · March 13, 2018, 3:52pm

@jrpeck1989 As of r100 the value is just hashed. It is not randomised, meaning it is not substituted with a random value. Each value is then replaced with it’s hash. The original value is not kept in the enrichment, but could possibly be retrieved from raw logs if those logs are not discarded.

In a later release, there will be the option (which will need to be enabled) to keep the mapping of the original value to its hash, but that would be kept separate from the rest of the data as good practice would advise that this information which constitutes PII of the data subject, should only be used with due justification and when consent is given by the data subject. That feature will be in an upcoming release.

Additionally, in a later release we will add the capability to easily scrub data from preexisting data on S3 (Removing PII form Redshift can currently be done as shown in this tutorial: GDPR: Deleting customer data from Redshift [tutorial])

Topic		Replies	Views
Snowplow R100 Epidaurus released with PII pseudonymization support – Snowplow GDPR	0	1242	March 9, 2018
Controlling the order enrichments are run Enrichment	6	2723	September 25, 2017
Snowplow Mini 0.5.0 released New releases	1	969	June 1, 2018
GDPR - PII configuration in the batch pipeline Enrichment	3	1240	January 31, 2019
Snowplow R99 released with support for Google Analytics New releases	3	2353	February 28, 2018

Snowplow R100 Epidaurus released with PII pseudonymization support

Related topics