RDB Loader 4.2.2 released

We’re pleased to announce we’ve released RDB Loader version 4.2.2

This release brings stability improvements and better error messaging on loaders.

Making load_tstamp column optional

In previous versions, loaders were always adding a load_tstamp column if it wasn’t already in the events table. It wasn’t possible to disable this. However, this approach was causing some problems in certain scenarios. You can now disable this feature by adding this to your config file:

"featureFlags": {
  "addLoadTstampColumn": false
}

Removing default catalog name in Databricks Loader

In the previous versions, Databricks Loader was using hive_metastore as the default catalog name. However, it is reported that this way isn’t working with the Databricks clusters configured with AWS Glue. Therefore, no catalog name is prepended starting from this version. We would like to thank Marcin Juraszek for reporting the issue and creating the PR.

Upgrading to 4.2.2

If you are already using a recent version of RDB Loader (3.0.0 or higher) then upgrading to 4.2.2 is as simple as pulling the newest docker images. There are no changes needed to your configuration files.

docker pull snowplow/transformer-kinesis:4.2.2
docker pull snowplow/rdb-loader-redshift:4.2.2
docker pull snowplow/rdb-loader-snowflake:4.2.2
docker pull snowplow/rdb-loader-databricks:4.2.2

The Snowplow docs site has a full guide to running the RDB Loader.

3 Likes