Snowplow Normalize DBT package can't find users_table macro

Alex_Kudlick · December 14, 2023, 8:10pm

Hello,

I am working through a new snowplow set up and have used the snowplow-normalize dbt package to generate models for our self-describing events. I set up a packages.yml with the snowplow normalize package and installed it with dbt deps. I have successfully generated the models using the snowplow_normalize_model_gen.py script. However, whenever I try to run a dbt command I get the following error:

Compilation Error
  In dispatch: No macro named 'users_table' found within namespace: 'snowplow_normalize'
      Searched for: 'my_project.redshift__users_table', 'my_project.postgres__users_table', 'my_project.default__users_table', 'snowplow_normalize.redshift__users_table', 'snowplow_normalize.postgres__users_table', 'snowplow_normalize.default__users_table'

I see in the snowplow_normalize package macros directory there is a users_table macro and it is listed in the macros schema.yml. I didn’t configure any users table in the normalize config when I generated the models.

I’m a new dbt user so I might be missing something obvious. Any help is appreciated. Thanks

Alex_Kudlick · December 14, 2023, 10:24pm

Ok, I should have read the docs more closely before getting too deep. snowplow-normalize doesn’t support redshift.

Ryan · December 15, 2023, 2:43pm

That’s correct @Alex_Kudlick , because redshift already has the flat shredded tables we haven’t seen a need to add redshift support to the normalize package.

What is it you were hoping to do with the package as you may be able to build the models on top of our base macros to still take advantage of our incremental approach and other features.

Alex_Kudlick · December 15, 2023, 5:59pm

Hi @Ryan thanks for the response.

redshift already has the flat shredded tables we haven’t seen a need to add redshift support to the normalize package.

Is this part of a quickstart setup? What generates those shredded tables? I didn’t exactly follow the the quickstart, what I have is our collector + enricher writing to kinesis, dumping to s3, then incremental load into redshift with airflow. I was hoping to automatically splat out our custom events via airflow jobs as well. Right now I am just writing sql to pull out each event type, which is not a huge deal since we don’t have that many custom event types.

Ryan · December 15, 2023, 8:50pm

The loader for redshift (and postgres) generated separate tables for the values in any self describing event or entity, rather than as columns in the main atomic events table like other warehouses. They won’t have the other atomic fields in those shredded tables. You can read more in the docs here https://docs.snowplow.io/docs/pipeline-components-and-applications/loaders-storage-targets/snowplow-rdb-loader/transforming-enriched-data/#shredded-data

Topic	Replies	Views
Snowplow-normalize 0.2.0 dbt package released New releases	788	December 21, 2022
More user table options in our snowplow normalize dbt package v0.2.2! New releases	737	March 13, 2023
New dbt package! Snowplow-normalize 0.1.0 released New releases	1061	December 7, 2022
Snowplow E-commerce dbt package 0.6.0 released! New releases	428	January 5, 2024
Snowplow-web 0.4.0 dbt package released New releases	1374	November 30, 2021

Snowplow Normalize DBT package can't find users_table macro

Related topics