RDS PostgreSQL Storage Full

Ryan_Jansen · March 22, 2023, 11:56pm

We’ve had Snowplow set up for a little while collecting events, it was setup following the AWS open source tutorial. It seems like recently it started to add a huge amount of data into the database (RDS PostgreSQL). We increased the storage capacity and it started using it all up. Where is the best place to figure out what is happening? You can see below when this started around March 14th.

Appreciate the help!

josh · March 23, 2023, 2:12am

Hi @Ryan_Jansen have you started doing any datamodeling of the data in Postgres / have you started tracking substantially more data into the pipeline? Did you introduce new tracking / make any changes around that date that could be contributing?

I would be starting with tracking down which schemas / tables in the database are consuming the bulk of the space in your RDS and figuring out what the source of it filling up is - if you are consuming this much disk space either you have increased traffic volume substantially or you might have some large output products from datamodeling processes.

Once you know the source of whats filling up the RDS exactly then you can start to figure out what the issue might be and go from there.

Ryan_Jansen · March 23, 2023, 4:02am

Thanks for getting back @josh! I was able to find the issue. We had logical replication on and the WAL was filling up because the intermediate SSH tunnel had changed it’s internal IP and could not connect to the database.

I appreciate you helping me debug!

Now I need to look into how to backfill missing data.

Topic		Replies	Views
PostgresDB: No data getting stored Troubleshooting	3	1407	August 29, 2022
No data in rds pipline db	3	1012	December 13, 2021
Snowplow RDB Loader R31 released New releases	7	1569	October 1, 2019
Snowplow RDB Loader R32 released New releases	0	1606	March 6, 2020
No data loaded in postgres, no errors either Storage targets	3	2338	April 12, 2017

RDS PostgreSQL Storage Full

Related topics