Discussion on Handling 'stl_insert' Permission Denied Error in Redshift Serverless

user-3 · March 4, 2024, 4:46pm

Hi everyone! :hihi: I tried to find some solution in this channel, however it seems that on a newer version redshift-legacy module has been deprecated… Any way to workaround the permission denied error regarding stl_insert for Redshift Serverless please?

datahub_team · March 4, 2024, 4:46pm

Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!

Which DataHub version are you using? (e.g. 0.12.0)
Please post any relevant error logs on the thread!

user-3 · March 4, 2024, 4:46pm

I am creating a POC right now, and I am using version 0.12.0.1 where in redshift-legacy is still available… but we would like to use a newer version…

user-1 · March 4, 2024, 4:46pm

I think I’m facing same issue right now.
Some of our data source owners migrated from provisioned Redshift to Redshift Serverless and we are having issues with the redshift connector complaining about stl_insert.
AFAIK the problem is not about permissions but Redshift Serverless missing or having a different name for some of the tables being queried by the crawler (deprecated and non deprecated one), such as stl_insert, among others.
<@UV14447EU> do you know if there is any ongoing plan to support Redshift Serverless? (asking you as one of the contributors to the redshift connector)

user-2 · March 4, 2024, 4:46pm

I need to dig a bit deeper in this to understand how we can get audit history from Redshift spectrum. Unfortunately some of the information schema tables which we used don’t work with Serverless

user-2 · March 4, 2024, 4:46pm

Definitely we would like to support Serverless

user-2 · March 4, 2024, 4:46pm

It seems we should use SYS tables which can work on both. The only issue I know with sys tables is you can’t get the original query from it because they remove line breaks and a line break can confuse our query parser.
I hope they will fix it as they state that they keep the line breaks but we saw it differently

user-2 · March 4, 2024, 4:46pm

I will create a ticket about this

user-1 · March 4, 2024, 4:46pm

https://docs.aws.amazon.com/redshift/latest/dg/sys_view_migration.html

When you migrate your Amazon Redshift provisioned cluster to Amazon Redshift Serverless, your monitoring or diagnostic queries might reference system views that are only available on provisioned clusters. You can update your queries to use the SYS monitoring views.
yes, that’s Redshift suggestion also

Please, could you share ticket here so we can keep an eye on it?

user-3 · March 4, 2024, 4:46pm

Thank you for the feedback! It would be easier for us to use the latest redshift connecter than using the legacy module.

user-1 · March 4, 2024, 4:46pm

Hey <@UV14447EU>, quick question - is adding Redshift Serverless support something your team is working on already? If not, my team could contribute with it, since we really need that capability soon. Just wanna check so we don’t step on each other’s toes!

user-2 · March 4, 2024, 4:46pm

Not working on it right now.

user-2 · March 4, 2024, 4:46pm

If you can work on it I’m happy to help and it can speed up to release it

user-1 · March 4, 2024, 4:46pm

That’s great! We’ll let you know once we start working on it! Thanks!

user-2 · March 4, 2024, 4:46pm

Sure, I’m happy to meet with you if you need help. A month ago I was working on another Redshift feature (improving the column/table level lineage), and there I was about to migrate to the new views, but then I gave up because:

You can’t join an old information schema table with a new one with querying because even though the name is the same, the id is different in the new and the old one.
The main issue was not being able to get the original SQL query with line breaks with the new information schema table -> even though STL_QUERYTEXT states that it stores it, in my experience, it didn’t contain it.

user-1 · March 4, 2024, 4:46pm

So new views does not fully replace old system tables in the case of provisioned Redshift? both schemas need to coexist?

user-2 · March 4, 2024, 4:46pm

yes, that was my main problem

user-2 · March 4, 2024, 4:46pm

And of course, they doesn’t replace exactly the old tables, but that can be fixed by tweaking the queries

user-1 · March 4, 2024, 4:46pm

that means we need to keep sort of two code paths (and/or full of “ifs”): one for provisioned one for serverless
what about having a separate connector? would that make future maintenance easier?

user-2 · March 4, 2024, 4:46pm

What I would do is to double-check if STL_QUERYTEXT can return the original query with linebreaks and if we can get that then we can use the new views

Topic		Replies	Views
Title: "Permission Denied for Relation stl_insert while Ingesting Metadata from Redshift" ingestion	16	49	March 4, 2024
Support for Column-Level Lineage in DataHub Redshift Ingestor for Serverless Clusters ingestion	4	9	March 31, 2025
Unsuccessful Lineage Parsing of Redshift CREATE TABLE Statements in Datahub 0.12.1.5 with Mixed Approach and `redshift` Connector getting-started	3	72	March 4, 2024
Troubleshooting Redshift Ingestion Stuck at "Pending" Status troubleshoot	1	55	April 1, 2024
Troubleshooting Redshift Ingestion Error: Invalid Expression / Unexpected Token ingestion	7	29	August 12, 2024

Discussion on Handling 'stl_insert' Permission Denied Error in Redshift Serverless

Related topics