<@U06TM7M40RK> I’m using the DataHub Redshift ingestor (version 0.14.0) for both provisioned and serverless clusters. While everything works fine for provisioned clusters, lineage extraction - particularly column-level lineage doesn’t seem to be working for serverless. Does DataHub currently support column-level lineage ingestion for Redshift Serverless?
DataHub does support Redshift Serverless, but there are some limitations regarding lineage extraction, particularly column-level lineage. According to the <https://datahubproject.io/docs/0.14.0/generated/ingestion/sources/redshift#config-details|DataHub Redshift documentation>, column-level lineage is supported but requires specific configurations and may not work seamlessly with Redshift Serverless due to the limitations of accessing certain system tables.
The <https://github.com/datahub-project/datahub/pull/9998|Redshift Serverless support> was introduced to accommodate the limitations of Amazon Redshift Serverless, such as restricted access to STL, STV, SVCS, and SVL system tables. This support includes a configuration flag is_serverless
that adjusts the ingestion pipeline to work with these limitations.
However, as per the <Slack discussion>, column-level lineage is not fully supported for Redshift Serverless, and only table-level lineage is currently available. This is due to the absence of certain information schema tables in Redshift Serverless that are necessary for extracting detailed lineage information.
Would you like more detailed information on how to configure the Redshift ingestor for serverless environments or any other specific aspect?
Sources:
- <https://datahubproject.io/docs/0.14.0/generated/ingestion/sources/redshift#config-details|DataHub Redshift documentation>
- <https://github.com/datahub-project/datahub/pull/9998|Redshift Serverless support>
- <Slack discussion>
0 button
0 button
Hint: Mention @runllm in the thread for followups.
Have you enabled serverless mode?
Hi Tamás. Yes, I did set is_serverless=true
Hi ! Did this interaction resolve your issue or do you still require assistance ?
Please tag @ RunLLM if you still need further assistance !
Or tag @Datahub Community Support if you require a person to take a look
Hope this helps ! ><