Troubleshooting slow Vertica UI ingestion on Kubernetes with DataHub 0.12.0

user-1 · March 4, 2024, 3:57pm

Hi, I have a problem with the vertica UI ingestion running on kubernetes deployment with datahub version 0.12.0. It seems to be quite slow and stuck on a query now. The last log message seems to be this

SELECT column_name, data_type, column_default, is_nullable
FROM v_catalog.columns
WHERE lower(table_name) = 'removed'
AND lower(table_schema) = 'schema'
UNION ALL
SELECT column_name, data_type, '' as column_default, true as is_nullable
FROM v_catalog.view_columns
WHERE lower(table_name) = 'removed'
AND lower(table_schema) = 'schema'
UNION ALL
SELECT projection_column_name,data_type,'' as column_default, true as is_nullable
FROM PROJECTION_COLUMNS
WHERE lower(projection_name) = 'removed'
AND lower(table_schema) = 'schema'

2024-01-12 00:20:05,494 INFO sqlalchemy.engine.Engine [dialect vertica+vertica_python does not support caching 0.00027s] {}
[2024-01-12 00:20:05,494] INFO     {sqlalchemy.engine.Engine:1868} - [dialect vertica+vertica_python does not support caching 0.00027s] {}```
I had this problem with vertica before where it failed at a random table so I added max_threads=1 as apparently that would help but I still get the same error. The ingestion is also still apparently running according to the UI and last time I cancelled the ingestion I had to delete the MySQL PVC and make a new one (Then I added max_threads=1). I can't see any obvious errors in the gms or actions logs either.

datahub_team · March 4, 2024, 3:57pm

Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!

Are you using UI or CLI for ingestion?
Which DataHub version are you using? (e.g. 0.12.0)
What data source(s) are you integrating with DataHub? (e.g. BigQuery)

Topic		Replies	Views
Troubleshooting Vertica Data Ingestion Error into DataHub getting-started	1	44	March 4, 2024
Issues with Ingestion on Datahub 0.10.4 and Strange UI Behavior ui	2	78	March 4, 2024
Troubleshooting Oracle Ingestion with DataHub Data Dictionary Mode ingestion	11	37	July 29, 2024
Datahub Ingestion Issues with Impala Hive Connector and SQLAlchemy Recipes ingestion	12	82	March 4, 2024
Schema Issue Causing Airbyte Pipeline Crashes ingestion	9	52	May 20, 2024

Troubleshooting slow Vertica UI ingestion on Kubernetes with DataHub 0.12.0

Related topics