Troubleshooting datahub errors and schema registry configuration in Kubernetes deployment using Helm charts

user-5 · March 4, 2024, 5:18pm

Hello,

I have been tasked to debug errors messages appearing on our datahub deployment on kubernetes using the helm chart provided by acryldata bumped chart version from 0.2.181 to 0.2.182
We are ingesting metadata from Databricks, Sagemaker, dbt. When displaying the logs of these 3 jobs, I get recurring similar errors being :
• datahub-gms return status 500 (see picture)
• Error registering Avro Schema
Even after trying the solution propsed in <Issues · acryldata/datahub-helm · GitHub issue> the problem keeps recurring.

Thank you for the help! attachment

datahub_team · March 4, 2024, 5:18pm

<@U04UKA5L5LK> might be able to speak to this!

user-1 · March 4, 2024, 5:18pm

Hi, what schema registry are you using?

user-5 · March 4, 2024, 5:18pm

I think it is the provided by the helm charts version 0.2.182

datahub_team · March 4, 2024, 5:18pm

There is a known issue with internal and glue schema registries in the last two versions of datahub, which could cause this issue, can you make sure you are using the confluent schema registry?

user-5 · March 4, 2024, 5:18pm

so I need to change the schema registry image in the values.yaml file? https://github.com/acryldata/datahub-helm/blob/master/charts/datahub/values.yaml

user-5 · March 4, 2024, 5:18pm

lines 487 to 498

user-1 · March 4, 2024, 5:18pm

Yes! That’s right. Sorry, I think we had a different chart version that changed the default back to what works, but it looks like it’s not this one.

user-5 · March 4, 2024, 5:18pm

so there need to be an update of the prerquisites helm charts?

user-1 · March 4, 2024, 5:18pm

Yeah, you may need to install the schema registry which will be used.

user-5 · March 4, 2024, 5:18pm

user-5 · March 4, 2024, 5:18pm

this is what I currently have in my values.yaml

user-1 · March 4, 2024, 5:18pm

This looks good to me

user-5 · March 4, 2024, 5:18pm

It is with this config that I get the error describes

user-1 · March 4, 2024, 5:18pm

Did you make the change in values.yaml as well?

user-5 · March 4, 2024, 5:18pm

My bad, I meant this is the current content of my values.yaml file

user-5 · March 4, 2024, 5:18pm

So it is currently failing with these values, so what do you suggest to replace them with?

datahub_team · March 4, 2024, 5:18pm

I see I see. Actually, if GMS is coming up, then I think you have that part configured correctly. <@U04N9PYJBEW> can you weigh in on the ingestion failures?

user-5 · March 4, 2024, 5:18pm

user-4 · March 4, 2024, 5:18pm

Are you running ingestion from the UI? And are you sure datahub-datahub-gms:8080 is the correct url from inside your cluster? You can try curl datahub-datahub-gms:8080/health to check

Topic		Replies	Views
Fixing Serialization Error in DataHub Helm Chart: Enabling Schema Registry & Upcoming Chart Version troubleshoot	9	60	March 4, 2024
Troubleshooting setup issues with datahub deployment on EKS cluster all-things-deployment	1	47	March 4, 2024
Do I Need the Kafka Schema Registry Component for DataHub Ingestion on Kubernetes with Strimzi Operator? all-things-deployment	15	58	March 4, 2024
Repopulating Kafka Data and Schema in DataHub After VPC Changes troubleshoot	3	54	March 4, 2024
Troubleshooting Datahub Deployment with Kubernetes in Azure AKS all-things-deployment	3	103	March 4, 2024

Troubleshooting datahub errors and schema registry configuration in Kubernetes deployment using Helm charts

Related topics