Hi
I am deploying my DataHub test environment and I have Clickhouse clusters, kafka topics and developers’ apps that put data into kafka.
It is not a problem to ingest metadata from Clickhouse or from Kafka, but I wonder if I can create simple custom source that would describe some app? May be there is a way to create some container with metadata written and updated manually?
Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!
Are you using UI or CLI for ingestion?
Which DataHub version are you using? (e.g. 0.12.0)
What data source(s) are you integrating with DataHub? (e.g. BigQuery)
<@U06TM7M40RK> Can you help with the questions in this thread?
We’re experimenting with a new AI assistant for questions in the <#CUMUWQU66> channel to help you track down helpful resources more quickly. Keep in mind that responses may not always be spot-on; feel free to ask it follow-up questions and don’t forget to provide its responses a or so other folks in the Community know if you’re still looking for help on the topic!
To create a custom source for describing an app and manually updating its metadata, you can indeed use DataHub’s extensible ingestion framework. Here’s a step-by-step guide to help you achieve this:
Steps to Create a Custom Source
Build a Custom Source:
You need to create a custom ingestion source. This involves writing a Python class that defines how to fetch and process metadata for your app.