Associating Business Glossary Terms with Columns in DataHub through API and CLI

Original Slack Thread

<@U06TM7M40RK> How can I ingest/associate business glossary terms to the columns (metadata) I have ingested in datahub

To associate business glossary terms with columns in DataHub, you can use the DataHub API or the DataHub CLI. Here’s a summary of the process:

  1. Using GraphQL API: You can use the addTerm mutation to associate a glossary term with a specific column in a dataset. This involves specifying the URN of the dataset column and the URN of the glossary term.

  2. Using DataHub CLI: You can prepare a CSV file with the necessary glossary terms and their associations, and then use the DataHub CLI to ingest these associations. The CSV file should include columns for the resource (e.g., column URN), glossary terms, and other relevant metadata.

  3. Using YAML Configuration: If you prefer using YAML, you can define your glossary terms and their associations in a YAML file and ingest it using the DataHub CLI.

For detailed steps and examples, you can refer to the <Terms | DataHub API documentation> and the <https://datahubproject.io/docs/generated/ingestion/sources/csv/#module-csv-enricher|CSV Enricher Documentation>.

Would you like more detailed instructions or help with a specific part of the process?

Sources:

Hi <@U08AB3962ER>, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.

What would be the URN of the local file ingested into datahub to be used for addTerms using graphiql

Hi :slight_smile: ! Did this interaction resolve your issue or do you still require assistance ?

Please tag @ RunLLM if you still need further assistance !

Or tag @Datahub Community Support if you require a person to take a look :slight_smile:

Hope this helps ! ><