Automating the Mapping of Business Glossary Terms to DataHub Entities through CLI and Python Code

Original Slack Thread

Hi Team,I have ingested business glossary .yml file through CLI .How can I automate the process of mapping those terms and termgroups to the DataHub Entities ?

Hey there! :wave: Make sure your message includes the following information if relevant, so we can help more effectively!

  1. Are you using UI or CLI for ingestion?
  2. Which DataHub version are you using? (e.g. 0.12.0)
  3. What data source(s) are you integrating with DataHub? (e.g. BigQuery)

That’s fully documented in DataHub’s documentation. That’s how I successfully did it.

Thank you Ignacio.Did you try with python code?Can you please share the link wit me.

<@U06SC4W3FAR>, can you share the documentation URL? I have tried with transformers, but it is not adding terms to any of my datasets. For me Graphql script is working though.

<@U06JS7VND0C>,Can you please share documentation for Graphql script.I didn’t get any clear understanding from DataHUb documentation.I am referring this linkhttps://datahubproject.io/docs/graphql/mutations/#addterm

Reference URL: https://datahubproject.io/docs/next/api/tutorials/terms#remove-terms.
Replace batchRemoveTerms with batchAddTerms I prefer to do it as batch (multiple entities). You can do same with addTerms for single entity. Hope this helps.