Troubleshooting YAML Business Glossary Ingestion Issue with No Assets Getting Ingested

Original Slack Thread

Hello, I am attempting business glossary ingestion using UI with a yaml and no assets are getting ingested despite the ingestion completing successfully as per the logs. Can anyone help troubleshoot? Attaching the logs -

I tried both UI and CLI. I just updated to the latest Chart datahub-0.4.4 and app version 0.13.0.attachment

Hey there! :wave: Make sure your message includes the following information if relevant, so we can help more effectively!

  1. Are you using UI or CLI for ingestion?
  2. Which DataHub version are you using? (e.g. 0.12.0)
  3. What data source(s) are you integrating with DataHub? (e.g. BigQuery)

I am having same issue. Please let me know if you find solution.

Hi <@U052SA31CTD> and <@U06LQC5DJMA>, Looks like I am also facing the same issue. But, I am not sure if I am missing something in this ingestion process. On my function, I have configured source, sink and transformer . I could see my term nodes and terms are getting created, but based on the transformers, I am expecting these terms to be added to my all snowflake dataset. But, the missing part is I don’t know where to pass dataset information. Here is my function for ingestion. Please share with me as well if you find any solution.

    """Create business glossary for snowflake entities """
    from datahub.configuration.config_loader import load_config_file
    from datahub.ingestion.run.pipeline import Pipeline
    <http://logger.info|logger.info>("Creating business glossary")
    pipeline = Pipeline.create(
        # This configuration is analogous to a recipe configuration.
        {
            "source": {
                "type": "datahub-business-glossary",
                "config": {
                    "file":    "business_glossary_recipe.yml",
                    "enable_auto_id" : True
                },
            },
            "sink": {
                "type": "datahub-rest",
                "config": {
                    "server": f"{cf.server}",
                    "token": f"{cf.token}"
                },
            },
            "transformers": [
                {
                "type": "simple_add_dataset_terms",
                "config": {
                    "semantics": "PATCH",
                    "term_urns": ["urn:li:glossaryTerm:address"],
                    },
                }
            ],
        }
    )
    pipeline.run()
    pipeline.pretty_print_summary()
    pipeline.raise_from_status()```