Resolving DataHub Tag Deletion Issue with GraphQL API and Metadata Sync

Original Slack Thread

<@U06TM7M40RK> <@U07FV3UHE4D> help me to fix the below issue in datahub. Go to any dataset, in the right side column, select “Tag” and apply to the dataset. Now click on the tag, a menu shall pop up. Click on the three dots on the top right corner. Select “Delete” option. This should ideally delete the tag from the dataset. When you open the same dataset again, the tag is still applied and is not deleted.

To address the issue of tags not being deleted from a dataset in DataHub, you can follow these steps:

  1. Remove the Tag Programmatically: Use the DataHub GraphQL API to remove the tag from the dataset. This can be done using a GraphQL mutation. Here’s an example:

    mutation removeTag {
      removeTag(
        input: {
          tagUrn: "urn:li:tag:your_tag",
          resourceUrn: "urn:li:dataset:(urn:li:dataPlatform:your_platform,your_dataset,PROD)"
        }
      )
    }
    

    You can execute this mutation using a GraphQL client or via a curl command:

    curl --location --request POST 'http://localhost:8080/api/graphql' \
    --header 'Authorization: Bearer &lt;your-access-token&gt;' \
    --header 'Content-Type: application/json' \
    --data-raw '{ "query": "mutation removeTag { removeTag(input: { tagUrn: \"urn:li:tag:your_tag\", resourceUrn: \"urn:li:dataset:(urn:li:dataPlatform:your_platform,your_dataset,PROD)\" }) }", "variables":{}}'
    
  2. Ensure Metadata Sync: After removing the tag programmatically, ensure that the metadata is fully synchronized and the UI is refreshed to reflect the changes. This may involve:

    • Checking Elasticsearch Indices: Ensure that the Elasticsearch indices are updated to reflect the deletion.
    • Restarting DataHub Services: Restart the DataHub services, especially the GMS (Graph Metadata Service), to clear any cached data.
    • Manual Reindexing: Perform a manual reindex to ensure that the search indices are in sync with the database.
  3. Verify the Deletion: After performing the above steps, verify that the tag has been removed by checking the dataset in the DataHub UI.

If you need more detailed instructions on any of these steps, please let me know.

Sources:

Hi <@U08EZCKTJN6>, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.