how to delete wrong dataset ingestion
Hey there! 
<@U06TM7M40RK> Can you help with the questions in this thread?
Please continue to tag RunLLM (e.g <@U06TM7M40RK>) to have it continue generating responses!
To delete a wrongly ingested dataset in DataHub, you can use the DataHub CLI to perform either a soft delete or a hard delete. Here are the steps you can follow:
-
Soft Delete: This will mark the dataset as removed, but it will still exist in the system and can be restored if needed.
datahub delete --urn "urn:li:dataset:(urn:li:dataPlatform:your_platform,your_dataset,PROD)"
-
Hard Delete: This will permanently remove the dataset from the system.
datahub delete --urn "urn:li:dataset:(urn:li:dataPlatform:your_platform,your_dataset,PROD)" --hard
Make sure to replace your_platform
and your_dataset
with the appropriate values for your dataset. It’s important to surround the URN with quotes to avoid any terminal misinterpretation.
According to the <https://datahubproject.io/docs/0.14.0/how/delete-metadata#delete-cli-examples|DataHub documentation>, you can also use the --dry-run
option to see what would be deleted without actually performing the deletion. This can be useful to ensure that you are targeting the correct dataset.
Would you like more detailed steps or further assistance on this?
Sources:
- <https://datahubproject.io/docs/0.14.0/how/delete-metadata#delete-cli-examples|DataHub Delete CLI Examples>
0 button
0 button
Hint: Mention @runllm in the thread for followups.
Hi, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.
Hi ! Did this interaction resolve your issue or do you still require assistance ?
Please tag @ RunLLM if you still need further assistance !
Or tag @Datahub Community Support if you require a person to take a look
Hope this helps ! ><