Hi everyone. I am using Datahub version 0.11.0. I got the initial datahub ingest to work. I am specifically ingesting AWS Athena tables.
However, I noticed that subsequent ingests of the metadata of the same tables do not update the “env”. By default all the “env” values are “PROD”. I attempted to put in an “env” value of “TEST” based on certain Athena schema names, and I saw that the table ingested. But the URN still has “PROD” in it.
My goal is to be able to separate out different environments such as “PROD”, “TEST”, etc
re-ingesting “env”: “TEST” won’t overwrite alreday ingested metadata with PROD. Ingest evn ‘TEST’ should have created new set entities with ‘TEST’ in URNs. You may have to cleanup ‘PROD’ entities with by delete them https://datahubproject.io/docs/how/delete-metadata/
<@U0445MUD81W> if I want to re-ingest everything for a platform (a database), can I delete the entire platform in datahub and re-ingest metadata, and the metadata will show up?
or
just try to hard delete few URNs with PROD and re-ingest with "env": "TEST”, see deleted entries will show up datahub delete --urn "<urn>" --hard