Hello Team, I have this reciepe to ingest data from postgres which ingest fine but when I the datahub exists --urn “urn:li:dataset:(urn:li:dataPlatform:postgres,cdr.public.customer, PROD)”
it return false. Any thought on why that is so.
Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!
Are you using UI or CLI for ingestion?
Which DataHub version are you using? (e.g. 0.12.0)
What data source(s) are you integrating with DataHub? (e.g. BigQuery)
<@U06TM7M40RK> Can you help with the questions in this thread?
We’re experimenting with a new AI assistant for questions in the <#CUMUWQU66> channel to help you track down helpful resources more quickly. Keep in mind that responses may not always be spot-on; feel free to ask it follow-up questions and don’t forget to provide its responses a or so other folks in the Community know if you’re still looking for help on the topic!
To help you more effectively, could you please provide the following information?
Are you using the UI or CLI for ingestion?
Which DataHub version are you using? (e.g., 0.12.0)
What data source(s) are you integrating with DataHub? (e.g., BigQuery)
Based on the information provided, it seems like you are using the CLI for ingestion and integrating with a PostgreSQL data source. The issue you are facing might be related to the URN format or the ingestion process itself.
Possible Issues and Solutions
URN Format: Ensure that the URN format is correct. The URN should follow the pattern urn:li:dataset:(urn:li:dataPlatform:<platform>,<name>,<env>). In your case, it should be:
Check Entity Existence: You can use the datahub exists command to check if the entity exists in DataHub. Ensure that you are using the correct URN format:
GraphQL Query: If the CLI command does not work, you can use a GraphQL query to check if the dataset exists. Here is an example of how to do it using Python: