Use DataHub version 0.13.3 CLI for Snowflake, Oracle, and PostgreSQL. Specify the sink as a JSON file. Explain two methods for analyzing data lineage on the column level:
Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!
Are you using UI or CLI for ingestion?
Which DataHub version are you using? (e.g. 0.12.0)
What data source(s) are you integrating with DataHub? (e.g. BigQuery)
<@U06TM7M40RK> Can you help with the questions in this thread?
We’re experimenting with a new AI assistant for questions in the <#CUMUWQU66> channel to help you track down helpful resources more quickly. Keep in mind that responses may not always be spot-on; feel free to ask it follow-up questions and don’t forget to provide its responses a or so other folks in the Community know if you’re still looking for help on the topic!
I’m not able to find any data sources to answer your question. Could you provide more details about the specific issue or question you have regarding DataHub? If this doesn’t help, please reach out to the DataHub support team for further assistance. 0 button 0 button Hint: Mention @runllm in the thread for followups.
Great, thanks for the details! Given that you’re using DataHub version 0.13.3 and the CLI for ingestion with Snowflake, Oracle, and PostgreSQL, I can provide guidance on setting up ingestion and lineage for these data sources.
Steps to Set Up Ingestion
Install Required Packages:
Ensure you have the necessary packages installed for Snowflake, Oracle, and PostgreSQL ingestion.
### Visualizing Lineage in DataHub UI
After running the ingestion and lineage setup, you can visualize the lineage in the DataHub UI:
1. **Navigate to the Dataset Page**: Go to the dataset page in the DataHub UI.
2. **View Lineage Tab**: Click on the "Lineage" tab to see the upstream and downstream dependencies.
Would you like more detailed instructions or examples for any specific part of this process?
### Sources:
- [Oracle Ingestion Configuration](<https://datahubproject.io/docs/0.13.1/generated/ingestion/sources/oracle#config-details>)
- [SchemaResolver Initialization](<https://github.com/datahub-project/datahub/blob/master/metadata-ingestion/src/datahub/sql_parsing/schema_resolver.py>)
- [SQL Parsing Example](<https://github.com/datahub-project/datahub/blob/master/metadata-ingestion/src/datahub/sql_parsing/sqlglot_lineage.py>)
- [File-based Lineage Documentation](<https://datahubproject.io/docs/generated/ingestion/sources/file-based-lineage/>)
- [DataHub CLI Documentation](<https://github.com/datahub-project/datahub/blob/master/docs/cli.md>) :+1: 0 button :-1: 0 button :bulb: *Hint:* Mention @runllm in the thread for followups.