Using SQLAlchemy with DataHub for Metadata Ingestion

Original Slack Thread

how to use SQLAIchemy ?where i can see it?

Hey there! :wave:

<@U06TM7M40RK> Can you help with the questions in this thread?

:information_source: Please continue to tag RunLLM (e.g <@U06TM7M40RK>) to have it continue generating responses!

To use SQLAlchemy with DataHub for metadata ingestion, you can follow these steps:

  1. Install the SQLAlchemy Plugin: You need to install the DataHub SQLAlchemy plugin. This can be done using pip:

    pip install 'acryl-datahub[sqlalchemy]'
    

    This step is crucial as it ensures that the necessary dependencies for SQLAlchemy are available.

  2. Create a Recipe: You need to create a YAML recipe file to configure the ingestion process. Here is a basic example:

    source:
      type: sqlalchemy
      config:
        connect_uri: "dialect+driver://username:password@host:port/database"
    sink:
      type: "datahub-rest"
      config:
        server: "http://localhost:8080"  # Change this to your DataHub server URL
    

    The connect_uri should be replaced with the appropriate connection string for your database.

  3. Run the Ingestion: Execute the ingestion using the DataHub CLI:

    datahub ingest -c your_recipe.yaml
    

    This command will start the ingestion process based on the configuration specified in your recipe file.

For more detailed configuration options and capabilities, you can refer to the <https://datahubproject.io/docs/0.13.1/generated/ingestion/sources/sqlalchemy|DataHub SQLAlchemy documentation>.

If you encounter issues such as the SQLAlchemy plugin not being enabled, ensure that the module is correctly installed and that there are no missing dependencies. You can check the installation by running the command pip list to see if acryl-datahub[sqlalchemy] is listed.

Would you like more detailed guidance on any of these steps?

Sources:

Hi, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.

Hi :slight_smile: ! Did this interaction resolve your issue or do you still require assistance ?

Please tag @ RunLLM if you still need further assistance !

Or tag @Datahub Community Support if you require a person to take a look :slight_smile:

Hope this helps ! ><