Resolving Kafka Source Disabled Error in DataHub

Original Slack Thread

<@U06TM7M40RK> when i am trying to create ingestion source for confluent kafka. We are getting below error

datahub.ingestion.run.pipeline.PipelineInitError: Failed to find a registered source for type kafka: kafka is disabled; try running: pip install 'acryl-datahub[kafka]'```

The error message you’re encountering indicates that the Kafka source is disabled due to a missing dependency. Specifically, it suggests that the acryl-datahub[kafka] package is not installed, which is required for Kafka ingestion.

To resolve this issue, you should ensure that the acryl-datahub[kafka] package is installed in your environment. You can do this by running the following command:

pip install 'acryl-datahub[kafka]'

This command will install the necessary dependencies for Kafka ingestion in DataHub. After installing the package, try running your ingestion pipeline again.

If you continue to experience issues, ensure that your environment is correctly set up and that there are no network connectivity issues preventing the installation of the package.

Would you like more detailed guidance on setting up Kafka ingestion in DataHub?

Sources:

Hi <@U087HP4P21X>, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.

Hi :slight_smile: ! Did this interaction resolve your issue or do you still require assistance ?

Please tag @ RunLLM if you still need further assistance !

Or tag @Datahub Community Support if you require a person to take a look :slight_smile:

Hope this helps ! ><