Adjusting DataHub Ingestion Limit for BigQuery Tables

Original Slack Thread

<@U06TM7M40RK> WARNING {datahub.ingestion.source.bigquery_v2.bigquery_schema:528} -table contains more than 300 columns, only processing 300 columns

The warning message you are seeing indicates that the DataHub ingestion process for BigQuery is configured to process a maximum of 300 columns per table. If a table contains more than 300 columns, only the first 300 columns are processed, and the rest are ignored. This is not a hard limit and can be configured.

According to the <BigQuery | DataHub BigQuery configuration documentation>, you can adjust this limit using the column_limit property. By default, this property is set to 300, but you can increase it based on your environment’s needs.

If you would like more detailed guidance on how to adjust this setting, please let me know!

Sources:

Hi <@U086EAJ6RR8>, how did my answer work out for you? Need any more assistance? You can also tag <@U07FV3UHE4D> to let your fellow community members know you’re still looking for guidance.

Hi :slight_smile: ! Did this interaction resolve your issue or do you still require assistance ?

Please tag @ RunLLM if you still need further assistance !

Or tag @Datahub Community Support if you require a person to take a look :slight_smile:

Hope this helps ! ><