Troubleshooting ingestion issue for dataset with tables and sharded tables in Datahub v0.13

user-2 · May 27, 2024, 12:04am

Hello everyone (Datahub v0.13). I’m ingesting a series of datasets from BigQuery; their name is in the form published_something. I’m in trouble beacuse one of my published_something datasets is not able to be ingested for some reason, in the sense that the ingestion starts but then it remains running like forever. I’m copying the recipe I’m using to ingest every single published dataset, and I want to specify that it works fine for every other dataset:

source:
type: bigquery
config:
env: TEST
include_table_lineage: true
include_usage_statistics: true
include_tables: true
include_views: true
include_schema_metadata: true
profiling:
enabled: true
profile_table_level_only: true
stateful_ingestion:
enabled: true
credential:
project_id: extractor_project
private_key: ‘-----BEGIN PRIVATE KEY-----\n<covered>\n-----END PRIVATE KEY-----\n’
private_key_id: <covered>
client_email: <covered>
client_id: <covered>
project_id: test-project
dataset_pattern:
allow:
- test-project.published_dataset
deny:
ignoreCase: true
table_pattern:
allow:
- ‘.*’
deny:
ignoreCase: true

I want to specify that the dataset I’m not able to ingest contains tables and sharded tables, whereas all of the other dataset only contains tables and views. Any idea of the problem or any suggestion to ingest tables and sharded tables belonging to this dataset? I’ve tried to ingest one single table at a time, but it does not work.

datahub_team · May 27, 2024, 12:04am

Hey there! Make sure your message includes the following information if relevant, so we can help more effectively!

Are you using UI or CLI for ingestion?
Which DataHub version are you using? (e.g. 0.12.0)
What data source(s) are you integrating with DataHub? (e.g. BigQuery)

Topic		Replies	Views
.title {"How to Ingest Only the Most Recent Partition of Sharded Tables in Datahub UI"} ingestion	3	30	May 20, 2024
Ingesting Multiple Tables with the Same Name from Different Datasets in UI BigQuery ingestion	7	64	March 4, 2024
Profiling a Specific Table in a Dataset with Data Ingestion Recipe ingestion	2	64	March 4, 2024
Troubleshooting Dataset Ingestion Issue with Superset Configuration ingestion	15	98	August 12, 2024
Troubleshooting Dataset Existence Issue in DataHub with PostgreSQL Data Source ingestion	3	16	July 22, 2024

Troubleshooting ingestion issue for dataset with tables and sharded tables in Datahub v0.13

Related topics