• Log ingest for “Entities” schema as below:
[2024-04-02 07:52:38,066] INFO {datahub.cli.ingest_cli:147} - DataHub CLI version: 0.13.0
[2024-04-02 07:52:38,147] INFO {datahub.ingestion.run.pipeline:238} - Sink configured successfully. DataHubRestEmitter: configured to talk to <http://datahub-gms:8080>
[2024-04-02 07:52:38,995] INFO {datahub.ingestion.run.pipeline:255} - Source configured successfully.
[2024-04-02 07:52:38,996] INFO {datahub.cli.ingest_cli:128} - Starting metadata ingestion
2024-04-02 07:52:39,553 INFO sqlalchemy.engine.Engine SELECT SchemaName FROM sys_schemas
[2024-04-02 07:52:39,553] INFO {sqlalchemy.engine.base:1853} - SELECT SchemaName FROM sys_schemas
/2024-04-02 07:52:39,553 INFO sqlalchemy.engine.Engine [raw sql] ()
[2024-04-02 07:52:39,553] INFO {sqlalchemy.engine.base:1858} - [raw sql] ()
2024-04-02 07:52:39,673 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:39,673] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:39,695 INFO sqlalchemy.engine.Engine SELECT TableName FROM SYS_TABLES WHERE TableType='TABLE' AND SchemaName=?
[2024-04-02 07:52:39,695] INFO {sqlalchemy.engine.base:1853} - SELECT TableName FROM SYS_TABLES WHERE TableType='TABLE' AND SchemaName=?
2024-04-02 07:52:39,695 INFO sqlalchemy.engine.Engine [raw sql] ('Entities',)
[2024-04-02 07:52:39,695] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('Entities',)
2024-04-02 07:52:44,653 INFO sqlalchemy.engine.Engine COMMIT
|[2024-04-02 07:52:44,653] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,662 INFO sqlalchemy.engine.Engine SELECT TableName FROM SYS_TABLES WHERE TableType='VIEW' AND SchemaName=?
[2024-04-02 07:52:44,662] INFO {sqlalchemy.engine.base:1853} - SELECT TableName FROM SYS_TABLES WHERE TableType='VIEW' AND SchemaName=?
2024-04-02 07:52:44,662 INFO sqlalchemy.engine.Engine [raw sql] ('Entities',)
[2024-04-02 07:52:44,662] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('Entities',)
2024-04-02 07:52:44,669 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,669] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,671 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,671] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,671 INFO sqlalchemy.engine.Engine [raw sql] ('MultiSelectPickListAttributeMetaData', 'Entities')
[2024-04-02 07:52:44,671] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('MultiSelectPickListAttributeMetaData', 'Entities')
2024-04-02 07:52:44,741 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,741] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,749 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,749] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,749 INFO sqlalchemy.engine.Engine [raw sql] ('MultiSelectPickListOptions', 'Entities')
[2024-04-02 07:52:44,749] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('MultiSelectPickListOptions', 'Entities')
2024-04-02 07:52:44,763 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,763] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,771 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,771] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,771 INFO sqlalchemy.engine.Engine [raw sql] ('PickListAttributeMetaData', 'Entities')
[2024-04-02 07:52:44,771] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('PickListAttributeMetaData', 'Entities')
2024-04-02 07:52:44,786 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,786] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,793 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,793] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,793 INFO sqlalchemy.engine.Engine [raw sql] ('PickListOptions', 'Entities')
[2024-04-02 07:52:44,793] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('PickListOptions', 'Entities')
2024-04-02 07:52:44,805 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,805] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,811 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,811] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,811 INFO sqlalchemy.engine.Engine [raw sql] ('StateAttributeMetadata', 'Entities')
[2024-04-02 07:52:44,811] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('StateAttributeMetadata', 'Entities')
2024-04-02 07:52:44,824 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,824] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,831 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,831] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,831 INFO sqlalchemy.engine.Engine [raw sql] ('StateAttributeOptions', 'Entities')
[2024-04-02 07:52:44,831] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('StateAttributeOptions', 'Entities')
2024-04-02 07:52:44,842 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,842] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,850 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,850] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,850 INFO sqlalchemy.engine.Engine [raw sql] ('StatusAttributeMetadata', 'Entities')
[2024-04-02 07:52:44,850] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('StatusAttributeMetadata', 'Entities')
2024-04-02 07:52:44,864 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,864] INFO {sqlalchemy.engine.base:1087} - COMMIT
2024-04-02 07:52:44,870 INFO sqlalchemy.engine.Engine SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
[2024-04-02 07:52:44,870] INFO {sqlalchemy.engine.base:1853} - SELECT CatalogName,SchemaName,TableName,ColumnName,DataType,NumericScale,IsNullable,Ordinal,IsAutoIncrement,IsKey,NumericPrecision FROM sys_tablecolumns WHERE TableName=? AND SchemaName=?
2024-04-02 07:52:44,870 INFO sqlalchemy.engine.Engine [raw sql] ('StatusAttributeOptions', 'Entities')
[2024-04-02 07:52:44,870] INFO {sqlalchemy.engine.base:1858} - [raw sql] ('StatusAttributeOptions', 'Entities')
2024-04-02 07:52:44,881 INFO sqlalchemy.engine.Engine COMMIT
[2024-04-02 07:52:44,881] INFO {sqlalchemy.engine.base:1087} - COMMIT
\[2024-04-02 07:52:44,933] INFO {datahub.cli.ingest_cli:141} - Finished metadata ingestion
Cli report:
{'cli_version': '0.13.0',
'cli_entry_location': '/usr/local/lib/python3.10/site-packages/datahub/__init__.py',
'py_version': '3.10.11 (main, May 23 2023, 13:58:30) [GCC 10.2.1 20210110]',
'py_exec_path': '/usr/local/bin/python',
'os_details': 'Linux-4.18.0-348.7.1.el8_5.x86_64-x86_64-with-glibc2.31',
'mem_info': '272.49 MB',
'peak_memory_usage': '272.49 MB',
'disk_info': {'total': '527.37 GB', 'used': '88.36 GB', 'used_initally': '88.36 GB', 'free': '412.15 GB'},
'peak_disk_usage': '88.36 GB',
'thread_count': 2,
'peak_thread_count': 2}
Source (sqlalchemy) report:
{'events_produced': 61,
'events_produced_per_sec': 9,
'entities': {'container': ['urn:li:container:7dff0b80aea6abeca2e6ae6ffbccf54c', 'urn:li:container:a2cbc87138bc4d5824c9bb7693e2b238'],
'dataset': ['urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.MultiSelectPickListAttributeMetaData,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.MultiSelectPickListOptions,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.PickListAttributeMetaData,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.PickListOptions,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.StateAttributeMetadata,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.StateAttributeOptions,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.StatusAttributeMetadata,PROD)',
'urn:li:dataset:(urn:li:dataPlatform:dataverse,Entities.StatusAttributeOptions,PROD)']},
'aspects': {'container': {'containerProperties': 2, 'status': 2, 'dataPlatformInstance': 2, 'subTypes': 2, 'browsePathsV2': 4, 'container': 1},
'dataset': {'container': 8,
'status': 8,
'datasetProperties': 8,
'schemaMetadata': 8,
'subTypes': 8,
'viewProperties': 8,
'browsePathsV2': 16}},
{--- omitted --- }
Pipeline finished successfully; produced 61 events in 6.39 seconds.```