Investigating Pipeline Failures and Invalid URN Format

Original Slack Thread

[2023-10-18, 10:19:33 MSK] {logging_mixin.py:149} INFO - Pipeline finished with at least 44 failures; produced 2581 events in 5 minutes and 10.79 seconds.
datahub.configuration.common.PipelineExecutionError: (‘Sink reported errors’, DataHubRestSinkReport(total_records_written=2537, records_written_per_second=8, warnings=, failures=[{‘error’: ‘Unable to emit metadata to DataHub GMS: Invalid format for aspect: {customProperties={Dimensions=quarter, Filters=, Metrics=AHT}, inputs=[{string=urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD)}], description=, lastModified={created={actor=urn:li:corpuser:unknown, time=1696843600540}, lastModified={actor=urn:li:corpuser:unknown, time=1696843600540}}, type=BAR, title=AHT Q2Q, сек, chartUrl=https://superset-integration.team/superset/explore/?form_data={“slice_id”%3A%202507}} for entity: urn:li:chart:(superset,2507)\n Cause: ERROR :: /inputs/0/string :: “Provided urn urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD)” is invalid: Failed to convert urn to entity key: urns parts and key fields do not have same length for urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD)\n’, ‘info’: {‘exceptionClass’: ‘com.linkedin.restli.server.RestLiServiceException’, ‘message’: ‘Invalid format for aspect: {customProperties={Dimensions=quarter, Filters=, Metrics=AHT}, inputs=[{string=urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD)}], description=, lastModified={created={actor=urn:li:corpuser:unknown, time=1696843600540}, lastModified={actor=urn:li:corpuser:unknown, time=1696843600540}}, type=BAR, title=AHT Q2Q, сек, chartUrl=https://superset-integration.team/superset/explore/?form_data={“slice_id”%3A%202507}} for entity: urn:li:chart:(superset,2507)\n Cause: ERROR :: /inputs/0/string :: “Provided urn urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_mail.AHT, Ma,PROD)” is invalid: Failed to convert urn to entity key: urns parts and key fields do not have same length for urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD)\n’, ‘status’: 400, ‘id’: ‘urn:li:chart:(superset,2507)’}}. Could you please help find cause of errors.

A follow-up. Looks like a problem is in the urn Provided urn urn:li:dataset:(urn:li:dataPlatform:clickhouse,m_clickhouse.autodwh_dash_m.AHT, Ma,PROD), when it should be urn:li:dataset:(
DATAPLATFORMURN,
SOME_ID,
_ENVIROMENT
). Why would connecting library generate 4 parameters when there should be 3? clickhouse is the platform, m_clickhouse.autodwh_dash_m.AHN is the id, PROD is the environment. Where Ma comes from? Would it be the problem with a comma in value? Could it be fixed?

Do you have any assets (i.e table or view) as name AHT, Ma