Hi Guys -
We have seen an issue with datahub release v0.13.0 in the redshift connector while using the use_lineage_v2 enabled flag. The use case we were trying to solve was to show the upstream lineage when there is temporary table involved. I believe this issue was fixed with this latest release. https://github.com/datahub-project/datahub/issues/4895
https://github.com/datahub-project/datahub/pull/9704
We were hoping to see the table A as upstream for table C when A–>temp B–>C. But this is showing some weird results.
We see 2 issues:
- For my main table f_order_event, the previous upstream lineage count showed 7 tables and they also showed in the lineage graph. However we got to know that there are some temporary tables also involved into creating f_order_event. We enabled the use_lineage_v2 flag on release v13 and now the count says 9 upstream but those extra 2 tables dont show in the lineage graph. Which is very confusing. Those extra tables are also not present in the upstreamLineage aspect in metadata storage. I confirmed that all permanent tables involved in the lineage for f_order_event are present in our datahub instance.
- THe 2nd issue is when we expand the up/down lineage to 2nd or 3rd levels we see the spinner loading icon permanently like its trying to fetch the information, but the 2nd/3rd level lineage already comes up on the screen. THis is not an issue with other connectors with large lineage like snowflake or vertica.
<@U01GZEETMEZ> <@UV14447EU> any suggestions here? Our customers want to see the upstream tables responsible for creating the temp tables in the lineage, fine with not able to see temp tables.