<@U06TM7M40RK> A wrong number (count) is displayed under the platform name in UI (under “Platforms”). It shows a greater number than there actually are entities. Do you know why is that?
The discrepancy in the number of entities displayed under the platform name in the UI can be attributed to a few potential reasons:
-
Inclusion of Containers: According to a <Issues · datahub-project/datahub · GitHub issue>, the platform total count includes both datasets (tables) and containers of datasets like databases and schemas. This means that the count under platforms may be higher because it aggregates all types of entities, not just datasets.
-
Data Inconsistency: There might be a data inconsistency between Elasticsearch and the database (e.g., MySQL). As mentioned in a <Issues · datahub-project/datahub · GitHub issue>, discrepancies can occur if there are excess documents in Elasticsearch that are not present in the database. This can lead to a mismatch in counts.
-
UI Changes: As noted in a <Slack conversation>, recent UI changes might affect how counts are displayed, such as rounding large numbers, which could lead to perceived discrepancies.
Would you like more detailed information on any of these points or guidance on how to address the issue?
Sources:
- <Issues · datahub-project/datahub · GitHub of entities by Entity Type and by Platform do not match>
- <Issues · datahub-project/datahub · GitHub value does not match the actual number of objects in GraphQL query result>
- <Slack conversation on dataset count per platform> 0 button 0 button Hint: Mention @runllm in the thread for followups.