Difference Between RestoreIndex Job and SystemUpdate Job with Reindex Set to True

Original Slack Thread

Hi team, We run testing on datahub restoreindex and systemupdate job. We observe that we have to run both restoreindex and systemupdate job for datahub to work correctly. However, In systemupdate job also config elasticsearch reindex. So, In theory, we have run reindex 2 times. The requestion is what is the different between RestoreIndex job and SystemUpdate job with reindex set to true.

<@U04UKA5L5LK> might be able to speak to this!

Hey <@U05467759ME>! So, the system update may change the schema of the elasticsearch indices, which would trigger a reindex. However, sometimes there is no reindex so this step is not necessary. For the restore indices, this fixes any inconsistencies in the index, but does not change the schema. So, it is possible that both of those may be needed. I hope that helps!