I am looking for incremental ingestion from a single cluster.
For example, I have 1+ million datasets in Hive. After doing the initial loading is there any way of ingesting updated/mew datasets in incremental way instead of running the ingestion on the whole cluster?
Currently it takes days to do the ingestion on the complete cluster.