Completely disabling backoff #1651

itiulkanov · 2022-09-21T08:49:43Z

Please use the following questions as a guideline to help me answer
your issue/question without further inquiry. Thank you.

Which version of Elastic are you using?

[ x] elastic.v7 (for Elasticsearch 7.x)

Please describe the expected behavior

Once backoff is disabled, we expect all the data sent to the disabled (unavailable) cluster will be discarded, so we only process the response from the cluster in the after method, and store all the failed values into DLQ. Instead, the data sent at the time of the ES failure is kept and submitted once the connection to the ES cluster is restored.

Please describe the actual behavior

We have a constant flow of values, that are kept in different ES clusters. We have added the code that suppose to handle the case when of the ES clusters failed, which is sending that values to the dead letter queue (DLQ) and read them back to it when it is restored back. The issue we are having now - double records. Seems that even without the backoff policy defined (or something like stopBackoff{} one), BulkProcessor, once the ES cluster is restored sends all the values processed from the source into ES, at the same time as we are starting to process DLQ... This means that all the records ingested by the app while the ES cluster is off simply piled up on top of each other, instead of being discarded after the "after" method has been called.
We've tried to overcome that issue by setting up StopBackoff like that:

processor, err = client.BulkProcessor().
			Workers(o.Workers).
			BulkActions(o.BatchSize).
			BulkSize(o.BatchBytes).
			FlushInterval(o.FlushInterval).
			RetryItemStatusCodes(o.RetryItemStatusCodes...).
                        Backoff(elastic.StopBackoff{}).
			Stats(o.WantStats).
			After(after). // call "after" after every commit
			Do(o.Ctx)

But that didn't help, since it seems to only generate errors one after another without doing much.

This behavior would've been fine if we were processing a low amount of messages, but when there are a lot of messages stored in the memory, eventually the app would be crashed and all the stored messages are lost.

Any steps to reproduce the behavior?

Set up the BulkProcessor with stopBackoff defined as backoff.
Disable ES cluster, while BulkProcessor app keeps running and ingest messages from the source (kafka)
Send some values to the app ingestion channel
Enable ES cluster
All the values sent to the BulkProcessor while the ES cluster was disabled are can be seen in the ES cluster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Completely disabling backoff #1651

Completely disabling backoff #1651

itiulkanov commented Sep 21, 2022

Completely disabling backoff #1651

Completely disabling backoff #1651

Comments

itiulkanov commented Sep 21, 2022

Which version of Elastic are you using?

Please describe the expected behavior

Please describe the actual behavior

Any steps to reproduce the behavior?

Suggested solutions