SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. #1200

urtho · 2022-08-23T08:37:49Z

Problem

We've deployed 2.14.0rc3 on testnet on 25+ nodes.
After two days of testing (that includes random restarts) we've observed ledger cache ahead of postgres ledger which requires manual intervention. Happened to 3 different nodes.

2.14 is our first indexer with local ledger. We've skipped 2.12 and 2.13

{"error":"MakeProcessorWithLedgerInit() err: InitializeLedger() simple catchup err: RunMigration() err: MakeProcessor() err: the ledger cache is ahead of the required round and must be re-initialized","level":"error","msg":"blockprocessor.MakeProcessor() err MakeProcessorWithLedgerInit() err: InitializeLedger() simple catchup err: RunMigration() err: MakeProcessor() err: the ledger cache is ahead of the required round and must be re-initialized","time":"2022-08-23T08:26:37Z"}

Probably not generally fixable with current approach but maybe a "one block off" situation could be addressed.

Urgency

Not very urgent but all shutdowns were "clean" ones so statistically this is going to hurt.

Acceptance Criteria

Use the MaxAccountLookback in the ledger to fetch recent StateDelta objects.
If the local ledger is ahead of postgres, use the historic StateDelta instead of computing a new one.

The text was updated successfully, but these errors were encountered:

fabrice102 · 2023-05-31T13:24:13Z

@urtho How did you manually fix the issue without a full reset of the indexer?

urtho · 2023-05-31T15:45:43Z

I just do fast catchup from a matching catchup from this list https://algorand-catchpoints.s3.us-east-2.amazonaws.com/consolidated/mainnet_catchpoints.txt

So downtime is only 40 minutes.

fabrice102 · 2023-05-31T17:34:52Z

Thanks! Unfortunately, in my case, this did not work out: the indexer started indexing from start again... I'm not completely sure why.

urtho added the new-feature-request Feature request that needs triage label Aug 23, 2022

winder added the Team Lamprey label Sep 8, 2022

chaihoang assigned shiqizng and unassigned shiqizng Sep 8, 2022

winder changed the title ~~Make indexer 2.14 with local ledger more resilient in case of restarts.~~ SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. Sep 22, 2022

shiqizng self-assigned this Oct 3, 2022

winder unassigned shiqizng Oct 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. #1200

SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. #1200

urtho commented Aug 23, 2022 •

edited by winder

fabrice102 commented May 31, 2023

urtho commented May 31, 2023

fabrice102 commented May 31, 2023

SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. #1200

SPIKE: Make indexer 2.14 with local ledger more resilient in case of restarts. #1200

Comments

urtho commented Aug 23, 2022 • edited by winder

Problem

Urgency

Acceptance Criteria

fabrice102 commented May 31, 2023

urtho commented May 31, 2023

fabrice102 commented May 31, 2023

urtho commented Aug 23, 2022 •

edited by winder