[WIP] feat: Prune stale staking data hard fork + onwards pt. 2 #4505

adsorptionenthalpy · 2023-09-18T21:38:37Z

This pull requests is a follow up to #4068

Stale staking data is defined as delegations to a validator which have
no rewards, no amount, and no undelegations. This definition, however,
excludes the (inactive) validator's self delegation since it is expected
to be at index 0, and a validator cannot be deleted at the moment. The
process works as below:

At the last block of every epoch including the hard fork epoch,
obtain a list of validators. For each validator, load the
ValidatorWrapper and iterate through Delegations to find stale
delegations. Drop such delegations, and make a map of delegators as
keys with validator addres(es) as the values.
Using the map obtained in (1), clear the delegations by delegator in
offchain data. The map is therefore cached in the ProcessorResult for
ease of access (between state_processor and node_newblock).
Since the objective of this change is to save some space following
the hard fork, do not allow small undelegations. This means that
undelegations must be made in such a manner that the remaining amount
after the undelegation has been made is at least a 100 ONEs, or zero.
Lastly, since the removal results in different lengths between
delegations of validator snapshots and validator wrappers, modify the
algorithm for AddReward to be based on DelegatorAddress rather than
the loop index. See core/state/statedb.go

The impact of this change on block processing speed is small. I ran a
node with this change at epoch 881 on mainnet using the rclone database
and saw that pruneStaleStakingData takes 18ms to complete (if each prune
is not logged individually, it takes 9ms).

The PR has been tested locally for integration testing (the undelegation
bit), and unit tests for pruning stale data, remaining undelegation, and
AddReward have been added. Goimports is included.

Issue

Described above.

Test

Unit Test Coverage

for file in $(git diff --name-status main..clear-stale-staking-data | sed s/\^..//); do go test -cover ./$(dirname $file)/; done | sort -f | uniq -i

Before:

?   	github.com/harmony-one/harmony/consensus/engine	[no test files]
?   	github.com/harmony-one/harmony/hmy	[no test files]
?   	github.com/harmony-one/harmony/internal/chain	[no test files]
?   	github.com/harmony-one/harmony/internal/params	[no test files]
?   	github.com/harmony-one/harmony/test/chain/reward	[no test files]
ok  	github.com/harmony-one/harmony/core	(cached)	coverage: 31.2% of statements
ok  	github.com/harmony-one/harmony/core/state	(cached)	coverage: 71.9% of statements
ok  	github.com/harmony-one/harmony/core/vm	(cached)	coverage: 42.4% of statements
ok  	github.com/harmony-one/harmony/hmy/downloader	(cached)	coverage: 75.9% of statements
ok  	github.com/harmony-one/harmony/node/worker	(cached)	coverage: 27.1% of statements
ok  	github.com/harmony-one/harmony/staking	(cached)	coverage: 89.1% of statements
ok  	github.com/harmony-one/harmony/staking/types	(cached)	coverage: 62.9% of statements

After:

?   	github.com/harmony-one/harmony/consensus/engine	[no test files]
?   	github.com/harmony-one/harmony/hmy	[no test files]
?   	github.com/harmony-one/harmony/internal/params	[no test files]
?   	github.com/harmony-one/harmony/test/chain/reward	[no test files]
ok  	github.com/harmony-one/harmony/core	(cached)	coverage: 31.1% of statements
ok  	github.com/harmony-one/harmony/core/state	(cached)	coverage: 75.6% of statements
ok  	github.com/harmony-one/harmony/core/vm	(cached)	coverage: 42.4% of statements
ok  	github.com/harmony-one/harmony/hmy/downloader	(cached)	coverage: 76.0% of statements
ok  	github.com/harmony-one/harmony/internal/chain	(cached)	coverage: 2.8% of statements
ok  	github.com/harmony-one/harmony/node/worker	(cached)	coverage: 27.0% of statements
ok  	github.com/harmony-one/harmony/staking	(cached)	coverage: 89.1% of statements
ok  	github.com/harmony-one/harmony/staking/types	(cached)	coverage: 63.2% of statements

Per-line test coverage for files modified by this PR is available here

Test/Run Logs

{'blockNum': 23232511,
 'caller': '/home/user/go/src/github.com/harmony-one/harmony/internal/chain/engine.go:364',
 'elapsed time': 18,
 'epoch': 881,
 'level': 'info',
 'message': 'pruneStaleStakingData',
 'time': '2022-02-23T08:43:54.259281751Z'}

Full logs show that stale data was removed from 472 validators (there were 676 overall).

Operational Checklist

Does this PR introduce backward-incompatible changes to the on-disk data structure and/or the over-the-wire protocol?. (If no, skip to question 8.)
No.
Describe the migration plan.. For each flag epoch, describe what changes take place at the flag epoch, the anticipated interactions between upgraded/non-upgraded nodes, and any special operational considerations for the migration.
Describe how the plan was tested.
How much minimum baking period after the last flag epoch should we allow on Pangaea before promotion onto mainnet?
What are the planned flag epoch numbers and their ETAs on Pangaea?
What are the planned flag epoch numbers and their ETAs on mainnet?

Note that this must be enough to cover baking period on Pangaea.
What should node operators know about this planned change?
Does this PR introduce backward-incompatible changes NOT related to on-disk data structure and/or over-the-wire protocol? (If no, continue to question 11.)
No.
Does the existing node.sh continue to work with this change?
What should node operators know about this change?
Does this PR introduce significant changes to the operational requirements of the node software, such as >20% increase in CPU, memory, and/or disk usage?
No, a bulk pruning takes up only 18ms extra at the hard fork epoch. It is reasonable to expect continuous pruning at the end of each epoch to not take more time than that.

Update feature with dev

Port over old clear stale stake data commits from stale PR (buildable)

fd2890e

adsorptionenthalpy self-assigned this Sep 18, 2023

adsorptionenthalpy changed the title ~~feat: Prune stale staking data hard fork + onwards pt. 2~~ [WIP] feat: Prune stale staking data hard fork + onwards pt. 2 Sep 18, 2023

adsorptionenthalpy added the WIP Work in progress don't merge yet! label Sep 18, 2023

Merge branch 'dev' into feature/clear-stale-staking-data

90e7160

ONECasey requested review from ONECasey and Frozen September 19, 2023 21:29

add engine_test

d9cb44a

ONECasey requested a review from sophoah September 21, 2023 15:38

adsorptionenthalpy and others added 21 commits October 6, 2023 20:00

update statedb, reward:AccumulateRewardsAndCountSigs

3dce4b7

update statedb, reward:AccumulateRewardsAndCountSigs

2ff6376

Merge branch 'dev' into feature/clear-stale-staking-data

0a52b14

fix merge conflicts

1954fa8

fix merge conflict

8017162

fix merge conflict and syncing with dev

3687575

Merge branch 'dev' into feature/clear-stale-staking-data

912aeb6

Merge branch 'dev' into feature/clear-stale-staking-data

61cbbdd

update leader rotation configs

3d7a931

Merge branch 'dev' into feature/clear-stale-staking-data

7262b56

fix merge conflict

1b3b181

Merge branch 'dev' into feature/clear-stale-staking-data

5a1bc7a

Merge branch 'dev' into feature/clear-stale-staking-data

76f0ef6

Merge branch 'dev' into feature/clear-stale-staking-data

5c512f6

Merge branch 'dev' into feature/clear-stale-staking-data

910a09d

Merge branch 'dev' into feature/clear-stale-staking-data

e28505f

merge dev

0908039

lint issues

652d2b7

lint issues

5215f6d

Merge branch 'dev' into feature/clear-stale-staking-data

ad438fd

fix config.go conflict

8fc39df

adsorptionenthalpy and others added 3 commits December 29, 2023 23:32

update with dev

13e4a99

fix conflict in stakingtype case

42a09f9

Merge pull request #4603 from harmony-one/dev-clear-stake-010924

2d521b6

Update feature with dev

adsorptionenthalpy requested a review from diego1q2w January 11, 2024 15:38

Merge branch 'dev' into feature/clear-stale-staking-data-31024

48162c1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] feat: Prune stale staking data hard fork + onwards pt. 2 #4505

[WIP] feat: Prune stale staking data hard fork + onwards pt. 2 #4505

adsorptionenthalpy commented Sep 18, 2023 •

edited

[WIP] feat: Prune stale staking data hard fork + onwards pt. 2 #4505

Are you sure you want to change the base?

[WIP] feat: Prune stale staking data hard fork + onwards pt. 2 #4505

Conversation

adsorptionenthalpy commented Sep 18, 2023 • edited

Issue

Test

Unit Test Coverage

Test/Run Logs

Operational Checklist

adsorptionenthalpy commented Sep 18, 2023 •

edited