Use delayed power table to validate or drop messages from future instances #151

anorth · 2024-04-03T17:33:52Z

We have agreed that voting weights for an instance should come from the power table resulting from not the immediately previous instance, but some distance (10 instances / 5 minutes? longer) further back. This allows us to do proper message validation and avoid retransmitting bad messages.

Since the power table is given from the node, it might be that theres little or no code change required in GPBFT, but (1) confirm this, and (2) ensure that multi-instance tests do this properly, and (3) let's use #125 to help us gain confidence in the lookback distance.

Change host API to separate fetching chain from fetching power tables #256
Use list of power tables maintained internally to validate messages
Drop messages for instances that are >10 ahead of the current instance
Queue messages that are valid for a future, soon-enough instance
Test using the lookback and dropping messages, ensure it's practically effective.

anorth · 2024-04-04T21:21:52Z

The point of this is to be able to validate message from near-future instances (from a node's point of view). So there will be work here to implement that validation and queue of validated messages. This may involve changes to the host API so the participant can reliably keep track of N power tables and have the right one on hand for any message.

anorth · 2024-05-16T23:43:29Z

I'm closing #130 and expanding the scope here a little with a task list including maintaining the delayed power tables and using them to validate messages.

anorth · 2024-05-20T23:36:52Z

The host (Lotus) must end up with a store of (instance, finalised tipset) records somewhere in order to be able to bootstrap the protocol when a node starts up. F3 couldn't otherwise know what instance it's up to or what power table to use. So, the API will assume that F3 can ask the host for the power table corresponding(*) to an instance. The host can map instance -> Tipset -> Epoch and then go build the power table that F3 needs. F3 can cache the results.

A significant design question is whether the lookback parameter should be internal to F3, or encapsulated by the host. I.e. does (*) corresponding to mean the power table finalised by an instance, or the power table to be used for an instance. It initially seems natural to make the parameter internal to F3, but a few things push back against that:

All the sim testing infrastructure is set up to associate the power table to be used for an instance. That also makes tests much easier to write and independent of that parameter value. So the sim would have to compute the reverse offset to feed F3 the power table from an instance (which would initially be a bunch of genesis tips). The associations in the test code would be different to the production code, which would be confusing.
If the API is finalised by, then F3 also needs a way to ask for the gensis power table. With instance numbers as uint64 there's very high risk of an underflow computing current - offset to find the right instance. We could add an explicit API for fetching genesis, but then we need underflow checks and branches anywhere that calls these methods. We could alternatively make instance numbers a signed int64, and then infer genesis from any negative instance. (I would choose this option, using int64 throughout)

Thus, I am first going to encapsulate the offset parameter in the host. F3 will ask for the power table it should use for an instance. This means the offset configuration lives in the host, and subtraction of the offset happens on the host side.

It's not perfect, but I think we'd introduce a bunch of unnecessary complexity to try to keep the parameter in F3: reworking the simulation testing setup to account for offsets, adjusting tests that use power table fluctuations, converting instance to int64 everywhere. We can always come back to do this later if we don't like it.

Stebalien · 2024-05-21T02:37:03Z

IMO, we should use int64 for instances regardless (I've seen too many issues with MaxUint64 overflowing int64).

That aside, I don't think having the lookback inside go-f3 will actually be all that difficult:

We can still initialize instances with the desired power table (no lookback).
A GetPowerTableFromInstance method can simply assert that the passed instance has the correct lookback for the instance being simulated, then return the power table.

Kubuxu · 2024-05-21T13:18:27Z

Regarding Lotus having to store the power table, we are storing finality certificates, which should contain the power tables that are being finalized.

Stebalien · 2024-05-21T17:40:45Z

Well, the finality certificates only store power table diffs. But looking up the power table associated with an instance isn't difficult (instance - lookback_distance -> head ts -> power table).

Stebalien · 2024-05-22T21:42:06Z

See https://github.com/filecoin-project/go-f3/pull/273/files#r1610685278 for an example of the power tables I'll need if we implement #257.

anorth added the gossipbft Relates to core GossipPBFT protocol label Apr 3, 2024

Kubuxu added this to the F3 Alpha milestone Apr 22, 2024

ranchalp mentioned this issue Apr 24, 2024

Design questions left for FIP #177

Open

anorth mentioned this issue May 7, 2024

Replace queue of future messages with some other catch-up mechanism #130

Closed

anorth self-assigned this May 14, 2024

anorth changed the title ~~Delayed power table~~ Use delayed power table to validate or drop messages from future instances May 16, 2024

This was referenced May 17, 2024

Message re-broadcast, fast-forward and queues #169

Closed

Epic: Protocol completeness #246

Open

Implement basic denial-of-service resistence #12

Open

Stebalien mentioned this issue May 17, 2024

When to pull certificates #176

Open

anorth mentioned this issue May 21, 2024

Change host API to separate fetching chain from fetching power tables #256

Merged

Stebalien mentioned this issue May 21, 2024

Commit to the next power table in the message payload #257

Open

Stebalien mentioned this issue May 22, 2024

Commit to a next power table and additional commitments #273

Open

anorth mentioned this issue May 24, 2024

Use offset committees to validate and queue/drop messages for future instances #264

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use delayed power table to validate or drop messages from future instances #151

Use delayed power table to validate or drop messages from future instances #151

anorth commented Apr 3, 2024 •

edited by jennijuju

anorth commented Apr 4, 2024

anorth commented May 16, 2024

anorth commented May 20, 2024

Stebalien commented May 21, 2024

Kubuxu commented May 21, 2024 •

edited

Stebalien commented May 21, 2024

Stebalien commented May 22, 2024

Use delayed power table to validate or drop messages from future instances #151

Use delayed power table to validate or drop messages from future instances #151

Comments

anorth commented Apr 3, 2024 • edited by jennijuju

anorth commented Apr 4, 2024

anorth commented May 16, 2024

anorth commented May 20, 2024

Stebalien commented May 21, 2024

Kubuxu commented May 21, 2024 • edited

Stebalien commented May 21, 2024

Stebalien commented May 22, 2024

anorth commented Apr 3, 2024 •

edited by jennijuju

Kubuxu commented May 21, 2024 •

edited