chain/neutrino: improve rescan locking #695

cfromknecht · 2020-04-10T19:46:24Z

Fixes 5 or so issues around locking the rescan-related member variables, see commit messages for more details. The issue fixed by the first commit was triggered by an lnd itest, the others are just observed from looking through the code.

halseth

LGTM 👍

guggero · 2020-11-07T16:57:06Z

I tried running our itests against this PR (rebased onto current btcwallet master) and it looks like something's deadlocking now. The tests that do rescan don't even start properly, not sure what's going on.

This commit fixes a potential nil pointer dereference in the neutrino chain client. Currently the Rescan method blindly overwrites the rescan member with nil _after_ reacquiring the lock. However, the field may have been populated with a new rescan while the mutex wasn't held, causing a panic later on where we assume the rescan is non-nil. To remedy, we add a simple check that asserts we are nilling the same rescan we were shutting down while the mutex was unlocked. If we find a differing value, we will continue to this process until we arrive at matching rescan pointers, at which point we know we can safely proceed in creating a new rescan.

The scanning variable is only ever set to true, and it happens in the same paths that the rescanQuit is initialized. Since rescanQuit is also only ever set to a non-nil value, we instead use this to determine whether we are "scanning".

The rescanQuit and rescan objects are set under different mutex acquisitions, hence it's possible for rescanQuit to be non-nil while rescan is nil. This commit ensures that we only try to wait for shutdown of non-nil rescans.

This also fixes an unprotected access of s.rescan when calling Update without the lock held.

Currently NotifyBlock releases the clientMtx before calling a public version of NotifyReceived that reacquires clientMtx. This can have unexpected behavior because the value of isScanning() could change between lock acquisitions. We switch to using the internal notifyReceived so that our read on isScanning() is consistent for the duration of the call.

wpaulino approved these changes Apr 14, 2020

View reviewed changes

halseth approved these changes Nov 6, 2020

View reviewed changes

guggero mentioned this pull request Nov 10, 2020

itest: clean harness state before each icase and better naming for icase logs lightningnetwork/lnd#4737

Merged

cfromknecht force-pushed the rescan-nil-ptr branch from 0c4fca6 to a2a90dd Compare November 12, 2020 17:13

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Nov 12, 2020

mod: pull in btcsuite/btcwallet#695

9ff5039

cfromknecht mentioned this pull request Nov 12, 2020

mod: pull in btcsuite/btcwallet#695 lightningnetwork/lnd#4767

Open

cfromknecht force-pushed the rescan-nil-ptr branch from a2a90dd to 1eb58d9 Compare November 18, 2020 22:48

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Nov 18, 2020

mod: pull in btcsuite/btcwallet#695

2524ac9

cfromknecht force-pushed the rescan-nil-ptr branch from 1eb58d9 to 8d41d76 Compare November 19, 2020 02:13

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Nov 19, 2020

mod: pull in btcsuite/btcwallet#695

1e9552d

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Nov 24, 2020

mod: pull in btcsuite/btcwallet#695

17ec653

cfromknecht added 6 commits December 8, 2020 13:55

chain/neutrino: fix potential panic due to nil rescan

676f942

The rescanQuit and rescan objects are set under different mutex acquisitions, hence it's possible for rescanQuit to be non-nil while rescan is nil. This commit ensures that we only try to wait for shutdown of non-nil rescans.

chain/neutrino: make private notifyReceived

5bc2264

This also fixes an unprotected access of s.rescan when calling Update without the lock held.

chain/neutrino: only acquire mutex once in Rescan happy flow

faf925f

cfromknecht force-pushed the rescan-nil-ptr branch from 8d41d76 to faf925f Compare December 8, 2020 21:55

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Dec 8, 2020

mod: pull in btcsuite/btcwallet#695

f9b577a

cfromknecht added a commit to cfromknecht/lnd that referenced this pull request Dec 27, 2020

mod: pull in btcsuite/btcwallet#695

38862ba

MStreet3 mentioned this pull request Oct 14, 2022

NeutrinoClient data race #819

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chain/neutrino: improve rescan locking #695

chain/neutrino: improve rescan locking #695

cfromknecht commented Apr 10, 2020

halseth left a comment

guggero commented Nov 7, 2020

chain/neutrino: improve rescan locking #695

Are you sure you want to change the base?

chain/neutrino: improve rescan locking #695

Conversation

cfromknecht commented Apr 10, 2020

halseth left a comment

Choose a reason for hiding this comment

guggero commented Nov 7, 2020