Extract sequence tracking from the Broadcaster #12353

amit-momin · 2024-03-08T00:01:07Z

Created the new SequenceTracker interface which will be used to manage sequences used in the Broadcaster
Created a new nonceTracker component in the EVM code as the EVM specific implementation of SequenceTracker
Extracted the sequence management logic from the Broadcaster into the nonceTracker
Removed dependency on the FindLatestSequence TxStore method for loading the sequences on startup. Switch to using the more generic GetAllTransactions method.

github-actions · 2024-03-08T00:01:27Z

I see you updated files related to core. Please run pnpm changeset to add a changeset.

github-actions · 2024-03-08T00:01:27Z

I see that you haven't updated any README files. Would it make sense to do so?

dimriou · 2024-03-12T12:55:19Z

core/chains/evm/txmgr/builder.go

Appreciation comment for cleaning this up from TXM 🙌

Actually does it make sense to move NonceTracker inside Broadcaster completely? It seems like Broadcaster already has all the necessary parameters and non else calls NonceTracker for now.

Good point! Moved the nonce tracker initialization into NewEvmBroadcaster in the latest commit. Had to adjust how we initialize the Broadcaster in tests though to still have an exposed nonce tracker for tests.

dimriou · 2024-03-12T13:42:31Z

core/chains/evm/txmgr/nonce_tracker.go

+	return seq, err
+}
+
+func (s *nonceTracker) getSequenceFromStore(ctx context.Context, address common.Address) (seq evmtypes.Nonce, err error) {


Why unpack FindLatestSequence's logic here and not keep it as it is? GetAllTransactions seems like a heavier query with no added benefits.

@prashantkumar1982 mentioned we're trying to move to a generic tx store so moving forward we'd only want to rely on generic methods. So this was just to break our reliance on an EVM specific method while I was already making these changes.

I don't think it's a good practice to load every tx from the DB and handle the logic in memory. This is significantly heavier than what we do now. If we decide moving to a generic tx store makes sense then we can make the change, but for now I would suggest to keep it as it is.

I agree with Dimitris here. Let's keep it the same. When we refactor the txstore, we can deal with this problem then. For now, let's just focus on the nonce and not change any behavior related to the txstore.

patrick-dowell

Overall looks good. I appreciate that you removed more lines than you added :)

Just a few notes left in the comments - we will need to address these before merging.

patrick-dowell · 2024-03-14T22:37:32Z

core/chains/evm/txmgr/nonce_tracker.go

+	return seq, err
+}
+
+func (s *nonceTracker) getSequenceFromStore(ctx context.Context, address common.Address) (seq evmtypes.Nonce, err error) {


I agree with Dimitris here. Let's keep it the same. When we refactor the txstore, we can deal with this problem then. For now, let's just focus on the nonce and not change any behavior related to the txstore.

patrick-dowell · 2024-03-14T22:39:16Z

core/chains/evm/txmgr/evm_tx_store.go

@@ -1013,16 +1029,6 @@ func (o *evmTxStore) UpdateTxCallbackCompleted(ctx context.Context, pipelineTask
 	return nil
 }

-func (o *evmTxStore) FindLatestSequence(ctx context.Context, fromAddress common.Address, chainId *big.Int) (nonce evmtypes.Nonce, err error) {


Let's keep using this method and UpdateKeyNextSequence until we refactor the TxStore.

Just wanted to note UpdateKeyNextSequence is unused. I removed it just as a clean up. It's actually updating a table that we don't own.

patrick-dowell · 2024-03-14T22:48:41Z

core/chains/evm/txmgr/nonce_tracker.go

+		s.lggr.Infow("Fast-forward sequence", "address", addr, "newNextSequence", nonce, "oldNextSequence", localSequence)
+	}
+
+	s.sequenceLock.Lock()


It's a bit dangerous to lock after doing a check that can affect whether we want to proceed with the write. In this case, what happens if the value of nonce here changes after you do the check on 152 but before you lock here?

The check on line 152 is actually only used to determine if we should log some statements, otherwise we proceed with the write regardless. Also the nonce here is just the one we retrieved from on chain. I guess it could change but the sequence map lock wouldn't prevent any desync with on-chain data.

Ok cool this makes sense. So it's fine as is then!

patrick-dowell · 2024-03-14T22:55:53Z

core/chains/evm/txmgr/nonce_tracker.go

+	client           NonceTrackerClient
+	enabledAddresses []common.Address
+
+	sequenceLock sync.RWMutex


out of curiosity, does the nonceTracker need a lock? If it's always been called from the broadcaster which has its own lock, then it might be protected already and adding the extra lock just slows down the code. But I'd need to look more carefully to confirm this. It might be safer just to have it unless it's crystal clear that the the calling code is protected by a broadcaster lock.

This lock was actually moved over from the Broadcaster so we aren't double locking here.

Theoretically, the Broadcaster can process multiple addresses at the same time, which are handled by different go routines (monitorTxs) so there is a scenario where SequenceTracker requires locking. Good observation though, I like that we're pushing to clean up unnecessary things 🙌 .

…onent

dimriou · 2024-03-15T12:50:59Z

core/chains/evm/txmgr/builder.go

 	logger logger.Logger,
 	checkerFactory TransmitCheckerFactory,
 	autoSyncNonce bool,
 ) *Broadcaster {
-	return txmgr.NewBroadcaster(txStore, client, chainConfig, feeConfig, txConfig, listenerConfig, keystore, txAttemptBuilder, nonceSyncer, logger, checkerFactory, autoSyncNonce, evmtypes.GenerateNextNonce)
+	nonceTracker := NewNonceTracker(logger, txStore, client)


I was thinking of creating the SequenceTracker inside the Broadcaster altogether but I guess the plan is to use it in other components as well so this is totally fine.

patrick-dowell

LGTM!

prashantkumar1982 · 2024-03-15T20:43:00Z

core/chains/evm/txmgr/nonce_tracker.go

+	// Try to retrieve next sequence from tx table or on-chain to load the map
+	// A scenario could exist where loading the map during startup failed (e.g. All configured RPC's are unreachable at start)
+	// The expectation is that the node does not fail startup so sequences need to be loaded during runtime
+	foundSeq, err := s.getSequenceForAddr(ctx, address)


This makes an RPC call onchain, while holding the lock. So we are locking this for potentially many seconds.
Could we acquire the lock only when we are reading/writing from the nextSequenceMap?

Couldn't we run into an issue where multiple calls are made to GetNextSequence where none find the value locally so they all search on-chain? Then if we only lock the actual write to the map, the callers could get different values from on-chain depending which RPC their request was sent to. They would then write conflicting values to the map and return different nonces? I think this wouldn't be a problem in the current setup where the Broadcaster only uses the nonce tracker but if other components start using this we could maybe run into this problem. My worries could also be completely unfounded though so not against making this change if this isn't a worry.

Yes I am assuming the broadcaster is the only caller, and for a given FromAddress, it only calls NonceTracker from a single thread.
But lets keep the logic as it is for now.

prashantkumar1982 · 2024-03-15T23:15:50Z

.changeset/silver-months-glow.md

+"chainlink": patch
+---
+
+Fixed nonce gap bug if in-progress tx sent but TXM shutdown before marked as broadcasted


nit: Fixed a race condition bug around EVM nonce management, which could cause the Node to skip a nonce and get stuck.

cl-sonarqube-production · 2024-03-19T00:21:27Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
99.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

amit-momin temporarily deployed to sdlc March 8, 2024 00:01 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 8, 2024 00:50 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 8, 2024 17:51 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 8, 2024 18:07 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 8, 2024 22:18 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 11, 2024 18:01 — with GitHub Actions Inactive

amit-momin temporarily deployed to sdlc March 11, 2024 19:19 — with GitHub Actions Inactive

github-merge-queue bot temporarily deployed to sdlc March 11, 2024 19:19 Inactive

amit-momin marked this pull request as ready for review March 11, 2024 19:41

amit-momin requested review from a team as code owners March 11, 2024 19:41

amit-momin changed the title ~~Extract sequence tracking from the Broadcaster into a separate component~~ Extract sequence tracking from the Broadcaster Mar 11, 2024

amit-momin temporarily deployed to sdlc March 11, 2024 21:25 — with GitHub Actions Inactive

dimriou reviewed Mar 12, 2024

View reviewed changes

amit-momin temporarily deployed to sdlc March 12, 2024 17:11 — with GitHub Actions Inactive

patrick-dowell requested changes Mar 14, 2024

View reviewed changes

amit-momin added 5 commits March 14, 2024 20:16

Extracted sequence tracking from the Broadcaster into a separate comp…

f73a3ec

…onent

Fixed test and linting

3fb3173

Updated NonceTracker to use TXM client

8a68572

Fixed tests

ec5abea

Moved NonceTracker initialization into the EVM Broadcaster builder

0b22f2b

amit-momin force-pushed the txm-sequence-refactor branch from 2c68d46 to 0b22f2b Compare March 15, 2024 01:34

amit-momin temporarily deployed to sdlc March 15, 2024 01:34 — with GitHub Actions Inactive

dimriou reviewed Mar 15, 2024

View reviewed changes

dimriou previously approved these changes Mar 15, 2024

View reviewed changes

patrick-dowell previously approved these changes Mar 15, 2024

View reviewed changes

prashantkumar1982 reviewed Mar 15, 2024

View reviewed changes

Fixed issue with in-progress tx during startup

a9d75e6

amit-momin dismissed stale reviews from patrick-dowell and dimriou via a9d75e6 March 15, 2024 21:52

amit-momin temporarily deployed to sdlc March 15, 2024 21:52 — with GitHub Actions Inactive

Fixed linting

af8e231

amit-momin temporarily deployed to sdlc March 15, 2024 22:00 — with GitHub Actions Inactive

prashantkumar1982 previously approved these changes Mar 15, 2024

View reviewed changes

Merge branch 'develop' into txm-sequence-refactor

de51eaa

prashantkumar1982 temporarily deployed to sdlc March 15, 2024 22:10 — with GitHub Actions Inactive

Added changeset

fb70f96

amit-momin dismissed prashantkumar1982’s stale review via fb70f96 March 15, 2024 22:19

amit-momin temporarily deployed to sdlc March 15, 2024 22:19 — with GitHub Actions Inactive

prashantkumar1982 reviewed Mar 15, 2024

View reviewed changes

Updated changeset message

f62b7ee

amit-momin temporarily deployed to sdlc March 16, 2024 05:31 — with GitHub Actions Inactive

prashantkumar1982 approved these changes Mar 19, 2024

View reviewed changes

Merge branch 'develop' into txm-sequence-refactor

4beeb76

prashantkumar1982 temporarily deployed to sdlc March 19, 2024 00:07 — with GitHub Actions Inactive

prashantkumar1982 added this pull request to the merge queue Mar 19, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 19, 2024

prashantkumar1982 added this pull request to the merge queue Mar 19, 2024

Merged via the queue into develop with commit 07c9f6c Mar 19, 2024
105 checks passed

prashantkumar1982 deleted the txm-sequence-refactor branch March 19, 2024 01:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract sequence tracking from the Broadcaster #12353

Extract sequence tracking from the Broadcaster #12353

amit-momin commented Mar 8, 2024 •

edited

Loading

github-actions bot commented Mar 8, 2024

github-actions bot commented Mar 8, 2024

dimriou Mar 12, 2024

dimriou Mar 12, 2024

amit-momin Mar 12, 2024

dimriou Mar 12, 2024

amit-momin Mar 12, 2024

dimriou Mar 13, 2024

patrick-dowell Mar 14, 2024

patrick-dowell left a comment

patrick-dowell Mar 14, 2024

patrick-dowell Mar 14, 2024

amit-momin Mar 14, 2024

patrick-dowell Mar 14, 2024

amit-momin Mar 14, 2024

patrick-dowell Mar 15, 2024

patrick-dowell Mar 14, 2024

amit-momin Mar 14, 2024

dimriou Mar 15, 2024

dimriou Mar 15, 2024 •

edited

Loading

patrick-dowell left a comment

prashantkumar1982 Mar 15, 2024

amit-momin Mar 15, 2024

prashantkumar1982 Mar 15, 2024

prashantkumar1982 Mar 15, 2024

cl-sonarqube-production bot commented Mar 19, 2024

Extract sequence tracking from the Broadcaster #12353

Extract sequence tracking from the Broadcaster #12353

Conversation

amit-momin commented Mar 8, 2024 • edited Loading

github-actions bot commented Mar 8, 2024

github-actions bot commented Mar 8, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

patrick-dowell left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dimriou Mar 15, 2024 • edited Loading

Choose a reason for hiding this comment

patrick-dowell left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cl-sonarqube-production bot commented Mar 19, 2024

Quality Gate passed

amit-momin commented Mar 8, 2024 •

edited

Loading

dimriou Mar 15, 2024 •

edited

Loading