-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Dynamically adjust proposal block ranges #1139
base: master
Are you sure you want to change the base?
Changes from 2 commits
02e2363
cb9186a
b3d31e9
d029d53
ee01c7e
314073f
fb12aa0
f0cb419
ccb5df0
c716ee8
a990ee7
51df1ca
70f65e9
4697b59
093d80b
7a17005
3a5b2c1
677c8b9
c8673dc
7559948
6c19e4b
5c88581
56bef8c
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,9 +1,11 @@ | ||
import assert from "assert"; | ||
import { utils as ethersUtils } from "ethers"; | ||
import { | ||
winston, | ||
EMPTY_MERKLE_ROOT, | ||
sortEventsDescending, | ||
BigNumber, | ||
getNetworkName, | ||
getRefund, | ||
MerkleTree, | ||
sortEventsAscending, | ||
|
@@ -18,6 +20,7 @@ import { | |
DepositWithBlock, | ||
FillsToRefund, | ||
FillWithBlock, | ||
InvalidFill, | ||
isUbaOutflow, | ||
outflowIsFill, | ||
ProposedRootBundle, | ||
|
@@ -29,6 +32,7 @@ import { | |
RunningBalances, | ||
PoolRebalanceLeaf, | ||
RelayerRefundLeaf, | ||
RelayData, | ||
} from "../interfaces"; | ||
import { DataworkerClients } from "./DataworkerClientHelper"; | ||
import { SpokePoolClient, UBAClient, BalanceAllocator } from "../clients"; | ||
|
@@ -453,15 +457,167 @@ export class Dataworker { | |
} | ||
} | ||
|
||
async narrowProposalBlockRanges( | ||
blockRanges: number[][], | ||
spokePoolClients: SpokePoolClientsByChain, | ||
isUBA = false, | ||
logData = false | ||
): Promise<number[][]> { | ||
const chainIds = this.chainIdListForBundleEvaluationBlockNumbers; | ||
const updatedBlockRanges = Object.fromEntries(chainIds.map((chainId, idx) => [chainId, [...blockRanges[idx]]])); | ||
|
||
const { deposits, allValidFills, allInvalidFills } = await this.clients.bundleDataClient._loadData( | ||
blockRanges, | ||
spokePoolClients, | ||
isUBA, | ||
logData | ||
); | ||
|
||
// If invalid fills were detected and they appear to be due to gaps in FundsDeposited events: | ||
// - Narrow the origin block range to exclude the missing deposit, AND | ||
// - Narrow the destination block range to exclude the invalid fill. | ||
allInvalidFills | ||
.filter(({ code }) => code === InvalidFill.DepositIdNotFound) | ||
.forEach(({ fill: { depositId, originChainId, destinationChainId, blockNumber } }) => { | ||
const [originChain, destinationChain] = [getNetworkName(originChainId), getNetworkName(destinationChainId)]; | ||
|
||
// Exclude the missing deposit on the origin chain. | ||
const originSpokePoolClient = spokePoolClients[originChainId]; | ||
let [startBlock, endBlock] = updatedBlockRanges[originChainId]; | ||
|
||
// Find the previous known deposit. This may not be immediastely preceding depositId. | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
const previousDeposit = originSpokePoolClient | ||
.getDepositsForDestinationChain(destinationChainId) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I don't think these are guaranteed to be sorted, which might mean this deposit would be too early, right? |
||
.filter((deposit: DepositWithBlock) => deposit.blockNumber < blockNumber) | ||
.at(-1); | ||
Comment on lines
+520
to
+523
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What if the deposit is from a gap between spoke pool deployments or something? So there actually is no deposit for this id? Will this always return undefined in that case, which would mean no change? If there are edge cases like that that are handled in a subtle way here, we may want to add a brief comment explaining them. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The SpokePoolClient only has the ability to handle a single SpokePool deployment, so in the case that we don't find any preceding deposit then I'll update the comment on line 509 to clarify this. There's arguably a scenario where the first deposit on a new chain goes missing. This would implicitly result in proposing over the range [0,0] for that chain because the HubPoolClient supplies previousEndBlock 0 in that scenario. I think this is an extremely remote probability, and activating a new chain requires multiple responsible adults to test the deployment and monitor the proposal, so I'm probably not inclined to handle it explicitly in the code. wdyt? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. what would a bundle block range look like for a new spoke pool address? whether it be a new chain or an updated spoke pool address for an existing chain? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The bundlEndBlock is sourced from HubPoolClient.getLatestBundleEndBlockForChain(), so we inherit its behaviour. For a new deployment on an existing chain, we're bound to continue from the previous bundleEndBlock. This should work as expected. For a new chain, the chainId index isn't found in the previous bundle, so we default to proposing from 0. This is also as was the case for the activation of zkSync and Base - so we inherit the existing behaviour. The only change that I can foresee here is that if that initial proposal contains invalid fills due to missing deposits on the new chain, or fills for missing deposits on another chain, then we'd revert to proposing over [0,0]. I'm not sure whether that's a problem in itself, but it's a pretty extreme edge case because the number of deposits and fills for the new chain in the initial proposal are likely to be very low. and we'd detect it immediately because activating a new chain implies manual review of the proposal block ranges. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think the 0,0 case is remote and doesn't cause an obvious problem, so I wouldn't worry about handling it explicitly. This all makes sense. I think a comment that says something like:
Would be really helpful to the reader. |
||
|
||
if (previousDeposit?.blockNumber ?? startBlock > startBlock) { | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
updatedBlockRanges[originChainId] = [startBlock, previousDeposit.blockNumber]; | ||
this.logger.debug({ | ||
at: "Dataworker::narrowBlockRanges", | ||
message: `Narrowed proposal block range on ${originChain} due to missing deposit.`, | ||
depositId, | ||
previousBlockRange: [startBlock, endBlock], | ||
newBlockRange: updatedBlockRanges[originChainId], | ||
}); | ||
} | ||
|
||
// Update the endBlock on the destination chain to exclude the invalid fill. | ||
const destSpokePoolClient = spokePoolClients[destinationChainId]; | ||
[startBlock, endBlock] = updatedBlockRanges[destinationChainId]; | ||
|
||
if (blockNumber <= endBlock) { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. nit: wouldn't There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Not necessarily, because we source There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Clarified here: 7559948 |
||
// Find the fill immediately preceding this invalid fill. | ||
const previousFill = destSpokePoolClient | ||
.getFillsWithBlockInRange(startBlock, Math.max(blockNumber - 1, startBlock)) | ||
nicholaspai marked this conversation as resolved.
Show resolved
Hide resolved
|
||
.at(-1); | ||
|
||
// Wind back to the bundle end block number to that of the previous fill. | ||
const newEndBlock = previousFill?.blockNumber ?? startBlock; | ||
updatedBlockRanges[destinationChainId] = | ||
newEndBlock > startBlock ? [startBlock, newEndBlock] : [startBlock - 1, startBlock - 1]; // @fix: Must use previous end block! | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
this.logger.debug({ | ||
at: "Dataworker::narrowBlockRanges", | ||
message: `Narrowed proposal block range on ${destinationChain} due to missing deposit on ${originChain}.`, | ||
depositId, | ||
previousBlockRange: [startBlock, endBlock], | ||
newBlockRange: updatedBlockRanges[destinationChainId], | ||
}); | ||
} | ||
}); | ||
|
||
// For each deposit within the origin chain block range for which no or only partial fill events are found, | ||
// verify whether the fill has been completed on the destination chain. If the fill has been made to completion | ||
// then this is evidence of potential dropped/suppressed events, so narrow the block range to exclude those blocks. | ||
const newUnfilledDeposits = deposits.filter(({ originChainId, depositId, amount, blockNumber }) => { | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
if (blockNumber > updatedBlockRanges[originChainId][1]) { | ||
return false; // Fill event is already out of scope due to previous narrowing. | ||
} | ||
return allValidFills.find( | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can we use a spoke pool function or map lookup to get the fill for a deposit rather than looping through all fills on each deposit? We store by a special hash in the client to avoid this n^2 loop (which has caused speed issues in the past). There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I have backed that entire section out (for now) per this thread with Nick: #1139 (comment). Here's the change: 093d80b I did consider accessing these via a map, but tentatively decided against it because I don't think this will actually use much CPU time in practice. Arriving here is conditional on at least one deposit is detected as filled, despite no known FilledRelay event. Then, we iterate over the àllValidFills` array once per instance of "filled-but-missing" event. So in the normal case it costs nothing, but it does cost CPU in the bad case where a chain/RPC is misbehaving. However, because we've already filtered on the missing deposits and have narrowed the block range accordingly, we should skip over most of those missing fill events anyway. So in practice I think the impact of this would be fairly limited, even though it is technically inefficient to search the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It does, it sounds like you're saying this loop will be run rarely so optimization isn't a concern. I wasn't aware of that when I initially read the code, but will take another pass. |
||
(fill) => | ||
fill.depositId === depositId && fill.originChainId === originChainId && fill.totalFilledAmount.eq(amount) | ||
) | ||
? false | ||
: true; | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
}); | ||
|
||
// For each unfilled or partially filled deposit, verify whether it was actually filled by the proposal end block. | ||
const fillBlocks = await sdk.utils.mapAsync(newUnfilledDeposits, async (deposit: DepositWithBlock) => { | ||
const { spokePool, deploymentBlock } = spokePoolClients[deposit.destinationChainId]; | ||
const blockRange = updatedBlockRanges[deposit.destinationChainId]; | ||
const endBlock = blockRange[1]; | ||
|
||
// @todo: Some tests rely on this; fix! | ||
const startBlock = blockRange[0] > deploymentBlock ? blockRange[0] : deploymentBlock; | ||
|
||
// @todo: Beware the scenario where the fill is completed before the deposit, yet the deposit hash matches 100%. | ||
// This corner case must be ruled out or mitigated before merge, because it will cause the proposer to throw. | ||
return await sdk.utils.findFillBlock(spokePool, deposit as RelayData, startBlock, endBlock); | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
}); | ||
|
||
// Exclude each un- or partially-filled deposit that resolved to a complete fill. | ||
fillBlocks.forEach((fillBlock, idx) => { | ||
if (isDefined(fillBlock)) { | ||
const deposit = newUnfilledDeposits[idx]; | ||
const { destinationChainId } = deposit; | ||
const chain = getNetworkName(deposit.destinationChainId); | ||
this.logger.warn({ | ||
at: "Dataworker::narrowBlockRange", | ||
message: `Identified probable missing filledRelay event on ${chain} at block ${fillBlock}.`, | ||
deposit, | ||
}); | ||
|
||
const [startBlock, endBlock] = updatedBlockRanges[destinationChainId]; | ||
const newEndBlock = | ||
allValidFills | ||
.filter((fill) => fill.destinationChainId === destinationChainId && fill.blockNumber < fillBlock) | ||
.at(-1)?.blockNumber ?? startBlock; | ||
|
||
if (newEndBlock < endBlock) { | ||
updatedBlockRanges[destinationChainId] = | ||
newEndBlock > startBlock ? [startBlock, newEndBlock] : [startBlock - 1, startBlock - 1]; | ||
|
||
this.logger.debug({ | ||
at: "Dataworker::narrowBlockRanges", | ||
message: `Narrowed proposal block range on ${chain} due to missing fill event at block ${fillBlock}.`, | ||
deposit, | ||
previousBlockRange: [startBlock, endBlock], | ||
newBlockRange: updatedBlockRanges[destinationChainId], | ||
}); | ||
} | ||
} | ||
}); | ||
|
||
// Quick sanity check - make sure that the block ranges are coherent. A chain may be soft-paused if it has ongoing | ||
// RPC issues (block ranges are frozen at the previous proposal endBlock), so ensure that this is also handled. | ||
const finalBlockRanges = chainIds.map((chainId) => updatedBlockRanges[chainId]); | ||
nicholaspai marked this conversation as resolved.
Show resolved
Hide resolved
|
||
const coherentBlockRanges = finalBlockRanges.every(([startBlock, endBlock], idx) => { | ||
const [originalStartBlock] = blockRanges[idx]; | ||
return ( | ||
(endBlock > startBlock && startBlock === originalStartBlock) || | ||
(startBlock === endBlock && startBlock === originalStartBlock - 1) // soft-pause | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Why would the new start block be one behind the original start block in this case? And is it okay to leave it that way? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
To be more correct, rather than assuming that There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Understood. That makes sense. |
||
); | ||
}); | ||
assert(coherentBlockRanges, "Updated proposal block ranges are incoherent"); | ||
|
||
return chainIds.map((chainId) => updatedBlockRanges[chainId]); | ||
} | ||
|
||
async Legacy_proposeRootBundle( | ||
blockRangesForProposal: number[][], | ||
spokePoolClients: SpokePoolClientsByChain, | ||
latestMainnetBundleEndBlock: number, | ||
logData = false | ||
): Promise<ProposeRootBundleReturnType> { | ||
const timerStart = Date.now(); | ||
|
||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
// Dry-run the proposal over the input block ranges to identify any data inconsistencies. | ||
const blockRanges = await this.narrowProposalBlockRanges(blockRangesForProposal, spokePoolClients); | ||
pxrl marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
const { fillsToRefund, deposits, allValidFills, unfilledDeposits, earlyDeposits } = | ||
await this.clients.bundleDataClient._loadData(blockRangesForProposal, spokePoolClients, false, logData); | ||
await this.clients.bundleDataClient._loadData(blockRanges, spokePoolClients, false, logData); | ||
|
||
const allValidFillsInRange = getFillsInRange( | ||
allValidFills, | ||
blockRangesForProposal, | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,15 @@ | ||
import { interfaces } from "@across-protocol/sdk-v2"; | ||
import { SpokePoolClient } from "../clients"; | ||
import * as utils from "../utils/SDKUtils"; | ||
|
||
export interface SpokePoolClientsByChain { | ||
[chainId: number]: SpokePoolClient; | ||
} | ||
|
||
export const { InvalidFill } = utils; | ||
|
||
export type InvalidFill = { | ||
fill: interfaces.FillWithBlock; | ||
code: utils.InvalidFillEnum; | ||
reason: string; | ||
}; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we sort these by block number by chain to allow us to exit the following map earlier?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not totally confident on this, but I'm not sure that we can actually simply sort and exit earlier. This is because we need to avoid making assumptions about the ordering of fills vs. deposits.
For example, the first invalid fill on a destination chain might correspond to the last missing deposit on the origin chain. Then, some later invalid fill on the destination chain might actually fill an earlier missing deposit on the origin chain. This also seems more likely to occur in the case that the RPCs are serving inconsistent data.
If we group/sort by destinationChainId and destination block number, we might not narrow the originChainId block range correctly. If we group/sort originChainId and origin block number, we might not narrow the destinationChainid block range correctly.
In general, even when it's really bad, the number of missing deposits we typically see is about 20 - 30. In order to identify both the earliest block for both origin and destination chains we might end up looping multiple times over all of the invalid fills. In the case where we only have tens missing events, it seems like it might be cheaper and simpler to just process them one by one.
Full disclosure: I might have overlooked something really obvious here. wdyt?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is a good point, that the latest deposit might match with the earliest fill and vice versra