Skip to content

Commit

Permalink
This commit introduces some new functionality and some refactoring:
Browse files Browse the repository at this point in the history
- new: log trigger events and log trigger upkeeps
- new: simulation progress tracking to the console
- change: RunBook changed to SimulationPlan
- change: SimulationPlan json altered for cleaner configurations
- change: loaders moved to loader package due to import cycles
- change: refactored main process for cleaner simulation exit

To run a new simulation:

```
make simulator && ./bin/simulator --simulate -f ./tools/simulator/plans/simplan_fast_check.json
```

Verification of expected performs is still not fully tested, but the updates above create the baseline for log trigger upkeeps.
  • Loading branch information
EasterTheBunny committed Nov 7, 2023
1 parent 822db98 commit cb0d08c
Show file tree
Hide file tree
Showing 85 changed files with 1,839 additions and 1,180 deletions.
4 changes: 2 additions & 2 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -29,5 +29,5 @@ bin/

# default simulation output folders
reports/
runbook_logs/
private_runbooks/
simulation_plan_logs/
simulation_plans/
281 changes: 221 additions & 60 deletions SIMULATOR.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,11 +14,13 @@ as p2p network latency, block production, upkeep schedules, and more.

## Profiling

Start the service in one terminal window and run the pprof tool in another. For more information on pprof, view some docs [here](https://github.com/google/pprof/blob/main/doc/README.md) to get started.
Start the service in one terminal window and run the pprof tool in another. For
more information on pprof, view some docs
[here](https://github.com/google/pprof/blob/main/doc/README.md) to get started.

```
# terminal 1
$ ./bin/simulator --pprof --simulate -f ./simulation_runbooks/runbook_eth_goerli_mild.json
$ ./bin/simulator --pprof --simulate -f ./tools/simulator/plans/simplan_fast_check.json
# terminal 2
$ go tool pprof -top http://localhost:6060/debug/pprof/heap
Expand All @@ -42,126 +44,285 @@ p2p and RPC. The charts are provided by an HTTP endpoint on localhost.

*Example*
```
$ ./bin/simulator --simulate -f ./simulation_runbooks/runbook_eth_goerli_mild.json
$ ./bin/simulator --simulate -f ./tools/simulator/plans/simplan_fast_check.json
```

*Options*
- `--simulation-file | -f [string]`: default ./runbook.json, path to JSON file defining simulation parameters
- `--output-directory | -o [string]`: default ./runbook_logs, path to output directory where run logs are written
- `--simulation-file | -f [string]`: default ./simulation_plan.json, path to JSON file defining simulation parameters
- `--output-directory | -o [string]`: default ./simulation_logs, path to output directory where run logs are written
- `--simulate [bool]`: default false, run simulation and output results
- `--charts [bool]`: default false, start and run charts service to display results
- `--pprof [bool]`: default false, run pprof server on simulation startup
- `--pprof-port [int]`: default 6060, port to serve pprof profiler on

### Runbook Options
## Simulation Plan

A runbook is a set of configurations for the simulator defined in a JSON file.
Each property is described below.
A simulation plan is a set of configurations for the simulator defined in a JSON file and is designed to produce a
consistent simulation between runs. Be aware that there is some variance in simulation outcomes from the same simulation
plan due to randomness in the `rpc`, `blocks`, or `p2pNetwork` configurations. Events are precise relative to block
production.

*nodes*
### Node

The instantiation of a node involves creating simulated dependencies and providing them to the delegate constructor for
the plugin.

*node*
[object]

Configuration values that apply to setting up nodes in the network.

*node.totalNodeCount*
[int]

Total number of nodes to run in the simulation. Each node is connected via a
simulated p2p network and provided an isolated contract/RPC simulation.
Total number of nodes to run in the simulation. Each node is connected via a simulated p2p network and provided an
isolated contract/RPC simulation.

*maxNodeServiceWorkers*
*node.maxNodeServiceWorkers*
[int]

Service workers are used to parallelize RPC calls to be able to process more in
a short time. This number is to set the upper limit on the number of service
workers per simulated node.
Service workers are used to parallelize RPC calls to be able to process more in a short time. This number is to set the
upper limit on the number of service workers per simulated node.

*maxNodeServiceQueueSize*
*node.maxNodeServiceQueueSize*
[int]

Max queue size for sending work to service workers. This should be deprecated
soon.
Max queue size for sending work to service workers. This should be deprecated soon.

### P2PNetwork

The p2p network simulation does not include any tcp/udp networking layers and only serves the perpose of inter-node
communication. The simulated network can be configured to simulate nodes operating on the same hardware or in close
proximity or configured to simulate nodes spread across a large physical distance. A `maxLatency` of `300ms` might
simulate the physical distance of nodes operating in Paris and Singapore, for example.

*p2pNetwork*
[object]

Configure the simulated p2p network.

*avgNetworkLatency*
*p2pNetwork.maxLatency*
[int]

The total amount of time a message should take to be sent in the simulated p2p
network. This is an average and is calculated by taking a random number between
0 and the defined latency.
The maximum amount of time a message should take to be sent in the simulated p2p network. This is calculated by taking
a random number between 0 and the provided latency.

### RPC

A simulated RPC is the connection layer between the block producer and the node. In a real-world environment, the block
production can be imagined as a singular source and RPCs independently read the state of block production. In this way,
each RPC can have a different 'view' of the singular block source.

*rpcDetail*
Each node gets an isolated instance of a simulated RPC. The role the RPC plays is to surface changes made to the
singular block source such as new upkeeps being created, ocr configs being committed, or logs being emitted.

*rpc*
[object]

This object is a container for RPC related configurations. There is currently a
limit of a single RPC simulation configuration and applies to all instances.
Configure the behavior of the simulated RPC. There is currently a limit of a single RPC simulation configuration and
applies to all instances.

*rpcDetail.maxBlockDelay*
*rpc.maxBlockDelay*
[int]

The maximum delay in in milliseconds that an RPC would deliver a new block.

*rpcDetail.averageLatency*
*rpc.averageLatency*
[int]

The average response latency of a simulated RPC call. All latency calculations
have a baseline of 50 milliseconds with an added latency calculated as a
binomial distribution of the configuration where `N = conf * 2` and `P = 0.4`.
The average response latency of a simulated RPC call. All latency calculations have a baseline of 50 milliseconds with
an added latency calculated as a binomial distribution of the configuration where `N = conf * 2` and `P = 0.4`.

*rpcDetail.errorRate*
*rpc.errorRate*
[float]

The probability that an RPC call will return an error. `0.02` is `2%`
The probability that an RPC call will return an error. `0.02` is `2%`. RPC providers are essentially cloud services that
have potential failures. Use this configuration to simulate a flaky RPC provider.

*rpcDetail.rateLimitThreshold*
*rpc.rateLimitThreshold*
[int]

Total number of calls per second before returning a rate limit response from the
simulated RPC provider.
Total number of calls per second before returning a rate limit response from the simulated RPC provider.

### Blocks

*blockDetail*
Simulated blocks in the context of the simulator are only containers of events that apply to the network of nodes and
that are provided to the network on a defined cadence. The concept of signatures, and hashes doesn't apply. Where the
term `hash` is used, the value is likely either a randomly generated value or a value derived from some block data.

*blocks*
[object]

Configuration object for simulated chain. The chain is a coordinated block
producer that feeds each simulated RPC by a dedicated channel.
Configuration object for simulated chain. The chain is a coordinated block producer that feeds each simulated RPC by a
dedicated channel. Each simulated RPC can receive blocks at different times.

*blockDetail.genesisBlock*
*blocks.genesisBlock*
[int]

The block number for the first simulated block. Formatted as time.
The block number for the first simulated block.

*blockDetail.blockCadence*
*blocks.blockCadence*
[string]

The rate at which new blocks are created. Formatted as time.

*blockDetail.blockCadenceJitter*
*blocks.blockCadenceJitter*
[string]

Some chains produce blocks on a well defined cadence. Most do not. This
parameter allows some jitter to be applied to the block cadence.
Block cadenece jitter is applied to block production such that each block is not produced exactly on the cadence.

*blockDetail.durationInBlocks*
*blocks.durationInBlocks*
[int]

A simulation only runs for this defined number of blocks. The configured upkeeps
are applied within this range.
A simulation only runs for this defined number of blocks. The configured upkeeps are applied within this range.

*blockDetail.endPadding*
*blocks.endPadding*
[int]

The simulated chain continues to broadcast blocks for the end padding duration
to allow all performs to have time to be completed. The configured upkeeps do
not apply to this block set.
The simulated chain continues to broadcast blocks for the end padding duration to allow all performs to have time to be
completed. The configured upkeeps do not apply to this block set.

### Events

All events are applied to blocks as they are produced. Each event contains at least a `type` that describes how to
process the event and a `eventBlockNumber` which defines the block in which to apply the event. Many events are
singular. Some events are generative in that multiple events are generated from a single configuration.

Generative events, such as `generateUpkeeps`, allows a more collapsed JSON config. The specific case of generating
upkeeps will create `count` upkeep create events for the same `eventBlockNumber`.

*configEvents*
*events*
[array[object]]

Config events change the state of the network and at least 1 is required to
start the network configuration. Each event is broadcast by the simulated chain
at the block defined.
Config events change the state of the network and at least 1 is required to start the network configuration. Each event
is broadcast by the simulated chain at the block defined.

Every event has some common properties:

*type*
[string:required]

Determines the event type. Options include `ocr3config`, `generateUpkeeps`, `logTrigger`

*eventBlockNumber*
[int:required]

Block number to commit this event to block history.

*comment*
[string:optional]

Optional reference value. Not output on logs.


#### OCR 3 Config

- type: `ocr3config`

An event with the type `ocr3config` indicates that a new network configuration was committed to the block history. In a
real-world scenario, this would be an on-chain transaction that emits a log. In the simulation, the event is a specific
type and is recognized by the simulated RPC.

TODO: describe OCR config values and how they are provided

*encodedOffchainConfig*
[string]

The encoded config does not currently enforce an encoding type. An OCR off-chain config is an array of bytes allowing
the encoding to be anything. This configuration property should be the value already encoded as the plugin expects. In
the case of JSON, include character escaping such as `{\"version\":\"v3\"}`.

#### Generate Upkeeps

- type: `generateUpkeeps`

*configEvents.triggerBlockNumber*
Multiple upkeep events can be generated by using this event type. An upkeep event simulates an upkeep being added to a
registry and made active. The only states relevant to the simulator regarding registered upkeeps are: is it active, and
is it eligible?

An upkeep becomes active on the `eventBlockNumber` where it is committed to the block history. From that point,
eligibility begins to apply, which is defined by the `eligibilityFunc`. No other events will apply to an upkeep until
after the upkeep becomes active in the block history, which includes `logTrigger` type events.

The `eligibilityFunc` allows a basic linear function to be supplied indicating when, relative to the trigger block, an
upkeep should be eligible.

Example:

func: `2x + 1`
active at block: `100`
final block: `500`

```
let start = 100;
let end = 500;
let i = 0;
let next = 0;
let eligible = [];
while next < end {
if next > start {
eligible.push(next);
}
let y = (2 * i) + 1;
next = start + Math.round(y);
i++;
}
// the eligibility function makes the upkeep eligible every 2 blocks with a relative
// offset of 1 to the start block
// eligible: [101, 103, 105, 107, ...]
```

The `offsetFunc` advances the `start` point for each generated upkeep to ensure eligibility doesn't overlap for each
generated upkeep.

Special options such as `always` and `never` are also available.

*count*
[int]

Total number of upkeeps to generate for the event configuration.

*startID*
[int]

The block to broadcast the event on. This block should be after the genesis
block and before the final simulation block.
ID to reference the upkeep in the config. The UpkeepID output will be different.

*eligibilityFunc*
[string]

Simple linear equation for generating eligibility. Also allowed are `always` and `never`.

*offsetFunc*
[string]

Simple linear equation for eligibility start offset. Allowed to be empty when `eligibilityFunc` is `always` or `never`.

*upkeepType*
[string]

Options are `conditional` and `logTrigger`.

*logTriggeredBy*
[string]

This value applies to a log trigger type upkeep and is the reference point for a `logTrigger` event. If this value
matches the `triggerValue` of a `logTrigger` event, this upkeep is 'triggered' by the log trigger event.

#### Log Events

- type: `logTrigger`

A simulated log event does only simulates the existence of a real-world log and the value for matching to an upkeep
trigger. Once a log event is active in the block history, it can 'trigger' any active and eligible `logTrigger` type
upkeeps with a matching `triggerValue`.

*configEvents.offchainConfigJSON*
*triggerValue*
[string]

Stringified JSON for off-chain configuration.
Value used to 'trigger' upkeeps.
Loading

0 comments on commit cb0d08c

Please sign in to comment.