More metrics for RF, RS and ROB #632

tilk · 2024-03-27T15:52:56Z

This PR adds the following metrics:

Histogram of the number of used rows in RF, RS and ROB in each cycle.
Histogram of latency for instructions in RS.
Histogram of times registers are valid in RF before being freed.

Conclusions from reading the metrics:

ROB is currently very underused. Average ROB slots used is around 3-4, maximum is around 11-16. This will probably change with the introduction of jump prediction.
RS latency is low, typically 1-2 cycles.

The PR also adds two modules:

AsyncMemoryBank which basically wraps an Amaranth memory with asynchronous reads,
IndexedLatencyMeasurer which allows to measure latency for things that are not processed in FIFO fashion, but have unique indexes (like RF and RS entries).

TODO:

Documentation.
Tests.

coreblocks/core_structs/rf.py

coreblocks/func_blocks/fu/common/rs.py

lekcyjna123 · 2024-03-31T12:15:19Z

test/transactron/test_metrics.py

+
+        time = 0
+
+        def ticker():


You can use yield Now() instead of creating separate process.

The test is written similar to to other metrics tests. I'm not going to do the change here without changing all the others. This is best left for a refactoring PR.

lekcyjna123 · 2024-03-31T12:18:06Z

test/transactron/test_metrics.py

+            for _ in range(200):
+                if not free_slots:
+                    yield
+                    continue


This cause that we can have less than 200 iterations. Was this intentional?

Not really. I guess changing that if to while should be enough.

lekcyjna123 · 2024-03-31T12:20:03Z

test/transactron/test_metrics.py

+            self.assertEqual(min(latencies), (yield m._dut.histogram.min.value))
+            self.assertEqual(max(latencies), (yield m._dut.histogram.max.value))
+            self.assertEqual(sum(latencies), (yield m._dut.histogram.sum.value))
+            self.assertEqual(len(latencies), (yield m._dut.histogram.count.value))
+
+            for i in range(m._dut.histogram.bucket_count):
+                bucket_start = 0 if i == 0 else 2 ** (i - 1)
+                bucket_end = 1e10 if i == m._dut.histogram.bucket_count - 1 else 2**i
+
+                count = sum(1 for x in latencies if bucket_start <= x < bucket_end)
+                self.assertEqual(count, (yield m._dut.histogram.buckets[i].value))


Could we do that common with FIFOLatencyMeasurer test?

Possibly, maybe that's a refactor worth doing.

lekcyjna123 · 2024-03-31T12:27:12Z

coreblocks/func_blocks/fu/common/rs_func_block.py

@@ -41,10 +43,13 @@ def __init__(self, gen_params: GenParams, func_units: Iterable[tuple[FuncUnit, s
            Functional units to be used by this module.
        rs_entries: int
            Number of entries in RS.
+        rs_number: int
+            The number of this RS block. Used for debugging.


If used for debugging, maybe there should be a default value? So that if anyone doesn't care about debug feature, then it doesn't have to pass that value?

I thought about it, and I don't think this is a good thing. If rs_number is not set in a given CoreConfiguration then the metric will be hard to read, so the person doing the metrics will need to change the configuration.

Probably it would be better to auto-generate these numbers.

tilk · 2024-03-31T15:47:04Z

It seems like the test might still have a race. Those Settles are really a nightmare.

…ruct-usage

tilk added 4 commits March 27, 2024 12:05

Add metrics for structs

2084d88

Various fixes

0afb16d

Towards indexed latency measurer

9f5ecf8

Fix errors

bbfa39c

tilk added the enhancement New feature or request label Mar 27, 2024

tilk added 3 commits March 28, 2024 11:11

Documentation

7408471

Test for IndexedLatencyMeasurer

8363f21

Test for AsyncMemoryBank

18119ab

tilk marked this pull request as ready for review March 28, 2024 11:02

xThaid approved these changes Mar 28, 2024

View reviewed changes

coreblocks/core_structs/rf.py Outdated Show resolved Hide resolved

coreblocks/func_blocks/fu/common/rs.py Outdated Show resolved Hide resolved

Address review comments

472adfb

lekcyjna123 reviewed Mar 31, 2024

View reviewed changes

tilk added 4 commits March 31, 2024 16:00

Automatic generation of RS numbers

15cf0fe

Use Now(), increase number of tests, fixes

f931351

Use Now() in another test

2971e30

LatencyMeasurer test refactor

463f29f

lekcyjna123 approved these changes Apr 1, 2024

View reviewed changes

tilk added 3 commits April 1, 2024 14:30

Add assertions to IndexedLatencyMeasurer

110d596

Change Indexed to Tagged

a318903

Merge remote-tracking branch 'origin/master' into tilk/metrics-for-st…

5cfcdcc

…ruct-usage

tilk merged commit f8add3c into master Apr 1, 2024
8 checks passed

tilk deleted the tilk/metrics-for-struct-usage branch April 1, 2024 14:49

github-actions bot pushed a commit that referenced this pull request Apr 1, 2024

More metrics for RF, RS and ROB (#632)

dcc9692

tilk added a commit to kuznia-rdzeni/transactron that referenced this pull request Nov 25, 2024

More metrics for RF, RS and ROB (kuznia-rdzeni/coreblocks#632)

dbcd9e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More metrics for RF, RS and ROB #632

More metrics for RF, RS and ROB #632

tilk commented Mar 27, 2024 •

edited

Loading

lekcyjna123 Mar 31, 2024

tilk Mar 31, 2024

lekcyjna123 Mar 31, 2024

tilk Mar 31, 2024

lekcyjna123 Mar 31, 2024

tilk Mar 31, 2024

lekcyjna123 Mar 31, 2024

tilk Mar 31, 2024

tilk commented Mar 31, 2024

More metrics for RF, RS and ROB #632

More metrics for RF, RS and ROB #632

Conversation

tilk commented Mar 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tilk commented Mar 31, 2024

tilk commented Mar 27, 2024 •

edited

Loading