Avoid overhead for synthesized nodes lookup #13424

mtreinish · 2024-11-12T15:40:09Z

Summary

After #12550 a hash implementation was added to the implementation of DAGOpNode to be able to have identical instances of dag nodes used be usable in a set or dict. This is because after #12550 changed the DAGCircuit so the DAGOpNode instances were just a python view of the data contained in the nodes of a dag. While prior to #12550 the actual DAGOpNode objects were returned by reference from DAG methods. However, this hash implementation has additional overhead compared to the object identity based version used before. This has caused a regression in some cases for high level synthesis when it's checking for nodes it's already synthesized. This commit addresses this by changing the dict key to be the node id instead of the node object. The integer hashing is significantly faster than the object hashing.

Details and comments

After Qiskit#12550 a hash implementation was added to the implementation of DAGOpNode to be able to have identical instances of dag nodes used be usable in a set or dict. This is because after Qiskit#12550 changed the DAGCircuit so the DAGOpNode instances were just a python view of the data contained in the nodes of a dag. While prior to Qiskit#12550 the actual DAGOpNode objects were returned by reference from DAG methods. However, this hash implementation has additional overhead compared to the object identity based version used before. This has caused a regression in some cases for high level synthesis when it's checking for nodes it's already synthesized. This commit addresses this by changing the dict key to be the node id instead of the node object. The integer hashing is significantly faster than the object hashing.

qiskit-bot · 2024-11-12T15:40:14Z

One or more of the following people are relevant to this code:

@Qiskit/terra-core

raynelfss · 2024-11-12T15:52:32Z

Thank you for this addition! This makes more sense. Is this improvement noticeable when benchmarking runtime?

mtreinish · 2024-11-12T15:56:04Z

I did a quick asv run and it didn't flag anything as an improvement or a regression. But it is noticeable in some benchpress benchmarks it has a noticeable improvement for example running qiskit_gym/abstract_transpile/test_hamiltonians.py::TestWorkoutAbstractHamiltonians::test_hamiltonians[ham_ham_JW-18-all-to-all] it went goes from 5.5106774509768 sec with 1.3.0rc1 to 3.2585 sec with this PR.

coveralls · 2024-11-12T16:05:15Z

Pull Request Test Coverage Report for Build 11800658258

Details

3 of 3 (100.0%) changed or added relevant lines in 1 file are covered.
15 unchanged lines in 3 files lost coverage.
Overall coverage decreased (-0.01%) to 88.922%

Files with Coverage Reduction	New Missed Lines	%
crates/qasm2/src/expr.rs	1	94.02%
crates/qasm2/src/parse.rs	6	97.62%
crates/qasm2/src/lex.rs	8	91.48%

Totals
Change from base Build 11784569909:	-0.01%
Covered Lines:	79053
Relevant Lines:	88902

💛 - Coveralls

kevinhartman

LGTM, thanks!

kevinhartman · 2024-11-12T16:24:01Z

qiskit/transpiler/passes/synthesis/high_level_synthesis.py

@@ -382,7 +382,7 @@ def _run(

            # If the synthesis changed the operation (i.e. it is not None), store the result.
            if synthesized is not None:
-                synthesized_nodes[node] = (synthesized, synthesized_context)
+                synthesized_nodes[node._node_id] = (synthesized, synthesized_context)


This looks fine in this case since synthesized_nodes is only used with one DAG (and not e.g. shared by the recursive control flow block handling).

But this optimization is something we should be careful when applying in other places, since the semantics of uniqueness in keys shifts from being global to local within a specific DAG (which means that keys will clobber each other if node indices from more than a single DAG are used in the same map). This was responsible for a bug we had in the visualization code after the initial DAG port to Rust.

After #12550 a hash implementation was added to the implementation of DAGOpNode to be able to have identical instances of dag nodes used be usable in a set or dict. This is because after #12550 changed the DAGCircuit so the DAGOpNode instances were just a python view of the data contained in the nodes of a dag. While prior to #12550 the actual DAGOpNode objects were returned by reference from DAG methods. However, this hash implementation has additional overhead compared to the object identity based version used before. This has caused a regression in some cases for high level synthesis when it's checking for nodes it's already synthesized. This commit addresses this by changing the dict key to be the node id instead of the node object. The integer hashing is significantly faster than the object hashing. (cherry picked from commit 8c6ad02)

After #12550 a hash implementation was added to the implementation of DAGOpNode to be able to have identical instances of dag nodes used be usable in a set or dict. This is because after #12550 changed the DAGCircuit so the DAGOpNode instances were just a python view of the data contained in the nodes of a dag. While prior to #12550 the actual DAGOpNode objects were returned by reference from DAG methods. However, this hash implementation has additional overhead compared to the object identity based version used before. This has caused a regression in some cases for high level synthesis when it's checking for nodes it's already synthesized. This commit addresses this by changing the dict key to be the node id instead of the node object. The integer hashing is significantly faster than the object hashing. (cherry picked from commit 8c6ad02) Co-authored-by: Matthew Treinish <[email protected]>

mtreinish added performance Changelog: None Do not include in changelog labels Nov 12, 2024

mtreinish added this to the 1.3.0 milestone Nov 12, 2024

mtreinish requested review from alexanderivrii, ShellyGarion and a team as code owners November 12, 2024 15:40

mtreinish added the stable backport potential The bug might be minimal and/or import enough to be port to stable label Nov 12, 2024

mtreinish assigned Cryoris and alexanderivrii Nov 12, 2024

kevinhartman approved these changes Nov 12, 2024

View reviewed changes

Cryoris added this pull request to the merge queue Nov 12, 2024

Merged via the queue into Qiskit:main with commit 8c6ad02 Nov 12, 2024
19 checks passed

mergify bot mentioned this pull request Nov 12, 2024

Avoid overhead for synthesized nodes lookup (backport #13424) #13425

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid overhead for synthesized nodes lookup #13424

Avoid overhead for synthesized nodes lookup #13424

mtreinish commented Nov 12, 2024

qiskit-bot commented Nov 12, 2024

raynelfss commented Nov 12, 2024

mtreinish commented Nov 12, 2024

coveralls commented Nov 12, 2024

kevinhartman left a comment

kevinhartman Nov 12, 2024

Avoid overhead for synthesized nodes lookup #13424

Avoid overhead for synthesized nodes lookup #13424

Conversation

mtreinish commented Nov 12, 2024

Summary

Details and comments

qiskit-bot commented Nov 12, 2024

raynelfss commented Nov 12, 2024

mtreinish commented Nov 12, 2024

coveralls commented Nov 12, 2024

Pull Request Test Coverage Report for Build 11800658258

Details

💛 - Coveralls

kevinhartman left a comment

Choose a reason for hiding this comment

kevinhartman Nov 12, 2024

Choose a reason for hiding this comment