Remove The Hashmap from Shorted Path for Centrality Computation #1307

Paulo-21 · 2024-11-03T18:53:33Z

Hello,
I removed the hashmap for the shorted path for centrality.

This may improve performance.

Tell me what do u think about :)

… performance

coveralls · 2024-11-03T23:23:33Z

Pull Request Test Coverage Report for Build 11846697683

Details

20 of 20 (100.0%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 95.809%

Totals
Change from base Build 11840985523:	0.0%
Covered Lines:	18013
Relevant Lines:	18801

💛 - Coveralls

IvanIsCoding

I think minimizing the number of hashing operations that happen during the shortest path is a good idea in general. But I am not convinced this works, we need to benchmark this more carefully.

I have a feeling this method will be slower for directed graphs, where some nodes are not able to reach all other nodes in the graph. In those cases, having a small hashmap with the nodes that can be reached is much faster than having a large vector with mostly non-visited entries.

IvanIsCoding · 2024-11-05T00:22:53Z

rustworkx-core/src/centrality.rs

+    let mut verts_sorted_by_distance: Vec<G::NodeId> = Vec::with_capacity(c); // a stack
+    let mut predecessors: Vec<Vec<usize>> = vec![Vec::new(); max_index];
+    let mut sigma: Vec<f64> = vec![0.; max_index];
+    let mut distance: Vec<i64> = vec![-1; max_index];


A more appropriate type here is Option<i64> if you want to represent missing paths. I'd even say Option<usize>

IvanIsCoding · 2024-11-05T00:29:09Z

rustworkx-core/src/centrality.rs

+        let coeff = (1.0 + delta[iw]) / path_calc.sigma[iw];
+        let p_w = path_calc.predecessors.get(iw).unwrap();
+        for iv in p_w {
+            //let iv = graph.to_index(*v);


Remove this comment

Paulo-21 · 2024-11-08T15:38:18Z

rustworkx-core/src/centrality.rs

-
-    for node in graph.node_identifiers() {
-        predecessors.insert(node, Vec::new());
-        sigma.insert(node, 0.0);
-        distance.insert(node, -1);
-    }


Se how the hashmap was full filled with value for all node of the graph, so replacing with a vec of size of node bound will no be a problem for cache efficiency, because hashmap was already full when the algorithm started

Paulo-21 · 2024-11-08T15:40:03Z

I think minimizing the number of hashing operations that happen during the shortest path is a good idea in general. But I am not convinced this works, we need to benchmark this more carefully.

I have a feeling this method will be slower for directed graphs, where some nodes are not able to reach all other nodes in the graph. In those cases, having a small hashmap with the nodes that can be reached is much faster than having a large vector with mostly non-visited entries.

I heard your argument and i agree that we should benchmark to be sure that it will be faster when the hashmap was not full filled
But in this particular case the hashmap was full filled before the algorithm started so i don't think it will cause any different.
I think we can replace every hashmap indexed by NodeId type and that is full filled before algorithm start.

Paulo-21 and others added 9 commits April 24, 2024 15:41

Remove the Hashmap of the katz centrality computation to avoid better…

88f5122

… performance

cargo fmt

3087023

Merge branch 'main' into main

77a3eb6

Remove the Hashmap from the ShortestPath_for_centrality computation

15cec78

Merge branch 'main' of https://github.com/Paulo-21/rustworkx

7fbbac4

clippy advice

0a2b549

Fix

a6d8171

fmt

70322a0

Fix python test

00af4af

Remove Hashmap from accumulate vertice too

d65c34e

Paulo-21 mentioned this pull request Nov 4, 2024

Avoid using HashMaps in intermediate betweenness centrality computations when not necessary #1309

Open

IvanIsCoding reviewed Nov 5, 2024

View reviewed changes

Merge branch 'Qiskit:main' into main

83cbfe6

Paulo-21 commented Nov 8, 2024

View reviewed changes

Paulo-21 and others added 2 commits November 8, 2024 17:04

Remove useless comment

55a8b91

Merge branch 'main' into main

4332b7c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove The Hashmap from Shorted Path for Centrality Computation #1307

Remove The Hashmap from Shorted Path for Centrality Computation #1307

Paulo-21 commented Nov 3, 2024

coveralls commented Nov 3, 2024 •

edited

Loading

IvanIsCoding left a comment

IvanIsCoding Nov 5, 2024

IvanIsCoding Nov 5, 2024

Paulo-21 Nov 8, 2024

Paulo-21 commented Nov 8, 2024

Remove The Hashmap from Shorted Path for Centrality Computation #1307

Are you sure you want to change the base?

Remove The Hashmap from Shorted Path for Centrality Computation #1307

Conversation

Paulo-21 commented Nov 3, 2024

coveralls commented Nov 3, 2024 • edited Loading

Pull Request Test Coverage Report for Build 11846697683

Details

💛 - Coveralls

IvanIsCoding left a comment

Choose a reason for hiding this comment

IvanIsCoding Nov 5, 2024

Choose a reason for hiding this comment

IvanIsCoding Nov 5, 2024

Choose a reason for hiding this comment

Paulo-21 Nov 8, 2024

Choose a reason for hiding this comment

Paulo-21 commented Nov 8, 2024

coveralls commented Nov 3, 2024 •

edited

Loading