tsort refactoring proposal #5968

anastygnome · 2024-02-11T07:59:43Z

This pull request is an attempt to rework the current tsort implementation in order to allow for better performance.
I finally had some time to give it a shot after I mentionned it in #4931 with @tertsdiepraam

It also tries to mimic gnu tsort output.

Please share any suggestions.

sylvestre · 2024-02-11T08:57:22Z

I think we should keep the gnu ordering. I value compatibility over speed

anastygnome · 2024-02-11T09:00:47Z

I think we should keep the gnu ordering. I value compatibility over speed

The thing is topological ordering isn't unique when it exists in most cases. If you rely on the order of a specific implementation, even an update to gnu can break your workflow.

Keeping the same order forces us to implement a solution that's 4x slower.
The order in the provided solution is still mathematically correct.

If you switch to rust coreutils and you rely on implementation related details, you're in trouble anyway
So I don't believe it's a compatibility issue in and of itself

tertsdiepraam · 2024-02-11T09:09:47Z

The thing is topological ordering isn't unique when it exists in most cases. If you rely on the order of a specific implementation, even an update to gnu can break your workflow.

This depends on what that order is. For example, if it attempts to retain the order in which the nodes appear in the source, that would make sense. Could you give some examples of which cases differ from GNU?

Also, I'd love to see some benchmarks on this using hyperfine. Counting memory copies is a nice approximation, but actual measurements are even more helpful :)

Edit: I've done a measurement:

Benchmark 1: ./coreutils-main tsort in.txt
  Time (mean ± σ):      2.428 s ±  0.100 s    [User: 2.355 s, System: 0.066 s]
  Range (min … max):    2.334 s …  2.684 s    10 runs
 
Benchmark 2: ./coreutils-new tsort in.txt
  Time (mean ± σ):      2.658 s ±  0.092 s    [User: 2.589 s, System: 0.063 s]
  Range (min … max):    2.562 s …  2.824 s    10 runs
 
Benchmark 3: tsort in.txt
  Time (mean ± σ):     517.4 ms ±   5.6 ms    [User: 501.2 ms, System: 12.6 ms]
  Range (min … max):   511.5 ms … 528.9 ms    10 runs
 
Summary
  tsort in.txt ran
    4.69 ± 0.20 times faster than ./coreutils-main tsort in.txt
    5.14 ± 0.19 times faster than ./coreutils-new tsort in.txt

Looks like this is not yet yielding the results you were hoping for and you actually made it a bit slower.

anastygnome · 2024-02-11T09:27:28Z

The thing is topological ordering isn't unique when it exists in most cases. If you rely on the order of a specific implementation, even an update to gnu can break your workflow.

This depends on what that order is. For example, if it attempts to retain the order in which the nodes appear in the source, that would make sense. Could you give some examples of which cases differ from GNU?

well, the test case with the dependencies is such an example. My implementation sorts the nodes alphabetically before running tsort.

 Benchmark 1: ./coreutils-main tsort in.txt
   Time (mean ± σ):      2.428 s ±  0.100 s    [User: 2.355 s, System: 0.066 s]
 Range (min … max):    2.334 s …  2.684 s    10 runs 
 Benchmark 2: ./coreutils-new tsort in.txt
  Time (mean ± σ):      2.658 s ±  0.092 s    [User: 2.589 s, System: 0.063 s]
   Range (min … max):    2.562 s …  2.824 s    10 runs

 Benchmark 3: tsort in.txt
   Time (mean ± σ):     517.4 ms ±   5.6 ms    [User: 501.2 ms, System: 12.6 ms]
   Range (min … max):   511.5 ms … 528.9 ms    10 runs
  
 Summary
   tsort in.txt ran
     4.69 ± 0.20 times faster than ./coreutils-main tsort in.txt
     5.14 ± 0.19 times faster than ./coreutils-new tsort in.txt

Looks like this is not yet yielding the results you were hoping for and you actually made it a bit slower.

this is still a WIP. I'm trying to get better performance indeed.
Can you share the in.txt file so that we have a common bench?

tertsdiepraam · 2024-02-11T09:30:12Z

Sure, I've generated the input with this:

import random

N = 10000
for i in range(100*N):
    a = random.randint(0,N)
    b = random.randint(0,N)
    if a != b:
        print(f"{min(a, b)} {max(a, b)}")

anastygnome · 2024-06-23T09:35:24Z

@tertsdiepraam
I hope you're both doing well. I've been quite busy lately but have managed to make significant progress on the refactor I've been working on.

The code still passes all tests and shows a substantial performance improvement, being up to 3 times faster than the current implementation in almost all cases I have tested. I didn't have the time to document it sadly, put simply, I've used another algorithm that leverages the tree nature of the graph in a better way, ensuring less copies and sorting.

I believe we can potentially achieve even greater speed improvements by switching from a BTree to a Red-Black tree.

I am currently unable to continue this PR, so I hope someone can reuse my work :)

tertsdiepraam · 2024-06-23T09:54:10Z

Remind me on Tuesday! I'd love to take a look then

github-actions · 2024-06-24T21:06:41Z

GNU testsuite comparison:

Skip an intermittent issue tests/tail/inotify-dir-recreate (fails in this run but passes in the 'main' branch)

anastygnome · 2024-06-25T03:47:06Z

Remind me on Tuesday! I'd love to take a look then
@tertsdiepraam
There you go.

For reference, the proposed code in the first commit is approximately 1.5 times slower than GNU tsort.
(still a 2x improvement)

Benchmark 1: target/release/coreutils tsort terts_10000_seed0
  Time (mean ± σ):     763.7 ms ±  12.9 ms    [User: 736.1 ms, System: 22.8 ms]
  Range (min … max):   742.4 ms … 781.6 ms    10 runs
 
Benchmark 2: tsort terts_10000_seed0
  Time (mean ± σ):     509.4 ms ±   4.2 ms    [User: 485.3 ms, System: 21.1 ms]
  Range (min … max):   503.0 ms … 514.9 ms    10 runs
 
Summary
  tsort terts_10000_seed0 ran
    1.50 ± 0.03 times faster than target/release/coreutils tsort terts_10000_seed0

The last commit is a work in progress and serves as a reference to illustrate that the performance cost is mainly driven by the data structure used for nodes. Switching to a hashmap and some manual sort results in a topological order very close to GNU tsort (though it still fails in some cases).

Benchmark 1: target/release/coreutils tsort terts_10000_seed0
  Time (mean ± σ):     222.5 ms ±   4.5 ms    [User: 202.3 ms, System: 19.0 ms]
  Range (min … max):   215.8 ms … 229.7 ms    13 runs
 
Benchmark 2: tsort terts_10000_seed0
  Time (mean ± σ):     511.4 ms ±   6.5 ms    [User: 490.1 ms, System: 19.3 ms]
  Range (min … max):   500.5 ms … 519.5 ms    10 runs
 
Summary
  target/release/coreutils tsort terts_10000_seed0 ran
    2.30 ± 0.05 times faster than tsort terts_10000_seed0

The proposed implementation uses Knuth's Algorithm T (TAOCP vol. 1) for topological sorting.

This algorithm works by first listing all nodes and counting how many nodes each one depends on (its predecessors). nodes with no predecessors are added to a queue. We then take the node at the front of the queue, remove it from the graph and append it to the result. Any nodes depending on it have their predecessor count reduced.

If any of those nodes now have no predecessors, they are added at the back of the queue. This continues until the whole queue is processed. If any nodes remain with predecessors at the end, it means there's a cycle, which we find with a DFS on the remaining subgraph.

tertsdiepraam

Looks great! I think you actually settled on something very close to the current version on main, but with a much better implementation. Most of my comments are easy to fix I think. The compat is interesting. I bet we can fix this to be more compatible, but we'd have to investigate more.

src/uu/tsort/src/tsort.rs

tertsdiepraam · 2024-06-27T09:25:46Z

src/uu/tsort/src/tsort.rs

- self.init_node(to);
+ self.add_node(from);
+ if from != to {
+ self.add_node(to);


This is not something for this PR, but in this function, the each node is already hashed twice. It would be interesting to see if we can speed this up by assigning each string an id or something.

This is why I didn't remove the comment regarding the use of int.

src/uu/tsort/src/tsort.rs

github-actions · 2024-06-29T13:51:11Z

GNU testsuite comparison:

Skipping an intermittent issue tests/tail/inotify-dir-recreate (passes in this run but fails in the 'main' branch)

github-actions · 2024-06-29T17:31:28Z

GNU testsuite comparison:

Skipping an intermittent issue tests/tail/inotify-dir-recreate (passes in this run but fails in the 'main' branch)

github-actions · 2024-06-30T14:39:35Z

GNU testsuite comparison:

Skip an intermittent issue tests/tail/inotify-dir-recreate (fails in this run but passes in the 'main' branch)

github-actions · 2024-07-05T07:06:59Z

GNU testsuite comparison:

GNU test failed: tests/timeout/timeout. tests/timeout/timeout is passing on 'main'. Maybe you have to rebase?
Skipping an intermittent issue tests/tail/inotify-dir-recreate (passes in this run but fails in the 'main' branch)

anastygnome · 2024-07-23T11:13:57Z

@tertsdiepraam ding dong!

tertsdiepraam · 2024-07-23T11:30:31Z

Sorry I don't have access to a computer this week. Ill try to look at it soonish

github-actions · 2024-09-20T10:40:06Z

GNU testsuite comparison:

GNU test failed: tests/timeout/timeout. tests/timeout/timeout is passing on 'main'. Maybe you have to rebase?

anastygnome · 2024-09-24T18:28:47Z

@tertsdiepraam Hey there, I've corrected typos & clarified the comments. I'm also working on testing replacing strings with IDs, but that will be for another PR.

github-actions · 2024-09-26T14:32:19Z

GNU testsuite comparison:

GNU test failed: tests/timeout/timeout. tests/timeout/timeout is passing on 'main'. Maybe you have to rebase?

anastygnome · 2024-10-05T14:44:15Z

@sylvestre Hey there, I would also like your input on this one, given the benchmarks,
I fear the output of tsort are very structure dependent and it would require a rewrite of the tree as is.
I believe @tertsdiepraam is correct and it should be merged as is.

There is still work to be done on tsort, but I believe this one makes it at least good enough for now.

This commit is a complete rewrite of the tsort util, resulting in a x10 performance improvement BREAKING CHANGE: the order is no longer the same as gnu tsort in some fringe cases.

github-actions · 2024-10-05T15:11:34Z

GNU testsuite comparison:

GNU test failed: tests/timeout/timeout. tests/timeout/timeout is passing on 'main'. Maybe you have to rebase?

sylvestre · 2024-10-05T15:17:11Z

If Terts is happy, I am happy. Could you please add your benchmark in a BENCHMARKING.md file?
See other for example

anastygnome force-pushed the tsort branch from 1320e6d to 8d3bc12 Compare February 11, 2024 08:01

anastygnome force-pushed the tsort branch from 8d3bc12 to e4d1ed6 Compare June 23, 2024 09:01

anastygnome force-pushed the tsort branch 7 times, most recently from 8c9306a to d8b784b Compare June 23, 2024 09:53

anastygnome force-pushed the tsort branch 4 times, most recently from 2bcf91b to 76abc9d Compare June 24, 2024 20:40

tertsdiepraam reviewed Jun 27, 2024

View reviewed changes

anastygnome force-pushed the tsort branch from 76abc9d to 02ecff3 Compare June 28, 2024 17:39

anastygnome commented Jun 28, 2024

View reviewed changes

src/uu/tsort/src/tsort.rs Show resolved Hide resolved

anastygnome requested a review from tertsdiepraam June 28, 2024 17:42

anastygnome marked this pull request as ready for review June 28, 2024 17:42

anastygnome force-pushed the tsort branch 3 times, most recently from 516ef13 to 42f102e Compare June 29, 2024 06:56

anastygnome force-pushed the tsort branch from 2bbf37b to 9160074 Compare June 29, 2024 17:04

anastygnome force-pushed the tsort branch from 9160074 to a7f0527 Compare June 30, 2024 13:52

anastygnome force-pushed the tsort branch 3 times, most recently from 67136d7 to 4b5c191 Compare July 5, 2024 06:40

anastygnome force-pushed the tsort branch 2 times, most recently from a587261 to f6ddfde Compare July 9, 2024 03:16

anastygnome force-pushed the tsort branch 2 times, most recently from 3cb012d to 2355df0 Compare July 21, 2024 09:23

anastygnome force-pushed the tsort branch from 2355df0 to c8cfeaf Compare August 7, 2024 15:15

anastygnome force-pushed the tsort branch 2 times, most recently from 51bd45c to 9d227cd Compare August 31, 2024 14:27

anastygnome force-pushed the tsort branch from 9d227cd to 97f6bb5 Compare September 20, 2024 10:13

anastygnome force-pushed the tsort branch 2 times, most recently from ffbdf45 to f2f15d7 Compare September 24, 2024 18:28

anastygnome force-pushed the tsort branch from f2f15d7 to 8762d84 Compare September 26, 2024 14:06

refactor(tsort): Switch to Knuth algorithm T

82f169b

This commit is a complete rewrite of the tsort util, resulting in a x10 performance improvement BREAKING CHANGE: the order is no longer the same as gnu tsort in some fringe cases.

anastygnome force-pushed the tsort branch from 8762d84 to 82f169b Compare October 5, 2024 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tsort refactoring proposal #5968

tsort refactoring proposal #5968

anastygnome commented Feb 11, 2024 •

edited

Loading

sylvestre commented Feb 11, 2024

anastygnome commented Feb 11, 2024 •

edited

Loading

tertsdiepraam commented Feb 11, 2024 •

edited

Loading

anastygnome commented Feb 11, 2024 •

edited

Loading

tertsdiepraam commented Feb 11, 2024 •

edited

Loading

anastygnome commented Jun 23, 2024

tertsdiepraam commented Jun 23, 2024

github-actions bot commented Jun 24, 2024

anastygnome commented Jun 25, 2024 •

edited

Loading

tertsdiepraam left a comment

tertsdiepraam Jun 27, 2024

anastygnome Jun 28, 2024

github-actions bot commented Jun 29, 2024

github-actions bot commented Jun 29, 2024

github-actions bot commented Jun 30, 2024

github-actions bot commented Jul 5, 2024

anastygnome commented Jul 23, 2024

tertsdiepraam commented Jul 23, 2024

github-actions bot commented Sep 20, 2024

anastygnome commented Sep 24, 2024 •

edited

Loading

github-actions bot commented Sep 26, 2024

anastygnome commented Oct 5, 2024

github-actions bot commented Oct 5, 2024

sylvestre commented Oct 5, 2024

tsort refactoring proposal #5968

Are you sure you want to change the base?

tsort refactoring proposal #5968

Conversation

anastygnome commented Feb 11, 2024 • edited Loading

sylvestre commented Feb 11, 2024

anastygnome commented Feb 11, 2024 • edited Loading

tertsdiepraam commented Feb 11, 2024 • edited Loading

anastygnome commented Feb 11, 2024 • edited Loading

tertsdiepraam commented Feb 11, 2024 • edited Loading

anastygnome commented Jun 23, 2024

tertsdiepraam commented Jun 23, 2024

github-actions bot commented Jun 24, 2024

anastygnome commented Jun 25, 2024 • edited Loading

tertsdiepraam left a comment

Choose a reason for hiding this comment

tertsdiepraam Jun 27, 2024

Choose a reason for hiding this comment

anastygnome Jun 28, 2024

Choose a reason for hiding this comment

github-actions bot commented Jun 29, 2024

github-actions bot commented Jun 29, 2024

github-actions bot commented Jun 30, 2024

github-actions bot commented Jul 5, 2024

anastygnome commented Jul 23, 2024

tertsdiepraam commented Jul 23, 2024

github-actions bot commented Sep 20, 2024

anastygnome commented Sep 24, 2024 • edited Loading

github-actions bot commented Sep 26, 2024

anastygnome commented Oct 5, 2024

github-actions bot commented Oct 5, 2024

sylvestre commented Oct 5, 2024

anastygnome commented Feb 11, 2024 •

edited

Loading

anastygnome commented Feb 11, 2024 •

edited

Loading

tertsdiepraam commented Feb 11, 2024 •

edited

Loading

anastygnome commented Feb 11, 2024 •

edited

Loading

tertsdiepraam commented Feb 11, 2024 •

edited

Loading

anastygnome commented Jun 25, 2024 •

edited

Loading

anastygnome commented Sep 24, 2024 •

edited

Loading