Some new performance features #14

ericeil · 2024-05-16T15:33:49Z

While doing some Prover performance work, I found the following to be useful:

Add mapReduce and parallelMapReduce methods on TreapSet, TreapMap, and TreapList. These do what you think they do, and are useful for the obvious reasons.
Allow TreapMap.updateValues (and parallelUpdateValues) to change the type of the values. This makes a straightforward mapping of values to different types an O(N) operation instead of O(N log N).

I also simplified the TreapMap.updateEntry signature a bit (it had some extraneous nullability annotations).

jtoman

Nits and requests for comment

jtoman · 2024-05-17T17:16:21Z

collect/src/main/kotlin/com/certora/collect/AbstractTreapMap.kt

+    private fun <R : Any> mapReduceImpl(map: (K, V) -> R, reduce: (R, R) -> R): R {
+        val (left, middle, right) = fork(
+            self,
+            { left?.mapReduceImpl(map, reduce) },
+            { shallowMapReduce(map, reduce) },
+            { right?.mapReduceImpl(map, reduce) }
+        )
+        val leftAndMiddle = left?.let { reduce(it, middle) } ?: middle
+        return right?.let { reduce(leftAndMiddle, it) } ?: leftAndMiddle
+    }


is there a compelling reason to not just go ahead and make this an erased types implementation too? It seems like we always go back and do this for performance anyway so...

Static typing is good. :)

jtoman · 2024-05-17T17:18:21Z

collect/src/main/kotlin/com/certora/collect/AbstractTreapSet.kt

+    override fun <R : Any> mapReduce(map: (E) -> R, reduce: (R, R) -> R): R =
+        notForking(self) { mapReduceImpl(map, reduce) }
+
+    override fun <R : Any> parallelMapReduce(map: (E) -> R, reduce: (R, R) -> R, parallelThresholdLog2: Int): R =
+        maybeForking(self, threshold = { it.isApproximatelySmallerThanLog2(parallelThresholdLog2) }) {
+            mapReduceImpl(map, reduce)
+        }
+
+    context(ThresholdForker<S>)
+    private fun <R : Any> mapReduceImpl(map: (E) -> R, reduce: (R, R) -> R): R {
+        val (left, middle, right) = fork(
+            self,
+            { left?.mapReduceImpl(map, reduce) },
+            { shallowMapReduce(map, reduce) },
+            { right?.mapReduceImpl(map, reduce) }
+        )
+        val leftAndMiddle = left?.let { reduce(it, middle) } ?: middle
+        return right?.let { reduce(leftAndMiddle, it) } ?: leftAndMiddle
+    }


just so I'm not crazy: this is the same implementation as the map case right?

Yes, I think it's just the map lambda type that's different.

jtoman · 2024-05-17T17:19:58Z

collect/src/main/kotlin/com/certora/collect/EmptyTreapMap.kt

+    override fun <R : Any> updateValues(
+        transform: (K, V) -> R?
+    ): TreapMap<K, R> = treapMapOf()

-    override fun <U> updateEntry(key: K, value: U?, merger: (V?, U?) -> V?): TreapMap<K, V> = 
+    override fun <R : Any> parallelUpdateValues(
+        parallelThresholdLog2: Int,
+        transform: (K, V) -> R?
+    ): TreapMap<K, R> = treapMapOf()


can't you return this and just unsafe cast it? That's ultimately what treapMapOf is doing right? I'm not suggesting we do this, just making sure I understand what the code is doing.

Yes, this is just a nicer-looking way of doing that.

jtoman · 2024-05-17T17:23:52Z

collect/src/main/kotlin/com/certora/collect/HashTreapMap.kt

+    override fun <R : Any> shallowMapReduce(map: (K, V) -> R, reduce: (R, R) -> R): R {
+        var result: R? = null
+        forEachPair {
+            val mapped = map(it.key, it.value)
+            result = result?.let { result -> reduce(result, mapped) } ?: mapped
+        }
+        return result!!
+    }


is the type bound on R here just so we have nullability of result? I'm trying to think of a way to relax this, but I think you can only use, e.g., lateinit on non-null types.

Probably not worth it.

We depend on this constraint in mapReduceImpl, where it's a bigger deal. It avoids needing to allocate a result holder for each step of the traversal.

jtoman · 2024-05-17T17:27:08Z

collect/src/main/kotlin/com/certora/collect/TreapListNode.kt

+
+        internal tailrec fun <E> TreapListNode<E>?.isApproximatelySmallerThanLog2(sizeLog2: Int): Boolean = when {
+            this == null -> true
+            sizeLog2 <= 0 -> false


when could this ever be less than zero? Seems like we should throw on this behavior?

Good point!

jtoman · 2024-05-17T17:29:30Z

collect/src/main/kotlin/com/certora/collect/TreapMap.kt

+    public fun <R : Any> updateValues(
+        transform: (K, V) -> R?
+    ): TreapMap<K, R>


So this is a change in the public API right? If V was a nullable type, you could use this function to update the values of the map. But now you can't, updatedValues is unusable for nullable types. Maybe that was a bug, as I suspect that we removed entries for which transform returned null, but still, it means part of the API is unusable depending on your type parameter.

I think I'm favor of this change, as it makes explicit that null values are filtered out, where as previously you could write:

treapMapOf("foo" to 3, "bar" to null).updateValues { _, v -> v }

and have this return a map without "bar" (and no null keys) but this wasn't reflected in the return type of updateValues

Can we document this behavior a bit more explicitly though?

You can definitely write this, before and after this change:

var map = treapMapOf("foo" to 3, "bar" to null).updateValues { _, v -> v }

In both cases the result is a map without the entries that had null values. With this change, the result will be typed as TreapMap<String, Int> instead of TreapMap<String, Int?>.

I will add some comments.

Add mapReduce/parallelMapReduce, and some other tweaks

f2d3ada

ericeil force-pushed the mapReduce branch from 7655cbd to f2d3ada Compare May 16, 2024 21:51

ericeil changed the title ~~A couple of new features and a bugfix~~ Some new performance features May 16, 2024

ericeil requested a review from jtoman May 16, 2024 22:19

jtoman reviewed May 17, 2024

View reviewed changes

John's feedback

42dc2e6

ericeil requested a review from jtoman May 17, 2024 20:21

One more tweak

005a429

jtoman approved these changes May 21, 2024

View reviewed changes

ericeil merged commit 840d5eb into Certora:main May 21, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some new performance features #14

Some new performance features #14

ericeil commented May 16, 2024 •

edited

Loading

jtoman left a comment

jtoman May 17, 2024

ericeil May 17, 2024

jtoman May 17, 2024

ericeil May 17, 2024

jtoman May 17, 2024

ericeil May 17, 2024

jtoman May 17, 2024

ericeil May 17, 2024

jtoman May 17, 2024

ericeil May 17, 2024

jtoman May 17, 2024

jtoman May 17, 2024

jtoman May 17, 2024

ericeil May 17, 2024 •

edited

Loading

ericeil May 17, 2024

Some new performance features #14

Some new performance features #14

Conversation

ericeil commented May 16, 2024 • edited Loading

jtoman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ericeil May 17, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ericeil commented May 16, 2024 •

edited

Loading

ericeil May 17, 2024 •

edited

Loading