Optimize CollectionSpec checker #436

nilern · 2022-02-05T21:33:37Z

Replace a doseq with reduce on JVM
Accumulate into a JS array on cljs instead of a persistent vector in an atom

The latter could improve #356 substantially; in the Malli seqex impl I had a similar situation with a persistent set in an atom on cljs and it was just unacceptably slow.

w01fe · 2022-02-05T21:38:18Z

src/cljc/schema/spec/collection.cljc

@@ -23,19 +23,18 @@
      (let [_ (macros/assert! (= 2 (count e)) "remaining can have only one schema.")
            c (spec/sub-checker (second e) params)]
        #?(:clj (fn [^java.util.List res x]
-                  (doseq [i x]
-                    (.add res (c i)))
+                  (reduce (fn [res i] (doto res (.add (c i)))) res x)


Why is this expected to be faster? The List can't be used safely concurrently so I would be surprised if you could do better than the doseq. Have you profiled and seen a speedup?

reduce (and run!) is faster and more memory efficient because as an internal iterator it can be fully specialized to the collection and usually does not need to allocate at all. I have profiled it sometime but by now it is just a habit like using (into [] (map ...) x) instead of (into [] (map ... x)). Since reduce works on all seqs really the only reason to ever use doseq is if :when/:while/:let is super useful.

The weird thing about reduce here is that it's threading through an extra argument that you don't care about because you're mutating the argument anyway. I think it makes the code slightly less clear and I would be a bit surprised if there was a performance benefit, but I'd be happy to believe it if there were accompanying measurements.

The weird thing about reduce here is that it's threading through an extra argument that you don't care about because you're mutating the argument anyway.

Might be worth comparing to (reduce #(.add res (c %2))) nil x).

Oh, it would probably beat the current code handily since I don't see a type hint on the inner res.

EDIT: I've added a reflection check to the master build.
EDIT2: Of course I missed the doto propagating the type hint, nevermind.

w01fe · 2022-02-05T21:41:58Z

src/cljc/schema/spec/collection.cljc

                  (then res nil))
           :cljs (fn [res x]
-                   (swap! res into (map c x))
+                   (reduce (fn [res i] (doto res (.push (c i)))) res x)


I believe that something mutable could be faster here, but I'm again unsure about the use of reduce. Can you share profiling results?

into uses reduce too! Here it is more important that the data structure is mutable. I'll do some profiling later.

w01fe · 2022-02-05T21:42:39Z

src/cljc/schema/spec/collection.cljc

@@ -58,7 +57,7 @@

 :cljs
 (defn- has-error? [l]
-  (some utils/error? l)))
+  (.some l utils/error?)))


I'm not super familiar with cljs, can you explain the change here please?

Now that l is a JS array it just uses Array#some.

Thanks. Is that expected to be appreciably faster?

w01fe · 2022-02-05T21:44:23Z

Thanks for the PR! Certainly happy to accept optimizations but always prefer if they come with measurements. And would like to learn more about the choice of reduce over doseq.

nilern · 2022-02-05T22:08:59Z

I want to do some benchmarks but for now I am out of time.

w01fe · 2022-02-06T21:38:32Z

Thanks! I just want to make sure we are striking the right balance between performance and clarity, i.e. only making optimizations that make a measurable difference. My intuition is admittedly stale here, so I'm happy to take these with measurements, or happy if another maintainer wants to take them without.

nilern · 2022-02-12T17:41:41Z

I don't think this adds much complexity; it is just using JS arrays just like it was already using ArrayList on the JVM.

While I am fairly confident it will also be faster it would be silly to optimize without benchmarking.

Optimize CollectionSpec checker

e640557

w01fe reviewed Feb 5, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize CollectionSpec checker #436

Optimize CollectionSpec checker #436

nilern commented Feb 5, 2022

w01fe Feb 5, 2022

nilern Feb 5, 2022

w01fe Feb 6, 2022

frenchy64 Mar 18, 2022

frenchy64 Mar 18, 2022 •

edited

Loading

w01fe Feb 5, 2022

nilern Feb 5, 2022

w01fe Feb 5, 2022

nilern Feb 5, 2022

w01fe Feb 6, 2022

w01fe commented Feb 5, 2022

nilern commented Feb 5, 2022

w01fe commented Feb 6, 2022

nilern commented Feb 12, 2022

Optimize CollectionSpec checker #436

Are you sure you want to change the base?

Optimize CollectionSpec checker #436

Conversation

nilern commented Feb 5, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frenchy64 Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

w01fe commented Feb 5, 2022

nilern commented Feb 5, 2022

w01fe commented Feb 6, 2022

nilern commented Feb 12, 2022

frenchy64 Mar 18, 2022 •

edited

Loading