Feature: Basic matrix operations until QR and SVD algorithms #172

hyiltiz · 2024-01-21T08:12:30Z

add basic matrix operations
add stable QR decomposition algorithm
add simple SVD
finalize tests

sogaiu · 2024-01-21T09:56:07Z

I was going to mention using ; (splice) vs apply, for example, instead of:

(defn dot [v1 v2]
  (apply + (map * v1 v2)))

expressing as:

(defn dot [v1 v2]
  (+ ;(map * v1 v2)))

but when I got to comparing the speed difference, I got the sense that apply was slightly faster (added some newlines for readability):

$ janet
Janet 1.33.0-23b0fe9f linux/x64/gcc - '(doc)' for help

repl:1:> (do (use spork/test) :done)
:done

repl:2:> (defn dot1 [v1 v2] (apply + (map * v1 v2)))
<function dot1>

repl:3:> (defn dot2 [v1 v2] (+ ;(map * v1 v2)))
<function dot2>

repl:4:> (timeit-loop [:timeout 10] "dot1" (dot1 [2 3 8 9] [1 0 -2 7]))
dot1 10.000s, 0.9481µs/body
nil

repl:5:> (timeit-loop [:timeout 10] "dot2" (dot2 [2 3 8 9] [1 0 -2 7]))
dot2 10.000s, 1.011µs/body
nil

That got me thinking though...may be using loop with an accumulator could be faster:

repl:6:> (defn dot3 [v1 v2] (var t 0) (loop [i :in v1 j :in v2] (+= t (* i j))) t)
<function dot3>

repl:7:> (timeit-loop [:timeout 10] "dot3" (dot3 [2 3 8 9] [1 0 -2 7]))
dot3 10.000s, 0.4623µs/body
nil

I did something similar for subtract using seq:

$ janet
Janet 1.33.0-23b0fe9f linux/x64/gcc - '(doc)' for help
repl:1:> (do (use spork/test) :done)
:done

repl:2:> (defn subtract1 [v1 v2] (map - v1 v2))
<function subtract1>

repl:3:> (defn subtract2 [v1 v2] (seq [i :in v1 j :in v2] (- i j)))
<function subtract2>

repl:4:> (timeit-loop [:timeout 10] (subtract1 [2 3 8 9] [1 0 -2 7]))
Elapsed time: 10.000s, 0.8114µs/body
nil

repl:5:> (timeit-loop [:timeout 10] (subtract2 [2 3 8 9] [1 0 -2 7]))
Elapsed time: 10.000s, 0.7926µs/body
nil

Not that different in this case I guess.

But using array/push:

repl:8:> (defn subtract3 [v1 v2] (def res @[]) (for i 0 (length v1) (array/push res (- (get v1 i) (get v2 i)))) res)
<function subtract3>

repl:9:> (timeit-loop [:timeout 10] (subtract3 [2 3 8 9] [1 0 -2 7]))
Elapsed time: 10.000s, 0.2809µs/body
nil

Hmm, may be using for instead of loop for dot would be faster...looks like it:

repl:8:> (defn dot4 [v1 v2] (var t 0) (for i 0 (length v1) (+= t (* (get v1 i) (get v2 i)))) t)
<function dot4>

repl:9:> (timeit-loop [:timeout 10] "dot4" (dot4 [2 3 8 9] [1 0 -2 7]))
dot4 10.000s, 0.1531µs/body
nil

Don't know if this kind of thing is worth it, but FWIW.

sogaiu · 2024-01-21T10:18:06Z

Not sure how I feel about:

(defn sign [x]
  (if (>= x 0) 1 -1))

If we're going for this, may be we want 0 to be returned for the case of x being 0?

sogaiu · 2024-01-21T16:09:42Z

I noticed there are now 2 functions named dot:

bakpakin · 2024-01-21T18:00:23Z

Thanks @hyiltiz , looks interesting!

@sogaiu I wouldn't focus too much on small optimizations - generally, explicitly using loop instead of map will be faster but either way is fine.

hyiltiz · 2024-01-21T23:43:08Z

Thank you so much for all the comments! I worked on this to introduce basic matrix utilities to Janet, hence started adding anything that is needed until QR and SVD was possible. There are surely a lot of optimizations that are possible, as @sogaiu has identified. I am more than happy to adopt those changes.

A big question is that spork/math adopted row-first convension for matrices (an array is understood as a row vector, rather than column vector, hence a linear equation is expresssed as b = xA, whereas the convension is b = Ax assuming an array represents column vector). If there is not much external dependencies, it might make sense to deprecate old convension; I'll have to adjust this PR as well.

In a minor note, the earlier dot seemed too complex for what dot does; unless there is significant benefits in efficiency, I think it is better to rely on compiler to provide optimizations

sogaiu · 2024-01-22T06:39:59Z

In general I'm not a fan of trying to do the kinds of optimizations I experimented with above because:

some of these things could be obsoleted by changes to janet or other code
they often obscure what's going on, potentially making maintenance / comprehension by future parties more time-consuming

To explain some of the motivation for earlier optimization comments...

in another channel hyilitz and I had been discussing efficiency matters so this was a bit of a continuation
ulterior motive to see if anyone (hi @primo-ppcg!) might know better and would leak some of their insight (^^;
just recording for future reference some current performance differences and implementation alternatives

primo-ppcg · 2024-01-22T12:53:36Z

Since I've been asked to comment:

A few of these functions are already defined, most notably dot and trans, although the proposed implementations are cleaner, and could/should replace the existing. I've actually mentioned this simpler implementation of transpose previously.
sum should probably be preferred over apply + (e.g. in the implementation of dot). sum compiles to a loop, whereas apply + compiles to pusha call. The same is true for product and apply *.
sign should probably be defined as (cmp x 0), if at all.
I don't think that trans-v needs to exist, or at very least without an underlying type for "row vector" and "column vector", its current implementation is not very meaningful.

> (defn trans-v [xs] (map array xs))
<function trans-v>
> (def v (range 5))
@[0 1 2 3 4]
> (trans-v v)
@[@[0] @[1] @[2] @[3] @[4]]
> (trans-v (trans-v v))
@[@[@[0]] @[@[1]] @[@[2]] @[@[3]] @[@[4]]]

Perhaps explicitly transposing 1xN or Nx1 matrices would be more clear?

transpose op transpose should be unnecessary in most cases. For example, fliplr could be defined as just (map reverse m).
There's a little bit of code duplication (e.g. matmul redefines transpose in place).
I'm not familiar enough with the QR or SVD algorithms used to be able to comment on the implementations. Is O(n^3) the fastest algorithm known?

hyiltiz · 2024-01-24T02:34:38Z

The QR algorithm is the (one of) the best known; the SVD is (one of) the simplest one given QR, and there are a lot of other algorithms designed for efficiency. Those are usually complex enough that it is usually not re-implemented but simply wrapped/called from the LAPACK/BLAS/eigen libraries. I think this is a simple and direct first step for a purely janet-based implementation.

Thank you so much all for the informative and detailed feedback! Given the valuable feedback above, I'll revise the draft and see if we can get it closer to a PR that we can consider for a merge.

hyiltiz · 2024-02-11T01:53:33Z

spork/math.janet

+(let [m3 @[@[1 2 3] @[4 5 6] @[7 8 9]]]
+  (assert (m-approx= (matmul m3 (ident (rows m3)))
+                     m3)
+          "matmul identity left: this test succeeds here but fails in suite-math.janet"))


Any idea why?

Fixed. Caused by stateful computation due to join-rows not being idempotent (array/concat mutates first argument).

hyiltiz · 2024-02-11T01:53:53Z

test/suite-math.janet

+
+      (assert (m-approx= (matmul m3 (ident (rows m3)))
+                         m3)
+              "matmul identity left")


Fails here.

Fixed. Caused by stateful computation due to join-rows not being idempotent (array/concat mutates first argument).

Thansk @sogaiu for digging into it.

hyiltiz · 2024-02-11T03:17:36Z

New changes:

Added a few more tests and passed all tests
Removed redundancids in utilities such as transpose, dot, mul, matmult
Removed the original dot which was not used anywhere except as a helper function for mul and implemented matrix multiplication rather than dot product ("dot product" is not conventionally defined for matrices). A simple implementation for dot is provided, defined as the inner product between two (row) vectors of equal dimension.
Renamed trans-v to row->col and made it idempotent.
mul implementation currently mutates first argument and several existing functions rely on it. mul also accepts scalar, vector (confusingly, row vector but will be treated as column; see earlier discussions), and matrix. To avoid confusion due to this and the stateful nature of mult, please adopt matmul or dot for all new code.

sogaiu · 2024-02-12T15:50:42Z

Some minor comments:

May be the (let ...) here can be removed?
I think the wrapping (do ...) here is not needed. Similar thing here and here.
The m at the end of this line feels on the hidden side to me, may be it could go on a line by itself?
Are you intending to address this TODO?
There is some minor tidying that could be done via the janet-format script (included in a spork installation) which I think will remove trailing whitespace and format things a bit.

For this bit I think instead of:

(set (cells x)
     (*
       bc
       (math/pow p x)
       (math/pow (- 1 p) (- t x))))

it makes sense to do at least (may be some other folks would go further):

(set (cells x)
     (* bc
        (math/pow p x)
        (math/pow (- 1 p) (- t x))))

hyiltiz · 2024-02-14T05:11:41Z

Adjusted for all of the points. Some of the points refer to functions such as sop and binomial-distribution that are not part of this PR but is part of the module touched by this PR. Happy to fix.

sogaiu · 2024-02-14T05:15:03Z

Ah, sorry about those unrelated bits. Not sure what to do about those. Will think on it.

Hope to look in more detail soon, but wondering about this change. I'm not too familiar with GH CI, but is this intentional?

hyiltiz · 2024-02-14T05:19:04Z

Yes; that change should allow it (and all other incoming PRs) to run the linting and checks without having repo admin to click Approve. Less friction for PR contributions. Probably should've been a separate PR, but it is just a single line of change so may as well...

pepe · 2024-02-14T07:02:02Z

.github/workflows/test.yml

@@ -3,7 +3,7 @@ name: Test
 on:
  push:
    branches: [ master ]
-  pull_request:


This should be part of the separate PR.

That's my feeling as well, but I think it's really up to bakpakin to decide.

I do think that separating makes later history easier to understand. I know we don't all follow this course of action all of the time though (^^;

Possibly reversion is simpler too? Not so sure about this point though.

I would say it is just a good manner not to "hide" surprises :-).

TBH I am definitely on the side of having that CI config in. I just do not like the sneaky way :-).

sogaiu · 2024-02-15T13:40:16Z

There is a small issue with the tests.

The following diff may fix it:

diff --git a/test/suite-math.janet b/test/suite-math.janet
index 1950c79..edaa477 100644
--- a/test/suite-math.janet
+++ b/test/suite-math.janet
@@ -420,90 +420,90 @@
       res-svd (svd m3)
       U (res-svd :U)
       S (res-svd :S)
-      V (res-svd :V)
-      (assert (deep= m23 m23)
-              "deep= matrix")
-
-      (assert (deep= (flipud m23)
-                     @[@[4 5 6] @[1 2 3]])
-              "flipud")
-
-      (assert (deep= (fliplr m23)
-                     @[@[3 2 1] @[6 5 4]])
-              "fliplr")
-
-      (assert (deep= (join-rows m3 m23)
-                     @[@[1 2 3]
-                       @[4 5 6]
-                       @[7 8 9]
-                       @[1 2 3]
-                       @[4 5 6]])
-              "join-rows")
-
-      (assert (deep= (join-cols m23 m23)
-                     @[@[1 2 3 1 2 3]
-                       @[4 5 6 4 5 6]])
-              "join-cols")
-
-      (assert (m-approx= (res1-m3 :Q)
-                         @[@[-0.123091490979333 -0.492365963917331 -0.861640436855329]
-                           @[-0.492365963917331 0.784145597779528 -0.377745203885826]
-                           @[-0.861640436855329 -0.377745203885826 0.338945893199805]])
-              "qr1-q")
-
-      (assert (m-approx= (res1-m3 :m^)
-                         @[@[-0.0859655700236277 -0.171931140047257]
-                           @[-0.90043974754135 -1.8008794950827]])
-              "qr1-m")
-
-      (assert (m-approx= (res-m3 :Q)
-                         @[@[-0.123091490979333 0.904534033733291 0.408248290463864]
-                           @[-0.492365963917331 0.301511344577765 -0.816496580927726]
-                           @[-0.861640436855329 -0.301511344577764 0.408248290463863]])
-              "qr-q")
-
-      (assert (m-approx= (res-m3 :R)
-                         @[@[-8.12403840463596 -9.60113629638795 -11.0782341881399]
-                           @[-8.88178419700125e-16 0.90453403373329 1.80906806746658]
-                           @[-8.88178419700125e-16 -4.44089209850063e-16 8.88178419700125e-16]])
-              "qr-r")
-
-      (assert (m-approx= U
-                         @[@[0.214837238368396 -0.887230688346371 0.408248290463863]
-                           @[0.520587389464737 -0.249643952988298 -0.816496580927726]
-                           @[0.826337540561078 0.387942782369775 0.408248290463863]])
-              "svd-U")
-
-      (assert (m-approx= S
-                         @[@[16.8481033526142 0 0]
-                           @[-1.1642042401554e-237 -1.06836951455471 0]
-                           @[-6.42285339593621e-323 0 3.62597321469472e-16]])
-
-              "svd-S")
-
-      (assert (m-approx= V
-                         @[@[0.479671177877771 -0.776690990321559 0.408248290463863]
-                           @[0.572367793972062 -0.0756864701045582 -0.816496580927726]
-                           @[0.665064410066353 0.625318050112442 0.408248290463863]])
-              "svd-U")
-
-      (assert (m-approx= (matmul m3 (ident (rows m3)))
-                         m3)
-              "matmul identity left")
-
-      (assert (m-approx= (matmul (ident (rows m3)) m3)
-                         m3)
-              "matmul identity right")
-
-      (assert (m-approx= m3 (matmul (res-m3 :Q) (res-m3 :R)))
-              "qr-square decompose")
-
-      (assert (m-approx= m23 (matmul (res-m23 :Q) (res-m23 :R)))
-              "qr-non-square decompose")
-
-      (assert (m-approx= m3 (reduce matmul (ident (rows U))
-                                    (array U S (trans V))))
-              "svd-USV' decompose")])
+      V (res-svd :V)]
+  (assert (deep= m23 m23)
+          "deep= matrix")
+
+  (assert (deep= (flipud m23)
+                 @[@[4 5 6] @[1 2 3]])
+          "flipud")
+
+  (assert (deep= (fliplr m23)
+                 @[@[3 2 1] @[6 5 4]])
+          "fliplr")
+
+  (assert (deep= (join-rows m3 m23)
+                 @[@[1 2 3]
+                   @[4 5 6]
+                   @[7 8 9]
+                   @[1 2 3]
+                   @[4 5 6]])
+          "join-rows")
+
+  (assert (deep= (join-cols m23 m23)
+                 @[@[1 2 3 1 2 3]
+                   @[4 5 6 4 5 6]])
+          "join-cols")
+
+  (assert (m-approx= (res1-m3 :Q)
+                     @[@[-0.123091490979333 -0.492365963917331 -0.861640436855329]
+                       @[-0.492365963917331 0.784145597779528 -0.377745203885826]
+                       @[-0.861640436855329 -0.377745203885826 0.338945893199805]])
+          "qr1-q")
+
+  (assert (m-approx= (res1-m3 :m^)
+                     @[@[-0.0859655700236277 -0.171931140047257]
+                       @[-0.90043974754135 -1.8008794950827]])
+          "qr1-m")
+
+  (assert (m-approx= (res-m3 :Q)
+                     @[@[-0.123091490979333 0.904534033733291 0.408248290463864]
+                       @[-0.492365963917331 0.301511344577765 -0.816496580927726]
+                       @[-0.861640436855329 -0.301511344577764 0.408248290463863]])
+          "qr-q")
+
+  (assert (m-approx= (res-m3 :R)
+                     @[@[-8.12403840463596 -9.60113629638795 -11.0782341881399]
+                       @[-8.88178419700125e-16 0.90453403373329 1.80906806746658]
+                       @[-8.88178419700125e-16 -4.44089209850063e-16 8.88178419700125e-16]])
+          "qr-r")
+
+  (assert (m-approx= U
+                     @[@[0.214837238368396 -0.887230688346371 0.408248290463863]
+                       @[0.520587389464737 -0.249643952988298 -0.816496580927726]
+                       @[0.826337540561078 0.387942782369775 0.408248290463863]])
+          "svd-U")
+
+  (assert (m-approx= S
+                     @[@[16.8481033526142 0 0]
+                       @[-1.1642042401554e-237 -1.06836951455471 0]
+                       @[-6.42285339593621e-323 0 3.62597321469472e-16]])
+
+          "svd-S")
+
+  (assert (m-approx= V
+                     @[@[0.479671177877771 -0.776690990321559 0.408248290463863]
+                       @[0.572367793972062 -0.0756864701045582 -0.816496580927726]
+                       @[0.665064410066353 0.625318050112442 0.408248290463863]])
+          "svd-U")
+
+  (assert (m-approx= (matmul m3 (ident (rows m3)))
+                     m3)
+          "matmul identity left")
+
+  (assert (m-approx= (matmul (ident (rows m3)) m3)
+                     m3)
+          "matmul identity right")
+
+  (assert (m-approx= m3 (matmul (res-m3 :Q) (res-m3 :R)))
+          "qr-square decompose")
+
+  (assert (m-approx= m23 (matmul (res-m23 :Q) (res-m23 :R)))
+          "qr-non-square decompose")
+
+  (assert (m-approx= m3 (reduce matmul (ident (rows U))
+                                (array U S (trans V))))
+          "svd-USV' decompose"))
 
 
 (assert (= 10 (perm @[@[1 2]

sogaiu · 2024-02-15T13:41:27Z

With the diff above, tests pass for me.

Sorry about the sop and binomial-distribution bits -- I thought I had only commented on lines that had changes on them.

Not sure what happened there (^^;

hyiltiz · 2024-02-16T04:00:49Z

Patch adopted. Separated workflow change into separate PR: #178.

sogaiu · 2024-02-16T04:06:06Z

00f3760 passed all tests here 👍

hyiltiz added 2 commits January 21, 2024 02:19

Draft: basic matrix operations until QR and SVD algorithms

3e2dd07

Add tests

a394b7f

hyiltiz marked this pull request as draft January 21, 2024 08:12

hyiltiz added 2 commits February 10, 2024 15:44

use m-approx for tests

6c4638c

incorporate all feedback

06051be

hyiltiz commented Feb 11, 2024

View reviewed changes

fix join-rows so it is idempotent

99e7455

hyiltiz marked this pull request as ready for review February 11, 2024 03:17

hyiltiz changed the title ~~Draft: basic matrix operations until QR and SVD algorithms [WIP]~~ Basic matrix operations until QR and SVD algorithms Feb 11, 2024

Remove: This workflow requires approval from a maintainer

d1df813

hyiltiz changed the title ~~Basic matrix operations until QR and SVD algorithms~~ Feature: Basic matrix operations until QR and SVD algorithms Feb 11, 2024

hyiltiz added 2 commits February 13, 2024 23:09

apply all comments

463b627

format using janet-format

4f1118e

pepe suggested changes Feb 14, 2024

View reviewed changes

hyiltiz added 2 commits February 15, 2024 21:58

undo github workflow

656c27d

match up let

00f3760

hyiltiz mentioned this pull request Feb 16, 2024

CI/CD feature: Remove "This workflow requires approval from a maintainer" #178

Merged

bakpakin merged commit caf9220 into janet-lang:master Feb 16, 2024
1 check passed

hyiltiz deleted the matrix branch February 19, 2024 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Basic matrix operations until QR and SVD algorithms #172

Feature: Basic matrix operations until QR and SVD algorithms #172

hyiltiz commented Jan 21, 2024 •

edited

Loading

sogaiu commented Jan 21, 2024 •

edited

Loading

sogaiu commented Jan 21, 2024

sogaiu commented Jan 21, 2024

bakpakin commented Jan 21, 2024

hyiltiz commented Jan 21, 2024

sogaiu commented Jan 22, 2024

primo-ppcg commented Jan 22, 2024

hyiltiz commented Jan 24, 2024 •

edited

Loading

hyiltiz Feb 11, 2024

hyiltiz Feb 11, 2024

hyiltiz Feb 11, 2024

hyiltiz Feb 11, 2024

hyiltiz Feb 11, 2024

hyiltiz commented Feb 11, 2024

sogaiu commented Feb 12, 2024 •

edited

Loading

hyiltiz commented Feb 14, 2024

sogaiu commented Feb 14, 2024 •

edited

Loading

hyiltiz commented Feb 14, 2024

pepe Feb 14, 2024

sogaiu Feb 14, 2024

pepe Feb 14, 2024

sogaiu commented Feb 15, 2024

sogaiu commented Feb 15, 2024 •

edited

Loading

hyiltiz commented Feb 16, 2024 •

edited

Loading

sogaiu commented Feb 16, 2024

Feature: Basic matrix operations until QR and SVD algorithms #172

Feature: Basic matrix operations until QR and SVD algorithms #172

Conversation

hyiltiz commented Jan 21, 2024 • edited Loading

sogaiu commented Jan 21, 2024 • edited Loading

sogaiu commented Jan 21, 2024

sogaiu commented Jan 21, 2024

bakpakin commented Jan 21, 2024

hyiltiz commented Jan 21, 2024

sogaiu commented Jan 22, 2024

primo-ppcg commented Jan 22, 2024

hyiltiz commented Jan 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyiltiz commented Feb 11, 2024

sogaiu commented Feb 12, 2024 • edited Loading

hyiltiz commented Feb 14, 2024

sogaiu commented Feb 14, 2024 • edited Loading

hyiltiz commented Feb 14, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sogaiu commented Feb 15, 2024

sogaiu commented Feb 15, 2024 • edited Loading

hyiltiz commented Feb 16, 2024 • edited Loading

sogaiu commented Feb 16, 2024

hyiltiz commented Jan 21, 2024 •

edited

Loading

sogaiu commented Jan 21, 2024 •

edited

Loading

hyiltiz commented Jan 24, 2024 •

edited

Loading

sogaiu commented Feb 12, 2024 •

edited

Loading

sogaiu commented Feb 14, 2024 •

edited

Loading

sogaiu commented Feb 15, 2024 •

edited

Loading

hyiltiz commented Feb 16, 2024 •

edited

Loading