Content: Specify output size calculations #582

inexorabletash · 2024-02-24T00:50:03Z

This covers:

Reduction ops (including argMin/argMax)
conv2d
convTranspose2d
Pooling ops (which rely on conv2d)

Fixes #500

inexorabletash · 2024-02-24T00:57:13Z

This is basically a translation of the Chromium C++ implementation into spec-ese, although parameter validation remains separated out. That can be revisited. And like the Chromium implementation, there's a lot of logic sharing among the ops.

We could also go further with helper ops, e.g. unpacking and re-packing operand layouts given an MLOperandLayout

zolkis · 2024-02-24T13:10:35Z

Wow, this is big! Thanks!

inexorabletash · 2024-02-24T22:28:12Z

Wow, this is big! Thanks!

Yeah... I didn't know what I was getting myself into.

index.bs

huningxin · 2024-02-26T01:37:37Z

index.bs

+  </summary>
+  <div class=algorithm-steps>
+    1. Let |effectiveFilterSize| be ( |filterSize| - 1 ) * |dilation| + 1.
+    1. Let |outputSize| be ( |inputSize| - |effectiveFilterSize| + |beginningPadding| + |endingPadding| ) / |stride| + 1.


Should we handle any errors? Like |outputSize| is overflow or underflow?

That's a good question... the specs I've been involved in usually operate in an abstract space with infinite precision, unlimited range, etc., with very limited checking.

I noticed the Chromium prototype impl is using checked math (i.e. testing underflow/overflow), and given the sizes that ML deals with it, it seems like this is going to be a practical concern.

We can spec this any way we want... ideas welcome. We should look at other specs for inspiration, too.

Hmm, I've always found when implementing specs (like bidi, line breaking, vertical layout...) that I was grateful they left out that degree of fine validation and instead focused on the algorithm itself. Any validation specific to the nature of the operators (e.g. maybe an operator only supports even sizes) belongs in there, but too much otherwise muddies an already complex document. Maybe we can even call that out somewhere in a common section, that the implementation should handle overflow/underflow/safe limit checks, but not as extra prose for each operator.

index.bs

fdwr

😮 This must have taken a while. It's certainly an improvement over "Let outputShape be the result of invoking the underlying implementation for calculating output dimensions, given options". Approved after typo fix. Thanks J.

index.bs

inexorabletash · 2024-02-27T19:13:26Z

Marking as draft. Once #587 merges I'll push a rebased version. I have it locally and it's much smaller; most of the lines now handle the nchw/nhwc layout unpacking/packing.

This covers: * Reduction ops (including argMin/argMax) * conv2d * convTranspose2d * Pooling ops (which rely on conv2d) Fixes webmachinelearning#500

Co-authored-by: Ningxin Hu <[email protected]>

inexorabletash · 2024-02-27T21:53:37Z

Okay, rebased - sorry about the forced push, but it should be ready for another review now.

huningxin

LGTM, thanks much!

SHA: 69e7ad6 Reason: push, by huningxin Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

huningxin reviewed Feb 26, 2024

View reviewed changes

inexorabletash force-pushed the content-output-shapes branch from f66b135 to 15aabf7 Compare February 26, 2024 16:59

fdwr reviewed Feb 27, 2024

View reviewed changes

index.bs Show resolved Hide resolved

fdwr approved these changes Feb 27, 2024

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

inexorabletash marked this pull request as draft February 27, 2024 19:04

inexorabletash and others added 7 commits February 27, 2024 13:47

Content: Specify output size calculations

4fa6341

This covers: * Reduction ops (including argMin/argMax) * conv2d * convTranspose2d * Pooling ops (which rely on conv2d) Fixes webmachinelearning#500

Update index.bs

c252161

Co-authored-by: Ningxin Hu <[email protected]>

Update index.bs

9a9df16

Co-authored-by: Ningxin Hu <[email protected]>

calculate pool -> calculate pool2d

089afc0

pull axes validation back out of size calc

3a03033

rebase on no-autopad

2fd03fa

future-proof single dimension output size calc algorithm names

2f9868c

inexorabletash force-pushed the content-output-shapes branch from 5989deb to 2f9868c Compare February 27, 2024 21:51

inexorabletash marked this pull request as ready for review February 27, 2024 21:52

inexorabletash mentioned this pull request Feb 27, 2024

Rename inputSize variables as inputRank in algorithms #588

Closed

fdwr approved these changes Feb 27, 2024

View reviewed changes

huningxin approved these changes Feb 28, 2024

View reviewed changes

huningxin merged commit 69e7ad6 into webmachinelearning:main Feb 28, 2024
1 check passed

github-actions bot added a commit that referenced this pull request Feb 28, 2024

Merge pull request #582 from inexorabletash/content-output-shapes

25e708e

SHA: 69e7ad6 Reason: push, by huningxin Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

inexorabletash deleted the content-output-shapes branch February 28, 2024 02:26

inexorabletash mentioned this pull request Mar 21, 2024

Need clarify scale factor for resample2d #610

Closed

inexorabletash mentioned this pull request Apr 5, 2024

Behavior for numeric overflows/underflows/etc in algorithms #636

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Content: Specify output size calculations #582

Content: Specify output size calculations #582

inexorabletash commented Feb 24, 2024 •

edited by pr-preview bot

Loading

inexorabletash commented Feb 24, 2024

zolkis commented Feb 24, 2024

inexorabletash commented Feb 24, 2024

huningxin Feb 26, 2024

inexorabletash Feb 26, 2024

fdwr Feb 27, 2024 •

edited

Loading

fdwr left a comment •

edited

Loading

inexorabletash commented Feb 27, 2024 •

edited

Loading

inexorabletash commented Feb 27, 2024

huningxin left a comment

Content: Specify output size calculations #582

Content: Specify output size calculations #582

Conversation

inexorabletash commented Feb 24, 2024 • edited by pr-preview bot Loading

inexorabletash commented Feb 24, 2024

zolkis commented Feb 24, 2024

inexorabletash commented Feb 24, 2024

huningxin Feb 26, 2024

Choose a reason for hiding this comment

inexorabletash Feb 26, 2024

Choose a reason for hiding this comment

fdwr Feb 27, 2024 • edited Loading

Choose a reason for hiding this comment

fdwr left a comment • edited Loading

Choose a reason for hiding this comment

inexorabletash commented Feb 27, 2024 • edited Loading

inexorabletash commented Feb 27, 2024

huningxin left a comment

Choose a reason for hiding this comment

inexorabletash commented Feb 24, 2024 •

edited by pr-preview bot

Loading

fdwr Feb 27, 2024 •

edited

Loading

fdwr left a comment •

edited

Loading

inexorabletash commented Feb 27, 2024 •

edited

Loading