New content: Add definition for shape broadcasting #534

inexorabletash · 2024-01-26T22:54:28Z

This change introduces a new section for Algorithms, following APIs, to collect algorithms referenced throughout the specification.

A section for Broadcasting is introduced, which defines broadcasting shapes and gives an explicit algorithm matching WebNN implementations of NumPy's General Broadcasting Rules. Definitions for "broadcastable" and "unidirectionally broadcastable" are introduced. The previous definition of "broadcast-shapes" is removed in favor of these new algorithms.

For #324, #462, and potentially #523.

Preview | Diff

inexorabletash · 2024-01-26T22:56:09Z

I'm still pretty hazy on definitions, so please look at this very skeptically.

Re: introducing a new "Algorithms" section - this is a pattern I've adopted from other specs. I thought I'd give a whirl here and see what everyone thinks. It's easy enough to drop and put these definitions anywhere else to keep the overall spec structure unchanged.

inexorabletash · 2024-01-26T23:50:11Z

~~Hold off on reviews, I was confusing unidirectional (mentioned in several places in the spec) and bidirectional (mentioned in the Chromium implementation). Fridays, am I right?~~

inexorabletash · 2024-01-27T00:14:06Z

Okay, this is updated to match the Chromium impl, I think. But... that may not be what's desired @a-sully mentions that unidirectional broadcasting seems specific to ONNX and other frameworks do things to work around this during export.

a-sully · 2024-01-27T01:00:25Z

AFAICT ONNX is an outlier in making a distinction between "unidirectional" and "multidirectional" broadcasting. This distinction is not made by:

Numpy: https://numpy.org/doc/stable/user/basics.broadcasting.html
PyTorch: https://pytorch.org/docs/stable/notes/broadcasting.html (which references Numpy)
Tensorflow/XLA/StableHLO: https://www.tensorflow.org/xla/broadcasting (which also references Numpy, though it has some other restrictions)

The "unidirectional broadcastable" constraint of some ONNX ops (e.g. prelu()) requires workarounds when exporting from other formats to ONNX - like in this example of using TVM to export PyTorch to ONNX: pytorch/pytorch#70570 (comment). The folks on that issue discovered that the "unidirectional broadcastable" constraint for prelu() was not documented (which has since been fixed) and that there were inconsistencies between ONNX implementations (which... hopefully has also been fixed ¯\(ツ)/¯ )

Numpy's broadcasting rules are the clear standard across the industry and IMHO is the standard that we should expose to the web. It then becomes up to the user agent to ensure that the constraints of the underlying framework (e.g. unidirectional broadcasting for ONNX, lack of inferred broadcasting specifications for XLA) are satisfied.

wacky6 · 2024-01-27T10:00:22Z

We follow Numpy broadcast rules for ops (in the chromium implementation), I think it's the golden "standard". +1 to adopt this throughout the spec (I didn't realize the spec matmul referred to an implementation defined broadcast 😱).

@a-sully mentions that unidirectional broadcasting seems specific to ONNX and other frameworks do things to work around this during export

Is ONNX unidirectional broadcast issue easy to workaround? The linked prelu() patch seems "easy enough".

Should we (from the spec's perspective) expect implementations (e.g. chromium DML backend)to do asid easy workarounds?

huningxin

LGTM with a comment, thanks much!

index.bs

huningxin · 2024-02-02T04:32:01Z

@fdwr and @wchao1115 , please also take a look. Thanks!

fdwr

Thank you Josh.

index.bs

fdwr · 2024-02-10T06:17:14Z

The unidirectional broadcastable constraint of some ONNX ops (e.g. prelu() requires workarounds when exporting from other formats ...

Huh, I would have presumed prelu was bidirectionally broadcastable too (TIL...). For operator behavior changes, I'd like it to be separate from this CR that's just adding definitions.

Is ONNX unidirectional broadcast issue easy to workaround? The linked prelu() patch seems "easy enough".

Yep, easy with a little reshape of leading 1's.

Should we (from the spec's perspective) expect implementations (e.g. chromium DML backend)to do asid easy workarounds?

Whichever broadcasting behavior we have for prelu, unidirectional or bidirectional, it's trivial either way for backends to insert leading 1's as needed (which the Chromium DML backend already does for DML, since DML expects elementwise tensor descriptions of each input to have been massaged to consistent rank first - any higher level concepts like broadcasting already resolved). In the worst case, backends can utilize whatever logic they have for expand to pre-expand tensors before forwarding them along.

inexorabletash · 2024-02-12T17:52:16Z

History was getting messy - I force-pushed a new squashed commit. It rolls in #564 to get the build working, but I'll rebase once that merges.

This change introduces a new section for Algorithms, following APIs, to collect algorithms referenced throughout the specification. A section for Broadcasting is introduced, which defines broadcasting shapes and gives an explicit algorithm matching WebNN implementations of NumPy's General Broadcasting Rules. Definitions for "broadcastable" and "unidirectionally broadcastable" are introduced. The previous definition of "broadcast-shapes" is removed in favor of these new algorithms. Use broadcasting definition in expand(), rather than bespoke steps For webmachinelearning#324, webmachinelearning#378, webmachinelearning#462, and potentially webmachinelearning#523. Co-authored-by: Dwayne Robinson <[email protected]>

fdwr · 2024-02-13T00:07:27Z

I force-pushed a new squashed commit.

Josh: 🤔 ~~Shoot, I'm unsure now if I merged your triangular CR via squash commit like I was supposed to.~~ Appears it was a single commit.

Except larger branch integrations where we want to retain history, that's the general policy for cleaner history. Then the CR author doesn't need to ever explicitly squash their changes, because force pushes in GitHub yield this unfortunate result for reviewers to minimize the review delta since last time...

fdwr

LGTM. 👞✨

inexorabletash · 2024-02-13T00:25:40Z

No worries... I tend to squash locally, force-push, and always review the full diff anyway to see what the actual change is going to be. I'll try to refrain from squashing/force-pushing here though, going forward!

fdwr

One more tiny fix after re-review, which I'll just (per your earlier permission) submit and complete this. Thanks Josh.

index.bs

SHA: a26b587 Reason: push, by fdwr Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

inexorabletash mentioned this pull request Jan 26, 2024

Wording change: Tighten up output shape calculation algorithms #523

Merged

inexorabletash force-pushed the content-broadcast-shapes branch 2 times, most recently from 8438403 to 74aef99 Compare January 27, 2024 00:09

inexorabletash force-pushed the content-broadcast-shapes branch from 74aef99 to 0712df2 Compare January 27, 2024 18:58

inexorabletash mentioned this pull request Feb 1, 2024

Reserve asserts only for non-obvious things #455

Closed

huningxin approved these changes Feb 2, 2024

View reviewed changes

index.bs Outdated Show resolved Hide resolved

inexorabletash force-pushed the content-broadcast-shapes branch 3 times, most recently from 1b63ff7 to cf22546 Compare February 7, 2024 17:42

inexorabletash requested a review from fdwr February 9, 2024 16:57

fdwr reviewed Feb 10, 2024

View reviewed changes

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Outdated Show resolved Hide resolved

index.bs Show resolved Hide resolved

inexorabletash force-pushed the content-broadcast-shapes branch from 312a9e8 to 962ec76 Compare February 12, 2024 17:50

inexorabletash force-pushed the content-broadcast-shapes branch from 962ec76 to 7ce286f Compare February 12, 2024 21:54

fdwr approved these changes Feb 13, 2024

View reviewed changes

index.bs Outdated Show resolved Hide resolved

Fix prelu parameter order

145bff1

fdwr merged commit a26b587 into webmachinelearning:main Feb 13, 2024
2 checks passed

github-actions bot added a commit that referenced this pull request Feb 13, 2024

New content: Add definition for shape broadcasting (#534)

91d51f9

SHA: a26b587 Reason: push, by fdwr Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

inexorabletash deleted the content-broadcast-shapes branch February 13, 2024 06:09

a-sully mentioned this pull request Feb 28, 2024

Consider adopting new broadcasting rules #590

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New content: Add definition for shape broadcasting #534

New content: Add definition for shape broadcasting #534

inexorabletash commented Jan 26, 2024 •

edited by pr-preview bot

Loading

inexorabletash commented Jan 26, 2024

inexorabletash commented Jan 26, 2024 •

edited

Loading

inexorabletash commented Jan 27, 2024

a-sully commented Jan 27, 2024

wacky6 commented Jan 27, 2024

huningxin left a comment

huningxin commented Feb 2, 2024

fdwr left a comment

fdwr commented Feb 10, 2024 •

edited

Loading

inexorabletash commented Feb 12, 2024

fdwr commented Feb 13, 2024 •

edited

Loading

fdwr left a comment

inexorabletash commented Feb 13, 2024

fdwr left a comment •

edited

Loading

New content: Add definition for shape broadcasting #534

New content: Add definition for shape broadcasting #534

Conversation

inexorabletash commented Jan 26, 2024 • edited by pr-preview bot Loading

inexorabletash commented Jan 26, 2024

inexorabletash commented Jan 26, 2024 • edited Loading

inexorabletash commented Jan 27, 2024

a-sully commented Jan 27, 2024

wacky6 commented Jan 27, 2024

huningxin left a comment

Choose a reason for hiding this comment

huningxin commented Feb 2, 2024

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Feb 10, 2024 • edited Loading

inexorabletash commented Feb 12, 2024

fdwr commented Feb 13, 2024 • edited Loading

fdwr left a comment

Choose a reason for hiding this comment

inexorabletash commented Feb 13, 2024

fdwr left a comment • edited Loading

Choose a reason for hiding this comment

inexorabletash commented Jan 26, 2024 •

edited by pr-preview bot

Loading

inexorabletash commented Jan 26, 2024 •

edited

Loading

fdwr commented Feb 10, 2024 •

edited

Loading

fdwr commented Feb 13, 2024 •

edited

Loading

fdwr left a comment •

edited

Loading