Conventions: Use "rank" for variable names, when appropriate #627

inexorabletash · 2024-03-28T21:11:33Z

Fixes #588

inexorabletash · 2024-03-28T21:16:38Z

@fdwr - can you take a look?

I think the remaining uses of "size" in variable names refer to element counts or magnitudes (e.g. width and height). Maybe I missed some? I was on the fence about the broadcast algorithms that still use sizeA/sizeB. I'm happy to convert those over, but wanted your opinion.

If you want to search the rendered spec for "size" I find it handy to remove the IDL and code samples, e.g. running this on the console [...document.querySelectorAll('pre')].forEach(n=>n.remove())

fdwr · 2024-03-29T04:55:03Z

Thanks Josh 🙂. Scanning for all "size" occurrences, I only found one other operator to replace with rank, plus some dubious size-related statements. Do these proposed edits look valid to you?

batchNormalization's current wording kinda makes it sound like the bias can only be a scalar (1 element) rather than a 1D array:

    1. If |options|.{{MLBatchNormalizationOptions/scale}} [=map/exists=]:
-       1. If its [=list/size=] is not 1, then [=exception/throw=] a {{TypeError}}.
+       1. If its [=MLOperand/rank=] is not 1, then [=exception/throw=] a {{TypeError}}.
        1. If |options|.{{MLBatchNormalizationOptions/scale}}'s [=MLOperand/shape=][0] is not equal to |input|'s [=MLOperand/shape=][|options|.{{MLBatchNormalizationOptions/axis}}], then [=exception/throw=] a {{TypeError}}.
    1. If |options|.{{MLBatchNormalizationOptions/bias}} [=map/exists=]:
-       1. If its [=list/size=] is not 1, then [=exception/throw=] a {{TypeError}}.
+       1. If its [=MLOperand/rank=] is not 1, then [=exception/throw=] a {{TypeError}}.
        1. If |options|.{{MLBatchNormalizationOptions/bias}}'s [=MLOperand/shape=][0] is not equal to |input|'s [=MLOperand/shape=][|options|.{{MLBatchNormalizationOptions/axis}}], then [=exception/throw=] a {{TypeError}}.
    1. *Make graph connections:*

Since the rest of these are size related, but not about the rank, would you rather these be a separate CR?

reduce currently sounds like it preserves any input dimensions of length 1, but actually it sets reduced output dimensions to length 1. Also list size looks wrong because it's not about the list size, but rather the dimension values inside the list:

    : <dfn>keepDimensions</dfn>
    ::
-        If true, retains reduced dimensions with [=list/size=] 1.
+        If true, the output has the same rank as the input, setting any reduced dimensions to size 1.

slice checking the list size for 0 is redundant because there's a check below that the list size == the input rank anyway (and arguably rejecting an empty slice list for scalars is wrong anyway). I believe this statement was intended to check each dimension length != 0 inside the list rather than the list size:

<details open algorithm>
  <summary>
    The <dfn method for=MLGraphBuilder>slice(|input|, |starts|, |sizes|)</dfn> method steps are:
  </summary>
    1. If [=MLGraphBuilder/validating operand=] with [=this=] and |input| returns false, then [=exception/throw=] a {{TypeError}}.
-   1. If |sizes|'s [=list/size=] is 0, then [=exception/throw=] a {{TypeError}}.
+   1. If any of |sizes|'s have size 0, then [=exception/throw=] a {{TypeError}}.
    1. If |starts|'s [=list/size=] and |sizes|'s [=list/size=] are not both equal to |input|'s [=MLOperand/rank=], then [=exception/throw=] a {{TypeError}}.

I'm okay with sizeA here because it's actually about the shape's size. Though, somebody reported to me that Let sizeA be A’s size was confusing because it sounded like it meant tensor A (since we often use capital letters for tensors), but in this case A is actually a shape. How about:

- To unidirectionally broadcast the shapes A and B, perform the following steps. A and B are [lists](https://infra.spec.whatwg.org/#list) of positive integers, representing the dimensions of tensors, and the steps return a new [list](https://infra.spec.whatwg.org/#list) of positive integers, or failure.
+ To unidirectionally broadcast the shapes shapeA and shapeB, perform the following steps. shapeA and shapeB are [lists](https://infra.spec.whatwg.org/#list) of positive integers, representing the dimensions of tensors, and the steps return a new [list](https://infra.spec.whatwg.org/#list) of positive integers, or failure.

- 1. Let |sizeA| be |A|'s [=list/size=].
+ 1. Let |sizeA| be |shapeA|'s [=list/size=].
- 1. Let |sizeB| be |B|'s [=list/size=].
+ 1. Let |sizeB| be |shapeB|'s [=list/size=].
1. If |sizeB| > |sizeA|, then return failure.
- 1. Let |paddedB| be a [=list/clone=] of |B|.
+ 1. Let |paddedB| be a [=list/clone=] of |shapeB|.
1. While |paddedB|'s [=list/size=] is less than |sizeA|, [=list/prepend=] 1 to |paddedB|.
1. Let |outputShape| be a new [=/list=].
1. [=list/For each=] |index| in [=the range=] 0 to |sizeA|, exclusive:
-    1. Let |dimA| be |A|[|index|].
+    1. Let |dimA| be |shapeA|[|index|].
    1. Let |dimB| be |paddedB|[|index|].
    1. If |dimA| is not equal to |dimB| and |dimA| is not equal to 1, then return failure.
    1. [=list/Append=] |dimA| to |outputShape|.
1. Return |outputShape|.

inexorabletash · 2024-04-01T16:35:29Z

I only found one other operator to replace with rank

Just to confirm: this was referring to the batchNormalization issues, not a separate issue?

would you rather these be a separate CR?

Nah, too much noise, this is all related.

, plus some dubious size-related statements. Do these proposed edits look valid to you?

Yes - incorporated verbatim in 38f9261 except for in slice()

-   1. If |sizes|'s [=list/size=] is 0, then [=exception/throw=] a {{TypeError}}.
+   1. If any of |sizes|'s have size 0, then [=exception/throw=] a {{TypeError}}.

I made this change instead:

-    1. If |sizes|'s [=list/size=] is 0, then [=exception/throw=] a {{TypeError}}.
+    1. If any of |sizes|'s [=list/items=] are 0, then [=exception/throw=] a {{TypeError}}.

.. and made sure |A| → |shapeA| etc everywhere. I'll add a convention note.

fdwr

LGTM JB. TY. 😎

fdwr · 2024-04-03T08:40:39Z

this was referring to the batchNormalization issues, not a separate issue?

Yep, it looked like you found every other case of size -> rank for other operators.

@huningxin Any thoughts on this one?

huningxin

LGTM, thanks @inexorabletash !

SHA: e52f163 Reason: push, by huningxin Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Conventions: Use "rank" for variable names, when appropriate

a610b9e

Fixes webmachinelearning#588

inexorabletash requested a review from fdwr March 28, 2024 21:11

inexorabletash added the conventions label Mar 28, 2024

incorporate fdwr's suggestions

38f9261

document convention

f4535ca

fdwr approved these changes Apr 3, 2024

View reviewed changes

inexorabletash requested a review from huningxin April 3, 2024 19:48

huningxin approved these changes Apr 4, 2024

View reviewed changes

huningxin merged commit e52f163 into webmachinelearning:main Apr 4, 2024
2 checks passed

github-actions bot added a commit that referenced this pull request Apr 4, 2024

Merge pull request #627 from inexorabletash/conventions-rank-not-size

9e504fb

SHA: e52f163 Reason: push, by huningxin Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

inexorabletash deleted the conventions-rank-not-size branch April 4, 2024 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conventions: Use "rank" for variable names, when appropriate #627

Conventions: Use "rank" for variable names, when appropriate #627

inexorabletash commented Mar 28, 2024 •

edited by pr-preview bot

Loading

inexorabletash commented Mar 28, 2024

fdwr commented Mar 29, 2024 •

edited

Loading

inexorabletash commented Apr 1, 2024

fdwr left a comment

fdwr commented Apr 3, 2024

huningxin left a comment

Conventions: Use "rank" for variable names, when appropriate #627

Conventions: Use "rank" for variable names, when appropriate #627

Conversation

inexorabletash commented Mar 28, 2024 • edited by pr-preview bot Loading

inexorabletash commented Mar 28, 2024

fdwr commented Mar 29, 2024 • edited Loading

inexorabletash commented Apr 1, 2024

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Apr 3, 2024

huningxin left a comment

Choose a reason for hiding this comment

inexorabletash commented Mar 28, 2024 •

edited by pr-preview bot

Loading

fdwr commented Mar 29, 2024 •

edited

Loading