Clarify intermediate values (activations) vs MLActivation vs operations #335

zolkis · 2023-02-01T15:32:32Z

With consecutive recent changes I feel we should check and sort out the use of the following terms in the spec:

intermediate values (activations)
MLActivation interface (former MLOperator)
operations (both ops and activation functions).

The latter are supposed to have functional semantics, so don't have state and don't change states of other objects. Therefore they are fine being represented by ops functions in MLGraphBuilder.

However,

Should we consider marking the activation functions among ops?
Would a separate MLOperator definition be still useful, as it is continued to be used in implementation(s)?

(Possibly a V2 discussion, but would be nice to sort out in V1. The text definitely needs checking in V1.)
If there is a clear explanation, we just need to update the spec text, explainer etc.

zolkis · 2023-03-01T17:42:18Z

Having worked on several algorithms and having studied the implementation, I don't find the current usage of MLActivation sufficiently clear.

I suggest we revert to the MLOperator interface name, and use the activation name in arguments with the type MLOperator, for instance in batchNormalization(), conv2d(), etc.

In addition, we need constructors for MLOperator and MLOperand (I have written them, but didn't make a PR).

This would help make the spec more clear and also follow at least the CPU impl more closely.

huningxin · 2023-03-02T14:13:17Z

@zolkis , thanks for your feedback.

There is a review-in-progress Chromium CL that exposes MLActivation and makes MLOperator internal. You may want to check it out. Thanks!

zolkis · 2023-03-02T14:41:25Z

Thanks @huningxin.
We still have a problem in the spec that MLGraph is under-specified and MLActivation is an empty interface.
I have made an attempt to address the latter, and it only requires an internal slot for a name, and eventually to an internal function.
We should also think about expressing the graph structure of MLGraph more exactly, so that we could refer to it in algorithms with more formalized prose than English. :)

inexorabletash · 2024-02-21T22:08:23Z

@zolkis - do you think we need to keep this open, or is it covered by more specific issues e.g. #448 #457 #549 #552 ? (Many of which are overlapping/redundant too...)

zolkis · 2024-02-22T08:59:00Z

This is an old problem/discussion that has been addressed from multiple sides during the past year. Closing this for now.

zolkis mentioned this issue Feb 1, 2023

Add internal slots to MLOperand, MLActivation and basic algorithms #336

Closed

anssiko mentioned this issue Feb 3, 2023

Use modern WebIDL and Infra standard conventions #210

Closed

zolkis mentioned this issue Feb 22, 2023

Add internal slots to MLOperand and MLActivation #337

Closed

anssiko added the enhancement label Mar 3, 2023

dontcallmedom added editorial and removed enhancement labels Mar 3, 2023

zolkis closed this as completed Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify intermediate values (activations) vs MLActivation vs operations #335

Clarify intermediate values (activations) vs MLActivation vs operations #335

zolkis commented Feb 1, 2023

zolkis commented Mar 1, 2023

huningxin commented Mar 2, 2023

zolkis commented Mar 2, 2023

inexorabletash commented Feb 21, 2024

zolkis commented Feb 22, 2024

Clarify intermediate values (activations) vs MLActivation vs operations #335

Clarify intermediate values (activations) vs MLActivation vs operations #335

Comments

zolkis commented Feb 1, 2023

zolkis commented Mar 1, 2023

huningxin commented Mar 2, 2023

zolkis commented Mar 2, 2023

inexorabletash commented Feb 21, 2024

zolkis commented Feb 22, 2024