Adding MMOE & PLE #1173

marcromeyn · 2023-07-03T15:58:19Z

Goals ⚽

This PR introduces mixture-of-experts + PLE/CGC. With this we should be able to write a pytorch version of the multi-task blogpost.

github-actions · 2023-07-03T16:06:50Z

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1173

gabrielspmoreira

The API is very clean. I debugged the tests to better understand and implementation seems ok. Just added some optional suggestions

gabrielspmoreira · 2023-07-11T00:09:57Z

tests/unit/torch/blocks/test_experts.py

+ def test_init_with_outputs(self):
+ outputs = mm.ParallelBlock({"a": mm.BinaryOutput(), "b": mm.BinaryOutput()})
+ outputs.prepend_for_each(mm.MLPBlock([2]))
+ outputs.prepend(MMOEBlock(mm.MLPBlock([2, 2]), 2, outputs))


The API is very clean!. But building MTL models from the output to the input kind of works, but might be counter-intuitive. Will that be the pattern for MTL models?
But I think I understand the challenges of building such MMOE models from the inputs, as gates depend on # experts and number of gates and towers depend on numbers of outputs.

Yeah, it might make sense to add a MMOEOutputs or something that does this under-the-hood.

gabrielspmoreira · 2023-07-11T00:40:36Z

merlin/models/torch/blocks/experts.py

+ outputs : ParallelBlock
+ The output block.
+ shared_gate : bool, optional
+ If true, use a shared gate for all tasks. Defaults to False.


I think this argument name and docstrings is misleading. When num_shared_experts>0 all task gates will use the shared experts, right?
This shared_gate argument seems to be responsible to make it available the output of the shared expert (shortcut and experts keys) from one CGCBlock to the next one in PLE architecture? If that is the case, we could try and clarify that.

gabrielspmoreira · 2023-07-11T00:48:50Z

merlin/models/torch/blocks/experts.py

+ "outputs": outputs,
+ }
+ super().__init__(*CGCBlock(shared_gate=True, **cgc_kwargs).repeat(depth - 1))
+ self.append(CGCBlock(**cgc_kwargs))


Good trick to avoid outputting the shared experts in the last layer.

merlin/models/torch/blocks/experts.py

marcromeyn added 2 commits July 3, 2023 16:58

First pass over MMOEBlock & PLEBlock

e8bc44f

Adding some simple tests for MMOEBlock

38321f3

marcromeyn added enhancement New feature or request area/pytorch labels Jul 3, 2023

marcromeyn self-assigned this Jul 3, 2023

marcromeyn mentioned this pull request Jul 3, 2023

[RMP] Add support for ranking models in PyTorch NVIDIA-Merlin/Merlin#1044

Open

29 tasks

marcromeyn added 5 commits July 3, 2023 18:19

Adding some doc-strings

f3d578e

Fixing failing tests

ebea1df

Merge branch 'main' into torch/experts

941e8d0

Merge branch 'main' into torch/experts

c233900

Increase test-coverage

e78f94e

marcromeyn requested a review from gabrielspmoreira July 7, 2023 09:06

Merge branch 'main' into torch/experts

e8cd77e

marcromeyn marked this pull request as ready for review July 7, 2023 09:14

marcromeyn added the status/needs-review label Jul 7, 2023

marcromeyn added 2 commits July 8, 2023 12:52

Merge branch 'main' into torch/experts

2308346

Merge branch 'main' into torch/experts

1bf46e4

gabrielspmoreira approved these changes Jul 11, 2023

View reviewed changes

marcromeyn added 3 commits July 11, 2023 10:06

Improving doc-strings

ee8c419

Merge branch 'main' into torch/experts

a3a39a9

Fixing failing tests

85575cf

marcromeyn merged commit d2113e8 into main Jul 11, 2023
37 checks passed

marcromeyn deleted the torch/experts branch July 11, 2023 08:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding MMOE & PLE #1173

Adding MMOE & PLE #1173

marcromeyn commented Jul 3, 2023 •

edited

Loading

github-actions bot commented Jul 3, 2023

gabrielspmoreira left a comment

gabrielspmoreira Jul 11, 2023

marcromeyn Jul 11, 2023

gabrielspmoreira Jul 11, 2023

gabrielspmoreira Jul 11, 2023

Adding MMOE & PLE #1173

Adding MMOE & PLE #1173

Conversation

marcromeyn commented Jul 3, 2023 • edited Loading

Goals ⚽

github-actions bot commented Jul 3, 2023

Documentation preview

gabrielspmoreira left a comment

Choose a reason for hiding this comment

gabrielspmoreira Jul 11, 2023

Choose a reason for hiding this comment

marcromeyn Jul 11, 2023

Choose a reason for hiding this comment

gabrielspmoreira Jul 11, 2023

Choose a reason for hiding this comment

gabrielspmoreira Jul 11, 2023

Choose a reason for hiding this comment

marcromeyn commented Jul 3, 2023 •

edited

Loading