Introducing BatchBlock #1192

marcromeyn · 2023-07-10T08:24:38Z

Goals ⚽

BatchBlock will be used inside the Model to create the Batch object. It's also useful for things like masking & padding.

github-actions · 2023-07-10T08:43:14Z

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1192

tests/unit/torch/test_block.py

sararb

The PR looks good to me! I've just left some minor comments/questions

sararb · 2023-07-11T15:29:18Z

merlin/models/torch/batch.py

+            A dictionary containing all the flattened features, targets, and sequences.
+        """
+        flat_dict: Dict[str, torch.Tensor] = self._flatten()
+        dummy_tensor = torch.tensor(0)


Can you explain why we need the dummy_tensor variable?

We never use it, but we need to have it in order to keep the type of the Dict[str, torch.Tensor]. We could store the original value but that might take more memory, so that's why I added the dummy. We only care about the keys of the original inputs.

merlin/models/torch/models/base.py

merlin/models/torch/predict.py

merlin/models/torch/transforms/sequences.py

sararb · 2023-07-11T16:02:28Z

tests/unit/torch/test_batch.py

+        result = batch.flatten_as_dict(input_batch)
+        assert len(result) == 9  # input keys are considered
+        assert (
+            len([k for k in result if k.startswith("inputs.")]) == 4


what is the difference between inputs. and features. keys ?

inputs is what went into the batch-transformation, used for the mechanism to restore values of the batch that were not transformed by any of the branches.

sararb · 2023-07-11T16:06:59Z

tests/unit/torch/test_block.py

+
+    def test_in_parallel(self):
+        feat, target = torch.tensor([1, 2]), torch.tensor([3, 4])
+        outputs = module_utils.module_test(


+1 to how we can now create different preprocessing blocks for each subset of the inputs! I love it!!

marcromeyn added enhancement New feature or request area/pytorch labels Jul 10, 2023

marcromeyn requested a review from sararb July 10, 2023 08:24

marcromeyn self-assigned this Jul 10, 2023

marcromeyn marked this pull request as ready for review July 10, 2023 08:46

marcromeyn requested a review from oliverholworthy July 10, 2023 08:46

oliverholworthy reviewed Jul 10, 2023

View reviewed changes

tests/unit/torch/test_block.py Outdated Show resolved Hide resolved

marcromeyn force-pushed the torch/batch-block branch 2 times, most recently from fc1d25f to 1582d6a Compare July 11, 2023 12:21

marcromeyn requested a review from oliverholworthy July 11, 2023 12:21

marcromeyn added the status/needs-review label Jul 11, 2023

marcromeyn added 8 commits July 11, 2023 17:28

Introducing BatchBlock

305dbd1

Introducing BatchBlock

8b044f6

Adding type-hint to model.pre

8be29d6

Adding doc-strings to BatchBlock

838265c

Adding match= to pytest.raises

d34c081

Adding EncoderBlock, this should be returned when you do model.select

bedb91b

Running linting

6a897d2

Fixing merge-conflicts

69d17df

marcromeyn force-pushed the torch/batch-block branch from 1582d6a to 69d17df Compare July 11, 2023 15:30

Running linting

0a936a4

sararb approved these changes Jul 11, 2023

View reviewed changes

marcromeyn added 3 commits July 11, 2023 18:24

Fix according to PR-comments

9383c27

Fix according to PR-comments

33fc02f

Merge branch 'main' into torch/batch-block

34e893f

marcromeyn merged commit 3431321 into main Jul 11, 2023

marcromeyn deleted the torch/batch-block branch July 11, 2023 16:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing BatchBlock #1192

Introducing BatchBlock #1192

marcromeyn commented Jul 10, 2023 •

edited

Loading

github-actions bot commented Jul 10, 2023

sararb left a comment

sararb Jul 11, 2023

marcromeyn Jul 11, 2023 •

edited

Loading

sararb Jul 11, 2023

marcromeyn Jul 11, 2023

sararb Jul 11, 2023

Introducing BatchBlock #1192

Introducing BatchBlock #1192

Conversation

marcromeyn commented Jul 10, 2023 • edited Loading

Goals ⚽

github-actions bot commented Jul 10, 2023

Documentation preview

sararb left a comment

Choose a reason for hiding this comment

sararb Jul 11, 2023

Choose a reason for hiding this comment

marcromeyn Jul 11, 2023 • edited Loading

Choose a reason for hiding this comment

sararb Jul 11, 2023

Choose a reason for hiding this comment

marcromeyn Jul 11, 2023

Choose a reason for hiding this comment

sararb Jul 11, 2023

Choose a reason for hiding this comment

marcromeyn commented Jul 10, 2023 •

edited

Loading

marcromeyn Jul 11, 2023 •

edited

Loading