Introducing structured model outputs #316

laserkelvin · 2024-11-12T19:30:15Z

This PR is semi-related to #104, allowing flexibility for wrappers and full implementations to potentially have a common output format that is also "backwards" compatible. It depends on ModelOutput from #315, so please review and merge that before this PR.

Added a private attribute of the base AbstractTask class (which graph models inherit) called __skip_output_heads__, which BaseTaskLitModules will check this to determine if output heads are needed or not. If we do not use them, we do not initialize the OutputHeads to save on parameters/ensure we don't have unused parameters.
I have refactored the MACEWrapper as an example of how to do this, and we should do so for other wrappers as well (e.g. CHGNet). I've also updated the sphinx documentation to detail how this can be used.
I've ensured that the regular workflow (with PyG EGNN) isn't broken, and included unit tests for MACEWrapper to run the forward pass and an end-to-end with ForceRegressionTask.

…put head

…finition While having the argument means we have choice, it does make it more difficult to control the logic

…elOutputs

Signed-off-by: Lee, Kin Long Kelvin <[email protected]>

laserkelvin · 2024-11-12T20:57:59Z

CI passed on relevant things

melo-gonzo

Looks good overall! Will be a nice improvement to have. Left one general comment about downstream workflows.

melo-gonzo · 2024-11-12T22:26:29Z

matsciml/models/base.py

+                if isinstance(encoder_outputs, Embeddings):
+                    embeddings = encoder_outputs
+                # for BYO output head cases
+                elif isinstance(encoder_outputs, ModelOutput):


A comment that may not be super important to consider right at this moment but should be noted - some workflows, such as serving models with OpenKIM's kusp to then run benchmarks, require that node energies are present. Some models (such as mace) can output these directly and they may be worth hanging onto as well.

I think they're included in ModelOutput - in the wrapper output they're stashed as node_energies. That's what you mean right?

laserkelvin added 30 commits October 17, 2024 11:09

refactor: making output head construction quasi-optional

6dc2090

fix: using moduledict instead of regular dict to pass assertion

b1b1749

refactor: making forward pass predictions if not embeddings

25d4321

refactor: stashing embeddings if included in encoder outputs

8cfc5e5

refactor: adding check to validate encoder outputs

1873f56

refactor: return encoder predictions directly for force regression

f2dbeb1

Merge branch 'main' into byo-outputs

8abd7a9

refactor: added private variable for abstract tasks to signal byo out…

9724f89

…put head

refactor: updating forward signature with non-embedding output

64c9aa5

Merge branch 'structured-model-output-type' into byo-outputs

78446c3

refactor: introducing model output data structure

09e10fe

Merge branch 'structured-model-output-type' into byo-outputs

fc28949

refactor: updated output signatures to include ModelOutput

9e70a70

refactor: allowing high level forward call to skip output heads

87e0c43

refactor: mapping model outputs to process embeddings

87c7d92

refactor: removing kwarg for using encoder_predictions in favor of de…

c075a55

…finition While having the argument means we have choice, it does make it more difficult to control the logic

refactor: changing forward signature to emit ModelOutput or dict

d9ba520

refactor: using getattr instead of dict lookup to support dict or Mod…

a8db278

…elOutputs

refactor: adding dict case in addition to getattr

74a14e2

refactor: making MACE wrapper emit model output structure

d5d88b2

refactor: flipping default state for forces in MACE to be disabled

69565a9

Merge branch 'structured-model-output-type' into byo-outputs

8d1a882

test: updating MACE wrapper test to work for model outputs

51d1f52

refactor: updating force regression to allow ModelOutput

d155f1c

fix: added missing macewrapper to pyg model namespace

408844b

Signed-off-by: Lee, Kin Long Kelvin <[email protected]>

fix: mapping unit shifts into MACE kwargs

2829cdd

test: adding unit test for BYO outputs represented by MACE

af0d9f1

docs: added documentation about using modeloutput data structure

60513a9

Signed-off-by: Lee, Kin Long Kelvin <[email protected]>

refactor: adding model output key check in ForceRegressionTask

3a091c2

refactor: skipping output kwargs if encoder doesn't use output heads

52dfeff

laserkelvin added enhancement New feature or request code maintenance Issue/PR for refactors, code clean up, etc. labels Nov 12, 2024

laserkelvin requested a review from melo-gonzo November 12, 2024 19:30

laserkelvin added 4 commits November 12, 2024 11:32

docs: updated design intention with model outputs

f2bbf30

Merge branch 'main' into byo-outputs

2c0b6b8

fix: correcting missing FAENet import in pyg namespace

abaf72f

Signed-off-by: Lee, Kin Long Kelvin <[email protected]>

fix: correcting model to encoder in expected attribute name

2e4a5ba

Merge branch 'main' into byo-outputs

a1e98ba

melo-gonzo approved these changes Nov 12, 2024

View reviewed changes

laserkelvin merged commit b522337 into IntelLabs:main Nov 12, 2024
2 of 3 checks passed

laserkelvin deleted the byo-outputs branch November 12, 2024 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing structured model outputs #316

Introducing structured model outputs #316

laserkelvin commented Nov 12, 2024 •

edited

Loading

laserkelvin commented Nov 12, 2024

melo-gonzo left a comment

melo-gonzo Nov 12, 2024

laserkelvin Nov 12, 2024

Introducing structured model outputs #316

Introducing structured model outputs #316

Conversation

laserkelvin commented Nov 12, 2024 • edited Loading

laserkelvin commented Nov 12, 2024

melo-gonzo left a comment

Choose a reason for hiding this comment

melo-gonzo Nov 12, 2024

Choose a reason for hiding this comment

laserkelvin Nov 12, 2024

Choose a reason for hiding this comment

laserkelvin commented Nov 12, 2024 •

edited

Loading