Wrap PyTorch models under `TorchModule` #147

bryanlimy · 2024-02-06T17:14:26Z

new autoemulate/neural_networks modules where we can define new PyTorch architectures by implementing TorchModule
NeuralNetTorch takes input argument (string) module as the name of the module to initialize.
moved set_random_seed to autoemulate/utils.py as it might be used by other non-PyTorch modules

address Issue #129

…plement the TorchModule class and user can define which NN architecture to use by providing module argument

bryanlimy · 2024-02-06T17:19:38Z

The update failed the test case check_estimators_overwrite_params in estimator_checks. I am not sure why, if we create a deepcopy of a model and fit the copied model, why would we expect the copied model to be the same as the original model.

mastoffel

Great work @bryanlimy ! Just a few comments below.

the failed check_estimators_overwrite_params tests is fine for now as long as everything else works. Could you add it to the _xfail_check dict in neural_net_torch.py to pass CI?

autoemulate/compare.py

mastoffel · 2024-02-09T11:55:30Z

autoemulate/emulators/neural_networks/neural_networks.py

+def register(name):
+    def add_to_dict(fn):
+        global _MODULES
+        _MODULES[name] = fn
+        return fn
+
+    return add_to_dict


I wonder whether global variables can lead to issues down the line (state management/testing) and whether it would be better to encapsulate this in a class. If you're frequently using globals and think it's fine, I'm ok with leaving this for the moment.

Updated the get_module method to not use global variables.

autoemulate/emulators/neural_networks/neural_networks.py

mastoffel · 2024-02-09T11:58:11Z

autoemulate/emulators/neural_networks/mlp.py

+class MLPModule(TorchModule):
+    def __init__(
+        self,
+        input_size: int = None,
+        output_size: int = None,
+        random_state: int = None,
+        hidden_sizes: Tuple[int] = (100,),
+    ):
+        super(MLPModule, self).__init__(
+            module_name="mlp",
+            input_size=input_size,
+            output_size=output_size,
+            random_state=random_state,
+        )
+        modules = []
+        for hidden_size in hidden_sizes:
+            modules.append(nn.Linear(in_features=input_size, out_features=hidden_size))
+            modules.append(nn.ReLU())
+            input_size = hidden_size
+        modules.append(nn.Linear(in_features=input_size, out_features=output_size))
+        self.model = nn.Sequential(*modules)


I wonder whether we should move the hyperparameter search space from neural_net_torch.py to here, as it will be quite specific for each PyTorch model.

Yes, I think it make sense that the hyperparameter settings live within each TorchModule

I have moved the get_grid_params method to TorchModule. A problem with this approach is that the module is not initialized upon creation of NeuralNetTorch, and is only initialized when we call fit for the first time. So if we call grid_params = nn_torch_model.get_grid_params() before a hyperparameter search, we will get an error due to self.module_.get_grid_params does not exists yet.

We can either always initialize the module in NeuralNetTorch.__init__, which would fail some cases in the estimator test suite, or before we do hyperparameter search, we initialize the module on our own.

Yes I see, so this currently fails when running:

em = AutoEmulate() em.setup(X, y, model_subset=["NeuralNetTorch"], param_search=True) em.compare()

with AttributeError: 'NeuralNetTorch' object has no attribute 'module_'

My feeling is that initializing the module in NeuralNetTorch.__init__ is fine and we just add the failed tests to "_xfail_checks", because this seems like what skorch is intending to do anyway.

mastoffel · 2024-02-09T14:17:13Z

@bryanlimy sorry one more comment: Would you mind adding a few docstrings?

…forward and get_grid_params in TorchModule. add docstring

github-actions · 2024-02-09T14:55:21Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
autoemulate
utils.py					322-323
autoemulate/emulators
neural_net_torch.py					75, 117
autoemulate/emulators/neural_networks
__init__.py
base.py					28, 31
get_module.py					15-16
mlp.py					37-58
tests
test_emulators.py
Project Total

_{This report was generated by python-coverage-comment-action}

codecov-commenter · 2024-02-09T14:55:44Z

Codecov Report

Attention: 16 lines in your changes are missing coverage. Please review.

Comparison is base (d6adb8b) 91.72% compared to head (44582e6) 91.54%.
Report is 15 commits behind head on main.

Files	Patch %	Lines
autoemulate/emulators/neural_networks/mlp.py	71.42%	8 Missing ⚠️
autoemulate/emulators/neural_net_torch.py	77.77%	2 Missing ⚠️
autoemulate/emulators/neural_networks/base.py	86.66%	2 Missing ⚠️
...utoemulate/emulators/neural_networks/get_module.py	80.00%	2 Missing ⚠️
autoemulate/utils.py	80.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #147      +/-   ##
==========================================
- Coverage   91.72%   91.54%   -0.18%     
==========================================
  Files          36       40       +4     
  Lines        1691     1739      +48     
==========================================
+ Hits         1551     1592      +41     
- Misses        140      147       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…where model was tested without initialization

bryanlimy added 2 commits February 6, 2024 17:09

create neural_networks wrapper in emulators where each torch model im…

f030d10

…plement the TorchModule class and user can define which NN architecture to use by providing module argument

remove unused import

40219bb

bryanlimy requested a review from mastoffel February 6, 2024 17:21

mastoffel reviewed Feb 9, 2024

View reviewed changes

bryanlimy added 3 commits February 9, 2024 14:22

run pre-commit

a72a34a

move grid_params definition to individual TorchModule. add check for …

e5b31c0

…forward and get_grid_params in TorchModule. add docstring

remove init model and optimizer in __init__ function

66823d2

bryanlimy added 4 commits February 9, 2024 15:01

remove usage of global variable

e84b172

initialize module and optimizer within __init__ and ignore test case …

b9745db

…where model was tested without initialization

remove dictionray pop

a5aeca5

remove comment

44582e6

bryanlimy merged commit a70aa6e into main Feb 9, 2024
5 checks passed

bryanlimy deleted the pytorch_wrapper branch February 9, 2024 16:40

bryanlimy mentioned this pull request Feb 9, 2024

Wrapper for PyTorch model in NeuralNetTorch #129

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrap PyTorch models under `TorchModule` #147

Wrap PyTorch models under `TorchModule` #147

bryanlimy commented Feb 6, 2024 •

edited

Loading

bryanlimy commented Feb 6, 2024 •

edited

Loading

mastoffel left a comment

mastoffel Feb 9, 2024

bryanlimy Feb 9, 2024

mastoffel Feb 9, 2024

bryanlimy Feb 9, 2024

bryanlimy Feb 9, 2024

bryanlimy Feb 9, 2024

mastoffel Feb 9, 2024

mastoffel commented Feb 9, 2024

github-actions bot commented Feb 9, 2024 •

edited

Loading

codecov-commenter commented Feb 9, 2024 •

edited

Loading

Wrap PyTorch models under TorchModule #147

Wrap PyTorch models under TorchModule #147

Conversation

bryanlimy commented Feb 6, 2024 • edited Loading

bryanlimy commented Feb 6, 2024 • edited Loading

mastoffel left a comment

Choose a reason for hiding this comment

mastoffel Feb 9, 2024

Choose a reason for hiding this comment

bryanlimy Feb 9, 2024

Choose a reason for hiding this comment

mastoffel Feb 9, 2024

Choose a reason for hiding this comment

bryanlimy Feb 9, 2024

Choose a reason for hiding this comment

bryanlimy Feb 9, 2024

Choose a reason for hiding this comment

bryanlimy Feb 9, 2024

Choose a reason for hiding this comment

mastoffel Feb 9, 2024

Choose a reason for hiding this comment

mastoffel commented Feb 9, 2024

github-actions bot commented Feb 9, 2024 • edited Loading

Coverage report

codecov-commenter commented Feb 9, 2024 • edited Loading

Codecov Report

Wrap PyTorch models under `TorchModule` #147

Wrap PyTorch models under `TorchModule` #147

bryanlimy commented Feb 6, 2024 •

edited

Loading

bryanlimy commented Feb 6, 2024 •

edited

Loading

github-actions bot commented Feb 9, 2024 •

edited

Loading

codecov-commenter commented Feb 9, 2024 •

edited

Loading