feat(Optuna): Allow for parsing of Choice Nodes #290

berombau · 2024-10-29T16:17:24Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Fixes #289.

Minimal Example / How should this PR be tested?

There are some extra tests, see pytest -k optuna.

Any other comments?

Seems to be working. Will add new issues if I run into issues with it.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

eddiebergman · 2024-10-29T19:10:29Z

src/amltk/optimization/optimizers/optuna.py

+            # filter all parameters given the made choices
+            filtered_workspace = {k: v for k, v in workspace.items() if (
+                ("__choice__" not in k) and
+                (not any(c in k for c in delete_other_options))


This line might need a second look, I had a bug a while back with something similar where essentially the c was present in some other k in another part of the pipeline that shouldn't have been effected by the given choice.

I'm on my phone so I cant write a concrete example but see the overall review comment for more

eddiebergman · 2024-10-29T19:16:05Z

Looks really good, clean and well done, many thanks!

I would ask that the actual sampling function is factored into its own function that is then tested a bit more as to it's sampling behaviour.

This wasn't required for the other optimizers as I relied on them testing them own functionality, i.e. I dont want to test that other optimizers have their internals correct.

However here, since we are the ones implementing the sampling procedure, might be good to have it factored out and testable.

You can make it a private function of the module outside of the class as generally I don't think a user should be exposed to it.

For the actual study object, you can choose any sampler that makes sense to you!

Things to test are what was mentioned in the comment but also maybe that deeper more heirarchical pipelines work too. If you can correctly sample for this pipeline as well as a test for the aforementioned issue, I'd be happy to hit merge!

If you can't get around to it, just let me know and I will try to finish it off!

berombau · 2024-10-30T13:08:46Z

I fixed some stuff and added a OptunaSearchSpace class. It does the Choice parsing and has a sample_configuration() function.

The main issue is probably still this filtering of the parameters for the unwanted choices. Now it just looks for ':{name_of_unwanted_choice}:'. Using the name of the node had trouble with flat vs. non-flat hierarchies and the Sequence node name which has some ?random elements?. So probably need to store the flat option in OptunaSearchSpace and do some smarter parsing for Sequence nodes.

A pipeline where two Choice components have the same options is now probably not yet supported. Was just thinking if there are more options besides the two above to be taken into account for this filter logic.

eddiebergman · 2024-10-30T16:15:21Z

The random elements is usually just a thing of not giving a component a name. To do the configuring properly, I had to give them some identifier.

In any case, happy to push this through once the test pass. In any case, I can turn the remaining thing into an issue so this can be merged!

Very appreciative of your work!

berombau · 2024-10-30T16:33:26Z

Thanks for the help! It's a really nice toolkit and codebase, was very glad when I found it 👍

The mypy crashes on my pre-commit, so running it locally.

mypy.....................................................................Failed
- hook id: mypy
- exit code: 2

tests/scheduling/test_scheduler.py:116: error: INTERNAL ERROR -- Please try using mypy master on GitHub:
https://mypy.readthedocs.io/en/stable/common_issues.html#using-a-development-mypy-build
Please report a bug at https://github.com/python/mypy/issues
version: 1.8.0

eddiebergman · 2024-10-30T18:52:43Z

Thanks for the help! It's a really nice toolkit and codebase, was very glad when I found it 👍

The mypy crashes on my pre-commit, so running it locally.

mypy.....................................................................Failed
- hook id: mypy
- exit code: 2

tests/scheduling/test_scheduler.py:116: error: INTERNAL ERROR -- Please try using mypy master on GitHub:
https://mypy.readthedocs.io/en/stable/common_issues.html#using-a-development-mypy-build
Please report a bug at https://github.com/python/mypy/issues
version: 1.8.0

Aye I'm aware of this issue, it's very annoying. I can't figure out what causes it but clearing cache seems to sometimes fix it -_-

begin on support for Choice in Optuna parsing

a57a126

eddiebergman requested changes Oct 29, 2024

View reviewed changes

berombau changed the title ~~begin on support for Choice in Optuna parsing~~ support for Choice in Optuna optimizer Oct 30, 2024

improve and move Choice logic from optimizer to parser

7d79816

berombau changed the title ~~support for Choice in Optuna optimizer~~ feat(Optuna): Allow for parsing of Choice Nodes Oct 30, 2024

add pre-commit changes

39e002b

berombau requested a review from eddiebergman October 30, 2024 16:31

eddiebergman merged commit 5221d80 into automl:main Oct 31, 2024
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(Optuna): Allow for parsing of Choice Nodes #290

feat(Optuna): Allow for parsing of Choice Nodes #290

berombau commented Oct 29, 2024 •

edited

Loading

eddiebergman Oct 29, 2024

eddiebergman commented Oct 29, 2024 •

edited

Loading

berombau commented Oct 30, 2024

eddiebergman commented Oct 30, 2024

berombau commented Oct 30, 2024

eddiebergman commented Oct 30, 2024

feat(Optuna): Allow for parsing of Choice Nodes #290

feat(Optuna): Allow for parsing of Choice Nodes #290

Conversation

berombau commented Oct 29, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Minimal Example / How should this PR be tested?

Any other comments?

eddiebergman Oct 29, 2024

Choose a reason for hiding this comment

eddiebergman commented Oct 29, 2024 • edited Loading

berombau commented Oct 30, 2024

eddiebergman commented Oct 30, 2024

berombau commented Oct 30, 2024

eddiebergman commented Oct 30, 2024

berombau commented Oct 29, 2024 •

edited

Loading

eddiebergman commented Oct 29, 2024 •

edited

Loading