added branch composition to the modelsystem normalizer #76

JFRudzinski · 2024-05-28T11:00:11Z

No description provided.

coveralls · 2024-05-28T11:01:17Z

Pull Request Test Coverage Report for Build 9268882832

Details

0 of 0 changed or added relevant lines in 0 files are covered.
10 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.3%) to 98.22%

Files with Coverage Reduction	New Missed Lines	%
tests/conftest.py	10	89.62%

Totals
Change from base Build 9254335937:	-0.3%
Covered Lines:	607
Relevant Lines:	618

💛 - Coveralls

JFRudzinski · 2024-05-29T09:17:53Z

@JosePizarro3, @Bernadette-Mohr
I added the attribute composition_formula to ModelSystem, which describes the children of each ModelSystem with the notation X(n)Y(m), where X, Y are the branch labels and n,m are the number of repeating branches with the same label. This is applied during normalization only for representative systems. At the lowest level of the hierarchy, i.e., when a model system has no model_system subsection, composition formula then corresponds to the normal chemical formula in terms of the atom_labels.

I am not attached to this notation, but I think it is useful for understanding what each branch is containing. Let me know what you think. There are TODOs and ?'s that warrant addressing, please have a look at those.

JosePizarro3

Very good, just some minor reorganization comments.

I am mainly interested on moving the couple of functions you have under normalize() directly as class methods and combine them into something called resolve_composition_formula. Still, you will need to define a specific function inside there to do the recursion.

I will also change the .get('...') methods for our typical style to resolve quantities and sub-sections, e.g., system.model_system instead of system.get('model_system'). I think the second would return a dictionary, but I'd might be wrong.

Let me know if you need help or something is not clear.

src/nomad_simulations/model_system.py

src/nomad_simulations/utils/utils.py

src/nomad_simulations/model_system.py

tests/test_model_system.py

JFRudzinski · 2024-05-30T12:47:16Z

@JosePizarro3 In terms of system.model_system instead of system.get('model_system'), they return the same thing if system.model_system exists. However, the .get() returns None if it doesn't exist, whereas the former breaks with an error. I guess it is somehow guaranteed that these archive sections are populated with all their possible attributes, but I am just used to using .get() to be safe.

I will change it, just wanted to clarify.

JosePizarro3 · 2024-05-30T12:53:09Z

@JosePizarro3 In terms of system.model_system instead of system.get('model_system'), they return the same thing if system.model_system exists. However, the .get() returns None if it doesn't exist, whereas the former breaks with an error. I guess it is somehow guaranteed that these archive sections are populated with all their possible attributes, but I am just used to using .get() to be safe.

I will change it, just wanted to clarify.

Yeah, this is actually kind of annoying. @TLCFEM @ladinesa could this be implemented or is there any reason why we don't want system.model_system in the example above to return None?

ladinesa · 2024-05-30T13:32:01Z

@JosePizarro3 In terms of system.model_system instead of system.get('model_system'), they return the same thing if system.model_system exists. However, the .get() returns None if it doesn't exist, whereas the former breaks with an error. I guess it is somehow guaranteed that these archive sections are populated with all their possible attributes, but I am just used to using .get() to be safe.
I will change it, just wanted to clarify.

Yeah, this is actually kind of annoying. @TLCFEM @ladinesa could this be implemented or is there any reason why we don't want system.model_system in the example above to return None?

model_system since it is repeating subsection should return an empty list right?

JFRudzinski · 2024-05-30T13:39:16Z

@JosePizarro3 In terms of system.model_system instead of system.get('model_system'), they return the same thing if system.model_system exists. However, the .get() returns None if it doesn't exist, whereas the former breaks with an error. I guess it is somehow guaranteed that these archive sections are populated with all their possible attributes, but I am just used to using .get() to be safe.
I will change it, just wanted to clarify.

Yeah, this is actually kind of annoying. @TLCFEM @ladinesa could this be implemented or is there any reason why we don't want system.model_system in the example above to return None?

model_system since it is repeating subsection should return an empty list right?

I actually wasn't saying it doesn't. I was asking if all metainfo sections and attributes are automatically populated with None (or empty lists) so that you don't ever need to do .get() because you won't get an error. Is that true? At what point are they populated, at instantiation?

ladinesa · 2024-05-30T13:45:58Z

@JosePizarro3 In terms of system.model_system instead of system.get('model_system'), they return the same thing if system.model_system exists. However, the .get() returns None if it doesn't exist, whereas the former breaks with an error. I guess it is somehow guaranteed that these archive sections are populated with all their possible attributes, but I am just used to using .get() to be safe.
I will change it, just wanted to clarify.

Yeah, this is actually kind of annoying. @TLCFEM @ladinesa could this be implemented or is there any reason why we don't want system.model_system in the example above to return None?

model_system since it is repeating subsection should return an empty list right?

I actually wasn't saying it doesn't. I was asking if all metainfo sections and attributes are automatically populated with None (or empty lists) so that you don't ever need to do .get() because you won't get an error. Is that true? At what point are they populated, at instantiation?

yes it should be the case, I have not encoutered a case where I need to use .get

JFRudzinski · 2024-06-04T07:55:56Z

@JosePizarro3 This is ready for another review. No rush though, you can leave it till after the project meeting if you don't have time.

I did make some improvements to the testing, I know it's not exactly what you had in mind, and I still feel that it is not ideal in the sense that the generation of the hierarchy is sort of complicated (I guess I could move this to some template generator?), but I am not exactly sure an alternative.

That being said, I think the functionality is much better now in the sense that I actually test a bunch of cases where quantities might be missing, and I tried to document this in the description.

JosePizarro3

Very good, just some final minor details that will make your life easier when writing docstrings for functions.

I really liked the testing, it made it very easy to understand the implementation, and I am surprised that this actually decreased the coverage... I have to ask the details about this package, it is very strange sometimes.

src/nomad_simulations/general.py

src/nomad_simulations/utils/utils.py

tests/test_model_system.py

JFRudzinski · 2024-06-04T12:10:42Z

@JosePizarro3 Thanks for the tips, I think I have addressed everything. Let me know if you think we are ready to consider merging.

btw - have you applied ruff to this repo or do I need to do it manually? My auto-ruff formatting is not instigated for this repo, not sure if that is something that I need to address locally or something with the repo.

JosePizarro3 · 2024-06-04T12:14:24Z

@JosePizarro3 Thanks for the tips, I think I have addressed everything. Let me know if you think we are ready to consider merging.

btw - have you applied ruff to this repo or do I need to do it manually? My auto-ruff formatting is not instigated for this repo, not sure if that is something that I need to address locally or something with the repo.

So in the pyproject.toml ruff is added, so autoformatting should work. You also have a couple of files with conflicts in develop, maybe you need to rebase.

coveralls · 2024-06-04T12:29:37Z

Pull Request Test Coverage Report for Build 9367142112

Details

49 of 49 (100.0%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.07%) to 98.749%

Totals
Change from base Build 9365151737:	0.07%
Covered Lines:	947
Relevant Lines:	959

💛 - Coveralls

JFRudzinski · 2024-06-04T12:31:32Z

@JosePizarro3 Thanks for the tips, I think I have addressed everything. Let me know if you think we are ready to consider merging.
btw - have you applied ruff to this repo or do I need to do it manually? My auto-ruff formatting is not instigated for this repo, not sure if that is something that I need to address locally or something with the repo.

So in the pyproject.toml ruff is added, so autoformatting should work. You also have a couple of files with conflicts in develop, maybe you need to rebase.

ok rebased and applied ruff manually (not sure what's going on there)...looks good?

JosePizarro3 · 2024-06-04T12:48:29Z

Yeah, please, merge.

Maybe there is something off with your virtual environment and the rules applied (I am guessing some conflict on that direction). We can check it out in the office this week, it should be automatically applied when saving a file.

JFRudzinski · 2024-06-04T12:57:22Z

Yeah, please, merge.

Maybe there is something off with your virtual environment and the rules applied (I am guessing some conflict on that direction). We can check it out in the office this week, it should be automatically applied when saving a file.

cool, thanks a lot for all your help with this!

JFRudzinski linked an issue May 28, 2024 that may be closed by this pull request

ModelSystem for H5MD #73

Closed

JFRudzinski requested review from JosePizarro3 and Bernadette-Mohr May 29, 2024 09:12

JosePizarro3 reviewed May 30, 2024

View reviewed changes

JFRudzinski requested a review from JosePizarro3 June 4, 2024 07:52

JosePizarro3 approved these changes Jun 4, 2024

View reviewed changes

jrudz added 10 commits June 4, 2024 14:23

added branch composition to the modelsystem normalizer

9329eb5

fixed attribute name mistake

af52af4

added test for composition formula

1426d8f

resolve composition formula outside of normalize within ModelSystem

ea171a0

moved resolve composition formula up to Simulations()

90d08c1

save point

484f047

finalized movement of resolve formulas and expanded tests

d5aaac8

added doc strings

853b583

removed logger and unused typing

6ce4348

some typos

c60c66c

JFRudzinski force-pushed the 73-modelsystem-for-h5md branch from 3799690 to c60c66c Compare June 4, 2024 12:25

applied ruff

faac37c

ruff formatting

165144c

JFRudzinski merged commit e032021 into develop Jun 4, 2024
4 checks passed

JFRudzinski deleted the 73-modelsystem-for-h5md branch June 4, 2024 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added branch composition to the modelsystem normalizer #76

added branch composition to the modelsystem normalizer #76

JFRudzinski commented May 28, 2024

coveralls commented May 28, 2024 •

edited

Loading

JFRudzinski commented May 29, 2024

JosePizarro3 left a comment

JFRudzinski commented May 30, 2024

JosePizarro3 commented May 30, 2024

ladinesa commented May 30, 2024

JFRudzinski commented May 30, 2024

ladinesa commented May 30, 2024

JFRudzinski commented Jun 4, 2024

JosePizarro3 left a comment

JFRudzinski commented Jun 4, 2024

JosePizarro3 commented Jun 4, 2024

coveralls commented Jun 4, 2024 •

edited

Loading

JFRudzinski commented Jun 4, 2024

JosePizarro3 commented Jun 4, 2024

JFRudzinski commented Jun 4, 2024

added branch composition to the modelsystem normalizer #76

added branch composition to the modelsystem normalizer #76

Conversation

JFRudzinski commented May 28, 2024

coveralls commented May 28, 2024 • edited Loading

Pull Request Test Coverage Report for Build 9268882832

Details

💛 - Coveralls

JFRudzinski commented May 29, 2024

JosePizarro3 left a comment

Choose a reason for hiding this comment

JFRudzinski commented May 30, 2024

JosePizarro3 commented May 30, 2024

ladinesa commented May 30, 2024

JFRudzinski commented May 30, 2024

ladinesa commented May 30, 2024

JFRudzinski commented Jun 4, 2024

JosePizarro3 left a comment

Choose a reason for hiding this comment

JFRudzinski commented Jun 4, 2024

JosePizarro3 commented Jun 4, 2024

coveralls commented Jun 4, 2024 • edited Loading

Pull Request Test Coverage Report for Build 9367142112

Details

💛 - Coveralls

JFRudzinski commented Jun 4, 2024

JosePizarro3 commented Jun 4, 2024

JFRudzinski commented Jun 4, 2024

coveralls commented May 28, 2024 •

edited

Loading

coveralls commented Jun 4, 2024 •

edited

Loading