Implement on-the-fly descriptor calculation #630

RandomDefaultUser · 2025-01-07T10:53:33Z

This PR gives MALA the capability to compute descriptors on the fly. In doing so, it allows for more refined hyperparameter optimization of descriptor related hyperparameters, since the training-free methods we initially envisioned may not (yet) be enough in many cases. The starting point for this on-the-fly methods are the .json files the DataConverter class can generate.

Since this is a pretty fundamental change in the MALA pipeline, it requires thorough testing. Namely, the following capabilities have to be implemented:

…these changes

RandomDefaultUser · 2025-01-08T13:24:38Z

I just looked into the DDP side of things. The problem here seems to be that DDP allocates the GPU for its own usage, so when LAMMPS wants to use them, they are not free. This should still be solvable, but requires more modification in the code. I will first address the other open issues and address this issue later on, maybe in a separate PR.

RandomDefaultUser added 3 commits January 6, 2025 16:59

On-the-fly training works for the RAM case

e154d2a

Lazy Loading training works now

88a5edb

Checkpointing works now as well

0e4ddfe

RandomDefaultUser added the enhancement New feature or request label Jan 7, 2025

RandomDefaultUser self-assigned this Jan 7, 2025

Made method private

c43fefc

RandomDefaultUser marked this pull request as draft January 7, 2025 11:50

RandomDefaultUser added 9 commits January 7, 2025 13:13

Tester class now also works with on-the-fly calculations

7a90ed5

Prefetching works

39a40c3

Is this already enought to get DDP working?

2201e34

Merge branch 'refs/heads/develop_lenz' into descriptors_on_the_fly

bc79637

Renamed "additional info", since it will be used more regularly with …

c1737c5

…these changes

Fixing a parallel writing bug

4001d0a

Can I use DDP and MPI at the same time?

e6a0723

It does not help

5162404

Getting rid of the parallel modification for now

3997229

RandomDefaultUser added 4 commits January 9, 2025 11:57

Shuffling from atomic positions works now

15ff5dc

Shuffling now works as part of the temporary pipeline

8e8cb3f

Fixed docstrings

615792b

Added automatic snapshot type detection

d0e8de6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement on-the-fly descriptor calculation #630

Implement on-the-fly descriptor calculation #630

RandomDefaultUser commented Jan 7, 2025 •

edited

Loading

RandomDefaultUser commented Jan 8, 2025

Implement on-the-fly descriptor calculation #630

Are you sure you want to change the base?

Implement on-the-fly descriptor calculation #630

Conversation

RandomDefaultUser commented Jan 7, 2025 • edited Loading

RandomDefaultUser commented Jan 8, 2025

RandomDefaultUser commented Jan 7, 2025 •

edited

Loading