Add Cat-B calibration to OSA #292

marialainez · 2024-05-28T14:46:45Z

No description provided.

morcuended

Looks good overall, I left a few first set of comments. Thanks also for adapting the provenance tracking!

src/osa/configs/sequencer.cfg

src/osa/scripts/datasequence.py

Co-authored-by: Daniel Morcuende <[email protected]>

morcuended

Looks good to me. Some final comments inline

src/osa/scripts/datasequence.py

morcuended

It looks good to me. In the current implementation, you have to make sure to change the DL1b subdirectory prod id in another cfg (e.g. CatB_tailcut84) to differentiate it from the original DL1b files (produced only assuming Cat A calibration).

Alternatively, you could implement it in a way that if you activate the application of Cat B, the DL1b subdirectory is automatically changed to include the CatB prefix.

Nonetheless, from my side, this is good to go already.

The failing test is not related, it is just due to the datetime reported in the sequencer table title.

morcuended

I have one concern about adding the creation of CatB and application to DL1a. See comment inline

morcuended · 2024-10-01T14:34:25Z

src/osa/scripts/datasequence.py

+    if run_str[-4:] != "0000":
+        log.debug(f"{run_str} is not the first subrun of the run, so the script "
+            "onsite_create_cat_B_calibration_file will not be launched for this subrun.")
+
+        catB_calibration_file = get_catB_calibration_filename(int(run_str[:5]))
+        n = 0
+        n_max = 10
+        while not catB_calibration_file.exists() and n<=n_max:
+            time.sleep(120)
+            n += 1
+        return 0


Here, if the subrun is not the file 0000 the processing will be kept sleeping until the file is available, right? It is not very convenient, but I see it as the most straightforward way in the current OSA scheme in which all analysis steps run within a single batch job for a given subrun.

Setting slurm dependencies would require changing the way OSA submits jobs to something more modular: one job per subrun per analysis step, hence dependencies can be set. In this way, you could also decouple the different time/memory requirements of each step. And maybe everything would be more efficient

so you suggest to change OSA to submit one job per subrun?

The way you have implemented it, it can probably work, but I think is prone to problems during processing.
The current scheme has the drawback of not being flexible for mixing processing at subrun and run levels. Normal processing goes subrun-wise, then cat B coefficients calculation is done on a run-wise basis. The same applies to e.g. DL3 production, once you have the DL2 merged run-wise, you would like to run it that way. That's why datasequence has this limitation

r0_dl1 (subrun-wise) -> calculate_catB (run-wise) -> dl1ab (subrun-wise) -> merge dl1 (run-wise) -> dl1_dl2 (run-wise) -> dl2_dl3 (run-wise)

Each step would be a job with dependency on the previous stage.

However, this would be a major change in OSA I think. I do not know where to go from this point

probably keep the current implementation and test it for some time

We discussed it a week ago,, I had the same objections you do have. I suggested implementing dependencies in the individual slurm jobs of an array job, but as Maria pointed out, this means waiting for the full completion of the 0000 subrun (DL1a, DL1ab). It is clearly a bottle neck. Is catB calculated only with a subrun or does it use the information all subruns?

I think the bottleneck is there anyway with both schemes, you'd have to wait.

Cat B calibration coefficient is calculated runwise, I'd say using the information from all interleaved events across the entire run. If these coefficients are to be applied I see no other way than waiting until the calibration B file is produced. The thing with dependencies is that they automatically link steps, without having to do the sleep until it finishes. Maybe it can be decoupled into two things: first the production of CatB calib files and second the possible application to the data (this is to be checked with Crab data).

I think it is fine as it is. Give it a try.

For the moment, we can work this way, but later on, I also prefer your proposal. Consider this just another step in the sequencer, not in the data sequence, another calibration step. Build a script that just produces the cat_B calibration for a run. It will depend on the ped_cal calibration, and the data sequence jobs of a given run will depend on the cta_b jobs for this run.

…wice)

marialainez and others added 10 commits May 28, 2024 16:00

Add new step in datasequence to produce the CatB calibration files

4915cc1

add r0-dir to the calibration scripts (e.g. in case R0G is used)

53d57b3

Add CatB calibration file as input in dl1ab step

0b43696

adapt configuration file

95290da

add docstring

0828b26

adapt tests

763ec64

add argument in catB_calibration function

a3f0e1a

adapt tests

9e9193c

fix small issues

5ac3770

adapt tests

889508c

morcuended reviewed Jun 21, 2024

View reviewed changes

src/osa/configs/sequencer.cfg Outdated Show resolved Hide resolved

src/osa/scripts/datasequence.py Outdated Show resolved Hide resolved

marialainez and others added 3 commits June 26, 2024 12:01

Update src/osa/configs/sequencer.cfg

2d3e2bd

Co-authored-by: Daniel Morcuende <[email protected]>

adapt name of Cat-A and Cat-B calibration dir to the cfg file

eefba65

use lstchain major version as calib_prod_id

f483be3

morcuended reviewed Jul 23, 2024

View reviewed changes

src/osa/scripts/datasequence.py Outdated Show resolved Hide resolved

src/osa/scripts/datasequence.py Show resolved Hide resolved

marialainez added 2 commits August 5, 2024 18:01

add option in the cfg to apply or not catB calibration

34c8ab0

add cat A calibration run number

c67cdd5

morcuended approved these changes Aug 8, 2024

View reviewed changes

marialainez added 7 commits September 23, 2024 13:09

add filters as an argument of the catB calibration command

c3f8608

correct name of catB_calib_dir

ccf89db

wait until the catB file is created if subrun != 0000

f8b0edd

adapt tests

e94a15a

remove unused imports

a9ebfdb

adapt tests

d8df805

adapt historylevel function

37062cd

morcuended reviewed Oct 1, 2024

View reviewed changes

add --no-dl1ab option to process data with catB (sequencer launched t…

d220c81

…wice)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Cat-B calibration to OSA #292

Add Cat-B calibration to OSA #292

marialainez commented May 28, 2024

morcuended left a comment

morcuended left a comment

morcuended left a comment •

edited

Loading

morcuended left a comment

morcuended Oct 1, 2024

marialainez Oct 1, 2024

morcuended Oct 1, 2024

morcuended Oct 1, 2024

contrera Oct 1, 2024

morcuended Oct 1, 2024

contrera Oct 1, 2024

Add Cat-B calibration to OSA #292

Are you sure you want to change the base?

Add Cat-B calibration to OSA #292

Conversation

marialainez commented May 28, 2024

morcuended left a comment

Choose a reason for hiding this comment

morcuended left a comment

Choose a reason for hiding this comment

morcuended left a comment • edited Loading

Choose a reason for hiding this comment

morcuended left a comment

Choose a reason for hiding this comment

morcuended Oct 1, 2024

Choose a reason for hiding this comment

marialainez Oct 1, 2024

Choose a reason for hiding this comment

morcuended Oct 1, 2024

Choose a reason for hiding this comment

morcuended Oct 1, 2024

Choose a reason for hiding this comment

contrera Oct 1, 2024

Choose a reason for hiding this comment

morcuended Oct 1, 2024

Choose a reason for hiding this comment

contrera Oct 1, 2024

Choose a reason for hiding this comment

morcuended left a comment •

edited

Loading