Rerun of i2MassChroQ on Ion Level module fails #459

JuliaS92 · 2024-11-28T09:12:21Z

Describe the bug
Downloading the input_df.csv from the Public runs and reloading that as new data raises an error.

To Reproduce
Steps to reproduce the behavior:

Download input_df.csv for i2MassChroQ__20240904_071654
Submit the same file as i2MassChroQ software result
Hit parse and bench

Expected behavior
This should reproduce the results generated from the original input made to create the public run.

Screenshots

File "/mnt/data/git/ProteoBench/webinterface/pages/base_pages/quant.py", line 469, in execute_proteobench
    result_performance, all_datapoints, input_df = self.run_benchmarking_process()
                                                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/data/git/ProteoBench/webinterface/pages/base_pages/quant.py", line 495, in run_benchmarking_process
    return self.ionmodule.benchmarking(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/ubuntu/micromamba/envs/proteobench/lib/python3.12/site-packages/proteobench/modules/dda_quant_ion/dda_quant_ion_module.py", line 90, in benchmarking
    raise ParseSettingsError(f"Error parsing the input file: {e}")
ParseSettingsError: Error parsing the input file: 'ProForma'

Desktop (please complete the following information):

OS: OSX Sonoma
Browser Firefox
ProteoBench version 0.5.1

The text was updated successfully, but these errors were encountered:

RobbinBouwmeester · 2024-11-28T09:22:21Z

Hi Julia,

Not entirely sure, but is the input_df.csv direct output of i2MassChroQ? There are multiple files you should be able to retrieve via the download, not all of them are direct outputs of the tool. Some of the files are formatted intermediates by ProteoBench.

JuliaS92 · 2024-11-28T09:26:04Z

It's either that or the params.csv or result_performance.csv. The direct input is not available, at least through the interface. If it is an intermediate format, we should make sure it loads the same way as the original input, especially for testing and rerunning of benchmarks.

RobbinBouwmeester · 2024-11-28T09:28:39Z

Unfortunately it does not load intermediate files, and I do not think we should support that via de webinterface. We should however support downloading of the raw input files. @julianu is this currently not possible?

RobbinBouwmeester · 2024-11-28T09:31:06Z

This relates to #458?

julianu · 2024-11-28T09:40:10Z

All data, that is stored on the server, can be downloaded via:
https://proteobench.cubimed.rub.de/datasets/ (maybe someone should put this into the docs?)

For the DDA modules, also the download function works, as far as I see.
I am not entirely sure whether the "df_input.csv" is the "raw", I just link everything that is stored right now.

Edit:
DIA has a bug right now... I will fix this.

JuliaS92 · 2024-11-28T09:47:25Z

Regarding putting it in the documentation also see the other issue: #457
For rerunning all datasets we need to be able to rerun from the input_df.csv files, if those are the only ones automatically generated.

RobbinBouwmeester · 2024-11-28T15:52:37Z

Regarding putting it in the documentation also see the other issue: #457 For rerunning all datasets we need to be able to rerun from the input_df.csv files, if those are the only ones automatically generated.

In my opinion it would be better to run it from the raw input. So, as mentioned before there is no need to run it from the input_df.csv. Main reason is that if we change anything in the parsing we will not be able to re-use the results.

mlocardpaulet · 2024-11-29T08:15:22Z

II may be wrong but I think that "input_df" is the raw input.
And I agree, we should re-run from the raw input.
I wonder if the issue could come from a change in i2masschroq outputs? I mean, there were so many back and forth with the developer that maybe something changed between the first point that was submitted and now?
If you want @JuliaS92 we can see this together.

RobbinBouwmeester · 2024-11-29T08:58:35Z

"input_df" is unfortunately not the raw input. See: https://proteobench.cubimed.rub.de/datasets/

RobbinBouwmeester · 2024-11-30T08:19:44Z

Should be mostly fixed in #462

RobbinBouwmeester · 2024-12-09T08:43:11Z

Only thing left is to zip files when storing.

RobbinBouwmeester · 2024-12-09T13:17:39Z

Is done and released in v0.5.5

RobbinBouwmeester closed this as completed Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rerun of i2MassChroQ on Ion Level module fails #459

Rerun of i2MassChroQ on Ion Level module fails #459

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

julianu commented Nov 28, 2024 •

edited

Loading

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

mlocardpaulet commented Nov 29, 2024

RobbinBouwmeester commented Nov 29, 2024

RobbinBouwmeester commented Nov 30, 2024

RobbinBouwmeester commented Dec 9, 2024

RobbinBouwmeester commented Dec 9, 2024

Rerun of i2MassChroQ on Ion Level module fails #459

Rerun of i2MassChroQ on Ion Level module fails #459

Comments

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

julianu commented Nov 28, 2024 • edited Loading

JuliaS92 commented Nov 28, 2024

RobbinBouwmeester commented Nov 28, 2024

mlocardpaulet commented Nov 29, 2024

RobbinBouwmeester commented Nov 29, 2024

RobbinBouwmeester commented Nov 30, 2024

RobbinBouwmeester commented Dec 9, 2024

RobbinBouwmeester commented Dec 9, 2024

julianu commented Nov 28, 2024 •

edited

Loading