Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Creation of a held-out test data for active learning training phase validation #33

Open
7 tasks done
rohanbanerjee opened this issue Feb 22, 2024 · 3 comments
Open
7 tasks done
Assignees

Comments

@rohanbanerjee
Copy link
Collaborator

rohanbanerjee commented Feb 22, 2024

Description

I've been thinking a ways to evaluate out active learning training phases (mentioned in #29). We have currently trained a baseline model based on the good quality (✅ ) segmentations and we are using this model for predicting the SC on other images and then manually correct them on need basis.

Our overall goal to do active learning was to see if the model is able to adapt on not-great quality images and we expect performance improvement after each active learning phase. For this imo, we should have a separate held-out test set that we can test out trained models on.

How do we build this test set:

  1. We divide all the datasets into a 80:20 ratio in which 80% is for training and 20% is for testing.
  2. We concatenate all the 20% sets into one Heldout-test set (illustrated in the figure below)
  3. After each and every active learning training phase, we test the performance on the test set created in step 2.
Screenshot 2024-02-22 at 11 28 48 PM

Figure: Creation of held-out test set for testing active learning phases

Ref: SCIseg paper

Tasks:

@rohanbanerjee rohanbanerjee self-assigned this Feb 23, 2024
@rohanbanerjee rohanbanerjee changed the title Need for a held-out test data Creation of a held-out test data for active learning training phase validation Feb 23, 2024
@rohanbanerjee
Copy link
Collaborator Author

rohanbanerjee commented Mar 5, 2024

Uploading the QC for the held-out test set
qc_held_out_test.zip

I went through the QC and below is the report
20240305_held_out_qc_report-RB.zip

@rohanbanerjee
Copy link
Collaborator Author

Update:
Currently using a small subset of the held-out test data. Below is the list of the subjects:
participant_held-out.tsv

This dataset which includes the manual corrections can be found at: https://drive.google.com/file/d/18SiXA8RWzCo6TBovC8umZxWcsZW1soIV/view?usp=sharing

The QC for the above data can be found below:

qc_held_out_test_manually_corrected.zip

@rohanbanerjee
Copy link
Collaborator Author

  • Upload the corrected data to Openneuro

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant