Merge pull request #250 from sammsaski/formalise25fixes

FormaliSE 2025 AE Fixes
verivital · Dec 6, 2024 · 1ecabc6 · 1ecabc6
2 parents 97fe0cd + a4c1496
commit 1ecabc6
Show file tree

Hide file tree

Showing 4 changed files with 31 additions and 178 deletions.
diff --git a/code/nnv/examples/Submission/FORMALISE2025/README.md b/code/nnv/examples/Submission/FORMALISE2025/README.md
@@ -2,17 +2,19 @@
 
 This artifact is used to reproduce the results in _Robustness Verification of Video Classification Neural Networks_.
 
-All results from _Robustness Verification of Video Classification Neural Networks_ were captured using an Apple M1 Max 10-core [email protected]×10 with 64GB of RAM. The times reported to run the smoke test and reproduce results are approximated based on this computer.
+All results from _Robustness Verification of Video Classification Neural Networks_ were captured using an `Apple M1 Max 10-core [email protected]×10` with 64GB of RAM. The times reported to run the smoke test and reproduce results are approximated based on this computer.
 
 # Requirements
 
 The following resources are required to run this artifact:
 
 - MATLAB 2024a with NNV and npy-matlab installed and added to the path.
-- conda environment with Python v3.11. Install rquirements from requirements.txt. Make sure to install the source files.
+- A conda environment with Python v3.11. Install rquirements from `requirements.txt`. Make sure to install the source files.
 
 # Installation
 
+This section describes all of the necessary steps for installing tools, dependencies, etc. needed to reproduce the artifact. For the remainder of these instructions, when the `FORMALISE2025` directory is referred to, this really means the directory at the following path: `nnv/code/nnv/examples/Submission/FORMALISE2025`.
+
 1. Clone NNV and npy-matlab and install them ensuring that both have been added to the MATLAB path.
 
 ```
@@ -21,19 +23,31 @@ git clone --recursive https://github.com/verivital/nnv.git
 git clone https://github.com/kwikteam/npy-matlab.git
 ```
 
-Follow the instructions under heading `# Installation:` on [NNV's README](https://github.com/verivital/nnv/blob/master/README.md). Note, if using a computer with an Apple silicon CPU, then it may be necessary to remove a line from NNV's `install.m` script. The following line
+Follow the instructions under heading `# Installation:` on [NNV's README](https://github.com/verivital/nnv/blob/master/README.md). Note, if using a computer with an Apple silicon CPU, then it may be necessary to remove the following line from NNV's `install.m` script:
 
 ```matlab
 tbxmanager install lcp hysdel cddmex clpmex glpkmex fourier sedumi;
 ```
 
-can be removed/commented out.
+Either comment it out or remove it.
 
 For `npy-matlab`, add and save the path to MATLAB. Again, instructions are available on [`npy-matlab`'s README](https://github.com/kwikteam/npy-matlab/blob/master/README.md).
 
-2. Download the following dataset files from here into the `FORMALISE2025/data` folder:
+For installing both of these tools, it will be necessary to run with administrator privileges so that you can save them to the MATLAB path.
+
+2. Download the following dataset files from [here](https://drive.google.com/drive/folders/1sXRtSObHLBTeKVss2IA-NGPKljirgD8D?usp=drive_link) into the `FORMALISE2025/data` folder so that the file tree now looks like
+
+```pseudo
+FORMALISE2025
+├── data
+│   ├── ZoomIn
+│   ├── ZoomOut
+│   ├── GTSRB
+│   └── STMNIST
+...
+```
 
--
+Note that the `GTSRB` data folder is quite large, so it will likely be downloaded in separate parts. Please make sure the organization of the folder is the same as in the download link.
 
 3. Create a conda environment with `Python v3.11` and install the requirements from `requirements.txt`. Additionally, install the source files to the environment. Both can be done by running the following commands from the root directory (`FORMALISE2025`):
 
@@ -45,9 +59,9 @@ For `npy-matlab`, add and save the path to MATLAB. Again, instructions are avail
    pip install -e .
    ```
 
-4. Modify the `verify.py` file to add the path to your NNV and npy-matlab directories (the repositories that were cloned earlier). For the `npy-matlab` repository, you'll want to reference the subfolder in the directory also called `npy-matlab`, e.g. `/some/path/to/npy-matlab/npy-matlab`.
+4. Modify the `.env` file to add the path to your NNV and npy-matlab directories (the repositories that were cloned earlier). For the `npy-matlab` repository, you'll want to reference the subfolder in the directory also called `npy-matlab`, e.g. `/some/path/to/npy-matlab/npy-matlab`.
 
-5.
+5. With all of these steps done, you are now ready to begin reproducing the artifacts.
 
 # Smoke Test Instructions (~30 minutes)
 
@@ -69,35 +83,35 @@ chmod +x run_smoketest.sh && ./run_smoketest.sh
 **********************************************
 ```
 
-# Reproducing the Full Results
+# Reproducing a Subset of the Results
 
-Assuming the average runtime for the experiments remains as shown in the paper, then it will take approximately 9-10 days to reproduce the results. As such, there will also be an intermediary where we verify 1/10 samples verified in the full experiments. Those instructions will be in the next section. For now, we discuss how to reproduce the full results. The instructions will be the exact same for running the smoke test, only we use a different script.
+Assuming the average runtime for the experiments remains as shown in the paper, then it will take approximately 9-10 days to reproduce the results. For that reason, this set of instructions is for reproducing a subset of the results. In this case, we verify every tenth sample from what was done for the original verification. Generating the reachable output range plots remains the same.
 
 1. Open a terminal and navigate to the `FORMALISE2025` directory.
 2. Make sure the conda environment with the installed dependencies is activated.
-3. Run the following command to begin reproducing the artifacts:
+3. Run the following command to begin reproducing a subset of the artifacts:
 
 ```bash
-chmod +x run_vvn.sh && ./run_vvn.sh
+chmod +x run_subset_vvn.sh && run_subset_vvn.sh
 ```
 
-# Reproducing a Subset of the Results
+# Reproducing the Full Results
 
-This set of instructions is for reproducing a subset of the results. In this case, we verify every tenth sample from what was done for the original verification. Generating the reachable output range plots remains the same.
+This set of instructions describes how to reproduce the full results. The instructions will be just like the previous parts only we use a different script.
 
 1. Open a terminal and navigate to the `FORMALISE2025` directory.
 2. Make sure the conda environment with the installed dependencies is activated.
-3. Run the following command to begin reproducing a subset of the artifacts:
+3. Run the following command to begin reproducing the artifacts:
 
 ```bash
-chmod +x run_subset_vvn.sh && run_subset_vvn.sh
+chmod +x run_vvn.sh && ./run_vvn.sh
 ```
 
 # Results
 
 | Artifact | Filepath                                                                  | Description                                                                                                                                                     |
 | -------- | ------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| Table II | `results/table2.txt                                                       | A table of results (PSRV and average runtime) for all experiments.                                                                                              |
+| Table 2  | `results/table2.txt`                                                      | A table of results (PSRV and average runtime) for all experiments.                                                                                              |
 | Fig. 7   | `results/reach_stmnist_plot.png` and `results/reach_bad_stmnist_plot.png` | Plots of the reachable output ranges on a sample of STMNIST with different epsilon values (one robust, the other non-robust).                                   |
 | Fig. 8   | `results/runtime_comp.png`                                                | A visualization of how average runtime changes for the Zoom In, Zoom Out, and GTSRB datasets as the number of frames and magnitude of the perturbation changes. |
 

diff --git a/code/nnv/examples/Submission/FORMALISE2025/src/vvn.m b/code/nnv/examples/Submission/FORMALISE2025/src/vvn.m
diff --git a/code/nnv/examples/Submission/FORMALISE2025/src/vvn/testing/check_verification.m b/code/nnv/examples/Submission/FORMALISE2025/src/vvn/testing/check_verification.m
diff --git a/code/nnv/examples/Submission/FORMALISE2025/src/vvn/testing/convert_results_to_json.m b/code/nnv/examples/Submission/FORMALISE2025/src/vvn/testing/convert_results_to_json.m