update docs

rodekruis · Jan 10, 2024 · 9740527 · 9740527
1 parent 92a0e11
commit 9740527
Show file tree

Hide file tree

Showing 2 changed files with 89 additions and 11 deletions.
diff --git a/docs/SOP.md b/docs/SOP.md
@@ -1,21 +1,99 @@
 # ADA Standard Operating Procedures
-These Standard Operating Procedures (SOP) are meant for 510 to use ADA during emergencies. Other organizations are encouraged to contact [Jacopo Margutti](mailto:[email protected]) to adapt these accordingly.
+These Standard Operating Procedures (SOP) are meant for 510 to use ADA during emergencies.
 
-Prerequisites:
-1. Access to the BitWarden collection `Damage Assessment`
-2. Access to the resource group `510Global-ADA` with role `Contributor` or higher
-3. [QGIS](https://www.qgis.org/en/site/index.html) installed locally
-
-### 1. Should I run ADA?
+## Should I run ADA?
 * There **must** be a clear request by the IFRC SIMS/IM Coordinator - in case of international respone - or by its NS counterpart, because we want this analysis to be useful and used.
 * The relevant agencies - [Copernicus](https://emergency.copernicus.eu/mapping/list-of-activations-rapid), [UNOSAT](https://unosat.org/products/), or [others](https://data.humdata.org/search?q=damage+assessment) - **must not** be already producing damage assessments for this emergency, or there must be a clear gap in terms of spatial coverage of those assessments, because we don't want to duplicate efforts.
 * ...
 
-### 2. Can I run ADA?
+## Can I run ADA?
 * High-resolution (<0.6 m/pixel) optical satellite imagery of the affected area, both pre- and post-disaster, **must** be available.
 * The pre- and post-disaster imagery **must** spatially overlap, since ADA needs both images for each building.
 * There **should** be building information (polygons) already available by [OpenStreetMap](https://www.openstreetmap.org/), [Microsoft](https://github.com/microsoft/GlobalMLBuildingFootprints/blob/main/examples/example_building_footprints.ipynb), or [Google](https://sites.research.google/open-buildings/#download). 
   * if not, buildings can be detected automatically using [ABD](https://github.com/rodekruis/ada-collection/tree/master/abd_model), but the quality of the results will strongly depend on building density, being lower for densely built-up areas.
 
-### 3. How do I run ADA?
-1. ...
+> [!TIP]
+> You can check the extent and overlap of images from Maxar Open data using [opengeos/maxar-open-data](https://github.com/opengeos/maxar-open-data).
+
+## How do I run ADA?
+
+### Prerequisites
+1. Access to the BitWarden collection `Damage Assessment`
+2. Access to the resource group `510Global-ADA` with role `Contributor` or higher
+3. [QGIS](https://www.qgis.org/en/site/index.html) installed locally
+
+### 1. Get the imagery
+The first step is to load the satellite imagery in the container `operations` of the datalake storage `adadatalakestorage`. 
+
+Login into the VM and mount the container on the directory `data` with
+```commandline
+sudo blobfuse ~/data --tmp-path=/mnt/resource/blobfusetmp  --config-file=~/blobfuse/fuse_connection_adadatalake.cfg -o attr_timeout=240 -o entry_timeout=240 -o negative_timeout=120 -o allow_other
+```
+
+If you received a link to the images (e.g. from another agency), simply download and extract it in `data`.
+```commandline
+mkdir ~/data/<event-name>
+cd ~/data/<event-name>
+wget <link-to-my-images>
+```
+
+If you have the images on your local machine, simply upload them to the data lake storage, in the `operations` container.
+
+> [!TIP]
+> Use [Azure Storage Explorer](https://azure.microsoft.com/en-us/products/storage/storage-explorer) to upload and organize images if you're not familiar with command line.
+
+> [!CAUTION]
+> Make sure that the images are divided in two sub-folders called `pre-event` and `post-event`.
+
+If you want to download the images from Maxar open data
+  1. go to https://www.maxar.com/open-data
+  2. browse to the relevant event
+  3. copy the name of the event from the URL (e.g. "typhoon-mangkhut" from https://www.maxar.com/open-data/typhoon-mangkhut)
+  4. download the images with 
+  ```commandline
+  load-images --disaster <event-name> --dest ~/data/<event-name>
+  ```
+
+### 2. Check the imagery
+Verify that the imagery is cloud-free and that building damage is visible. Can be done locally by downloading the images and visualizing them with QGIS.
+
+> [!TIP]
+> NOT YET TESTED! Allegedly, you can visualize images from Maxar Open data using [opengeos/solara-maxar](https://github.com/opengeos/solara-maxar).
+
+### 3. Get building footprint
+The second step is to get a vector file (.geojson) with the buildings in the affected area.
+* Check if OpenStreetMap (OSM) buildings are good enough; if so, download them for each image with
+```commandline
+run get-osm-buildings --raster ~/data/<event-name>/pre-event/<image-name>.tif
+```
+
+* if OSM is not good enough, check [Google buildings](https://sites.research.google/open-buildings/#download)
+* if also Google is not good enough, check [Microsoft buildings](https://github.com/microsoft/GlobalMLBuildingFootprints/blob/main/examples/example_building_footprints.ipynb)
+* if also Microsoft is not good enough, run [automated-building-detection](https://github.com/rodekruis/automated-building-detection?tab=readme-ov-file#end-to-end-example)
+* place the output `buildings.geojson` in `~/data/<event-name>`
+
+### 3. Run ADA
+* [OPTIONAL] Copy images from the datalake storage to the VM (processing is faster locally)
+```
+cp -r ~/data/<event-name> ~/<event-name>
+```
+* prepare data for caladrius (damage classification model)
+```
+cd ~/<event-name>
+prepare-data --data . --buildings buildings.geojson --dest caladrius
+```
+* run caladrius
+```
+conda activate cal
+CUDA_VISIBLE_DEVICES="0" python ~/caladrius/caladrius/run.py --run-name run --data-path caladrius --model-type attentive --model-path ~/data/caladrius_att_effnet4_v1.pkl --checkpoint-path caladrius/runs --batch-size 2 --number-of-workers 4 --classification-loss-type f1 --output-type classification --inference
+```
+* prepare the final vector file with building polygons and caladrius' predictions
+```
+conda activate base
+final-layer --builds buildings.geojson --damage caladrius/runs/run-input_size_32-learning_rate_0.001-batch_size_2/predictions/run-split_inference-epoch_001-model_attentive-predictions.txt --out buildings-predictions.geojson
+```
+* copy it back on the datalake, download it locally and visualize it with QGIS
+```
+cp buildings-predictions.geojson ~/data/<event-name>
+```
+* create a map and share
diff --git a/docs/ada-training-commands.sh b/docs/ada-training-commands.sh
@@ -23,7 +23,7 @@ sudo blobfuse training-data --tmp-path=/mnt/resource/blobfusetmp  --config-file=
 ## Getting Images
 # Next step would be to check if there are satellite images available that cover the relevant area, pre and post images overlap and that are of sufficient quality. 
 # This can be quiet a hassle since we are working with big files here. Downloading the images and visualizing them in QGIS is a lot of work!! (is there a QGIS plugin to directly connect to Azure Blob Storage)?
-# Have a good look at the file names of the sattelite images, they often correspond to geographical areas and can already tell you which pre and post images might overlap.
+# Have a good look at the file names of the satellite images, they often correspond to geographical areas and can already tell you which pre and post images might overlap.
 # Again, take into account that this might actually take some time.
 
 # A common source for good satellite images would be Maxar. If they have images available, the first step would be to manually upload them to the datalake (divided in pre- and post-event) OR download them from Maxar open data