Merge pull request #36 from oncoray/dev_1.3.0

Release version 2.0.0
oncoray · Oct 27, 2023 · 2041519 · 2041519
2 parents b0f82dc + 2b9b428
commit 2041519
Show file tree

Hide file tree

Showing 3,413 changed files with 38,221 additions and 13,193 deletions.
diff --git a/.gitignore b/.gitignore
@@ -4,4 +4,8 @@ mirp/__pycache__/
 *.pyc
 test/data/reference_values/.Rhistory
 test/.Rhistory
-test/data/temp
+test/data/temp
+/test/data/configuration_files/temp_test_config_data.xml
+/mirp.egg-info/
+/dist/
+docs_source/build/*
diff --git a/MANIFEST.in b/MANIFEST.in
@@ -1,4 +1,2 @@
 include *.md
-include *.txt
-recursive-include docs *.md
-recursive-include docs *.txt
+include LICENSE.txt
diff --git a/NEWS.md b/NEWS.md
@@ -1,3 +1,67 @@
+# Version 2.0.0
+
+## Major changes
+
+- MIRP was previously configured using two `xml` files: [`config_data.xml`](mirp/config_data.xml) for configuring
+  directories, data to be read, etc., and [`config_settings.xml`](mirp/config_settings.xml) for configuring experiments.
+  While these two files can still be used, MIRP can now be configured directly, without using these files.
+
+- The main functions of MIRP (`mainFunctions.py`) have all been re-implemented.
+  - `mainFunctions.extract_features` is now `extractFeaturesAndImages.extract_features` (functional form) or
+    `extractFeaturesAndImages.extract_features_generator` (generator). The replacements allow for both writing
+    feature values to a directory and returning them as function output. 
+  - `mainFunctions.extract_images_to_nifti` is now `extractFeaturesAndImages.extract_images` (functional form) or
+     `extractFeaturesAndImages.extract_images_generator` (generator). The replacements allow for both writing 
+     images to a directory (e.g., in NIfTI or numpy format) and returning them as function output.
+  - `mainFunctions.extract_images_for_deep_learning` has been replaced by 
+    `deepLearningPreprocessing.deep_learning_preprocessing` (functional form) and 
+    `deepLearningPreprocessing.deep_learning_preprocessing_generator` (generator).
+  - `mainFunctions.get_file_structure_parameters` and `mainFunctions.parse_file_structure` are deprecated, as the
+    the file import system used in version 2 no longer requires a rigid directory structure.
+  - `mainFunctions.get_roi_labels` is now `extractMaskLabels.extract_mask_labels`.
+  - `mainFunctions.get_image_acquisition_parameters` is now `extractImageParameters.extract_image_parameters`.
+
+- MIRP previously relied on `ImageClass` and `RoiClass` objects. These have been completely replaced by `GenericImage`
+  (and its subclasses, e.g. `CTImage`) and `BaseMask` objects, respectively. New image modalities can be added as
+  subclass of `GenericImage` in the `mirp.images` submodule.
+
+- File import, e.g. from DICOM or NIfTI files, in was previously implemented in an ad-hoc manner, and required a rigid
+  directory structure. Now, file import is implemented using an object-oriented approach, and directory structures 
+  are more flexible. File import of new modalities can be implemented as a relevant subclass of `ImageFile`.
+
+- MIRP uses type hinting, and makes use of the `Self` type hint introduced in Python 3.11. MIRP therefore requires 
+  Python 3.11 or later.
+
+## Minor changes
+- MIRP now uses the `ray` package for parallel processing.
+
+# Version 1.3.0 (dev - unreleased)
+
+## Minor changes
+- `SimpleITK` has been removed as a dependency. Handling of non-DICOM imaging is now done through `itk` itself.
+- Rotation - as a perturbation or augmentation operation - is now performed as part of the interpolation process. 
+  Previously, rotation was implemented using `scipy.ndimage.rotate`. This, combined with any translation or 
+  interpolation operation would involve two interpolation steps. Aside from removing a computationally intensive 
+  step, this also prevents unnecessary image degradation through the interpolation process. The new implementation 
+  operates using affine matrix transformations.
+- Discretisation of intensities after filtering (i.e. intensities of response maps) now uses a *fixed bin number* 
+  method with 16 bins by default. Previously, no default was set, which could lead to unintended results. These 
+  parameters can be manually specified using the `response_map_discretisation_method`, 
+  `response_map_discretisation_bin_width`, and `response_map_discretisation_n_bins` arguments; or alternatively 
+  using the `discretisation_method`, `discretisation_bin_width` and `discretisation_n_bins` parameters of the 
+  `img_transform` section of the settings configuration file. 
+
+## Fixes
+
+- Fixed a deprecation warning caused by `slic` of the `scikit-image` module.
+- Fixed incorrect merging of contours of the same region of interest (ROI) in the same slice. Previously, each contour 
+  was converted to a mask individually, and merged with the segmentation mask using `OR` operations. This functions 
+  perfectly for contours that represent separate objects spatially. However, holes in RTSTRUCT objects are not 
+  always represented by a single contour. They can also be represented by a separate contour (of the same region of 
+  interest) that is contained within a larger contour. For those RTSTRUCT objects, holes would disappear. This has 
+  now been fixed by first collecting all contours of a ROI for each slice, prior to converted them to a segmentation 
+  mask.
+
 # Version 1.2.0
 
 ## Major changes

diff --git a/README.md b/README.md
@@ -1,19 +1,165 @@
-*2023-09-12: Version 2.0 is coming soon. Version 2.0 is major refactoring of the current code, with a focus on better usability and documentation.*
+<img src="https://raw.githubusercontent.com/oncoray/mirp/master/icon/mirp.svg" align="right" width="120"/>
 
 # Medical Image Radiomics Processor
 
-The Medical Image Radiomics Processor (MIRP) is an IBSI-compliant radiomics processor that is used to extract image biomarkers from medical imaging. MIRP is an end-to-end framework featuring parallel processing that is configured using `xml` files.
+Medical Image Radiomics Processor (MIRP) is an IBSI-compliant python package for medical image analysis.
+MIRP focuses on radiomics applications and supports computation of features for conventional radiomics
+and image processing for deep-learning applications.
+
+## Installing MIRP
+MIRP is available from PyPI and can be installed using `pip`, or other installer tools:
+
+```commandline
+pip install mirp
+```
+
+## Transitioning to version 2
+
+Version 2 is a major refactoring of the previous code base. For users this brings the following noticeable changes:
+
+- MIRP was previously configured using two `xml` files: [`config_data.xml`](mirp/config_data.xml) for configuring
+  directories, data to be read, etc., and [`config_settings.xml`](mirp/config_settings.xml) for configuring experiments.
+  While these two files can still be used, MIRP can now be configured directly, without using these files.
+- The main functions of MIRP (`mainFunctions.py`) have all been re-implemented.
+  - `mainFunctions.extract_features` is now `extract_features` (functional form) or
+    `extract_features_generator` (generator). The replacements allow for both writing
+    feature values to a directory and returning them as function output. 
+  - `mainFunctions.extract_images_to_nifti` is now `extract_images` (functional form) or
+     `extract_images_generator` (generator). The replacements allow for both writing 
+     images to a directory (e.g., in NIfTI or numpy format) and returning them as function output.
+  - `mainFunctions.extract_images_for_deep_learning` has been replaced by 
+    `deep_learning_preprocessing` (functional form) and 
+    `deep_learning_preprocessing_generator` (generator).
+  - `mainFunctions.get_file_structure_parameters` and `mainFunctions.parse_file_structure` are deprecated, as the
+    the file import system used in version 2 no longer requires a rigid directory structure.
+  - `mainFunctions.get_roi_labels` is now `extract_mask_labels`.
+  - `mainFunctions.get_image_acquisition_parameters` is now `extract_image_parameters`.
+
+For advanced users and developers, the following changes are relevant:
+- MIRP previously relied on `ImageClass` and `RoiClass` objects. These have been completely replaced by `GenericImage`
+  (and its subclasses, e.g. `CTImage`) and `BaseMask` objects, respectively. New image modalities can be added as
+  subclass of `GenericImage` in the `mirp.images` submodule.
+- File import, e.g. from DICOM or NIfTI files, in version 1 was implemented in an ad-hoc manner, and required a rigid
+  directory structure. Since version 2, file import is implemented using an object-oriented approach, and directory
+  structures are more flexible. File import of new modalities can be implemented as a relevant subclass of `ImageFile`.
+- MIRP uses type hinting, and makes use of the `Self` type hint introduced in Python 3.11. MIRP 
+  therefore requires Python 3.11 or later.
+- MIRP now uses the `ray` package for parallel processing.
+
+## Examples - Computing Radiomics Features
+
+MIRP can be used to compute quantitative features from regions of interest in images in an IBSI-compliant manner 
+using a standardized workflow This requires both images and masks. MIRP can process DICOM, NIfTI, NRRD and numpy 
+images. Masks are DICOM radiotherapy structure sets (RTSTRUCT), or volumetric data with integer labels (e.g. 1, 2, 
+etc.).
+
+Below is a minimal working example for extracting features from a single image file and its mask.
+
+```python
+from mirp import extract_features
+
+feature_data = extract_features(
+    image="path to image",
+    mask="path to mask",
+    base_discretisation_method="fixed_bin_number",
+    base_discretisation_n_bins=32
+)
+```
+Instead of providing the path to the image (`"path_to_image"`), a numpy image can be provided, and the same goes for 
+`"path to mask"`. The disadvantage of doing so is that voxel spacing cannot be determined. 
+
+MIRP also supports processing images and masks for multiple samples (e.g., patients). The syntax is much the same, 
+but depending on the file type and directory structure, additional arguments need to be specified. For example, 
+assume that files are organised in subfolders for each sample, i.e. `main_folder / sample_name / subfolder`. The 
+minimal working example is then:
+
+```python
+from mirp import extract_features
+
+feature_data = extract_features(
+    image="path to main image directory",
+    mask="path to main mask directory",
+    image_sub_folder="image subdirectory structure relative to main image directory",
+    mask_sub_folder="mask subdirectory structure relative to main mask directory",
+    base_discretisation_method="fixed_bin_number",
+    base_discretisation_n_bins=32
+)
+```
+The above example will compute features sequentially. MIRP supports parallel processing using the `ray` package. 
+Feature computation can be parallelized by specifying the `num_cpus` argument, e.g. `num_cpus=2` for two CPU threads.
+
+## Examples - Image Preprocessing for Deep Learning
+Deep learning-based radiomics is an alternative to using predefined quantitative features. MIRP supports 
+preprocessing of images and masks using the same standardized workflow that is used for computing features.
+
+Below is a minimal working example for preprocessing deep learning images. Note that MIRP uses the numpy notation 
+for indexing, i.e. indices are ordered [*z*, *y*, *x*].
+
+```python
+from mirp import deep_learning_preprocessing
+
+processed_images = deep_learning_preprocessing(
+    image="path to image",
+    mask="path to mask",
+    crop_size=[50, 224, 224]
+)
+```
+
+## Examples - Summarising Image Metadata
+
+MIRP can also summarise image metadata. This is particularly relevant for DICOM files that have considerable 
+metadata. Other files, e.g. NIfTI, only have metadata related to position and spacing of the image.
+
+Below is a minimal working example for extracting metadata from a single image file.
+```python
+from mirp import extract_image_parameters
+
+image_parameters = extract_image_parameters(
+    image="path to image"
+)
+```
+
+MIRP also supports extracting metadata from multiple files. For example, assume that files are organised in 
+subfolders for each sample, i.e. `main_folder / sample_name / subfolder`. The minimal working example is then:
+```python
+from mirp import extract_image_parameters
+
+image_parameters = extract_image_parameters(
+    image="path to main image directory",
+    image_sub_folder="image subdirectory structure relative to main image directory"
+)
+```
+
+## Examples - Finding labels
+
+MIRP can identify which labels are present in masks. For a single mask file, labels can be retrieved as follows:
+```python
+from mirp import extract_mask_labels
+
+mask_labels = extract_mask_labels(
+    mask="path to mask"
+)
+```
+
+MIRP supports extracting labels from multiple masks. For example, assume that files are organised in subfolders for 
+each sample, i.e. `main_folder / sample_name / subfolder`. The minimal working example is then:
+```python
+from mirp import extract_mask_labels
+mask_labels = extract_mask_labels(
+    mask="path to main mask directory",
+    mask_sub_folder="mask subdirectory structure relative to main mask directory"
+)
+```
 
 # Citation info
-If you use MIRP, please cite the following work:
+A publication for MIRP is forthcoming. For now, please cite the following work:
 ```Zwanenburg A, Leger S, Agolli L, Pilz K, Troost EG, Richter C, Löck S. Assessing robustness of radiomic features by image perturbation. Scientific reports. 2019 Jan 24;9(1):614.```
 
-# Usage
-To be documented.
-
-# Developers
+# Developers and contributors
 
-MIRP was developed by:
+MIRP is developed by:
 * Alex Zwanenburg
+
+We would like thank the following contributors:
 * Stefan Leger
 * Sebastian Starke
diff --git a/VERSION.txt b/VERSION.txt
@@ -1 +1 @@
-1.2.0
+2.0.0
diff --git a/conda/conda_build_config.yaml b/conda/conda_build_config.yaml
@@ -1,5 +1,2 @@
 python:
-  - 3.7
-  - 3.8
-  - 3.9
-  - 3.10
+  - 3.11
diff --git a/conda/meta.yaml b/conda/meta.yaml
@@ -1,6 +1,6 @@
 package:
   name: "mirp"
-  version: "1.2.0"
+  version: "2.0.0"
 
 source:
   path: ../
@@ -16,10 +16,10 @@ requirements:
     - pandas >=1.0.0
     - pydicom >=1.3.0
     - pywavelets >=1.0.0
-    - scikit-image >=0.15.0
-    - SimpleITK >=1.2.0
+    - scikit-image >=0.19.0
     - itk>=5.1.0
     - matplotlib >= 3.0.0
+    - ray >= 2.7.0
 
   run:
     - python
@@ -28,10 +28,10 @@ requirements:
     - pandas >=1.0.0
     - pydicom >=1.3.0
     - pywavelets >=1.0.0
-    - scikit-image >=0.15.0
-    - SimpleITK >=1.2.0
+    - scikit-image >=0.19.0
     - itk>=5.1.0
     - matplotlib >= 3.0.0
+    - ray >= 2.7.0
 
 about:
   home: https://github.com/oncoray/mirp

diff --git a/docs/.buildinfo b/docs/.buildinfo
@@ -0,0 +1,4 @@
+# Sphinx build info version 1
+# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
+config: 9c9bf8ce1e6064b92ac7bf61594b8542
+tags: 645f666f9bcd5a90fca523b33c5a78b7
diff --git a/mirp/imageFilters/riesz.py → docs/.nojekyll b/mirp/imageFilters/riesz.py → docs/.nojekyll
-Original file line number
+Diff line change
@@ -1,5 +1,2 @@
     python:
-      - 3.7
-      - 3.8
-      - 3.9
-      - 3.10
+      - 3.11