add support for raw images

Browse files

Files changed (5) hide show

README.md +19 -14
configs/inference.json +13 -8
configs/metadata.json +2 -1
configs/train.json +1 -1
docs/README.md +19 -14

README.md CHANGED Viewed

@@ -15,32 +15,39 @@ LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset!
-## Data
 The dataset we are experimenting in this example is LUNA16 (https://luna16.grand-challenge.org/Home/), which is based on [LIDC/IDRI database](https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI) [3,4,5].
 LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the goal is to identify locations of possible nodules, and to assign a probability for being a nodule to each location.
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset! We acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.
 We follow the official 10-fold data splitting from LUNA16 challenge and generate data split json files using the script from [nnDetection](https://github.com/MIC-DKFZ/nnDetection/blob/main/projects/Task016_Luna/scripts/prepare.py).
-The resulted json files can be downloaded from https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/LUNA16_datasplit-20220615T233840Z-001.zip.
 In these files, the values of "box" are the ground truth boxes in world coordinate.
 The raw CT images in LUNA16 have various of voxel sizes. The first step is to resample them to the same voxel size.
-In this model, we resampled them into 0.703125 x 0.703125 x 1.25 mm. The code of resampling can be found in Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection
-## Training configuration
 The training was performed with at least 12GB-memory GPUs.
 Actual Model Input: 192 x 192 x 80
-## Input and output formats
 Input: list of 1 channel 3D CT patches
 Output: dictionary of classification and box regression loss in training mode;
 list of dictionary of predicted box, classification label, and classification score in evaluation mode.
-## Scores
 The script to compute FROC sensitivity value on inference results can be found in https://github.com/Project-MONAI/tutorials/tree/main/detection
 This model achieves the following FROC sensitivity value on the validation data (our own split from the training dataset):
@@ -53,28 +60,26 @@ This model achieves the following FROC sensitivity value on the validation data
 **Table 1**. The FROC sensitivity values at the predefined false positive per scan thresholds of the LUNA16 challenge.
-## commands example
 Execute training:
 ```
 python -m monai.bundle run training --meta_file configs/metadata.json --config_file configs/train.json --logging_file configs/logging.conf
 ```
 Override the `train` config to execute evaluation with the trained model:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file "['configs/train.json','configs/evaluate.json']" --logging_file configs/logging.conf
 ```
-Execute inference:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file configs/inference.json --logging_file configs/logging.conf
 ```
-Note that in inference.json, the transform "AffineBoxToWorldCoordinated" in "postprocessing" has `"affine_lps_to_ras": true`.
-This depends on the input images. It is possible that your inference dataset should set "affine_lps_to_ras": false.
-Please set it as `true` only when the original images were read by itkreader with affine_lps_to_ras=True.
 # Disclaimer

 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset!
+## 1. Data
+### 1.1 Data description
 The dataset we are experimenting in this example is LUNA16 (https://luna16.grand-challenge.org/Home/), which is based on [LIDC/IDRI database](https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI) [3,4,5].
 LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the goal is to identify locations of possible nodules, and to assign a probability for being a nodule to each location.
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset! We acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.
+### 1.2 10-fold data splitting
 We follow the official 10-fold data splitting from LUNA16 challenge and generate data split json files using the script from [nnDetection](https://github.com/MIC-DKFZ/nnDetection/blob/main/projects/Task016_Luna/scripts/prepare.py).
+Please download the resulted json files from https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/LUNA16_datasplit-20220615T233840Z-001.zip.
 In these files, the values of "box" are the ground truth boxes in world coordinate.
+### 1.3 Data resampling
 The raw CT images in LUNA16 have various of voxel sizes. The first step is to resample them to the same voxel size.
+In this model, we resampled them into 0.703125 x 0.703125 x 1.25 mm.
+Please following the instruction in Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection to do the resampling.
+## 2. Training configuration
 The training was performed with at least 12GB-memory GPUs.
 Actual Model Input: 192 x 192 x 80
+## 3. Input and output formats
 Input: list of 1 channel 3D CT patches
 Output: dictionary of classification and box regression loss in training mode;
 list of dictionary of predicted box, classification label, and classification score in evaluation mode.
+## 4. Results and Scores
 The script to compute FROC sensitivity value on inference results can be found in https://github.com/Project-MONAI/tutorials/tree/main/detection
 This model achieves the following FROC sensitivity value on the validation data (our own split from the training dataset):
 **Table 1**. The FROC sensitivity values at the predefined false positive per scan thresholds of the LUNA16 challenge.
+## 5. Commands example
 Execute training:
 ```
 python -m monai.bundle run training --meta_file configs/metadata.json --config_file configs/train.json --logging_file configs/logging.conf
 ```
 Override the `train` config to execute evaluation with the trained model:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file "['configs/train.json','configs/evaluate.json']" --logging_file configs/logging.conf
 ```
+Execute inference on resampled LUNA16 images (resampled following Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection) by setting `"whether_raw_luna16": false` in `inference.json`:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file configs/inference.json --logging_file configs/logging.conf
 ```
+With the same command, we can execute inference on raw LUNA16 images by setting `"whether_raw_luna16": true` in `inference.json`. Remember to also set `"data_list_file_path": "$@bundle_root + '/LUNA16_datasplit/original/dataset_fold0.json'"` and change `"data_file_base_dir"`.
+Note that in inference.json, the transform "LoadImaged" in "preprocessing" and "AffineBoxToWorldCoordinated" in "postprocessing" has `"affine_lps_to_ras": true`.
+This depends on the input images. LUNA16 needs `"affine_lps_to_ras": true`.
+It is possible that your inference dataset should set `"affine_lps_to_ras": false`.
 # Disclaimer

configs/inference.json CHANGED Viewed

@@ -1,4 +1,6 @@
 {
     "imports": [
         "$import glob",
         "$import os"
@@ -6,7 +8,7 @@
     "bundle_root": "./",
     "ckpt_dir": "$@bundle_root + '/models'",
     "output_dir": "$@bundle_root + '/eval'",
-    "data_list_file_path": "$@bundle_root + '/annotation/dataset_fold0.json'",
     "data_file_base_dir": "/home/canz/Projects/datasets/LUNA16/93176/Images_resample",
     "test_datalist": "$monai.data.load_decathlon_datalist(@data_list_file_path, is_segmentation=True, data_list_key='validation', base_dir=@data_file_base_dir)",
     "device": "$torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')",
@@ -71,16 +73,18 @@
         "_target_": "Compose",
         "transforms": [
             {
-                "_target_": "DeleteItemsd",
-                "keys": [
-                    "box",
-                    "label"
-                ]
             },
             {
                 "_target_": "LoadImaged",
                 "keys": "image",
-                "meta_key_postfix": "meta_dict"
             },
             {
                 "_target_": "EnsureChannelFirstd",
@@ -99,7 +103,8 @@
                     0.703125,
                     0.703125,
                     1.25
-                ]
             },
             {
                 "_target_": "ScaleIntensityRanged",

 {
+    "whether_raw_luna16": false,
+    "whether_resampled_luna16": "$(not @whether_raw_luna16)",
     "imports": [
         "$import glob",
         "$import os"
     "bundle_root": "./",
     "ckpt_dir": "$@bundle_root + '/models'",
     "output_dir": "$@bundle_root + '/eval'",
+    "data_list_file_path": "$@bundle_root + '/LUNA16_datasplit/dataset_fold0.json'",
     "data_file_base_dir": "/home/canz/Projects/datasets/LUNA16/93176/Images_resample",
     "test_datalist": "$monai.data.load_decathlon_datalist(@data_list_file_path, is_segmentation=True, data_list_key='validation', base_dir=@data_file_base_dir)",
     "device": "$torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')",
         "_target_": "Compose",
         "transforms": [
             {
+                "_target_": "LoadImaged",
+                "keys": "image",
+                "meta_key_postfix": "meta_dict",
+                "_disabled_": "@whether_raw_luna16"
             },
             {
                 "_target_": "LoadImaged",
                 "keys": "image",
+                "meta_key_postfix": "meta_dict",
+                "reader": "itkreader",
+                "affine_lps_to_ras": true,
+                "_disabled_": "@whether_resampled_luna16"
             },
             {
                 "_target_": "EnsureChannelFirstd",
                     0.703125,
                     0.703125,
                     1.25
+                ],
+                "_disabled_": "@whether_resampled_luna16"
             },
             {
                 "_target_": "ScaleIntensityRanged",

configs/metadata.json CHANGED Viewed

@@ -1,7 +1,8 @@
 {
     "schema": "https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/meta_schema_20220324.json",
-    "version": "0.3.0",
     "changelog": {
         "0.3.0": "update license files",
         "0.2.0": "unify naming",
         "0.1.1": "add reference for LIDC dataset",

 {
     "schema": "https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/meta_schema_20220324.json",
+    "version": "0.4.0",
     "changelog": {
+        "0.4.0": "add support for raw images",
         "0.3.0": "update license files",
         "0.2.0": "unify naming",
         "0.1.1": "add reference for LIDC dataset",

configs/train.json CHANGED Viewed

@@ -6,7 +6,7 @@
     "bundle_root": "./",
     "ckpt_dir": "$@bundle_root + '/models'",
     "output_dir": "$@bundle_root + '/eval'",
-    "data_list_file_path": "$@bundle_root + '/annotation/dataset_fold0.json'",
     "data_file_base_dir": "/home/canz/Projects/datasets/LUNA16/93176/Images_resample",
     "train_datalist": "$monai.data.load_decathlon_datalist(@data_list_file_path, is_segmentation=True, data_list_key='training', base_dir=@data_file_base_dir)",
     "device": "$torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')",

     "bundle_root": "./",
     "ckpt_dir": "$@bundle_root + '/models'",
     "output_dir": "$@bundle_root + '/eval'",
+    "data_list_file_path": "$@bundle_root + '/LUNA16_datasplit/dataset_fold0.json'",
     "data_file_base_dir": "/home/canz/Projects/datasets/LUNA16/93176/Images_resample",
     "train_datalist": "$monai.data.load_decathlon_datalist(@data_list_file_path, is_segmentation=True, data_list_key='training', base_dir=@data_file_base_dir)",
     "device": "$torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')",

docs/README.md CHANGED Viewed

@@ -8,32 +8,39 @@ LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset!
-## Data
 The dataset we are experimenting in this example is LUNA16 (https://luna16.grand-challenge.org/Home/), which is based on [LIDC/IDRI database](https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI) [3,4,5].
 LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the goal is to identify locations of possible nodules, and to assign a probability for being a nodule to each location.
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset! We acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.
 We follow the official 10-fold data splitting from LUNA16 challenge and generate data split json files using the script from [nnDetection](https://github.com/MIC-DKFZ/nnDetection/blob/main/projects/Task016_Luna/scripts/prepare.py).
-The resulted json files can be downloaded from https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/LUNA16_datasplit-20220615T233840Z-001.zip.
 In these files, the values of "box" are the ground truth boxes in world coordinate.
 The raw CT images in LUNA16 have various of voxel sizes. The first step is to resample them to the same voxel size.
-In this model, we resampled them into 0.703125 x 0.703125 x 1.25 mm. The code of resampling can be found in Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection
-## Training configuration
 The training was performed with at least 12GB-memory GPUs.
 Actual Model Input: 192 x 192 x 80
-## Input and output formats
 Input: list of 1 channel 3D CT patches
 Output: dictionary of classification and box regression loss in training mode;
 list of dictionary of predicted box, classification label, and classification score in evaluation mode.
-## Scores
 The script to compute FROC sensitivity value on inference results can be found in https://github.com/Project-MONAI/tutorials/tree/main/detection
 This model achieves the following FROC sensitivity value on the validation data (our own split from the training dataset):
@@ -46,28 +53,26 @@ This model achieves the following FROC sensitivity value on the validation data
 **Table 1**. The FROC sensitivity values at the predefined false positive per scan thresholds of the LUNA16 challenge.
-## commands example
 Execute training:
 ```
 python -m monai.bundle run training --meta_file configs/metadata.json --config_file configs/train.json --logging_file configs/logging.conf
 ```
 Override the `train` config to execute evaluation with the trained model:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file "['configs/train.json','configs/evaluate.json']" --logging_file configs/logging.conf
 ```
-Execute inference:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file configs/inference.json --logging_file configs/logging.conf
 ```
-Note that in inference.json, the transform "AffineBoxToWorldCoordinated" in "postprocessing" has `"affine_lps_to_ras": true`.
-This depends on the input images. It is possible that your inference dataset should set "affine_lps_to_ras": false.
-Please set it as `true` only when the original images were read by itkreader with affine_lps_to_ras=True.
 # Disclaimer

 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset!
+## 1. Data
+### 1.1 Data description
 The dataset we are experimenting in this example is LUNA16 (https://luna16.grand-challenge.org/Home/), which is based on [LIDC/IDRI database](https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI) [3,4,5].
 LUNA16 is a public dataset of CT lung nodule detection. Using raw CT scans, the goal is to identify locations of possible nodules, and to assign a probability for being a nodule to each location.
 Disclaimer: We are not the host of the data. Please make sure to read the requirements and usage policies of the data and give credit to the authors of the dataset! We acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.
+### 1.2 10-fold data splitting
 We follow the official 10-fold data splitting from LUNA16 challenge and generate data split json files using the script from [nnDetection](https://github.com/MIC-DKFZ/nnDetection/blob/main/projects/Task016_Luna/scripts/prepare.py).
+Please download the resulted json files from https://github.com/Project-MONAI/MONAI-extra-test-data/releases/download/0.8.1/LUNA16_datasplit-20220615T233840Z-001.zip.
 In these files, the values of "box" are the ground truth boxes in world coordinate.
+### 1.3 Data resampling
 The raw CT images in LUNA16 have various of voxel sizes. The first step is to resample them to the same voxel size.
+In this model, we resampled them into 0.703125 x 0.703125 x 1.25 mm.
+Please following the instruction in Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection to do the resampling.
+## 2. Training configuration
 The training was performed with at least 12GB-memory GPUs.
 Actual Model Input: 192 x 192 x 80
+## 3. Input and output formats
 Input: list of 1 channel 3D CT patches
 Output: dictionary of classification and box regression loss in training mode;
 list of dictionary of predicted box, classification label, and classification score in evaluation mode.
+## 4. Results and Scores
 The script to compute FROC sensitivity value on inference results can be found in https://github.com/Project-MONAI/tutorials/tree/main/detection
 This model achieves the following FROC sensitivity value on the validation data (our own split from the training dataset):
 **Table 1**. The FROC sensitivity values at the predefined false positive per scan thresholds of the LUNA16 challenge.
+## 5. Commands example
 Execute training:
 ```
 python -m monai.bundle run training --meta_file configs/metadata.json --config_file configs/train.json --logging_file configs/logging.conf
 ```
 Override the `train` config to execute evaluation with the trained model:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file "['configs/train.json','configs/evaluate.json']" --logging_file configs/logging.conf
 ```
+Execute inference on resampled LUNA16 images (resampled following Section 3.1 of https://github.com/Project-MONAI/tutorials/tree/main/detection) by setting `"whether_raw_luna16": false` in `inference.json`:
 ```
 python -m monai.bundle run evaluating --meta_file configs/metadata.json --config_file configs/inference.json --logging_file configs/logging.conf
 ```
+With the same command, we can execute inference on raw LUNA16 images by setting `"whether_raw_luna16": true` in `inference.json`. Remember to also set `"data_list_file_path": "$@bundle_root + '/LUNA16_datasplit/original/dataset_fold0.json'"` and change `"data_file_base_dir"`.
+Note that in inference.json, the transform "LoadImaged" in "preprocessing" and "AffineBoxToWorldCoordinated" in "postprocessing" has `"affine_lps_to_ras": true`.
+This depends on the input images. LUNA16 needs `"affine_lps_to_ras": true`.
+It is possible that your inference dataset should set `"affine_lps_to_ras": false`.
 # Disclaimer