jamesdolezal commited on
Commit
9815a01
1 Parent(s): 9f1ce61

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -0
README.md CHANGED
@@ -1,3 +1,60 @@
1
  ---
2
  license: gpl-3.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: gpl-3.0
3
  ---
4
+
5
+ # Breast Estrogen Receptor (ER) GAN v1 Model Card
6
+ This model card describes a model associated with a manuscript that is currently under review. Links to the manuscript will be provided once publicly available.
7
+
8
+ ## Model Details
9
+ - **Developed by:** James Dolezal
10
+ - **Model type:** Generative adversarial network
11
+ - **Language(s):** English
12
+ - **License:** GPL-3.0
13
+ - **Model Description:** This is a StyleGAN2 model that can generate synthetic H&E pathologic images of breast cancer. The GAN is conditioned on estrogen receptor (ER) status as determined by immunohistochemical testing, with categories ER-negative (=0) and ER-positive (=1).
14
+ - **Image processing:** This model generates images at 512 x 512 px resolution and was trained on lossless (PNG) pathologic images at 400 x 400 μm magnification.
15
+ - **Resources for more information:** [GitHub Repository](https://github.com/jamesdolezal/histologic-sheep)
16
+
17
+ # Uses
18
+
19
+ ## Examples
20
+ This model is a [StyleGAN2](https://github.com/NVlabs/stylegan3) model and can be used with any StyleGAN-compatible scripts and tools. The [GitHub repository](https://github.com/jamesdolezal/histologic-sheep) associated with his model includes detailed information on how to interface with the GAN, generate images, and perform class blending via embedding interpolation.
21
+
22
+ ## Direct Use
23
+ This model is intended for research purposes only. Possible research areas and tasks include
24
+
25
+ - Applications in educational settings.
26
+ - Research on pathology classification models for breast cancer.
27
+
28
+ Excluded uses are described below.
29
+
30
+ ### Misuse and Out-of-Scope Use
31
+ Output from this model should not be used in a clinical setting or be provided to patients, physicians, or any other health care members directly involved in their health care outside the context of an approved research protocol. Using the model in a clinical setting outside the context of an approved research protocol is a misuse of this model. This includes influencing a patient's health care treatment in any way based on output from this model.
32
+
33
+ ### Limitations
34
+
35
+ The model does not generate images reflective of estrogen receptor status in a manner which controls for possible underlying biological bias, such tumor grade or histological subtype.
36
+
37
+ ### Bias
38
+ This model was trained on The Cancer Genome Atlas (TCGA), which contains patient data from communities and cultures which may not reflect the general population. This datasets is comprised of images from multiple institutions, which may introduce a potential source of bias from site-specific batch effects ([Howard, 2021](https://www.nature.com/articles/s41467-021-24698-1)).
39
+
40
+ ## Training
41
+
42
+ **Training Data**
43
+ The following dataset was used to train the model:
44
+
45
+ - The Cancer Genome Atlas (TCGA), THCA cohort (see next section)
46
+
47
+ This model was trained on a total of 1,048 slides, with 228 ER-negative tumor and 820 ER-positive tumors.
48
+
49
+ **Training Procedure**
50
+ Each whole-slide image was sectioned into smaller images in a grid-wise fashion in order to extract tiles from whole-slide images at 400 x 400 μm. Image tiles were extracted at the nearest downsample layer, and resized to 512 x 512 px using [Libvips](https://www.libvips.org/API/current/libvips-resample.html#vips-resize). During training, images are randomly flipped and rotated (90, 180, 270). Training is otherwise identical to the official StyleGAN2 implementation.
51
+
52
+ Additional training information:
53
+
54
+ - **Hardware:** 4 x A100 GPUs
55
+ - **Batch size:** 32
56
+ - **R1 gamma:** 1.6384
57
+ - **Training time:** 10,000 kimg
58
+
59
+ ## Evaluation Results
60
+ External evaluation results are currently under peer review and will be posted once publicly available.