Update model card with wavelets option
Browse files
README.md
CHANGED
@@ -76,6 +76,74 @@ The model achieved a classification accuracy of **81%** on the PANDA subset and
|
|
76 |
| **Hibou** | **83.1%** | 1.455e-06 | 0.10 |
|
77 |
| **Histoencoder** | 81.6% | **1.003e-06** | - |
|
78 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
79 |
## Limitations and Biases
|
80 |
Although this model was trained for a specific prostate histopathology analysis task, there are several limitations and biases:
|
81 |
- Performance may be affected by the quality of input images, particularly in cases of low resolution or noise.
|
|
|
76 |
| **Hibou** | **83.1%** | 1.455e-06 | 0.10 |
|
77 |
| **Histoencoder** | 81.6% | **1.003e-06** | - |
|
78 |
|
79 |
+
|
80 |
+
## Wavelet Decomposition
|
81 |
+
|
82 |
+
As previously mentioned, histopathology images are highly discontinuous, noisy, and often visually similar. Therefore, applying a filter to these images might help abstract their information, enabling more stable and potentially more effective training. This is why I believe that incorporating wavelet decomposition before the forward pass in our XCiT model could be a promising approach.
|
83 |
+
|
84 |
+
### Overview of 3D Wavelet Decomposition
|
85 |
+
|
86 |
+
3D wavelet decomposition is a method well-suited for analyzing volumetric data, such as \(224 \times 224 \times 3\) images, by extracting localized information at different spatial scales.
|
87 |
+
|
88 |
+
Wavelets are oscillating functions localized in time and space, used to decompose a signal \( f(x, y, z) \) into multiple scales and orientations. The 3D wavelet transform is defined as:
|
89 |
+
|
90 |
+
\[
|
91 |
+
W_\psi f(j, \theta, x, y, z) = f \ast \psi_{j, \theta}(x, y, z),
|
92 |
+
\]
|
93 |
+
|
94 |
+
where \( \psi_{j, \theta} \) is a 3D wavelet with:
|
95 |
+
|
96 |
+
- \( j \): a scale defining the spatial resolution,
|
97 |
+
- \( \theta \): a specific spatial orientation,
|
98 |
+
- \( \ast \): the 3D convolution operator.
|
99 |
+
|
100 |
+
Common 3D wavelets include Morlet and Haar wavelets, which are effective for capturing directional variations.
|
101 |
+
|
102 |
+
### 3D Scattering: Invariant Extension
|
103 |
+
|
104 |
+
3D scattering is a method related to wavelet decomposition that produces representations invariant to transformations (e.g., translation, rotation). This ensures that histopathology images are invariant in the wavelet coefficient domain, thereby enabling better generalization.
|
105 |
+
|
106 |
+
#### Step 1: Wavelet Decomposition
|
107 |
+
A 3D wavelet is applied to extract first-scale coefficients:
|
108 |
+
|
109 |
+
\[
|
110 |
+
U_1(x, y, z) = |f \ast \psi_{j_1, \theta_1}(x, y, z)|.
|
111 |
+
\]
|
112 |
+
|
113 |
+
#### Step 2: Higher-Level Coefficient Extraction
|
114 |
+
The coefficients \( U_1 \) are further transformed to capture secondary information:
|
115 |
+
|
116 |
+
\[
|
117 |
+
U_2(x, y, z) = |U_1 \ast \psi_{j_2, \theta_2}(x, y, z)|.
|
118 |
+
\]
|
119 |
+
|
120 |
+
This process can be repeated across multiple levels \( m \), forming a hierarchical cascade.
|
121 |
+
|
122 |
+
It is worth noting that these wavelet operations share similarities with CNNs, where convolution layers are applied. This highlights that wavelet decomposition is foundational to computer vision based on CNNs.
|
123 |
+
|
124 |
+
#### Step 3: Invariant Aggregation
|
125 |
+
At each level, a non-linear operator is applied to create invariant representations (the following is an example of such an operation):
|
126 |
+
|
127 |
+
\[
|
128 |
+
S_m = \int |U_m| \, dx \, dy \, dz.
|
129 |
+
\]
|
130 |
+
|
131 |
+
These \( S_m \) coefficients can then be used for downstream tasks.
|
132 |
+
|
133 |
+
Having introduced this idea, further testing is needed.
|
134 |
+
|
135 |
+
### Testing the Idea
|
136 |
+
|
137 |
+
We conducted small-scale experiments using Haar wavelets, considering a single decomposition scale and focusing on the "Approximation" of the image.
|
138 |
+
|
139 |
+
Despite these limitations, training revealed some potential. We tested this idea on the PANDA subset benchmark and **Bony_wave** achieved a 83% accuracy on the test.
|
140 |
+
|
141 |
+
|
142 |
+
|
143 |
+
|
144 |
+
|
145 |
+
|
146 |
+
|
147 |
## Limitations and Biases
|
148 |
Although this model was trained for a specific prostate histopathology analysis task, there are several limitations and biases:
|
149 |
- Performance may be affected by the quality of input images, particularly in cases of low resolution or noise.
|