emilioHugging commited on
Commit
b14fa26
·
verified ·
1 Parent(s): a25567e

Update model card with wavelets option

Browse files
Files changed (1) hide show
  1. README.md +68 -0
README.md CHANGED
@@ -76,6 +76,74 @@ The model achieved a classification accuracy of **81%** on the PANDA subset and
76
  | **Hibou** | **83.1%** | 1.455e-06 | 0.10 |
77
  | **Histoencoder** | 81.6% | **1.003e-06** | - |
78
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
79
  ## Limitations and Biases
80
  Although this model was trained for a specific prostate histopathology analysis task, there are several limitations and biases:
81
  - Performance may be affected by the quality of input images, particularly in cases of low resolution or noise.
 
76
  | **Hibou** | **83.1%** | 1.455e-06 | 0.10 |
77
  | **Histoencoder** | 81.6% | **1.003e-06** | - |
78
 
79
+
80
+ ## Wavelet Decomposition
81
+
82
+ As previously mentioned, histopathology images are highly discontinuous, noisy, and often visually similar. Therefore, applying a filter to these images might help abstract their information, enabling more stable and potentially more effective training. This is why I believe that incorporating wavelet decomposition before the forward pass in our XCiT model could be a promising approach.
83
+
84
+ ### Overview of 3D Wavelet Decomposition
85
+
86
+ 3D wavelet decomposition is a method well-suited for analyzing volumetric data, such as \(224 \times 224 \times 3\) images, by extracting localized information at different spatial scales.
87
+
88
+ Wavelets are oscillating functions localized in time and space, used to decompose a signal \( f(x, y, z) \) into multiple scales and orientations. The 3D wavelet transform is defined as:
89
+
90
+ \[
91
+ W_\psi f(j, \theta, x, y, z) = f \ast \psi_{j, \theta}(x, y, z),
92
+ \]
93
+
94
+ where \( \psi_{j, \theta} \) is a 3D wavelet with:
95
+
96
+ - \( j \): a scale defining the spatial resolution,
97
+ - \( \theta \): a specific spatial orientation,
98
+ - \( \ast \): the 3D convolution operator.
99
+
100
+ Common 3D wavelets include Morlet and Haar wavelets, which are effective for capturing directional variations.
101
+
102
+ ### 3D Scattering: Invariant Extension
103
+
104
+ 3D scattering is a method related to wavelet decomposition that produces representations invariant to transformations (e.g., translation, rotation). This ensures that histopathology images are invariant in the wavelet coefficient domain, thereby enabling better generalization.
105
+
106
+ #### Step 1: Wavelet Decomposition
107
+ A 3D wavelet is applied to extract first-scale coefficients:
108
+
109
+ \[
110
+ U_1(x, y, z) = |f \ast \psi_{j_1, \theta_1}(x, y, z)|.
111
+ \]
112
+
113
+ #### Step 2: Higher-Level Coefficient Extraction
114
+ The coefficients \( U_1 \) are further transformed to capture secondary information:
115
+
116
+ \[
117
+ U_2(x, y, z) = |U_1 \ast \psi_{j_2, \theta_2}(x, y, z)|.
118
+ \]
119
+
120
+ This process can be repeated across multiple levels \( m \), forming a hierarchical cascade.
121
+
122
+ It is worth noting that these wavelet operations share similarities with CNNs, where convolution layers are applied. This highlights that wavelet decomposition is foundational to computer vision based on CNNs.
123
+
124
+ #### Step 3: Invariant Aggregation
125
+ At each level, a non-linear operator is applied to create invariant representations (the following is an example of such an operation):
126
+
127
+ \[
128
+ S_m = \int |U_m| \, dx \, dy \, dz.
129
+ \]
130
+
131
+ These \( S_m \) coefficients can then be used for downstream tasks.
132
+
133
+ Having introduced this idea, further testing is needed.
134
+
135
+ ### Testing the Idea
136
+
137
+ We conducted small-scale experiments using Haar wavelets, considering a single decomposition scale and focusing on the "Approximation" of the image.
138
+
139
+ Despite these limitations, training revealed some potential. We tested this idea on the PANDA subset benchmark and **Bony_wave** achieved a 83% accuracy on the test.
140
+
141
+
142
+
143
+
144
+
145
+
146
+
147
  ## Limitations and Biases
148
  Although this model was trained for a specific prostate histopathology analysis task, there are several limitations and biases:
149
  - Performance may be affected by the quality of input images, particularly in cases of low resolution or noise.