Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ metrics:
|
|
13 |
|
14 |
This model is trained by the Netherlands Forensic Institute.
|
15 |
It can be used for linking snippets of exploded (heavy) fireworks to the type of firework that they originate from.
|
16 |
-
You may find an application that uses this model at [www.vuurwerkverkenner.nl](www.vuurwerkverkenner.nl).
|
17 |
|
18 |
## Architecture
|
19 |
|
@@ -26,13 +26,13 @@ Therefore, we require a one-shot model, which we construct as follows.
|
|
26 |
### Embedding model
|
27 |
|
28 |
First, we train an embedder that produces similar embeddings for snippets and wrappers of the same category, and dissimilar embeddings for different categories.
|
29 |
-
The embedding model is based on the Vision Transformer architecture (see [https://arxiv.org/abs/2010.11929
|
30 |
It has the following specifications:
|
31 |
|
32 |
* Model: ViT-B/32
|
33 |
* Input: RBG image of 640x640 pixels
|
34 |
* Output/embedding layer size: 512
|
35 |
-
* Training loss: TripletSemiHardLoss (see [https://www.tensorflow.org/addons/tutorials/losses_triplet
|
36 |
* Fixed learning rate of 0.000015 with Adam optimizer
|
37 |
* Epochs: 100
|
38 |
|
@@ -91,8 +91,8 @@ Overall, we find that the model performs very well for classes that are present
|
|
91 |
|
92 |
### Lab snippets
|
93 |
|
94 |
-
|
95 |
-
| Metric | Worst-case | Best-case | Worst-case | Best-case |
|
96 |
|--------------|------------|-----------|------------|-----------|
|
97 |
| Accuracy @ 1 | 0.21 | 0.99 | 0.64 | 0.99 |
|
98 |
| Accuracy @ 3 | 0.35 | 1.00 | 0.73 | 1.00 |
|
@@ -102,7 +102,7 @@ Overall, we find that the model performs very well for classes that are present
|
|
102 |
|
103 |
### Mock snippets
|
104 |
|
105 |
-
| Metric | Without
|
106 |
|--------------|------------------------|------------------------|
|
107 |
| Accuracy @ 1 | 0.99 | 0.99 |
|
108 |
| Accuracy @ 3 | 0.99 | 1.00 |
|
@@ -110,7 +110,7 @@ Overall, we find that the model performs very well for classes that are present
|
|
110 |
| Accuracy @ 10| 0.99 | 1.00 |
|
111 |
| Accuracy @ 25| 1.00 | 1.00 |
|
112 |
|
113 |
-
Note that the final model is trained on all data, so we expect performance to increase as compared to these metrics.
|
114 |
|
115 |
### Limitations
|
116 |
|
@@ -124,4 +124,4 @@ Moreover, when the type of firework under investigation is very new or rare, it
|
|
124 |
## Using the model
|
125 |
|
126 |
The model is intended to be used with the Vuurwerkverkenner application, which contains code for running the model.
|
127 |
-
The source code for the application may be found on (https://github.com/NetherlandsForensicInstitute/vuurwerkverkenner)
|
|
|
13 |
|
14 |
This model is trained by the Netherlands Forensic Institute.
|
15 |
It can be used for linking snippets of exploded (heavy) fireworks to the type of firework that they originate from.
|
16 |
+
You may find an application that uses this model at [www.vuurwerkverkenner.nl](https://www.vuurwerkverkenner.nl).
|
17 |
|
18 |
## Architecture
|
19 |
|
|
|
26 |
### Embedding model
|
27 |
|
28 |
First, we train an embedder that produces similar embeddings for snippets and wrappers of the same category, and dissimilar embeddings for different categories.
|
29 |
+
The embedding model is based on the Vision Transformer architecture (see [arXiv](https://arxiv.org/abs/2010.11929)).
|
30 |
It has the following specifications:
|
31 |
|
32 |
* Model: ViT-B/32
|
33 |
* Input: RBG image of 640x640 pixels
|
34 |
* Output/embedding layer size: 512
|
35 |
+
* Training loss: TripletSemiHardLoss (see [TensorFlow.org](https://www.tensorflow.org/addons/tutorials/losses_triplet)) with batch size 10 (2 anchors, 2 positives, 2 negatives)
|
36 |
* Fixed learning rate of 0.000015 with Adam optimizer
|
37 |
* Epochs: 100
|
38 |
|
|
|
91 |
|
92 |
### Lab snippets
|
93 |
|
94 |
+
|
95 |
+
| Metric | Worst-case (without text filter) | Best-case (without text filter) | Worst-case (with text filter) | Best-case (with text filter) |
|
96 |
|--------------|------------|-----------|------------|-----------|
|
97 |
| Accuracy @ 1 | 0.21 | 0.99 | 0.64 | 0.99 |
|
98 |
| Accuracy @ 3 | 0.35 | 1.00 | 0.73 | 1.00 |
|
|
|
102 |
|
103 |
### Mock snippets
|
104 |
|
105 |
+
| Metric | Without text filter | With text filter |
|
106 |
|--------------|------------------------|------------------------|
|
107 |
| Accuracy @ 1 | 0.99 | 0.99 |
|
108 |
| Accuracy @ 3 | 0.99 | 1.00 |
|
|
|
110 |
| Accuracy @ 10| 0.99 | 1.00 |
|
111 |
| Accuracy @ 25| 1.00 | 1.00 |
|
112 |
|
113 |
+
Note that the final model is trained on all data, so we expect performance to increase somewhat as compared to these metrics.
|
114 |
|
115 |
### Limitations
|
116 |
|
|
|
124 |
## Using the model
|
125 |
|
126 |
The model is intended to be used with the Vuurwerkverkenner application, which contains code for running the model.
|
127 |
+
The source code for the application may be found on [GitHub](https://github.com/NetherlandsForensicInstitute/vuurwerkverkenner).
|