File size: 1,992 Bytes
9ec0109
 
e4a2ac8
 
 
 
 
 
 
9ec0109
e4a2ac8
 
 
780ff80
 
 
8e008c9
780ff80
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
license: mit
datasets:
- deepghs/monochrome_danbooru
metrics:
- accuracy
pipeline_tag: image-classification
tags:
- art
---

The models used for determining whether an anime image is monochrome have a training size of 384.

|            Model            |  FLOPs | Accuracy |                                                        Confusion Matrix                                                       | Description                                                                                                     |
|:---------------------------:|:------:|:--------:|:-----------------------------------------------------------------------------------------------------------------------------:|-----------------------------------------------------------------------------------------------------------------|
|         caformer_s36        | 22.10G |  95.63%  |         [Confusion Matrix](https://huggingface.co/deepghs/monochrome_detect/blob/main/caformer_s36/plot_confusion.png)        | Model: caformer_s36 from timm                                                                                   |
|      caformer_s36_safe2     | 22.10G |  95.52%  |      [Confusion Matrix](https://huggingface.co/deepghs/monochrome_detect/blob/main/caformer_s36_safe2/plot_confusion.png)     | Model: caformer_s36 from timm, which have better precision and lower recall than caformer_s36                   |
|    mobilenetv3_large_100    |  0.63G |  95.40%  |    [Confusion Matrix](https://huggingface.co/deepghs/monochrome_detect/blob/main/mobilenetv3_large_100/plot_confusion.png)    | Model: mobilenetv3_large_100 from timm                                                                          |
| mobilenetv3_large_100_safe2 |  0.63G |  94.62%  | [Confusion Matrix](https://huggingface.co/deepghs/monochrome_detect/blob/main/mobilenetv3_large_100_safe2/plot_confusion.png) | Model: mobilenetv3_large_100 from timm, which have better precision and lower recall than mobilenetv3_large_100 |