Spaces:
Running
Running
A newer version of the Gradio SDK is available:
5.29.1
metadata
title: ImgutilsVideoProcessor
emoji: 🖼️
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: true
license: mit
short_description: 'Video Processor: Detection, Classification, Analysis'
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
deepghs/anime_person_detection
Model | FLOPS | Params | F1 Score | Threshold | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|
person_detect_v1.2_s | 3.49k | 11.1M | 0.86 | 0.295 | plot | confusion | person |
person_detect_v1.3_s | 3.49k | 11.1M | 0.86 | 0.324 | plot | confusion | person |
person_detect_v0_x | 31.3k | 68.1M | N/A | N/A | N/A | N/A | person |
person_detect_v0_m | 9.53k | 25.8M | 0.85 | 0.424 | plot | confusion | person |
person_detect_v0_s | 3.49k | 11.1M | N/A | N/A | N/A | N/A | person |
person_detect_v1.1_s | 3.49k | 11.1M | 0.86 | 0.384 | plot | confusion | person |
person_detect_v1.1_n | 898 | 3.01M | 0.85 | 0.327 | plot | confusion | person |
person_detect_v1.1_m | 9.53k | 25.8M | 0.87 | 0.348 | plot | confusion | person |
person_detect_v1_m | 9.53k | 25.8M | 0.86 | 0.351 | plot | confusion | person |
deepghs/anime_halfbody_detection
Model | FLOPS | Params | F1 Score | Threshold | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|
halfbody_detect_v1.0_n | 898 | 3.01M | 0.94 | 0.512 | plot | confusion | halfbody |
halfbody_detect_v1.0_s | 3.49k | 11.1M | 0.95 | 0.577 | plot | confusion | halfbody |
halfbody_detect_v0.4_s | 3.49k | 11.1M | 0.93 | 0.517 | plot | confusion | halfbody |
halfbody_detect_v0.3_s | 3.49k | 11.1M | 0.92 | 0.222 | plot | confusion | halfbody |
halfbody_detect_v0.2_s | 3.49k | 11.1M | 0.94 | 0.548 | plot | confusion | halfbody |
deepghs/anime_head_detection
Model | Type | FLOPS | Params | F1 Score | Threshold | precision(B) | recall(B) | mAP50(B) | mAP50-95(B) | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|---|---|---|---|---|
head_detect_v2.0_x_yv11 | yolo | 195G | 56.9M | 0.93 | 0.458 | 0.95942 | 0.90938 | 0.96853 | 0.78938 | plot | confusion | head |
head_detect_v2.0_l_yv11 | yolo | 87.3G | 25.3M | 0.93 | 0.432 | 0.95557 | 0.90905 | 0.9661 | 0.78709 | plot | confusion | head |
head_detect_v2.0_m_yv11 | yolo | 68.2G | 20.1M | 0.93 | 0.42 | 0.95383 | 0.90658 | 0.96485 | 0.78511 | plot | confusion | head |
head_detect_v2.0_s_yv11 | yolo | 21.5G | 9.43M | 0.92 | 0.383 | 0.954 | 0.89512 | 0.95789 | 0.77753 | plot | confusion | head |
head_detect_v2.0_n_yv11 | yolo | 6.44G | 2.59M | 0.91 | 0.365 | 0.94815 | 0.87169 | 0.9452 | 0.75835 | plot | confusion | head |
head_detect_v2.0_x | yolo | 227G | 61.6M | 0.93 | 0.459 | 0.95378 | 0.91123 | 0.96593 | 0.78767 | plot | confusion | head |
head_detect_v2.0_l | yolo | 146G | 39.5M | 0.93 | 0.379 | 0.95124 | 0.91264 | 0.96458 | 0.78627 | plot | confusion | head |
head_detect_v2.0_m | yolo | 79.1G | 25.9M | 0.93 | 0.397 | 0.95123 | 0.90701 | 0.96403 | 0.78342 | plot | confusion | head |
head_detect_v2.0_s | yolo | 28.6G | 11.1M | 0.92 | 0.413 | 0.95556 | 0.89197 | 0.95799 | 0.77833 | plot | confusion | head |
head_detect_v2.0_n | yolo | 8.19G | 3.01M | 0.91 | 0.368 | 0.94633 | 0.87046 | 0.94361 | 0.75764 | plot | confusion | head |
head_detect_v1.6_x_rtdetr | rtdetr | 232G | 67.3M | 0.93 | 0.559 | 0.95316 | 0.91697 | 0.96556 | 0.76682 | plot | confusion | head |
head_detect_v1.6_l_rtdetr | rtdetr | 108G | 32.8M | 0.93 | 0.53 | 0.95113 | 0.90956 | 0.96218 | 0.76201 | plot | confusion | head |
head_detect_v1.6_s_yv11 | yolo | 21.5G | 9.43M | 0.93 | 0.42 | 0.95273 | 0.90558 | 0.96327 | 0.78566 | plot | confusion | head |
head_detect_v1.6_n_yv11 | yolo | 6.44G | 2.59M | 0.92 | 0.385 | 0.95561 | 0.87798 | 0.95086 | 0.76765 | plot | confusion | head |
head_detect_v1.6_s_yv9 | yolo | 22.7G | 6.32M | 0.93 | 0.419 | 0.95464 | 0.90425 | 0.9627 | 0.78663 | plot | confusion | head |
head_detect_v1.6_t_yv9 | yolo | 6.7G | 1.77M | 0.91 | 0.332 | 0.94968 | 0.8792 | 0.95069 | 0.76789 | plot | confusion | head |
head_detect_v1.6_x | yolo | 258G | 68.2M | 0.94 | 0.448 | 0.9546 | 0.91873 | 0.96878 | 0.79502 | plot | confusion | head |
head_detect_v1.6_l | yolo | 165G | 43.6M | 0.94 | 0.458 | 0.95733 | 0.92018 | 0.96868 | 0.79428 | plot | confusion | head |
head_detect_v1.6_s_yv10 | yolo | 24.8G | 8.07M | 0.93 | 0.406 | 0.95424 | 0.90074 | 0.96201 | 0.78713 | plot | confusion | head |
head_detect_v1.6_n_yv10 | yolo | 8.39G | 2.71M | 0.91 | 0.374 | 0.94845 | 0.87492 | 0.9503 | 0.77059 | plot | confusion | head |
head_detect_v1.6_s | yolo | 28.6G | 11.1M | 0.93 | 0.381 | 0.95333 | 0.90587 | 0.96241 | 0.78688 | plot | confusion | head |
head_detect_v1.6_n | yolo | 8.19G | 3.01M | 0.92 | 0.38 | 0.94835 | 0.88436 | 0.95051 | 0.76766 | plot | confusion | head |
head_detect_v1.5_s | yolo | 28.6G | 11.1M | 0.94 | 0.453 | 0.96014 | 0.92275 | 0.96829 | 0.80674 | plot | confusion | head |
head_detect_v1.5_n | yolo | 8.19G | 3.01M | 0.93 | 0.396 | 0.95719 | 0.90511 | 0.9612 | 0.78841 | plot | confusion | head |
head_detect_v1.4_s | yolo | 28.6G | 11.1M | 0.94 | 0.472 | 0.96275 | 0.91875 | 0.96812 | 0.80417 | plot | confusion | head |
head_detect_v1.4_n | yolo | 8.19G | 3.01M | 0.93 | 0.396 | 0.9557 | 0.90559 | 0.96075 | 0.78689 | plot | confusion | head |
head_detect_v1.3_s | yolo | 28.6G | 11.1M | 0.94 | 0.423 | 0.95734 | 0.9257 | 0.97037 | 0.80391 | plot | confusion | head |
head_detect_v1.3_n | yolo | 8.19G | 3.01M | 0.93 | 0.409 | 0.95254 | 0.90674 | 0.96258 | 0.7844 | plot | confusion | head |
head_detect_v1.2_s | yolo | 28.6G | 11.1M | 0.94 | 0.415 | 0.95756 | 0.9271 | 0.97097 | 0.80514 | plot | confusion | head |
head_detect_v1.2_n | yolo | 8.19G | 3.01M | 0.93 | 0.471 | 0.96309 | 0.89766 | 0.9647 | 0.78928 | plot | confusion | head |
head_detect_v1.1_s | yolo | 28.6G | 11.1M | 0.94 | 0.485 | 0.96191 | 0.91892 | 0.97069 | 0.80182 | plot | confusion | head |
head_detect_v1.0_l | yolo | 165G | 43.6M | 0.94 | 0.579 | 0.95881 | 0.91532 | 0.96561 | 0.81417 | plot | confusion | head |
head_detect_v1.0_x | yolo | 258G | 68.2M | 0.94 | 0.567 | 0.9597 | 0.91947 | 0.96682 | 0.8154 | plot | confusion | head |
head_detect_v1.0_m | yolo | 79.1G | 25.9M | 0.94 | 0.489 | 0.95805 | 0.9196 | 0.96632 | 0.81383 | plot | confusion | head |
head_detect_v1.0_s | yolo | 28.6G | 11.1M | 0.93 | 0.492 | 0.95267 | 0.91355 | 0.96245 | 0.80371 | plot | confusion | head |
head_detect_v1.0_n | yolo | 8.19G | 3.01M | 0.92 | 0.375 | 0.93999 | 0.9002 | 0.95509 | 0.7849 | plot | confusion | head |
head_detect_v0.5_s | yolo | 28.6G | 11.1M | 0.92 | 0.415 | 0.93908 | 0.9034 | 0.95697 | 0.77514 | plot | confusion | head |
head_detect_v0.5_n | yolo | 8.19G | 3.01M | 0.91 | 0.446 | 0.93834 | 0.88034 | 0.94784 | 0.75251 | plot | confusion | head |
head_detect_v0.5_s_pruned | yolo | 28.6G | 11.1M | 0.93 | 0.472 | 0.95455 | 0.89865 | 0.9584 | 0.79968 | plot | confusion | head |
head_detect_v0.5_n_pruned | yolo | 8.19G | 3.01M | 0.91 | 0.523 | 0.95254 | 0.8743 | 0.95049 | 0.7807 | plot | confusion | head |
head_detect_v0.5_m_pruned | yolo | 79.1G | 25.9M | 0.94 | 0.52 | 0.9609 | 0.91365 | 0.96501 | 0.81322 | plot | confusion | head |
head_detect_v0.4_s | yolo | 28.6G | 11.1M | 0.92 | 0.405 | 0.93314 | 0.90274 | 0.95727 | 0.77193 | plot | confusion | head |
head_detect_v0.4_s_fp | yolo | 28.6G | 11.1M | 0.91 | 0.445 | 0.93181 | 0.89113 | 0.95002 | 0.76302 | plot | confusion | head |
head_detect_v0.3_s | yolo | 28.6G | 11.1M | 0.91 | 0.406 | 0.92457 | 0.90351 | 0.95785 | 0.78912 | plot | confusion | head |
head_detect_v0.2_s_plus | yolo | 28.6G | 11.1M | 0.91 | 0.594 | 0.94239 | 0.8774 | 0.94909 | 0.77986 | plot | confusion | head |
head_detect_v0.2_s | yolo | 28.6G | 11.1M | 0.9 | 0.461 | 0.91861 | 0.8898 | 0.94765 | 0.77541 | plot | confusion | head |
head_detect_v0.1_s | yolo | 28.6G | 11.1M | 0.9 | 0.504 | 0.91576 | 0.88662 | 0.94213 | 0.7713 | plot | confusion | head |
head_detect_v0_n | yolo | 8.19G | 3.01M | 0.9 | 0.316 | N/A | N/A | N/A | N/A | plot | confusion | head |
head_detect_v0_s | yolo | 28.6G | 11.1M | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | head |
deepghs/ccip
Model | F1 Score | Precision | Recall | Threshold | Cluster_2 | Cluster_Free |
---|---|---|---|---|---|---|
ccip-caformer_b36-24 | 0.940925 | 0.938254 | 0.943612 | 0.213231 | 0.89508 | 0.957017 |
ccip-caformer-24-randaug-pruned | 0.917211 | 0.933481 | 0.901499 | 0.178475 | 0.890366 | 0.922375 |
ccip-v2-caformer_s36-10 | 0.906422 | 0.932779 | 0.881513 | 0.207757 | 0.874592 | 0.89241 |
ccip-caformer-6-randaug-pruned_fp32 | 0.878403 | 0.893648 | 0.863669 | 0.195122 | 0.810176 | 0.897904 |
ccip-caformer-5_fp32 | 0.864363 | 0.90155 | 0.830121 | 0.183973 | 0.792051 | 0.862289 |
ccip-caformer-4_fp32 | 0.844967 | 0.870553 | 0.820842 | 0.18367 | 0.795565 | 0.868133 |
ccip-caformer_query-12 | 0.823928 | 0.871122 | 0.781585 | 0.141308 | 0.787237 | 0.809426 |
ccip-caformer-23_randaug_fp32 | 0.81625 | 0.854134 | 0.781585 | 0.136797 | 0.745697 | 0.8068 |
ccip-caformer-2-randaug-pruned_fp32 | 0.78561 | 0.800148 | 0.771592 | 0.171053 | 0.686617 | 0.728195 |
ccip-caformer-2_fp32 | 0.755125 | 0.790172 | 0.723055 | 0.141275 | 0.64977 | 0.718516 |
- The calculation of
F1 Score
,Precision
, andRecall
considers "the characters in both images are the same" as a positive case.Threshold
is determined by finding the maximum value on the F1 Score curve. Cluster_2
represents the approximate optimal clustering solution obtained by tuning the eps value in DBSCAN clustering algorithm with min_samples set to2
, and evaluating the similarity between the obtained clusters and the true distribution using therandom_adjust_score
.Cluster_Free
represents the approximate optimal solution obtained by tuning themax_eps
andmin_samples
values in the OPTICS clustering algorithm, and evaluating the similarity between the obtained clusters and the true distribution using therandom_adjust_score
.
deepghs/anime_aesthetic
Name | FLOPS | Params | Accuracy | AUC | Confusion | Labels |
---|---|---|---|---|---|---|
caformer_s36_v0_ls0.2 | 22.10G | 37.22M | 34.68% | 0.7725 | confusion | masterpiece , best , great , good , normal , low , worst |
swinv2pv3_v0_448_ls0.2 | 46.20G | 65.94M | 40.32% | 0.8188 | confusion | masterpiece , best , great , good , normal , low , worst |
swinv2pv3_v0_448_ls0.2_x | 46.20G | 65.94M | 40.88% | 0.8214 | confusion | masterpiece , best , great , good , normal , low , worst |
deepghs/anime_face_detection
Model | FLOPS | Params | F1 Score | Threshold | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|
face_detect_v1.4_n | 898 | 3.01M | 0.94 | 0.278 | plot | confusion | face |
face_detect_v1.4_s | 3.49k | 11.1M | 0.95 | 0.307 | plot | confusion | face |
face_detect_v1.3_n | 898 | 3.01M | 0.93 | 0.305 | plot | confusion | face |
face_detect_v1.2_s | 3.49k | 11.1M | 0.93 | 0.222 | plot | confusion | face |
face_detect_v1.3_s | 3.49k | 11.1M | 0.93 | 0.259 | plot | confusion | face |
face_detect_v1_s | 3.49k | 11.1M | 0.95 | 0.446 | plot | confusion | face |
face_detect_v1_n | 898 | 3.01M | 0.95 | 0.458 | plot | confusion | face |
face_detect_v0_n | 898 | 3.01M | 0.97 | 0.428 | plot | confusion | face |
face_detect_v0_s | 3.49k | 11.1M | N/A | N/A | N/A | N/A | face |
face_detect_v1.1_n | 898 | 3.01M | 0.94 | 0.373 | plot | confusion | face |
face_detect_v1.1_s | 3.49k | 11.1M | 0.94 | 0.405 | plot | confusion | face |
deepghs/real_person_detection
Model | Type | FLOPS | Params | F1 Score | Threshold | precision(B) | recall(B) | mAP50(B) | mAP50-95(B) | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|---|---|---|---|---|
person_detect_v0_l_yv11 | yolo | 87.3G | 25.3M | 0.79 | 0.359 | 0.84037 | 0.74055 | 0.82796 | 0.57272 | plot | confusion | person |
person_detect_v0_m_yv11 | yolo | 68.2G | 20.1M | 0.78 | 0.351 | 0.83393 | 0.73614 | 0.82195 | 0.56267 | plot | confusion | person |
person_detect_v0_s_yv11 | yolo | 21.5G | 9.43M | 0.75 | 0.344 | 0.82356 | 0.6967 | 0.79224 | 0.52304 | plot | confusion | person |
person_detect_v0_n_yv11 | yolo | 6.44G | 2.59M | 0.71 | 0.325 | 0.80096 | 0.64148 | 0.74612 | 0.46875 | plot | confusion | person |
person_detect_v0_l | yolo | 165G | 43.6M | 0.79 | 0.359 | 0.83674 | 0.74182 | 0.82536 | 0.57022 | plot | confusion | person |
person_detect_v0_m | yolo | 79.1G | 25.9M | 0.78 | 0.363 | 0.83439 | 0.72529 | 0.81314 | 0.55388 | plot | confusion | person |
person_detect_v0_s | yolo | 28.6G | 11.1M | 0.76 | 0.346 | 0.82522 | 0.69696 | 0.79105 | 0.52201 | plot | confusion | person |
person_detect_v0_n | yolo | 8.19G | 3.01M | 0.72 | 0.32 | 0.80883 | 0.64552 | 0.74996 | 0.47272 | plot | confusion | person |
deepghs/real_head_detection
Model | Type | FLOPS | Params | F1 Score | Threshold | precision(B) | recall(B) | mAP50(B) | mAP50-95(B) | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|---|---|---|---|---|
head_detect_v0_l_yv11 | yolo | 87.3G | 25.3M | 0.81 | 0.199 | 0.90226 | 0.72872 | 0.81049 | 0.5109 | plot | confusion | head |
head_detect_v0_m_yv11 | yolo | 68.2G | 20.1M | 0.8 | 0.206 | 0.89855 | 0.72654 | 0.80704 | 0.50804 | plot | confusion | head |
head_detect_v0_s_yv11 | yolo | 21.5G | 9.43M | 0.78 | 0.187 | 0.88726 | 0.69234 | 0.77518 | 0.47825 | plot | confusion | head |
head_detect_v0_n_yv11 | yolo | 6.44G | 2.59M | 0.74 | 0.14 | 0.87359 | 0.64011 | 0.73393 | 0.44118 | plot | confusion | head |
head_detect_v0_l | yolo | 165G | 43.6M | 0.81 | 0.234 | 0.89921 | 0.74092 | 0.81715 | 0.51615 | plot | confusion | head |
head_detect_v0_m | yolo | 79.1G | 25.9M | 0.8 | 0.228 | 0.90006 | 0.72646 | 0.80614 | 0.50586 | plot | confusion | head |
head_detect_v0_s | yolo | 28.6G | 11.1M | 0.78 | 0.182 | 0.89224 | 0.69382 | 0.77804 | 0.48067 | plot | confusion | head |
head_detect_v0_n | yolo | 8.19G | 3.01M | 0.74 | 0.172 | 0.8728 | 0.64823 | 0.73865 | 0.44501 | plot | confusion | head |
deepghs/real_face_detection
Model | Type | FLOPS | Params | F1 Score | Threshold | precision(B) | recall(B) | mAP50(B) | mAP50-95(B) | F1 Plot | Confusion | Labels |
---|---|---|---|---|---|---|---|---|---|---|---|---|
face_detect_v0_s_yv12 | yolo | 21.5G | 9.25M | 0.74 | 0.272 | 0.86931 | 0.6404 | 0.73074 | 0.42652 | plot | confusion | face |
face_detect_v0_n_yv12 | yolo | 6.48G | 2.57M | 0.7 | 0.258 | 0.85246 | 0.59089 | 0.6793 | 0.39182 | plot | confusion | face |
face_detect_v0_l_yv11 | yolo | 87.3G | 25.3M | 0.77 | 0.291 | 0.88458 | 0.67474 | 0.76666 | 0.45722 | plot | confusion | face |
face_detect_v0_m_yv11 | yolo | 68.2G | 20.1M | 0.76 | 0.262 | 0.87947 | 0.67315 | 0.76073 | 0.45288 | plot | confusion | face |
face_detect_v0_s_yv11 | yolo | 21.5G | 9.43M | 0.73 | 0.271 | 0.87001 | 0.63572 | 0.72683 | 0.42706 | plot | confusion | face |
face_detect_v0_n_yv11 | yolo | 6.44G | 2.59M | 0.7 | 0.263 | 0.86044 | 0.58577 | 0.67641 | 0.38975 | plot | confusion | face |
face_detect_v0_l | yolo | 165G | 43.6M | 0.76 | 0.277 | 0.87894 | 0.67335 | 0.76313 | 0.4532 | plot | confusion | face |
face_detect_v0_m | yolo | 79.1G | 25.9M | 0.75 | 0.277 | 0.87687 | 0.66265 | 0.75114 | 0.44262 | plot | confusion | face |
face_detect_v0_s | yolo | 28.6G | 11.1M | 0.73 | 0.282 | 0.86932 | 0.63557 | 0.72494 | 0.42219 | plot | confusion | face |
face_detect_v0_n | yolo | 8.19G | 3.01M | 0.7 | 0.257 | 0.85337 | 0.58877 | 0.67471 | 0.38692 | plot | confusion | face |