avans06's picture
Add application file
5262ee3

A newer version of the Gradio SDK is available: 5.29.1

Upgrade
metadata
title: ImgutilsVideoProcessor
emoji: 🖼️
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: true
license: mit
short_description: 'Video Processor: Detection, Classification, Analysis'

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference


deepghs/anime_person_detection

Model FLOPS Params F1 Score Threshold F1 Plot Confusion Labels
person_detect_v1.2_s 3.49k 11.1M 0.86 0.295 plot confusion person
person_detect_v1.3_s 3.49k 11.1M 0.86 0.324 plot confusion person
person_detect_v0_x 31.3k 68.1M N/A N/A N/A N/A person
person_detect_v0_m 9.53k 25.8M 0.85 0.424 plot confusion person
person_detect_v0_s 3.49k 11.1M N/A N/A N/A N/A person
person_detect_v1.1_s 3.49k 11.1M 0.86 0.384 plot confusion person
person_detect_v1.1_n 898 3.01M 0.85 0.327 plot confusion person
person_detect_v1.1_m 9.53k 25.8M 0.87 0.348 plot confusion person
person_detect_v1_m 9.53k 25.8M 0.86 0.351 plot confusion person

deepghs/anime_halfbody_detection

Model FLOPS Params F1 Score Threshold F1 Plot Confusion Labels
halfbody_detect_v1.0_n 898 3.01M 0.94 0.512 plot confusion halfbody
halfbody_detect_v1.0_s 3.49k 11.1M 0.95 0.577 plot confusion halfbody
halfbody_detect_v0.4_s 3.49k 11.1M 0.93 0.517 plot confusion halfbody
halfbody_detect_v0.3_s 3.49k 11.1M 0.92 0.222 plot confusion halfbody
halfbody_detect_v0.2_s 3.49k 11.1M 0.94 0.548 plot confusion halfbody

deepghs/anime_head_detection

Model Type FLOPS Params F1 Score Threshold precision(B) recall(B) mAP50(B) mAP50-95(B) F1 Plot Confusion Labels
head_detect_v2.0_x_yv11 yolo 195G 56.9M 0.93 0.458 0.95942 0.90938 0.96853 0.78938 plot confusion head
head_detect_v2.0_l_yv11 yolo 87.3G 25.3M 0.93 0.432 0.95557 0.90905 0.9661 0.78709 plot confusion head
head_detect_v2.0_m_yv11 yolo 68.2G 20.1M 0.93 0.42 0.95383 0.90658 0.96485 0.78511 plot confusion head
head_detect_v2.0_s_yv11 yolo 21.5G 9.43M 0.92 0.383 0.954 0.89512 0.95789 0.77753 plot confusion head
head_detect_v2.0_n_yv11 yolo 6.44G 2.59M 0.91 0.365 0.94815 0.87169 0.9452 0.75835 plot confusion head
head_detect_v2.0_x yolo 227G 61.6M 0.93 0.459 0.95378 0.91123 0.96593 0.78767 plot confusion head
head_detect_v2.0_l yolo 146G 39.5M 0.93 0.379 0.95124 0.91264 0.96458 0.78627 plot confusion head
head_detect_v2.0_m yolo 79.1G 25.9M 0.93 0.397 0.95123 0.90701 0.96403 0.78342 plot confusion head
head_detect_v2.0_s yolo 28.6G 11.1M 0.92 0.413 0.95556 0.89197 0.95799 0.77833 plot confusion head
head_detect_v2.0_n yolo 8.19G 3.01M 0.91 0.368 0.94633 0.87046 0.94361 0.75764 plot confusion head
head_detect_v1.6_x_rtdetr rtdetr 232G 67.3M 0.93 0.559 0.95316 0.91697 0.96556 0.76682 plot confusion head
head_detect_v1.6_l_rtdetr rtdetr 108G 32.8M 0.93 0.53 0.95113 0.90956 0.96218 0.76201 plot confusion head
head_detect_v1.6_s_yv11 yolo 21.5G 9.43M 0.93 0.42 0.95273 0.90558 0.96327 0.78566 plot confusion head
head_detect_v1.6_n_yv11 yolo 6.44G 2.59M 0.92 0.385 0.95561 0.87798 0.95086 0.76765 plot confusion head
head_detect_v1.6_s_yv9 yolo 22.7G 6.32M 0.93 0.419 0.95464 0.90425 0.9627 0.78663 plot confusion head
head_detect_v1.6_t_yv9 yolo 6.7G 1.77M 0.91 0.332 0.94968 0.8792 0.95069 0.76789 plot confusion head
head_detect_v1.6_x yolo 258G 68.2M 0.94 0.448 0.9546 0.91873 0.96878 0.79502 plot confusion head
head_detect_v1.6_l yolo 165G 43.6M 0.94 0.458 0.95733 0.92018 0.96868 0.79428 plot confusion head
head_detect_v1.6_s_yv10 yolo 24.8G 8.07M 0.93 0.406 0.95424 0.90074 0.96201 0.78713 plot confusion head
head_detect_v1.6_n_yv10 yolo 8.39G 2.71M 0.91 0.374 0.94845 0.87492 0.9503 0.77059 plot confusion head
head_detect_v1.6_s yolo 28.6G 11.1M 0.93 0.381 0.95333 0.90587 0.96241 0.78688 plot confusion head
head_detect_v1.6_n yolo 8.19G 3.01M 0.92 0.38 0.94835 0.88436 0.95051 0.76766 plot confusion head
head_detect_v1.5_s yolo 28.6G 11.1M 0.94 0.453 0.96014 0.92275 0.96829 0.80674 plot confusion head
head_detect_v1.5_n yolo 8.19G 3.01M 0.93 0.396 0.95719 0.90511 0.9612 0.78841 plot confusion head
head_detect_v1.4_s yolo 28.6G 11.1M 0.94 0.472 0.96275 0.91875 0.96812 0.80417 plot confusion head
head_detect_v1.4_n yolo 8.19G 3.01M 0.93 0.396 0.9557 0.90559 0.96075 0.78689 plot confusion head
head_detect_v1.3_s yolo 28.6G 11.1M 0.94 0.423 0.95734 0.9257 0.97037 0.80391 plot confusion head
head_detect_v1.3_n yolo 8.19G 3.01M 0.93 0.409 0.95254 0.90674 0.96258 0.7844 plot confusion head
head_detect_v1.2_s yolo 28.6G 11.1M 0.94 0.415 0.95756 0.9271 0.97097 0.80514 plot confusion head
head_detect_v1.2_n yolo 8.19G 3.01M 0.93 0.471 0.96309 0.89766 0.9647 0.78928 plot confusion head
head_detect_v1.1_s yolo 28.6G 11.1M 0.94 0.485 0.96191 0.91892 0.97069 0.80182 plot confusion head
head_detect_v1.0_l yolo 165G 43.6M 0.94 0.579 0.95881 0.91532 0.96561 0.81417 plot confusion head
head_detect_v1.0_x yolo 258G 68.2M 0.94 0.567 0.9597 0.91947 0.96682 0.8154 plot confusion head
head_detect_v1.0_m yolo 79.1G 25.9M 0.94 0.489 0.95805 0.9196 0.96632 0.81383 plot confusion head
head_detect_v1.0_s yolo 28.6G 11.1M 0.93 0.492 0.95267 0.91355 0.96245 0.80371 plot confusion head
head_detect_v1.0_n yolo 8.19G 3.01M 0.92 0.375 0.93999 0.9002 0.95509 0.7849 plot confusion head
head_detect_v0.5_s yolo 28.6G 11.1M 0.92 0.415 0.93908 0.9034 0.95697 0.77514 plot confusion head
head_detect_v0.5_n yolo 8.19G 3.01M 0.91 0.446 0.93834 0.88034 0.94784 0.75251 plot confusion head
head_detect_v0.5_s_pruned yolo 28.6G 11.1M 0.93 0.472 0.95455 0.89865 0.9584 0.79968 plot confusion head
head_detect_v0.5_n_pruned yolo 8.19G 3.01M 0.91 0.523 0.95254 0.8743 0.95049 0.7807 plot confusion head
head_detect_v0.5_m_pruned yolo 79.1G 25.9M 0.94 0.52 0.9609 0.91365 0.96501 0.81322 plot confusion head
head_detect_v0.4_s yolo 28.6G 11.1M 0.92 0.405 0.93314 0.90274 0.95727 0.77193 plot confusion head
head_detect_v0.4_s_fp yolo 28.6G 11.1M 0.91 0.445 0.93181 0.89113 0.95002 0.76302 plot confusion head
head_detect_v0.3_s yolo 28.6G 11.1M 0.91 0.406 0.92457 0.90351 0.95785 0.78912 plot confusion head
head_detect_v0.2_s_plus yolo 28.6G 11.1M 0.91 0.594 0.94239 0.8774 0.94909 0.77986 plot confusion head
head_detect_v0.2_s yolo 28.6G 11.1M 0.9 0.461 0.91861 0.8898 0.94765 0.77541 plot confusion head
head_detect_v0.1_s yolo 28.6G 11.1M 0.9 0.504 0.91576 0.88662 0.94213 0.7713 plot confusion head
head_detect_v0_n yolo 8.19G 3.01M 0.9 0.316 N/A N/A N/A N/A plot confusion head
head_detect_v0_s yolo 28.6G 11.1M N/A N/A N/A N/A N/A N/A N/A N/A head

deepghs/ccip

Model F1 Score Precision Recall Threshold Cluster_2 Cluster_Free
ccip-caformer_b36-24 0.940925 0.938254 0.943612 0.213231 0.89508 0.957017
ccip-caformer-24-randaug-pruned 0.917211 0.933481 0.901499 0.178475 0.890366 0.922375
ccip-v2-caformer_s36-10 0.906422 0.932779 0.881513 0.207757 0.874592 0.89241
ccip-caformer-6-randaug-pruned_fp32 0.878403 0.893648 0.863669 0.195122 0.810176 0.897904
ccip-caformer-5_fp32 0.864363 0.90155 0.830121 0.183973 0.792051 0.862289
ccip-caformer-4_fp32 0.844967 0.870553 0.820842 0.18367 0.795565 0.868133
ccip-caformer_query-12 0.823928 0.871122 0.781585 0.141308 0.787237 0.809426
ccip-caformer-23_randaug_fp32 0.81625 0.854134 0.781585 0.136797 0.745697 0.8068
ccip-caformer-2-randaug-pruned_fp32 0.78561 0.800148 0.771592 0.171053 0.686617 0.728195
ccip-caformer-2_fp32 0.755125 0.790172 0.723055 0.141275 0.64977 0.718516
  • The calculation of F1 Score, Precision, and Recall considers "the characters in both images are the same" as a positive case. Threshold is determined by finding the maximum value on the F1 Score curve.
  • Cluster_2 represents the approximate optimal clustering solution obtained by tuning the eps value in DBSCAN clustering algorithm with min_samples set to 2, and evaluating the similarity between the obtained clusters and the true distribution using the random_adjust_score.
  • Cluster_Free represents the approximate optimal solution obtained by tuning the max_eps and min_samples values in the OPTICS clustering algorithm, and evaluating the similarity between the obtained clusters and the true distribution using the random_adjust_score.

deepghs/anime_aesthetic

Name FLOPS Params Accuracy AUC Confusion Labels
caformer_s36_v0_ls0.2 22.10G 37.22M 34.68% 0.7725 confusion masterpiece, best, great, good, normal, low, worst
swinv2pv3_v0_448_ls0.2 46.20G 65.94M 40.32% 0.8188 confusion masterpiece, best, great, good, normal, low, worst
swinv2pv3_v0_448_ls0.2_x 46.20G 65.94M 40.88% 0.8214 confusion masterpiece, best, great, good, normal, low, worst

deepghs/anime_face_detection

Model FLOPS Params F1 Score Threshold F1 Plot Confusion Labels
face_detect_v1.4_n 898 3.01M 0.94 0.278 plot confusion face
face_detect_v1.4_s 3.49k 11.1M 0.95 0.307 plot confusion face
face_detect_v1.3_n 898 3.01M 0.93 0.305 plot confusion face
face_detect_v1.2_s 3.49k 11.1M 0.93 0.222 plot confusion face
face_detect_v1.3_s 3.49k 11.1M 0.93 0.259 plot confusion face
face_detect_v1_s 3.49k 11.1M 0.95 0.446 plot confusion face
face_detect_v1_n 898 3.01M 0.95 0.458 plot confusion face
face_detect_v0_n 898 3.01M 0.97 0.428 plot confusion face
face_detect_v0_s 3.49k 11.1M N/A N/A N/A N/A face
face_detect_v1.1_n 898 3.01M 0.94 0.373 plot confusion face
face_detect_v1.1_s 3.49k 11.1M 0.94 0.405 plot confusion face

deepghs/real_person_detection

Model Type FLOPS Params F1 Score Threshold precision(B) recall(B) mAP50(B) mAP50-95(B) F1 Plot Confusion Labels
person_detect_v0_l_yv11 yolo 87.3G 25.3M 0.79 0.359 0.84037 0.74055 0.82796 0.57272 plot confusion person
person_detect_v0_m_yv11 yolo 68.2G 20.1M 0.78 0.351 0.83393 0.73614 0.82195 0.56267 plot confusion person
person_detect_v0_s_yv11 yolo 21.5G 9.43M 0.75 0.344 0.82356 0.6967 0.79224 0.52304 plot confusion person
person_detect_v0_n_yv11 yolo 6.44G 2.59M 0.71 0.325 0.80096 0.64148 0.74612 0.46875 plot confusion person
person_detect_v0_l yolo 165G 43.6M 0.79 0.359 0.83674 0.74182 0.82536 0.57022 plot confusion person
person_detect_v0_m yolo 79.1G 25.9M 0.78 0.363 0.83439 0.72529 0.81314 0.55388 plot confusion person
person_detect_v0_s yolo 28.6G 11.1M 0.76 0.346 0.82522 0.69696 0.79105 0.52201 plot confusion person
person_detect_v0_n yolo 8.19G 3.01M 0.72 0.32 0.80883 0.64552 0.74996 0.47272 plot confusion person

deepghs/real_head_detection

Model Type FLOPS Params F1 Score Threshold precision(B) recall(B) mAP50(B) mAP50-95(B) F1 Plot Confusion Labels
head_detect_v0_l_yv11 yolo 87.3G 25.3M 0.81 0.199 0.90226 0.72872 0.81049 0.5109 plot confusion head
head_detect_v0_m_yv11 yolo 68.2G 20.1M 0.8 0.206 0.89855 0.72654 0.80704 0.50804 plot confusion head
head_detect_v0_s_yv11 yolo 21.5G 9.43M 0.78 0.187 0.88726 0.69234 0.77518 0.47825 plot confusion head
head_detect_v0_n_yv11 yolo 6.44G 2.59M 0.74 0.14 0.87359 0.64011 0.73393 0.44118 plot confusion head
head_detect_v0_l yolo 165G 43.6M 0.81 0.234 0.89921 0.74092 0.81715 0.51615 plot confusion head
head_detect_v0_m yolo 79.1G 25.9M 0.8 0.228 0.90006 0.72646 0.80614 0.50586 plot confusion head
head_detect_v0_s yolo 28.6G 11.1M 0.78 0.182 0.89224 0.69382 0.77804 0.48067 plot confusion head
head_detect_v0_n yolo 8.19G 3.01M 0.74 0.172 0.8728 0.64823 0.73865 0.44501 plot confusion head

deepghs/real_face_detection

Model Type FLOPS Params F1 Score Threshold precision(B) recall(B) mAP50(B) mAP50-95(B) F1 Plot Confusion Labels
face_detect_v0_s_yv12 yolo 21.5G 9.25M 0.74 0.272 0.86931 0.6404 0.73074 0.42652 plot confusion face
face_detect_v0_n_yv12 yolo 6.48G 2.57M 0.7 0.258 0.85246 0.59089 0.6793 0.39182 plot confusion face
face_detect_v0_l_yv11 yolo 87.3G 25.3M 0.77 0.291 0.88458 0.67474 0.76666 0.45722 plot confusion face
face_detect_v0_m_yv11 yolo 68.2G 20.1M 0.76 0.262 0.87947 0.67315 0.76073 0.45288 plot confusion face
face_detect_v0_s_yv11 yolo 21.5G 9.43M 0.73 0.271 0.87001 0.63572 0.72683 0.42706 plot confusion face
face_detect_v0_n_yv11 yolo 6.44G 2.59M 0.7 0.263 0.86044 0.58577 0.67641 0.38975 plot confusion face
face_detect_v0_l yolo 165G 43.6M 0.76 0.277 0.87894 0.67335 0.76313 0.4532 plot confusion face
face_detect_v0_m yolo 79.1G 25.9M 0.75 0.277 0.87687 0.66265 0.75114 0.44262 plot confusion face
face_detect_v0_s yolo 28.6G 11.1M 0.73 0.282 0.86932 0.63557 0.72494 0.42219 plot confusion face
face_detect_v0_n yolo 8.19G 3.01M 0.7 0.257 0.85337 0.58877 0.67471 0.38692 plot confusion face