metadata
language: en
tags:
- video-classification
license: apache-2.0
datasets:
- ucf101
metrics:
- accuracy
- top-5-accuracy
pipeline_tag: video-classification
model-index:
- name: i3d-kinetics-400
results:
- task:
type: video-classification
name: Video Classification
dataset:
name: UCF101
type: ucf101
metrics:
- name: Accuracy
type: accuracy
value: 0.95
- name: Top-5 Accuracy
type: top-5-accuracy
value: 0.95
I3D Kinetics-400
This model is a fine-tuned version of the Inflated 3D Convnet model for action recognition, trained on the Kinetics-400 dataset.
Model Description
The I3D (Inflated 3D Convnet) model is designed for video classification tasks. It extends 2D convolutions to 3D, enabling the model to capture spatiotemporal features from video frames.
Intended Uses
The model can be used for action recognition in videos. It is particularly suited for tasks involving the classification of human activities.
Training Data
The model was fine-tuned on the UCF101 dataset, which consists of 13,320 videos belonging to 101 action categories.
Performance
The model achieves an accuracy of 90% and a top-5 accuracy of 95% on the UCF101 test set.
Example Usage
from transformers import pipeline
model = pipeline("video-classification", model="Mouwiya/i3d-kinetics-400")
# Example video path
video_path = "path_to_your_video.mp4"
# Perform video classification
results = model(video_path)
print(results)