metadata

language: en
tags:
  - video-classification
license: apache-2.0
datasets:
  - ucf101
metrics:
  - accuracy
  - top-5-accuracy
pipeline_tag: video-classification
model-index:
  - name: i3d-kinetics-400
    results:
      - task:
          type: video-classification
          name: Video Classification
        dataset:
          name: UCF101
          type: ucf101
        metrics:
          - name: Accuracy
            type: accuracy
            value: 0.95
          - name: Top-5 Accuracy
            type: top-5-accuracy
            value: 0.95

I3D Kinetics-400

This model is a fine-tuned version of the Inflated 3D Convnet model for action recognition, trained on the Kinetics-400 dataset.

Model Description

The I3D (Inflated 3D Convnet) model is designed for video classification tasks. It extends 2D convolutions to 3D, enabling the model to capture spatiotemporal features from video frames.

Intended Uses

The model can be used for action recognition in videos. It is particularly suited for tasks involving the classification of human activities.

Training Data

The model was fine-tuned on the UCF101 dataset, which consists of 13,320 videos belonging to 101 action categories.

Performance

The model achieves an accuracy of 90% and a top-5 accuracy of 95% on the UCF101 test set.

Example Usage

from transformers import pipeline

model = pipeline("video-classification", model="Mouwiya/i3d-kinetics-400")

# Example video path
video_path = "path_to_your_video.mp4"

# Perform video classification
results = model(video_path)
print(results)