Edit model card

Pose Estimation: front,side,back

Model description

This model predicts the person's body position relative to the camera: front, side, back. It was trained on Lucy in the Sky images.

This model is a fine-tuned version of google/vit-base-patch16-224-in21k.

Training and evaluation data

It achieves the following results on the evaluation set:

  • Loss: 0.2524
  • Accuracy: 0.9355

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 64
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 20

Framework versions

  • Transformers 4.34.0
  • Pytorch 2.0.1+cu118
  • Datasets 2.14.5
  • Tokenizers 0.14.0
Downloads last month
14
Inference API
Drag image file here or click to browse from your device
This model can be loaded on Inference API (serverless).

Finetuned from

Space using LucyintheSky/pose-estimation-front-side-back 1