pi05-base-mini-chair-openpi

Full fine-tune of Physical Intelligence pi0.5 (pi05_base) trained with openpi (PyTorch backend) on JonathanGiegold/mini-chair-base-put-only.

Files

  • model.safetensors — fine-tuned weights (final step 12,963)
  • metadata.pt — openpi checkpoint metadata
  • assets/JonathanGiegold/mini-chair-base-put-only-v21/norm_stats.json — required state/action normalization stats for inference
  • training_summary.png — training curves (loss / grad-norm / LR)

Optimizer state is intentionally omitted (inference/deploy only).

Training curves

training summary

Raw per-step loss shows spikes from flow-matching noise-level sampling; the smoothed trend falls 0.31 -> ~0.035 and is flat at the end. Gradient-norm spikes coincide with those steps but were absorbed by gradient clipping (norm 1.0, fired on ~2% of steps), so the optimizer stayed stable throughout.

Model

Architecture pi0.5 (PaliGemma gemma_2b + action expert gemma_300m)
Action horizon 16
State / action dim 14 (padded to 32)
Precision bfloat16
Cameras (3) context, wrist_left, wrist_right

Data

Dataset JonathanGiegold/mini-chair-base-put-only (LeRobot v2.1 on-box conversion)
Transforms repack + delta joint actions (mask [Tx6, F, Tx6, F]), prompt_from_task
Normalization quantile (norm_stats computed on this dataset)

Hyperparameters

Steps 12,964 (5 epochs)
Batch size 32
Num workers 8
Seed 42
EMA disabled
Optimizer AdamW
  betas (b1, b2) 0.9, 0.95
  eps 1e-8
  weight_decay 1e-10
  grad clip norm 1.0
LR schedule cosine decay with warmup
  warmup steps 1,000
  peak LR 2.5e-5
  decay steps 12,964
  floor LR 2.5e-6

Run

Hardware 1x H100 80GB
Wall time 11h 29m (3.19 s/step)
VRAM ~41.6 GB
Final smoothed loss ~0.035
Downloads last month

-

Downloads are not tracked for this model. How to track
Safetensors
Model size
4B params
Tensor type
F32
·
BF16
·
Video Preview
loading