pi05-base-mini-chair-openpi

Full fine-tune of Physical Intelligence pi0.5 (pi05_base) trained with openpi (PyTorch backend) on JonathanGiegold/mini-chair-base-put-only.

Files

model.safetensors — fine-tuned weights (final step 12,963)
metadata.pt — openpi checkpoint metadata
assets/JonathanGiegold/mini-chair-base-put-only-v21/norm_stats.json — required state/action normalization stats for inference
training_summary.png — training curves (loss / grad-norm / LR)

Optimizer state is intentionally omitted (inference/deploy only).

Training curves

Raw per-step loss shows spikes from flow-matching noise-level sampling; the smoothed trend falls 0.31 -> ~0.035 and is flat at the end. Gradient-norm spikes coincide with those steps but were absorbed by gradient clipping (norm 1.0, fired on ~2% of steps), so the optimizer stayed stable throughout.

Model


Architecture	pi0.5 (PaliGemma `gemma_2b` + action expert `gemma_300m`)
Action horizon	16
State / action dim	14 (padded to 32)
Precision	bfloat16
Cameras (3)	context, wrist_left, wrist_right

Data


Dataset	`JonathanGiegold/mini-chair-base-put-only` (LeRobot v2.1 on-box conversion)
Transforms	repack + delta joint actions (mask [Tx6, F, Tx6, F]), prompt_from_task
Normalization	quantile (norm_stats computed on this dataset)

Hyperparameters


Steps	12,964 (5 epochs)
Batch size	32
Num workers	8
Seed	42
EMA	disabled
Optimizer	AdamW
betas (b1, b2)	0.9, 0.95
eps	1e-8
weight_decay	1e-10
grad clip norm	1.0
LR schedule	cosine decay with warmup
warmup steps	1,000
peak LR	2.5e-5
decay steps	12,964
floor LR	2.5e-6

Run


Hardware	1x H100 80GB
Wall time	~~11h 29m (~~3.19 s/step)
VRAM	~41.6 GB
Final smoothed loss	~0.035

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

4B params

Tensor type

F32

BF16

Video Preview

Robotics