pi05-organize-tools-trossen-ai

A Ο€β‚€.β‚… (pi0.5) policy fine-tuned with openpi (Physical Intelligence's JAX trainer) for the organize tools task on the Trossen AI Stationary bimanual platform.

Framework: openpi (JAX/XLA) β€” this is not a LeRobot/PyTorch checkpoint. Load it with the openpi runtime, not lerobot.

Inputs

  • Dataset: TrossenRoboticsCommunity/trossen_ai_stationary_organize_tools β€” LeRobot v2 format (openpi requires v2; v3 is rejected).
  • Base model: Ο€β‚€.β‚… (pi0.5) pretrained checkpoint from Physical Intelligence, LoRA fine-tuned.
  • Training code: TrossenRobotics/openpi
  • Hardware: Trossen AI Stationary (bimanual), trained locally on an RTX 5090.
  • Action/state space: 14-dim (per assets/trossen/norm_stats.json).

Configuration

  • Steps: 100,000
  • Batch size: 8
  • Fine-tuning: LoRA
  • Norm stats: assets/trossen/norm_stats.json (auto-computed at train time).

How it was trained

Fine-tuned locally on an RTX 5090 using the openpi JAX trainer (LoRA, batch size 8, 100K steps). openpi's first-class Ο€ implementation generally outperforms LeRobot's PyTorch Ο€ policies. Exported as the final orbax checkpoint (params/ = weights, train_state/ = optimizer state for resuming).

How to use

git clone https://github.com/TrossenRobotics/openpi
# follow openpi serving/inference instructions; point the policy at this repo's params/
# recommended inference rate: 50 Hz

Outputs

  • Best checkpoint: step 100,000
  • Checkpoint exported: 2025-11-05

Evaluation

Versions

  • r1 (2025-11-05) β€” initial openpi checkpoint export.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including TrossenRoboticsCommunity/pi05-organize-tools-trossen-ai