Fire Boy MiniCPM-V 4.6 VLA Router Artifacts
Creator And Submission Links
| Item | Link |
|---|---|
| Creator | Sanjay Prasad H S (sanjuhs) |
| GitHub repo | sanjuhs/build-small-hackathon-v1 |
| Canonical HF Space repo | build-small-hackathon/toy-room-v3 |
| Canonical live Space | https://build-small-hackathon-toy-room-v3.hf.space/toy-v3 |
| Personal HF Space mirror | sanjuhs/toy-room-v3 |
| Personal live Space mirror | https://sanjuhs-toy-room-v3.hf.space/toy-v3 |
| YouTube demo | https://youtu.be/rwRIXLwaLmI |
| Canonical model repo | build-small-hackathon/fireboy-minicpm-v-4-6-vla |
| Personal model mirror | sanjuhs/fireboy-minicpm-v-4-6-vla |
This repository backs the Toy Room v3 embodied-action demo:
- Space: https://build-small-hackathon-toy-room-v3.hf.space/toy-v3
- GitHub: https://github.com/sanjuhs/build-small-hackathon-v1
- YouTube demo: https://youtu.be/rwRIXLwaLmI
- VLA research page: https://build-small-hackathon-toy-room-v3.hf.space/vla-research
- Policy gallery: https://build-small-hackathon-toy-room-v3.hf.space/fireboy-policy-gallery
- Dataset/artifacts: https://huggingface.co/datasets/build-small-hackathon/fireboy-vla-rollout-artifacts
The shipped live route uses MiniCPM-V 4.6 as the vision-language backbone, freezes the backbone, mean-pools the 1024-d vision-language representation, and trains a small skill/parameter head. The router emits a bounded contract:
walk_torun_aroundpick_upfind_and_eat_berry
Toy Room v3 calls this route first through src/vla_router_policy.py, then dispatches the selected skill into the MuJoCo policy registry and retargets the proof rollout into visible Fire Boy movement.
What Is Included
checkpoints/: trained VLA heads, LoRA/action-head experiments, eval JSON, summaries, and embedding caches.runtime-policies/: small MuJoCo policy artifacts needed by the Toy Room v3 runtime.docs/: research notes copied from the source project.
The final judge-facing router currently loads:
checkpoints/fireboy_minicpm_vla_skill_param_head/minicpm_vla_skill_param_head.pt
and dispatches against the skill-parameter rows in:
checkpoints/fireboy_minicpm_vla_skill_param_head/fireboy_vla_skill_params_allskill.jsonl
Runtime Mapping
flowchart LR
Player["Toy Room command"] --> API["/api/pet-action"]
API --> Router["Modal fireboy-vla-router\nMiniCPM-V 4.6"]
Router --> Head["frozen embedding + skill/param head"]
Head --> Dispatch["MuJoCo policy registry"]
Dispatch --> Action["retargeted PET JSON"]
Action --> Toy["Fire Boy moves, picks up, eats berry"]
License
The adapter heads, exported policy artifacts, cards, docs, and generated evidence in this repository are released under the MIT license. The upstream MiniCPM-V base model is not redistributed here and remains governed by its own upstream license and model card.
Model tree for build-small-hackathon/fireboy-minicpm-v-4-6-vla
Base model
openbmb/MiniCPM-V-4.6