Commit History
complete almost 78e43a9
complete almost 04b615f
pin vllm<=0.18.0 to fix TRL importance-sampling ratio collapse 8464e70
Drop max_prompt_length from GRPOConfig (removed in newer TRL) 1ecce94
Add GRPO training scaffolding (Module 5 rollout_func pattern) + plot artifacts caea08d
Merge branch 'main' of https://github.com/Hard007ik/ShopManagerEng d11ffe8
shop manage eng phase 2 first 048f186
Add LICENSE file 3b9e063 unverified
Hardik Makwana commited on
Delete Apache License from LICENSE file cb8199f unverified
Hardik Makwana commited on
Include Apache License 2.0 2348f11 unverified
Hardik Makwana commited on