Commit History

updated environment with multi-agent DB integration
c996cea

hard007ik commited on

complete almost
78e43a9

hard007ik commited on

complete almost
04b615f

hard007ik commited on

pin vllm<=0.18.0 to fix TRL importance-sampling ratio collapse
8464e70

hard007ik commited on

Drop max_prompt_length from GRPOConfig (removed in newer TRL)
1ecce94

hard007ik commited on

Add GRPO training scaffolding (Module 5 rollout_func pattern) + plot artifacts
caea08d

hard007ik commited on

Merge branch 'main' of https://github.com/Hard007ik/ShopManagerEng
d11ffe8

hard007ik commited on

shop manage eng phase 2 first
048f186

hard007ik commited on

Add LICENSE file
3b9e063
unverified

Hardik Makwana commited on

Delete Apache License from LICENSE file
cb8199f
unverified

Hardik Makwana commited on

Include Apache License 2.0
2348f11
unverified

Hardik Makwana commited on

solve inference again 2
135f6f5

hard007ik commited on

solve inference again
8927ef0

hard007ik commited on

again inferecnce updated
2f31737

hard007ik commited on

resolving inference
78a47d4

hard007ik commited on

inference updated for 3 task
28cbbf9

hard007ik commited on

yaml updated
35bb9d3

hard007ik commited on

updated server/app.py as per submit validation
5c3c008

hard007ik commited on

updated url in inference.py
b4b27d7

hard007ik commited on

fix: resolve relative import error for Docker deployment
95a7814

hard007ik commited on

feat: jewelry shop RL environment with market, warehouse, and showroom phases
5c6ca01

hard007ik commited on