A LoRA-based implementation of AlpacaFarm RLHF PPO
More details in https://github.com/SimengSun/alpaca_farm_lora
- Downloads last month
- 0
Unable to determine this model's library. Check the
docs
.
A LoRA-based implementation of AlpacaFarm RLHF PPO
More details in https://github.com/SimengSun/alpaca_farm_lora