Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Ashutosh Baheti
abaheti95
Follow
0 followers
·
2 following
https://abaheti95.github.io/
wat_the_fun
abaheti95
AI & ML interests
Reinforcement Learning, Open-domain Dialog Systems, Chatbots
Organizations
Papers
1
arxiv:
2305.14718
models
4
Sort: Recently updated
abaheti95/dpo_qlora_hh
Updated
Oct 4, 2023
•
1
abaheti95/a_lol_kl_good_prioirty_qlora_hh
Updated
Oct 4, 2023
abaheti95/a_lol_good_prioirty_qlora_hh
Updated
Oct 4, 2023
abaheti95/a_lol_seq_good_prioirty_qlora_hh
Updated
Oct 4, 2023
datasets
None public yet