Ashutosh Baheti's picture

1 2

Ashutosh Baheti

abaheti95

·

https://abaheti95.github.io/

AI & ML interests

Reinforcement Learning, Open-domain Dialog Systems, Chatbots

Organizations

Papers 1

arxiv:2305.14718

models 4

abaheti95/dpo_qlora_hh

Updated Oct 4, 2023 • 1

abaheti95/a_lol_kl_good_prioirty_qlora_hh

Updated Oct 4, 2023

abaheti95/a_lol_good_prioirty_qlora_hh

Updated Oct 4, 2023

abaheti95/a_lol_seq_good_prioirty_qlora_hh

Updated Oct 4, 2023

datasets

None public yet