amirabdullah19852020/gpt-neo-125m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 8
amirabdullah19852020/pythia-70m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 10
amirabdullah19852020/pythia-160m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 8
amirabdullah19852020/gpt-neo-125m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 8
amirabdullah19852020/pythia-160m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 8
amirabdullah19852020/pythia-70m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 14