dshin/flan-t5-ppo-user-f-batch-size-8-epoch-2-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-2-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-f-batch-size-8-epoch-3-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 6
dshin/flan-t5-ppo-user-e-batch-size-8-epoch-2-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-3-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-f-batch-size-8-epoch-4-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-e-batch-size-8-epoch-3-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-4-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
dshin/flan-t5-ppo-user-e-batch-size-8-epoch-4-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 7
Zekunli/flan-t5-large-extraction-cnndm_8000-all-loss-ep10 Text2Text Generation • Updated Mar 13, 2023 • 10