nabeelshan
/

rlhf-gpt2-pipeline

Text Generation

reinforcement-learning

instruction-tuning

Model card Files Files and versions

rlhf-gpt2-pipeline / ppo_aligned_final /merges.txt

Nabeel Shan

Add tokenizer files

b461de7 2 months ago

history contribute delete

456 kB

File too large to display, you can check the raw version instead.