annaovesnaatatt
/

martin-arguments

Feature Extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

annaovesnaatatt commited on Sep 12, 2023

Commit

6a5a3f6

•

1 Parent(s): 4bf9fce

Create README.md

Files changed (1) hide show

README.md +1 -0

README.md ADDED Viewed

	@@ -0,0 +1 @@


1	+ Testig reward model for RLHF on 1000 examples from [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf).