blakenp
/

Qwen2.5-1.5B-Policy

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2.5-1.5B-Policy / vocab.json

blakenp's picture

rlhf_qwen2.5 0.5B

6923b38 verified 5 days ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.