Rauhan
/

Qwen2.5-3B-GRPO-GSM325

Text Generation

reinforcement-learning

mathematical-reasoning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2.5-3B-GRPO-GSM325 / vocab.json

Rauhan's picture

Upload tokenizer

85baed1 verified about 1 month ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.