jwhj
/

Qwen2.5-Math-1.5B-OREO

Model card Files Files and versions Community

jwhj commited on Jan 20

Commit

97ef421

·

verified ·

1 Parent(s): ab53ac8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ Source code for [Offline Reinforcement Learning for LLM Multi-Step Reasoning](ht
 Model: [Policy](https://huggingface.co/jwhj/Qwen2.5-Math-1.5B-OREO) | [Value](https://huggingface.co/jwhj/Qwen2.5-Math-1.5B-OREO-Value)
-<img src="./OREO.png" alt="Image description" width="50%" />
 # Installation

 Model: [Policy](https://huggingface.co/jwhj/Qwen2.5-Math-1.5B-OREO) | [Value](https://huggingface.co/jwhj/Qwen2.5-Math-1.5B-OREO-Value)
+<img src="https://raw.githubusercontent.com/jwhj/OREO/refs/heads/main/OREO.png" alt="Image description" width="50%" />
 # Installation