namish10 commited on
Commit
012f151
·
verified ·
1 Parent(s): ec86d65

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ContextFlow RL Doubt Predictor
2
+
3
+ ## Overview
4
+ This is the trained reinforcement learning model for ContextFlow doubt prediction system.
5
+
6
+ ## Model Details
7
+ - **Algorithm**: GRPO (Group Relative Policy Optimization) + Q-Learning
8
+ - **State Dimension**: 64 features
9
+ - **Action Dimension**: 10 doubt prediction actions
10
+ - **Policy Version**: 50
11
+ - **Training Samples**: 200
12
+
13
+ ## Usage
14
+ ```python
15
+ import pickle
16
+ from huggingface_hub import hf_hub_download
17
+
18
+ # Download checkpoint
19
+ path = hf_hub_download(repo_id='namish10/contextflow-rl', filename='checkpoint.pkl')
20
+
21
+ # Load checkpoint
22
+ with open(path, 'rb') as f:
23
+ checkpoint = pickle.load(f)
24
+
25
+ print(f"Policy version: {checkpoint.policy_version}")
26
+ ```
27
+
28
+ ## Citation
29
+ ```bibtex
30
+ @software{contextflow_rl,
31
+ title={ContextFlow RL Doubt Predictor},
32
+ author={ContextFlow Team},
33
+ year={2026}
34
+ }
35
+ ```