libertininick
commited on
Commit
•
16c88aa
1
Parent(s):
51a6401
Upload folder using huggingface_hub
Browse files- README.md +49 -0
- eval_results.json +9 -0
- model.joblib +3 -0
- public.pem +3 -0
- signature.txt +1 -0
README.md
ADDED
@@ -0,0 +1,49 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- FrozenLake-v1-4x4-slippery
|
4 |
+
- q-learning
|
5 |
+
- reinforcement-learning
|
6 |
+
- custom-implementation
|
7 |
+
model-index:
|
8 |
+
- name: q-table-frozen-lake
|
9 |
+
results:
|
10 |
+
- task:
|
11 |
+
type: reinforcement-learning
|
12 |
+
name: reinforcement-learning
|
13 |
+
dataset:
|
14 |
+
name: FrozenLake-v1-4x4-slippery
|
15 |
+
type: FrozenLake-v1-4x4-slippery
|
16 |
+
metrics:
|
17 |
+
- type: mean_reward
|
18 |
+
value: 0.75 +/- 0.43
|
19 |
+
name: mean_reward
|
20 |
+
verified: false
|
21 |
+
---
|
22 |
+
|
23 |
+
# **Q-Learning** Agent playing **FrozenLake-v1**
|
24 |
+
|
25 |
+
This is a trained **Q-Learning** agent playing **FrozenLake-v1**.
|
26 |
+
|
27 |
+
## Usage
|
28 |
+
|
29 |
+
```python
|
30 |
+
import gymnasium as gym
|
31 |
+
from huggingface_hub import snapshot_download
|
32 |
+
from r2seedo.io import load_n_verify_model
|
33 |
+
|
34 |
+
# Download model snapshot from Hugging Face Hub
|
35 |
+
repo_local_path = snapshot_download(
|
36 |
+
repo_id="libertininick/q-table-frozen-lake",
|
37 |
+
local_dir="path/to/download",
|
38 |
+
)
|
39 |
+
|
40 |
+
# Load the model from the snapshot
|
41 |
+
model = load_n_verify_model(repo_local_path)
|
42 |
+
|
43 |
+
# Create the environment
|
44 |
+
env = env_slippery = gym.make(
|
45 |
+
id='FrozenLake-v1',
|
46 |
+
map_name='4x4',
|
47 |
+
is_slippery=True,
|
48 |
+
)
|
49 |
+
```
|
eval_results.json
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"env_id": "FrozenLake-v1",
|
3 |
+
"map_name": "4x4",
|
4 |
+
"is_slippery": true,
|
5 |
+
"num_episodes": 100,
|
6 |
+
"mean_reward": 0.75,
|
7 |
+
"std_reward": 0.4330127018922193,
|
8 |
+
"eval_datetime": "2024-04-07T13:29:45.128562"
|
9 |
+
}
|
model.joblib
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cef8c3672a107117f84b09cbfd4aec86ddf0a50141513b447147d309123f977f
|
3 |
+
size 1107
|
public.pem
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
-----BEGIN PUBLIC KEY-----
|
2 |
+
MCowBQYDK2VwAyEA7cLQ3Lj0Gjq/yJAVJg65ndUeHIuW6S2HVmRlXUX7TGA=
|
3 |
+
-----END PUBLIC KEY-----
|
signature.txt
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
cbe2b968d6638e685d79dc8388b41d4e3911d31f318c35321275943fe23593d7d74533b8e7633b2903242261d473986ba4060ff5dee28aaa519d3b1d8b0a8a05
|