initial model
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- README.md +138 -0
- results/tau_agent_A1_2M/Tau-A1-2M.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.onnx +3 -0
- results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.pt +3 -0
- results/tau_agent_A1_2M/checkpoints/checkpoint.pt +3 -0
- results/tau_agent_A1_2M/configuration.yaml +93 -0
- results/tau_agent_A3_1M/Tau-A3-1M.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-12324.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-12324.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-199903.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-199903.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-28282.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-28282.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-299879.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-299879.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-399831.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-399831.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-499989.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-499989.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-599755.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-599755.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-699907.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-699907.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-799975.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-799975.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-899787.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-899787.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-999987.onnx +3 -0
- results/tau_agent_A3_1M/checkpoints/TauAgent-999987.pt +3 -0
- results/tau_agent_A3_1M/checkpoints/checkpoint.pt +3 -0
README.md
CHANGED
@@ -1,3 +1,141 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
|
5 |
+
# Tau LLM Unity ML Agents Project
|
6 |
+
|
7 |
+
Welcome to the Tau LLM Unity ML Agents Project repository! This project focuses on training reinforcement learning agents using Unity ML-Agents and the PPO algorithm. Our goal is to optimize the performance of the agents through various configurations and training runs.
|
8 |
+
|
9 |
+
## Project Overview
|
10 |
+
|
11 |
+
This repository contains the code and configurations for training agents in a Unity environment using the Proximal Policy Optimization (PPO) algorithm. The agents are designed to learn and adapt to their environment, improving their performance over time.
|
12 |
+
|
13 |
+
### Key Features
|
14 |
+
|
15 |
+
- **Reinforcement Learning**: Utilizes the PPO algorithm for training agents.
|
16 |
+
- **Unity ML-Agents**: Integrates with Unity ML-Agents for a seamless training experience.
|
17 |
+
- **Custom Reward Functions**: Implements gradient-based reward functions for nuanced feedback.
|
18 |
+
- **Memory Networks**: Incorporates memory networks to handle temporal dependencies.
|
19 |
+
- **TensorBoard Integration**: Monitors training progress and performance using TensorBoard.
|
20 |
+
|
21 |
+
## Configuration
|
22 |
+
|
23 |
+
Below is the configuration used for training the agents:
|
24 |
+
|
25 |
+
```yaml
|
26 |
+
behaviors:
|
27 |
+
TauAgent:
|
28 |
+
trainer_type: ppo
|
29 |
+
hyperparameters:
|
30 |
+
batch_size: 256
|
31 |
+
buffer_size: 4096
|
32 |
+
learning_rate: 0.00003
|
33 |
+
beta: 0.005
|
34 |
+
epsilon: 0.2
|
35 |
+
lambd: 0.95
|
36 |
+
num_epoch: 10
|
37 |
+
learning_rate_schedule: linear
|
38 |
+
network_settings:
|
39 |
+
normalize: true
|
40 |
+
hidden_units: 256
|
41 |
+
num_layers: 4
|
42 |
+
vis_encode_type: simple
|
43 |
+
memory:
|
44 |
+
memory_size: 256
|
45 |
+
sequence_length: 256
|
46 |
+
num_layers: 4
|
47 |
+
reward_signals:
|
48 |
+
extrinsic:
|
49 |
+
gamma: 0.99
|
50 |
+
strength: 1.0
|
51 |
+
curiosity:
|
52 |
+
gamma: 0.995
|
53 |
+
strength: 0.1
|
54 |
+
network_settings:
|
55 |
+
normalize: true
|
56 |
+
hidden_units: 256
|
57 |
+
num_layers: 4
|
58 |
+
learning_rate: 0.00003
|
59 |
+
keep_checkpoints: 10
|
60 |
+
checkpoint_interval: 100000
|
61 |
+
threaded: true
|
62 |
+
max_steps: 3000000
|
63 |
+
time_horizon: 256
|
64 |
+
summary_freq: 10000
|
65 |
+
```
|
66 |
+
|
67 |
+
## Model Naming Convention
|
68 |
+
|
69 |
+
The models in this repository follow the naming convention `Tau_<series>_<max_steps>`. This helps in easily identifying the series and the number of training steps for each model.
|
70 |
+
|
71 |
+
## Getting Started
|
72 |
+
|
73 |
+
### Prerequisites
|
74 |
+
|
75 |
+
- Unity 6
|
76 |
+
- Unity ML-Agents Toolkit
|
77 |
+
- Python 3.10.11
|
78 |
+
- PyTorch
|
79 |
+
- Transformers
|
80 |
+
|
81 |
+
### Installation
|
82 |
+
|
83 |
+
1. Clone the repository:
|
84 |
+
```bash
|
85 |
+
git clone https://github.com/yourusername/tau-llm-unity-ml-agents.git
|
86 |
+
cd tau-llm-unity-ml-agents
|
87 |
+
```
|
88 |
+
|
89 |
+
2. Install the required Python packages:
|
90 |
+
```bash
|
91 |
+
pip install -r requirements.txt
|
92 |
+
```
|
93 |
+
|
94 |
+
3. Open the Unity project:
|
95 |
+
- Launch Unity Hub and open the project folder.
|
96 |
+
|
97 |
+
### Training the Agent
|
98 |
+
|
99 |
+
To start training the agent, run the following command:
|
100 |
+
```bash
|
101 |
+
mlagents-learn config/trainer_config.yaml --run-id=run1
|
102 |
+
```
|
103 |
+
|
104 |
+
### Monitoring Training
|
105 |
+
|
106 |
+
You can monitor the training progress using TensorBoard:
|
107 |
+
```bash
|
108 |
+
tensorboard --logdir=results --port=6006
|
109 |
+
```
|
110 |
+
|
111 |
+
## Results
|
112 |
+
|
113 |
+
The training results, including the average reward and cumulative reward, can be visualized using TensorBoard. The graphs below show the performance of the agent over time:
|
114 |
+
|
115 |
+
![Average Reward](path/to/average_reward.png)
|
116 |
+
![Cumulative Reward](path/to/cumulative_reward.png)
|
117 |
+
|
118 |
+
## Citation
|
119 |
+
|
120 |
+
If you use this project in your research, please cite it as follows:
|
121 |
+
|
122 |
+
```bibtex
|
123 |
+
@misc{Tau,
|
124 |
+
author = {K. Rawson},
|
125 |
+
title = {Tau LLM Unity ML Agents Project},
|
126 |
+
year = {2024},
|
127 |
+
publisher = {GitHub},
|
128 |
+
journal = {GitHub repository},
|
129 |
+
howpublished = {\url{https://github.com/p3nGu1nZz/Tau}},
|
130 |
+
}
|
131 |
+
```
|
132 |
+
|
133 |
+
## License
|
134 |
+
|
135 |
+
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
|
136 |
+
|
137 |
+
## Acknowledgments
|
138 |
+
|
139 |
+
- Unity ML-Agents Toolkit
|
140 |
+
- TensorFlow and PyTorch communities
|
141 |
+
- Hugging Face for hosting the model repository
|
results/tau_agent_A1_2M/Tau-A1-2M.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:08931e19cffa93c14fed86e9bb88278424715303928d4761bf3dcc257fdde73d
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b24d7a70f3f708362ccd3b35ccbf309d81696c379a5c2111810676ffda6c9c3d
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1199744.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4f556a11f0fea58e2cc679cf2f9ad6e86403425ced6496f25188080f8f29bc8e
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bb6f5d1ee696b00963d7cb00a10b924fdf123f5eb46b618ba006117c7d843919
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1299958.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c0244046f46126c14c82f60ef46b799325fb0f35205cec04ccec9141784a93c
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9d0b419212303e89f05b6d735d04bb392166df4dd491fd0036ea2fce40a3abd6
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1399744.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c120a1fa8108b6b9558d9667fe949808b3e16394d499693e20392d2ea1f6c28e
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f424ea2d9e633119050d04d95e1079bee5e8c3a1a9fee31282ca95855bd7d885
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1499776.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:912696d11a6837fc71783e311bed38195e1dda57fe4123a64141db5e96083ba3
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:56f777fafa9cc0919950a0231834f75954a17c4e07b9bcf7c6b2b3dbc5426c41
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1599808.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f311e89bfe1d1fa8a578efe91d9ceafece8fa50a349a0721013634ff0e664ef9
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:90533537979d9abeb815e499a777474c4c0c66e3068c3e9de39c17512f6cd35c
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1699840.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:09f5ead3bb039e49e211a2ca8d7afa788223c1ed2b9883342efc46ef66799982
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:de1ea3ba5c8d90ce7be467ee2871441c2dbb220e8761d14b8c3d70439bc9ad7b
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1799808.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:32def0f45b71e638ddd1ad302b620df2526e4099b0bc65c9f4b1ec7a2737b092
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9677e46c8e368b0e5f5a3aa982ca3949bf1f4489fa58ae55cea8801e56563aba
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1899840.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a9cacd7cdf76e019c35c7618a719b7c58d5b468f7a47d136dc4d1dcea7ede6b7
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5eb6959372271405f646cec449cddcc6d19f604a7d02b5422b02aa7035aa9906
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-1999872.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cfc7b6a20afe2601acf9523584f01e56a6a62f274dff5066b5b53ae4621953aa
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:08931e19cffa93c14fed86e9bb88278424715303928d4761bf3dcc257fdde73d
|
3 |
+
size 2186395
|
results/tau_agent_A1_2M/checkpoints/TauAgent-2005504.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:260196c3491aca9156c57f3d29bba9cb40b9655acd53407f09075f191df035ae
|
3 |
+
size 15534256
|
results/tau_agent_A1_2M/checkpoints/checkpoint.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:37080da1574efbcf39b5938261f36738c44437f2ceb72501058d86a7ffe8d386
|
3 |
+
size 15533332
|
results/tau_agent_A1_2M/configuration.yaml
ADDED
@@ -0,0 +1,93 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
default_settings: null
|
2 |
+
behaviors:
|
3 |
+
TauAgent:
|
4 |
+
trainer_type: ppo
|
5 |
+
hyperparameters:
|
6 |
+
batch_size: 256
|
7 |
+
buffer_size: 4096
|
8 |
+
learning_rate: 3.0e-05
|
9 |
+
beta: 0.005
|
10 |
+
epsilon: 0.2
|
11 |
+
lambd: 0.95
|
12 |
+
num_epoch: 7
|
13 |
+
shared_critic: false
|
14 |
+
learning_rate_schedule: linear
|
15 |
+
beta_schedule: linear
|
16 |
+
epsilon_schedule: linear
|
17 |
+
checkpoint_interval: 100000
|
18 |
+
network_settings:
|
19 |
+
normalize: true
|
20 |
+
hidden_units: 256
|
21 |
+
num_layers: 4
|
22 |
+
vis_encode_type: simple
|
23 |
+
memory:
|
24 |
+
sequence_length: 256
|
25 |
+
memory_size: 256
|
26 |
+
goal_conditioning_type: hyper
|
27 |
+
deterministic: false
|
28 |
+
reward_signals:
|
29 |
+
extrinsic:
|
30 |
+
gamma: 0.99
|
31 |
+
strength: 1.0
|
32 |
+
network_settings:
|
33 |
+
normalize: false
|
34 |
+
hidden_units: 128
|
35 |
+
num_layers: 2
|
36 |
+
vis_encode_type: simple
|
37 |
+
memory: null
|
38 |
+
goal_conditioning_type: hyper
|
39 |
+
deterministic: false
|
40 |
+
curiosity:
|
41 |
+
gamma: 0.995
|
42 |
+
strength: 0.1
|
43 |
+
network_settings:
|
44 |
+
normalize: true
|
45 |
+
hidden_units: 256
|
46 |
+
num_layers: 4
|
47 |
+
vis_encode_type: simple
|
48 |
+
memory: null
|
49 |
+
goal_conditioning_type: hyper
|
50 |
+
deterministic: false
|
51 |
+
learning_rate: 0.0003
|
52 |
+
encoding_size: null
|
53 |
+
init_path: null
|
54 |
+
keep_checkpoints: 10
|
55 |
+
even_checkpoints: false
|
56 |
+
max_steps: 2000000
|
57 |
+
time_horizon: 256
|
58 |
+
summary_freq: 10000
|
59 |
+
threaded: true
|
60 |
+
self_play: null
|
61 |
+
behavioral_cloning: null
|
62 |
+
env_settings:
|
63 |
+
env_path: .\Build
|
64 |
+
env_args: null
|
65 |
+
base_port: 5005
|
66 |
+
num_envs: 1
|
67 |
+
num_areas: 1
|
68 |
+
timeout_wait: 300
|
69 |
+
seed: -1
|
70 |
+
max_lifetime_restarts: 10
|
71 |
+
restarts_rate_limit_n: 1
|
72 |
+
restarts_rate_limit_period_s: 60
|
73 |
+
engine_settings:
|
74 |
+
width: 84
|
75 |
+
height: 84
|
76 |
+
quality_level: 5
|
77 |
+
time_scale: 20
|
78 |
+
target_frame_rate: -1
|
79 |
+
capture_frame_rate: 60
|
80 |
+
no_graphics: false
|
81 |
+
environment_parameters: null
|
82 |
+
checkpoint_settings:
|
83 |
+
run_id: tau_agent_ppo_A1
|
84 |
+
initialize_from: null
|
85 |
+
load_model: false
|
86 |
+
resume: false
|
87 |
+
force: true
|
88 |
+
train_model: false
|
89 |
+
inference: false
|
90 |
+
results_dir: results
|
91 |
+
torch_settings:
|
92 |
+
device: cuda
|
93 |
+
debug: false
|
results/tau_agent_A3_1M/Tau-A3-1M.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9c5d20133e25c7b1f17c9fe045373f12448adfba0a341d2ce0ab683dc0a505e9
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9c5d20133e25c7b1f17c9fe045373f12448adfba0a341d2ce0ab683dc0a505e9
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-1001575.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2c1e3f84038d1be138512d0c2b168f8fed07c9e123765c992dbb88ce19ad9729
|
3 |
+
size 23269214
|
results/tau_agent_A3_1M/checkpoints/TauAgent-12324.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eaf958d6b87edcf7ebcfbfe28296a04bbdb8f8a5fad6aa1b8f23bcf747cd89d1
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-12324.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c98eb2737900135b3e52175a3b2c94a6d3bd3c88d1b85667f32af788be5b6075
|
3 |
+
size 23268710
|
results/tau_agent_A3_1M/checkpoints/TauAgent-199903.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4f024ccd7748fde0b0b7e8fa59d06951bf15fcefb555457746ee03d2f7a90bbc
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-199903.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1b77bbf92946f07c4ea7dba982ce78ad57c00ee5a9bca7edd7271d7918bbbda7
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-28282.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cd48d742cff7e006aa71e8dcb2c31c31c74bc9ae03d54b76559c3b3dd8745c61
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-28282.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8513da1f1353113fd99c51dd85e61f57e4dfb4b8d31b644d0f7af73ac4c3b47e
|
3 |
+
size 23268710
|
results/tau_agent_A3_1M/checkpoints/TauAgent-299879.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3ff77dadbb9a1fafc12a4946bdd5c63f4e6f43fa1763cff4dd0929fa3d499a2b
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-299879.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b03039f2bdfbaa4625a6f13b8580c72c43bef5bfd9025e9f915e2bf3a51b8dfa
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-399831.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f0f215196437e7d6a5e1757fb59a21b4ea1cda1a3b7af937a427659a375b9a0d
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-399831.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a0bd14a4a2ab99d335f034d318f7c372770c62ea615a4e2804bd8994de6b8050
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-499989.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:187b76bb76c34cba0dac85f49f58a7ea5bde3e4a23f9bfc899e2023cdbba3e70
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-499989.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:782a3ce94817b5f93f82f245128eabd0e5944a6f1ce158921c7cb500cc53c1d2
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-599755.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:545ee682b7953e6ed53cb351c7e17d16fcd665bf5d9a9f3cb1e32452bceaa760
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-599755.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:53a8c5e1897e7aa448878fc936a0b0968addbffe7e0b73cfa17999fd75de4aa6
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-699907.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f5f5612c9190f8d25f0b2e1ae27d72aca029f7962cd1d1c9c2605296389979b4
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-699907.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:01a48a1e97204e86b79e57dde69e65a2e156df2159595e7583f69355a2726e41
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-799975.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3946bf8a6a03014256bedd19b4965ec552903ba24a711ac5b3ae5bfd96e18f82
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-799975.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:50854837ea697627700f4b23582acd5083cbc41be015f5177bec6fa417a75141
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-899787.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3834f38a6da9d23a037c64da280ad1b87c77d9228ffc461ccc7c312a0f147c38
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-899787.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5c4867e2a04b63b7412b26745df14b83b2a4465dfcd36b37018158fa2b685661
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/TauAgent-999987.onnx
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f599e796aa05601ba2a7a5c54f4c702594601fa5c5d3861f60964129a4d4109
|
3 |
+
size 1983173
|
results/tau_agent_A3_1M/checkpoints/TauAgent-999987.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5d7607c7b79859970d0162c13f77a89730ac548341ae0438001612c2ab0745ab
|
3 |
+
size 23268962
|
results/tau_agent_A3_1M/checkpoints/checkpoint.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94b764243f51b496724081db5f3d7a11d270dc706ec1c5635abfc81dc20cfb0b
|
3 |
+
size 23267702
|