AndiB93 commited on
Commit
8042066
1 Parent(s): 9ab19b3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ pipeline_tag: reinforcement-learning
6
+ tags:
7
+ - web
8
+ - game
9
+ - CosmicVoyage
10
+ ---This model is a reinforcement learning agent trained to autonomously navigate and control the web-based game Cosmic Voyager. Utilizing the Proximal Policy Optimization (PPO) algorithm, the agent learns optimal strategies to maximize in-game performance.
11
+
12
+ Training Configuration:
13
+
14
+ Algorithm: Proximal Policy Optimization (PPO)
15
+ Policy: Convolutional Neural Network (CnnPolicy)
16
+ Learning Rate: 5e-5
17
+ Batch Size: 256
18
+ Number of Steps per Update (n_steps): 2048
19
+ Number of Epochs: 20
20
+ Maximum Gradient Norm (max_grad_norm): 0.75
21
+ Discount Factor (gamma): 0.95
22
+ GAE Lambda (gae_lambda): 0.95
23
+ Clip Range: 0.1
24
+ Entropy Coefficient (ent_coef): 0.02
25
+ Target KL Divergence (target_kl): 0.025
26
+ Total Timesteps: 3,000,000
27
+ Policy Architecture:
28
+
29
+ Feature Extractor Dimensions: 1024
30
+ Network Architecture:
31
+ Policy Network (pi): [1024, 512, 256]
32
+ Value Function Network (vf): [1024, 512, 256]
33
+ Activation Function: LeakyReLU
34
+ Image Normalization: Disabled
35
+ Environment Configuration:
36
+
37
+ Observation Dimensions: Adjusted to fit the game's requirements
38
+ Frame Stacking: Implemented to provide temporal context
39
+ Usage:
40
+
41
+ This model is designed to be integrated into the Cosmic Voyager game, enabling autonomous gameplay. For integration details and deployment instructions, please refer to the accompanying documentation.
42
+
43
+ Training Monitoring:
44
+
45
+ Training progress and metrics were tracked using Weights & Biases under the project 'Cosmic Voyager RL' by the entity 'andiB1293'.
46
+
47
+ Disclaimer:
48
+
49
+ This model is tailored specifically for the Cosmic Voyager game environment. Performance in different settings or games may vary. Users are advised to test the model thoroughly in their specific use cases.