tnvjjr
/

Lunar_Lander_DeepQ_Learning_Model

Model card Files Files and versions

Vijay Shrivarshan Vijayaraja commited on Jan 6

Commit

5c137af

·

verified ·

1 Parent(s): d0a2bbf

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -30,8 +30,11 @@ NOTE: I used only 10 episodes for the purpose of this video. I recommend using a
 This project implements a Deep Q-Network (DQN) to train an agent to solve the Lunar Lander environment from OpenAI Gym. The goal is to teach the agent to safely control a lunar lander to land on the moon's surface by interacting with the environment.
 The project includes:
 - A fully implemented DQN algorithm.
 - Real-time visualization of the training process using Pygame.
 - Dynamic plotting of training progress using Matplotlib.
 ---
@@ -135,10 +138,15 @@ The project includes:
 You can modify the following hyperparameters in the script to customize training:
 **Learning Rate:** LR (default: 5e-4)
 **Bactch Size:** BATCH_SIZE (default: 64)
 **Discount Factor (Gamma):** GAMMA (default: 0.99)
 **Replay Buffer Size:** BUFFER_SIZE (default: 1e5)
 **Target Network Update Rate:** TAU (default: 1e-3)
 **Update Frequency:** UPDATE_EVERY (default: 4)
 ---

 This project implements a Deep Q-Network (DQN) to train an agent to solve the Lunar Lander environment from OpenAI Gym. The goal is to teach the agent to safely control a lunar lander to land on the moon's surface by interacting with the environment.
 The project includes:
 - A fully implemented DQN algorithm.
 - Real-time visualization of the training process using Pygame.
 - Dynamic plotting of training progress using Matplotlib.
 ---
 You can modify the following hyperparameters in the script to customize training:
 **Learning Rate:** LR (default: 5e-4)
 **Bactch Size:** BATCH_SIZE (default: 64)
 **Discount Factor (Gamma):** GAMMA (default: 0.99)
 **Replay Buffer Size:** BUFFER_SIZE (default: 1e5)
 **Target Network Update Rate:** TAU (default: 1e-3)
 **Update Frequency:** UPDATE_EVERY (default: 4)
 ---