CliffWalking Q-Learning Agent

This repository contains a Q-learning agent trained on the CliffWalking-v0 environment from Gymnasium. The agent learns to navigate the cliff, avoiding falling into the cliff zone while reaching the goal with minimal penalties. The Q-learning algorithm is implemented with epsilon-greedy exploration and updates the Q-table based on state-action-reward transitions.

Files:

  • train.py: The main script that trains the Q-learning agent.
  • cliffWalking_qtable.npy: The saved Q-table after training.
  • replay.mp4: A video of the agent's performance after training.

Training Details:

  • Environment: CliffWalking-v0 (Gymnasium)
  • Episodes: 30,000
  • Learning Rate (α): 0.2
  • Discount Factor (γ): 0.97
  • Epsilon (ε): 0.2 (exploration vs exploitation trade-off)

The agent starts by exploring the environment randomly and gradually learns the optimal path to avoid falling off the cliff while reaching the goal.

How to Run:

1. Install Dependencies:

Make sure you have the required packages installed:

pip install gymnasium numpy imageio[ffmpeg]

2. Training the Agent:

To train the agent, run the script train.py:

python train.py
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading