CliffWalking Q-Learning Agent

This repository contains a Q-learning agent trained on the CliffWalking-v0 environment from Gymnasium. The agent learns to navigate the cliff, avoiding falling into the cliff zone while reaching the goal with minimal penalties. The Q-learning algorithm is implemented with epsilon-greedy exploration and updates the Q-table based on state-action-reward transitions.

Files:

train.py: The main script that trains the Q-learning agent.
cliffWalking_qtable.npy: The saved Q-table after training.
replay.mp4: A video of the agent's performance after training.

Training Details:

Environment: CliffWalking-v0 (Gymnasium)
Episodes: 30,000
Learning Rate (α): 0.2
Discount Factor (γ): 0.97
Epsilon (ε): 0.2 (exploration vs exploitation trade-off)

The agent starts by exploring the environment randomly and gradually learns the optimal path to avoid falling off the cliff while reaching the goal.

How to Run:

1. Install Dependencies:

Make sure you have the required packages installed:

pip install gymnasium numpy imageio[ffmpeg]

2. Training the Agent:

To train the agent, run the script train.py:

python train.py