PPO-Taxi-v3 / README.md
zap-thamm's picture
Upload of a new agent
16078a4 verified
|
raw
history blame
1.66 kB
metadata
tags:
  - Taxi-v3
  - reinforcement-learning
  - rl-framework
model-index:
  - name: PPO-Taxi-v3
    results:
      - task:
          type: reinforcement-learning
          name: reinforcement-learning
        dataset:
          name: Taxi-v3
          type: Taxi-v3
        metrics:
          - type: mean_reward
            value: 7.72 +/- 2.66
            name: mean_reward
            verified: false

PPO agent playing on Taxi-v3

This is a trained model of an agent playing on the environment Taxi-v3. The agent was trained with a PPO algorithm and evaluated for 100 episodes. See further agent and evaluation metadata in the according README section.

Import

The Python module used for training and uploading/downloading is rl-framework. It is an easy-to-read, plug-and-use Reinforcement Learning framework and provides standardized interfaces and implementations to various Reinforcement Learning methods and environments.

Also it provides connectors for the upload and download to popular model version control systems, including the HuggingFace Hub.

Usage


from rl_framework import StableBaselinesAgent, StableBaselinesAlgorithm

# Create new agent instance
agent = StableBaselinesAgent(
    algorithm=StableBaselinesAlgorithm.PPO
    algorithm_parameters={
        ...
    },
)

# Download existing agent from HF Hub
repository_id = "zap-thamm/PPO-Taxi-v3"
file_name = "algorithm.zip"
agent.download(repository_id=repository_id, filename=file_name)

Further examples can be found in the exploration section of the rl-framework repository.