A2C Training
Browse files- README.md +37 -0
- a2c.zip +3 -0
- a2c/_stable_baselines3_version +1 -0
- a2c/data +0 -0
- a2c/policy.optimizer.pth +3 -0
- a2c/policy.pth +3 -0
- a2c/pytorch_variables.pth +3 -0
- a2c/system_info.txt +9 -0
- config.json +0 -0
- results.json +1 -0
    	
        README.md
    ADDED
    
    | @@ -0,0 +1,37 @@ | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            ---
         | 
| 2 | 
            +
            library_name: stable-baselines3
         | 
| 3 | 
            +
            tags:
         | 
| 4 | 
            +
            - ALE/Pong-v5
         | 
| 5 | 
            +
            - deep-reinforcement-learning
         | 
| 6 | 
            +
            - reinforcement-learning
         | 
| 7 | 
            +
            - stable-baselines3
         | 
| 8 | 
            +
            model-index:
         | 
| 9 | 
            +
            - name: A2C
         | 
| 10 | 
            +
              results:
         | 
| 11 | 
            +
              - task:
         | 
| 12 | 
            +
                  type: reinforcement-learning
         | 
| 13 | 
            +
                  name: reinforcement-learning
         | 
| 14 | 
            +
                dataset:
         | 
| 15 | 
            +
                  name: ALE/Pong-v5
         | 
| 16 | 
            +
                  type: ALE/Pong-v5
         | 
| 17 | 
            +
                metrics:
         | 
| 18 | 
            +
                - type: mean_reward
         | 
| 19 | 
            +
                  value: -19.80 +/- 3.60
         | 
| 20 | 
            +
                  name: mean_reward
         | 
| 21 | 
            +
                  verified: false
         | 
| 22 | 
            +
            ---
         | 
| 23 | 
            +
             | 
| 24 | 
            +
            # **A2C** Agent playing **ALE/Pong-v5**
         | 
| 25 | 
            +
            This is a trained model of a **A2C** agent playing **ALE/Pong-v5**
         | 
| 26 | 
            +
            using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
         | 
| 27 | 
            +
             | 
| 28 | 
            +
            ## Usage (with Stable-baselines3)
         | 
| 29 | 
            +
            TODO: Add your code
         | 
| 30 | 
            +
             | 
| 31 | 
            +
             | 
| 32 | 
            +
            ```python
         | 
| 33 | 
            +
            from stable_baselines3 import ...
         | 
| 34 | 
            +
            from huggingface_sb3 import load_from_hub
         | 
| 35 | 
            +
             | 
| 36 | 
            +
            ...
         | 
| 37 | 
            +
            ```
         | 
    	
        a2c.zip
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:a1906d638ed85911deb713783cfd2e74ea1163f97e7d764a8d5908122ac3de02
         | 
| 3 | 
            +
            size 13592914
         | 
    	
        a2c/_stable_baselines3_version
    ADDED
    
    | @@ -0,0 +1 @@ | |
|  | 
|  | |
| 1 | 
            +
            2.3.2
         | 
    	
        a2c/data
    ADDED
    
    | The diff for this file is too large to render. 
		See raw diff | 
|  | 
    	
        a2c/policy.optimizer.pth
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:3d79d6ce5fa28ae8a36c906d21321d1346749970651d2560b890950d58893d44
         | 
| 3 | 
            +
            size 6733134
         | 
    	
        a2c/policy.pth
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:57a8cb7338609e0f6611d16b77387858732ac6f0ee1f3e0d199c3aa9c1064f24
         | 
| 3 | 
            +
            size 6733298
         | 
    	
        a2c/pytorch_variables.pth
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:ebdad4b9cfe9cd22a3abadb5623bf7bb1f6eb2e408740245eb3f2044b0adc018
         | 
| 3 | 
            +
            size 864
         | 
    	
        a2c/system_info.txt
    ADDED
    
    | @@ -0,0 +1,9 @@ | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            - OS: macOS-14.7-arm64-arm-64bit Darwin Kernel Version 23.6.0: Wed Jul 31 20:48:52 PDT 2024; root:xnu-10063.141.1.700.5~1/RELEASE_ARM64_T6020
         | 
| 2 | 
            +
            - Python: 3.9.19
         | 
| 3 | 
            +
            - Stable-Baselines3: 2.3.2
         | 
| 4 | 
            +
            - PyTorch: 2.4.0
         | 
| 5 | 
            +
            - GPU Enabled: False
         | 
| 6 | 
            +
            - Numpy: 2.0.2
         | 
| 7 | 
            +
            - Cloudpickle: 3.0.0
         | 
| 8 | 
            +
            - Gymnasium: 0.29.1
         | 
| 9 | 
            +
            - OpenAI Gym: 0.26.2
         | 
    	
        config.json
    ADDED
    
    | The diff for this file is too large to render. 
		See raw diff | 
|  | 
    	
        results.json
    ADDED
    
    | @@ -0,0 +1 @@ | |
|  | 
|  | |
| 1 | 
            +
            {"mean_reward": -19.8, "std_reward": 3.6, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2024-09-27T20:21:45.357551"}
         | 
