msgerasyov commited on
Commit
fd98a85
1 Parent(s): a71bd48

Upload . with huggingface_hub

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. replay.mp4 +2 -2
  3. sf_log.txt +190 -0
README.md CHANGED
@@ -15,7 +15,7 @@ model-index:
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
- value: 16.78 +/- 4.67
19
  name: mean_reward
20
  verified: false
21
  ---
 
15
  type: doom_health_gathering_supreme
16
  metrics:
17
  - type: mean_reward
18
+ value: 14.73 +/- 4.56
19
  name: mean_reward
20
  verified: false
21
  ---
replay.mp4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:21ca11d2ad2dfa15a797f79297af88482451bf6965fdfb8bc144d90614787b39
3
- size 32950546
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:222bc36f66fe45af3bd2c7b3626e94d418d48d3a903ab89d1ca2799d53f273c9
3
+ size 28961573
sf_log.txt CHANGED
@@ -5442,3 +5442,193 @@ vizdoom.vizdoom.SignalException: Signal SIGINT received. ViZDoom instance has be
5442
  [2023-02-25 21:30:00,554][00219] Avg episode rewards: #0: 44.874, true rewards: #0: 16.775
5443
  [2023-02-25 21:30:00,557][00219] Avg episode reward: 44.874, avg true_objective: 16.775
5444
  [2023-02-25 21:31:35,010][00219] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5442
  [2023-02-25 21:30:00,554][00219] Avg episode rewards: #0: 44.874, true rewards: #0: 16.775
5443
  [2023-02-25 21:30:00,557][00219] Avg episode reward: 44.874, avg true_objective: 16.775
5444
  [2023-02-25 21:31:35,010][00219] Replay video saved to /content/train_dir/default_experiment/replay.mp4!
5445
+ [2023-02-25 21:31:38,070][00219] The model has been pushed to https://huggingface.co/msgerasyov/rl_course_vizdoom_health_gathering_supreme
5446
+ [2023-02-25 21:31:40,245][00219] Loading existing experiment configuration from /content/train_dir/default_experiment/config.json
5447
+ [2023-02-25 21:31:40,247][00219] Overriding arg 'num_workers' with value 1 passed from command line
5448
+ [2023-02-25 21:31:40,250][00219] Adding new argument 'no_render'=True that is not in the saved config file!
5449
+ [2023-02-25 21:31:40,253][00219] Adding new argument 'save_video'=True that is not in the saved config file!
5450
+ [2023-02-25 21:31:40,254][00219] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
5451
+ [2023-02-25 21:31:40,260][00219] Adding new argument 'video_name'=None that is not in the saved config file!
5452
+ [2023-02-25 21:31:40,261][00219] Adding new argument 'max_num_frames'=100000 that is not in the saved config file!
5453
+ [2023-02-25 21:31:40,263][00219] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
5454
+ [2023-02-25 21:31:40,264][00219] Adding new argument 'push_to_hub'=True that is not in the saved config file!
5455
+ [2023-02-25 21:31:40,268][00219] Adding new argument 'hf_repository'='msgerasyov/rl_course_vizdoom_health_gathering_supreme' that is not in the saved config file!
5456
+ [2023-02-25 21:31:40,269][00219] Adding new argument 'policy_index'=0 that is not in the saved config file!
5457
+ [2023-02-25 21:31:40,271][00219] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
5458
+ [2023-02-25 21:31:40,272][00219] Adding new argument 'train_script'=None that is not in the saved config file!
5459
+ [2023-02-25 21:31:40,274][00219] Adding new argument 'enjoy_script'=None that is not in the saved config file!
5460
+ [2023-02-25 21:31:40,275][00219] Using frameskip 1 and render_action_repeat=4 for evaluation
5461
+ [2023-02-25 21:31:40,303][00219] RunningMeanStd input shape: (3, 72, 128)
5462
+ [2023-02-25 21:31:40,306][00219] RunningMeanStd input shape: (1,)
5463
+ [2023-02-25 21:31:40,337][00219] ConvEncoder: input_channels=3
5464
+ [2023-02-25 21:31:40,431][00219] Conv encoder output size: 512
5465
+ [2023-02-25 21:31:40,437][00219] Policy head output size: 512
5466
+ [2023-02-25 21:31:40,468][00219] Loading state from checkpoint /content/train_dir/default_experiment/checkpoint_p0/checkpoint_000007466_30580736.pth...
5467
+ [2023-02-25 21:31:40,958][00219] Num frames 100...
5468
+ [2023-02-25 21:31:41,067][00219] Num frames 200...
5469
+ [2023-02-25 21:31:41,184][00219] Num frames 300...
5470
+ [2023-02-25 21:31:41,296][00219] Num frames 400...
5471
+ [2023-02-25 21:31:41,424][00219] Num frames 500...
5472
+ [2023-02-25 21:31:41,540][00219] Num frames 600...
5473
+ [2023-02-25 21:31:41,674][00219] Num frames 700...
5474
+ [2023-02-25 21:31:41,841][00219] Avg episode rewards: #0: 17.680, true rewards: #0: 7.680
5475
+ [2023-02-25 21:31:41,844][00219] Avg episode reward: 17.680, avg true_objective: 7.680
5476
+ [2023-02-25 21:31:41,903][00219] Num frames 800...
5477
+ [2023-02-25 21:31:42,013][00219] Num frames 900...
5478
+ [2023-02-25 21:31:42,126][00219] Num frames 1000...
5479
+ [2023-02-25 21:31:42,245][00219] Num frames 1100...
5480
+ [2023-02-25 21:31:42,363][00219] Num frames 1200...
5481
+ [2023-02-25 21:31:42,483][00219] Num frames 1300...
5482
+ [2023-02-25 21:31:42,597][00219] Num frames 1400...
5483
+ [2023-02-25 21:31:42,716][00219] Num frames 1500...
5484
+ [2023-02-25 21:31:42,838][00219] Num frames 1600...
5485
+ [2023-02-25 21:31:42,958][00219] Num frames 1700...
5486
+ [2023-02-25 21:31:43,070][00219] Num frames 1800...
5487
+ [2023-02-25 21:31:43,184][00219] Num frames 1900...
5488
+ [2023-02-25 21:31:43,337][00219] Avg episode rewards: #0: 21.920, true rewards: #0: 9.920
5489
+ [2023-02-25 21:31:43,339][00219] Avg episode reward: 21.920, avg true_objective: 9.920
5490
+ [2023-02-25 21:31:43,361][00219] Num frames 2000...
5491
+ [2023-02-25 21:31:43,483][00219] Num frames 2100...
5492
+ [2023-02-25 21:31:43,597][00219] Num frames 2200...
5493
+ [2023-02-25 21:31:43,760][00219] Num frames 2300...
5494
+ [2023-02-25 21:31:43,976][00219] Num frames 2400...
5495
+ [2023-02-25 21:31:44,140][00219] Num frames 2500...
5496
+ [2023-02-25 21:31:44,397][00219] Num frames 2600...
5497
+ [2023-02-25 21:31:44,588][00219] Num frames 2700...
5498
+ [2023-02-25 21:31:44,865][00219] Num frames 2800...
5499
+ [2023-02-25 21:31:45,077][00219] Num frames 2900...
5500
+ [2023-02-25 21:31:45,278][00219] Num frames 3000...
5501
+ [2023-02-25 21:31:45,462][00219] Num frames 3100...
5502
+ [2023-02-25 21:31:45,688][00219] Num frames 3200...
5503
+ [2023-02-25 21:31:45,936][00219] Num frames 3300...
5504
+ [2023-02-25 21:31:46,118][00219] Num frames 3400...
5505
+ [2023-02-25 21:31:46,292][00219] Num frames 3500...
5506
+ [2023-02-25 21:31:46,470][00219] Num frames 3600...
5507
+ [2023-02-25 21:31:46,639][00219] Num frames 3700...
5508
+ [2023-02-25 21:31:46,967][00219] Num frames 3800...
5509
+ [2023-02-25 21:31:47,107][00219] Avg episode rewards: #0: 32.713, true rewards: #0: 12.713
5510
+ [2023-02-25 21:31:47,112][00219] Avg episode reward: 32.713, avg true_objective: 12.713
5511
+ [2023-02-25 21:31:47,420][00219] Num frames 3900...
5512
+ [2023-02-25 21:31:47,881][00219] Num frames 4000...
5513
+ [2023-02-25 21:31:48,301][00219] Num frames 4100...
5514
+ [2023-02-25 21:31:48,730][00219] Num frames 4200...
5515
+ [2023-02-25 21:31:49,382][00219] Num frames 4300...
5516
+ [2023-02-25 21:31:49,744][00219] Num frames 4400...
5517
+ [2023-02-25 21:31:50,103][00219] Num frames 4500...
5518
+ [2023-02-25 21:31:50,485][00219] Num frames 4600...
5519
+ [2023-02-25 21:31:50,894][00219] Num frames 4700...
5520
+ [2023-02-25 21:31:51,299][00219] Num frames 4800...
5521
+ [2023-02-25 21:31:51,492][00219] Num frames 4900...
5522
+ [2023-02-25 21:31:51,638][00219] Avg episode rewards: #0: 31.097, true rewards: #0: 12.347
5523
+ [2023-02-25 21:31:51,644][00219] Avg episode reward: 31.097, avg true_objective: 12.347
5524
+ [2023-02-25 21:31:51,772][00219] Num frames 5000...
5525
+ [2023-02-25 21:31:51,964][00219] Num frames 5100...
5526
+ [2023-02-25 21:31:52,190][00219] Num frames 5200...
5527
+ [2023-02-25 21:31:52,325][00219] Num frames 5300...
5528
+ [2023-02-25 21:31:52,436][00219] Num frames 5400...
5529
+ [2023-02-25 21:31:52,558][00219] Num frames 5500...
5530
+ [2023-02-25 21:31:52,674][00219] Num frames 5600...
5531
+ [2023-02-25 21:31:52,795][00219] Num frames 5700...
5532
+ [2023-02-25 21:31:52,908][00219] Num frames 5800...
5533
+ [2023-02-25 21:31:53,023][00219] Num frames 5900...
5534
+ [2023-02-25 21:31:53,161][00219] Avg episode rewards: #0: 30.140, true rewards: #0: 11.940
5535
+ [2023-02-25 21:31:53,163][00219] Avg episode reward: 30.140, avg true_objective: 11.940
5536
+ [2023-02-25 21:31:53,206][00219] Num frames 6000...
5537
+ [2023-02-25 21:31:53,334][00219] Num frames 6100...
5538
+ [2023-02-25 21:31:53,453][00219] Num frames 6200...
5539
+ [2023-02-25 21:31:53,573][00219] Num frames 6300...
5540
+ [2023-02-25 21:31:53,686][00219] Num frames 6400...
5541
+ [2023-02-25 21:31:53,819][00219] Num frames 6500...
5542
+ [2023-02-25 21:31:53,930][00219] Num frames 6600...
5543
+ [2023-02-25 21:31:54,047][00219] Num frames 6700...
5544
+ [2023-02-25 21:31:54,159][00219] Num frames 6800...
5545
+ [2023-02-25 21:31:54,276][00219] Num frames 6900...
5546
+ [2023-02-25 21:31:54,389][00219] Num frames 7000...
5547
+ [2023-02-25 21:31:54,505][00219] Num frames 7100...
5548
+ [2023-02-25 21:31:54,623][00219] Num frames 7200...
5549
+ [2023-02-25 21:31:54,744][00219] Num frames 7300...
5550
+ [2023-02-25 21:31:54,864][00219] Num frames 7400...
5551
+ [2023-02-25 21:31:54,984][00219] Num frames 7500...
5552
+ [2023-02-25 21:31:55,105][00219] Num frames 7600...
5553
+ [2023-02-25 21:31:55,226][00219] Num frames 7700...
5554
+ [2023-02-25 21:31:55,345][00219] Num frames 7800...
5555
+ [2023-02-25 21:31:55,461][00219] Num frames 7900...
5556
+ [2023-02-25 21:31:55,583][00219] Num frames 8000...
5557
+ [2023-02-25 21:31:55,719][00219] Avg episode rewards: #0: 34.283, true rewards: #0: 13.450
5558
+ [2023-02-25 21:31:55,721][00219] Avg episode reward: 34.283, avg true_objective: 13.450
5559
+ [2023-02-25 21:31:55,769][00219] Num frames 8100...
5560
+ [2023-02-25 21:31:55,886][00219] Num frames 8200...
5561
+ [2023-02-25 21:31:55,998][00219] Num frames 8300...
5562
+ [2023-02-25 21:31:56,114][00219] Num frames 8400...
5563
+ [2023-02-25 21:31:56,229][00219] Num frames 8500...
5564
+ [2023-02-25 21:31:56,357][00219] Num frames 8600...
5565
+ [2023-02-25 21:31:56,469][00219] Num frames 8700...
5566
+ [2023-02-25 21:31:56,578][00219] Num frames 8800...
5567
+ [2023-02-25 21:31:56,693][00219] Num frames 8900...
5568
+ [2023-02-25 21:31:56,812][00219] Num frames 9000...
5569
+ [2023-02-25 21:31:56,930][00219] Num frames 9100...
5570
+ [2023-02-25 21:31:57,042][00219] Num frames 9200...
5571
+ [2023-02-25 21:31:57,159][00219] Num frames 9300...
5572
+ [2023-02-25 21:31:57,270][00219] Num frames 9400...
5573
+ [2023-02-25 21:31:57,388][00219] Num frames 9500...
5574
+ [2023-02-25 21:31:57,501][00219] Num frames 9600...
5575
+ [2023-02-25 21:31:57,618][00219] Num frames 9700...
5576
+ [2023-02-25 21:31:57,730][00219] Num frames 9800...
5577
+ [2023-02-25 21:31:57,808][00219] Avg episode rewards: #0: 36.454, true rewards: #0: 14.026
5578
+ [2023-02-25 21:31:57,810][00219] Avg episode reward: 36.454, avg true_objective: 14.026
5579
+ [2023-02-25 21:31:57,909][00219] Num frames 9900...
5580
+ [2023-02-25 21:31:58,021][00219] Num frames 10000...
5581
+ [2023-02-25 21:31:58,136][00219] Num frames 10100...
5582
+ [2023-02-25 21:31:58,248][00219] Num frames 10200...
5583
+ [2023-02-25 21:31:58,365][00219] Num frames 10300...
5584
+ [2023-02-25 21:31:58,478][00219] Num frames 10400...
5585
+ [2023-02-25 21:31:58,589][00219] Num frames 10500...
5586
+ [2023-02-25 21:31:58,707][00219] Num frames 10600...
5587
+ [2023-02-25 21:31:58,829][00219] Num frames 10700...
5588
+ [2023-02-25 21:31:58,943][00219] Num frames 10800...
5589
+ [2023-02-25 21:31:59,057][00219] Num frames 10900...
5590
+ [2023-02-25 21:31:59,175][00219] Num frames 11000...
5591
+ [2023-02-25 21:31:59,290][00219] Num frames 11100...
5592
+ [2023-02-25 21:31:59,408][00219] Num frames 11200...
5593
+ [2023-02-25 21:31:59,520][00219] Num frames 11300...
5594
+ [2023-02-25 21:31:59,631][00219] Num frames 11400...
5595
+ [2023-02-25 21:31:59,742][00219] Num frames 11500...
5596
+ [2023-02-25 21:31:59,854][00219] Avg episode rewards: #0: 38.057, true rewards: #0: 14.432
5597
+ [2023-02-25 21:31:59,856][00219] Avg episode reward: 38.057, avg true_objective: 14.432
5598
+ [2023-02-25 21:31:59,920][00219] Num frames 11600...
5599
+ [2023-02-25 21:32:00,036][00219] Num frames 11700...
5600
+ [2023-02-25 21:32:00,147][00219] Num frames 11800...
5601
+ [2023-02-25 21:32:00,267][00219] Num frames 11900...
5602
+ [2023-02-25 21:32:00,382][00219] Num frames 12000...
5603
+ [2023-02-25 21:32:00,494][00219] Num frames 12100...
5604
+ [2023-02-25 21:32:00,607][00219] Num frames 12200...
5605
+ [2023-02-25 21:32:00,736][00219] Num frames 12300...
5606
+ [2023-02-25 21:32:00,861][00219] Num frames 12400...
5607
+ [2023-02-25 21:32:00,975][00219] Num frames 12500...
5608
+ [2023-02-25 21:32:01,094][00219] Num frames 12600...
5609
+ [2023-02-25 21:32:01,207][00219] Num frames 12700...
5610
+ [2023-02-25 21:32:01,340][00219] Num frames 12800...
5611
+ [2023-02-25 21:32:01,503][00219] Num frames 12900...
5612
+ [2023-02-25 21:32:01,670][00219] Num frames 13000...
5613
+ [2023-02-25 21:32:01,840][00219] Num frames 13100...
5614
+ [2023-02-25 21:32:02,001][00219] Num frames 13200...
5615
+ [2023-02-25 21:32:02,171][00219] Num frames 13300...
5616
+ [2023-02-25 21:32:02,341][00219] Num frames 13400...
5617
+ [2023-02-25 21:32:02,519][00219] Num frames 13500...
5618
+ [2023-02-25 21:32:02,695][00219] Num frames 13600...
5619
+ [2023-02-25 21:32:02,826][00219] Avg episode rewards: #0: 39.828, true rewards: #0: 15.162
5620
+ [2023-02-25 21:32:02,828][00219] Avg episode reward: 39.828, avg true_objective: 15.162
5621
+ [2023-02-25 21:32:02,941][00219] Num frames 13700...
5622
+ [2023-02-25 21:32:03,101][00219] Num frames 13800...
5623
+ [2023-02-25 21:32:03,269][00219] Num frames 13900...
5624
+ [2023-02-25 21:32:03,443][00219] Num frames 14000...
5625
+ [2023-02-25 21:32:03,620][00219] Num frames 14100...
5626
+ [2023-02-25 21:32:03,789][00219] Num frames 14200...
5627
+ [2023-02-25 21:32:03,988][00219] Num frames 14300...
5628
+ [2023-02-25 21:32:04,170][00219] Num frames 14400...
5629
+ [2023-02-25 21:32:04,353][00219] Num frames 14500...
5630
+ [2023-02-25 21:32:04,530][00219] Num frames 14600...
5631
+ [2023-02-25 21:32:04,716][00219] Num frames 14700...
5632
+ [2023-02-25 21:32:04,839][00219] Avg episode rewards: #0: 38.834, true rewards: #0: 14.734
5633
+ [2023-02-25 21:32:04,842][00219] Avg episode reward: 38.834, avg true_objective: 14.734
5634
+ [2023-02-25 21:33:34,041][00219] Replay video saved to /content/train_dir/default_experiment/replay.mp4!