2022-02-28 22:32:44,053 INFO MainThread:234672 [internal.py:wandb_internal():89] W&B internal server running at pid: 234672, started at: 2022-02-28 22:32:44.053045 2022-02-28 22:32:44,055 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: check_version 2022-02-28 22:32:44,056 INFO WriterThread:234672 [datastore.py:open_for_write():77] open: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/run-2ay2wvge.wandb 2022-02-28 22:32:44,057 DEBUG SenderThread:234672 [sender.py:send():235] send: header 2022-02-28 22:32:44,057 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: check_version 2022-02-28 22:32:44,125 DEBUG SenderThread:234672 [sender.py:send():235] send: run 2022-02-28 22:32:44,219 INFO SenderThread:234672 [dir_watcher.py:__init__():169] watching files in: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files 2022-02-28 22:32:44,219 INFO SenderThread:234672 [sender.py:_start_run_threads():809] run started: 2ay2wvge with start time 1646087563 2022-02-28 22:32:44,219 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:32:44,219 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:32:44,220 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: run_start 2022-02-28 22:32:44,225 DEBUG HandlerThread:234672 [meta.py:__init__():36] meta init 2022-02-28 22:32:44,226 DEBUG HandlerThread:234672 [meta.py:__init__():50] meta init done 2022-02-28 22:32:44,226 DEBUG HandlerThread:234672 [meta.py:probe():210] probe 2022-02-28 22:32:44,232 DEBUG HandlerThread:234672 [meta.py:_setup_git():200] setup git 2022-02-28 22:32:44,247 DEBUG HandlerThread:234672 [meta.py:_setup_git():207] setup git done 2022-02-28 22:32:44,247 DEBUG HandlerThread:234672 [meta.py:_save_pip():54] save pip 2022-02-28 22:32:44,248 DEBUG HandlerThread:234672 [meta.py:_save_pip():68] save pip done 2022-02-28 22:32:44,248 DEBUG HandlerThread:234672 [meta.py:probe():248] probe done 2022-02-28 22:32:44,351 DEBUG SenderThread:234672 [sender.py:send():235] send: files 2022-02-28 22:32:44,351 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-metadata.json with policy now 2022-02-28 22:32:44,355 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:32:44,356 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:32:44,400 DEBUG SenderThread:234672 [sender.py:send():235] send: config 2022-02-28 22:32:44,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:32:44,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:32:44,401 WARNING SenderThread:234672 [sender.py:send_metric():902] Seen metric with glob (shouldnt happen) 2022-02-28 22:32:44,599 INFO Thread-11 :234672 [upload_job.py:push():137] Uploaded file /tmp/tmpmt5j1akwwandb/rrdwz1yo-wandb-metadata.json 2022-02-28 22:32:45,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_created():217] file/dir created: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:45,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_created():217] file/dir created: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-metadata.json 2022-02-28 22:32:45,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_created():217] file/dir created: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/requirements.txt 2022-02-28 22:32:45,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_created():217] file/dir created: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:32:47,220 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:51,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:51,605 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:32:51,605 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:32:51,605 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:32:51,605 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:32:51,605 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:32:51,606 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:32:52,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:32:53,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:57,223 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:57,864 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:32:57,865 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:32:57,865 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:32:58,224 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:32:59,224 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:32:59,633 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:32:59,633 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:33:03,225 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:04,310 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:04,310 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:04,312 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:05,226 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:05,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:09,228 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:10,527 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:10,528 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:10,530 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:11,228 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:11,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:12,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:12,660 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:33:14,785 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:33:14,785 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:33:15,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/config.yaml 2022-02-28 22:33:16,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:16,579 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:16,580 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:16,580 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:17,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:18,231 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:22,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:22,716 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:22,716 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:22,718 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:23,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:24,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:28,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:28,748 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:28,748 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:28,750 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:29,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:30,072 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:33:30,073 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:33:30,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:34,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:34,688 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:34,689 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:34,690 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:35,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:36,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:37,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:38,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:40,639 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:40,639 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:40,641 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:41,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:42,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:43,069 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:33:43,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:45,135 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:33:45,136 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:33:46,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:46,506 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:46,507 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:46,507 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:47,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:48,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:49,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:51,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:52,469 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:52,469 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:52,471 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:53,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:33:55,244 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:57,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:33:58,415 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:33:58,416 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:33:58,416 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:33:59,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:00,171 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:34:00,172 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:34:01,246 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:03,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:04,259 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:04,259 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:04,260 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:05,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:05,248 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:09,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:10,054 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:10,054 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:10,054 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:10,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:11,250 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:13,439 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:34:15,207 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:34:15,207 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:34:15,251 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:15,855 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:15,855 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:15,855 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:16,251 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:17,252 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:21,253 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:21,294 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:21,294 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:21,294 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:22,254 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:23,254 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:25,255 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:26,989 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:26,990 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:26,991 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:27,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:27,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:29,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:30,257 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:30,281 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:34:30,281 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:34:32,258 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:32,437 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:32,437 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:32,438 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:33,258 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:33,258 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:34,258 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:36,259 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:37,941 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:37,941 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:37,942 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:38,260 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:39,260 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:40,260 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:42,261 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:43,426 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:43,427 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:43,427 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:43,822 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:34:44,262 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:44,262 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:45,262 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:45,540 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:34:45,540 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:34:48,263 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:48,828 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:48,829 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:48,829 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:49,263 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:50,264 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:52,264 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:54,178 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:54,178 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:54,179 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:34:54,265 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:34:56,266 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:57,266 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:59,267 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:34:59,541 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:34:59,541 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:34:59,541 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:00,267 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:00,645 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:35:00,646 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:35:01,267 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:02,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:03,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:04,878 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:04,879 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:04,879 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:05,269 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:06,269 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:07,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:09,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:10,084 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:10,084 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:10,085 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:10,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:11,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:12,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:14,192 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:35:15,272 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:15,293 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:15,294 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:15,294 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:15,938 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:35:15,939 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:35:16,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:17,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:18,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:19,274 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:20,486 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:20,486 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:20,486 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:21,274 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:21,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:22,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:25,543 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:25,543 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:25,544 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:26,276 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:26,276 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:28,277 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:30,277 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:30,638 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:30,638 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:30,638 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:31,253 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:35:31,253 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:35:31,278 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:32,278 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:34,279 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:35,707 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:35,708 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:35,708 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:36,279 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:38,280 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:40,281 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:40,743 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:40,744 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:40,744 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:41,281 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:42,281 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:44,282 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:44,576 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:35:45,646 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:45,646 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:45,647 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:46,283 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:46,283 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:46,562 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:35:46,562 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:35:48,283 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:50,284 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:50,523 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:50,524 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:50,524 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:51,284 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:52,285 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:55,286 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:55,318 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:35:55,318 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:35:55,318 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:35:56,286 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:56,286 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:35:57,287 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:35:59,287 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:00,027 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:00,027 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:00,028 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:00,288 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:01,288 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:01,725 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:36:01,725 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:36:03,289 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:04,636 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:04,636 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:04,637 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:05,289 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:05,290 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:07,290 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:09,194 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:09,194 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:09,195 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:09,291 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:09,291 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:11,292 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:13,292 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:13,520 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:13,520 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:13,521 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:14,293 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:14,978 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:36:15,293 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:16,764 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:36:16,764 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:36:17,294 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:17,732 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:17,732 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:17,733 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:18,294 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:19,294 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:21,295 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:21,804 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:21,805 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:21,805 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:22,295 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:23,296 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:25,296 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:25,678 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:25,679 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:25,679 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:26,297 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:27,297 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:29,289 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:29,289 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:29,290 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:29,298 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:29,298 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:31,299 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:31,810 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:36:31,810 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:36:32,595 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:32,595 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:32,596 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:33,299 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:33,300 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:35,300 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:35,643 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:35,643 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:35,643 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:36,301 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:37,301 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:38,297 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:38,297 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:38,298 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:38,301 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:39,302 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:40,621 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:40,621 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:40,621 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:41,302 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:41,303 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:42,698 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:42,699 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:42,699 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:43,303 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:43,303 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:44,554 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:44,554 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:44,554 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:45,304 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:45,304 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:45,350 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:36:46,148 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:46,148 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:46,149 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:46,304 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:46,996 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:36:46,997 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:36:47,304 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:48,133 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,138 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,149 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,149 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,155 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,155 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,160 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,165 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,170 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,176 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,189 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,199 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,200 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,206 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,216 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,217 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,217 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,217 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,217 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,217 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,222 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,222 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,222 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,222 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,222 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,223 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,228 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,228 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,228 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,228 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,228 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,229 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,229 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,229 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,229 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,229 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,234 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,239 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,239 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,240 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,240 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,240 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,245 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,245 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,245 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,245 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,245 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,246 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,251 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,257 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,262 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,268 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,268 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,268 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,268 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,268 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,276 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,276 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,276 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,276 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,276 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,277 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,278 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,279 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,280 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,281 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,282 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,283 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,284 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,285 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,286 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,287 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,288 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,289 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,290 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,291 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,292 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,293 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,294 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,295 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,296 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,296 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,296 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,296 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,296 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,297 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,298 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,299 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,300 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,301 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,302 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,303 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,304 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,305 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,306 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,307 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,308 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,309 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,310 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,311 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,312 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,313 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,314 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,315 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,316 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,317 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,318 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,319 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,320 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,321 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,322 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,323 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,324 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,325 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,326 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,327 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,328 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,329 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,330 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,331 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,332 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,333 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,334 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,335 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,336 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,337 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,338 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,339 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,340 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,341 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,342 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,343 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,344 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,345 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,346 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,347 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,348 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,349 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,350 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,351 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,352 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,353 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,354 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,355 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,356 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,357 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,358 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,359 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,360 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,361 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,362 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,363 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,364 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,365 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,366 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,367 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,368 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,369 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,370 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,371 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,372 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,373 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,374 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,375 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,376 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,377 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,378 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,379 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,380 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,381 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,382 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,383 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,384 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,385 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,386 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,387 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,388 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,389 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,390 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,391 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,392 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,393 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,394 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,395 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,396 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,397 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,398 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,399 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,400 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,401 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,402 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,403 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,404 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,405 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,406 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,407 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,408 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,409 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,410 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,411 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,412 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,413 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,414 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,415 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,416 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,417 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,418 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,419 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,420 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,421 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,422 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,423 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,424 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,425 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,426 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,427 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,428 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,429 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,430 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,431 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,432 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,433 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,434 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,435 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,436 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,437 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,438 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: metric 2022-02-28 22:36:48,439 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:48,525 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:48,610 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:49,336 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:49,336 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:36:53,337 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:54,467 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:36:54,519 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:36:54,603 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:36:55,338 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:36:55,338 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:00,340 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:00,573 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:00,624 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:00,711 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:01,340 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:02,058 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:37:02,059 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:37:02,340 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:03,341 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:06,342 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:06,626 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:06,678 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:06,763 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:07,342 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:07,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:08,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:12,344 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:12,663 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:12,714 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:12,804 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:13,344 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:14,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:15,864 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:37:16,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:17,121 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:37:17,122 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:37:18,436 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/config.yaml 2022-02-28 22:37:18,527 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:18,579 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:18,658 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:19,436 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:20,437 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:22,438 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:24,442 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:24,498 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:24,576 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:25,471 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:26,471 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:28,472 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:30,361 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:30,412 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:30,499 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:31,493 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:32,181 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:37:32,182 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:37:32,493 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:33,493 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:35,494 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:36,211 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:36,263 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:36,343 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:36,494 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:37,495 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:38,495 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:41,496 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:42,007 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:42,059 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:42,149 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:42,496 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:43,497 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:44,497 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:46,527 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:37:47,226 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:37:47,226 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:37:47,498 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:47,863 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:47,916 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:48,000 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:48,499 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:48,499 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:49,499 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:51,500 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:53,577 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:53,628 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:53,709 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:54,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:54,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:37:55,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:57,502 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:37:59,251 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:37:59,300 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:37:59,381 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:37:59,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:00,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:01,504 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:02,319 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:38:02,320 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:38:04,505 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:04,837 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:04,900 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:04,990 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:05,505 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:06,506 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:07,506 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:10,451 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:10,505 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:10,583 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:10,585 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:11,583 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:12,584 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:13,584 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:14,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:15,957 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:16,012 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:16,102 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:16,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:16,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:17,240 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:38:17,478 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:38:17,479 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:38:17,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:20,586 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:21,389 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:21,440 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:21,519 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:21,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:22,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:23,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:24,588 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:26,948 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:27,001 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:27,088 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:27,589 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:28,589 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:29,590 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:31,590 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:32,512 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:32,564 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:32,646 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:32,744 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:38:32,745 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:38:33,645 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:35,646 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:37,647 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:37,941 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:37,991 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:38,079 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:38,647 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:39,647 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:41,648 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:43,289 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:43,340 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:43,425 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:43,648 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:45,649 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:47,650 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:47,727 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:38:47,900 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:38:47,902 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:38:48,615 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:48,667 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:48,746 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:49,683 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:49,683 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:53,684 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:54,004 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:54,055 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:54,136 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:54,685 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:38:55,685 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:57,686 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:38:59,276 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:38:59,327 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:38:59,410 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:38:59,686 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:00,687 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:01,687 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:03,021 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:39:03,022 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:39:04,521 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:04,575 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:04,659 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:04,688 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:04,688 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:05,688 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:06,689 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:08,690 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:09,743 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:09,795 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:09,877 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:10,690 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:10,690 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:11,691 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:14,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:14,938 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:14,982 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:15,068 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:15,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:15,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:16,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:18,227 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:39:18,256 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:39:18,257 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:39:18,693 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:19,984 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:20,035 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:20,117 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:20,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:20,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:21,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:25,051 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:25,100 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:25,182 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:25,695 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:25,696 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:27,696 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:29,697 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:30,179 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:30,242 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:30,325 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:30,697 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:31,698 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:32,698 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:33,578 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:39:33,579 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:39:35,202 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:35,259 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:35,351 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:35,699 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:35,699 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:36,699 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:37,700 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:39,700 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:40,155 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:40,204 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:40,291 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:40,701 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:41,701 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:42,701 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:43,702 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:45,003 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:45,076 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:45,164 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:45,703 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:46,703 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:47,703 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:48,641 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:39:48,755 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:39:48,756 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:39:49,704 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:49,867 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:49,921 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:50,007 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:50,705 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:51,705 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:52,705 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:53,706 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:54,669 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:54,721 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:54,803 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:55,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:55,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:39:56,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:57,728 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:59,253 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:39:59,304 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:39:59,388 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:39:59,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:39:59,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:00,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:03,730 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:03,801 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:40:03,803 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:40:03,869 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:03,922 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:04,005 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:04,731 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:04,731 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:05,731 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:07,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:08,353 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:08,424 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:08,508 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:08,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:08,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:09,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:11,733 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:12,678 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:12,729 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:12,810 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:13,808 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:13,809 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:15,809 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:16,802 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:16,854 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:16,935 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:17,855 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:17,855 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:18,860 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:40:18,861 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:40:19,206 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:40:19,856 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:20,645 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:20,698 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:20,778 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:20,856 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:21,856 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:23,857 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:24,300 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:24,351 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:24,432 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:24,858 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:25,858 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:27,807 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:27,863 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:27,945 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:27,948 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:28,945 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:29,946 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:31,085 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:31,138 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:31,265 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:31,946 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:31,947 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:33,947 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:33,973 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:40:33,974 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:40:34,092 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:34,152 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:34,243 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:34,948 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:35,948 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:36,893 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:36,946 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:37,031 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:38,029 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:38,029 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:39,415 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:39,469 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:39,554 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:40,030 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:40,030 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:41,611 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:41,664 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:41,749 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:42,030 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:42,031 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:43,564 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:43,617 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:43,702 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:44,031 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:44,031 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:45,290 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:45,344 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:45,433 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:46,032 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:46,032 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:47,350 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:47,533 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:47,623 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:48,032 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:48,033 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:49,019 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:40:49,020 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:40:49,033 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:49,882 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:40:50,033 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:53,034 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:53,727 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:53,780 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:53,864 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:40:54,035 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:40:55,035 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:56,035 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:59,036 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:40:59,675 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:40:59,729 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:40:59,825 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:00,037 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:01,037 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:02,038 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:04,117 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:41:04,118 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:41:05,039 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:05,699 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:05,744 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:05,835 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:06,039 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:07,039 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:08,040 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:11,041 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:11,533 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:11,586 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:11,693 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:12,041 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:13,042 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:14,042 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:17,043 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:17,423 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:17,475 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:17,556 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:18,043 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:19,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:19,179 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:41:19,180 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:41:20,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:20,354 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:41:21,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:23,247 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:23,301 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:23,384 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:24,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:24,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:25,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:27,047 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:29,029 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:29,082 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:29,166 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:30,083 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:30,083 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:32,083 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:34,084 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:34,219 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:41:34,220 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:41:34,849 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:34,906 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:34,994 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:35,084 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:36,085 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:37,085 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:40,086 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:40,665 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:40,719 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:40,804 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:41,086 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:42,087 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:43,087 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:46,088 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:46,448 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:46,499 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:46,581 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:47,088 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:48,089 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:49,089 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:49,440 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:41:49,441 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:41:50,090 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:50,762 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:41:52,016 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:52,068 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:52,158 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:53,157 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:54,157 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:55,158 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:56,158 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:57,601 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:41:57,654 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:41:57,735 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:41:58,159 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:41:58,159 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:41:59,159 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:02,160 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:03,263 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:03,316 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:03,400 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:04,161 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:04,161 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:04,546 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:42:04,547 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:42:05,161 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:08,836 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:08,891 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:08,977 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:09,162 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:09,163 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:11,163 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:13,164 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:14,392 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:14,445 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:14,529 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:15,164 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:15,165 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:17,165 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:19,166 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:19,646 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:42:19,647 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:42:19,988 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:20,039 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:20,121 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:20,166 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:21,165 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:42:21,166 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:23,167 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:25,321 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:25,377 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:25,461 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:26,168 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:27,168 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:29,169 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:30,704 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:30,757 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:30,843 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:31,169 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:32,170 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:34,171 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:34,700 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:42:34,702 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:42:35,970 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:36,023 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:36,110 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:36,171 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:36,171 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:38,172 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:40,173 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:41,369 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:41,430 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:41,517 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:42,173 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:44,174 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:46,175 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:46,628 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:46,705 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:46,791 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:47,175 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:48,175 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:49,835 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:42:49,836 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:42:50,176 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:51,692 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:42:51,881 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:51,933 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:52,016 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:52,177 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:54,178 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:56,178 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:42:57,058 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:42:57,112 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:42:57,196 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:42:57,197 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:42:58,196 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:02,197 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:02,219 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:02,269 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:02,354 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:03,198 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:04,198 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:05,104 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:43:05,106 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:43:06,199 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:07,306 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:07,363 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:07,477 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:08,199 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:08,200 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:12,201 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:12,455 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:12,511 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:12,599 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:13,201 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:14,201 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:16,202 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:17,451 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:17,504 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:17,586 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:18,203 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:18,203 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:20,203 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:20,403 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:43:20,404 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:43:21,204 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:22,266 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:43:22,314 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:22,368 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:22,452 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:23,204 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:24,205 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:25,205 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:27,171 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:27,226 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:27,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:27,311 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:28,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:28,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:29,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:31,228 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:32,044 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:32,101 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:32,190 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:32,228 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:33,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:35,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:35,695 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:43:35,696 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:43:36,867 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:36,941 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:37,027 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:37,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:39,231 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:41,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:41,587 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:41,641 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:41,723 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:42,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:43,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:45,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:46,300 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:46,353 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:46,437 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:47,234 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:47,234 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:49,234 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:50,742 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:43:50,744 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:43:50,961 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:51,029 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:51,115 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:51,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:51,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:52,753 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:43:53,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:54,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:55,471 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:55,543 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:43:55,668 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:43:56,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:56,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:43:57,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:58,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:43:59,946 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:43:59,999 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:00,084 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:00,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:00,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:01,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:02,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:04,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:04,310 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:04,363 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:04,446 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:05,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:05,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:05,792 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:44:05,793 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:44:06,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:08,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:08,445 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:08,499 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:08,584 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:09,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:09,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:10,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:12,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:12,460 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:12,522 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:12,611 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:13,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:13,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:14,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:16,244 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:16,366 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:16,419 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:16,506 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:17,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:18,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:19,991 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:20,044 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:20,127 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:20,246 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:20,246 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:20,840 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:44:20,842 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:44:22,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:23,244 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:44:23,327 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:23,382 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:23,465 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:24,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:24,248 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:26,248 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:26,438 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:26,494 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:26,581 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:27,248 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:28,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:29,324 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:29,377 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:29,460 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:30,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:30,250 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:31,816 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:31,872 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:31,954 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:32,250 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:32,250 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:34,052 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:34,106 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:34,189 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:34,251 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:34,251 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:35,889 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:44:35,891 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:44:36,104 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:36,175 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:36,254 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:36,256 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:37,255 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:37,924 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:37,976 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:38,061 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:38,255 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:38,255 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:39,531 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:39,584 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:39,672 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:40,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:40,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:41,532 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:41,707 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:41,791 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:42,256 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:42,257 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:44,257 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:46,258 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:47,817 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:47,871 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:47,957 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:48,259 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:50,259 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:50,968 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:44:50,970 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:44:52,260 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:53,817 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:53,871 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:53,956 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:44:54,092 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:44:54,261 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:54,261 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:44:56,261 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:59,262 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:44:59,810 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:44:59,862 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:44:59,942 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:00,263 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:01,263 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:02,264 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:05,265 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:05,626 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:05,699 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:05,778 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:06,147 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:45:06,149 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:45:06,265 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:07,265 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:08,266 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:09,266 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:11,370 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:11,421 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:11,520 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:12,267 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:12,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:13,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:15,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:17,048 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:17,101 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:17,186 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:17,269 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:18,269 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:19,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:21,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:21,312 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:45:21,314 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:45:22,762 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:22,812 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:22,893 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:23,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:23,272 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:24,272 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:24,531 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:45:26,272 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:28,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:28,495 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:28,547 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:28,631 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:29,274 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:30,274 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:32,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:34,213 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:34,266 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:34,348 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:35,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:36,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:36,358 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:45:36,360 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:45:37,348 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:38,348 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:39,855 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:39,914 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:39,995 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:40,349 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:41,350 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:42,350 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:44,351 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:45,415 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:45,473 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:45,556 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:46,352 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:46,352 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:47,352 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:50,353 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:50,910 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:50,979 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:51,061 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:51,353 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:51,545 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:45:51,546 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:45:52,354 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:53,354 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:54,355 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:55,163 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:45:56,512 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:45:56,565 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:45:56,651 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:45:57,356 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:45:57,356 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:45:58,356 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:00,357 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:02,098 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:02,151 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:02,233 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:02,358 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:03,358 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:06,704 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:46:06,705 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:46:07,359 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:07,599 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:07,650 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:07,734 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:08,360 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:09,360 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:11,361 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:13,077 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:13,151 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:13,233 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:13,361 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:15,362 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:17,363 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:18,562 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:18,616 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:18,701 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:19,363 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:19,364 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:21,838 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:46:21,839 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:46:23,365 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:23,945 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:23,998 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:24,082 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:24,365 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:25,365 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:25,626 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:46:26,366 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:27,366 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:29,298 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:29,349 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:29,434 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:30,432 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:31,433 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:32,433 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:33,433 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:34,543 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:34,596 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:34,679 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:35,434 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:35,434 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:36,435 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:37,291 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:46:37,292 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:46:37,435 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:39,673 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:39,725 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:39,831 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:40,436 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:42,437 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:44,437 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:44,872 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:44,922 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:45,006 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:45,438 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:46,438 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:50,026 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:50,075 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:50,157 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:50,439 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:50,440 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:52,440 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:52,755 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:46:52,757 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:46:54,441 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:54,980 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:46:55,034 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:46:55,116 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:46:55,441 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:46:56,072 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:46:56,441 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:46:59,984 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:00,038 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:00,120 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:00,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:00,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:02,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:04,444 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:04,913 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:04,964 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:05,047 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:05,444 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:06,445 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:08,010 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:47:08,011 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:47:08,445 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:09,912 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:09,965 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:10,091 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:10,446 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:10,446 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:12,447 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:14,447 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:14,747 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:14,798 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:14,879 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:15,448 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:16,448 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:18,449 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:19,453 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:19,505 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:19,585 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:20,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:20,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:22,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:23,159 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:47:23,160 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:47:24,191 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:24,243 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:24,326 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:24,502 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:25,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:26,585 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:47:27,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:28,938 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:28,992 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:29,079 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:29,504 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:29,504 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:31,504 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:33,505 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:33,707 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:33,757 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:33,839 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:34,505 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:35,506 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:37,506 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:38,207 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:47:38,209 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:47:38,307 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:38,360 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:38,443 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:38,507 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:39,507 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:41,508 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:42,850 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:42,902 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:42,984 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:43,509 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:43,509 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:45,509 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:46,510 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:47,298 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:47,349 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:47,433 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:47,510 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:48,510 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:49,511 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:50,511 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:51,678 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:51,739 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:51,820 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:52,512 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:52,512 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:53,259 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:47:53,261 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:47:53,512 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:54,513 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:55,929 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:47:55,986 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:47:56,070 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:47:56,513 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:56,514 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:47:57,013 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:47:57,514 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:47:58,514 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:00,172 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:00,223 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:00,306 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:00,515 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:00,515 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:01,515 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:02,516 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:04,222 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:04,274 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:04,355 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:04,516 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:04,517 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:05,517 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:06,517 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:08,113 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:08,168 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:08,248 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:08,335 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:48:08,336 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:48:08,518 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:08,518 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:09,518 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:10,519 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:11,794 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:11,846 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:11,930 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:12,519 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:12,520 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:13,520 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:14,520 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:15,260 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:15,314 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:15,398 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:15,520 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:16,521 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:17,521 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:18,492 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:18,542 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:18,568 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:18,650 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:19,543 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:19,543 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:20,543 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:21,507 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:21,560 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:21,641 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:22,560 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:22,561 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:23,554 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:48:23,555 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:48:23,561 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:24,270 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:24,326 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:24,408 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:24,561 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:24,561 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:25,561 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:26,562 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:26,801 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:26,854 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:26,939 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:27,562 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:27,563 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:27,695 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:48:28,563 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:29,147 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:29,198 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:29,279 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:29,563 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:30,564 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:31,160 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:31,212 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:31,295 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:31,564 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:31,564 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:32,564 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:32,902 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:32,951 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:33,038 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:33,565 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:33,565 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:34,565 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:34,937 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:35,114 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:35,200 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:35,566 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:36,566 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:38,755 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:48:38,756 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:48:39,567 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:40,979 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:41,061 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:41,147 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:41,568 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:42,568 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:43,569 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:45,569 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:46,972 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:47,026 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:47,113 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:47,570 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:48,570 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:49,571 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:51,571 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:52,855 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:52,927 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:53,034 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:53,572 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:48:54,132 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:48:54,134 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:48:54,572 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:55,573 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:57,574 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:58,221 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:48:58,786 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:48:58,839 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:48:58,926 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:48:59,574 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:48:59,575 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:00,575 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:01,575 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:03,576 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:04,497 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:04,572 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:04,658 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:05,656 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:05,657 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:06,657 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:09,238 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:49:09,239 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:49:09,658 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:10,318 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:10,373 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:10,488 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:10,658 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:11,658 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:12,659 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:15,660 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:16,001 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:16,056 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:16,142 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:16,660 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:16,661 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:17,661 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:19,661 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:21,692 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:21,748 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:21,833 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:22,662 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:22,662 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:23,662 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:24,294 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:49:24,296 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:49:25,663 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:27,302 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:27,355 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:27,439 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:27,664 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:28,664 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:28,783 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:49:29,665 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:30,665 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:32,666 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:32,926 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:32,979 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:33,068 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:33,666 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:34,667 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:35,667 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:38,623 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:38,680 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:38,695 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:38,788 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:39,354 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:49:39,355 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:49:39,685 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:39,686 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:40,686 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:42,687 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:44,232 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:44,284 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:44,369 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:44,687 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:45,688 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:46,688 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:48,689 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:49,723 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:49,808 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:49,897 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:50,689 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:50,690 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:51,690 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:54,595 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:49:54,596 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:49:54,691 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:55,198 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:49:55,276 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:49:55,369 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:49:55,691 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:49:56,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:57,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:49:59,456 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:50:00,623 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:00,697 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:00,780 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:00,782 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:01,724 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:01,724 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:02,724 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:04,725 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:06,056 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:06,109 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:06,194 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:06,726 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:06,726 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:07,726 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:09,729 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:50:09,730 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:50:10,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:11,340 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:11,394 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:11,482 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:11,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:12,728 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:13,728 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:14,728 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:16,750 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:16,799 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:16,888 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:17,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:17,730 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:18,730 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:20,731 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:22,097 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:22,175 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:22,317 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:22,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:22,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:24,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:25,053 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:50:25,054 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:50:27,346 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:27,402 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:27,510 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:27,734 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:27,734 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:28,734 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:29,734 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:29,984 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:50:31,735 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:32,455 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:32,509 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:32,594 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:32,735 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:33,736 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:34,736 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:35,737 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:37,487 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:37,540 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:37,627 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:37,737 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:37,737 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:38,737 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:39,738 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:40,372 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:50:40,373 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:50:41,739 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:42,523 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:42,598 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:42,682 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:42,739 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:43,739 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:44,740 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:45,740 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:47,503 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:47,557 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:47,648 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:47,741 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:48,741 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:49,742 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:51,743 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:52,545 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:52,590 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:52,678 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:52,743 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:53,743 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:54,744 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:55,652 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:50:55,653 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:50:55,744 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:57,512 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:50:57,575 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:50:57,664 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:50:57,745 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:50:58,745 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:50:59,745 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:00,524 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:51:01,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:02,550 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:02,606 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:02,692 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:02,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:04,747 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:06,748 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:07,410 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:07,464 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:07,550 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:07,748 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:08,749 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:10,817 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:51:10,818 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:51:12,205 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:12,261 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:12,348 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:12,750 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:12,750 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:14,751 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:16,752 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:17,021 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:17,075 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:17,162 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:17,752 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:18,752 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:19,753 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:20,753 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:21,705 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:21,760 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:21,851 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:22,766 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:22,767 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:23,767 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:24,767 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:25,892 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:51:25,893 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:51:26,494 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:26,548 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:26,633 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:26,768 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:28,769 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:29,769 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:30,769 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:31,105 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:31,158 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:31,243 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:31,292 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:51:31,770 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:32,770 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:33,770 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:34,771 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:35,632 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:35,686 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:35,775 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:36,774 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:36,774 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:37,774 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:38,775 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:40,419 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:40,476 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:40,564 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:40,775 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:40,776 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:41,072 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:51:41,073 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:51:41,776 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:43,776 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:44,532 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:44,615 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:44,701 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:44,777 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:45,777 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:47,778 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:48,786 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:48,840 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:48,928 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:49,779 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:49,779 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:53,055 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:53,111 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:53,203 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:53,780 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:53,780 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:55,780 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:56,142 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:51:56,144 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:51:57,136 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:51:57,190 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:51:57,278 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:51:57,781 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:51:57,782 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:51:59,782 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:00,991 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:01,047 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:01,133 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:01,783 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:01,883 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:52:02,783 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:04,712 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:04,768 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:04,855 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:04,857 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:05,856 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:06,856 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:08,108 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:08,156 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:08,269 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:08,857 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:08,857 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:10,857 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:11,195 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:11,254 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:11,339 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:52:11,343 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:11,344 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:52:11,858 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:12,858 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:13,974 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:14,027 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:14,116 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:14,859 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:14,859 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:16,534 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:16,589 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:16,675 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:16,860 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:16,860 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:18,855 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:18,908 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:18,908 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:18,994 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:19,903 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:20,875 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:20,926 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:20,937 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:21,011 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:21,927 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:22,614 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:22,667 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:22,770 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:22,928 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:22,928 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:24,218 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:24,271 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:24,357 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:24,928 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:24,929 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:26,235 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:26,405 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:26,487 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:26,540 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:52:26,541 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:52:26,929 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:26,929 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:28,930 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:30,930 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:32,434 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:32,488 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:32,576 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:32,621 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:52:32,931 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:32,931 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:34,932 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:36,932 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:38,442 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:38,502 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:38,588 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:38,933 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:40,934 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:41,858 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:52:41,860 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:52:42,934 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:44,308 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:44,360 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:44,445 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:44,935 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:44,936 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:46,936 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:48,937 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:50,213 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:50,266 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:50,351 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:50,937 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:50,938 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:52,938 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:55,939 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:56,013 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:52:56,068 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:52:56,178 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:52:56,940 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:52:56,940 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:52:57,197 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:52:57,199 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:52:57,940 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:01,845 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:01,913 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:01,998 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:02,000 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:02,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:02,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:03,152 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:53:03,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:06,000 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:07,526 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:07,581 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:07,667 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:08,000 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:09,001 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:10,001 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:12,002 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:12,275 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:53:12,276 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:53:13,199 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:13,253 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:13,339 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:14,003 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:14,003 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:15,003 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:18,004 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:18,914 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:18,960 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:19,045 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:20,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:20,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:21,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:24,045 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:24,593 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:24,648 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:24,779 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:25,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:26,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:27,047 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:27,328 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:53:27,330 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:53:28,047 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:30,144 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:30,198 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:30,283 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:31,048 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:31,048 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:32,048 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:33,754 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:53:34,049 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:35,605 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:35,677 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:35,762 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:36,050 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:37,050 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:38,050 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:41,051 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:41,141 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:41,194 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:41,306 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:42,052 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:42,526 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:53:42,528 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:53:43,052 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:44,052 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:45,053 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:46,727 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:46,810 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:46,899 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:47,053 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:48,054 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:49,054 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:51,055 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:52,162 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:52,216 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:52,317 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:53,056 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:53,056 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:54,056 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:57,057 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:53:57,585 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:53:57,638 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:53:57,722 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:53:57,815 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:53:57,816 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:53:58,058 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:53:59,058 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:00,058 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:01,059 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:02,984 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:03,050 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:03,133 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:04,132 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:04,132 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:04,272 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:54:05,132 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:07,133 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:08,243 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:08,296 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:08,384 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:09,134 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:09,134 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:10,134 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:12,977 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:54:12,979 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:54:13,135 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:13,657 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:13,710 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:13,793 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:14,136 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:15,137 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:16,137 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:18,138 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:18,963 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:19,021 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:19,103 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:19,138 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:20,139 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:24,140 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:24,153 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:24,207 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:24,289 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:25,141 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:26,141 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:28,021 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:54:28,022 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:54:28,142 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:29,242 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:29,293 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:29,386 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:30,143 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:30,143 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:31,143 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:34,144 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:34,398 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:34,453 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:34,538 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:34,890 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:54:35,145 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:36,145 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:37,145 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:38,146 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:39,565 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:39,619 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:39,705 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:40,147 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:41,147 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:43,079 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:54:43,081 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:54:44,543 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:44,598 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:44,682 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:45,148 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:45,149 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:47,149 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:49,150 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:49,637 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:49,689 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:49,776 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:50,150 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:51,151 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:54,597 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:54,651 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:54,739 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:54:55,152 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:55,152 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:54:57,153 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:58,176 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:54:58,178 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:54:59,153 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:54:59,576 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:54:59,630 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:54:59,714 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:00,154 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:01,154 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:03,155 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:04,581 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:04,643 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:04,755 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:05,155 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:05,156 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:05,317 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:55:07,156 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:09,157 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:09,529 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:09,580 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:09,666 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:10,157 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:11,158 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:13,158 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:13,255 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:55:13,256 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:55:14,319 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:14,371 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:14,452 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:15,159 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:15,160 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:17,160 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:19,083 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:19,137 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:19,222 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:20,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:21,221 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:23,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:23,759 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:23,812 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:23,897 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:24,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:25,222 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:27,223 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:28,301 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:55:28,303 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:55:28,363 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:28,405 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:28,489 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:29,224 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:29,224 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:32,225 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:32,921 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:32,974 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:33,057 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:33,225 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:34,226 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:35,840 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:55:36,226 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:37,411 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:37,461 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:37,544 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:38,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:38,227 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:40,228 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:41,741 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:41,795 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:41,904 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:42,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:42,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:43,412 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:55:43,414 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:55:44,229 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:45,958 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:46,011 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:46,096 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:46,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:46,230 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:48,231 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:49,231 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:49,989 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:50,043 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:50,126 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:50,231 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:51,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:52,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:53,232 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:53,873 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:53,926 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:54,012 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:54,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:55,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:56,233 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:57,234 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:55:57,488 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:55:57,546 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:55:57,633 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:55:58,234 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:55:58,459 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:55:58,461 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:55:59,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:00,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:00,872 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:00,926 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:01,010 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:01,235 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:01,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:02,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:03,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:04,011 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:04,087 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:04,169 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:04,236 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:05,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:06,237 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:06,428 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:56:06,903 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:06,955 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:07,035 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:07,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:07,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:08,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:09,238 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:09,573 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:09,625 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:09,707 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:10,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:11,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:12,000 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:12,055 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:12,145 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:12,239 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:12,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:13,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:13,504 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:56:13,505 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:56:14,097 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:14,152 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:14,234 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:14,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:14,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:15,240 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:15,978 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:16,030 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:16,115 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:16,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:16,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:17,241 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:17,556 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:17,608 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:17,712 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:18,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:18,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:19,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:19,559 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:19,738 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:19,820 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:20,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:20,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:21,242 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:22,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:23,243 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:25,573 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:25,626 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:25,712 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:26,244 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:27,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:28,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:28,771 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:56:28,772 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:56:29,245 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:31,506 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:31,559 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:31,688 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:32,246 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:32,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:34,247 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:36,248 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:36,986 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:56:37,382 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:37,435 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:37,522 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:38,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:38,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:40,249 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:42,250 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:43,208 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:43,262 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:43,352 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:44,268 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:44,269 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:44,299 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:56:44,300 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:56:48,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:48,978 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:49,032 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:49,121 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:49,270 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:50,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:51,271 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:54,272 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:54,766 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:56:54,818 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:56:54,917 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:56:55,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:56:56,273 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:57,274 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:56:59,407 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:56:59,408 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:57:00,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:00,521 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:00,578 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:00,663 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:01,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:02,275 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:03,276 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:04,276 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:06,250 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:06,327 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:06,410 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:07,323 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:07,324 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:07,449 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:57:08,324 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:10,324 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:11,989 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:12,044 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:12,133 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:12,325 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:13,325 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:14,455 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:57:14,456 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:57:17,327 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:17,690 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:17,759 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:17,852 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:18,327 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:19,328 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:23,269 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:23,333 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:23,354 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:23,444 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:24,344 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:25,344 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:27,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:28,814 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:28,871 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:28,958 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:29,346 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:29,518 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:57:29,519 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:57:31,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:33,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:34,212 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:34,267 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:34,352 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:34,358 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:35,353 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:37,353 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:38,016 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:57:39,354 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:39,615 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:39,669 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:39,756 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:40,354 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:41,355 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:42,355 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:43,355 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:44,761 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:57:44,763 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:57:45,050 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:45,103 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:45,190 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:45,356 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:46,356 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:47,357 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:49,357 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:50,469 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:50,515 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:50,602 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:51,358 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:51,359 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:52,359 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:55,359 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:56,050 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:57:56,105 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:57:56,192 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:57:56,361 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:57:57,362 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:58,362 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:59,363 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:57:59,844 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:57:59,845 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:58:01,358 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:01,403 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:01,490 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:02,403 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:02,403 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:04,404 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:06,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:06,576 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:06,630 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:06,719 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:07,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:08,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:08,604 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:58:10,406 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:11,876 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:11,930 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:12,014 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:12,407 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:12,407 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:14,407 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:14,995 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:58:14,996 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:58:16,408 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:17,098 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:17,177 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:17,264 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:17,409 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:18,409 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:20,410 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:22,283 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:22,336 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:22,429 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:23,427 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:24,427 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:26,428 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:27,427 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:27,482 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:27,568 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:28,482 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:28,483 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:30,293 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:58:30,295 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:58:30,483 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:32,497 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:32,552 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:32,640 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:33,484 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:33,484 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:35,485 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:37,462 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:37,517 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:37,517 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:37,604 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:38,518 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:39,065 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:58:39,518 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:41,519 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:42,446 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:42,500 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:42,585 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:43,584 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:45,574 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:58:45,575 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:58:45,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:47,382 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:47,469 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:47,555 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:47,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:47,585 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:49,586 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:51,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:52,267 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:52,320 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:52,408 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:52,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:58:53,587 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:55,588 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:57,023 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:58:57,079 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:58:57,167 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:58:57,589 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:58:57,589 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:00,725 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:59:00,726 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:59:01,590 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:01,856 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:01,913 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:01,999 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:02,590 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:03,591 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:05,591 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:06,650 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:06,705 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:06,790 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:07,592 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:07,593 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:08,593 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:09,551 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:59:10,593 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:11,339 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:11,393 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:11,481 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:11,594 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:12,594 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:15,771 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:59:15,772 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:59:15,967 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:16,022 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:16,107 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:16,595 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:16,596 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:18,596 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:20,513 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:20,563 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:20,651 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:20,653 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:21,651 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:22,652 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:24,652 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:24,961 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:25,013 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:25,102 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:25,652 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:26,653 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:28,654 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:29,255 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:29,308 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:29,395 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:29,654 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:30,654 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:30,815 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:59:30,817 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:59:32,655 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:33,559 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:33,611 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:33,693 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:34,692 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:34,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:36,693 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:37,714 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:37,767 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:37,855 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:38,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:38,694 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:40,116 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 22:59:40,695 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:41,720 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:41,774 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:41,859 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:42,695 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:42,696 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:44,696 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:45,522 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:45,575 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:45,662 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:45,696 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:45,942 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 22:59:45,944 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 22:59:46,697 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:48,697 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:49,108 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:49,163 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:49,250 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:49,697 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:50,698 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:52,518 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:52,594 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:52,704 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:52,706 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:53,705 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:54,705 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:55,694 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:55,745 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:55,745 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:55,830 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:56,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:56,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:57,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 22:59:58,574 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 22:59:58,630 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 22:59:58,716 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 22:59:58,746 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 22:59:59,747 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:01,051 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:00:01,052 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:00:01,213 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:01,269 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:01,358 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:01,747 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:01,748 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:03,610 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:03,687 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:03,842 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:03,843 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:04,842 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:05,725 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:05,799 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:05,884 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:05,886 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:06,885 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:07,589 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:07,644 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:07,732 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:07,885 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:07,885 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:09,180 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:09,235 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:09,321 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:09,886 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:09,886 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:10,659 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:00:11,264 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:11,456 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:11,539 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:11,887 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:11,887 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:13,887 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:15,888 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:16,101 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:00:16,101 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:00:17,259 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:17,313 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:17,399 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:17,889 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:19,890 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:21,890 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:23,144 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:23,195 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:23,278 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:23,891 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:23,891 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:25,892 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:27,892 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:28,921 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:28,973 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:29,058 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:29,893 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:29,893 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:31,149 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:00:31,150 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:00:33,894 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:34,698 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:34,753 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:34,838 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:34,895 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:35,895 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:37,896 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:40,431 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:40,487 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:40,573 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:40,897 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:41,059 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:00:41,085 WARNING FileStreamThread:234672 [file_stream.py:request_with_retry():594] requests_with_retry encountered retryable exception: ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer')). func: >, args: ('https://api.wandb.ai/files/sanchit-gandhi/huggingface/2ay2wvge/file_stream',), kwargs: {'json': {'files': {'wandb-summary.json': {'offset': 0, 'content': ['{"train/loss": 4.1062, "train/learning_rate": 7.0200000000000006e-06, "train/epoch": 0.35, "train/global_step": 355, "_runtime": 1677, "_timestamp": 1646089240, "_step": 354, "gradients/decoder.transformer.ln_f.weight": {"_type": "histogram", "values": [10.0, 39.0, 264.0, 483.0, 192.0, 27.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-15.277154922485352, -10.82435417175293, -6.371552467346191, -1.9187507629394531, 2.5340499877929688, 6.986850738525391, 11.439653396606445, 15.892454147338867, 20.34525489807129, 24.79805564880371, 29.250858306884766, 33.70365905761719, 38.15645980834961, 42.60926055908203, 47.06206512451172, 51.514862060546875, 55.96766662597656, 60.420467376708984, 64.8732681274414, 69.3260726928711, 73.77886962890625, 78.23167419433594, 82.68447875976562, 87.13727569580078, 91.59007263183594, 96.04287719726562, 100.49567413330078, 104.94847869873047, 109.40127563476562, 113.85408020019531, 118.306884765625, 122.75968170166016, 127.21247863769531, 131.665283203125, 136.1180877685547, 140.5708770751953, 145.023681640625, 149.4764862060547, 153.92929077148438, 158.382080078125, 162.8348846435547, 167.28768920898438, 171.74049377441406, 176.1932830810547, 180.64608764648438, 185.09889221191406, 189.55169677734375, 194.00448608398438, 198.45730590820312, 202.9101104736328, 207.3629150390625, 211.81570434570312, 216.2685089111328, 220.7213134765625, 225.1741180419922, 229.62692260742188, 234.0797119140625, 238.5325164794922, 242.98532104492188, 247.4381103515625, 251.8909149169922, 256.3437194824219, 260.7965087890625, 265.24932861328125, 269.7021179199219]}, "gradients/decoder.transformer.ln_f.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 5.0, 3.0, 2.0, 1.0, 4.0, 7.0, 7.0, 7.0, 7.0, 13.0, 10.0, 14.0, 20.0, 12.0, 21.0, 25.0, 30.0, 21.0, 37.0, 38.0, 37.0, 29.0, 35.0, 39.0, 57.0, 40.0, 45.0, 35.0, 48.0, 36.0, 32.0, 27.0, 24.0, 27.0, 25.0, 23.0, 30.0, 18.0, 19.0, 17.0, 13.0, 19.0, 9.0, 9.0, 6.0, 9.0, 7.0, 3.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-44.952396392822266, -43.53903579711914, -42.125675201416016, -40.71231460571289, -39.298954010009766, -37.885597229003906, -36.47223663330078, -35.058876037597656, -33.64551544189453, -32.232154846191406, -30.81879425048828, -29.405433654785156, -27.992074966430664, -26.57871437072754, -25.165353775024414, -23.751995086669922, -22.338632583618164, -20.92527198791504, -19.511911392211914, -18.098552703857422, -16.685192108154297, -15.271831512451172, -13.858470916748047, -12.445111274719238, -11.031750679016113, -9.618390083312988, -8.20503044128418, -6.791669845581055, -5.378309726715088, -3.964949607849121, -2.551589012145996, -1.1382293701171875, 0.2751312255859375, 1.6884914636611938, 3.10185170173645, 4.515212059020996, 5.928572177886963, 7.34193229675293, 8.755292892456055, 10.168652534484863, 11.582013130187988, 12.995373725891113, 14.408733367919922, 15.822093963623047, 17.235454559326172, 18.648815155029297, 20.062175750732422, 21.475534439086914, 22.88889503479004, 24.302255630493164, 25.71561622619629, 27.12897491455078, 28.542335510253906, 29.95569610595703, 31.369056701660156, 32.78241729736328, 34.195777893066406, 35.60913848876953, 37.022499084472656, 38.43585968017578, 39.849220275878906, 41.26258087158203, 42.675941467285156, 44.089298248291016, 45.50265884399414]}, "gradients/decoder.transformer.h.23.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 2.0, 5.0, 3.0, 3.0, 2.0, 8.0, 8.0, 6.0, 10.0, 13.0, 4.0, 12.0, 18.0, 22.0, 19.0, 19.0, 23.0, 31.0, 27.0, 28.0, 19.0, 42.0, 33.0, 42.0, 38.0, 47.0, 59.0, 50.0, 41.0, 34.0, 34.0, 32.0, 32.0, 27.0, 30.0, 23.0, 23.0, 18.0, 20.0, 16.0, 13.0, 11.0, 12.0, 13.0, 6.0, 9.0, 6.0, 6.0, 5.0, 4.0, 3.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.9609375, -3.8385009765625, -3.716064453125, -3.5936279296875, -3.47119140625, -3.3487548828125, -3.226318359375, -3.1038818359375, -2.9814453125, -2.8590087890625, -2.736572265625, -2.6141357421875, -2.49169921875, -2.3692626953125, -2.246826171875, -2.1243896484375, -2.001953125, -1.8795166015625, -1.757080078125, -1.6346435546875, -1.51220703125, -1.3897705078125, -1.267333984375, -1.1448974609375, -1.0224609375, -0.9000244140625, -0.777587890625, -0.6551513671875, -0.53271484375, -0.4102783203125, -0.287841796875, -0.1654052734375, -0.04296875, 0.0794677734375, 0.201904296875, 0.3243408203125, 0.44677734375, 0.5692138671875, 0.691650390625, 0.8140869140625, 0.9365234375, 1.0589599609375, 1.181396484375, 1.3038330078125, 1.42626953125, 1.5487060546875, 1.671142578125, 1.7935791015625, 1.916015625, 2.0384521484375, 2.160888671875, 2.2833251953125, 2.40576171875, 2.5281982421875, 2.650634765625, 2.7730712890625, 2.8955078125, 3.0179443359375, 3.140380859375, 3.2628173828125, 3.38525390625, 3.5076904296875, 3.630126953125, 3.7525634765625, 3.875]}, "gradients/decoder.transformer.h.23.mlp.c_proj.weight": {"_type": "histogram", "values": [4.0, 3.0, 4.0, 3.0, 6.0, 7.0, 21.0, 13.0, 11.0, 13.0, 21.0, 20.0, 36.0, 42.0, 55.0, 61.0, 91.0, 96.0, 136.0, 183.0, 262.0, 325.0, 384.0, 550.0, 830.0, 1313.0, 2078.0, 3574.0, 7154.0, 18124.0, 77818.0, 682302.0, 2368741.0, 881023.0, 108809.0, 21833.0, 8051.0, 3742.0, 2140.0, 1336.0, 819.0, 557.0, 439.0, 284.0, 250.0, 159.0, 111.0, 112.0, 59.0, 58.0, 50.0, 42.0, 32.0, 34.0, 31.0, 10.0, 12.0, 9.0, 7.0, 2.0, 4.0, 1.0, 4.0, 3.0], "bins": [-10.2890625, -9.97314453125, -9.6572265625, -9.34130859375, -9.025390625, -8.70947265625, -8.3935546875, -8.07763671875, -7.76171875, -7.44580078125, -7.1298828125, -6.81396484375, -6.498046875, -6.18212890625, -5.8662109375, -5.55029296875, -5.234375, -4.91845703125, -4.6025390625, -4.28662109375, -3.970703125, -3.65478515625, -3.3388671875, -3.02294921875, -2.70703125, -2.39111328125, -2.0751953125, -1.75927734375, -1.443359375, -1.12744140625, -0.8115234375, -0.49560546875, -0.1796875, 0.13623046875, 0.4521484375, 0.76806640625, 1.083984375, 1.39990234375, 1.7158203125, 2.03173828125, 2.34765625, 2.66357421875, 2.9794921875, 3.29541015625, 3.611328125, 3.92724609375, 4.2431640625, 4.55908203125, 4.875, 5.19091796875, 5.5068359375, 5.82275390625, 6.138671875, 6.45458984375, 6.7705078125, 7.08642578125, 7.40234375, 7.71826171875, 8.0341796875, 8.35009765625, 8.666015625, 8.98193359375, 9.2978515625, 9.61376953125, 9.9296875]}, "gradients/decoder.transformer.h.23.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 2.0, 5.0, 0.0, 6.0, 3.0, 11.0, 10.0, 10.0, 11.0, 10.0, 18.0, 31.0, 21.0, 52.0, 76.0, 107.0, 174.0, 253.0, 376.0, 517.0, 614.0, 504.0, 370.0, 249.0, 178.0, 128.0, 96.0, 66.0, 47.0, 29.0, 23.0, 20.0, 18.0, 5.0, 7.0, 11.0, 9.0, 5.0, 3.0, 1.0, 1.0, 2.0, 2.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-16.296875, -15.810791015625, -15.32470703125, -14.838623046875, -14.3525390625, -13.866455078125, -13.38037109375, -12.894287109375, -12.408203125, -11.922119140625, -11.43603515625, -10.949951171875, -10.4638671875, -9.977783203125, -9.49169921875, -9.005615234375, -8.51953125, -8.033447265625, -7.54736328125, -7.061279296875, -6.5751953125, -6.089111328125, -5.60302734375, -5.116943359375, -4.630859375, -4.144775390625, -3.65869140625, -3.172607421875, -2.6865234375, -2.200439453125, -1.71435546875, -1.228271484375, -0.7421875, -0.256103515625, 0.22998046875, 0.716064453125, 1.2021484375, 1.688232421875, 2.17431640625, 2.660400390625, 3.146484375, 3.632568359375, 4.11865234375, 4.604736328125, 5.0908203125, 5.576904296875, 6.06298828125, 6.549072265625, 7.03515625, 7.521240234375, 8.00732421875, 8.493408203125, 8.9794921875, 9.465576171875, 9.95166015625, 10.437744140625, 10.923828125, 11.409912109375, 11.89599609375, 12.382080078125, 12.8681640625, 13.354248046875, 13.84033203125, 14.326416015625, 14.8125]}, "gradients/decoder.transformer.h.23.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 3.0, 2.0, 1.0, 10.0, 5.0, 11.0, 9.0, 14.0, 15.0, 27.0, 25.0, 44.0, 59.0, 76.0, 123.0, 164.0, 266.0, 480.0, 860.0, 2306.0, 25238.0, 4034410.0, 124196.0, 3194.0, 1177.0, 622.0, 322.0, 188.0, 118.0, 82.0, 65.0, 60.0, 19.0, 21.0, 16.0, 16.0, 13.0, 10.0, 8.0, 4.0, 7.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.46875, -56.62109375, -54.7734375, -52.92578125, -51.078125, -49.23046875, -47.3828125, -45.53515625, -43.6875, -41.83984375, -39.9921875, -38.14453125, -36.296875, -34.44921875, -32.6015625, -30.75390625, -28.90625, -27.05859375, -25.2109375, -23.36328125, -21.515625, -19.66796875, -17.8203125, -15.97265625, -14.125, -12.27734375, -10.4296875, -8.58203125, -6.734375, -4.88671875, -3.0390625, -1.19140625, 0.65625, 2.50390625, 4.3515625, 6.19921875, 8.046875, 9.89453125, 11.7421875, 13.58984375, 15.4375, 17.28515625, 19.1328125, 20.98046875, 22.828125, 24.67578125, 26.5234375, 28.37109375, 30.21875, 32.06640625, 33.9140625, 35.76171875, 37.609375, 39.45703125, 41.3046875, 43.15234375, 45.0, 46.84765625, 48.6953125, 50.54296875, 52.390625, 54.23828125, 56.0859375, 57.93359375, 59.78125]}, "gradients/decoder.transformer.h.23.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 109.0, 653.0, 234.0, 13.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-303.5772399902344, -297.3272399902344, -291.0772705078125, -284.8272705078125, -278.5772705078125, -272.3272705078125, -266.0773010253906, -259.8273010253906, -253.57730102539062, -247.3273162841797, -241.0773162841797, -234.82733154296875, -228.57733154296875, -222.3273468017578, -216.07736206054688, -209.82736206054688, -203.57737731933594, -197.327392578125, -191.077392578125, -184.82740783691406, -178.57740783691406, -172.32742309570312, -166.07742309570312, -159.8274383544922, -153.57745361328125, -147.3274688720703, -141.0774688720703, -134.82748413085938, -128.57748413085938, -122.32749938964844, -116.07750701904297, -109.8275146484375, -103.57750701904297, -97.3275146484375, -91.07752227783203, -84.82752990722656, -78.57754516601562, -72.32754516601562, -66.07756042480469, -59.82756805419922, -53.57757568359375, -47.32758331298828, -41.07759094238281, -34.82760238647461, -28.57761001586914, -22.327617645263672, -16.07762908935547, -9.82763671875, -3.5776443481445312, 2.672347068786621, 8.922338485717773, 15.17232894897461, 21.422321319580078, 27.672313690185547, 33.92230224609375, 40.17229461669922, 46.42228698730469, 52.672279357910156, 58.922271728515625, 65.17225646972656, 71.42225646972656, 77.6722412109375, 83.92223358154297, 90.17222595214844, 96.4222183227539]}, "gradients/decoder.transformer.h.23.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 1.0, 1.0, 5.0, 10.0, 6.0, 6.0, 7.0, 7.0, 11.0, 15.0, 11.0, 17.0, 23.0, 30.0, 19.0, 23.0, 22.0, 30.0, 34.0, 31.0, 41.0, 32.0, 31.0, 39.0, 32.0, 33.0, 55.0, 41.0, 44.0, 38.0, 38.0, 45.0, 33.0, 27.0, 22.0, 24.0, 16.0, 22.0, 17.0, 8.0, 11.0, 13.0, 9.0, 7.0, 4.0, 6.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 2.0, 1.0], "bins": [-53.098175048828125, -51.56222915649414, -50.02627944946289, -48.490333557128906, -46.954383850097656, -45.41843795776367, -43.88248825073242, -42.34654235839844, -40.81059265136719, -39.2746467590332, -37.73869705200195, -36.20275115966797, -34.66680145263672, -33.130855560302734, -31.594905853271484, -30.0589599609375, -28.523012161254883, -26.987064361572266, -25.45111656188965, -23.91516876220703, -22.379220962524414, -20.843273162841797, -19.307327270507812, -17.771377563476562, -16.235431671142578, -14.699483871459961, -13.163536071777344, -11.627588272094727, -10.09164047241211, -8.555692672729492, -7.019745826721191, -5.483798027038574, -3.9478492736816406, -2.4119014739990234, -0.8759539127349854, 0.6599936485290527, 2.19594144821167, 3.731889247894287, 5.267836570739746, 6.803784370422363, 8.33973217010498, 9.875679969787598, 11.411627769470215, 12.947574615478516, 14.483522415161133, 16.01947021484375, 17.555418014526367, 19.091365814208984, 20.6273136138916, 22.16326141357422, 23.699209213256836, 25.235157012939453, 26.77110481262207, 28.307052612304688, 29.842998504638672, 31.378948211669922, 32.914894104003906, 34.45083999633789, 35.98678970336914, 37.522735595703125, 39.058685302734375, 40.59463119506836, 42.13058090209961, 43.666526794433594, 45.202476501464844]}, "gradients/decoder.transformer.h.23.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 6.0, 3.0, 11.0, 3.0, 2.0, 8.0, 10.0, 12.0, 7.0, 23.0, 21.0, 18.0, 20.0, 33.0, 25.0, 32.0, 37.0, 34.0, 44.0, 38.0, 43.0, 43.0, 43.0, 48.0, 43.0, 50.0, 34.0, 35.0, 34.0, 25.0, 23.0, 24.0, 26.0, 26.0, 16.0, 18.0, 19.0, 12.0, 15.0, 5.0, 7.0, 7.0, 10.0, 4.0, 6.0, 2.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.42578125, -4.2813720703125, -4.136962890625, -3.9925537109375, -3.84814453125, -3.7037353515625, -3.559326171875, -3.4149169921875, -3.2705078125, -3.1260986328125, -2.981689453125, -2.8372802734375, -2.69287109375, -2.5484619140625, -2.404052734375, -2.2596435546875, -2.115234375, -1.9708251953125, -1.826416015625, -1.6820068359375, -1.53759765625, -1.3931884765625, -1.248779296875, -1.1043701171875, -0.9599609375, -0.8155517578125, -0.671142578125, -0.5267333984375, -0.38232421875, -0.2379150390625, -0.093505859375, 0.0509033203125, 0.1953125, 0.3397216796875, 0.484130859375, 0.6285400390625, 0.77294921875, 0.9173583984375, 1.061767578125, 1.2061767578125, 1.3505859375, 1.4949951171875, 1.639404296875, 1.7838134765625, 1.92822265625, 2.0726318359375, 2.217041015625, 2.3614501953125, 2.505859375, 2.6502685546875, 2.794677734375, 2.9390869140625, 3.08349609375, 3.2279052734375, 3.372314453125, 3.5167236328125, 3.6611328125, 3.8055419921875, 3.949951171875, 4.0943603515625, 4.23876953125, 4.3831787109375, 4.527587890625, 4.6719970703125, 4.81640625]}, "gradients/decoder.transformer.h.23.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 5.0, 12.0, 14.0, 19.0, 31.0, 42.0, 60.0, 96.0, 153.0, 221.0, 290.0, 466.0, 626.0, 953.0, 1405.0, 2074.0, 3165.0, 4830.0, 7410.0, 11393.0, 17880.0, 28417.0, 47238.0, 84857.0, 181850.0, 344518.0, 134386.0, 68072.0, 39717.0, 24475.0, 15291.0, 9693.0, 6383.0, 4112.0, 2779.0, 1826.0, 1147.0, 874.0, 594.0, 340.0, 269.0, 185.0, 137.0, 83.0, 56.0, 39.0, 18.0, 22.0, 17.0, 13.0, 3.0, 2.0, 2.0, 1.0, 2.0, 2.0], "bins": [-1.2509765625, -1.21343994140625, -1.1759033203125, -1.13836669921875, -1.100830078125, -1.06329345703125, -1.0257568359375, -0.98822021484375, -0.95068359375, -0.91314697265625, -0.8756103515625, -0.83807373046875, -0.800537109375, -0.76300048828125, -0.7254638671875, -0.68792724609375, -0.650390625, -0.61285400390625, -0.5753173828125, -0.53778076171875, -0.500244140625, -0.46270751953125, -0.4251708984375, -0.38763427734375, -0.35009765625, -0.31256103515625, -0.2750244140625, -0.23748779296875, -0.199951171875, -0.16241455078125, -0.1248779296875, -0.08734130859375, -0.0498046875, -0.01226806640625, 0.0252685546875, 0.06280517578125, 0.100341796875, 0.13787841796875, 0.1754150390625, 0.21295166015625, 0.25048828125, 0.28802490234375, 0.3255615234375, 0.36309814453125, 0.400634765625, 0.43817138671875, 0.4757080078125, 0.51324462890625, 0.55078125, 0.58831787109375, 0.6258544921875, 0.66339111328125, 0.700927734375, 0.73846435546875, 0.7760009765625, 0.81353759765625, 0.85107421875, 0.88861083984375, 0.9261474609375, 0.96368408203125, 1.001220703125, 1.03875732421875, 1.0762939453125, 1.11383056640625, 1.1513671875]}, "gradients/decoder.transformer.h.23.crossattention.c_attn.bias": {"_type": "histogram", "values": [3.0, 2.0, 3.0, 1.0, 0.0, 3.0, 2.0, 3.0, 5.0, 7.0, 12.0, 10.0, 10.0, 13.0, 16.0, 12.0, 14.0, 23.0, 23.0, 25.0, 36.0, 40.0, 27.0, 35.0, 32.0, 42.0, 27.0, 41.0, 1070.0, 40.0, 33.0, 39.0, 32.0, 50.0, 37.0, 31.0, 35.0, 28.0, 28.0, 25.0, 19.0, 18.0, 18.0, 10.0, 12.0, 10.0, 12.0, 8.0, 5.0, 6.0, 2.0, 4.0, 3.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.599609375, -2.50946044921875, -2.4193115234375, -2.32916259765625, -2.239013671875, -2.14886474609375, -2.0587158203125, -1.96856689453125, -1.87841796875, -1.78826904296875, -1.6981201171875, -1.60797119140625, -1.517822265625, -1.42767333984375, -1.3375244140625, -1.24737548828125, -1.1572265625, -1.06707763671875, -0.9769287109375, -0.88677978515625, -0.796630859375, -0.70648193359375, -0.6163330078125, -0.52618408203125, -0.43603515625, -0.34588623046875, -0.2557373046875, -0.16558837890625, -0.075439453125, 0.01470947265625, 0.1048583984375, 0.19500732421875, 0.28515625, 0.37530517578125, 0.4654541015625, 0.55560302734375, 0.645751953125, 0.73590087890625, 0.8260498046875, 0.91619873046875, 1.00634765625, 1.09649658203125, 1.1866455078125, 1.27679443359375, 1.366943359375, 1.45709228515625, 1.5472412109375, 1.63739013671875, 1.7275390625, 1.81768798828125, 1.9078369140625, 1.99798583984375, 2.088134765625, 2.17828369140625, 2.2684326171875, 2.35858154296875, 2.44873046875, 2.53887939453125, 2.6290283203125, 2.71917724609375, 2.809326171875, 2.89947509765625, 2.9896240234375, 3.07977294921875, 3.169921875]}, "gradients/decoder.transformer.h.23.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 5.0, 4.0, 7.0, 9.0, 14.0, 18.0, 24.0, 35.0, 54.0, 105.0, 165.0, 220.0, 401.0, 680.0, 1249.0, 2063.0, 3749.0, 6558.0, 12055.0, 22558.0, 45082.0, 92285.0, 221497.0, 1435320.0, 127789.0, 60298.0, 29879.0, 15506.0, 8465.0, 4709.0, 2647.0, 1512.0, 857.0, 522.0, 298.0, 186.0, 104.0, 64.0, 39.0, 25.0, 26.0, 23.0, 10.0, 7.0, 4.0, 3.0, 3.0, 1.0, 1.0, 2.0, 3.0, 3.0], "bins": [-1.7314453125, -1.6822662353515625, -1.633087158203125, -1.5839080810546875, -1.53472900390625, -1.4855499267578125, -1.436370849609375, -1.3871917724609375, -1.3380126953125, -1.2888336181640625, -1.239654541015625, -1.1904754638671875, -1.14129638671875, -1.0921173095703125, -1.042938232421875, -0.9937591552734375, -0.944580078125, -0.8954010009765625, -0.846221923828125, -0.7970428466796875, -0.74786376953125, -0.6986846923828125, -0.649505615234375, -0.6003265380859375, -0.5511474609375, -0.5019683837890625, -0.452789306640625, -0.4036102294921875, -0.35443115234375, -0.3052520751953125, -0.256072998046875, -0.2068939208984375, -0.15771484375, -0.1085357666015625, -0.059356689453125, -0.0101776123046875, 0.03900146484375, 0.0881805419921875, 0.137359619140625, 0.1865386962890625, 0.2357177734375, 0.2848968505859375, 0.334075927734375, 0.3832550048828125, 0.43243408203125, 0.4816131591796875, 0.530792236328125, 0.5799713134765625, 0.629150390625, 0.6783294677734375, 0.727508544921875, 0.7766876220703125, 0.82586669921875, 0.8750457763671875, 0.924224853515625, 0.9734039306640625, 1.0225830078125, 1.0717620849609375, 1.120941162109375, 1.1701202392578125, 1.21929931640625, 1.2684783935546875, 1.317657470703125, 1.3668365478515625, 1.416015625]}, "gradients/decoder.transformer.h.23.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 3.0, 3.0, 5.0, 3.0, 11.0, 11.0, 16.0, 24.0, 27.0, 34.0, 45.0, 79.0, 94.0, 91.0, 104.0, 103.0, 89.0, 65.0, 43.0, 48.0, 24.0, 28.0, 12.0, 10.0, 7.0, 1.0, 9.0, 4.0, 4.0, 1.0, 3.0, 0.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0010557174682617188, -0.0010241195559501648, -0.0009925216436386108, -0.0009609237313270569, -0.0009293258190155029, -0.000897727906703949, -0.000866129994392395, -0.0008345320820808411, -0.0008029341697692871, -0.0007713362574577332, -0.0007397383451461792, -0.0007081404328346252, -0.0006765425205230713, -0.0006449446082115173, -0.0006133466958999634, -0.0005817487835884094, -0.0005501508712768555, -0.0005185529589653015, -0.00048695504665374756, -0.0004553571343421936, -0.00042375922203063965, -0.0003921613097190857, -0.00036056339740753174, -0.0003289654850959778, -0.00029736757278442383, -0.0002657696604728699, -0.00023417174816131592, -0.00020257383584976196, -0.000170975923538208, -0.00013937801122665405, -0.0001077800989151001, -7.618218660354614e-05, -4.458427429199219e-05, -1.2986361980438232e-05, 1.8611550331115723e-05, 5.020946264266968e-05, 8.180737495422363e-05, 0.00011340528726577759, 0.00014500319957733154, 0.0001766011118888855, 0.00020819902420043945, 0.0002397969365119934, 0.00027139484882354736, 0.0003029927611351013, 0.0003345906734466553, 0.00036618858575820923, 0.0003977864980697632, 0.00042938441038131714, 0.0004609823226928711, 0.000492580235004425, 0.000524178147315979, 0.000555776059627533, 0.0005873739719390869, 0.0006189718842506409, 0.0006505697965621948, 0.0006821677088737488, 0.0007137656211853027, 0.0007453635334968567, 0.0007769614458084106, 0.0008085593581199646, 0.0008401572704315186, 0.0008717551827430725, 0.0009033530950546265, 0.0009349510073661804, 0.0009665489196777344]}, "gradients/decoder.transformer.h.23.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 5.0, 1.0, 3.0, 5.0, 9.0, 12.0, 10.0, 9.0, 22.0, 25.0, 36.0, 61.0, 98.0, 159.0, 215.0, 353.0, 701.0, 5142.0, 1039413.0, 996.0, 465.0, 295.0, 185.0, 109.0, 63.0, 53.0, 30.0, 22.0, 18.0, 14.0, 9.0, 10.0, 2.0, 5.0, 1.0, 1.0, 1.0, 3.0, 0.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0203399658203125, -0.01968860626220703, -0.019037246704101562, -0.018385887145996094, -0.017734527587890625, -0.017083168029785156, -0.016431808471679688, -0.01578044891357422, -0.01512908935546875, -0.014477729797363281, -0.013826370239257812, -0.013175010681152344, -0.012523651123046875, -0.011872291564941406, -0.011220932006835938, -0.010569572448730469, -0.009918212890625, -0.009266853332519531, -0.008615493774414062, -0.007964134216308594, -0.007312774658203125, -0.006661415100097656, -0.0060100555419921875, -0.005358695983886719, -0.00470733642578125, -0.004055976867675781, -0.0034046173095703125, -0.0027532577514648438, -0.002101898193359375, -0.0014505386352539062, -0.0007991790771484375, -0.00014781951904296875, 0.0005035400390625, 0.0011548995971679688, 0.0018062591552734375, 0.0024576187133789062, 0.003108978271484375, 0.0037603378295898438, 0.0044116973876953125, 0.005063056945800781, 0.00571441650390625, 0.006365776062011719, 0.0070171356201171875, 0.007668495178222656, 0.008319854736328125, 0.008971214294433594, 0.009622573852539062, 0.010273933410644531, 0.01092529296875, 0.011576652526855469, 0.012228012084960938, 0.012879371643066406, 0.013530731201171875, 0.014182090759277344, 0.014833450317382812, 0.015484809875488281, 0.01613616943359375, 0.01678752899169922, 0.017438888549804688, 0.018090248107910156, 0.018741607666015625, 0.019392967224121094, 0.020044326782226562, 0.02069568634033203, 0.0213470458984375]}, "gradients/decoder.transformer.h.23.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 8.0, 859.0, 150.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.003812056966125965, -0.0037229156587272882, -0.0036337743513286114, -0.003544632811099291, -0.003455491503700614, -0.003366350196301937, -0.0032772086560726166, -0.0031880673486739397, -0.003098926041275263, -0.003009784733876586, -0.002920643426477909, -0.0028315018862485886, -0.0027423605788499117, -0.002653219271451235, -0.0025640777312219143, -0.0024749364238232374, -0.0023857951164245605, -0.0022966538090258837, -0.002207512501627207, -0.0021183709613978863, -0.0020292296539992094, -0.0019400883466005325, -0.0018509469227865338, -0.0017618054989725351, -0.0016726641915738583, -0.0015835228841751814, -0.0014943814603611827, -0.001405240036547184, -0.0013160987291485071, -0.0012269574217498302, -0.0011378159979358315, -0.0010486745741218328, -0.0009595331503078341, -0.0008703917847014964, -0.0007812504190951586, -0.0006921090534888208, -0.000602967687882483, -0.0005138263222761452, -0.00042468495666980743, -0.00033554359106346965, -0.00024640222545713186, -0.00015726085985079408, -6.811949424445629e-05, 2.1021871361881495e-05, 0.00011016323696821928, 0.00019930460257455707, 0.00028844596818089485, 0.00037758733378723264, 0.0004667286993935704, 0.0005558700649999082, 0.000645011430606246, 0.0007341527962125838, 0.0008232941618189216, 0.0009124355274252594, 0.0010015768930315971, 0.001090718200430274, 0.0011798596242442727, 0.0012690010480582714, 0.0013581423554569483, 0.0014472836628556252, 0.0015364250866696239, 0.0016255665104836226, 0.0017147078178822994, 0.0018038491252809763, 0.001892990549094975]}, "gradients/decoder.transformer.h.23.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 2.0, 3.0, 4.0, 6.0, 5.0, 9.0, 11.0, 14.0, 12.0, 22.0, 23.0, 29.0, 22.0, 30.0, 29.0, 45.0, 41.0, 38.0, 40.0, 40.0, 47.0, 44.0, 35.0, 48.0, 34.0, 34.0, 41.0, 49.0, 42.0, 36.0, 28.0, 37.0, 19.0, 19.0, 14.0, 8.0, 10.0, 7.0, 11.0, 10.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.0004277825355529785, -0.00041530653834342957, -0.0004028305411338806, -0.00039035454392433167, -0.0003778785467147827, -0.00036540254950523376, -0.0003529265522956848, -0.00034045055508613586, -0.0003279745578765869, -0.00031549856066703796, -0.000303022563457489, -0.00029054656624794006, -0.0002780705690383911, -0.00026559457182884216, -0.0002531185746192932, -0.00024064257740974426, -0.0002281665802001953, -0.00021569058299064636, -0.0002032145857810974, -0.00019073858857154846, -0.0001782625913619995, -0.00016578659415245056, -0.0001533105969429016, -0.00014083459973335266, -0.0001283586025238037, -0.00011588260531425476, -0.00010340660810470581, -9.093061089515686e-05, -7.845461368560791e-05, -6.597861647605896e-05, -5.350261926651001e-05, -4.102662205696106e-05, -2.855062484741211e-05, -1.607462763786316e-05, -3.598630428314209e-06, 8.877366781234741e-06, 2.135336399078369e-05, 3.382936120033264e-05, 4.630535840988159e-05, 5.878135561943054e-05, 7.125735282897949e-05, 8.373335003852844e-05, 9.620934724807739e-05, 0.00010868534445762634, 0.00012116134166717529, 0.00013363733887672424, 0.0001461133360862732, 0.00015858933329582214, 0.0001710653305053711, 0.00018354132771492004, 0.000196017324924469, 0.00020849332213401794, 0.0002209693193435669, 0.00023344531655311584, 0.0002459213137626648, 0.00025839731097221375, 0.0002708733081817627, 0.00028334930539131165, 0.0002958253026008606, 0.00030830129981040955, 0.0003207772970199585, 0.00033325329422950745, 0.0003457292914390564, 0.00035820528864860535, 0.0003706812858581543]}, "gradients/decoder.transformer.h.23.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 6.0, 3.0, 11.0, 3.0, 2.0, 8.0, 10.0, 12.0, 7.0, 23.0, 21.0, 18.0, 20.0, 33.0, 25.0, 32.0, 37.0, 34.0, 44.0, 38.0, 43.0, 43.0, 43.0, 48.0, 43.0, 50.0, 34.0, 35.0, 34.0, 25.0, 23.0, 24.0, 26.0, 26.0, 16.0, 18.0, 19.0, 12.0, 15.0, 5.0, 7.0, 7.0, 10.0, 4.0, 6.0, 2.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.42578125, -4.2813720703125, -4.136962890625, -3.9925537109375, -3.84814453125, -3.7037353515625, -3.559326171875, -3.4149169921875, -3.2705078125, -3.1260986328125, -2.981689453125, -2.8372802734375, -2.69287109375, -2.5484619140625, -2.404052734375, -2.2596435546875, -2.115234375, -1.9708251953125, -1.826416015625, -1.6820068359375, -1.53759765625, -1.3931884765625, -1.248779296875, -1.1043701171875, -0.9599609375, -0.8155517578125, -0.671142578125, -0.5267333984375, -0.38232421875, -0.2379150390625, -0.093505859375, 0.0509033203125, 0.1953125, 0.3397216796875, 0.484130859375, 0.6285400390625, 0.77294921875, 0.9173583984375, 1.061767578125, 1.2061767578125, 1.3505859375, 1.4949951171875, 1.639404296875, 1.7838134765625, 1.92822265625, 2.0726318359375, 2.217041015625, 2.3614501953125, 2.505859375, 2.6502685546875, 2.794677734375, 2.9390869140625, 3.08349609375, 3.2279052734375, 3.372314453125, 3.5167236328125, 3.6611328125, 3.8055419921875, 3.949951171875, 4.0943603515625, 4.23876953125, 4.3831787109375, 4.527587890625, 4.6719970703125, 4.81640625]}, "gradients/decoder.transformer.h.23.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 3.0, 1.0, 5.0, 2.0, 4.0, 7.0, 13.0, 8.0, 18.0, 20.0, 24.0, 35.0, 48.0, 66.0, 90.0, 130.0, 175.0, 256.0, 350.0, 467.0, 711.0, 1017.0, 1430.0, 2131.0, 3305.0, 5436.0, 10812.0, 35930.0, 883127.0, 71501.0, 13456.0, 6389.0, 3770.0, 2460.0, 1645.0, 1061.0, 761.0, 525.0, 385.0, 253.0, 224.0, 130.0, 113.0, 69.0, 57.0, 51.0, 21.0, 19.0, 17.0, 12.0, 9.0, 5.0, 5.0, 6.0, 0.0, 2.0, 4.0, 1.0, 0.0, 1.0, 1.0], "bins": [-36.71875, -35.55810546875, -34.3974609375, -33.23681640625, -32.076171875, -30.91552734375, -29.7548828125, -28.59423828125, -27.43359375, -26.27294921875, -25.1123046875, -23.95166015625, -22.791015625, -21.63037109375, -20.4697265625, -19.30908203125, -18.1484375, -16.98779296875, -15.8271484375, -14.66650390625, -13.505859375, -12.34521484375, -11.1845703125, -10.02392578125, -8.86328125, -7.70263671875, -6.5419921875, -5.38134765625, -4.220703125, -3.06005859375, -1.8994140625, -0.73876953125, 0.421875, 1.58251953125, 2.7431640625, 3.90380859375, 5.064453125, 6.22509765625, 7.3857421875, 8.54638671875, 9.70703125, 10.86767578125, 12.0283203125, 13.18896484375, 14.349609375, 15.51025390625, 16.6708984375, 17.83154296875, 18.9921875, 20.15283203125, 21.3134765625, 22.47412109375, 23.634765625, 24.79541015625, 25.9560546875, 27.11669921875, 28.27734375, 29.43798828125, 30.5986328125, 31.75927734375, 32.919921875, 34.08056640625, 35.2412109375, 36.40185546875, 37.5625]}, "gradients/decoder.transformer.h.23.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 3.0, 5.0, 4.0, 7.0, 6.0, 12.0, 12.0, 9.0, 14.0, 18.0, 18.0, 24.0, 22.0, 21.0, 27.0, 24.0, 42.0, 31.0, 31.0, 36.0, 58.0, 90.0, 262.0, 1650.0, 190.0, 58.0, 46.0, 36.0, 38.0, 35.0, 33.0, 22.0, 27.0, 21.0, 15.0, 18.0, 23.0, 7.0, 10.0, 11.0, 7.0, 9.0, 6.0, 4.0, 7.0, 5.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0], "bins": [-14.9296875, -14.4954833984375, -14.061279296875, -13.6270751953125, -13.19287109375, -12.7586669921875, -12.324462890625, -11.8902587890625, -11.4560546875, -11.0218505859375, -10.587646484375, -10.1534423828125, -9.71923828125, -9.2850341796875, -8.850830078125, -8.4166259765625, -7.982421875, -7.5482177734375, -7.114013671875, -6.6798095703125, -6.24560546875, -5.8114013671875, -5.377197265625, -4.9429931640625, -4.5087890625, -4.0745849609375, -3.640380859375, -3.2061767578125, -2.77197265625, -2.3377685546875, -1.903564453125, -1.4693603515625, -1.03515625, -0.6009521484375, -0.166748046875, 0.2674560546875, 0.70166015625, 1.1358642578125, 1.570068359375, 2.0042724609375, 2.4384765625, 2.8726806640625, 3.306884765625, 3.7410888671875, 4.17529296875, 4.6094970703125, 5.043701171875, 5.4779052734375, 5.912109375, 6.3463134765625, 6.780517578125, 7.2147216796875, 7.64892578125, 8.0831298828125, 8.517333984375, 8.9515380859375, 9.3857421875, 9.8199462890625, 10.254150390625, 10.6883544921875, 11.12255859375, 11.5567626953125, 11.990966796875, 12.4251708984375, 12.859375]}, "gradients/decoder.transformer.h.23.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 3.0, 3.0, 10.0, 4.0, 5.0, 11.0, 12.0, 10.0, 13.0, 16.0, 12.0, 29.0, 19.0, 16.0, 28.0, 30.0, 46.0, 54.0, 87.0, 189.0, 630.0, 5452.0, 3130980.0, 6665.0, 708.0, 210.0, 99.0, 66.0, 40.0, 43.0, 30.0, 24.0, 24.0, 17.0, 16.0, 24.0, 16.0, 17.0, 9.0, 6.0, 9.0, 7.0, 6.0, 5.0, 5.0, 3.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-58.71875, -56.72998046875, -54.7412109375, -52.75244140625, -50.763671875, -48.77490234375, -46.7861328125, -44.79736328125, -42.80859375, -40.81982421875, -38.8310546875, -36.84228515625, -34.853515625, -32.86474609375, -30.8759765625, -28.88720703125, -26.8984375, -24.90966796875, -22.9208984375, -20.93212890625, -18.943359375, -16.95458984375, -14.9658203125, -12.97705078125, -10.98828125, -8.99951171875, -7.0107421875, -5.02197265625, -3.033203125, -1.04443359375, 0.9443359375, 2.93310546875, 4.921875, 6.91064453125, 8.8994140625, 10.88818359375, 12.876953125, 14.86572265625, 16.8544921875, 18.84326171875, 20.83203125, 22.82080078125, 24.8095703125, 26.79833984375, 28.787109375, 30.77587890625, 32.7646484375, 34.75341796875, 36.7421875, 38.73095703125, 40.7197265625, 42.70849609375, 44.697265625, 46.68603515625, 48.6748046875, 50.66357421875, 52.65234375, 54.64111328125, 56.6298828125, 58.61865234375, 60.607421875, 62.59619140625, 64.5849609375, 66.57373046875, 68.5625]}, "gradients/decoder.transformer.h.23.ln_1.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 4.0, 407.0, 608.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-59.58740997314453, -49.696773529052734, -39.80613708496094, -29.915504455566406, -20.02486801147461, -10.134231567382812, -0.24359893798828125, 9.647041320800781, 19.537673950195312, 29.42831039428711, 39.318946838378906, 49.20957946777344, 59.100215911865234, 68.99085235595703, 78.88148498535156, 88.77212524414062, 98.66275787353516, 108.55339050292969, 118.44403076171875, 128.33465576171875, 138.2252960205078, 148.11593627929688, 158.00656127929688, 167.897216796875, 177.787841796875, 187.67848205566406, 197.56910705566406, 207.45974731445312, 217.3503875732422, 227.24102783203125, 237.13165283203125, 247.0222930908203, 256.9129333496094, 266.8035583496094, 276.6942138671875, 286.5848388671875, 296.4754638671875, 306.3661193847656, 316.2567443847656, 326.14739990234375, 336.03802490234375, 345.92864990234375, 355.8193054199219, 365.7099304199219, 375.6005554199219, 385.4912109375, 395.3818359375, 405.2724609375, 415.1630859375, 425.0537109375, 434.9443664550781, 444.8349914550781, 454.7256164550781, 464.61627197265625, 474.50689697265625, 484.39752197265625, 494.2881774902344, 504.1788024902344, 514.0694580078125, 523.9600830078125, 533.8507080078125, 543.7413330078125, 553.6319580078125, 563.5226440429688, 573.4132690429688]}, "gradients/decoder.transformer.h.23.ln_1.bias": {"_type": "histogram", "values": [5.0, 4.0, 4.0, 4.0, 3.0, 3.0, 3.0, 10.0, 8.0, 5.0, 11.0, 9.0, 19.0, 14.0, 16.0, 22.0, 23.0, 21.0, 21.0, 27.0, 27.0, 31.0, 29.0, 27.0, 30.0, 39.0, 37.0, 41.0, 26.0, 41.0, 36.0, 47.0, 45.0, 30.0, 34.0, 33.0, 32.0, 25.0, 28.0, 22.0, 15.0, 21.0, 20.0, 7.0, 12.0, 8.0, 7.0, 3.0, 7.0, 3.0, 6.0, 4.0, 3.0, 3.0, 2.0, 3.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-31.174589157104492, -29.939983367919922, -28.70537757873535, -27.47077178955078, -26.236164093017578, -25.00156021118164, -23.766952514648438, -22.532346725463867, -21.297740936279297, -20.063135147094727, -18.828529357910156, -17.593923568725586, -16.359317779541016, -15.124711036682129, -13.890104293823242, -12.655498504638672, -11.420892715454102, -10.186286926269531, -8.951681137084961, -7.717074394226074, -6.482468605041504, -5.247862815856934, -4.013256549835205, -2.7786502838134766, -1.5440444946289062, -0.30943846702575684, 0.9251675605773926, 2.159773588180542, 3.3943796157836914, 4.628985404968262, 5.86359167098999, 7.098197937011719, 8.332801818847656, 9.567407608032227, 10.802013397216797, 12.036620140075684, 13.271225929260254, 14.505831718444824, 15.740438461303711, 16.97504425048828, 18.20965003967285, 19.444255828857422, 20.678861618041992, 21.913467407226562, 23.148075103759766, 24.382678985595703, 25.617286682128906, 26.851892471313477, 28.086498260498047, 29.321104049682617, 30.555709838867188, 31.790315628051758, 33.02492141723633, 34.25952911376953, 35.49413299560547, 36.72874069213867, 37.963348388671875, 39.19795608520508, 40.432559967041016, 41.66716766357422, 42.901771545410156, 44.13637924194336, 45.3709831237793, 46.6055908203125, 47.84019470214844]}, "gradients/decoder.transformer.h.22.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 3.0, 3.0, 10.0, 4.0, 3.0, 8.0, 4.0, 8.0, 8.0, 12.0, 10.0, 15.0, 31.0, 25.0, 24.0, 20.0, 34.0, 35.0, 34.0, 30.0, 50.0, 41.0, 44.0, 38.0, 44.0, 49.0, 36.0, 43.0, 39.0, 34.0, 28.0, 31.0, 27.0, 21.0, 21.0, 16.0, 24.0, 22.0, 13.0, 15.0, 12.0, 6.0, 8.0, 6.0, 7.0, 8.0, 3.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-4.70703125, -4.55645751953125, -4.4058837890625, -4.25531005859375, -4.104736328125, -3.95416259765625, -3.8035888671875, -3.65301513671875, -3.50244140625, -3.35186767578125, -3.2012939453125, -3.05072021484375, -2.900146484375, -2.74957275390625, -2.5989990234375, -2.44842529296875, -2.2978515625, -2.14727783203125, -1.9967041015625, -1.84613037109375, -1.695556640625, -1.54498291015625, -1.3944091796875, -1.24383544921875, -1.09326171875, -0.94268798828125, -0.7921142578125, -0.64154052734375, -0.490966796875, -0.34039306640625, -0.1898193359375, -0.03924560546875, 0.111328125, 0.26190185546875, 0.4124755859375, 0.56304931640625, 0.713623046875, 0.86419677734375, 1.0147705078125, 1.16534423828125, 1.31591796875, 1.46649169921875, 1.6170654296875, 1.76763916015625, 1.918212890625, 2.06878662109375, 2.2193603515625, 2.36993408203125, 2.5205078125, 2.67108154296875, 2.8216552734375, 2.97222900390625, 3.122802734375, 3.27337646484375, 3.4239501953125, 3.57452392578125, 3.72509765625, 3.87567138671875, 4.0262451171875, 4.17681884765625, 4.327392578125, 4.47796630859375, 4.6285400390625, 4.77911376953125, 4.9296875]}, "gradients/decoder.transformer.h.22.mlp.c_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 3.0, 5.0, 4.0, 4.0, 4.0, 6.0, 4.0, 7.0, 18.0, 20.0, 28.0, 29.0, 49.0, 60.0, 98.0, 152.0, 266.0, 433.0, 700.0, 1394.0, 2683.0, 5659.0, 13781.0, 42436.0, 681364.0, 3318202.0, 88889.0, 22199.0, 8477.0, 3631.0, 1675.0, 808.0, 446.0, 263.0, 168.0, 93.0, 56.0, 50.0, 29.0, 26.0, 23.0, 14.0, 9.0, 7.0, 4.0, 7.0, 2.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-33.875, -32.7900390625, -31.705078125, -30.6201171875, -29.53515625, -28.4501953125, -27.365234375, -26.2802734375, -25.1953125, -24.1103515625, -23.025390625, -21.9404296875, -20.85546875, -19.7705078125, -18.685546875, -17.6005859375, -16.515625, -15.4306640625, -14.345703125, -13.2607421875, -12.17578125, -11.0908203125, -10.005859375, -8.9208984375, -7.8359375, -6.7509765625, -5.666015625, -4.5810546875, -3.49609375, -2.4111328125, -1.326171875, -0.2412109375, 0.84375, 1.9287109375, 3.013671875, 4.0986328125, 5.18359375, 6.2685546875, 7.353515625, 8.4384765625, 9.5234375, 10.6083984375, 11.693359375, 12.7783203125, 13.86328125, 14.9482421875, 16.033203125, 17.1181640625, 18.203125, 19.2880859375, 20.373046875, 21.4580078125, 22.54296875, 23.6279296875, 24.712890625, 25.7978515625, 26.8828125, 27.9677734375, 29.052734375, 30.1376953125, 31.22265625, 32.3076171875, 33.392578125, 34.4775390625, 35.5625]}, "gradients/decoder.transformer.h.22.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 6.0, 1.0, 2.0, 1.0, 6.0, 4.0, 10.0, 13.0, 20.0, 19.0, 26.0, 38.0, 58.0, 78.0, 95.0, 160.0, 299.0, 499.0, 747.0, 756.0, 452.0, 251.0, 178.0, 106.0, 68.0, 61.0, 39.0, 23.0, 18.0, 14.0, 13.0, 7.0, 6.0, 4.0, 0.0, 2.0, 5.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-20.578125, -19.642822265625, -18.70751953125, -17.772216796875, -16.8369140625, -15.901611328125, -14.96630859375, -14.031005859375, -13.095703125, -12.160400390625, -11.22509765625, -10.289794921875, -9.3544921875, -8.419189453125, -7.48388671875, -6.548583984375, -5.61328125, -4.677978515625, -3.74267578125, -2.807373046875, -1.8720703125, -0.936767578125, -0.00146484375, 0.933837890625, 1.869140625, 2.804443359375, 3.73974609375, 4.675048828125, 5.6103515625, 6.545654296875, 7.48095703125, 8.416259765625, 9.3515625, 10.286865234375, 11.22216796875, 12.157470703125, 13.0927734375, 14.028076171875, 14.96337890625, 15.898681640625, 16.833984375, 17.769287109375, 18.70458984375, 19.639892578125, 20.5751953125, 21.510498046875, 22.44580078125, 23.381103515625, 24.31640625, 25.251708984375, 26.18701171875, 27.122314453125, 28.0576171875, 28.992919921875, 29.92822265625, 30.863525390625, 31.798828125, 32.734130859375, 33.66943359375, 34.604736328125, 35.5400390625, 36.475341796875, 37.41064453125, 38.345947265625, 39.28125]}, "gradients/decoder.transformer.h.22.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 2.0, 2.0, 4.0, 3.0, 4.0, 8.0, 14.0, 14.0, 20.0, 33.0, 47.0, 69.0, 98.0, 188.0, 279.0, 492.0, 3615.0, 4184496.0, 3565.0, 527.0, 283.0, 173.0, 101.0, 79.0, 58.0, 28.0, 25.0, 24.0, 12.0, 8.0, 9.0, 2.0, 1.0, 3.0, 3.0, 4.0, 0.0, 1.0], "bins": [-216.375, -211.3994140625, -206.423828125, -201.4482421875, -196.47265625, -191.4970703125, -186.521484375, -181.5458984375, -176.5703125, -171.5947265625, -166.619140625, -161.6435546875, -156.66796875, -151.6923828125, -146.716796875, -141.7412109375, -136.765625, -131.7900390625, -126.814453125, -121.8388671875, -116.86328125, -111.8876953125, -106.912109375, -101.9365234375, -96.9609375, -91.9853515625, -87.009765625, -82.0341796875, -77.05859375, -72.0830078125, -67.107421875, -62.1318359375, -57.15625, -52.1806640625, -47.205078125, -42.2294921875, -37.25390625, -32.2783203125, -27.302734375, -22.3271484375, -17.3515625, -12.3759765625, -7.400390625, -2.4248046875, 2.55078125, 7.5263671875, 12.501953125, 17.4775390625, 22.453125, 27.4287109375, 32.404296875, 37.3798828125, 42.35546875, 47.3310546875, 52.306640625, 57.2822265625, 62.2578125, 67.2333984375, 72.208984375, 77.1845703125, 82.16015625, 87.1357421875, 92.111328125, 97.0869140625, 102.0625]}, "gradients/decoder.transformer.h.22.ln_2.weight": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 142.0, 705.0, 163.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-35.7835693359375, -27.871990203857422, -19.960412979125977, -12.048835754394531, -4.137256622314453, 3.774322509765625, 11.685897827148438, 19.597476959228516, 27.509056091308594, 35.42063522338867, 43.33221435546875, 51.24378967285156, 59.15536880493164, 67.06694793701172, 74.97852325439453, 82.89010620117188, 90.80168151855469, 98.7132568359375, 106.62483978271484, 114.53641510009766, 122.447998046875, 130.3595733642578, 138.27114868164062, 146.18272399902344, 154.09429931640625, 162.00587463378906, 169.91744995117188, 177.82904052734375, 185.74061584472656, 193.65219116210938, 201.5637664794922, 209.475341796875, 217.38693237304688, 225.2985076904297, 233.2100830078125, 241.12167358398438, 249.0332489013672, 256.94482421875, 264.85638427734375, 272.7679748535156, 280.6795654296875, 288.5911560058594, 296.5027160644531, 304.414306640625, 312.32586669921875, 320.2374572753906, 328.1490478515625, 336.06060791015625, 343.97216796875, 351.8837585449219, 359.7953186035156, 367.7069091796875, 375.61846923828125, 383.5300598144531, 391.441650390625, 399.35321044921875, 407.2648010253906, 415.1763916015625, 423.08795166015625, 430.9995422363281, 438.9111022949219, 446.82269287109375, 454.7342529296875, 462.6458435058594, 470.55743408203125]}, "gradients/decoder.transformer.h.22.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 4.0, 3.0, 4.0, 5.0, 10.0, 21.0, 15.0, 22.0, 24.0, 24.0, 27.0, 26.0, 35.0, 38.0, 43.0, 41.0, 54.0, 40.0, 50.0, 54.0, 55.0, 50.0, 41.0, 43.0, 32.0, 36.0, 42.0, 32.0, 21.0, 16.0, 21.0, 18.0, 12.0, 11.0, 10.0, 9.0, 7.0, 5.0, 6.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-82.56156158447266, -79.8252944946289, -77.08902740478516, -74.3527603149414, -71.61648559570312, -68.88021850585938, -66.14395141601562, -63.407684326171875, -60.671417236328125, -57.935150146484375, -55.198883056640625, -52.46261215209961, -49.72634506225586, -46.99007797241211, -44.253807067871094, -41.517539978027344, -38.781272888183594, -36.045005798339844, -33.308738708496094, -30.572467803955078, -27.836200714111328, -25.099933624267578, -22.363664627075195, -19.627395629882812, -16.891128540039062, -14.154860496520996, -11.41859245300293, -8.682324409484863, -5.946056365966797, -3.2097883224487305, -0.47352027893066406, 2.2627487182617188, 4.999015808105469, 7.735283851623535, 10.471551895141602, 13.207819938659668, 15.944087982177734, 18.680355072021484, 21.416624069213867, 24.15289306640625, 26.88916015625, 29.62542724609375, 32.3616943359375, 35.097965240478516, 37.834232330322266, 40.570499420166016, 43.30677032470703, 46.04303741455078, 48.77930450439453, 51.51557159423828, 54.25183868408203, 56.98810958862305, 59.7243766784668, 62.46064376831055, 65.19691467285156, 67.93318176269531, 70.66944885253906, 73.40571594238281, 76.14198303222656, 78.87825012207031, 81.61451721191406, 84.35079193115234, 87.0870590209961, 89.82332611083984, 92.5595932006836]}, "gradients/decoder.transformer.h.22.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 6.0, 6.0, 6.0, 5.0, 2.0, 8.0, 4.0, 14.0, 7.0, 12.0, 10.0, 26.0, 24.0, 30.0, 31.0, 21.0, 33.0, 43.0, 42.0, 37.0, 39.0, 51.0, 41.0, 42.0, 43.0, 41.0, 48.0, 38.0, 33.0, 28.0, 28.0, 35.0, 19.0, 24.0, 22.0, 19.0, 15.0, 15.0, 15.0, 5.0, 8.0, 9.0, 7.0, 7.0, 3.0, 5.0, 2.0, 0.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.76171875, -4.60430908203125, -4.4468994140625, -4.28948974609375, -4.132080078125, -3.97467041015625, -3.8172607421875, -3.65985107421875, -3.50244140625, -3.34503173828125, -3.1876220703125, -3.03021240234375, -2.872802734375, -2.71539306640625, -2.5579833984375, -2.40057373046875, -2.2431640625, -2.08575439453125, -1.9283447265625, -1.77093505859375, -1.613525390625, -1.45611572265625, -1.2987060546875, -1.14129638671875, -0.98388671875, -0.82647705078125, -0.6690673828125, -0.51165771484375, -0.354248046875, -0.19683837890625, -0.0394287109375, 0.11798095703125, 0.275390625, 0.43280029296875, 0.5902099609375, 0.74761962890625, 0.905029296875, 1.06243896484375, 1.2198486328125, 1.37725830078125, 1.53466796875, 1.69207763671875, 1.8494873046875, 2.00689697265625, 2.164306640625, 2.32171630859375, 2.4791259765625, 2.63653564453125, 2.7939453125, 2.95135498046875, 3.1087646484375, 3.26617431640625, 3.423583984375, 3.58099365234375, 3.7384033203125, 3.89581298828125, 4.05322265625, 4.21063232421875, 4.3680419921875, 4.52545166015625, 4.682861328125, 4.84027099609375, 4.9976806640625, 5.15509033203125, 5.3125]}, "gradients/decoder.transformer.h.22.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 1.0, 6.0, 3.0, 13.0, 12.0, 16.0, 19.0, 33.0, 56.0, 98.0, 136.0, 215.0, 384.0, 562.0, 976.0, 1697.0, 2810.0, 5012.0, 8632.0, 15533.0, 29427.0, 57835.0, 129428.0, 426571.0, 203716.0, 80029.0, 38744.0, 20473.0, 11041.0, 6213.0, 3572.0, 2134.0, 1263.0, 735.0, 455.0, 260.0, 156.0, 88.0, 67.0, 50.0, 18.0, 27.0, 11.0, 9.0, 8.0, 8.0, 4.0, 2.0, 3.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.873046875, -1.8174896240234375, -1.761932373046875, -1.7063751220703125, -1.65081787109375, -1.5952606201171875, -1.539703369140625, -1.4841461181640625, -1.4285888671875, -1.3730316162109375, -1.317474365234375, -1.2619171142578125, -1.20635986328125, -1.1508026123046875, -1.095245361328125, -1.0396881103515625, -0.984130859375, -0.9285736083984375, -0.873016357421875, -0.8174591064453125, -0.76190185546875, -0.7063446044921875, -0.650787353515625, -0.5952301025390625, -0.5396728515625, -0.4841156005859375, -0.428558349609375, -0.3730010986328125, -0.31744384765625, -0.2618865966796875, -0.206329345703125, -0.1507720947265625, -0.09521484375, -0.0396575927734375, 0.015899658203125, 0.0714569091796875, 0.12701416015625, 0.1825714111328125, 0.238128662109375, 0.2936859130859375, 0.3492431640625, 0.4048004150390625, 0.460357666015625, 0.5159149169921875, 0.57147216796875, 0.6270294189453125, 0.682586669921875, 0.7381439208984375, 0.793701171875, 0.8492584228515625, 0.904815673828125, 0.9603729248046875, 1.01593017578125, 1.0714874267578125, 1.127044677734375, 1.1826019287109375, 1.2381591796875, 1.2937164306640625, 1.349273681640625, 1.4048309326171875, 1.46038818359375, 1.5159454345703125, 1.571502685546875, 1.6270599365234375, 1.6826171875]}, "gradients/decoder.transformer.h.22.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 4.0, 1.0, 3.0, 2.0, 3.0, 5.0, 6.0, 6.0, 6.0, 11.0, 11.0, 6.0, 14.0, 26.0, 22.0, 22.0, 23.0, 34.0, 30.0, 30.0, 37.0, 45.0, 42.0, 42.0, 40.0, 37.0, 1063.0, 36.0, 33.0, 30.0, 30.0, 24.0, 38.0, 39.0, 26.0, 34.0, 34.0, 28.0, 19.0, 15.0, 15.0, 13.0, 14.0, 12.0, 10.0, 7.0, 6.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-3.056640625, -2.95751953125, -2.8583984375, -2.75927734375, -2.66015625, -2.56103515625, -2.4619140625, -2.36279296875, -2.263671875, -2.16455078125, -2.0654296875, -1.96630859375, -1.8671875, -1.76806640625, -1.6689453125, -1.56982421875, -1.470703125, -1.37158203125, -1.2724609375, -1.17333984375, -1.07421875, -0.97509765625, -0.8759765625, -0.77685546875, -0.677734375, -0.57861328125, -0.4794921875, -0.38037109375, -0.28125, -0.18212890625, -0.0830078125, 0.01611328125, 0.115234375, 0.21435546875, 0.3134765625, 0.41259765625, 0.51171875, 0.61083984375, 0.7099609375, 0.80908203125, 0.908203125, 1.00732421875, 1.1064453125, 1.20556640625, 1.3046875, 1.40380859375, 1.5029296875, 1.60205078125, 1.701171875, 1.80029296875, 1.8994140625, 1.99853515625, 2.09765625, 2.19677734375, 2.2958984375, 2.39501953125, 2.494140625, 2.59326171875, 2.6923828125, 2.79150390625, 2.890625, 2.98974609375, 3.0888671875, 3.18798828125, 3.287109375]}, "gradients/decoder.transformer.h.22.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 6.0, 5.0, 9.0, 10.0, 14.0, 28.0, 37.0, 60.0, 77.0, 148.0, 240.0, 370.0, 637.0, 1119.0, 1936.0, 3747.0, 6593.0, 12496.0, 24109.0, 48334.0, 100126.0, 243320.0, 1408620.0, 123933.0, 59360.0, 29177.0, 14967.0, 7902.0, 4203.0, 2371.0, 1313.0, 752.0, 425.0, 264.0, 147.0, 88.0, 62.0, 39.0, 32.0, 19.0, 8.0, 11.0, 8.0, 6.0, 5.0, 3.0, 2.0, 1.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.7978515625, -1.7436065673828125, -1.689361572265625, -1.6351165771484375, -1.58087158203125, -1.5266265869140625, -1.472381591796875, -1.4181365966796875, -1.3638916015625, -1.3096466064453125, -1.255401611328125, -1.2011566162109375, -1.14691162109375, -1.0926666259765625, -1.038421630859375, -0.9841766357421875, -0.929931640625, -0.8756866455078125, -0.821441650390625, -0.7671966552734375, -0.71295166015625, -0.6587066650390625, -0.604461669921875, -0.5502166748046875, -0.4959716796875, -0.4417266845703125, -0.387481689453125, -0.3332366943359375, -0.27899169921875, -0.2247467041015625, -0.170501708984375, -0.1162567138671875, -0.06201171875, -0.0077667236328125, 0.046478271484375, 0.1007232666015625, 0.15496826171875, 0.2092132568359375, 0.263458251953125, 0.3177032470703125, 0.3719482421875, 0.4261932373046875, 0.480438232421875, 0.5346832275390625, 0.58892822265625, 0.6431732177734375, 0.697418212890625, 0.7516632080078125, 0.805908203125, 0.8601531982421875, 0.914398193359375, 0.9686431884765625, 1.02288818359375, 1.0771331787109375, 1.131378173828125, 1.1856231689453125, 1.2398681640625, 1.2941131591796875, 1.348358154296875, 1.4026031494140625, 1.45684814453125, 1.5110931396484375, 1.565338134765625, 1.6195831298828125, 1.673828125]}, "gradients/decoder.transformer.h.22.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 0.0, 7.0, 2.0, 7.0, 5.0, 4.0, 8.0, 10.0, 17.0, 21.0, 20.0, 20.0, 39.0, 39.0, 40.0, 42.0, 78.0, 60.0, 65.0, 78.0, 93.0, 59.0, 55.0, 52.0, 35.0, 32.0, 16.0, 19.0, 19.0, 8.0, 16.0, 9.0, 6.0, 8.0, 2.0, 2.0, 2.0, 3.0, 3.0, 0.0, 2.0, 4.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0009136199951171875, -0.0008880943059921265, -0.0008625686168670654, -0.0008370429277420044, -0.0008115172386169434, -0.0007859915494918823, -0.0007604658603668213, -0.0007349401712417603, -0.0007094144821166992, -0.0006838887929916382, -0.0006583631038665771, -0.0006328374147415161, -0.0006073117256164551, -0.000581786036491394, -0.000556260347366333, -0.000530734658241272, -0.0005052089691162109, -0.0004796832799911499, -0.00045415759086608887, -0.00042863190174102783, -0.0004031062126159668, -0.00037758052349090576, -0.0003520548343658447, -0.0003265291452407837, -0.00030100345611572266, -0.0002754777669906616, -0.0002499520778656006, -0.00022442638874053955, -0.00019890069961547852, -0.00017337501049041748, -0.00014784932136535645, -0.0001223236322402954, -9.679794311523438e-05, -7.127225399017334e-05, -4.5746564865112305e-05, -2.022087574005127e-05, 5.304813385009766e-06, 3.08305025100708e-05, 5.6356191635131836e-05, 8.188188076019287e-05, 0.0001074075698852539, 0.00013293325901031494, 0.00015845894813537598, 0.000183984637260437, 0.00020951032638549805, 0.00023503601551055908, 0.0002605617046356201, 0.00028608739376068115, 0.0003116130828857422, 0.0003371387720108032, 0.00036266446113586426, 0.0003881901502609253, 0.00041371583938598633, 0.00043924152851104736, 0.0004647672176361084, 0.0004902929067611694, 0.0005158185958862305, 0.0005413442850112915, 0.0005668699741363525, 0.0005923956632614136, 0.0006179213523864746, 0.0006434470415115356, 0.0006689727306365967, 0.0006944984197616577, 0.0007200241088867188]}, "gradients/decoder.transformer.h.22.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 2.0, 3.0, 0.0, 1.0, 3.0, 3.0, 5.0, 5.0, 3.0, 5.0, 10.0, 11.0, 15.0, 18.0, 16.0, 23.0, 33.0, 64.0, 96.0, 129.0, 199.0, 610.0, 895977.0, 150232.0, 520.0, 196.0, 118.0, 58.0, 51.0, 32.0, 24.0, 26.0, 18.0, 11.0, 10.0, 6.0, 11.0, 5.0, 3.0, 3.0, 4.0, 3.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0207061767578125, -0.019937992095947266, -0.01916980743408203, -0.018401622772216797, -0.017633438110351562, -0.016865253448486328, -0.016097068786621094, -0.01532888412475586, -0.014560699462890625, -0.01379251480102539, -0.013024330139160156, -0.012256145477294922, -0.011487960815429688, -0.010719776153564453, -0.009951591491699219, -0.009183406829833984, -0.00841522216796875, -0.007647037506103516, -0.006878852844238281, -0.006110668182373047, -0.0053424835205078125, -0.004574298858642578, -0.0038061141967773438, -0.0030379295349121094, -0.002269744873046875, -0.0015015602111816406, -0.0007333755493164062, 3.4809112548828125e-05, 0.0008029937744140625, 0.0015711784362792969, 0.0023393630981445312, 0.0031075477600097656, 0.003875732421875, 0.004643917083740234, 0.005412101745605469, 0.006180286407470703, 0.0069484710693359375, 0.007716655731201172, 0.008484840393066406, 0.00925302505493164, 0.010021209716796875, 0.01078939437866211, 0.011557579040527344, 0.012325763702392578, 0.013093948364257812, 0.013862133026123047, 0.014630317687988281, 0.015398502349853516, 0.01616668701171875, 0.016934871673583984, 0.01770305633544922, 0.018471240997314453, 0.019239425659179688, 0.020007610321044922, 0.020775794982910156, 0.02154397964477539, 0.022312164306640625, 0.02308034896850586, 0.023848533630371094, 0.024616718292236328, 0.025384902954101562, 0.026153087615966797, 0.02692127227783203, 0.027689456939697266, 0.0284576416015625]}, "gradients/decoder.transformer.h.22.ln_cross_attn.weight": {"_type": "histogram", "values": [7.0, 25.0, 62.0, 175.0, 290.0, 270.0, 127.0, 42.0, 17.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.382082498632371e-05, -7.435216684825718e-05, -5.488350143423304e-05, -3.54148396581877e-05, -1.5946177882142365e-05, 3.5224802559241652e-06, 2.299114566994831e-05, 4.2459811083972454e-05, 6.192846922203898e-05, 8.139712736010551e-05, 0.00010086579277412966, 0.0001203344581881538, 0.00013980311632622033, 0.00015927177446428686, 0.00017874044715426862, 0.00019820910529233515, 0.00021767776343040168, 0.0002371464215684682, 0.00025661507970653474, 0.0002760837378446013, 0.00029555242508649826, 0.0003150210832245648, 0.0003344897413626313, 0.00035395839950069785, 0.0003734270576387644, 0.0003928957157768309, 0.00041236437391489744, 0.00043183303205296397, 0.0004513016901910305, 0.00047077034832909703, 0.0004902390064671636, 0.000509707722812891, 0.0005291763227432966, 0.0005486449808813632, 0.0005681136390194297, 0.0005875822971574962, 0.0006070509552955627, 0.0006265196134336293, 0.0006459882715716958, 0.0006654569879174232, 0.0006849255878478289, 0.0007043942459858954, 0.0007238629041239619, 0.0007433315622620285, 0.000762800220400095, 0.0007822688785381615, 0.000801737536676228, 0.0008212062530219555, 0.000840674911160022, 0.0008601435692980886, 0.0008796122274361551, 0.0008990808855742216, 0.0009185495437122881, 0.0009380182018503547, 0.0009574868599884212, 0.0009769555181264877, 0.0009964242344722152, 0.0010158929508179426, 0.0010353615507483482, 0.0010548302670940757, 0.0010742988670244813, 0.0010937675833702087, 0.0011132361833006144, 0.0011327048996463418, 0.0011521734995767474]}, "gradients/decoder.transformer.h.22.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 4.0, 2.0, 4.0, 7.0, 9.0, 6.0, 18.0, 13.0, 21.0, 12.0, 19.0, 25.0, 23.0, 22.0, 24.0, 27.0, 21.0, 34.0, 32.0, 31.0, 37.0, 52.0, 41.0, 38.0, 37.0, 45.0, 39.0, 28.0, 37.0, 38.0, 30.0, 38.0, 32.0, 23.0, 22.0, 16.0, 18.0, 10.0, 14.0, 7.0, 10.0, 11.0, 13.0, 2.0, 8.0, 2.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 1.0], "bins": [-0.00039637088775634766, -0.0003847982734441757, -0.0003732256591320038, -0.00036165304481983185, -0.0003500804305076599, -0.000338507816195488, -0.00032693520188331604, -0.0003153625875711441, -0.00030378997325897217, -0.00029221735894680023, -0.0002806447446346283, -0.00026907213032245636, -0.0002574995160102844, -0.0002459269016981125, -0.00023435428738594055, -0.00022278167307376862, -0.00021120905876159668, -0.00019963644444942474, -0.0001880638301372528, -0.00017649121582508087, -0.00016491860151290894, -0.000153345987200737, -0.00014177337288856506, -0.00013020075857639313, -0.00011862814426422119, -0.00010705552995204926, -9.548291563987732e-05, -8.391030132770538e-05, -7.233768701553345e-05, -6.076507270336151e-05, -4.9192458391189575e-05, -3.761984407901764e-05, -2.6047229766845703e-05, -1.4474615454673767e-05, -2.902001142501831e-06, 8.670613169670105e-06, 2.024322748184204e-05, 3.181584179401398e-05, 4.338845610618591e-05, 5.496107041835785e-05, 6.653368473052979e-05, 7.810629904270172e-05, 8.967891335487366e-05, 0.0001012515276670456, 0.00011282414197921753, 0.00012439675629138947, 0.0001359693706035614, 0.00014754198491573334, 0.00015911459922790527, 0.0001706872135400772, 0.00018225982785224915, 0.00019383244216442108, 0.00020540505647659302, 0.00021697767078876495, 0.0002285502851009369, 0.00024012289941310883, 0.00025169551372528076, 0.0002632681280374527, 0.00027484074234962463, 0.00028641335666179657, 0.0002979859709739685, 0.00030955858528614044, 0.0003211311995983124, 0.0003327038139104843, 0.00034427642822265625]}, "gradients/decoder.transformer.h.22.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 6.0, 6.0, 6.0, 5.0, 2.0, 8.0, 4.0, 14.0, 7.0, 12.0, 10.0, 26.0, 24.0, 30.0, 31.0, 21.0, 33.0, 43.0, 42.0, 37.0, 39.0, 51.0, 41.0, 42.0, 43.0, 41.0, 48.0, 38.0, 33.0, 28.0, 28.0, 35.0, 19.0, 24.0, 22.0, 19.0, 15.0, 15.0, 15.0, 5.0, 8.0, 9.0, 7.0, 7.0, 3.0, 5.0, 2.0, 0.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.76171875, -4.60430908203125, -4.4468994140625, -4.28948974609375, -4.132080078125, -3.97467041015625, -3.8172607421875, -3.65985107421875, -3.50244140625, -3.34503173828125, -3.1876220703125, -3.03021240234375, -2.872802734375, -2.71539306640625, -2.5579833984375, -2.40057373046875, -2.2431640625, -2.08575439453125, -1.9283447265625, -1.77093505859375, -1.613525390625, -1.45611572265625, -1.2987060546875, -1.14129638671875, -0.98388671875, -0.82647705078125, -0.6690673828125, -0.51165771484375, -0.354248046875, -0.19683837890625, -0.0394287109375, 0.11798095703125, 0.275390625, 0.43280029296875, 0.5902099609375, 0.74761962890625, 0.905029296875, 1.06243896484375, 1.2198486328125, 1.37725830078125, 1.53466796875, 1.69207763671875, 1.8494873046875, 2.00689697265625, 2.164306640625, 2.32171630859375, 2.4791259765625, 2.63653564453125, 2.7939453125, 2.95135498046875, 3.1087646484375, 3.26617431640625, 3.423583984375, 3.58099365234375, 3.7384033203125, 3.89581298828125, 4.05322265625, 4.21063232421875, 4.3680419921875, 4.52545166015625, 4.682861328125, 4.84027099609375, 4.9976806640625, 5.15509033203125, 5.3125]}, "gradients/decoder.transformer.h.22.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 9.0, 4.0, 5.0, 13.0, 15.0, 24.0, 24.0, 28.0, 36.0, 54.0, 74.0, 72.0, 134.0, 192.0, 221.0, 379.0, 586.0, 1002.0, 1862.0, 3826.0, 9041.0, 24359.0, 78787.0, 305932.0, 444189.0, 119997.0, 34729.0, 12320.0, 5028.0, 2287.0, 1248.0, 668.0, 379.0, 298.0, 186.0, 142.0, 91.0, 79.0, 67.0, 44.0, 35.0, 12.0, 22.0, 19.0, 15.0, 11.0, 6.0, 4.0, 1.0, 5.0, 0.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.57421875, -4.42742919921875, -4.2806396484375, -4.13385009765625, -3.987060546875, -3.84027099609375, -3.6934814453125, -3.54669189453125, -3.39990234375, -3.25311279296875, -3.1063232421875, -2.95953369140625, -2.812744140625, -2.66595458984375, -2.5191650390625, -2.37237548828125, -2.2255859375, -2.07879638671875, -1.9320068359375, -1.78521728515625, -1.638427734375, -1.49163818359375, -1.3448486328125, -1.19805908203125, -1.05126953125, -0.90447998046875, -0.7576904296875, -0.61090087890625, -0.464111328125, -0.31732177734375, -0.1705322265625, -0.02374267578125, 0.123046875, 0.26983642578125, 0.4166259765625, 0.56341552734375, 0.710205078125, 0.85699462890625, 1.0037841796875, 1.15057373046875, 1.29736328125, 1.44415283203125, 1.5909423828125, 1.73773193359375, 1.884521484375, 2.03131103515625, 2.1781005859375, 2.32489013671875, 2.4716796875, 2.61846923828125, 2.7652587890625, 2.91204833984375, 3.058837890625, 3.20562744140625, 3.3524169921875, 3.49920654296875, 3.64599609375, 3.79278564453125, 3.9395751953125, 4.08636474609375, 4.233154296875, 4.37994384765625, 4.5267333984375, 4.67352294921875, 4.8203125]}, "gradients/decoder.transformer.h.22.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 0.0, 3.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 8.0, 7.0, 11.0, 6.0, 17.0, 18.0, 19.0, 16.0, 24.0, 26.0, 19.0, 42.0, 22.0, 35.0, 37.0, 50.0, 46.0, 76.0, 186.0, 1724.0, 184.0, 54.0, 47.0, 41.0, 25.0, 36.0, 26.0, 28.0, 36.0, 22.0, 26.0, 21.0, 16.0, 20.0, 19.0, 8.0, 7.0, 9.0, 11.0, 4.0, 1.0, 5.0, 2.0, 1.0, 4.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 3.0], "bins": [-16.65625, -16.128173828125, -15.60009765625, -15.072021484375, -14.5439453125, -14.015869140625, -13.48779296875, -12.959716796875, -12.431640625, -11.903564453125, -11.37548828125, -10.847412109375, -10.3193359375, -9.791259765625, -9.26318359375, -8.735107421875, -8.20703125, -7.678955078125, -7.15087890625, -6.622802734375, -6.0947265625, -5.566650390625, -5.03857421875, -4.510498046875, -3.982421875, -3.454345703125, -2.92626953125, -2.398193359375, -1.8701171875, -1.342041015625, -0.81396484375, -0.285888671875, 0.2421875, 0.770263671875, 1.29833984375, 1.826416015625, 2.3544921875, 2.882568359375, 3.41064453125, 3.938720703125, 4.466796875, 4.994873046875, 5.52294921875, 6.051025390625, 6.5791015625, 7.107177734375, 7.63525390625, 8.163330078125, 8.69140625, 9.219482421875, 9.74755859375, 10.275634765625, 10.8037109375, 11.331787109375, 11.85986328125, 12.387939453125, 12.916015625, 13.444091796875, 13.97216796875, 14.500244140625, 15.0283203125, 15.556396484375, 16.08447265625, 16.612548828125, 17.140625]}, "gradients/decoder.transformer.h.22.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 3.0, 5.0, 1.0, 4.0, 7.0, 10.0, 8.0, 9.0, 20.0, 19.0, 18.0, 19.0, 29.0, 37.0, 32.0, 33.0, 49.0, 77.0, 99.0, 162.0, 351.0, 946.0, 12686.0, 3121754.0, 7599.0, 817.0, 312.0, 162.0, 85.0, 75.0, 39.0, 43.0, 30.0, 29.0, 24.0, 17.0, 21.0, 13.0, 13.0, 10.0, 9.0, 12.0, 7.0, 2.0, 3.0, 4.0, 2.0, 0.0, 0.0, 4.0, 0.0, 1.0, 0.0, 2.0, 1.0], "bins": [-39.78125, -38.55517578125, -37.3291015625, -36.10302734375, -34.876953125, -33.65087890625, -32.4248046875, -31.19873046875, -29.97265625, -28.74658203125, -27.5205078125, -26.29443359375, -25.068359375, -23.84228515625, -22.6162109375, -21.39013671875, -20.1640625, -18.93798828125, -17.7119140625, -16.48583984375, -15.259765625, -14.03369140625, -12.8076171875, -11.58154296875, -10.35546875, -9.12939453125, -7.9033203125, -6.67724609375, -5.451171875, -4.22509765625, -2.9990234375, -1.77294921875, -0.546875, 0.67919921875, 1.9052734375, 3.13134765625, 4.357421875, 5.58349609375, 6.8095703125, 8.03564453125, 9.26171875, 10.48779296875, 11.7138671875, 12.93994140625, 14.166015625, 15.39208984375, 16.6181640625, 17.84423828125, 19.0703125, 20.29638671875, 21.5224609375, 22.74853515625, 23.974609375, 25.20068359375, 26.4267578125, 27.65283203125, 28.87890625, 30.10498046875, 31.3310546875, 32.55712890625, 33.783203125, 35.00927734375, 36.2353515625, 37.46142578125, 38.6875]}, "gradients/decoder.transformer.h.22.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 28.0, 428.0, 507.0, 46.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-76.03648376464844, -74.2300796508789, -72.4236831665039, -70.61727905273438, -68.81087493896484, -67.00447082519531, -65.19807434082031, -63.39167022705078, -61.58526611328125, -59.778865814208984, -57.97246170043945, -56.16606140136719, -54.359657287597656, -52.55325698852539, -50.746856689453125, -48.940452575683594, -47.13405227661133, -45.32765197753906, -43.52124786376953, -41.714847564697266, -39.908443450927734, -38.10204315185547, -36.29563903808594, -34.48923873901367, -32.682838439941406, -30.876436233520508, -29.07003402709961, -27.263633728027344, -25.457229614257812, -23.650829315185547, -21.84442710876465, -20.03802490234375, -18.231624603271484, -16.425222396850586, -14.618820190429688, -12.812418937683105, -11.006016731262207, -9.199614524841309, -7.393213272094727, -5.586811065673828, -3.7804088592529297, -1.9740068912506104, -0.16760492324829102, 1.6387968063354492, 3.4451990127563477, 5.251601219177246, 7.058002471923828, 8.864404678344727, 10.670806884765625, 12.477209091186523, 14.283611297607422, 16.090011596679688, 17.89641571044922, 19.702816009521484, 21.509218215942383, 23.31562042236328, 25.12202262878418, 26.928424835205078, 28.734827041625977, 30.541229248046875, 32.34762954711914, 34.15403366088867, 35.96043395996094, 37.76683807373047, 39.573238372802734]}, "gradients/decoder.transformer.h.22.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 4.0, 2.0, 3.0, 4.0, 3.0, 4.0, 3.0, 3.0, 12.0, 10.0, 10.0, 6.0, 11.0, 18.0, 20.0, 16.0, 23.0, 21.0, 39.0, 22.0, 37.0, 34.0, 36.0, 32.0, 48.0, 25.0, 44.0, 41.0, 39.0, 41.0, 36.0, 44.0, 32.0, 36.0, 26.0, 33.0, 34.0, 25.0, 20.0, 17.0, 18.0, 12.0, 15.0, 15.0, 9.0, 7.0, 9.0, 3.0, 3.0, 2.0, 0.0, 3.0, 3.0, 1.0, 1.0, 3.0], "bins": [-56.42034912109375, -54.805362701416016, -53.19037628173828, -51.57538986206055, -49.96040344238281, -48.34541320800781, -46.73042678833008, -45.115440368652344, -43.50045394897461, -41.885467529296875, -40.27048110961914, -38.655494689941406, -37.040504455566406, -35.42552185058594, -33.81053161621094, -32.1955451965332, -30.58055877685547, -28.965572357177734, -27.3505859375, -25.735597610473633, -24.1206111907959, -22.505624771118164, -20.890636444091797, -19.275650024414062, -17.660663604736328, -16.045677185058594, -14.430689811706543, -12.815702438354492, -11.200716018676758, -9.585729598999023, -7.970742225646973, -6.355754852294922, -4.7407684326171875, -3.125781536102295, -1.5107946395874023, 0.10419225692749023, 1.7191791534423828, 3.334165573120117, 4.949152946472168, 6.564140319824219, 8.179126739501953, 9.794113159179688, 11.409100532531738, 13.024087905883789, 14.639074325561523, 16.254060745239258, 17.869049072265625, 19.48403549194336, 21.099021911621094, 22.714008331298828, 24.328994750976562, 25.94398307800293, 27.558969497680664, 29.1739559173584, 30.788944244384766, 32.4039306640625, 34.018917083740234, 35.63390350341797, 37.2488899230957, 38.86387634277344, 40.47886657714844, 42.093849182128906, 43.708839416503906, 45.32382583618164, 46.938812255859375]}, "gradients/decoder.transformer.h.21.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 0.0, 2.0, 3.0, 4.0, 4.0, 7.0, 3.0, 4.0, 5.0, 6.0, 6.0, 8.0, 12.0, 9.0, 17.0, 16.0, 26.0, 21.0, 35.0, 29.0, 28.0, 39.0, 32.0, 35.0, 42.0, 47.0, 36.0, 47.0, 34.0, 51.0, 40.0, 43.0, 34.0, 32.0, 28.0, 30.0, 25.0, 27.0, 17.0, 23.0, 18.0, 7.0, 17.0, 12.0, 12.0, 7.0, 11.0, 5.0, 3.0, 6.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.74609375, -4.59173583984375, -4.4373779296875, -4.28302001953125, -4.128662109375, -3.97430419921875, -3.8199462890625, -3.66558837890625, -3.51123046875, -3.35687255859375, -3.2025146484375, -3.04815673828125, -2.893798828125, -2.73944091796875, -2.5850830078125, -2.43072509765625, -2.2763671875, -2.12200927734375, -1.9676513671875, -1.81329345703125, -1.658935546875, -1.50457763671875, -1.3502197265625, -1.19586181640625, -1.04150390625, -0.88714599609375, -0.7327880859375, -0.57843017578125, -0.424072265625, -0.26971435546875, -0.1153564453125, 0.03900146484375, 0.193359375, 0.34771728515625, 0.5020751953125, 0.65643310546875, 0.810791015625, 0.96514892578125, 1.1195068359375, 1.27386474609375, 1.42822265625, 1.58258056640625, 1.7369384765625, 1.89129638671875, 2.045654296875, 2.20001220703125, 2.3543701171875, 2.50872802734375, 2.6630859375, 2.81744384765625, 2.9718017578125, 3.12615966796875, 3.280517578125, 3.43487548828125, 3.5892333984375, 3.74359130859375, 3.89794921875, 4.05230712890625, 4.2066650390625, 4.36102294921875, 4.515380859375, 4.66973876953125, 4.8240966796875, 4.97845458984375, 5.1328125]}, "gradients/decoder.transformer.h.21.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 2.0, 5.0, 3.0, 1.0, 11.0, 8.0, 8.0, 11.0, 15.0, 13.0, 19.0, 26.0, 39.0, 48.0, 81.0, 86.0, 138.0, 282.0, 632.0, 2139.0, 12728.0, 245761.0, 2767235.0, 1105776.0, 52793.0, 4485.0, 984.0, 339.0, 188.0, 105.0, 80.0, 48.0, 44.0, 36.0, 20.0, 18.0, 23.0, 17.0, 10.0, 10.0, 6.0, 5.0, 3.0, 3.0, 2.0, 3.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 1.0], "bins": [-16.5625, -16.07177734375, -15.5810546875, -15.09033203125, -14.599609375, -14.10888671875, -13.6181640625, -13.12744140625, -12.63671875, -12.14599609375, -11.6552734375, -11.16455078125, -10.673828125, -10.18310546875, -9.6923828125, -9.20166015625, -8.7109375, -8.22021484375, -7.7294921875, -7.23876953125, -6.748046875, -6.25732421875, -5.7666015625, -5.27587890625, -4.78515625, -4.29443359375, -3.8037109375, -3.31298828125, -2.822265625, -2.33154296875, -1.8408203125, -1.35009765625, -0.859375, -0.36865234375, 0.1220703125, 0.61279296875, 1.103515625, 1.59423828125, 2.0849609375, 2.57568359375, 3.06640625, 3.55712890625, 4.0478515625, 4.53857421875, 5.029296875, 5.52001953125, 6.0107421875, 6.50146484375, 6.9921875, 7.48291015625, 7.9736328125, 8.46435546875, 8.955078125, 9.44580078125, 9.9365234375, 10.42724609375, 10.91796875, 11.40869140625, 11.8994140625, 12.39013671875, 12.880859375, 13.37158203125, 13.8623046875, 14.35302734375, 14.84375]}, "gradients/decoder.transformer.h.21.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 4.0, 3.0, 9.0, 8.0, 18.0, 18.0, 28.0, 26.0, 52.0, 67.0, 94.0, 114.0, 154.0, 225.0, 373.0, 469.0, 624.0, 510.0, 404.0, 256.0, 202.0, 121.0, 80.0, 53.0, 48.0, 36.0, 30.0, 18.0, 11.0, 6.0, 6.0, 6.0, 3.0, 4.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-22.859375, -22.199951171875, -21.54052734375, -20.881103515625, -20.2216796875, -19.562255859375, -18.90283203125, -18.243408203125, -17.583984375, -16.924560546875, -16.26513671875, -15.605712890625, -14.9462890625, -14.286865234375, -13.62744140625, -12.968017578125, -12.30859375, -11.649169921875, -10.98974609375, -10.330322265625, -9.6708984375, -9.011474609375, -8.35205078125, -7.692626953125, -7.033203125, -6.373779296875, -5.71435546875, -5.054931640625, -4.3955078125, -3.736083984375, -3.07666015625, -2.417236328125, -1.7578125, -1.098388671875, -0.43896484375, 0.220458984375, 0.8798828125, 1.539306640625, 2.19873046875, 2.858154296875, 3.517578125, 4.177001953125, 4.83642578125, 5.495849609375, 6.1552734375, 6.814697265625, 7.47412109375, 8.133544921875, 8.79296875, 9.452392578125, 10.11181640625, 10.771240234375, 11.4306640625, 12.090087890625, 12.74951171875, 13.408935546875, 14.068359375, 14.727783203125, 15.38720703125, 16.046630859375, 16.7060546875, 17.365478515625, 18.02490234375, 18.684326171875, 19.34375]}, "gradients/decoder.transformer.h.21.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 7.0, 5.0, 21.0, 19.0, 25.0, 33.0, 40.0, 67.0, 95.0, 121.0, 189.0, 295.0, 595.0, 2059.0, 439873.0, 3744932.0, 4056.0, 695.0, 360.0, 214.0, 159.0, 122.0, 81.0, 71.0, 40.0, 24.0, 18.0, 34.0, 10.0, 5.0, 5.0, 4.0, 4.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0], "bins": [-83.625, -81.2451171875, -78.865234375, -76.4853515625, -74.10546875, -71.7255859375, -69.345703125, -66.9658203125, -64.5859375, -62.2060546875, -59.826171875, -57.4462890625, -55.06640625, -52.6865234375, -50.306640625, -47.9267578125, -45.546875, -43.1669921875, -40.787109375, -38.4072265625, -36.02734375, -33.6474609375, -31.267578125, -28.8876953125, -26.5078125, -24.1279296875, -21.748046875, -19.3681640625, -16.98828125, -14.6083984375, -12.228515625, -9.8486328125, -7.46875, -5.0888671875, -2.708984375, -0.3291015625, 2.05078125, 4.4306640625, 6.810546875, 9.1904296875, 11.5703125, 13.9501953125, 16.330078125, 18.7099609375, 21.08984375, 23.4697265625, 25.849609375, 28.2294921875, 30.609375, 32.9892578125, 35.369140625, 37.7490234375, 40.12890625, 42.5087890625, 44.888671875, 47.2685546875, 49.6484375, 52.0283203125, 54.408203125, 56.7880859375, 59.16796875, 61.5478515625, 63.927734375, 66.3076171875, 68.6875]}, "gradients/decoder.transformer.h.21.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 5.0, 66.0, 396.0, 439.0, 107.0, 6.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-47.977500915527344, -42.07735061645508, -36.17719650268555, -30.27704620361328, -24.376893997192383, -18.476741790771484, -12.576591491699219, -6.6764373779296875, -0.7762870788574219, 5.123864650726318, 11.024016380310059, 16.92416763305664, 22.82431983947754, 28.724472045898438, 34.6246223449707, 40.524776458740234, 46.4249267578125, 52.325077056884766, 58.2252311706543, 64.12538146972656, 70.0255355834961, 75.92568969726562, 81.82583618164062, 87.72599029541016, 93.62614440917969, 99.52629852294922, 105.42644500732422, 111.32659912109375, 117.22675323486328, 123.12690734863281, 129.0270538330078, 134.92721557617188, 140.8273468017578, 146.7274932861328, 152.62765502929688, 158.52780151367188, 164.42794799804688, 170.32810974121094, 176.22825622558594, 182.12841796875, 188.028564453125, 193.9287109375, 199.82887268066406, 205.72901916503906, 211.62916564941406, 217.52932739257812, 223.42947387695312, 229.32962036132812, 235.22976684570312, 241.12991333007812, 247.0300750732422, 252.9302215576172, 258.83038330078125, 264.73052978515625, 270.63067626953125, 276.53082275390625, 282.4309997558594, 288.3311462402344, 294.2312927246094, 300.1314697265625, 306.0316162109375, 311.9317626953125, 317.8319091796875, 323.7320556640625, 329.6322021484375]}, "gradients/decoder.transformer.h.21.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 5.0, 2.0, 3.0, 3.0, 3.0, 7.0, 11.0, 10.0, 12.0, 21.0, 16.0, 14.0, 16.0, 19.0, 31.0, 23.0, 23.0, 34.0, 36.0, 35.0, 42.0, 42.0, 53.0, 40.0, 42.0, 32.0, 41.0, 39.0, 49.0, 40.0, 45.0, 24.0, 30.0, 26.0, 20.0, 24.0, 17.0, 19.0, 20.0, 12.0, 7.0, 11.0, 4.0, 2.0, 5.0, 2.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 3.0], "bins": [-75.74655151367188, -73.60922241210938, -71.47189331054688, -69.3345718383789, -67.1972427368164, -65.0599136352539, -62.92258834838867, -60.78526306152344, -58.64793395996094, -56.51060485839844, -54.3732795715332, -52.23595428466797, -50.09862518310547, -47.96129608154297, -45.823970794677734, -43.6866455078125, -41.54931640625, -39.4119873046875, -37.274662017822266, -35.13733673095703, -33.00000762939453, -30.862680435180664, -28.725353240966797, -26.58802604675293, -24.450698852539062, -22.313371658325195, -20.176044464111328, -18.03871726989746, -15.901390075683594, -13.764062881469727, -11.62673568725586, -9.489408493041992, -7.352088928222656, -5.214761734008789, -3.077434539794922, -0.9401073455810547, 1.1972198486328125, 3.3345470428466797, 5.471874237060547, 7.609201431274414, 9.746528625488281, 11.883855819702148, 14.021183013916016, 16.158510208129883, 18.29583740234375, 20.433164596557617, 22.570491790771484, 24.70781898498535, 26.84514617919922, 28.982473373413086, 31.119800567626953, 33.25712585449219, 35.39445495605469, 37.53178405761719, 39.66910934448242, 41.806434631347656, 43.943763732910156, 46.081092834472656, 48.21841812133789, 50.355743408203125, 52.493072509765625, 54.630401611328125, 56.76772689819336, 58.905052185058594, 61.042381286621094]}, "gradients/decoder.transformer.h.21.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 7.0, 4.0, 3.0, 4.0, 4.0, 6.0, 10.0, 18.0, 14.0, 16.0, 20.0, 15.0, 22.0, 32.0, 31.0, 34.0, 32.0, 45.0, 38.0, 34.0, 44.0, 53.0, 40.0, 37.0, 50.0, 43.0, 36.0, 31.0, 33.0, 22.0, 26.0, 22.0, 30.0, 14.0, 22.0, 17.0, 13.0, 17.0, 14.0, 12.0, 8.0, 8.0, 5.0, 3.0, 1.0, 5.0, 0.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-4.9609375, -4.803466796875, -4.64599609375, -4.488525390625, -4.3310546875, -4.173583984375, -4.01611328125, -3.858642578125, -3.701171875, -3.543701171875, -3.38623046875, -3.228759765625, -3.0712890625, -2.913818359375, -2.75634765625, -2.598876953125, -2.44140625, -2.283935546875, -2.12646484375, -1.968994140625, -1.8115234375, -1.654052734375, -1.49658203125, -1.339111328125, -1.181640625, -1.024169921875, -0.86669921875, -0.709228515625, -0.5517578125, -0.394287109375, -0.23681640625, -0.079345703125, 0.078125, 0.235595703125, 0.39306640625, 0.550537109375, 0.7080078125, 0.865478515625, 1.02294921875, 1.180419921875, 1.337890625, 1.495361328125, 1.65283203125, 1.810302734375, 1.9677734375, 2.125244140625, 2.28271484375, 2.440185546875, 2.59765625, 2.755126953125, 2.91259765625, 3.070068359375, 3.2275390625, 3.385009765625, 3.54248046875, 3.699951171875, 3.857421875, 4.014892578125, 4.17236328125, 4.329833984375, 4.4873046875, 4.644775390625, 4.80224609375, 4.959716796875, 5.1171875]}, "gradients/decoder.transformer.h.21.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 1.0, 6.0, 10.0, 11.0, 18.0, 25.0, 39.0, 38.0, 69.0, 95.0, 135.0, 193.0, 251.0, 360.0, 519.0, 699.0, 999.0, 1439.0, 1995.0, 2886.0, 4180.0, 6054.0, 8761.0, 13163.0, 19494.0, 29880.0, 46534.0, 77361.0, 145831.0, 324173.0, 146854.0, 78142.0, 46996.0, 30010.0, 19650.0, 12901.0, 8857.0, 5997.0, 4148.0, 2878.0, 1986.0, 1406.0, 1019.0, 733.0, 516.0, 361.0, 274.0, 169.0, 131.0, 93.0, 74.0, 40.0, 39.0, 21.0, 23.0, 13.0, 4.0, 6.0, 5.0, 3.0, 1.0, 1.0], "bins": [-1.2099609375, -1.1715545654296875, -1.133148193359375, -1.0947418212890625, -1.05633544921875, -1.0179290771484375, -0.979522705078125, -0.9411163330078125, -0.9027099609375, -0.8643035888671875, -0.825897216796875, -0.7874908447265625, -0.74908447265625, -0.7106781005859375, -0.672271728515625, -0.6338653564453125, -0.595458984375, -0.5570526123046875, -0.518646240234375, -0.4802398681640625, -0.44183349609375, -0.4034271240234375, -0.365020751953125, -0.3266143798828125, -0.2882080078125, -0.2498016357421875, -0.211395263671875, -0.1729888916015625, -0.13458251953125, -0.0961761474609375, -0.057769775390625, -0.0193634033203125, 0.01904296875, 0.0574493408203125, 0.095855712890625, 0.1342620849609375, 0.17266845703125, 0.2110748291015625, 0.249481201171875, 0.2878875732421875, 0.3262939453125, 0.3647003173828125, 0.403106689453125, 0.4415130615234375, 0.47991943359375, 0.5183258056640625, 0.556732177734375, 0.5951385498046875, 0.633544921875, 0.6719512939453125, 0.710357666015625, 0.7487640380859375, 0.78717041015625, 0.8255767822265625, 0.863983154296875, 0.9023895263671875, 0.9407958984375, 0.9792022705078125, 1.017608642578125, 1.0560150146484375, 1.09442138671875, 1.1328277587890625, 1.171234130859375, 1.2096405029296875, 1.248046875]}, "gradients/decoder.transformer.h.21.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 4.0, 2.0, 6.0, 8.0, 4.0, 5.0, 10.0, 14.0, 16.0, 15.0, 15.0, 28.0, 24.0, 31.0, 24.0, 37.0, 30.0, 31.0, 38.0, 32.0, 42.0, 42.0, 1062.0, 38.0, 57.0, 42.0, 39.0, 38.0, 30.0, 31.0, 44.0, 35.0, 18.0, 18.0, 24.0, 20.0, 20.0, 9.0, 12.0, 9.0, 4.0, 12.0, 6.0, 9.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-3.453125, -3.3465576171875, -3.239990234375, -3.1334228515625, -3.02685546875, -2.9202880859375, -2.813720703125, -2.7071533203125, -2.6005859375, -2.4940185546875, -2.387451171875, -2.2808837890625, -2.17431640625, -2.0677490234375, -1.961181640625, -1.8546142578125, -1.748046875, -1.6414794921875, -1.534912109375, -1.4283447265625, -1.32177734375, -1.2152099609375, -1.108642578125, -1.0020751953125, -0.8955078125, -0.7889404296875, -0.682373046875, -0.5758056640625, -0.46923828125, -0.3626708984375, -0.256103515625, -0.1495361328125, -0.04296875, 0.0635986328125, 0.170166015625, 0.2767333984375, 0.38330078125, 0.4898681640625, 0.596435546875, 0.7030029296875, 0.8095703125, 0.9161376953125, 1.022705078125, 1.1292724609375, 1.23583984375, 1.3424072265625, 1.448974609375, 1.5555419921875, 1.662109375, 1.7686767578125, 1.875244140625, 1.9818115234375, 2.08837890625, 2.1949462890625, 2.301513671875, 2.4080810546875, 2.5146484375, 2.6212158203125, 2.727783203125, 2.8343505859375, 2.94091796875, 3.0474853515625, 3.154052734375, 3.2606201171875, 3.3671875]}, "gradients/decoder.transformer.h.21.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 10.0, 9.0, 14.0, 6.0, 14.0, 25.0, 36.0, 49.0, 91.0, 148.0, 247.0, 416.0, 754.0, 1345.0, 2625.0, 4650.0, 8965.0, 17006.0, 34297.0, 70523.0, 157951.0, 1479280.0, 169716.0, 74463.0, 36155.0, 18022.0, 9390.0, 4859.0, 2751.0, 1367.0, 855.0, 465.0, 251.0, 123.0, 90.0, 62.0, 43.0, 22.0, 12.0, 6.0, 9.0, 9.0, 2.0, 6.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.845703125, -1.78717041015625, -1.7286376953125, -1.67010498046875, -1.611572265625, -1.55303955078125, -1.4945068359375, -1.43597412109375, -1.37744140625, -1.31890869140625, -1.2603759765625, -1.20184326171875, -1.143310546875, -1.08477783203125, -1.0262451171875, -0.96771240234375, -0.9091796875, -0.85064697265625, -0.7921142578125, -0.73358154296875, -0.675048828125, -0.61651611328125, -0.5579833984375, -0.49945068359375, -0.44091796875, -0.38238525390625, -0.3238525390625, -0.26531982421875, -0.206787109375, -0.14825439453125, -0.0897216796875, -0.03118896484375, 0.02734375, 0.08587646484375, 0.1444091796875, 0.20294189453125, 0.261474609375, 0.32000732421875, 0.3785400390625, 0.43707275390625, 0.49560546875, 0.55413818359375, 0.6126708984375, 0.67120361328125, 0.729736328125, 0.78826904296875, 0.8468017578125, 0.90533447265625, 0.9638671875, 1.02239990234375, 1.0809326171875, 1.13946533203125, 1.197998046875, 1.25653076171875, 1.3150634765625, 1.37359619140625, 1.43212890625, 1.49066162109375, 1.5491943359375, 1.60772705078125, 1.666259765625, 1.72479248046875, 1.7833251953125, 1.84185791015625, 1.900390625]}, "gradients/decoder.transformer.h.21.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 0.0, 1.0, 3.0, 2.0, 3.0, 3.0, 2.0, 6.0, 10.0, 8.0, 14.0, 18.0, 20.0, 26.0, 19.0, 29.0, 35.0, 37.0, 49.0, 71.0, 62.0, 50.0, 52.0, 64.0, 55.0, 52.0, 45.0, 44.0, 55.0, 25.0, 31.0, 17.0, 16.0, 14.0, 17.0, 19.0, 13.0, 7.0, 2.0, 3.0, 3.0, 5.0, 3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00106048583984375, -0.00102977454662323, -0.00099906325340271, -0.0009683519601821899, -0.0009376406669616699, -0.0009069293737411499, -0.0008762180805206299, -0.0008455067873001099, -0.0008147954940795898, -0.0007840842008590698, -0.0007533729076385498, -0.0007226616144180298, -0.0006919503211975098, -0.0006612390279769897, -0.0006305277347564697, -0.0005998164415359497, -0.0005691051483154297, -0.0005383938550949097, -0.0005076825618743896, -0.00047697126865386963, -0.0004462599754333496, -0.0004155486822128296, -0.00038483738899230957, -0.00035412609577178955, -0.00032341480255126953, -0.0002927035093307495, -0.0002619922161102295, -0.00023128092288970947, -0.00020056962966918945, -0.00016985833644866943, -0.00013914704322814941, -0.0001084357500076294, -7.772445678710938e-05, -4.7013163566589355e-05, -1.6301870346069336e-05, 1.4409422874450684e-05, 4.51207160949707e-05, 7.583200931549072e-05, 0.00010654330253601074, 0.00013725459575653076, 0.00016796588897705078, 0.0001986771821975708, 0.00022938847541809082, 0.00026009976863861084, 0.00029081106185913086, 0.0003215223550796509, 0.0003522336483001709, 0.0003829449415206909, 0.00041365623474121094, 0.00044436752796173096, 0.000475078821182251, 0.000505790114402771, 0.000536501407623291, 0.000567212700843811, 0.0005979239940643311, 0.0006286352872848511, 0.0006593465805053711, 0.0006900578737258911, 0.0007207691669464111, 0.0007514804601669312, 0.0007821917533874512, 0.0008129030466079712, 0.0008436143398284912, 0.0008743256330490112, 0.0009050369262695312]}, "gradients/decoder.transformer.h.21.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 4.0, 5.0, 4.0, 4.0, 4.0, 7.0, 9.0, 17.0, 16.0, 21.0, 15.0, 16.0, 31.0, 47.0, 39.0, 50.0, 69.0, 102.0, 144.0, 289.0, 646.0, 276027.0, 769418.0, 659.0, 271.0, 175.0, 119.0, 89.0, 52.0, 43.0, 23.0, 26.0, 22.0, 18.0, 18.0, 16.0, 16.0, 7.0, 4.0, 6.0, 4.0, 2.0, 2.0, 4.0, 3.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.0225677490234375, -0.021863222122192383, -0.021158695220947266, -0.02045416831970215, -0.01974964141845703, -0.019045114517211914, -0.018340587615966797, -0.01763606071472168, -0.016931533813476562, -0.016227006912231445, -0.015522480010986328, -0.014817953109741211, -0.014113426208496094, -0.013408899307250977, -0.01270437240600586, -0.011999845504760742, -0.011295318603515625, -0.010590791702270508, -0.00988626480102539, -0.009181737899780273, -0.008477210998535156, -0.007772684097290039, -0.007068157196044922, -0.006363630294799805, -0.0056591033935546875, -0.00495457649230957, -0.004250049591064453, -0.003545522689819336, -0.0028409957885742188, -0.0021364688873291016, -0.0014319419860839844, -0.0007274150848388672, -2.288818359375e-05, 0.0006816387176513672, 0.0013861656188964844, 0.0020906925201416016, 0.0027952194213867188, 0.003499746322631836, 0.004204273223876953, 0.00490880012512207, 0.0056133270263671875, 0.006317853927612305, 0.007022380828857422, 0.007726907730102539, 0.008431434631347656, 0.009135961532592773, 0.00984048843383789, 0.010545015335083008, 0.011249542236328125, 0.011954069137573242, 0.01265859603881836, 0.013363122940063477, 0.014067649841308594, 0.014772176742553711, 0.015476703643798828, 0.016181230545043945, 0.016885757446289062, 0.01759028434753418, 0.018294811248779297, 0.018999338150024414, 0.01970386505126953, 0.02040839195251465, 0.021112918853759766, 0.021817445755004883, 0.02252197265625]}, "gradients/decoder.transformer.h.21.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 9.0, 689.0, 319.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.003000686876475811, -0.0028889442328363657, -0.0027772013563662767, -0.0026654587127268314, -0.0025537158362567425, -0.002441973192617297, -0.002330230548977852, -0.002218487672507763, -0.002106744796037674, -0.0019950021523982286, -0.0018832592759281397, -0.0017715166322886944, -0.0016597737558186054, -0.0015480311121791601, -0.001436288352124393, -0.0013245455920696259, -0.0012128029484301805, -0.0011010601883754134, -0.0009893174283206463, -0.0008775747264735401, -0.0007658319664187729, -0.0006540892063640058, -0.0005423465045168996, -0.00043060374446213245, -0.0003188609844073653, -0.00020711823890451342, -9.537549340166152e-05, 1.636723754927516e-05, 0.0001281099976040423, 0.00023985275765880942, 0.00035159545950591564, 0.0004633382195606828, 0.0005750809796154499, 0.000686823739670217, 0.0007985664997249842, 0.0009103092015720904, 0.0010220520198345184, 0.0011337946634739637, 0.0012455374235287309, 0.001357280183583498, 0.0014690229436382651, 0.0015807657036930323, 0.0016925084637477994, 0.0018042512238025665, 0.0019159938674420118, 0.002027736743912101, 0.002139479387551546, 0.002251222264021635, 0.0023629649076610804, 0.0024747075513005257, 0.0025864504277706146, 0.00269819307141006, 0.002809935947880149, 0.002921678591519594, 0.003033421467989683, 0.0031451641116291285, 0.0032569067552685738, 0.003368649398908019, 0.003480392275378108, 0.0035921349190175533, 0.0037038777954876423, 0.0038156204391270876, 0.003927363082766533, 0.004039105959236622, 0.004150848835706711]}, "gradients/decoder.transformer.h.21.ln_cross_attn.bias": {"_type": "histogram", "values": [4.0, 0.0, 1.0, 2.0, 1.0, 4.0, 4.0, 5.0, 6.0, 7.0, 6.0, 6.0, 8.0, 16.0, 18.0, 17.0, 20.0, 23.0, 30.0, 27.0, 22.0, 27.0, 32.0, 32.0, 37.0, 39.0, 38.0, 41.0, 40.0, 43.0, 37.0, 26.0, 33.0, 48.0, 36.0, 33.0, 20.0, 26.0, 20.0, 27.0, 21.0, 15.0, 18.0, 17.0, 11.0, 13.0, 5.0, 12.0, 6.0, 6.0, 6.0, 6.0, 6.0, 4.0, 2.0, 2.0, 2.0, 3.0, 2.0, 0.0, 2.0, 0.0, 2.0, 1.0], "bins": [-0.0004120469093322754, -0.0003979327157139778, -0.00038381852209568024, -0.00036970432847738266, -0.0003555901348590851, -0.0003414759412407875, -0.00032736174762248993, -0.00031324755400419235, -0.0002991333603858948, -0.0002850191667675972, -0.0002709049731492996, -0.00025679077953100204, -0.00024267658591270447, -0.0002285623922944069, -0.00021444819867610931, -0.00020033400505781174, -0.00018621981143951416, -0.00017210561782121658, -0.000157991424202919, -0.00014387723058462143, -0.00012976303696632385, -0.00011564884334802628, -0.0001015346497297287, -8.742045611143112e-05, -7.330626249313354e-05, -5.919206887483597e-05, -4.507787525653839e-05, -3.0963681638240814e-05, -1.6849488019943237e-05, -2.7352944016456604e-06, 1.1378899216651917e-05, 2.5493092834949493e-05, 3.960728645324707e-05, 5.372148007154465e-05, 6.783567368984222e-05, 8.19498673081398e-05, 9.606406092643738e-05, 0.00011017825454473495, 0.00012429244816303253, 0.0001384066417813301, 0.00015252083539962769, 0.00016663502901792526, 0.00018074922263622284, 0.00019486341625452042, 0.000208977609872818, 0.00022309180349111557, 0.00023720599710941315, 0.0002513201907277107, 0.0002654343843460083, 0.0002795485779643059, 0.00029366277158260345, 0.00030777696520090103, 0.0003218911588191986, 0.0003360053524374962, 0.00035011954605579376, 0.00036423373967409134, 0.0003783479332923889, 0.0003924621269106865, 0.00040657632052898407, 0.00042069051414728165, 0.0004348047077655792, 0.0004489189013838768, 0.0004630330950021744, 0.00047714728862047195, 0.0004912614822387695]}, "gradients/decoder.transformer.h.21.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 7.0, 4.0, 3.0, 4.0, 4.0, 6.0, 10.0, 18.0, 14.0, 16.0, 20.0, 15.0, 22.0, 32.0, 31.0, 34.0, 32.0, 45.0, 38.0, 34.0, 44.0, 53.0, 40.0, 37.0, 50.0, 43.0, 36.0, 31.0, 33.0, 22.0, 26.0, 22.0, 30.0, 14.0, 22.0, 17.0, 13.0, 17.0, 14.0, 12.0, 8.0, 8.0, 5.0, 3.0, 1.0, 5.0, 0.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-4.9609375, -4.803466796875, -4.64599609375, -4.488525390625, -4.3310546875, -4.173583984375, -4.01611328125, -3.858642578125, -3.701171875, -3.543701171875, -3.38623046875, -3.228759765625, -3.0712890625, -2.913818359375, -2.75634765625, -2.598876953125, -2.44140625, -2.283935546875, -2.12646484375, -1.968994140625, -1.8115234375, -1.654052734375, -1.49658203125, -1.339111328125, -1.181640625, -1.024169921875, -0.86669921875, -0.709228515625, -0.5517578125, -0.394287109375, -0.23681640625, -0.079345703125, 0.078125, 0.235595703125, 0.39306640625, 0.550537109375, 0.7080078125, 0.865478515625, 1.02294921875, 1.180419921875, 1.337890625, 1.495361328125, 1.65283203125, 1.810302734375, 1.9677734375, 2.125244140625, 2.28271484375, 2.440185546875, 2.59765625, 2.755126953125, 2.91259765625, 3.070068359375, 3.2275390625, 3.385009765625, 3.54248046875, 3.699951171875, 3.857421875, 4.014892578125, 4.17236328125, 4.329833984375, 4.4873046875, 4.644775390625, 4.80224609375, 4.959716796875, 5.1171875]}, "gradients/decoder.transformer.h.21.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 6.0, 2.0, 3.0, 9.0, 8.0, 5.0, 12.0, 20.0, 24.0, 42.0, 61.0, 95.0, 104.0, 194.0, 283.0, 464.0, 700.0, 1213.0, 2268.0, 4130.0, 8592.0, 19686.0, 49087.0, 140041.0, 454050.0, 238867.0, 75086.0, 28889.0, 12255.0, 5511.0, 2793.0, 1522.0, 914.0, 540.0, 351.0, 224.0, 147.0, 110.0, 73.0, 52.0, 34.0, 33.0, 22.0, 16.0, 4.0, 5.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-3.35546875, -3.249664306640625, -3.14385986328125, -3.038055419921875, -2.9322509765625, -2.826446533203125, -2.72064208984375, -2.614837646484375, -2.509033203125, -2.403228759765625, -2.29742431640625, -2.191619873046875, -2.0858154296875, -1.980010986328125, -1.87420654296875, -1.768402099609375, -1.66259765625, -1.556793212890625, -1.45098876953125, -1.345184326171875, -1.2393798828125, -1.133575439453125, -1.02777099609375, -0.921966552734375, -0.816162109375, -0.710357666015625, -0.60455322265625, -0.498748779296875, -0.3929443359375, -0.287139892578125, -0.18133544921875, -0.075531005859375, 0.0302734375, 0.136077880859375, 0.24188232421875, 0.347686767578125, 0.4534912109375, 0.559295654296875, 0.66510009765625, 0.770904541015625, 0.876708984375, 0.982513427734375, 1.08831787109375, 1.194122314453125, 1.2999267578125, 1.405731201171875, 1.51153564453125, 1.617340087890625, 1.72314453125, 1.828948974609375, 1.93475341796875, 2.040557861328125, 2.1463623046875, 2.252166748046875, 2.35797119140625, 2.463775634765625, 2.569580078125, 2.675384521484375, 2.78118896484375, 2.886993408203125, 2.9927978515625, 3.098602294921875, 3.20440673828125, 3.310211181640625, 3.416015625]}, "gradients/decoder.transformer.h.21.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 6.0, 5.0, 3.0, 11.0, 4.0, 12.0, 12.0, 12.0, 15.0, 25.0, 21.0, 24.0, 29.0, 36.0, 42.0, 40.0, 49.0, 58.0, 59.0, 175.0, 1865.0, 106.0, 40.0, 53.0, 40.0, 37.0, 36.0, 31.0, 39.0, 28.0, 22.0, 21.0, 19.0, 22.0, 8.0, 13.0, 7.0, 8.0, 10.0, 6.0, 0.0, 2.0, 1.0, 1.0, 5.0, 1.0, 2.0], "bins": [-23.109375, -22.4912109375, -21.873046875, -21.2548828125, -20.63671875, -20.0185546875, -19.400390625, -18.7822265625, -18.1640625, -17.5458984375, -16.927734375, -16.3095703125, -15.69140625, -15.0732421875, -14.455078125, -13.8369140625, -13.21875, -12.6005859375, -11.982421875, -11.3642578125, -10.74609375, -10.1279296875, -9.509765625, -8.8916015625, -8.2734375, -7.6552734375, -7.037109375, -6.4189453125, -5.80078125, -5.1826171875, -4.564453125, -3.9462890625, -3.328125, -2.7099609375, -2.091796875, -1.4736328125, -0.85546875, -0.2373046875, 0.380859375, 0.9990234375, 1.6171875, 2.2353515625, 2.853515625, 3.4716796875, 4.08984375, 4.7080078125, 5.326171875, 5.9443359375, 6.5625, 7.1806640625, 7.798828125, 8.4169921875, 9.03515625, 9.6533203125, 10.271484375, 10.8896484375, 11.5078125, 12.1259765625, 12.744140625, 13.3623046875, 13.98046875, 14.5986328125, 15.216796875, 15.8349609375, 16.453125]}, "gradients/decoder.transformer.h.21.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 1.0, 3.0, 2.0, 3.0, 1.0, 2.0, 6.0, 7.0, 7.0, 8.0, 11.0, 8.0, 22.0, 15.0, 25.0, 24.0, 26.0, 38.0, 57.0, 58.0, 92.0, 154.0, 323.0, 611.0, 2809.0, 2673919.0, 464093.0, 2045.0, 580.0, 261.0, 132.0, 96.0, 59.0, 41.0, 29.0, 30.0, 26.0, 16.0, 17.0, 9.0, 13.0, 10.0, 9.0, 5.0, 1.0, 7.0, 5.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-27.96875, -26.93017578125, -25.8916015625, -24.85302734375, -23.814453125, -22.77587890625, -21.7373046875, -20.69873046875, -19.66015625, -18.62158203125, -17.5830078125, -16.54443359375, -15.505859375, -14.46728515625, -13.4287109375, -12.39013671875, -11.3515625, -10.31298828125, -9.2744140625, -8.23583984375, -7.197265625, -6.15869140625, -5.1201171875, -4.08154296875, -3.04296875, -2.00439453125, -0.9658203125, 0.07275390625, 1.111328125, 2.14990234375, 3.1884765625, 4.22705078125, 5.265625, 6.30419921875, 7.3427734375, 8.38134765625, 9.419921875, 10.45849609375, 11.4970703125, 12.53564453125, 13.57421875, 14.61279296875, 15.6513671875, 16.68994140625, 17.728515625, 18.76708984375, 19.8056640625, 20.84423828125, 21.8828125, 22.92138671875, 23.9599609375, 24.99853515625, 26.037109375, 27.07568359375, 28.1142578125, 29.15283203125, 30.19140625, 31.22998046875, 32.2685546875, 33.30712890625, 34.345703125, 35.38427734375, 36.4228515625, 37.46142578125, 38.5]}, "gradients/decoder.transformer.h.21.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 1.0, 2.0, 4.0, 5.0, 1.0, 12.0, 8.0, 20.0, 22.0, 27.0, 56.0, 61.0, 67.0, 91.0, 84.0, 119.0, 87.0, 79.0, 75.0, 58.0, 46.0, 27.0, 20.0, 20.0, 12.0, 2.0, 3.0, 2.0, 1.0, 3.0, 1.0], "bins": [-14.43320369720459, -14.134761810302734, -13.836319923400879, -13.537878036499023, -13.239436149597168, -12.940994262695312, -12.642552375793457, -12.344110488891602, -12.04566764831543, -11.747225761413574, -11.448783874511719, -11.150341987609863, -10.851900100708008, -10.553458213806152, -10.255016326904297, -9.956573486328125, -9.658132553100586, -9.35969066619873, -9.061248779296875, -8.76280689239502, -8.464365005493164, -8.165923118591309, -7.867480754852295, -7.5690388679504395, -7.270596981048584, -6.9721550941467285, -6.673713207244873, -6.375271320343018, -6.076828956604004, -5.778387069702148, -5.479945182800293, -5.1815032958984375, -4.883060455322266, -4.58461856842041, -4.286176681518555, -3.98773455619812, -3.6892926692962646, -3.390850782394409, -3.0924086570739746, -2.793966770172119, -2.4955248832702637, -2.197082996368408, -1.8986409902572632, -1.6001989841461182, -1.3017570972442627, -1.0033152103424072, -0.7048732042312622, -0.4064311981201172, -0.10798931121826172, 0.19045263528823853, 0.48889458179473877, 0.787336528301239, 1.0857784748077393, 1.3842203617095947, 1.6826623678207397, 1.9811043739318848, 2.2795462608337402, 2.5779881477355957, 2.876430034637451, 3.1748721599578857, 3.473314046859741, 3.7717559337615967, 4.070198059082031, 4.368639945983887, 4.667081832885742]}, "gradients/decoder.transformer.h.21.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 6.0, 3.0, 7.0, 5.0, 9.0, 5.0, 11.0, 13.0, 14.0, 13.0, 25.0, 27.0, 19.0, 36.0, 24.0, 34.0, 37.0, 38.0, 44.0, 48.0, 50.0, 48.0, 48.0, 42.0, 48.0, 49.0, 31.0, 32.0, 21.0, 36.0, 27.0, 24.0, 29.0, 17.0, 14.0, 10.0, 15.0, 12.0, 8.0, 7.0, 4.0, 5.0, 6.0, 3.0, 1.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-62.50450134277344, -60.536373138427734, -58.5682487487793, -56.600120544433594, -54.631996154785156, -52.66386795043945, -50.695743560791016, -48.72761535644531, -46.759490966796875, -44.79136276245117, -42.823238372802734, -40.85511016845703, -38.886985778808594, -36.91885757446289, -34.95073318481445, -32.98260498046875, -31.01447868347168, -29.04635238647461, -27.07822608947754, -25.11009979248047, -23.1419734954834, -21.173847198486328, -19.205718994140625, -17.237594604492188, -15.2694673538208, -13.30134105682373, -11.33321475982666, -9.365087509155273, -7.396961688995361, -5.428834915161133, -3.4607086181640625, -1.4925823211669922, 0.4755439758300781, 2.4436702728271484, 4.411796569824219, 6.379923343658447, 8.34804916381836, 10.316176414489746, 12.284302711486816, 14.252429008483887, 16.22055435180664, 18.18868064880371, 20.15680694580078, 22.12493324279785, 24.093059539794922, 26.061187744140625, 28.029312133789062, 29.997440338134766, 31.965566635131836, 33.933692932128906, 35.90182113647461, 37.86994552612305, 39.83807373046875, 41.80619812011719, 43.77432632446289, 45.74245071411133, 47.71057891845703, 49.678707122802734, 51.64683151245117, 53.614959716796875, 55.58308410644531, 57.551212310791016, 59.51933670043945, 61.487464904785156, 63.455589294433594]}, "gradients/decoder.transformer.h.20.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 2.0, 4.0, 4.0, 4.0, 7.0, 1.0, 5.0, 5.0, 8.0, 7.0, 17.0, 10.0, 23.0, 15.0, 20.0, 22.0, 26.0, 35.0, 30.0, 33.0, 41.0, 40.0, 29.0, 55.0, 32.0, 45.0, 33.0, 59.0, 37.0, 45.0, 27.0, 31.0, 24.0, 36.0, 21.0, 20.0, 22.0, 14.0, 18.0, 16.0, 17.0, 14.0, 10.0, 6.0, 11.0, 8.0, 4.0, 3.0, 2.0, 1.0, 3.0, 5.0, 1.0, 2.0, 0.0, 1.0, 3.0], "bins": [-5.17578125, -5.0174560546875, -4.859130859375, -4.7008056640625, -4.54248046875, -4.3841552734375, -4.225830078125, -4.0675048828125, -3.9091796875, -3.7508544921875, -3.592529296875, -3.4342041015625, -3.27587890625, -3.1175537109375, -2.959228515625, -2.8009033203125, -2.642578125, -2.4842529296875, -2.325927734375, -2.1676025390625, -2.00927734375, -1.8509521484375, -1.692626953125, -1.5343017578125, -1.3759765625, -1.2176513671875, -1.059326171875, -0.9010009765625, -0.74267578125, -0.5843505859375, -0.426025390625, -0.2677001953125, -0.109375, 0.0489501953125, 0.207275390625, 0.3656005859375, 0.52392578125, 0.6822509765625, 0.840576171875, 0.9989013671875, 1.1572265625, 1.3155517578125, 1.473876953125, 1.6322021484375, 1.79052734375, 1.9488525390625, 2.107177734375, 2.2655029296875, 2.423828125, 2.5821533203125, 2.740478515625, 2.8988037109375, 3.05712890625, 3.2154541015625, 3.373779296875, 3.5321044921875, 3.6904296875, 3.8487548828125, 4.007080078125, 4.1654052734375, 4.32373046875, 4.4820556640625, 4.640380859375, 4.7987060546875, 4.95703125]}, "gradients/decoder.transformer.h.20.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 3.0, 6.0, 1.0, 4.0, 3.0, 5.0, 5.0, 4.0, 3.0, 5.0, 8.0, 10.0, 11.0, 17.0, 14.0, 21.0, 18.0, 24.0, 34.0, 37.0, 25.0, 32.0, 47.0, 61.0, 193.0, 1555.0, 377972.0, 3794261.0, 18916.0, 528.0, 119.0, 52.0, 29.0, 23.0, 34.0, 27.0, 23.0, 26.0, 18.0, 14.0, 18.0, 13.0, 20.0, 10.0, 15.0, 7.0, 6.0, 4.0, 5.0, 2.0, 6.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-35.4375, -34.33984375, -33.2421875, -32.14453125, -31.046875, -29.94921875, -28.8515625, -27.75390625, -26.65625, -25.55859375, -24.4609375, -23.36328125, -22.265625, -21.16796875, -20.0703125, -18.97265625, -17.875, -16.77734375, -15.6796875, -14.58203125, -13.484375, -12.38671875, -11.2890625, -10.19140625, -9.09375, -7.99609375, -6.8984375, -5.80078125, -4.703125, -3.60546875, -2.5078125, -1.41015625, -0.3125, 0.78515625, 1.8828125, 2.98046875, 4.078125, 5.17578125, 6.2734375, 7.37109375, 8.46875, 9.56640625, 10.6640625, 11.76171875, 12.859375, 13.95703125, 15.0546875, 16.15234375, 17.25, 18.34765625, 19.4453125, 20.54296875, 21.640625, 22.73828125, 23.8359375, 24.93359375, 26.03125, 27.12890625, 28.2265625, 29.32421875, 30.421875, 31.51953125, 32.6171875, 33.71484375, 34.8125]}, "gradients/decoder.transformer.h.20.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 4.0, 4.0, 3.0, 6.0, 10.0, 15.0, 15.0, 30.0, 26.0, 41.0, 45.0, 53.0, 100.0, 121.0, 188.0, 274.0, 338.0, 499.0, 581.0, 503.0, 357.0, 259.0, 171.0, 114.0, 84.0, 61.0, 42.0, 29.0, 40.0, 15.0, 11.0, 11.0, 10.0, 7.0, 5.0, 8.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-16.0, -15.458251953125, -14.91650390625, -14.374755859375, -13.8330078125, -13.291259765625, -12.74951171875, -12.207763671875, -11.666015625, -11.124267578125, -10.58251953125, -10.040771484375, -9.4990234375, -8.957275390625, -8.41552734375, -7.873779296875, -7.33203125, -6.790283203125, -6.24853515625, -5.706787109375, -5.1650390625, -4.623291015625, -4.08154296875, -3.539794921875, -2.998046875, -2.456298828125, -1.91455078125, -1.372802734375, -0.8310546875, -0.289306640625, 0.25244140625, 0.794189453125, 1.3359375, 1.877685546875, 2.41943359375, 2.961181640625, 3.5029296875, 4.044677734375, 4.58642578125, 5.128173828125, 5.669921875, 6.211669921875, 6.75341796875, 7.295166015625, 7.8369140625, 8.378662109375, 8.92041015625, 9.462158203125, 10.00390625, 10.545654296875, 11.08740234375, 11.629150390625, 12.1708984375, 12.712646484375, 13.25439453125, 13.796142578125, 14.337890625, 14.879638671875, 15.42138671875, 15.963134765625, 16.5048828125, 17.046630859375, 17.58837890625, 18.130126953125, 18.671875]}, "gradients/decoder.transformer.h.20.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 2.0, 4.0, 8.0, 9.0, 11.0, 23.0, 28.0, 58.0, 60.0, 88.0, 90.0, 117.0, 173.0, 332.0, 654.0, 3093.0, 3879648.0, 306870.0, 1487.0, 550.0, 295.0, 174.0, 128.0, 100.0, 58.0, 57.0, 57.0, 32.0, 19.0, 13.0, 15.0, 10.0, 9.0, 8.0, 5.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 2.0], "bins": [-87.375, -84.79296875, -82.2109375, -79.62890625, -77.046875, -74.46484375, -71.8828125, -69.30078125, -66.71875, -64.13671875, -61.5546875, -58.97265625, -56.390625, -53.80859375, -51.2265625, -48.64453125, -46.0625, -43.48046875, -40.8984375, -38.31640625, -35.734375, -33.15234375, -30.5703125, -27.98828125, -25.40625, -22.82421875, -20.2421875, -17.66015625, -15.078125, -12.49609375, -9.9140625, -7.33203125, -4.75, -2.16796875, 0.4140625, 2.99609375, 5.578125, 8.16015625, 10.7421875, 13.32421875, 15.90625, 18.48828125, 21.0703125, 23.65234375, 26.234375, 28.81640625, 31.3984375, 33.98046875, 36.5625, 39.14453125, 41.7265625, 44.30859375, 46.890625, 49.47265625, 52.0546875, 54.63671875, 57.21875, 59.80078125, 62.3828125, 64.96484375, 67.546875, 70.12890625, 72.7109375, 75.29296875, 77.875]}, "gradients/decoder.transformer.h.20.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 9.0, 181.0, 618.0, 193.0, 15.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-72.6153564453125, -64.99711608886719, -57.378875732421875, -49.76063919067383, -42.142398834228516, -34.5241584777832, -26.905921936035156, -19.287681579589844, -11.669441223144531, -4.051201820373535, 3.567037582397461, 11.18527603149414, 18.803516387939453, 26.421756744384766, 34.03999328613281, 41.658233642578125, 49.27647399902344, 56.89471435546875, 64.51295471191406, 72.13119506835938, 79.74943542480469, 87.36767578125, 94.98590850830078, 102.6041488647461, 110.2223892211914, 117.84062957763672, 125.45886993408203, 133.0771026611328, 140.69534301757812, 148.31358337402344, 155.93182373046875, 163.55006408691406, 171.16830444335938, 178.7865447998047, 186.40478515625, 194.0230255126953, 201.64126586914062, 209.25950622558594, 216.87774658203125, 224.4959716796875, 232.11422729492188, 239.7324676513672, 247.3507080078125, 254.9689483642578, 262.5871887207031, 270.2054138183594, 277.82366943359375, 285.44189453125, 293.06011962890625, 300.6783447265625, 308.2966003417969, 315.9148254394531, 323.5330810546875, 331.15130615234375, 338.7695617675781, 346.3877868652344, 354.00604248046875, 361.624267578125, 369.2425231933594, 376.8607482910156, 384.47900390625, 392.09722900390625, 399.7154846191406, 407.3337097167969, 414.95196533203125]}, "gradients/decoder.transformer.h.20.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 5.0, 4.0, 4.0, 3.0, 5.0, 10.0, 5.0, 4.0, 5.0, 8.0, 16.0, 12.0, 14.0, 32.0, 22.0, 25.0, 33.0, 27.0, 37.0, 37.0, 39.0, 41.0, 39.0, 43.0, 40.0, 41.0, 42.0, 39.0, 33.0, 41.0, 48.0, 33.0, 29.0, 24.0, 16.0, 23.0, 26.0, 13.0, 31.0, 8.0, 15.0, 7.0, 8.0, 6.0, 5.0, 6.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-55.48941421508789, -53.65110778808594, -51.812801361083984, -49.97449493408203, -48.13618850708008, -46.297882080078125, -44.45957946777344, -42.621273040771484, -40.78296661376953, -38.94466018676758, -37.106353759765625, -35.26804733276367, -33.42974090576172, -31.5914363861084, -29.753129959106445, -27.914825439453125, -26.07651710510254, -24.238210678100586, -22.399904251098633, -20.561599731445312, -18.72329330444336, -16.884986877441406, -15.046680450439453, -13.208374977111816, -11.370068550109863, -9.53176212310791, -7.693456649780273, -5.85515022277832, -4.016844272613525, -2.1785383224487305, -0.34023189544677734, 1.4980735778808594, 3.3363800048828125, 5.174685955047607, 7.012991905212402, 8.851298332214355, 10.689603805541992, 12.527910232543945, 14.366216659545898, 16.20452117919922, 18.042827606201172, 19.881134033203125, 21.719440460205078, 23.55774688720703, 25.39605140686035, 27.234357833862305, 29.072664260864258, 30.910968780517578, 32.74927520751953, 34.587581634521484, 36.42588806152344, 38.26419448852539, 40.102500915527344, 41.94080352783203, 43.77911376953125, 45.61741638183594, 47.455726623535156, 49.29403305053711, 51.13233947753906, 52.970645904541016, 54.80895233154297, 56.647254943847656, 58.485565185546875, 60.32386779785156, 62.162174224853516]}, "gradients/decoder.transformer.h.20.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 7.0, 4.0, 3.0, 6.0, 7.0, 6.0, 15.0, 7.0, 12.0, 23.0, 19.0, 14.0, 21.0, 29.0, 39.0, 37.0, 30.0, 39.0, 38.0, 39.0, 41.0, 45.0, 54.0, 38.0, 43.0, 45.0, 45.0, 34.0, 31.0, 28.0, 29.0, 26.0, 17.0, 23.0, 16.0, 12.0, 18.0, 19.0, 13.0, 6.0, 7.0, 3.0, 4.0, 4.0, 5.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 3.0, 1.0], "bins": [-5.734375, -5.56292724609375, -5.3914794921875, -5.22003173828125, -5.048583984375, -4.87713623046875, -4.7056884765625, -4.53424072265625, -4.36279296875, -4.19134521484375, -4.0198974609375, -3.84844970703125, -3.677001953125, -3.50555419921875, -3.3341064453125, -3.16265869140625, -2.9912109375, -2.81976318359375, -2.6483154296875, -2.47686767578125, -2.305419921875, -2.13397216796875, -1.9625244140625, -1.79107666015625, -1.61962890625, -1.44818115234375, -1.2767333984375, -1.10528564453125, -0.933837890625, -0.76239013671875, -0.5909423828125, -0.41949462890625, -0.248046875, -0.07659912109375, 0.0948486328125, 0.26629638671875, 0.437744140625, 0.60919189453125, 0.7806396484375, 0.95208740234375, 1.12353515625, 1.29498291015625, 1.4664306640625, 1.63787841796875, 1.809326171875, 1.98077392578125, 2.1522216796875, 2.32366943359375, 2.4951171875, 2.66656494140625, 2.8380126953125, 3.00946044921875, 3.180908203125, 3.35235595703125, 3.5238037109375, 3.69525146484375, 3.86669921875, 4.03814697265625, 4.2095947265625, 4.38104248046875, 4.552490234375, 4.72393798828125, 4.8953857421875, 5.06683349609375, 5.23828125]}, "gradients/decoder.transformer.h.20.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 7.0, 12.0, 21.0, 20.0, 36.0, 57.0, 64.0, 108.0, 176.0, 237.0, 358.0, 491.0, 808.0, 1190.0, 1794.0, 2739.0, 4199.0, 6408.0, 10184.0, 15831.0, 25140.0, 41872.0, 72065.0, 139274.0, 358576.0, 164324.0, 80479.0, 45865.0, 27583.0, 17158.0, 11038.0, 7052.0, 4538.0, 2992.0, 1932.0, 1290.0, 912.0, 543.0, 389.0, 233.0, 186.0, 115.0, 81.0, 58.0, 44.0, 19.0, 22.0, 15.0, 8.0, 6.0, 3.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.4140625, -1.3678436279296875, -1.321624755859375, -1.2754058837890625, -1.22918701171875, -1.1829681396484375, -1.136749267578125, -1.0905303955078125, -1.0443115234375, -0.9980926513671875, -0.951873779296875, -0.9056549072265625, -0.85943603515625, -0.8132171630859375, -0.766998291015625, -0.7207794189453125, -0.674560546875, -0.6283416748046875, -0.582122802734375, -0.5359039306640625, -0.48968505859375, -0.4434661865234375, -0.397247314453125, -0.3510284423828125, -0.3048095703125, -0.2585906982421875, -0.212371826171875, -0.1661529541015625, -0.11993408203125, -0.0737152099609375, -0.027496337890625, 0.0187225341796875, 0.06494140625, 0.1111602783203125, 0.157379150390625, 0.2035980224609375, 0.24981689453125, 0.2960357666015625, 0.342254638671875, 0.3884735107421875, 0.4346923828125, 0.4809112548828125, 0.527130126953125, 0.5733489990234375, 0.61956787109375, 0.6657867431640625, 0.712005615234375, 0.7582244873046875, 0.804443359375, 0.8506622314453125, 0.896881103515625, 0.9430999755859375, 0.98931884765625, 1.0355377197265625, 1.081756591796875, 1.1279754638671875, 1.1741943359375, 1.2204132080078125, 1.266632080078125, 1.3128509521484375, 1.35906982421875, 1.4052886962890625, 1.451507568359375, 1.4977264404296875, 1.5439453125]}, "gradients/decoder.transformer.h.20.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 3.0, 5.0, 4.0, 4.0, 7.0, 5.0, 10.0, 10.0, 10.0, 19.0, 21.0, 21.0, 15.0, 37.0, 33.0, 33.0, 36.0, 30.0, 28.0, 34.0, 53.0, 53.0, 1064.0, 41.0, 56.0, 32.0, 50.0, 27.0, 28.0, 24.0, 21.0, 29.0, 33.0, 17.0, 22.0, 17.0, 18.0, 17.0, 7.0, 9.0, 7.0, 13.0, 8.0, 8.0, 3.0, 6.0, 2.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 1.0], "bins": [-3.607421875, -3.49755859375, -3.3876953125, -3.27783203125, -3.16796875, -3.05810546875, -2.9482421875, -2.83837890625, -2.728515625, -2.61865234375, -2.5087890625, -2.39892578125, -2.2890625, -2.17919921875, -2.0693359375, -1.95947265625, -1.849609375, -1.73974609375, -1.6298828125, -1.52001953125, -1.41015625, -1.30029296875, -1.1904296875, -1.08056640625, -0.970703125, -0.86083984375, -0.7509765625, -0.64111328125, -0.53125, -0.42138671875, -0.3115234375, -0.20166015625, -0.091796875, 0.01806640625, 0.1279296875, 0.23779296875, 0.34765625, 0.45751953125, 0.5673828125, 0.67724609375, 0.787109375, 0.89697265625, 1.0068359375, 1.11669921875, 1.2265625, 1.33642578125, 1.4462890625, 1.55615234375, 1.666015625, 1.77587890625, 1.8857421875, 1.99560546875, 2.10546875, 2.21533203125, 2.3251953125, 2.43505859375, 2.544921875, 2.65478515625, 2.7646484375, 2.87451171875, 2.984375, 3.09423828125, 3.2041015625, 3.31396484375, 3.423828125]}, "gradients/decoder.transformer.h.20.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 3.0, 2.0, 2.0, 2.0, 4.0, 6.0, 9.0, 12.0, 14.0, 17.0, 24.0, 25.0, 61.0, 84.0, 155.0, 217.0, 385.0, 694.0, 1161.0, 2082.0, 3627.0, 6380.0, 11703.0, 22363.0, 43636.0, 89154.0, 229860.0, 1448847.0, 117531.0, 56604.0, 28834.0, 14949.0, 7978.0, 4560.0, 2541.0, 1456.0, 907.0, 496.0, 278.0, 187.0, 110.0, 67.0, 44.0, 18.0, 20.0, 10.0, 8.0, 4.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.88671875, -1.8262176513671875, -1.765716552734375, -1.7052154541015625, -1.64471435546875, -1.5842132568359375, -1.523712158203125, -1.4632110595703125, -1.4027099609375, -1.3422088623046875, -1.281707763671875, -1.2212066650390625, -1.16070556640625, -1.1002044677734375, -1.039703369140625, -0.9792022705078125, -0.918701171875, -0.8582000732421875, -0.797698974609375, -0.7371978759765625, -0.67669677734375, -0.6161956787109375, -0.555694580078125, -0.4951934814453125, -0.4346923828125, -0.3741912841796875, -0.313690185546875, -0.2531890869140625, -0.19268798828125, -0.1321868896484375, -0.071685791015625, -0.0111846923828125, 0.04931640625, 0.1098175048828125, 0.170318603515625, 0.2308197021484375, 0.29132080078125, 0.3518218994140625, 0.412322998046875, 0.4728240966796875, 0.5333251953125, 0.5938262939453125, 0.654327392578125, 0.7148284912109375, 0.77532958984375, 0.8358306884765625, 0.896331787109375, 0.9568328857421875, 1.017333984375, 1.0778350830078125, 1.138336181640625, 1.1988372802734375, 1.25933837890625, 1.3198394775390625, 1.380340576171875, 1.4408416748046875, 1.5013427734375, 1.5618438720703125, 1.622344970703125, 1.6828460693359375, 1.74334716796875, 1.8038482666015625, 1.864349365234375, 1.9248504638671875, 1.9853515625]}, "gradients/decoder.transformer.h.20.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 4.0, 3.0, 2.0, 6.0, 3.0, 7.0, 6.0, 9.0, 5.0, 4.0, 9.0, 17.0, 16.0, 19.0, 24.0, 26.0, 28.0, 39.0, 44.0, 65.0, 69.0, 107.0, 59.0, 85.0, 60.0, 52.0, 43.0, 36.0, 30.0, 21.0, 24.0, 16.0, 12.0, 9.0, 7.0, 10.0, 4.0, 6.0, 6.0, 6.0, 4.0, 5.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0008206367492675781, -0.0007958412170410156, -0.0007710456848144531, -0.0007462501525878906, -0.0007214546203613281, -0.0006966590881347656, -0.0006718635559082031, -0.0006470680236816406, -0.0006222724914550781, -0.0005974769592285156, -0.0005726814270019531, -0.0005478858947753906, -0.0005230903625488281, -0.0004982948303222656, -0.0004734992980957031, -0.0004487037658691406, -0.0004239082336425781, -0.0003991127014160156, -0.0003743171691894531, -0.0003495216369628906, -0.0003247261047363281, -0.0002999305725097656, -0.0002751350402832031, -0.0002503395080566406, -0.00022554397583007812, -0.00020074844360351562, -0.00017595291137695312, -0.00015115737915039062, -0.00012636184692382812, -0.00010156631469726562, -7.677078247070312e-05, -5.1975250244140625e-05, -2.7179718017578125e-05, -2.384185791015625e-06, 2.2411346435546875e-05, 4.7206878662109375e-05, 7.200241088867188e-05, 9.679794311523438e-05, 0.00012159347534179688, 0.00014638900756835938, 0.00017118453979492188, 0.00019598007202148438, 0.00022077560424804688, 0.0002455711364746094, 0.0002703666687011719, 0.0002951622009277344, 0.0003199577331542969, 0.0003447532653808594, 0.0003695487976074219, 0.0003943443298339844, 0.0004191398620605469, 0.0004439353942871094, 0.0004687309265136719, 0.0004935264587402344, 0.0005183219909667969, 0.0005431175231933594, 0.0005679130554199219, 0.0005927085876464844, 0.0006175041198730469, 0.0006422996520996094, 0.0006670951843261719, 0.0006918907165527344, 0.0007166862487792969, 0.0007414817810058594, 0.0007662773132324219]}, "gradients/decoder.transformer.h.20.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 3.0, 5.0, 1.0, 3.0, 1.0, 5.0, 4.0, 4.0, 4.0, 2.0, 5.0, 7.0, 7.0, 11.0, 9.0, 23.0, 34.0, 46.0, 49.0, 60.0, 107.0, 122.0, 238.0, 433.0, 2567.0, 1040210.0, 3445.0, 442.0, 230.0, 121.0, 86.0, 73.0, 51.0, 22.0, 30.0, 22.0, 14.0, 11.0, 13.0, 7.0, 6.0, 6.0, 7.0, 2.0, 1.0, 5.0, 4.0, 2.0, 2.0, 1.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0172119140625, -0.01662898063659668, -0.01604604721069336, -0.015463113784790039, -0.014880180358886719, -0.014297246932983398, -0.013714313507080078, -0.013131380081176758, -0.012548446655273438, -0.011965513229370117, -0.011382579803466797, -0.010799646377563477, -0.010216712951660156, -0.009633779525756836, -0.009050846099853516, -0.008467912673950195, -0.007884979248046875, -0.007302045822143555, -0.006719112396240234, -0.006136178970336914, -0.005553245544433594, -0.0049703121185302734, -0.004387378692626953, -0.003804445266723633, -0.0032215118408203125, -0.002638578414916992, -0.002055644989013672, -0.0014727115631103516, -0.0008897781372070312, -0.00030684471130371094, 0.0002760887145996094, 0.0008590221405029297, 0.00144195556640625, 0.0020248889923095703, 0.0026078224182128906, 0.003190755844116211, 0.0037736892700195312, 0.0043566226959228516, 0.004939556121826172, 0.005522489547729492, 0.0061054229736328125, 0.006688356399536133, 0.007271289825439453, 0.007854223251342773, 0.008437156677246094, 0.009020090103149414, 0.009603023529052734, 0.010185956954956055, 0.010768890380859375, 0.011351823806762695, 0.011934757232666016, 0.012517690658569336, 0.013100624084472656, 0.013683557510375977, 0.014266490936279297, 0.014849424362182617, 0.015432357788085938, 0.016015291213989258, 0.016598224639892578, 0.0171811580657959, 0.01776409149169922, 0.01834702491760254, 0.01892995834350586, 0.01951289176940918, 0.0200958251953125]}, "gradients/decoder.transformer.h.20.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 39.0, 277.0, 515.0, 168.0, 14.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013334067771211267, -0.0012828508624807, -0.0012322949478402734, -0.0011817390331998467, -0.0011311830021440983, -0.0010806270875036716, -0.001030071172863245, -0.0009795152582228184, -0.0009289593435823917, -0.0008784034289419651, -0.0008278475143015385, -0.0007772915414534509, -0.0007267356268130243, -0.0006761797121725976, -0.0006256237393245101, -0.0005750678246840835, -0.0005245119100436568, -0.0004739559954032302, -0.0004234000516589731, -0.000372844107914716, -0.00032228819327428937, -0.00027173227863386273, -0.00022117633488960564, -0.00017062039114534855, -0.00012006447650492191, -6.950854731258005e-05, -1.8952618120238185e-05, 3.160331107210368e-05, 8.215924026444554e-05, 0.00013271515490487218, 0.00018327109864912927, 0.00023382704239338636, 0.000284382957033813, 0.00033493887167423964, 0.00038549481541849673, 0.0004360507591627538, 0.00048660667380318046, 0.0005371625884436071, 0.0005877185612916946, 0.0006382744759321213, 0.0006888303905725479, 0.0007393863052129745, 0.0007899422198534012, 0.0008404981927014887, 0.0008910541073419154, 0.000941610021982342, 0.0009921659948304296, 0.0010427219094708562, 0.0010932778241112828, 0.0011438337387517095, 0.001194389653392136, 0.0012449455680325627, 0.0012955015990883112, 0.0013460575137287378, 0.0013966134283691645, 0.001447169343009591, 0.0014977252576500177, 0.0015482811722904444, 0.001598837086930871, 0.0016493930015712976, 0.0016999489162117243, 0.001750504830852151, 0.0018010608619078994, 0.001851616776548326, 0.0019021726911887527]}, "gradients/decoder.transformer.h.20.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 5.0, 1.0, 3.0, 4.0, 4.0, 7.0, 4.0, 3.0, 11.0, 15.0, 9.0, 16.0, 21.0, 23.0, 22.0, 22.0, 33.0, 34.0, 34.0, 29.0, 35.0, 50.0, 46.0, 43.0, 47.0, 40.0, 46.0, 36.0, 40.0, 42.0, 40.0, 42.0, 31.0, 35.0, 28.0, 29.0, 18.0, 14.0, 14.0, 9.0, 9.0, 4.0, 4.0, 3.0, 8.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.00042945146560668945, -0.00041625555604696274, -0.000403059646487236, -0.0003898637369275093, -0.0003766678273677826, -0.0003634719178080559, -0.00035027600824832916, -0.00033708009868860245, -0.00032388418912887573, -0.000310688279569149, -0.0002974923700094223, -0.0002842964604496956, -0.00027110055088996887, -0.00025790464133024216, -0.00024470873177051544, -0.00023151282221078873, -0.000218316912651062, -0.0002051210030913353, -0.00019192509353160858, -0.00017872918397188187, -0.00016553327441215515, -0.00015233736485242844, -0.00013914145529270172, -0.000125945545732975, -0.00011274963617324829, -9.955372661352158e-05, -8.635781705379486e-05, -7.316190749406815e-05, -5.996599793434143e-05, -4.6770088374614716e-05, -3.3574178814888e-05, -2.0378269255161285e-05, -7.18235969543457e-06, 6.013549864292145e-06, 1.920945942401886e-05, 3.2405368983745575e-05, 4.560127854347229e-05, 5.8797188103199005e-05, 7.199309766292572e-05, 8.518900722265244e-05, 9.838491678237915e-05, 0.00011158082634210587, 0.00012477673590183258, 0.0001379726454615593, 0.000151168555021286, 0.00016436446458101273, 0.00017756037414073944, 0.00019075628370046616, 0.00020395219326019287, 0.00021714810281991959, 0.0002303440123796463, 0.00024353992193937302, 0.00025673583149909973, 0.00026993174105882645, 0.00028312765061855316, 0.0002963235601782799, 0.0003095194697380066, 0.0003227153792977333, 0.00033591128885746, 0.00034910719841718674, 0.00036230310797691345, 0.00037549901753664017, 0.0003886949270963669, 0.0004018908366560936, 0.0004150867462158203]}, "gradients/decoder.transformer.h.20.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 7.0, 4.0, 3.0, 6.0, 8.0, 5.0, 15.0, 7.0, 12.0, 23.0, 19.0, 14.0, 21.0, 29.0, 39.0, 37.0, 30.0, 39.0, 38.0, 39.0, 41.0, 45.0, 54.0, 38.0, 43.0, 45.0, 45.0, 34.0, 31.0, 28.0, 29.0, 26.0, 17.0, 23.0, 16.0, 12.0, 18.0, 19.0, 13.0, 6.0, 7.0, 3.0, 4.0, 4.0, 5.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 3.0, 1.0], "bins": [-5.734375, -5.56292724609375, -5.3914794921875, -5.22003173828125, -5.048583984375, -4.87713623046875, -4.7056884765625, -4.53424072265625, -4.36279296875, -4.19134521484375, -4.0198974609375, -3.84844970703125, -3.677001953125, -3.50555419921875, -3.3341064453125, -3.16265869140625, -2.9912109375, -2.81976318359375, -2.6483154296875, -2.47686767578125, -2.305419921875, -2.13397216796875, -1.9625244140625, -1.79107666015625, -1.61962890625, -1.44818115234375, -1.2767333984375, -1.10528564453125, -0.933837890625, -0.76239013671875, -0.5909423828125, -0.41949462890625, -0.248046875, -0.07659912109375, 0.0948486328125, 0.26629638671875, 0.437744140625, 0.60919189453125, 0.7806396484375, 0.95208740234375, 1.12353515625, 1.29498291015625, 1.4664306640625, 1.63787841796875, 1.809326171875, 1.98077392578125, 2.1522216796875, 2.32366943359375, 2.4951171875, 2.66656494140625, 2.8380126953125, 3.00946044921875, 3.180908203125, 3.35235595703125, 3.5238037109375, 3.69525146484375, 3.86669921875, 4.03814697265625, 4.2095947265625, 4.38104248046875, 4.552490234375, 4.72393798828125, 4.8953857421875, 5.06683349609375, 5.23828125]}, "gradients/decoder.transformer.h.20.attn.c_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 1.0, 0.0, 3.0, 5.0, 3.0, 2.0, 6.0, 12.0, 12.0, 19.0, 40.0, 35.0, 60.0, 97.0, 142.0, 204.0, 294.0, 432.0, 756.0, 1312.0, 2209.0, 3949.0, 7978.0, 17344.0, 43932.0, 149032.0, 520676.0, 203381.0, 55753.0, 20968.0, 9239.0, 4582.0, 2392.0, 1449.0, 751.0, 496.0, 313.0, 211.0, 146.0, 95.0, 79.0, 50.0, 30.0, 19.0, 19.0, 11.0, 9.0, 3.0, 8.0, 5.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.2421875, -3.12884521484375, -3.0155029296875, -2.90216064453125, -2.788818359375, -2.67547607421875, -2.5621337890625, -2.44879150390625, -2.33544921875, -2.22210693359375, -2.1087646484375, -1.99542236328125, -1.882080078125, -1.76873779296875, -1.6553955078125, -1.54205322265625, -1.4287109375, -1.31536865234375, -1.2020263671875, -1.08868408203125, -0.975341796875, -0.86199951171875, -0.7486572265625, -0.63531494140625, -0.52197265625, -0.40863037109375, -0.2952880859375, -0.18194580078125, -0.068603515625, 0.04473876953125, 0.1580810546875, 0.27142333984375, 0.384765625, 0.49810791015625, 0.6114501953125, 0.72479248046875, 0.838134765625, 0.95147705078125, 1.0648193359375, 1.17816162109375, 1.29150390625, 1.40484619140625, 1.5181884765625, 1.63153076171875, 1.744873046875, 1.85821533203125, 1.9715576171875, 2.08489990234375, 2.1982421875, 2.31158447265625, 2.4249267578125, 2.53826904296875, 2.651611328125, 2.76495361328125, 2.8782958984375, 2.99163818359375, 3.10498046875, 3.21832275390625, 3.3316650390625, 3.44500732421875, 3.558349609375, 3.67169189453125, 3.7850341796875, 3.89837646484375, 4.01171875]}, "gradients/decoder.transformer.h.20.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 0.0, 2.0, 0.0, 5.0, 4.0, 5.0, 0.0, 2.0, 7.0, 5.0, 14.0, 4.0, 14.0, 14.0, 24.0, 23.0, 31.0, 18.0, 20.0, 21.0, 32.0, 36.0, 30.0, 38.0, 40.0, 59.0, 100.0, 1916.0, 119.0, 64.0, 45.0, 36.0, 46.0, 35.0, 36.0, 29.0, 23.0, 26.0, 32.0, 15.0, 12.0, 11.0, 19.0, 11.0, 7.0, 7.0, 5.0, 9.0, 3.0, 0.0, 3.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-17.84375, -17.25927734375, -16.6748046875, -16.09033203125, -15.505859375, -14.92138671875, -14.3369140625, -13.75244140625, -13.16796875, -12.58349609375, -11.9990234375, -11.41455078125, -10.830078125, -10.24560546875, -9.6611328125, -9.07666015625, -8.4921875, -7.90771484375, -7.3232421875, -6.73876953125, -6.154296875, -5.56982421875, -4.9853515625, -4.40087890625, -3.81640625, -3.23193359375, -2.6474609375, -2.06298828125, -1.478515625, -0.89404296875, -0.3095703125, 0.27490234375, 0.859375, 1.44384765625, 2.0283203125, 2.61279296875, 3.197265625, 3.78173828125, 4.3662109375, 4.95068359375, 5.53515625, 6.11962890625, 6.7041015625, 7.28857421875, 7.873046875, 8.45751953125, 9.0419921875, 9.62646484375, 10.2109375, 10.79541015625, 11.3798828125, 11.96435546875, 12.548828125, 13.13330078125, 13.7177734375, 14.30224609375, 14.88671875, 15.47119140625, 16.0556640625, 16.64013671875, 17.224609375, 17.80908203125, 18.3935546875, 18.97802734375, 19.5625]}, "gradients/decoder.transformer.h.20.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 0.0, 2.0, 3.0, 1.0, 4.0, 6.0, 6.0, 11.0, 7.0, 9.0, 17.0, 12.0, 15.0, 14.0, 36.0, 33.0, 36.0, 44.0, 61.0, 90.0, 131.0, 195.0, 362.0, 865.0, 5166.0, 2900640.0, 233644.0, 2688.0, 674.0, 271.0, 174.0, 104.0, 88.0, 64.0, 38.0, 36.0, 24.0, 27.0, 22.0, 20.0, 19.0, 9.0, 4.0, 11.0, 5.0, 9.0, 1.0, 0.0, 5.0, 5.0, 4.0, 1.0, 2.0, 2.0, 2.0], "bins": [-32.65625, -31.7197265625, -30.783203125, -29.8466796875, -28.91015625, -27.9736328125, -27.037109375, -26.1005859375, -25.1640625, -24.2275390625, -23.291015625, -22.3544921875, -21.41796875, -20.4814453125, -19.544921875, -18.6083984375, -17.671875, -16.7353515625, -15.798828125, -14.8623046875, -13.92578125, -12.9892578125, -12.052734375, -11.1162109375, -10.1796875, -9.2431640625, -8.306640625, -7.3701171875, -6.43359375, -5.4970703125, -4.560546875, -3.6240234375, -2.6875, -1.7509765625, -0.814453125, 0.1220703125, 1.05859375, 1.9951171875, 2.931640625, 3.8681640625, 4.8046875, 5.7412109375, 6.677734375, 7.6142578125, 8.55078125, 9.4873046875, 10.423828125, 11.3603515625, 12.296875, 13.2333984375, 14.169921875, 15.1064453125, 16.04296875, 16.9794921875, 17.916015625, 18.8525390625, 19.7890625, 20.7255859375, 21.662109375, 22.5986328125, 23.53515625, 24.4716796875, 25.408203125, 26.3447265625, 27.28125]}, "gradients/decoder.transformer.h.20.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 14.0, 348.0, 592.0, 60.0, 1.0, 1.0, 2.0], "bins": [-144.5220184326172, -142.07830810546875, -139.63461303710938, -137.19090270996094, -134.74720764160156, -132.30349731445312, -129.85980224609375, -127.41609191894531, -124.9723892211914, -122.5286865234375, -120.0849838256836, -117.64128112792969, -115.19757080078125, -112.75386810302734, -110.31016540527344, -107.86646270751953, -105.42276000976562, -102.97905731201172, -100.53535461425781, -98.09164428710938, -95.64794158935547, -93.20423889160156, -90.76053619384766, -88.31683349609375, -85.87312316894531, -83.4294204711914, -80.9857177734375, -78.54200744628906, -76.09830474853516, -73.65460205078125, -71.21089935302734, -68.76719665527344, -66.32349395751953, -63.879791259765625, -61.43608474731445, -58.99238204956055, -56.54867935180664, -54.10497283935547, -51.66127014160156, -49.217567443847656, -46.77386474609375, -44.330162048339844, -41.88645553588867, -39.442752838134766, -36.99905014038086, -34.55534362792969, -32.11164093017578, -29.667938232421875, -27.22423553466797, -24.78053092956543, -22.336828231811523, -19.893123626708984, -17.449420928955078, -15.005716323852539, -12.56201171875, -10.118309020996094, -7.674603462219238, -5.230899810791016, -2.7871956825256348, -0.3434915542602539, 2.1002120971679688, 4.543915748596191, 6.9876203536987305, 9.431323051452637, 11.875027656555176]}, "gradients/decoder.transformer.h.20.ln_1.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 4.0, 7.0, 5.0, 5.0, 6.0, 5.0, 10.0, 7.0, 12.0, 16.0, 15.0, 14.0, 21.0, 20.0, 25.0, 16.0, 22.0, 21.0, 22.0, 28.0, 51.0, 38.0, 34.0, 42.0, 52.0, 43.0, 34.0, 36.0, 41.0, 41.0, 35.0, 29.0, 30.0, 25.0, 25.0, 16.0, 24.0, 20.0, 21.0, 20.0, 15.0, 9.0, 10.0, 14.0, 7.0, 4.0, 5.0, 1.0, 3.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-48.98492431640625, -47.245452880859375, -45.5059814453125, -43.766510009765625, -42.02703857421875, -40.287567138671875, -38.548095703125, -36.808624267578125, -35.06915283203125, -33.329681396484375, -31.5902099609375, -29.850738525390625, -28.11126708984375, -26.371795654296875, -24.632326126098633, -22.892854690551758, -21.153385162353516, -19.41391372680664, -17.674442291259766, -15.934971809387207, -14.195500373840332, -12.456028938293457, -10.716558456420898, -8.977087020874023, -7.237615585327148, -5.498144149780273, -3.7586731910705566, -2.01920223236084, -0.27973079681396484, 1.4597406387329102, 3.1992111206054688, 4.938682556152344, 6.678153991699219, 8.417625427246094, 10.157096862792969, 11.896567344665527, 13.636038780212402, 15.375510215759277, 17.114980697631836, 18.85445213317871, 20.593923568725586, 22.33339500427246, 24.072866439819336, 25.812335968017578, 27.551807403564453, 29.291278839111328, 31.030750274658203, 32.77022171020508, 34.50969314575195, 36.24916458129883, 37.9886360168457, 39.72810745239258, 41.46757888793945, 43.20705032348633, 44.94651794433594, 46.68598937988281, 48.42546081542969, 50.16493225097656, 51.90440368652344, 53.64387512207031, 55.38334655761719, 57.12281799316406, 58.86228942871094, 60.60176086425781, 62.34123229980469]}, "gradients/decoder.transformer.h.19.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 5.0, 6.0, 4.0, 4.0, 6.0, 6.0, 12.0, 8.0, 12.0, 15.0, 24.0, 19.0, 14.0, 29.0, 36.0, 25.0, 36.0, 41.0, 38.0, 41.0, 43.0, 38.0, 56.0, 45.0, 44.0, 47.0, 32.0, 43.0, 31.0, 33.0, 23.0, 33.0, 16.0, 23.0, 17.0, 14.0, 17.0, 17.0, 15.0, 10.0, 6.0, 6.0, 3.0, 2.0, 4.0, 4.0, 2.0, 3.0, 0.0, 2.0, 0.0, 4.0, 1.0], "bins": [-5.9140625, -5.73931884765625, -5.5645751953125, -5.38983154296875, -5.215087890625, -5.04034423828125, -4.8656005859375, -4.69085693359375, -4.51611328125, -4.34136962890625, -4.1666259765625, -3.99188232421875, -3.817138671875, -3.64239501953125, -3.4676513671875, -3.29290771484375, -3.1181640625, -2.94342041015625, -2.7686767578125, -2.59393310546875, -2.419189453125, -2.24444580078125, -2.0697021484375, -1.89495849609375, -1.72021484375, -1.54547119140625, -1.3707275390625, -1.19598388671875, -1.021240234375, -0.84649658203125, -0.6717529296875, -0.49700927734375, -0.322265625, -0.14752197265625, 0.0272216796875, 0.20196533203125, 0.376708984375, 0.55145263671875, 0.7261962890625, 0.90093994140625, 1.07568359375, 1.25042724609375, 1.4251708984375, 1.59991455078125, 1.774658203125, 1.94940185546875, 2.1241455078125, 2.29888916015625, 2.4736328125, 2.64837646484375, 2.8231201171875, 2.99786376953125, 3.172607421875, 3.34735107421875, 3.5220947265625, 3.69683837890625, 3.87158203125, 4.04632568359375, 4.2210693359375, 4.39581298828125, 4.570556640625, 4.74530029296875, 4.9200439453125, 5.09478759765625, 5.26953125]}, "gradients/decoder.transformer.h.19.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 4.0, 2.0, 6.0, 8.0, 7.0, 5.0, 9.0, 10.0, 20.0, 20.0, 30.0, 48.0, 64.0, 90.0, 157.0, 364.0, 1073.0, 4831.0, 35371.0, 370993.0, 2154583.0, 1434340.0, 170370.0, 17788.0, 2706.0, 675.0, 273.0, 137.0, 86.0, 64.0, 33.0, 27.0, 18.0, 23.0, 5.0, 10.0, 7.0, 6.0, 10.0, 3.0, 4.0, 4.0, 2.0, 4.0, 2.0, 1.0, 2.0], "bins": [-13.75, -13.3865966796875, -13.023193359375, -12.6597900390625, -12.29638671875, -11.9329833984375, -11.569580078125, -11.2061767578125, -10.8427734375, -10.4793701171875, -10.115966796875, -9.7525634765625, -9.38916015625, -9.0257568359375, -8.662353515625, -8.2989501953125, -7.935546875, -7.5721435546875, -7.208740234375, -6.8453369140625, -6.48193359375, -6.1185302734375, -5.755126953125, -5.3917236328125, -5.0283203125, -4.6649169921875, -4.301513671875, -3.9381103515625, -3.57470703125, -3.2113037109375, -2.847900390625, -2.4844970703125, -2.12109375, -1.7576904296875, -1.394287109375, -1.0308837890625, -0.66748046875, -0.3040771484375, 0.059326171875, 0.4227294921875, 0.7861328125, 1.1495361328125, 1.512939453125, 1.8763427734375, 2.23974609375, 2.6031494140625, 2.966552734375, 3.3299560546875, 3.693359375, 4.0567626953125, 4.420166015625, 4.7835693359375, 5.14697265625, 5.5103759765625, 5.873779296875, 6.2371826171875, 6.6005859375, 6.9639892578125, 7.327392578125, 7.6907958984375, 8.05419921875, 8.4176025390625, 8.781005859375, 9.1444091796875, 9.5078125]}, "gradients/decoder.transformer.h.19.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 7.0, 3.0, 8.0, 2.0, 5.0, 16.0, 11.0, 21.0, 35.0, 42.0, 36.0, 65.0, 95.0, 114.0, 199.0, 277.0, 388.0, 554.0, 596.0, 492.0, 348.0, 240.0, 156.0, 108.0, 70.0, 56.0, 38.0, 28.0, 20.0, 21.0, 7.0, 6.0, 7.0, 3.0, 1.0, 3.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-19.84375, -19.246337890625, -18.64892578125, -18.051513671875, -17.4541015625, -16.856689453125, -16.25927734375, -15.661865234375, -15.064453125, -14.467041015625, -13.86962890625, -13.272216796875, -12.6748046875, -12.077392578125, -11.47998046875, -10.882568359375, -10.28515625, -9.687744140625, -9.09033203125, -8.492919921875, -7.8955078125, -7.298095703125, -6.70068359375, -6.103271484375, -5.505859375, -4.908447265625, -4.31103515625, -3.713623046875, -3.1162109375, -2.518798828125, -1.92138671875, -1.323974609375, -0.7265625, -0.129150390625, 0.46826171875, 1.065673828125, 1.6630859375, 2.260498046875, 2.85791015625, 3.455322265625, 4.052734375, 4.650146484375, 5.24755859375, 5.844970703125, 6.4423828125, 7.039794921875, 7.63720703125, 8.234619140625, 8.83203125, 9.429443359375, 10.02685546875, 10.624267578125, 11.2216796875, 11.819091796875, 12.41650390625, 13.013916015625, 13.611328125, 14.208740234375, 14.80615234375, 15.403564453125, 16.0009765625, 16.598388671875, 17.19580078125, 17.793212890625, 18.390625]}, "gradients/decoder.transformer.h.19.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 3.0, 5.0, 7.0, 3.0, 8.0, 21.0, 13.0, 24.0, 31.0, 39.0, 55.0, 94.0, 133.0, 200.0, 376.0, 1096.0, 154511.0, 4033207.0, 3151.0, 479.0, 258.0, 178.0, 122.0, 67.0, 60.0, 32.0, 41.0, 25.0, 12.0, 11.0, 7.0, 5.0, 4.0, 2.0, 4.0, 3.0, 1.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-88.875, -85.9326171875, -82.990234375, -80.0478515625, -77.10546875, -74.1630859375, -71.220703125, -68.2783203125, -65.3359375, -62.3935546875, -59.451171875, -56.5087890625, -53.56640625, -50.6240234375, -47.681640625, -44.7392578125, -41.796875, -38.8544921875, -35.912109375, -32.9697265625, -30.02734375, -27.0849609375, -24.142578125, -21.2001953125, -18.2578125, -15.3154296875, -12.373046875, -9.4306640625, -6.48828125, -3.5458984375, -0.603515625, 2.3388671875, 5.28125, 8.2236328125, 11.166015625, 14.1083984375, 17.05078125, 19.9931640625, 22.935546875, 25.8779296875, 28.8203125, 31.7626953125, 34.705078125, 37.6474609375, 40.58984375, 43.5322265625, 46.474609375, 49.4169921875, 52.359375, 55.3017578125, 58.244140625, 61.1865234375, 64.12890625, 67.0712890625, 70.013671875, 72.9560546875, 75.8984375, 78.8408203125, 81.783203125, 84.7255859375, 87.66796875, 90.6103515625, 93.552734375, 96.4951171875, 99.4375]}, "gradients/decoder.transformer.h.19.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 5.0, 15.0, 25.0, 60.0, 128.0, 182.0, 204.0, 170.0, 118.0, 70.0, 25.0, 9.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-60.368309020996094, -57.78787612915039, -55.20744323730469, -52.627010345458984, -50.04657745361328, -47.46614074707031, -44.88570785522461, -42.305274963378906, -39.7248420715332, -37.1444091796875, -34.5639762878418, -31.98354148864746, -29.403108596801758, -26.822675704956055, -24.24224090576172, -21.661808013916016, -19.081375122070312, -16.50094223022461, -13.92050838470459, -11.34007453918457, -8.759641647338867, -6.179208755493164, -3.5987749099731445, -1.018341064453125, 1.5620918273925781, 4.1425251960754395, 6.722958564758301, 9.30339241027832, 11.883825302124023, 14.464258193969727, 17.044692993164062, 19.625125885009766, 22.205551147460938, 24.78598403930664, 27.366416931152344, 29.94685173034668, 32.52728271484375, 35.10771942138672, 37.68815231323242, 40.268585205078125, 42.84901809692383, 45.42945098876953, 48.009883880615234, 50.59031677246094, 53.170753479003906, 55.751182556152344, 58.33161926269531, 60.912052154541016, 63.49248504638672, 66.07292175292969, 68.65335083007812, 71.2337875366211, 73.81421661376953, 76.3946533203125, 78.97508239746094, 81.5555191040039, 84.13595581054688, 86.71639251708984, 89.29682159423828, 91.87725830078125, 94.45768737792969, 97.03812408447266, 99.6185531616211, 102.19898986816406, 104.7794189453125]}, "gradients/decoder.transformer.h.19.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 3.0, 2.0, 1.0, 5.0, 7.0, 5.0, 5.0, 8.0, 14.0, 13.0, 18.0, 18.0, 20.0, 30.0, 21.0, 26.0, 31.0, 37.0, 46.0, 39.0, 44.0, 40.0, 45.0, 49.0, 42.0, 31.0, 43.0, 41.0, 41.0, 37.0, 33.0, 35.0, 23.0, 29.0, 13.0, 25.0, 14.0, 16.0, 14.0, 8.0, 9.0, 8.0, 5.0, 7.0, 3.0, 3.0, 3.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-54.46820068359375, -52.644866943359375, -50.821533203125, -48.998199462890625, -47.174861907958984, -45.35152816772461, -43.528194427490234, -41.70486068725586, -39.88152313232422, -38.058189392089844, -36.23485565185547, -34.411521911621094, -32.58818435668945, -30.764850616455078, -28.941516876220703, -27.118183135986328, -25.294849395751953, -23.471515655517578, -21.64818000793457, -19.824846267700195, -18.001510620117188, -16.178176879882812, -14.354843139648438, -12.531508445739746, -10.708173751831055, -8.884839057922363, -7.06150484085083, -5.238170623779297, -3.4148359298706055, -1.591501235961914, 0.23183250427246094, 2.0551671981811523, 3.878498077392578, 5.7018327713012695, 7.525166988372803, 9.348501205444336, 11.171835899353027, 12.995170593261719, 14.818504333496094, 16.64183807373047, 18.465173721313477, 20.28850746154785, 22.11184310913086, 23.935176849365234, 25.75851058959961, 27.581846237182617, 29.405179977416992, 31.228515625, 33.051849365234375, 34.87518310546875, 36.698516845703125, 38.5218505859375, 40.34518814086914, 42.168521881103516, 43.99185562133789, 45.815189361572266, 47.638526916503906, 49.46186065673828, 51.285194396972656, 53.10852813720703, 54.93186569213867, 56.75519943237305, 58.57853317260742, 60.4018669128418, 62.22520065307617]}, "gradients/decoder.transformer.h.19.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 1.0, 1.0, 3.0, 2.0, 4.0, 3.0, 4.0, 6.0, 5.0, 4.0, 12.0, 9.0, 7.0, 9.0, 15.0, 24.0, 23.0, 22.0, 21.0, 24.0, 32.0, 27.0, 38.0, 42.0, 43.0, 33.0, 37.0, 42.0, 41.0, 42.0, 45.0, 36.0, 43.0, 28.0, 34.0, 28.0, 40.0, 19.0, 18.0, 24.0, 19.0, 15.0, 11.0, 18.0, 13.0, 11.0, 7.0, 3.0, 6.0, 3.0, 4.0, 1.0, 2.0, 4.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 1.0], "bins": [-5.1796875, -5.01409912109375, -4.8485107421875, -4.68292236328125, -4.517333984375, -4.35174560546875, -4.1861572265625, -4.02056884765625, -3.85498046875, -3.68939208984375, -3.5238037109375, -3.35821533203125, -3.192626953125, -3.02703857421875, -2.8614501953125, -2.69586181640625, -2.5302734375, -2.36468505859375, -2.1990966796875, -2.03350830078125, -1.867919921875, -1.70233154296875, -1.5367431640625, -1.37115478515625, -1.20556640625, -1.03997802734375, -0.8743896484375, -0.70880126953125, -0.543212890625, -0.37762451171875, -0.2120361328125, -0.04644775390625, 0.119140625, 0.28472900390625, 0.4503173828125, 0.61590576171875, 0.781494140625, 0.94708251953125, 1.1126708984375, 1.27825927734375, 1.44384765625, 1.60943603515625, 1.7750244140625, 1.94061279296875, 2.106201171875, 2.27178955078125, 2.4373779296875, 2.60296630859375, 2.7685546875, 2.93414306640625, 3.0997314453125, 3.26531982421875, 3.430908203125, 3.59649658203125, 3.7620849609375, 3.92767333984375, 4.09326171875, 4.25885009765625, 4.4244384765625, 4.59002685546875, 4.755615234375, 4.92120361328125, 5.0867919921875, 5.25238037109375, 5.41796875]}, "gradients/decoder.transformer.h.19.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 5.0, 3.0, 5.0, 7.0, 12.0, 18.0, 28.0, 40.0, 66.0, 99.0, 141.0, 176.0, 241.0, 354.0, 484.0, 703.0, 990.0, 1491.0, 2083.0, 2923.0, 4213.0, 6035.0, 9074.0, 13711.0, 20609.0, 31666.0, 52834.0, 92467.0, 200504.0, 313973.0, 117750.0, 63924.0, 38378.0, 24226.0, 15636.0, 10490.0, 7101.0, 5026.0, 3284.0, 2282.0, 1621.0, 1126.0, 815.0, 544.0, 409.0, 332.0, 210.0, 156.0, 113.0, 70.0, 46.0, 33.0, 17.0, 8.0, 8.0, 2.0, 2.0, 3.0, 1.0, 2.0, 2.0, 1.0], "bins": [-1.2705078125, -1.229766845703125, -1.18902587890625, -1.148284912109375, -1.1075439453125, -1.066802978515625, -1.02606201171875, -0.985321044921875, -0.944580078125, -0.903839111328125, -0.86309814453125, -0.822357177734375, -0.7816162109375, -0.740875244140625, -0.70013427734375, -0.659393310546875, -0.61865234375, -0.577911376953125, -0.53717041015625, -0.496429443359375, -0.4556884765625, -0.414947509765625, -0.37420654296875, -0.333465576171875, -0.292724609375, -0.251983642578125, -0.21124267578125, -0.170501708984375, -0.1297607421875, -0.089019775390625, -0.04827880859375, -0.007537841796875, 0.033203125, 0.073944091796875, 0.11468505859375, 0.155426025390625, 0.1961669921875, 0.236907958984375, 0.27764892578125, 0.318389892578125, 0.359130859375, 0.399871826171875, 0.44061279296875, 0.481353759765625, 0.5220947265625, 0.562835693359375, 0.60357666015625, 0.644317626953125, 0.68505859375, 0.725799560546875, 0.76654052734375, 0.807281494140625, 0.8480224609375, 0.888763427734375, 0.92950439453125, 0.970245361328125, 1.010986328125, 1.051727294921875, 1.09246826171875, 1.133209228515625, 1.1739501953125, 1.214691162109375, 1.25543212890625, 1.296173095703125, 1.3369140625]}, "gradients/decoder.transformer.h.19.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 3.0, 4.0, 5.0, 5.0, 5.0, 5.0, 13.0, 14.0, 8.0, 13.0, 19.0, 22.0, 32.0, 27.0, 32.0, 28.0, 40.0, 32.0, 44.0, 40.0, 40.0, 34.0, 1072.0, 46.0, 43.0, 41.0, 40.0, 42.0, 40.0, 32.0, 33.0, 31.0, 16.0, 16.0, 20.0, 17.0, 19.0, 13.0, 11.0, 12.0, 4.0, 3.0, 9.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-3.712890625, -3.59796142578125, -3.4830322265625, -3.36810302734375, -3.253173828125, -3.13824462890625, -3.0233154296875, -2.90838623046875, -2.79345703125, -2.67852783203125, -2.5635986328125, -2.44866943359375, -2.333740234375, -2.21881103515625, -2.1038818359375, -1.98895263671875, -1.8740234375, -1.75909423828125, -1.6441650390625, -1.52923583984375, -1.414306640625, -1.29937744140625, -1.1844482421875, -1.06951904296875, -0.95458984375, -0.83966064453125, -0.7247314453125, -0.60980224609375, -0.494873046875, -0.37994384765625, -0.2650146484375, -0.15008544921875, -0.03515625, 0.07977294921875, 0.1947021484375, 0.30963134765625, 0.424560546875, 0.53948974609375, 0.6544189453125, 0.76934814453125, 0.88427734375, 0.99920654296875, 1.1141357421875, 1.22906494140625, 1.343994140625, 1.45892333984375, 1.5738525390625, 1.68878173828125, 1.8037109375, 1.91864013671875, 2.0335693359375, 2.14849853515625, 2.263427734375, 2.37835693359375, 2.4932861328125, 2.60821533203125, 2.72314453125, 2.83807373046875, 2.9530029296875, 3.06793212890625, 3.182861328125, 3.29779052734375, 3.4127197265625, 3.52764892578125, 3.642578125]}, "gradients/decoder.transformer.h.19.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 1.0, 2.0, 8.0, 7.0, 5.0, 14.0, 22.0, 28.0, 42.0, 55.0, 95.0, 167.0, 289.0, 448.0, 813.0, 1478.0, 2582.0, 4778.0, 8739.0, 16628.0, 33522.0, 70097.0, 162389.0, 1483254.0, 167962.0, 72197.0, 34186.0, 17247.0, 8926.0, 4853.0, 2689.0, 1532.0, 831.0, 517.0, 288.0, 160.0, 105.0, 55.0, 51.0, 21.0, 22.0, 5.0, 8.0, 6.0, 6.0, 5.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.966796875, -1.904388427734375, -1.84197998046875, -1.779571533203125, -1.7171630859375, -1.654754638671875, -1.59234619140625, -1.529937744140625, -1.467529296875, -1.405120849609375, -1.34271240234375, -1.280303955078125, -1.2178955078125, -1.155487060546875, -1.09307861328125, -1.030670166015625, -0.96826171875, -0.905853271484375, -0.84344482421875, -0.781036376953125, -0.7186279296875, -0.656219482421875, -0.59381103515625, -0.531402587890625, -0.468994140625, -0.406585693359375, -0.34417724609375, -0.281768798828125, -0.2193603515625, -0.156951904296875, -0.09454345703125, -0.032135009765625, 0.0302734375, 0.092681884765625, 0.15509033203125, 0.217498779296875, 0.2799072265625, 0.342315673828125, 0.40472412109375, 0.467132568359375, 0.529541015625, 0.591949462890625, 0.65435791015625, 0.716766357421875, 0.7791748046875, 0.841583251953125, 0.90399169921875, 0.966400146484375, 1.02880859375, 1.091217041015625, 1.15362548828125, 1.216033935546875, 1.2784423828125, 1.340850830078125, 1.40325927734375, 1.465667724609375, 1.528076171875, 1.590484619140625, 1.65289306640625, 1.715301513671875, 1.7777099609375, 1.840118408203125, 1.90252685546875, 1.964935302734375, 2.02734375]}, "gradients/decoder.transformer.h.19.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 0.0, 3.0, 1.0, 2.0, 5.0, 6.0, 8.0, 7.0, 7.0, 13.0, 17.0, 30.0, 25.0, 44.0, 67.0, 123.0, 122.0, 117.0, 105.0, 74.0, 67.0, 50.0, 28.0, 26.0, 14.0, 15.0, 11.0, 6.0, 1.0, 3.0, 3.0, 3.0, 0.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0019245147705078125, -0.0018700659275054932, -0.0018156170845031738, -0.0017611682415008545, -0.0017067193984985352, -0.0016522705554962158, -0.0015978217124938965, -0.0015433728694915771, -0.0014889240264892578, -0.0014344751834869385, -0.0013800263404846191, -0.0013255774974822998, -0.0012711286544799805, -0.0012166798114776611, -0.0011622309684753418, -0.0011077821254730225, -0.0010533332824707031, -0.0009988844394683838, -0.0009444355964660645, -0.0008899867534637451, -0.0008355379104614258, -0.0007810890674591064, -0.0007266402244567871, -0.0006721913814544678, -0.0006177425384521484, -0.0005632936954498291, -0.0005088448524475098, -0.00045439600944519043, -0.0003999471664428711, -0.00034549832344055176, -0.0002910494804382324, -0.00023660063743591309, -0.00018215179443359375, -0.00012770295143127441, -7.325410842895508e-05, -1.8805265426635742e-05, 3.5643577575683594e-05, 9.009242057800293e-05, 0.00014454126358032227, 0.0001989901065826416, 0.00025343894958496094, 0.0003078877925872803, 0.0003623366355895996, 0.00041678547859191895, 0.0004712343215942383, 0.0005256831645965576, 0.000580132007598877, 0.0006345808506011963, 0.0006890296936035156, 0.000743478536605835, 0.0007979273796081543, 0.0008523762226104736, 0.000906825065612793, 0.0009612739086151123, 0.0010157227516174316, 0.001070171594619751, 0.0011246204376220703, 0.0011790692806243896, 0.001233518123626709, 0.0012879669666290283, 0.0013424158096313477, 0.001396864652633667, 0.0014513134956359863, 0.0015057623386383057, 0.001560211181640625]}, "gradients/decoder.transformer.h.19.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 3.0, 2.0, 1.0, 2.0, 4.0, 1.0, 10.0, 12.0, 16.0, 15.0, 27.0, 41.0, 55.0, 107.0, 168.0, 421.0, 393969.0, 652778.0, 445.0, 184.0, 107.0, 56.0, 37.0, 34.0, 17.0, 9.0, 8.0, 10.0, 5.0, 5.0, 0.0, 0.0, 3.0, 5.0, 2.0, 1.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.039520263671875, -0.03810930252075195, -0.036698341369628906, -0.03528738021850586, -0.03387641906738281, -0.032465457916259766, -0.03105449676513672, -0.029643535614013672, -0.028232574462890625, -0.026821613311767578, -0.02541065216064453, -0.023999691009521484, -0.022588729858398438, -0.02117776870727539, -0.019766807556152344, -0.018355846405029297, -0.01694488525390625, -0.015533924102783203, -0.014122962951660156, -0.01271200180053711, -0.011301040649414062, -0.009890079498291016, -0.008479118347167969, -0.007068157196044922, -0.005657196044921875, -0.004246234893798828, -0.0028352737426757812, -0.0014243125915527344, -1.33514404296875e-05, 0.0013976097106933594, 0.0028085708618164062, 0.004219532012939453, 0.0056304931640625, 0.007041454315185547, 0.008452415466308594, 0.00986337661743164, 0.011274337768554688, 0.012685298919677734, 0.014096260070800781, 0.015507221221923828, 0.016918182373046875, 0.018329143524169922, 0.01974010467529297, 0.021151065826416016, 0.022562026977539062, 0.02397298812866211, 0.025383949279785156, 0.026794910430908203, 0.02820587158203125, 0.029616832733154297, 0.031027793884277344, 0.03243875503540039, 0.03384971618652344, 0.035260677337646484, 0.03667163848876953, 0.03808259963989258, 0.039493560791015625, 0.04090452194213867, 0.04231548309326172, 0.043726444244384766, 0.04513740539550781, 0.04654836654663086, 0.047959327697753906, 0.04937028884887695, 0.05078125]}, "gradients/decoder.transformer.h.19.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 14.0, 76.0, 227.0, 326.0, 241.0, 94.0, 26.0, 9.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0019064029911532998, -0.0018708306597545743, -0.001835258211940527, -0.0017996858805418015, -0.001764113549143076, -0.0017285411013290286, -0.001692968769930303, -0.0016573963221162558, -0.0016218239907175303, -0.0015862516593188047, -0.0015506792115047574, -0.001515106880106032, -0.0014795344322919846, -0.001443962100893259, -0.0014083897694945335, -0.001372817438095808, -0.0013372449902817607, -0.0013016726588830352, -0.0012661002110689878, -0.0012305278796702623, -0.0011949555482715368, -0.0011593831004574895, -0.001123810769058764, -0.0010882383212447166, -0.0010526659898459911, -0.0010170936584472656, -0.0009815212106332183, -0.0009459488792344928, -0.0009103764896281064, -0.0008748041000217199, -0.0008392317686229944, -0.000803659379016608, -0.0007680871058255434, -0.000732514716219157, -0.0006969423266127706, -0.000661369995214045, -0.0006257976056076586, -0.0005902252160012722, -0.0005546528846025467, -0.0005190804949961603, -0.0004835080762859434, -0.0004479357157833874, -0.000412363326177001, -0.0003767909365706146, -0.0003412185760680586, -0.00030564621556550264, -0.0002700738259591162, -0.00023450146545656025, -0.00019892907585017383, -0.00016335670079570264, -0.00012778432574123144, -9.221195068676025e-05, -5.663957563228905e-05, -2.1067200577817857e-05, 1.4505174476653337e-05, 5.0077534979209304e-05, 8.564992458559573e-05, 0.00012122229964006692, 0.00015679467469453812, 0.0001923670497490093, 0.0002279394248034805, 0.00026351178530603647, 0.0002990841749124229, 0.00033465653541497886, 0.0003702289250213653]}, "gradients/decoder.transformer.h.19.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 3.0, 2.0, 5.0, 2.0, 7.0, 3.0, 8.0, 11.0, 10.0, 12.0, 18.0, 17.0, 27.0, 23.0, 17.0, 21.0, 43.0, 36.0, 28.0, 46.0, 45.0, 46.0, 48.0, 36.0, 48.0, 59.0, 43.0, 37.0, 43.0, 36.0, 33.0, 29.0, 24.0, 17.0, 15.0, 15.0, 15.0, 21.0, 15.0, 11.0, 10.0, 16.0, 7.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005411505699157715, -0.0005207424983382225, -0.0005003344267606735, -0.00047992635518312454, -0.00045951828360557556, -0.0004391102120280266, -0.0004187021404504776, -0.0003982940688729286, -0.00037788599729537964, -0.00035747792571783066, -0.0003370698541402817, -0.0003166617825627327, -0.0002962537109851837, -0.00027584563940763474, -0.00025543756783008575, -0.00023502949625253677, -0.0002146214246749878, -0.0001942133530974388, -0.00017380528151988983, -0.00015339720994234085, -0.00013298913836479187, -0.00011258106678724289, -9.217299520969391e-05, -7.176492363214493e-05, -5.135685205459595e-05, -3.0948780477046967e-05, -1.0540708899497986e-05, 9.867362678050995e-06, 3.0275434255599976e-05, 5.0683505833148956e-05, 7.109157741069794e-05, 9.149964898824692e-05, 0.0001119077205657959, 0.00013231579214334488, 0.00015272386372089386, 0.00017313193529844284, 0.00019354000687599182, 0.0002139480784535408, 0.00023435615003108978, 0.00025476422160863876, 0.00027517229318618774, 0.0002955803647637367, 0.0003159884363412857, 0.0003363965079188347, 0.00035680457949638367, 0.00037721265107393265, 0.00039762072265148163, 0.0004180287942290306, 0.0004384368658065796, 0.00045884493738412857, 0.00047925300896167755, 0.0004996610805392265, 0.0005200691521167755, 0.0005404772236943245, 0.0005608852952718735, 0.0005812933668494225, 0.0006017014384269714, 0.0006221095100045204, 0.0006425175815820694, 0.0006629256531596184, 0.0006833337247371674, 0.0007037417963147163, 0.0007241498678922653, 0.0007445579394698143, 0.0007649660110473633]}, "gradients/decoder.transformer.h.19.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 1.0, 1.0, 3.0, 2.0, 4.0, 3.0, 4.0, 6.0, 5.0, 4.0, 12.0, 9.0, 7.0, 9.0, 15.0, 24.0, 23.0, 22.0, 21.0, 24.0, 32.0, 27.0, 38.0, 42.0, 43.0, 33.0, 37.0, 42.0, 41.0, 42.0, 45.0, 36.0, 43.0, 28.0, 34.0, 28.0, 40.0, 19.0, 18.0, 24.0, 19.0, 15.0, 11.0, 18.0, 13.0, 11.0, 7.0, 3.0, 6.0, 3.0, 4.0, 1.0, 2.0, 4.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 1.0], "bins": [-5.1796875, -5.01409912109375, -4.8485107421875, -4.68292236328125, -4.517333984375, -4.35174560546875, -4.1861572265625, -4.02056884765625, -3.85498046875, -3.68939208984375, -3.5238037109375, -3.35821533203125, -3.192626953125, -3.02703857421875, -2.8614501953125, -2.69586181640625, -2.5302734375, -2.36468505859375, -2.1990966796875, -2.03350830078125, -1.867919921875, -1.70233154296875, -1.5367431640625, -1.37115478515625, -1.20556640625, -1.03997802734375, -0.8743896484375, -0.70880126953125, -0.543212890625, -0.37762451171875, -0.2120361328125, -0.04644775390625, 0.119140625, 0.28472900390625, 0.4503173828125, 0.61590576171875, 0.781494140625, 0.94708251953125, 1.1126708984375, 1.27825927734375, 1.44384765625, 1.60943603515625, 1.7750244140625, 1.94061279296875, 2.106201171875, 2.27178955078125, 2.4373779296875, 2.60296630859375, 2.7685546875, 2.93414306640625, 3.0997314453125, 3.26531982421875, 3.430908203125, 3.59649658203125, 3.7620849609375, 3.92767333984375, 4.09326171875, 4.25885009765625, 4.4244384765625, 4.59002685546875, 4.755615234375, 4.92120361328125, 5.0867919921875, 5.25238037109375, 5.41796875]}, "gradients/decoder.transformer.h.19.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 4.0, 1.0, 0.0, 4.0, 1.0, 4.0, 3.0, 4.0, 4.0, 8.0, 6.0, 17.0, 10.0, 9.0, 17.0, 30.0, 52.0, 65.0, 99.0, 181.0, 286.0, 562.0, 1120.0, 2262.0, 4720.0, 11403.0, 30211.0, 92976.0, 299120.0, 396095.0, 138113.0, 43419.0, 15514.0, 6422.0, 2832.0, 1378.0, 737.0, 339.0, 176.0, 115.0, 69.0, 51.0, 28.0, 14.0, 26.0, 15.0, 10.0, 8.0, 8.0, 3.0, 4.0, 2.0, 1.0, 1.0, 6.0, 0.0, 2.0, 1.0, 2.0, 2.0, 1.0, 1.0], "bins": [-3.177734375, -3.075775146484375, -2.97381591796875, -2.871856689453125, -2.7698974609375, -2.667938232421875, -2.56597900390625, -2.464019775390625, -2.362060546875, -2.260101318359375, -2.15814208984375, -2.056182861328125, -1.9542236328125, -1.852264404296875, -1.75030517578125, -1.648345947265625, -1.54638671875, -1.444427490234375, -1.34246826171875, -1.240509033203125, -1.1385498046875, -1.036590576171875, -0.93463134765625, -0.832672119140625, -0.730712890625, -0.628753662109375, -0.52679443359375, -0.424835205078125, -0.3228759765625, -0.220916748046875, -0.11895751953125, -0.016998291015625, 0.0849609375, 0.186920166015625, 0.28887939453125, 0.390838623046875, 0.4927978515625, 0.594757080078125, 0.69671630859375, 0.798675537109375, 0.900634765625, 1.002593994140625, 1.10455322265625, 1.206512451171875, 1.3084716796875, 1.410430908203125, 1.51239013671875, 1.614349365234375, 1.71630859375, 1.818267822265625, 1.92022705078125, 2.022186279296875, 2.1241455078125, 2.226104736328125, 2.32806396484375, 2.430023193359375, 2.531982421875, 2.633941650390625, 2.73590087890625, 2.837860107421875, 2.9398193359375, 3.041778564453125, 3.14373779296875, 3.245697021484375, 3.34765625]}, "gradients/decoder.transformer.h.19.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 5.0, 7.0, 6.0, 12.0, 14.0, 20.0, 16.0, 24.0, 24.0, 28.0, 29.0, 35.0, 42.0, 48.0, 52.0, 60.0, 89.0, 1782.0, 270.0, 70.0, 54.0, 47.0, 54.0, 47.0, 34.0, 30.0, 22.0, 30.0, 17.0, 20.0, 12.0, 9.0, 6.0, 7.0, 12.0, 1.0, 4.0, 8.0, 3.0, 2.0, 1.0, 1.0, 2.0, 2.0, 1.0], "bins": [-26.109375, -25.400390625, -24.69140625, -23.982421875, -23.2734375, -22.564453125, -21.85546875, -21.146484375, -20.4375, -19.728515625, -19.01953125, -18.310546875, -17.6015625, -16.892578125, -16.18359375, -15.474609375, -14.765625, -14.056640625, -13.34765625, -12.638671875, -11.9296875, -11.220703125, -10.51171875, -9.802734375, -9.09375, -8.384765625, -7.67578125, -6.966796875, -6.2578125, -5.548828125, -4.83984375, -4.130859375, -3.421875, -2.712890625, -2.00390625, -1.294921875, -0.5859375, 0.123046875, 0.83203125, 1.541015625, 2.25, 2.958984375, 3.66796875, 4.376953125, 5.0859375, 5.794921875, 6.50390625, 7.212890625, 7.921875, 8.630859375, 9.33984375, 10.048828125, 10.7578125, 11.466796875, 12.17578125, 12.884765625, 13.59375, 14.302734375, 15.01171875, 15.720703125, 16.4296875, 17.138671875, 17.84765625, 18.556640625, 19.265625]}, "gradients/decoder.transformer.h.19.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 4.0, 1.0, 0.0, 1.0, 6.0, 6.0, 4.0, 4.0, 4.0, 10.0, 5.0, 11.0, 7.0, 11.0, 23.0, 22.0, 35.0, 34.0, 52.0, 73.0, 111.0, 152.0, 282.0, 591.0, 2802.0, 2657891.0, 480139.0, 2119.0, 522.0, 254.0, 172.0, 92.0, 52.0, 49.0, 37.0, 29.0, 19.0, 20.0, 19.0, 15.0, 14.0, 7.0, 8.0, 4.0, 2.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-31.296875, -30.135009765625, -28.97314453125, -27.811279296875, -26.6494140625, -25.487548828125, -24.32568359375, -23.163818359375, -22.001953125, -20.840087890625, -19.67822265625, -18.516357421875, -17.3544921875, -16.192626953125, -15.03076171875, -13.868896484375, -12.70703125, -11.545166015625, -10.38330078125, -9.221435546875, -8.0595703125, -6.897705078125, -5.73583984375, -4.573974609375, -3.412109375, -2.250244140625, -1.08837890625, 0.073486328125, 1.2353515625, 2.397216796875, 3.55908203125, 4.720947265625, 5.8828125, 7.044677734375, 8.20654296875, 9.368408203125, 10.5302734375, 11.692138671875, 12.85400390625, 14.015869140625, 15.177734375, 16.339599609375, 17.50146484375, 18.663330078125, 19.8251953125, 20.987060546875, 22.14892578125, 23.310791015625, 24.47265625, 25.634521484375, 26.79638671875, 27.958251953125, 29.1201171875, 30.281982421875, 31.44384765625, 32.605712890625, 33.767578125, 34.929443359375, 36.09130859375, 37.253173828125, 38.4150390625, 39.576904296875, 40.73876953125, 41.900634765625, 43.0625]}, "gradients/decoder.transformer.h.19.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 8.0, 10.0, 36.0, 101.0, 182.0, 212.0, 188.0, 157.0, 74.0, 25.0, 10.0, 8.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.64617919921875, -28.768712997436523, -27.891246795654297, -27.01378059387207, -26.136314392089844, -25.258848190307617, -24.38138198852539, -23.503917694091797, -22.626449584960938, -21.74898338317871, -20.871517181396484, -19.994050979614258, -19.11658477783203, -18.239118576049805, -17.361652374267578, -16.484188079833984, -15.606721878051758, -14.729255676269531, -13.851789474487305, -12.974323272705078, -12.096857070922852, -11.219390869140625, -10.341925621032715, -9.464459419250488, -8.586993217468262, -7.709527015686035, -6.832060813903809, -5.95459508895874, -5.077128887176514, -4.199662685394287, -3.3221969604492188, -2.444730758666992, -1.5672626495361328, -0.6897965669631958, 0.1876695156097412, 1.0651354789733887, 1.9426016807556152, 2.820067882537842, 3.69753360748291, 4.574999809265137, 5.452466011047363, 6.32993221282959, 7.207398414611816, 8.084863662719727, 8.962329864501953, 9.83979606628418, 10.717262268066406, 11.594728469848633, 12.47219467163086, 13.349660873413086, 14.227127075195312, 15.104593276977539, 15.982059478759766, 16.859525680541992, 17.73699188232422, 18.614456176757812, 19.491924285888672, 20.3693904876709, 21.246856689453125, 22.12432289123535, 23.001789093017578, 23.879255294799805, 24.75672149658203, 25.634185791015625, 26.51165199279785]}, "gradients/decoder.transformer.h.19.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 3.0, 4.0, 10.0, 6.0, 8.0, 3.0, 15.0, 14.0, 18.0, 19.0, 21.0, 21.0, 25.0, 35.0, 31.0, 43.0, 32.0, 38.0, 48.0, 46.0, 41.0, 53.0, 50.0, 44.0, 37.0, 34.0, 30.0, 30.0, 24.0, 31.0, 31.0, 16.0, 25.0, 23.0, 25.0, 16.0, 12.0, 6.0, 8.0, 7.0, 6.0, 2.0, 4.0, 4.0, 1.0, 3.0, 3.0, 2.0, 0.0, 2.0], "bins": [-69.41703033447266, -67.44730377197266, -65.47757720947266, -63.50784683227539, -61.53812026977539, -59.568389892578125, -57.598663330078125, -55.628936767578125, -53.659210205078125, -51.689483642578125, -49.71975326538086, -47.75002670288086, -45.78030014038086, -43.810569763183594, -41.840843200683594, -39.871116638183594, -37.90138626098633, -35.93165969848633, -33.96192932128906, -31.992202758789062, -30.022476196289062, -28.05274772644043, -26.083019256591797, -24.113292694091797, -22.143564224243164, -20.17383575439453, -18.20410919189453, -16.2343807220459, -14.264653205871582, -12.294925689697266, -10.325197219848633, -8.355469703674316, -6.385746002197266, -4.416018486022949, -2.4462904930114746, -0.4765625, 1.4931650161743164, 3.462892532348633, 5.432621002197266, 7.402348518371582, 9.372076034545898, 11.341803550720215, 13.311531066894531, 15.281259536743164, 17.250988006591797, 19.220714569091797, 21.19044303894043, 23.160171508789062, 25.129898071289062, 27.099626541137695, 29.069353103637695, 31.039081573486328, 33.00880813598633, 34.978538513183594, 36.948265075683594, 38.917991638183594, 40.887718200683594, 42.857444763183594, 44.82717514038086, 46.79690170288086, 48.76662826538086, 50.736358642578125, 52.706085205078125, 54.675811767578125, 56.64554214477539]}, "gradients/decoder.transformer.h.18.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 2.0, 3.0, 1.0, 2.0, 4.0, 3.0, 7.0, 4.0, 8.0, 7.0, 7.0, 9.0, 13.0, 10.0, 17.0, 23.0, 20.0, 29.0, 24.0, 29.0, 30.0, 39.0, 34.0, 30.0, 50.0, 40.0, 41.0, 30.0, 54.0, 38.0, 43.0, 42.0, 36.0, 34.0, 27.0, 31.0, 23.0, 24.0, 22.0, 12.0, 22.0, 15.0, 13.0, 15.0, 11.0, 6.0, 6.0, 7.0, 0.0, 5.0, 1.0, 3.0, 0.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-5.2265625, -5.05560302734375, -4.8846435546875, -4.71368408203125, -4.542724609375, -4.37176513671875, -4.2008056640625, -4.02984619140625, -3.85888671875, -3.68792724609375, -3.5169677734375, -3.34600830078125, -3.175048828125, -3.00408935546875, -2.8331298828125, -2.66217041015625, -2.4912109375, -2.32025146484375, -2.1492919921875, -1.97833251953125, -1.807373046875, -1.63641357421875, -1.4654541015625, -1.29449462890625, -1.12353515625, -0.95257568359375, -0.7816162109375, -0.61065673828125, -0.439697265625, -0.26873779296875, -0.0977783203125, 0.07318115234375, 0.244140625, 0.41510009765625, 0.5860595703125, 0.75701904296875, 0.927978515625, 1.09893798828125, 1.2698974609375, 1.44085693359375, 1.61181640625, 1.78277587890625, 1.9537353515625, 2.12469482421875, 2.295654296875, 2.46661376953125, 2.6375732421875, 2.80853271484375, 2.9794921875, 3.15045166015625, 3.3214111328125, 3.49237060546875, 3.663330078125, 3.83428955078125, 4.0052490234375, 4.17620849609375, 4.34716796875, 4.51812744140625, 4.6890869140625, 4.86004638671875, 5.031005859375, 5.20196533203125, 5.3729248046875, 5.54388427734375, 5.71484375]}, "gradients/decoder.transformer.h.18.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 4.0, 3.0, 3.0, 7.0, 6.0, 6.0, 10.0, 5.0, 10.0, 11.0, 9.0, 17.0, 18.0, 28.0, 31.0, 40.0, 47.0, 64.0, 92.0, 168.0, 530.0, 3087.0, 37998.0, 1082544.0, 2882811.0, 176219.0, 8707.0, 1077.0, 277.0, 111.0, 76.0, 44.0, 46.0, 27.0, 30.0, 24.0, 20.0, 13.0, 16.0, 14.0, 9.0, 11.0, 6.0, 3.0, 5.0, 2.0, 1.0, 3.0, 0.0, 2.0, 2.0, 2.0, 2.0, 2.0], "bins": [-19.296875, -18.733154296875, -18.16943359375, -17.605712890625, -17.0419921875, -16.478271484375, -15.91455078125, -15.350830078125, -14.787109375, -14.223388671875, -13.65966796875, -13.095947265625, -12.5322265625, -11.968505859375, -11.40478515625, -10.841064453125, -10.27734375, -9.713623046875, -9.14990234375, -8.586181640625, -8.0224609375, -7.458740234375, -6.89501953125, -6.331298828125, -5.767578125, -5.203857421875, -4.64013671875, -4.076416015625, -3.5126953125, -2.948974609375, -2.38525390625, -1.821533203125, -1.2578125, -0.694091796875, -0.13037109375, 0.433349609375, 0.9970703125, 1.560791015625, 2.12451171875, 2.688232421875, 3.251953125, 3.815673828125, 4.37939453125, 4.943115234375, 5.5068359375, 6.070556640625, 6.63427734375, 7.197998046875, 7.76171875, 8.325439453125, 8.88916015625, 9.452880859375, 10.0166015625, 10.580322265625, 11.14404296875, 11.707763671875, 12.271484375, 12.835205078125, 13.39892578125, 13.962646484375, 14.5263671875, 15.090087890625, 15.65380859375, 16.217529296875, 16.78125]}, "gradients/decoder.transformer.h.18.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 3.0, 7.0, 7.0, 9.0, 10.0, 18.0, 34.0, 43.0, 48.0, 80.0, 116.0, 145.0, 229.0, 302.0, 483.0, 544.0, 574.0, 403.0, 329.0, 207.0, 160.0, 98.0, 65.0, 51.0, 42.0, 26.0, 8.0, 12.0, 9.0, 6.0, 5.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0], "bins": [-24.765625, -24.17578125, -23.5859375, -22.99609375, -22.40625, -21.81640625, -21.2265625, -20.63671875, -20.046875, -19.45703125, -18.8671875, -18.27734375, -17.6875, -17.09765625, -16.5078125, -15.91796875, -15.328125, -14.73828125, -14.1484375, -13.55859375, -12.96875, -12.37890625, -11.7890625, -11.19921875, -10.609375, -10.01953125, -9.4296875, -8.83984375, -8.25, -7.66015625, -7.0703125, -6.48046875, -5.890625, -5.30078125, -4.7109375, -4.12109375, -3.53125, -2.94140625, -2.3515625, -1.76171875, -1.171875, -0.58203125, 0.0078125, 0.59765625, 1.1875, 1.77734375, 2.3671875, 2.95703125, 3.546875, 4.13671875, 4.7265625, 5.31640625, 5.90625, 6.49609375, 7.0859375, 7.67578125, 8.265625, 8.85546875, 9.4453125, 10.03515625, 10.625, 11.21484375, 11.8046875, 12.39453125, 12.984375]}, "gradients/decoder.transformer.h.18.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 4.0, 2.0, 2.0, 3.0, 6.0, 7.0, 6.0, 8.0, 22.0, 32.0, 36.0, 62.0, 54.0, 101.0, 120.0, 166.0, 354.0, 729.0, 4181.0, 3832153.0, 352965.0, 1777.0, 556.0, 312.0, 200.0, 117.0, 89.0, 70.0, 37.0, 39.0, 21.0, 16.0, 13.0, 7.0, 7.0, 3.0, 6.0, 3.0, 2.0, 5.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-60.4375, -57.90234375, -55.3671875, -52.83203125, -50.296875, -47.76171875, -45.2265625, -42.69140625, -40.15625, -37.62109375, -35.0859375, -32.55078125, -30.015625, -27.48046875, -24.9453125, -22.41015625, -19.875, -17.33984375, -14.8046875, -12.26953125, -9.734375, -7.19921875, -4.6640625, -2.12890625, 0.40625, 2.94140625, 5.4765625, 8.01171875, 10.546875, 13.08203125, 15.6171875, 18.15234375, 20.6875, 23.22265625, 25.7578125, 28.29296875, 30.828125, 33.36328125, 35.8984375, 38.43359375, 40.96875, 43.50390625, 46.0390625, 48.57421875, 51.109375, 53.64453125, 56.1796875, 58.71484375, 61.25, 63.78515625, 66.3203125, 68.85546875, 71.390625, 73.92578125, 76.4609375, 78.99609375, 81.53125, 84.06640625, 86.6015625, 89.13671875, 91.671875, 94.20703125, 96.7421875, 99.27734375, 101.8125]}, "gradients/decoder.transformer.h.18.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 7.0, 91.0, 533.0, 340.0, 42.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-92.80313873291016, -83.39895629882812, -73.99476623535156, -64.59058380126953, -55.1864013671875, -45.78221893310547, -36.37803268432617, -26.973846435546875, -17.569664001464844, -8.16547966003418, 1.2387046813964844, 10.642889022827148, 20.047073364257812, 29.451255798339844, 38.85544204711914, 48.25962829589844, 57.66381072998047, 67.0679931640625, 76.47218322753906, 85.8763656616211, 95.28054809570312, 104.68473052978516, 114.08891296386719, 123.49310302734375, 132.89727783203125, 142.3014678955078, 151.7056427001953, 161.10983276367188, 170.51400756835938, 179.91819763183594, 189.3223876953125, 198.7265625, 208.13076782226562, 217.5349578857422, 226.9391326904297, 236.34332275390625, 245.74749755859375, 255.1516876220703, 264.5558776855469, 273.9600524902344, 283.3642578125, 292.7684326171875, 302.1726379394531, 311.5768127441406, 320.9809875488281, 330.38519287109375, 339.78936767578125, 349.19354248046875, 358.59771728515625, 368.00189208984375, 377.4060974121094, 386.8102722167969, 396.2144470214844, 405.61865234375, 415.0228271484375, 424.427001953125, 433.8311767578125, 443.2353515625, 452.6395568847656, 462.0437316894531, 471.4479064941406, 480.85211181640625, 490.25628662109375, 499.66046142578125, 509.0646667480469]}, "gradients/decoder.transformer.h.18.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 2.0, 7.0, 3.0, 2.0, 12.0, 12.0, 15.0, 11.0, 19.0, 24.0, 29.0, 24.0, 27.0, 36.0, 30.0, 24.0, 38.0, 29.0, 30.0, 38.0, 44.0, 46.0, 50.0, 35.0, 43.0, 39.0, 38.0, 39.0, 30.0, 29.0, 24.0, 25.0, 29.0, 15.0, 18.0, 13.0, 14.0, 14.0, 10.0, 12.0, 8.0, 4.0, 6.0, 4.0, 4.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-54.68250274658203, -52.85968017578125, -51.036861419677734, -49.21403884887695, -47.39122009277344, -45.568397521972656, -43.745574951171875, -41.92275619506836, -40.09993362426758, -38.2771110534668, -36.45429229736328, -34.6314697265625, -32.808650970458984, -30.985828399658203, -29.163007736206055, -27.340187072753906, -25.517366409301758, -23.69454574584961, -21.87172508239746, -20.048904418945312, -18.22608184814453, -16.403261184692383, -14.580440521240234, -12.75761890411377, -10.934798240661621, -9.111977577209473, -7.289155960083008, -5.466335296630859, -3.6435141563415527, -1.820693016052246, 0.0021276473999023438, 1.8249492645263672, 3.6477699279785156, 5.470591068267822, 7.293412208557129, 9.116232872009277, 10.939054489135742, 12.76187515258789, 14.584695816040039, 16.407516479492188, 18.23033905029297, 20.053159713745117, 21.875980377197266, 23.698802947998047, 25.521623611450195, 27.344444274902344, 29.167264938354492, 30.99008560180664, 32.812904357910156, 34.63572692871094, 36.45854568481445, 38.281368255615234, 40.10418701171875, 41.92700958251953, 43.74983215332031, 45.57265090942383, 47.39547348022461, 49.21829605102539, 51.041114807128906, 52.86393737792969, 54.6867561340332, 56.509578704833984, 58.3323974609375, 60.15522003173828, 61.97804260253906]}, "gradients/decoder.transformer.h.18.crossattention.c_proj.bias": {"_type": "histogram", "values": [3.0, 3.0, 2.0, 1.0, 2.0, 3.0, 5.0, 2.0, 4.0, 3.0, 8.0, 9.0, 12.0, 9.0, 10.0, 13.0, 15.0, 17.0, 23.0, 22.0, 26.0, 24.0, 24.0, 39.0, 44.0, 37.0, 46.0, 32.0, 44.0, 36.0, 59.0, 32.0, 43.0, 39.0, 37.0, 36.0, 26.0, 36.0, 23.0, 24.0, 18.0, 21.0, 19.0, 21.0, 12.0, 8.0, 10.0, 7.0, 5.0, 8.0, 2.0, 4.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.2734375, -5.09552001953125, -4.9176025390625, -4.73968505859375, -4.561767578125, -4.38385009765625, -4.2059326171875, -4.02801513671875, -3.85009765625, -3.67218017578125, -3.4942626953125, -3.31634521484375, -3.138427734375, -2.96051025390625, -2.7825927734375, -2.60467529296875, -2.4267578125, -2.24884033203125, -2.0709228515625, -1.89300537109375, -1.715087890625, -1.53717041015625, -1.3592529296875, -1.18133544921875, -1.00341796875, -0.82550048828125, -0.6475830078125, -0.46966552734375, -0.291748046875, -0.11383056640625, 0.0640869140625, 0.24200439453125, 0.419921875, 0.59783935546875, 0.7757568359375, 0.95367431640625, 1.131591796875, 1.30950927734375, 1.4874267578125, 1.66534423828125, 1.84326171875, 2.02117919921875, 2.1990966796875, 2.37701416015625, 2.554931640625, 2.73284912109375, 2.9107666015625, 3.08868408203125, 3.2666015625, 3.44451904296875, 3.6224365234375, 3.80035400390625, 3.978271484375, 4.15618896484375, 4.3341064453125, 4.51202392578125, 4.68994140625, 4.86785888671875, 5.0457763671875, 5.22369384765625, 5.401611328125, 5.57952880859375, 5.7574462890625, 5.93536376953125, 6.11328125]}, "gradients/decoder.transformer.h.18.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 3.0, 4.0, 3.0, 7.0, 10.0, 11.0, 18.0, 33.0, 49.0, 72.0, 119.0, 165.0, 263.0, 381.0, 542.0, 862.0, 1236.0, 1907.0, 2757.0, 4087.0, 6463.0, 9930.0, 15191.0, 24199.0, 39392.0, 68817.0, 131765.0, 354953.0, 174561.0, 84423.0, 47043.0, 28227.0, 17781.0, 11325.0, 7425.0, 4886.0, 3227.0, 2133.0, 1390.0, 971.0, 643.0, 445.0, 290.0, 199.0, 97.0, 101.0, 49.0, 38.0, 22.0, 21.0, 13.0, 7.0, 3.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.421875, -1.3739471435546875, -1.326019287109375, -1.2780914306640625, -1.23016357421875, -1.1822357177734375, -1.134307861328125, -1.0863800048828125, -1.0384521484375, -0.9905242919921875, -0.942596435546875, -0.8946685791015625, -0.84674072265625, -0.7988128662109375, -0.750885009765625, -0.7029571533203125, -0.655029296875, -0.6071014404296875, -0.559173583984375, -0.5112457275390625, -0.46331787109375, -0.4153900146484375, -0.367462158203125, -0.3195343017578125, -0.2716064453125, -0.2236785888671875, -0.175750732421875, -0.1278228759765625, -0.07989501953125, -0.0319671630859375, 0.015960693359375, 0.0638885498046875, 0.11181640625, 0.1597442626953125, 0.207672119140625, 0.2555999755859375, 0.30352783203125, 0.3514556884765625, 0.399383544921875, 0.4473114013671875, 0.4952392578125, 0.5431671142578125, 0.591094970703125, 0.6390228271484375, 0.68695068359375, 0.7348785400390625, 0.782806396484375, 0.8307342529296875, 0.878662109375, 0.9265899658203125, 0.974517822265625, 1.0224456787109375, 1.07037353515625, 1.1183013916015625, 1.166229248046875, 1.2141571044921875, 1.2620849609375, 1.3100128173828125, 1.357940673828125, 1.4058685302734375, 1.45379638671875, 1.5017242431640625, 1.549652099609375, 1.5975799560546875, 1.6455078125]}, "gradients/decoder.transformer.h.18.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 4.0, 3.0, 5.0, 6.0, 11.0, 3.0, 6.0, 4.0, 11.0, 22.0, 17.0, 24.0, 30.0, 33.0, 39.0, 39.0, 48.0, 50.0, 52.0, 42.0, 1065.0, 41.0, 43.0, 33.0, 37.0, 33.0, 45.0, 34.0, 41.0, 38.0, 33.0, 22.0, 23.0, 22.0, 11.0, 23.0, 6.0, 17.0, 2.0, 6.0, 3.0, 1.0, 1.0, 3.0, 3.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.890625, -3.75640869140625, -3.6221923828125, -3.48797607421875, -3.353759765625, -3.21954345703125, -3.0853271484375, -2.95111083984375, -2.81689453125, -2.68267822265625, -2.5484619140625, -2.41424560546875, -2.280029296875, -2.14581298828125, -2.0115966796875, -1.87738037109375, -1.7431640625, -1.60894775390625, -1.4747314453125, -1.34051513671875, -1.206298828125, -1.07208251953125, -0.9378662109375, -0.80364990234375, -0.66943359375, -0.53521728515625, -0.4010009765625, -0.26678466796875, -0.132568359375, 0.00164794921875, 0.1358642578125, 0.27008056640625, 0.404296875, 0.53851318359375, 0.6727294921875, 0.80694580078125, 0.941162109375, 1.07537841796875, 1.2095947265625, 1.34381103515625, 1.47802734375, 1.61224365234375, 1.7464599609375, 1.88067626953125, 2.014892578125, 2.14910888671875, 2.2833251953125, 2.41754150390625, 2.5517578125, 2.68597412109375, 2.8201904296875, 2.95440673828125, 3.088623046875, 3.22283935546875, 3.3570556640625, 3.49127197265625, 3.62548828125, 3.75970458984375, 3.8939208984375, 4.02813720703125, 4.162353515625, 4.29656982421875, 4.4307861328125, 4.56500244140625, 4.69921875]}, "gradients/decoder.transformer.h.18.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 2.0, 3.0, 1.0, 5.0, 6.0, 10.0, 21.0, 21.0, 39.0, 62.0, 88.0, 138.0, 267.0, 412.0, 750.0, 1331.0, 2483.0, 4823.0, 9215.0, 18998.0, 41224.0, 94556.0, 255864.0, 1443172.0, 122523.0, 52857.0, 24009.0, 11634.0, 5733.0, 3093.0, 1687.0, 939.0, 456.0, 267.0, 186.0, 98.0, 55.0, 34.0, 15.0, 24.0, 12.0, 11.0, 5.0, 4.0, 4.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.59765625, -2.52374267578125, -2.4498291015625, -2.37591552734375, -2.302001953125, -2.22808837890625, -2.1541748046875, -2.08026123046875, -2.00634765625, -1.93243408203125, -1.8585205078125, -1.78460693359375, -1.710693359375, -1.63677978515625, -1.5628662109375, -1.48895263671875, -1.4150390625, -1.34112548828125, -1.2672119140625, -1.19329833984375, -1.119384765625, -1.04547119140625, -0.9715576171875, -0.89764404296875, -0.82373046875, -0.74981689453125, -0.6759033203125, -0.60198974609375, -0.528076171875, -0.45416259765625, -0.3802490234375, -0.30633544921875, -0.232421875, -0.15850830078125, -0.0845947265625, -0.01068115234375, 0.063232421875, 0.13714599609375, 0.2110595703125, 0.28497314453125, 0.35888671875, 0.43280029296875, 0.5067138671875, 0.58062744140625, 0.654541015625, 0.72845458984375, 0.8023681640625, 0.87628173828125, 0.9501953125, 1.02410888671875, 1.0980224609375, 1.17193603515625, 1.245849609375, 1.31976318359375, 1.3936767578125, 1.46759033203125, 1.54150390625, 1.61541748046875, 1.6893310546875, 1.76324462890625, 1.837158203125, 1.91107177734375, 1.9849853515625, 2.05889892578125, 2.1328125]}, "gradients/decoder.transformer.h.18.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 2.0, 1.0, 3.0, 6.0, 4.0, 10.0, 14.0, 18.0, 14.0, 24.0, 20.0, 32.0, 35.0, 37.0, 34.0, 60.0, 55.0, 98.0, 91.0, 76.0, 61.0, 43.0, 34.0, 39.0, 38.0, 32.0, 24.0, 23.0, 17.0, 10.0, 12.0, 8.0, 7.0, 6.0, 5.0, 4.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0009975433349609375, -0.0009656399488449097, -0.0009337365627288818, -0.000901833176612854, -0.0008699297904968262, -0.0008380264043807983, -0.0008061230182647705, -0.0007742196321487427, -0.0007423162460327148, -0.000710412859916687, -0.0006785094738006592, -0.0006466060876846313, -0.0006147027015686035, -0.0005827993154525757, -0.0005508959293365479, -0.00051899254322052, -0.0004870891571044922, -0.00045518577098846436, -0.0004232823848724365, -0.0003913789987564087, -0.00035947561264038086, -0.00032757222652435303, -0.0002956688404083252, -0.00026376545429229736, -0.00023186206817626953, -0.0001999586820602417, -0.00016805529594421387, -0.00013615190982818604, -0.0001042485237121582, -7.234513759613037e-05, -4.044175148010254e-05, -8.538365364074707e-06, 2.3365020751953125e-05, 5.526840686798096e-05, 8.717179298400879e-05, 0.00011907517910003662, 0.00015097856521606445, 0.00018288195133209229, 0.00021478533744812012, 0.00024668872356414795, 0.0002785921096801758, 0.0003104954957962036, 0.00034239888191223145, 0.0003743022680282593, 0.0004062056541442871, 0.00043810904026031494, 0.0004700124263763428, 0.0005019158124923706, 0.0005338191986083984, 0.0005657225847244263, 0.0005976259708404541, 0.0006295293569564819, 0.0006614327430725098, 0.0006933361291885376, 0.0007252395153045654, 0.0007571429014205933, 0.0007890462875366211, 0.0008209496736526489, 0.0008528530597686768, 0.0008847564458847046, 0.0009166598320007324, 0.0009485632181167603, 0.000980466604232788, 0.001012369990348816, 0.0010442733764648438]}, "gradients/decoder.transformer.h.18.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 2.0, 3.0, 4.0, 5.0, 3.0, 1.0, 6.0, 9.0, 9.0, 11.0, 13.0, 17.0, 24.0, 36.0, 38.0, 66.0, 105.0, 178.0, 358.0, 867.0, 884533.0, 160687.0, 789.0, 335.0, 164.0, 79.0, 51.0, 39.0, 42.0, 26.0, 14.0, 8.0, 7.0, 8.0, 7.0, 9.0, 2.0, 4.0, 2.0, 3.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0288543701171875, -0.027978181838989258, -0.027101993560791016, -0.026225805282592773, -0.02534961700439453, -0.02447342872619629, -0.023597240447998047, -0.022721052169799805, -0.021844863891601562, -0.02096867561340332, -0.020092487335205078, -0.019216299057006836, -0.018340110778808594, -0.01746392250061035, -0.01658773422241211, -0.015711545944213867, -0.014835357666015625, -0.013959169387817383, -0.01308298110961914, -0.012206792831420898, -0.011330604553222656, -0.010454416275024414, -0.009578227996826172, -0.00870203971862793, -0.007825851440429688, -0.006949663162231445, -0.006073474884033203, -0.005197286605834961, -0.004321098327636719, -0.0034449100494384766, -0.0025687217712402344, -0.0016925334930419922, -0.00081634521484375, 5.984306335449219e-05, 0.0009360313415527344, 0.0018122196197509766, 0.0026884078979492188, 0.003564596176147461, 0.004440784454345703, 0.005316972732543945, 0.0061931610107421875, 0.00706934928894043, 0.007945537567138672, 0.008821725845336914, 0.009697914123535156, 0.010574102401733398, 0.01145029067993164, 0.012326478958129883, 0.013202667236328125, 0.014078855514526367, 0.01495504379272461, 0.01583123207092285, 0.016707420349121094, 0.017583608627319336, 0.018459796905517578, 0.01933598518371582, 0.020212173461914062, 0.021088361740112305, 0.021964550018310547, 0.02284073829650879, 0.02371692657470703, 0.024593114852905273, 0.025469303131103516, 0.026345491409301758, 0.0272216796875]}, "gradients/decoder.transformer.h.18.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 6.0, 41.0, 347.0, 516.0, 97.0, 8.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0035275027621537447, -0.0034557655453681946, -0.0033840283285826445, -0.0033122911117970943, -0.0032405538950115442, -0.003168816678225994, -0.003097079461440444, -0.003025342244654894, -0.0029536052606999874, -0.0028818680439144373, -0.002810130827128887, -0.002738393610343337, -0.002666656393557787, -0.002594919176772237, -0.0025231819599866867, -0.0024514449760317802, -0.0023797075264155865, -0.0023079703096300364, -0.0022362330928444862, -0.002164495876058936, -0.002092758659273386, -0.002021021442487836, -0.0019492843421176076, -0.0018775471253320575, -0.0018058099085465074, -0.0017340726917609572, -0.0016623354749754071, -0.001590598258189857, -0.0015188611578196287, -0.0014471239410340786, -0.0013753867242485285, -0.0013036495074629784, -0.001231912523508072, -0.0011601753067225218, -0.0010884380899369717, -0.0010167008731514215, -0.0009449637145735323, -0.0008732264977879822, -0.000801489339210093, -0.0007297521224245429, -0.0006580149056389928, -0.0005862776888534427, -0.0005145404720678926, -0.00044280331349000335, -0.00037106609670445323, -0.0002993288799189031, -0.00022759169223718345, -0.0001558545045554638, -8.411728776991367e-05, -1.2380085536278784e-05, 5.9357116697356105e-05, 0.000131094318930991, 0.00020283152116462588, 0.000274568737950176, 0.00034630592563189566, 0.0004180431133136153, 0.0004897803300991654, 0.0005615175468847156, 0.0006332547636702657, 0.0007049919222481549, 0.000776729139033705, 0.0008484663558192551, 0.0009202035143971443, 0.0009919407311826944, 0.0010636779479682446]}, "gradients/decoder.transformer.h.18.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 6.0, 1.0, 4.0, 4.0, 5.0, 1.0, 11.0, 8.0, 5.0, 12.0, 4.0, 19.0, 14.0, 11.0, 15.0, 23.0, 22.0, 29.0, 23.0, 18.0, 36.0, 39.0, 37.0, 39.0, 40.0, 45.0, 34.0, 36.0, 33.0, 41.0, 41.0, 26.0, 37.0, 36.0, 26.0, 23.0, 36.0, 23.0, 23.0, 15.0, 20.0, 19.0, 12.0, 16.0, 10.0, 6.0, 6.0, 9.0, 4.0, 7.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 1.0, 0.0, 1.0], "bins": [-0.0004329085350036621, -0.00041897501796483994, -0.00040504150092601776, -0.0003911079838871956, -0.0003771744668483734, -0.00036324094980955124, -0.00034930743277072906, -0.0003353739157319069, -0.0003214403986930847, -0.00030750688165426254, -0.00029357336461544037, -0.0002796398475766182, -0.000265706330537796, -0.00025177281349897385, -0.00023783929646015167, -0.0002239057794213295, -0.00020997226238250732, -0.00019603874534368515, -0.00018210522830486298, -0.0001681717112660408, -0.00015423819422721863, -0.00014030467718839645, -0.00012637116014957428, -0.0001124376431107521, -9.850412607192993e-05, -8.457060903310776e-05, -7.063709199428558e-05, -5.670357495546341e-05, -4.2770057916641235e-05, -2.883654087781906e-05, -1.4903023838996887e-05, -9.695068001747131e-07, 1.2964010238647461e-05, 2.6897527277469635e-05, 4.083104431629181e-05, 5.476456135511398e-05, 6.869807839393616e-05, 8.263159543275833e-05, 9.65651124715805e-05, 0.00011049862951040268, 0.00012443214654922485, 0.00013836566358804703, 0.0001522991806268692, 0.00016623269766569138, 0.00018016621470451355, 0.00019409973174333572, 0.0002080332487821579, 0.00022196676582098007, 0.00023590028285980225, 0.0002498337998986244, 0.0002637673169374466, 0.00027770083397626877, 0.00029163435101509094, 0.0003055678680539131, 0.0003195013850927353, 0.00033343490213155746, 0.00034736841917037964, 0.0003613019362092018, 0.000375235453248024, 0.00038916897028684616, 0.00040310248732566833, 0.0004170360043644905, 0.0004309695214033127, 0.00044490303844213486, 0.00045883655548095703]}, "gradients/decoder.transformer.h.18.attn.c_proj.bias": {"_type": "histogram", "values": [3.0, 3.0, 2.0, 1.0, 2.0, 3.0, 5.0, 2.0, 4.0, 3.0, 8.0, 9.0, 12.0, 9.0, 10.0, 13.0, 15.0, 17.0, 23.0, 22.0, 26.0, 24.0, 24.0, 40.0, 43.0, 37.0, 46.0, 32.0, 44.0, 36.0, 59.0, 32.0, 43.0, 39.0, 37.0, 36.0, 26.0, 36.0, 23.0, 24.0, 18.0, 21.0, 19.0, 21.0, 12.0, 8.0, 10.0, 7.0, 5.0, 8.0, 2.0, 4.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.2734375, -5.09552001953125, -4.9176025390625, -4.73968505859375, -4.561767578125, -4.38385009765625, -4.2059326171875, -4.02801513671875, -3.85009765625, -3.67218017578125, -3.4942626953125, -3.31634521484375, -3.138427734375, -2.96051025390625, -2.7825927734375, -2.60467529296875, -2.4267578125, -2.24884033203125, -2.0709228515625, -1.89300537109375, -1.715087890625, -1.53717041015625, -1.3592529296875, -1.18133544921875, -1.00341796875, -0.82550048828125, -0.6475830078125, -0.46966552734375, -0.291748046875, -0.11383056640625, 0.0640869140625, 0.24200439453125, 0.419921875, 0.59783935546875, 0.7757568359375, 0.95367431640625, 1.131591796875, 1.30950927734375, 1.4874267578125, 1.66534423828125, 1.84326171875, 2.02117919921875, 2.1990966796875, 2.37701416015625, 2.554931640625, 2.73284912109375, 2.9107666015625, 3.08868408203125, 3.2666015625, 3.44451904296875, 3.6224365234375, 3.80035400390625, 3.978271484375, 4.15618896484375, 4.3341064453125, 4.51202392578125, 4.68994140625, 4.86785888671875, 5.0457763671875, 5.22369384765625, 5.401611328125, 5.57952880859375, 5.7574462890625, 5.93536376953125, 6.11328125]}, "gradients/decoder.transformer.h.18.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 6.0, 7.0, 6.0, 15.0, 26.0, 43.0, 44.0, 62.0, 113.0, 157.0, 206.0, 317.0, 503.0, 680.0, 997.0, 1470.0, 2216.0, 3216.0, 4847.0, 7508.0, 12538.0, 21805.0, 43421.0, 100642.0, 320403.0, 324302.0, 102170.0, 43847.0, 22076.0, 12294.0, 7714.0, 4808.0, 3244.0, 2193.0, 1455.0, 1009.0, 689.0, 472.0, 318.0, 209.0, 154.0, 95.0, 90.0, 37.0, 44.0, 37.0, 16.0, 11.0, 14.0, 8.0, 1.0, 2.0, 4.0, 0.0, 2.0, 1.0], "bins": [-3.3984375, -3.29547119140625, -3.1925048828125, -3.08953857421875, -2.986572265625, -2.88360595703125, -2.7806396484375, -2.67767333984375, -2.57470703125, -2.47174072265625, -2.3687744140625, -2.26580810546875, -2.162841796875, -2.05987548828125, -1.9569091796875, -1.85394287109375, -1.7509765625, -1.64801025390625, -1.5450439453125, -1.44207763671875, -1.339111328125, -1.23614501953125, -1.1331787109375, -1.03021240234375, -0.92724609375, -0.82427978515625, -0.7213134765625, -0.61834716796875, -0.515380859375, -0.41241455078125, -0.3094482421875, -0.20648193359375, -0.103515625, -0.00054931640625, 0.1024169921875, 0.20538330078125, 0.308349609375, 0.41131591796875, 0.5142822265625, 0.61724853515625, 0.72021484375, 0.82318115234375, 0.9261474609375, 1.02911376953125, 1.132080078125, 1.23504638671875, 1.3380126953125, 1.44097900390625, 1.5439453125, 1.64691162109375, 1.7498779296875, 1.85284423828125, 1.955810546875, 2.05877685546875, 2.1617431640625, 2.26470947265625, 2.36767578125, 2.47064208984375, 2.5736083984375, 2.67657470703125, 2.779541015625, 2.88250732421875, 2.9854736328125, 3.08843994140625, 3.19140625]}, "gradients/decoder.transformer.h.18.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 3.0, 0.0, 0.0, 2.0, 3.0, 1.0, 2.0, 5.0, 1.0, 5.0, 5.0, 17.0, 17.0, 14.0, 11.0, 16.0, 19.0, 22.0, 32.0, 32.0, 18.0, 31.0, 41.0, 30.0, 27.0, 45.0, 58.0, 147.0, 1761.0, 200.0, 75.0, 48.0, 35.0, 38.0, 45.0, 28.0, 20.0, 28.0, 29.0, 19.0, 23.0, 21.0, 14.0, 9.0, 21.0, 9.0, 9.0, 7.0, 7.0, 6.0, 2.0, 0.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-17.890625, -17.326171875, -16.76171875, -16.197265625, -15.6328125, -15.068359375, -14.50390625, -13.939453125, -13.375, -12.810546875, -12.24609375, -11.681640625, -11.1171875, -10.552734375, -9.98828125, -9.423828125, -8.859375, -8.294921875, -7.73046875, -7.166015625, -6.6015625, -6.037109375, -5.47265625, -4.908203125, -4.34375, -3.779296875, -3.21484375, -2.650390625, -2.0859375, -1.521484375, -0.95703125, -0.392578125, 0.171875, 0.736328125, 1.30078125, 1.865234375, 2.4296875, 2.994140625, 3.55859375, 4.123046875, 4.6875, 5.251953125, 5.81640625, 6.380859375, 6.9453125, 7.509765625, 8.07421875, 8.638671875, 9.203125, 9.767578125, 10.33203125, 10.896484375, 11.4609375, 12.025390625, 12.58984375, 13.154296875, 13.71875, 14.283203125, 14.84765625, 15.412109375, 15.9765625, 16.541015625, 17.10546875, 17.669921875, 18.234375]}, "gradients/decoder.transformer.h.18.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 3.0, 3.0, 2.0, 3.0, 2.0, 10.0, 10.0, 9.0, 11.0, 15.0, 14.0, 20.0, 31.0, 31.0, 31.0, 58.0, 80.0, 103.0, 152.0, 198.0, 349.0, 709.0, 4810.0, 1267925.0, 1864134.0, 5111.0, 759.0, 318.0, 216.0, 159.0, 108.0, 65.0, 48.0, 44.0, 35.0, 29.0, 18.0, 18.0, 19.0, 17.0, 10.0, 8.0, 3.0, 6.0, 3.0, 2.0, 1.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 2.0], "bins": [-37.5625, -36.4580078125, -35.353515625, -34.2490234375, -33.14453125, -32.0400390625, -30.935546875, -29.8310546875, -28.7265625, -27.6220703125, -26.517578125, -25.4130859375, -24.30859375, -23.2041015625, -22.099609375, -20.9951171875, -19.890625, -18.7861328125, -17.681640625, -16.5771484375, -15.47265625, -14.3681640625, -13.263671875, -12.1591796875, -11.0546875, -9.9501953125, -8.845703125, -7.7412109375, -6.63671875, -5.5322265625, -4.427734375, -3.3232421875, -2.21875, -1.1142578125, -0.009765625, 1.0947265625, 2.19921875, 3.3037109375, 4.408203125, 5.5126953125, 6.6171875, 7.7216796875, 8.826171875, 9.9306640625, 11.03515625, 12.1396484375, 13.244140625, 14.3486328125, 15.453125, 16.5576171875, 17.662109375, 18.7666015625, 19.87109375, 20.9755859375, 22.080078125, 23.1845703125, 24.2890625, 25.3935546875, 26.498046875, 27.6025390625, 28.70703125, 29.8115234375, 30.916015625, 32.0205078125, 33.125]}, "gradients/decoder.transformer.h.18.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 36.0, 278.0, 548.0, 141.0, 11.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-92.32762908935547, -89.78218078613281, -87.23673248291016, -84.6912841796875, -82.14583587646484, -79.60038757324219, -77.05493927001953, -74.50949096679688, -71.96403503417969, -69.41858673095703, -66.87313842773438, -64.32769012451172, -61.78224182128906, -59.236793518066406, -56.691341400146484, -54.14589309692383, -51.60044860839844, -49.05500030517578, -46.509552001953125, -43.96410369873047, -41.41865539550781, -38.873207092285156, -36.327754974365234, -33.78230667114258, -31.236858367919922, -28.691410064697266, -26.14596176147461, -23.60051155090332, -21.055063247680664, -18.509614944458008, -15.964165687561035, -13.418716430664062, -10.873268127441406, -8.32781982421875, -5.782370567321777, -3.236921787261963, -0.6914730072021484, 1.8539752960205078, 4.3994245529174805, 6.944873809814453, 9.49032211303711, 12.035770416259766, 14.581219673156738, 17.12666893005371, 19.672117233276367, 22.217565536499023, 24.763015747070312, 27.30846405029297, 29.853912353515625, 32.39936065673828, 34.94480895996094, 37.490257263183594, 40.03570556640625, 42.581153869628906, 45.12660598754883, 47.672054290771484, 50.21750259399414, 52.7629508972168, 55.30839920043945, 57.85384750366211, 60.39929962158203, 62.94474792480469, 65.49019622802734, 68.03564453125, 70.58109283447266]}, "gradients/decoder.transformer.h.18.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 3.0, 3.0, 6.0, 3.0, 6.0, 7.0, 6.0, 9.0, 8.0, 14.0, 16.0, 21.0, 16.0, 18.0, 19.0, 22.0, 23.0, 34.0, 29.0, 37.0, 46.0, 41.0, 38.0, 42.0, 45.0, 41.0, 48.0, 46.0, 44.0, 25.0, 35.0, 43.0, 36.0, 27.0, 29.0, 10.0, 22.0, 15.0, 14.0, 12.0, 10.0, 8.0, 9.0, 10.0, 0.0, 5.0, 7.0, 4.0, 2.0, 3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-52.57984924316406, -50.75518035888672, -48.930511474609375, -47.10584259033203, -45.28117752075195, -43.45650863647461, -41.631839752197266, -39.80717086791992, -37.98250198364258, -36.157833099365234, -34.33316421508789, -32.50849914550781, -30.683828353881836, -28.859161376953125, -27.03449249267578, -25.209823608398438, -23.385156631469727, -21.560487747192383, -19.735820770263672, -17.911151885986328, -16.086483001708984, -14.261815071105957, -12.43714714050293, -10.612478256225586, -8.787810325622559, -6.963141918182373, -5.1384735107421875, -3.31380558013916, -1.4891371726989746, 0.33553123474121094, 2.1601991653442383, 3.984868049621582, 5.809535980224609, 7.634204387664795, 9.45887279510498, 11.283540725708008, 13.108209609985352, 14.932877540588379, 16.757545471191406, 18.58221435546875, 20.406883239746094, 22.231552124023438, 24.05621910095215, 25.880887985229492, 27.705556869506836, 29.530223846435547, 31.35489273071289, 33.179561614990234, 35.00422668457031, 36.828895568847656, 38.653564453125, 40.478233337402344, 42.30289840698242, 44.127567291259766, 45.95223617553711, 47.77690505981445, 49.6015739440918, 51.42624282836914, 53.250911712646484, 55.07557678222656, 56.900245666503906, 58.72491455078125, 60.549583435058594, 62.37425231933594, 64.19892120361328]}, "gradients/decoder.transformer.h.17.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 3.0, 2.0, 1.0, 1.0, 4.0, 4.0, 3.0, 2.0, 5.0, 9.0, 6.0, 14.0, 6.0, 18.0, 9.0, 15.0, 20.0, 23.0, 20.0, 32.0, 34.0, 32.0, 36.0, 36.0, 38.0, 44.0, 38.0, 48.0, 48.0, 50.0, 39.0, 40.0, 38.0, 31.0, 49.0, 20.0, 27.0, 31.0, 17.0, 21.0, 20.0, 16.0, 8.0, 11.0, 11.0, 8.0, 6.0, 8.0, 2.0, 2.0, 4.0, 2.0, 0.0, 4.0, 1.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.46484375, -5.27679443359375, -5.0887451171875, -4.90069580078125, -4.712646484375, -4.52459716796875, -4.3365478515625, -4.14849853515625, -3.96044921875, -3.77239990234375, -3.5843505859375, -3.39630126953125, -3.208251953125, -3.02020263671875, -2.8321533203125, -2.64410400390625, -2.4560546875, -2.26800537109375, -2.0799560546875, -1.89190673828125, -1.703857421875, -1.51580810546875, -1.3277587890625, -1.13970947265625, -0.95166015625, -0.76361083984375, -0.5755615234375, -0.38751220703125, -0.199462890625, -0.01141357421875, 0.1766357421875, 0.36468505859375, 0.552734375, 0.74078369140625, 0.9288330078125, 1.11688232421875, 1.304931640625, 1.49298095703125, 1.6810302734375, 1.86907958984375, 2.05712890625, 2.24517822265625, 2.4332275390625, 2.62127685546875, 2.809326171875, 2.99737548828125, 3.1854248046875, 3.37347412109375, 3.5615234375, 3.74957275390625, 3.9376220703125, 4.12567138671875, 4.313720703125, 4.50177001953125, 4.6898193359375, 4.87786865234375, 5.06591796875, 5.25396728515625, 5.4420166015625, 5.63006591796875, 5.818115234375, 6.00616455078125, 6.1942138671875, 6.38226318359375, 6.5703125]}, "gradients/decoder.transformer.h.17.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 3.0, 0.0, 4.0, 0.0, 4.0, 5.0, 1.0, 6.0, 13.0, 12.0, 16.0, 25.0, 29.0, 32.0, 38.0, 66.0, 66.0, 96.0, 196.0, 253.0, 426.0, 691.0, 1242.0, 2242.0, 4572.0, 10241.0, 25813.0, 69927.0, 200910.0, 536122.0, 1101646.0, 1203087.0, 647089.0, 245882.0, 86858.0, 32579.0, 12562.0, 5415.0, 2619.0, 1402.0, 739.0, 412.0, 304.0, 172.0, 146.0, 85.0, 56.0, 49.0, 27.0, 29.0, 23.0, 23.0, 14.0, 5.0, 4.0, 9.0, 4.0, 4.0, 3.0, 0.0, 1.0, 1.0, 3.0], "bins": [-5.359375, -5.19244384765625, -5.0255126953125, -4.85858154296875, -4.691650390625, -4.52471923828125, -4.3577880859375, -4.19085693359375, -4.02392578125, -3.85699462890625, -3.6900634765625, -3.52313232421875, -3.356201171875, -3.18927001953125, -3.0223388671875, -2.85540771484375, -2.6884765625, -2.52154541015625, -2.3546142578125, -2.18768310546875, -2.020751953125, -1.85382080078125, -1.6868896484375, -1.51995849609375, -1.35302734375, -1.18609619140625, -1.0191650390625, -0.85223388671875, -0.685302734375, -0.51837158203125, -0.3514404296875, -0.18450927734375, -0.017578125, 0.14935302734375, 0.3162841796875, 0.48321533203125, 0.650146484375, 0.81707763671875, 0.9840087890625, 1.15093994140625, 1.31787109375, 1.48480224609375, 1.6517333984375, 1.81866455078125, 1.985595703125, 2.15252685546875, 2.3194580078125, 2.48638916015625, 2.6533203125, 2.82025146484375, 2.9871826171875, 3.15411376953125, 3.321044921875, 3.48797607421875, 3.6549072265625, 3.82183837890625, 3.98876953125, 4.15570068359375, 4.3226318359375, 4.48956298828125, 4.656494140625, 4.82342529296875, 4.9903564453125, 5.15728759765625, 5.32421875]}, "gradients/decoder.transformer.h.17.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 2.0, 3.0, 5.0, 5.0, 3.0, 7.0, 7.0, 7.0, 8.0, 14.0, 12.0, 22.0, 32.0, 42.0, 44.0, 60.0, 68.0, 83.0, 115.0, 166.0, 225.0, 273.0, 335.0, 353.0, 397.0, 355.0, 355.0, 229.0, 210.0, 143.0, 102.0, 93.0, 75.0, 59.0, 40.0, 36.0, 28.0, 18.0, 15.0, 14.0, 7.0, 6.0, 6.0, 2.0, 3.0, 1.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-14.0859375, -13.6759033203125, -13.265869140625, -12.8558349609375, -12.44580078125, -12.0357666015625, -11.625732421875, -11.2156982421875, -10.8056640625, -10.3956298828125, -9.985595703125, -9.5755615234375, -9.16552734375, -8.7554931640625, -8.345458984375, -7.9354248046875, -7.525390625, -7.1153564453125, -6.705322265625, -6.2952880859375, -5.88525390625, -5.4752197265625, -5.065185546875, -4.6551513671875, -4.2451171875, -3.8350830078125, -3.425048828125, -3.0150146484375, -2.60498046875, -2.1949462890625, -1.784912109375, -1.3748779296875, -0.96484375, -0.5548095703125, -0.144775390625, 0.2652587890625, 0.67529296875, 1.0853271484375, 1.495361328125, 1.9053955078125, 2.3154296875, 2.7254638671875, 3.135498046875, 3.5455322265625, 3.95556640625, 4.3656005859375, 4.775634765625, 5.1856689453125, 5.595703125, 6.0057373046875, 6.415771484375, 6.8258056640625, 7.23583984375, 7.6458740234375, 8.055908203125, 8.4659423828125, 8.8759765625, 9.2860107421875, 9.696044921875, 10.1060791015625, 10.51611328125, 10.9261474609375, 11.336181640625, 11.7462158203125, 12.15625]}, "gradients/decoder.transformer.h.17.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 5.0, 3.0, 2.0, 3.0, 7.0, 12.0, 11.0, 13.0, 27.0, 26.0, 43.0, 45.0, 57.0, 84.0, 116.0, 175.0, 288.0, 478.0, 2464.0, 1855896.0, 2330452.0, 2622.0, 528.0, 281.0, 179.0, 124.0, 100.0, 60.0, 41.0, 29.0, 34.0, 22.0, 13.0, 12.0, 10.0, 6.0, 5.0, 6.0, 4.0, 2.0, 2.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-73.1875, -70.9013671875, -68.615234375, -66.3291015625, -64.04296875, -61.7568359375, -59.470703125, -57.1845703125, -54.8984375, -52.6123046875, -50.326171875, -48.0400390625, -45.75390625, -43.4677734375, -41.181640625, -38.8955078125, -36.609375, -34.3232421875, -32.037109375, -29.7509765625, -27.46484375, -25.1787109375, -22.892578125, -20.6064453125, -18.3203125, -16.0341796875, -13.748046875, -11.4619140625, -9.17578125, -6.8896484375, -4.603515625, -2.3173828125, -0.03125, 2.2548828125, 4.541015625, 6.8271484375, 9.11328125, 11.3994140625, 13.685546875, 15.9716796875, 18.2578125, 20.5439453125, 22.830078125, 25.1162109375, 27.40234375, 29.6884765625, 31.974609375, 34.2607421875, 36.546875, 38.8330078125, 41.119140625, 43.4052734375, 45.69140625, 47.9775390625, 50.263671875, 52.5498046875, 54.8359375, 57.1220703125, 59.408203125, 61.6943359375, 63.98046875, 66.2666015625, 68.552734375, 70.8388671875, 73.125]}, "gradients/decoder.transformer.h.17.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 39.0, 200.0, 464.0, 265.0, 45.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-166.6898193359375, -159.6375274658203, -152.58523559570312, -145.53292846679688, -138.4806365966797, -131.4283447265625, -124.37605285644531, -117.32376098632812, -110.27146911621094, -103.21917724609375, -96.16687774658203, -89.11458587646484, -82.06229400634766, -75.00999450683594, -67.95770263671875, -60.90541076660156, -53.853111267089844, -46.80081558227539, -39.7485237121582, -32.69622802734375, -25.64393424987793, -18.59164047241211, -11.539344787597656, -4.487052917480469, 2.5652427673339844, 9.617536544799805, 16.669830322265625, 23.722126007080078, 30.7744197845459, 37.82671356201172, 44.87900924682617, 51.93130111694336, 58.98359680175781, 66.035888671875, 73.08818817138672, 80.1404800415039, 87.1927719116211, 94.24507141113281, 101.29736328125, 108.34965515136719, 115.40194702148438, 122.45423889160156, 129.50653076171875, 136.558837890625, 143.6111297607422, 150.66342163085938, 157.71571350097656, 164.76800537109375, 171.8203125, 178.8726043701172, 185.92489624023438, 192.97720336914062, 200.0294952392578, 207.081787109375, 214.1340789794922, 221.18637084960938, 228.23866271972656, 235.29095458984375, 242.34324645996094, 249.39553833007812, 256.4478454589844, 263.5001220703125, 270.55242919921875, 277.604736328125, 284.6570129394531]}, "gradients/decoder.transformer.h.17.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 4.0, 5.0, 4.0, 6.0, 10.0, 9.0, 8.0, 10.0, 11.0, 16.0, 22.0, 13.0, 16.0, 30.0, 42.0, 35.0, 26.0, 39.0, 26.0, 28.0, 39.0, 45.0, 41.0, 43.0, 28.0, 36.0, 37.0, 28.0, 37.0, 45.0, 25.0, 29.0, 26.0, 29.0, 27.0, 19.0, 20.0, 18.0, 14.0, 14.0, 2.0, 11.0, 7.0, 9.0, 4.0, 9.0, 2.0, 3.0, 5.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-46.62152099609375, -45.07754898071289, -43.53357696533203, -41.98960876464844, -40.44563674926758, -38.90166473388672, -37.357696533203125, -35.813724517822266, -34.269752502441406, -32.72578048706055, -31.18181037902832, -29.637840270996094, -28.093868255615234, -26.549896240234375, -25.00592613220215, -23.461956024169922, -21.917984008789062, -20.374011993408203, -18.830041885375977, -17.28607177734375, -15.74209976196289, -14.198128700256348, -12.654157638549805, -11.110186576843262, -9.566215515136719, -8.022244453430176, -6.478273391723633, -4.93430233001709, -3.390331268310547, -1.846360206604004, -0.30238914489746094, 1.241581916809082, 2.785552978515625, 4.329524040222168, 5.873495101928711, 7.417466163635254, 8.961437225341797, 10.50540828704834, 12.049379348754883, 13.593350410461426, 15.137321472167969, 16.681293487548828, 18.225263595581055, 19.76923370361328, 21.31320571899414, 22.857177734375, 24.401147842407227, 25.945117950439453, 27.489089965820312, 29.033061981201172, 30.5770320892334, 32.121002197265625, 33.664974212646484, 35.208946228027344, 36.75291442871094, 38.2968864440918, 39.840858459472656, 41.384830474853516, 42.928802490234375, 44.47277069091797, 46.01674270629883, 47.56071472167969, 49.10468292236328, 50.64865493774414, 52.192626953125]}, "gradients/decoder.transformer.h.17.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 4.0, 3.0, 5.0, 8.0, 8.0, 20.0, 11.0, 14.0, 15.0, 18.0, 28.0, 25.0, 33.0, 33.0, 38.0, 34.0, 50.0, 34.0, 37.0, 44.0, 47.0, 47.0, 52.0, 39.0, 53.0, 42.0, 28.0, 31.0, 30.0, 25.0, 25.0, 12.0, 21.0, 8.0, 12.0, 16.0, 14.0, 11.0, 6.0, 5.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.16796875, -5.971435546875, -5.77490234375, -5.578369140625, -5.3818359375, -5.185302734375, -4.98876953125, -4.792236328125, -4.595703125, -4.399169921875, -4.20263671875, -4.006103515625, -3.8095703125, -3.613037109375, -3.41650390625, -3.219970703125, -3.0234375, -2.826904296875, -2.63037109375, -2.433837890625, -2.2373046875, -2.040771484375, -1.84423828125, -1.647705078125, -1.451171875, -1.254638671875, -1.05810546875, -0.861572265625, -0.6650390625, -0.468505859375, -0.27197265625, -0.075439453125, 0.12109375, 0.317626953125, 0.51416015625, 0.710693359375, 0.9072265625, 1.103759765625, 1.30029296875, 1.496826171875, 1.693359375, 1.889892578125, 2.08642578125, 2.282958984375, 2.4794921875, 2.676025390625, 2.87255859375, 3.069091796875, 3.265625, 3.462158203125, 3.65869140625, 3.855224609375, 4.0517578125, 4.248291015625, 4.44482421875, 4.641357421875, 4.837890625, 5.034423828125, 5.23095703125, 5.427490234375, 5.6240234375, 5.820556640625, 6.01708984375, 6.213623046875, 6.41015625]}, "gradients/decoder.transformer.h.17.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 5.0, 9.0, 11.0, 14.0, 17.0, 33.0, 45.0, 60.0, 65.0, 111.0, 168.0, 219.0, 281.0, 416.0, 570.0, 786.0, 1209.0, 1640.0, 2104.0, 3149.0, 4229.0, 5846.0, 8546.0, 12470.0, 17831.0, 26353.0, 40503.0, 63535.0, 110549.0, 266065.0, 207910.0, 98101.0, 58349.0, 37241.0, 24264.0, 16621.0, 11512.0, 7909.0, 5674.0, 4037.0, 2790.0, 2028.0, 1511.0, 1130.0, 771.0, 515.0, 386.0, 292.0, 194.0, 146.0, 100.0, 79.0, 49.0, 33.0, 29.0, 27.0, 12.0, 13.0, 5.0, 2.0, 2.0, 2.0], "bins": [-1.283203125, -1.24298095703125, -1.2027587890625, -1.16253662109375, -1.122314453125, -1.08209228515625, -1.0418701171875, -1.00164794921875, -0.96142578125, -0.92120361328125, -0.8809814453125, -0.84075927734375, -0.800537109375, -0.76031494140625, -0.7200927734375, -0.67987060546875, -0.6396484375, -0.59942626953125, -0.5592041015625, -0.51898193359375, -0.478759765625, -0.43853759765625, -0.3983154296875, -0.35809326171875, -0.31787109375, -0.27764892578125, -0.2374267578125, -0.19720458984375, -0.156982421875, -0.11676025390625, -0.0765380859375, -0.03631591796875, 0.00390625, 0.04412841796875, 0.0843505859375, 0.12457275390625, 0.164794921875, 0.20501708984375, 0.2452392578125, 0.28546142578125, 0.32568359375, 0.36590576171875, 0.4061279296875, 0.44635009765625, 0.486572265625, 0.52679443359375, 0.5670166015625, 0.60723876953125, 0.6474609375, 0.68768310546875, 0.7279052734375, 0.76812744140625, 0.808349609375, 0.84857177734375, 0.8887939453125, 0.92901611328125, 0.96923828125, 1.00946044921875, 1.0496826171875, 1.08990478515625, 1.130126953125, 1.17034912109375, 1.2105712890625, 1.25079345703125, 1.291015625]}, "gradients/decoder.transformer.h.17.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 1.0, 5.0, 4.0, 3.0, 10.0, 10.0, 15.0, 8.0, 16.0, 24.0, 19.0, 23.0, 29.0, 30.0, 24.0, 31.0, 34.0, 43.0, 27.0, 46.0, 40.0, 54.0, 1073.0, 40.0, 36.0, 33.0, 36.0, 37.0, 48.0, 29.0, 33.0, 16.0, 21.0, 22.0, 21.0, 20.0, 15.0, 18.0, 10.0, 6.0, 5.0, 5.0, 2.0, 3.0, 4.0, 2.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.35546875, -4.226409912109375, -4.09735107421875, -3.968292236328125, -3.8392333984375, -3.710174560546875, -3.58111572265625, -3.452056884765625, -3.322998046875, -3.193939208984375, -3.06488037109375, -2.935821533203125, -2.8067626953125, -2.677703857421875, -2.54864501953125, -2.419586181640625, -2.29052734375, -2.161468505859375, -2.03240966796875, -1.903350830078125, -1.7742919921875, -1.645233154296875, -1.51617431640625, -1.387115478515625, -1.258056640625, -1.128997802734375, -0.99993896484375, -0.870880126953125, -0.7418212890625, -0.612762451171875, -0.48370361328125, -0.354644775390625, -0.2255859375, -0.096527099609375, 0.03253173828125, 0.161590576171875, 0.2906494140625, 0.419708251953125, 0.54876708984375, 0.677825927734375, 0.806884765625, 0.935943603515625, 1.06500244140625, 1.194061279296875, 1.3231201171875, 1.452178955078125, 1.58123779296875, 1.710296630859375, 1.83935546875, 1.968414306640625, 2.09747314453125, 2.226531982421875, 2.3555908203125, 2.484649658203125, 2.61370849609375, 2.742767333984375, 2.871826171875, 3.000885009765625, 3.12994384765625, 3.259002685546875, 3.3880615234375, 3.517120361328125, 3.64617919921875, 3.775238037109375, 3.904296875]}, "gradients/decoder.transformer.h.17.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 3.0, 4.0, 4.0, 5.0, 3.0, 6.0, 11.0, 12.0, 28.0, 43.0, 46.0, 99.0, 177.0, 266.0, 466.0, 893.0, 1572.0, 2760.0, 5251.0, 10042.0, 19170.0, 38006.0, 79791.0, 183331.0, 1480555.0, 143425.0, 65365.0, 31411.0, 15986.0, 8430.0, 4518.0, 2381.0, 1328.0, 720.0, 376.0, 270.0, 131.0, 90.0, 52.0, 35.0, 32.0, 17.0, 9.0, 7.0, 4.0, 7.0, 2.0, 2.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.1484375, -2.07769775390625, -2.0069580078125, -1.93621826171875, -1.865478515625, -1.79473876953125, -1.7239990234375, -1.65325927734375, -1.58251953125, -1.51177978515625, -1.4410400390625, -1.37030029296875, -1.299560546875, -1.22882080078125, -1.1580810546875, -1.08734130859375, -1.0166015625, -0.94586181640625, -0.8751220703125, -0.80438232421875, -0.733642578125, -0.66290283203125, -0.5921630859375, -0.52142333984375, -0.45068359375, -0.37994384765625, -0.3092041015625, -0.23846435546875, -0.167724609375, -0.09698486328125, -0.0262451171875, 0.04449462890625, 0.115234375, 0.18597412109375, 0.2567138671875, 0.32745361328125, 0.398193359375, 0.46893310546875, 0.5396728515625, 0.61041259765625, 0.68115234375, 0.75189208984375, 0.8226318359375, 0.89337158203125, 0.964111328125, 1.03485107421875, 1.1055908203125, 1.17633056640625, 1.2470703125, 1.31781005859375, 1.3885498046875, 1.45928955078125, 1.530029296875, 1.60076904296875, 1.6715087890625, 1.74224853515625, 1.81298828125, 1.88372802734375, 1.9544677734375, 2.02520751953125, 2.095947265625, 2.16668701171875, 2.2374267578125, 2.30816650390625, 2.37890625]}, "gradients/decoder.transformer.h.17.crossattention.q_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 2.0, 2.0, 1.0, 5.0, 3.0, 5.0, 5.0, 11.0, 4.0, 7.0, 12.0, 14.0, 10.0, 16.0, 16.0, 32.0, 29.0, 24.0, 36.0, 46.0, 58.0, 65.0, 65.0, 79.0, 62.0, 49.0, 67.0, 41.0, 32.0, 38.0, 28.0, 23.0, 22.0, 18.0, 18.0, 10.0, 13.0, 7.0, 10.0, 5.0, 3.0, 2.0, 3.0, 6.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0009646415710449219, -0.0009372234344482422, -0.0009098052978515625, -0.0008823871612548828, -0.0008549690246582031, -0.0008275508880615234, -0.0008001327514648438, -0.0007727146148681641, -0.0007452964782714844, -0.0007178783416748047, -0.000690460205078125, -0.0006630420684814453, -0.0006356239318847656, -0.0006082057952880859, -0.0005807876586914062, -0.0005533695220947266, -0.0005259513854980469, -0.0004985332489013672, -0.0004711151123046875, -0.0004436969757080078, -0.0004162788391113281, -0.00038886070251464844, -0.00036144256591796875, -0.00033402442932128906, -0.0003066062927246094, -0.0002791881561279297, -0.00025177001953125, -0.0002243518829345703, -0.00019693374633789062, -0.00016951560974121094, -0.00014209747314453125, -0.00011467933654785156, -8.726119995117188e-05, -5.984306335449219e-05, -3.24249267578125e-05, -5.0067901611328125e-06, 2.2411346435546875e-05, 4.982948303222656e-05, 7.724761962890625e-05, 0.00010466575622558594, 0.00013208389282226562, 0.0001595020294189453, 0.000186920166015625, 0.0002143383026123047, 0.00024175643920898438, 0.00026917457580566406, 0.00029659271240234375, 0.00032401084899902344, 0.0003514289855957031, 0.0003788471221923828, 0.0004062652587890625, 0.0004336833953857422, 0.0004611015319824219, 0.0004885196685791016, 0.0005159378051757812, 0.0005433559417724609, 0.0005707740783691406, 0.0005981922149658203, 0.0006256103515625, 0.0006530284881591797, 0.0006804466247558594, 0.0007078647613525391, 0.0007352828979492188, 0.0007627010345458984, 0.0007901191711425781]}, "gradients/decoder.transformer.h.17.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 4.0, 5.0, 3.0, 7.0, 2.0, 6.0, 8.0, 14.0, 22.0, 6.0, 22.0, 16.0, 38.0, 48.0, 54.0, 84.0, 141.0, 287.0, 568.0, 4240.0, 1034887.0, 6645.0, 657.0, 297.0, 155.0, 104.0, 55.0, 43.0, 24.0, 18.0, 22.0, 14.0, 16.0, 7.0, 9.0, 8.0, 1.0, 7.0, 2.0, 4.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0], "bins": [-0.019256591796875, -0.018581867218017578, -0.017907142639160156, -0.017232418060302734, -0.016557693481445312, -0.01588296890258789, -0.015208244323730469, -0.014533519744873047, -0.013858795166015625, -0.013184070587158203, -0.012509346008300781, -0.01183462142944336, -0.011159896850585938, -0.010485172271728516, -0.009810447692871094, -0.009135723114013672, -0.00846099853515625, -0.007786273956298828, -0.007111549377441406, -0.006436824798583984, -0.0057621002197265625, -0.005087375640869141, -0.004412651062011719, -0.003737926483154297, -0.003063201904296875, -0.002388477325439453, -0.0017137527465820312, -0.0010390281677246094, -0.0003643035888671875, 0.0003104209899902344, 0.0009851455688476562, 0.0016598701477050781, 0.0023345947265625, 0.003009319305419922, 0.0036840438842773438, 0.004358768463134766, 0.0050334930419921875, 0.005708217620849609, 0.006382942199707031, 0.007057666778564453, 0.007732391357421875, 0.008407115936279297, 0.009081840515136719, 0.00975656509399414, 0.010431289672851562, 0.011106014251708984, 0.011780738830566406, 0.012455463409423828, 0.01313018798828125, 0.013804912567138672, 0.014479637145996094, 0.015154361724853516, 0.015829086303710938, 0.01650381088256836, 0.01717853546142578, 0.017853260040283203, 0.018527984619140625, 0.019202709197998047, 0.01987743377685547, 0.02055215835571289, 0.021226882934570312, 0.021901607513427734, 0.022576332092285156, 0.023251056671142578, 0.02392578125]}, "gradients/decoder.transformer.h.17.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 12.0, 27.0, 42.0, 99.0, 165.0, 195.0, 168.0, 135.0, 90.0, 42.0, 17.0, 12.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0005094473017379642, -0.0004850604454986751, -0.00046067358925938606, -0.000436286733020097, -0.00041189987678080797, -0.0003875130205415189, -0.0003631261351983994, -0.0003387392789591104, -0.00031435242271982133, -0.0002899655664805323, -0.00026557871024124324, -0.00024119183945003897, -0.00021680498321074992, -0.00019241812697146088, -0.0001680312561802566, -0.00014364439994096756, -0.00011925754370167851, -9.487068746238947e-05, -7.048382394714281e-05, -4.609696043189615e-05, -2.1710104192607105e-05, 2.6767520466819406e-06, 2.7063622837886214e-05, 5.145047907717526e-05, 7.58373353164643e-05, 0.00010022419155575335, 0.0001246110477950424, 0.00014899791858624667, 0.00017338477482553571, 0.00019777163106482476, 0.00022215850185602903, 0.0002465453580953181, 0.0002709322143346071, 0.00029531907057389617, 0.0003197059268131852, 0.00034409278305247426, 0.0003684796392917633, 0.00039286649553105235, 0.00041725338087417185, 0.0004416402371134609, 0.00046602709335274994, 0.0004904139786958694, 0.0005148008349351585, 0.0005391876911744475, 0.0005635745474137366, 0.0005879614036530256, 0.0006123482598923147, 0.0006367351161316037, 0.0006611219723708928, 0.0006855088286101818, 0.0007098956848494709, 0.0007342825410887599, 0.0007586693973280489, 0.000783056253567338, 0.000807443168014288, 0.0008318299660459161, 0.000856216880492866, 0.0008806037367321551, 0.0009049905929714441, 0.0009293774492107332, 0.0009537643054500222, 0.0009781512198969722, 0.0010025380179286003, 0.0010269249323755503, 0.0010513117304071784]}, "gradients/decoder.transformer.h.17.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 3.0, 3.0, 5.0, 2.0, 4.0, 9.0, 10.0, 14.0, 11.0, 11.0, 12.0, 13.0, 20.0, 25.0, 23.0, 29.0, 29.0, 35.0, 37.0, 35.0, 39.0, 51.0, 42.0, 42.0, 47.0, 37.0, 50.0, 51.0, 36.0, 23.0, 36.0, 36.0, 29.0, 31.0, 25.0, 15.0, 17.0, 10.0, 12.0, 10.0, 11.0, 13.0, 8.0, 3.0, 3.0, 1.0, 1.0, 4.0, 1.0, 1.0], "bins": [-0.0005670785903930664, -0.0005522938445210457, -0.000537509098649025, -0.0005227243527770042, -0.0005079396069049835, -0.0004931548610329628, -0.0004783701151609421, -0.00046358536928892136, -0.00044880062341690063, -0.0004340158775448799, -0.0004192311316728592, -0.00040444638580083847, -0.00038966163992881775, -0.00037487689405679703, -0.0003600921481847763, -0.0003453074023127556, -0.00033052265644073486, -0.00031573791056871414, -0.0003009531646966934, -0.0002861684188246727, -0.000271383672952652, -0.00025659892708063126, -0.00024181418120861053, -0.0002270294353365898, -0.0002122446894645691, -0.00019745994359254837, -0.00018267519772052765, -0.00016789045184850693, -0.0001531057059764862, -0.00013832096010446548, -0.00012353621423244476, -0.00010875146836042404, -9.396672248840332e-05, -7.91819766163826e-05, -6.439723074436188e-05, -4.9612484872341156e-05, -3.4827739000320435e-05, -2.0042993128299713e-05, -5.258247256278992e-06, 9.52649861574173e-06, 2.431124448776245e-05, 3.909599035978317e-05, 5.3880736231803894e-05, 6.866548210382462e-05, 8.345022797584534e-05, 9.823497384786606e-05, 0.00011301971971988678, 0.0001278044655919075, 0.00014258921146392822, 0.00015737395733594894, 0.00017215870320796967, 0.0001869434490799904, 0.0002017281949520111, 0.00021651294082403183, 0.00023129768669605255, 0.00024608243256807327, 0.000260867178440094, 0.0002756519243121147, 0.00029043667018413544, 0.00030522141605615616, 0.0003200061619281769, 0.0003347909078001976, 0.0003495756536722183, 0.00036436039954423904, 0.00037914514541625977]}, "gradients/decoder.transformer.h.17.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 4.0, 3.0, 5.0, 8.0, 8.0, 20.0, 11.0, 14.0, 15.0, 18.0, 28.0, 25.0, 33.0, 33.0, 38.0, 34.0, 50.0, 34.0, 37.0, 44.0, 47.0, 47.0, 52.0, 39.0, 53.0, 42.0, 28.0, 31.0, 30.0, 25.0, 25.0, 12.0, 21.0, 8.0, 12.0, 16.0, 14.0, 11.0, 6.0, 5.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.16796875, -5.971435546875, -5.77490234375, -5.578369140625, -5.3818359375, -5.185302734375, -4.98876953125, -4.792236328125, -4.595703125, -4.399169921875, -4.20263671875, -4.006103515625, -3.8095703125, -3.613037109375, -3.41650390625, -3.219970703125, -3.0234375, -2.826904296875, -2.63037109375, -2.433837890625, -2.2373046875, -2.040771484375, -1.84423828125, -1.647705078125, -1.451171875, -1.254638671875, -1.05810546875, -0.861572265625, -0.6650390625, -0.468505859375, -0.27197265625, -0.075439453125, 0.12109375, 0.317626953125, 0.51416015625, 0.710693359375, 0.9072265625, 1.103759765625, 1.30029296875, 1.496826171875, 1.693359375, 1.889892578125, 2.08642578125, 2.282958984375, 2.4794921875, 2.676025390625, 2.87255859375, 3.069091796875, 3.265625, 3.462158203125, 3.65869140625, 3.855224609375, 4.0517578125, 4.248291015625, 4.44482421875, 4.641357421875, 4.837890625, 5.034423828125, 5.23095703125, 5.427490234375, 5.6240234375, 5.820556640625, 6.01708984375, 6.213623046875, 6.41015625]}, "gradients/decoder.transformer.h.17.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 6.0, 3.0, 5.0, 3.0, 15.0, 12.0, 14.0, 27.0, 33.0, 57.0, 62.0, 117.0, 129.0, 296.0, 498.0, 987.0, 1962.0, 4580.0, 10094.0, 24673.0, 65858.0, 216465.0, 456626.0, 173252.0, 55471.0, 20821.0, 8849.0, 3880.0, 1738.0, 890.0, 456.0, 240.0, 134.0, 107.0, 63.0, 34.0, 29.0, 20.0, 18.0, 13.0, 6.0, 8.0, 1.0, 3.0, 3.0, 4.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.59375, -4.4520263671875, -4.310302734375, -4.1685791015625, -4.02685546875, -3.8851318359375, -3.743408203125, -3.6016845703125, -3.4599609375, -3.3182373046875, -3.176513671875, -3.0347900390625, -2.89306640625, -2.7513427734375, -2.609619140625, -2.4678955078125, -2.326171875, -2.1844482421875, -2.042724609375, -1.9010009765625, -1.75927734375, -1.6175537109375, -1.475830078125, -1.3341064453125, -1.1923828125, -1.0506591796875, -0.908935546875, -0.7672119140625, -0.62548828125, -0.4837646484375, -0.342041015625, -0.2003173828125, -0.05859375, 0.0831298828125, 0.224853515625, 0.3665771484375, 0.50830078125, 0.6500244140625, 0.791748046875, 0.9334716796875, 1.0751953125, 1.2169189453125, 1.358642578125, 1.5003662109375, 1.64208984375, 1.7838134765625, 1.925537109375, 2.0672607421875, 2.208984375, 2.3507080078125, 2.492431640625, 2.6341552734375, 2.77587890625, 2.9176025390625, 3.059326171875, 3.2010498046875, 3.3427734375, 3.4844970703125, 3.626220703125, 3.7679443359375, 3.90966796875, 4.0513916015625, 4.193115234375, 4.3348388671875, 4.4765625]}, "gradients/decoder.transformer.h.17.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 3.0, 4.0, 0.0, 9.0, 5.0, 5.0, 9.0, 11.0, 12.0, 16.0, 30.0, 28.0, 25.0, 30.0, 30.0, 40.0, 37.0, 43.0, 46.0, 77.0, 133.0, 1783.0, 211.0, 72.0, 37.0, 59.0, 46.0, 46.0, 35.0, 18.0, 30.0, 27.0, 27.0, 13.0, 6.0, 12.0, 8.0, 11.0, 4.0, 8.0, 2.0, 1.0, 5.0, 4.0, 3.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-20.65625, -19.981201171875, -19.30615234375, -18.631103515625, -17.9560546875, -17.281005859375, -16.60595703125, -15.930908203125, -15.255859375, -14.580810546875, -13.90576171875, -13.230712890625, -12.5556640625, -11.880615234375, -11.20556640625, -10.530517578125, -9.85546875, -9.180419921875, -8.50537109375, -7.830322265625, -7.1552734375, -6.480224609375, -5.80517578125, -5.130126953125, -4.455078125, -3.780029296875, -3.10498046875, -2.429931640625, -1.7548828125, -1.079833984375, -0.40478515625, 0.270263671875, 0.9453125, 1.620361328125, 2.29541015625, 2.970458984375, 3.6455078125, 4.320556640625, 4.99560546875, 5.670654296875, 6.345703125, 7.020751953125, 7.69580078125, 8.370849609375, 9.0458984375, 9.720947265625, 10.39599609375, 11.071044921875, 11.74609375, 12.421142578125, 13.09619140625, 13.771240234375, 14.4462890625, 15.121337890625, 15.79638671875, 16.471435546875, 17.146484375, 17.821533203125, 18.49658203125, 19.171630859375, 19.8466796875, 20.521728515625, 21.19677734375, 21.871826171875, 22.546875]}, "gradients/decoder.transformer.h.17.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 5.0, 5.0, 2.0, 4.0, 6.0, 6.0, 14.0, 10.0, 13.0, 15.0, 19.0, 34.0, 43.0, 47.0, 57.0, 94.0, 118.0, 186.0, 242.0, 424.0, 1593.0, 71597.0, 3061876.0, 7414.0, 766.0, 350.0, 196.0, 144.0, 109.0, 79.0, 47.0, 43.0, 39.0, 29.0, 26.0, 13.0, 14.0, 9.0, 5.0, 6.0, 7.0, 4.0, 3.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-45.40625, -44.0419921875, -42.677734375, -41.3134765625, -39.94921875, -38.5849609375, -37.220703125, -35.8564453125, -34.4921875, -33.1279296875, -31.763671875, -30.3994140625, -29.03515625, -27.6708984375, -26.306640625, -24.9423828125, -23.578125, -22.2138671875, -20.849609375, -19.4853515625, -18.12109375, -16.7568359375, -15.392578125, -14.0283203125, -12.6640625, -11.2998046875, -9.935546875, -8.5712890625, -7.20703125, -5.8427734375, -4.478515625, -3.1142578125, -1.75, -0.3857421875, 0.978515625, 2.3427734375, 3.70703125, 5.0712890625, 6.435546875, 7.7998046875, 9.1640625, 10.5283203125, 11.892578125, 13.2568359375, 14.62109375, 15.9853515625, 17.349609375, 18.7138671875, 20.078125, 21.4423828125, 22.806640625, 24.1708984375, 25.53515625, 26.8994140625, 28.263671875, 29.6279296875, 30.9921875, 32.3564453125, 33.720703125, 35.0849609375, 36.44921875, 37.8134765625, 39.177734375, 40.5419921875, 41.90625]}, "gradients/decoder.transformer.h.17.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 13.0, 82.0, 339.0, 450.0, 115.0, 17.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-96.70613861083984, -94.51325988769531, -92.32038116455078, -90.12750244140625, -87.93463134765625, -85.74175262451172, -83.54887390136719, -81.35599517822266, -79.16311645507812, -76.9702377319336, -74.77735900878906, -72.58448791503906, -70.39160919189453, -68.19873046875, -66.00585174560547, -63.81297302246094, -61.62009811401367, -59.42721939086914, -57.234344482421875, -55.041465759277344, -52.84858703613281, -50.65570831298828, -48.462833404541016, -46.269954681396484, -44.07707977294922, -41.88420104980469, -39.69132614135742, -37.49844741821289, -35.30556869506836, -33.112693786621094, -30.919815063476562, -28.72693634033203, -26.534053802490234, -24.341176986694336, -22.148298263549805, -19.955421447753906, -17.762542724609375, -15.569665908813477, -13.376789093017578, -11.183911323547363, -8.991033554077148, -6.798155784606934, -4.605278491973877, -2.4124011993408203, -0.21952342987060547, 1.9733543395996094, 4.166231155395508, 6.359108924865723, 8.551986694335938, 10.744864463806152, 12.937742233276367, 15.130619049072266, 17.323497772216797, 19.516374588012695, 21.709251403808594, 23.902130126953125, 26.095006942749023, 28.287883758544922, 30.480762481689453, 32.67363739013672, 34.86651611328125, 37.05939483642578, 39.25227355957031, 41.44514846801758, 43.63802719116211]}, "gradients/decoder.transformer.h.17.ln_1.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 5.0, 4.0, 2.0, 2.0, 7.0, 9.0, 2.0, 6.0, 11.0, 7.0, 22.0, 6.0, 17.0, 19.0, 17.0, 20.0, 17.0, 22.0, 29.0, 28.0, 22.0, 32.0, 31.0, 29.0, 31.0, 37.0, 29.0, 50.0, 38.0, 29.0, 28.0, 40.0, 33.0, 45.0, 31.0, 33.0, 23.0, 25.0, 28.0, 19.0, 17.0, 18.0, 12.0, 11.0, 13.0, 11.0, 11.0, 6.0, 6.0, 4.0, 5.0, 9.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-40.42657470703125, -39.086795806884766, -37.74701690673828, -36.40723419189453, -35.06745529174805, -33.72767639160156, -32.38789367675781, -31.048114776611328, -29.708335876464844, -28.36855697631836, -27.028776168823242, -25.688995361328125, -24.34921646118164, -23.009437561035156, -21.66965675354004, -20.329875946044922, -18.990097045898438, -17.650318145751953, -16.310537338256836, -14.970757484436035, -13.630977630615234, -12.291197776794434, -10.951417922973633, -9.611638069152832, -8.271858215332031, -6.9320783615112305, -5.59229850769043, -4.252518653869629, -2.912738800048828, -1.5729589462280273, -0.23317909240722656, 1.1066007614135742, 2.446380615234375, 3.786160469055176, 5.125940322875977, 6.465720176696777, 7.805500030517578, 9.145279884338379, 10.48505973815918, 11.82483959197998, 13.164619445800781, 14.504399299621582, 15.844179153442383, 17.1839599609375, 18.523738861083984, 19.86351776123047, 21.203298568725586, 22.543079376220703, 23.882858276367188, 25.222637176513672, 26.56241798400879, 27.902198791503906, 29.24197769165039, 30.581756591796875, 31.921537399291992, 33.26131820678711, 34.601097106933594, 35.94087600708008, 37.28065490722656, 38.62043762207031, 39.9602165222168, 41.29999542236328, 42.63977813720703, 43.979557037353516, 45.3193359375]}, "gradients/decoder.transformer.h.16.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 5.0, 3.0, 3.0, 2.0, 7.0, 7.0, 10.0, 15.0, 15.0, 13.0, 17.0, 22.0, 25.0, 25.0, 29.0, 34.0, 30.0, 39.0, 32.0, 46.0, 45.0, 38.0, 46.0, 53.0, 47.0, 48.0, 49.0, 35.0, 35.0, 31.0, 30.0, 24.0, 31.0, 20.0, 14.0, 12.0, 10.0, 10.0, 21.0, 9.0, 4.0, 5.0, 5.0, 5.0, 1.0, 0.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.15625, -5.955078125, -5.75390625, -5.552734375, -5.3515625, -5.150390625, -4.94921875, -4.748046875, -4.546875, -4.345703125, -4.14453125, -3.943359375, -3.7421875, -3.541015625, -3.33984375, -3.138671875, -2.9375, -2.736328125, -2.53515625, -2.333984375, -2.1328125, -1.931640625, -1.73046875, -1.529296875, -1.328125, -1.126953125, -0.92578125, -0.724609375, -0.5234375, -0.322265625, -0.12109375, 0.080078125, 0.28125, 0.482421875, 0.68359375, 0.884765625, 1.0859375, 1.287109375, 1.48828125, 1.689453125, 1.890625, 2.091796875, 2.29296875, 2.494140625, 2.6953125, 2.896484375, 3.09765625, 3.298828125, 3.5, 3.701171875, 3.90234375, 4.103515625, 4.3046875, 4.505859375, 4.70703125, 4.908203125, 5.109375, 5.310546875, 5.51171875, 5.712890625, 5.9140625, 6.115234375, 6.31640625, 6.517578125, 6.71875]}, "gradients/decoder.transformer.h.16.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 4.0, 1.0, 0.0, 2.0, 2.0, 0.0, 6.0, 3.0, 4.0, 5.0, 10.0, 8.0, 9.0, 11.0, 13.0, 19.0, 23.0, 14.0, 27.0, 36.0, 28.0, 49.0, 53.0, 86.0, 159.0, 314.0, 2603.0, 447856.0, 3716851.0, 24789.0, 610.0, 210.0, 125.0, 78.0, 50.0, 32.0, 34.0, 30.0, 20.0, 20.0, 11.0, 11.0, 18.0, 10.0, 16.0, 10.0, 5.0, 8.0, 6.0, 3.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-33.0, -31.87255859375, -30.7451171875, -29.61767578125, -28.490234375, -27.36279296875, -26.2353515625, -25.10791015625, -23.98046875, -22.85302734375, -21.7255859375, -20.59814453125, -19.470703125, -18.34326171875, -17.2158203125, -16.08837890625, -14.9609375, -13.83349609375, -12.7060546875, -11.57861328125, -10.451171875, -9.32373046875, -8.1962890625, -7.06884765625, -5.94140625, -4.81396484375, -3.6865234375, -2.55908203125, -1.431640625, -0.30419921875, 0.8232421875, 1.95068359375, 3.078125, 4.20556640625, 5.3330078125, 6.46044921875, 7.587890625, 8.71533203125, 9.8427734375, 10.97021484375, 12.09765625, 13.22509765625, 14.3525390625, 15.47998046875, 16.607421875, 17.73486328125, 18.8623046875, 19.98974609375, 21.1171875, 22.24462890625, 23.3720703125, 24.49951171875, 25.626953125, 26.75439453125, 27.8818359375, 29.00927734375, 30.13671875, 31.26416015625, 32.3916015625, 33.51904296875, 34.646484375, 35.77392578125, 36.9013671875, 38.02880859375, 39.15625]}, "gradients/decoder.transformer.h.16.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 4.0, 5.0, 14.0, 14.0, 23.0, 39.0, 55.0, 95.0, 144.0, 207.0, 292.0, 469.0, 595.0, 589.0, 499.0, 350.0, 234.0, 145.0, 103.0, 74.0, 49.0, 23.0, 24.0, 12.0, 9.0, 5.0, 3.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-16.5625, -15.951904296875, -15.34130859375, -14.730712890625, -14.1201171875, -13.509521484375, -12.89892578125, -12.288330078125, -11.677734375, -11.067138671875, -10.45654296875, -9.845947265625, -9.2353515625, -8.624755859375, -8.01416015625, -7.403564453125, -6.79296875, -6.182373046875, -5.57177734375, -4.961181640625, -4.3505859375, -3.739990234375, -3.12939453125, -2.518798828125, -1.908203125, -1.297607421875, -0.68701171875, -0.076416015625, 0.5341796875, 1.144775390625, 1.75537109375, 2.365966796875, 2.9765625, 3.587158203125, 4.19775390625, 4.808349609375, 5.4189453125, 6.029541015625, 6.64013671875, 7.250732421875, 7.861328125, 8.471923828125, 9.08251953125, 9.693115234375, 10.3037109375, 10.914306640625, 11.52490234375, 12.135498046875, 12.74609375, 13.356689453125, 13.96728515625, 14.577880859375, 15.1884765625, 15.799072265625, 16.40966796875, 17.020263671875, 17.630859375, 18.241455078125, 18.85205078125, 19.462646484375, 20.0732421875, 20.683837890625, 21.29443359375, 21.905029296875, 22.515625]}, "gradients/decoder.transformer.h.16.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 2.0, 3.0, 3.0, 11.0, 13.0, 15.0, 14.0, 26.0, 28.0, 45.0, 50.0, 58.0, 91.0, 91.0, 145.0, 212.0, 352.0, 747.0, 2256.0, 253842.0, 3925162.0, 8556.0, 1070.0, 473.0, 295.0, 204.0, 113.0, 88.0, 69.0, 60.0, 47.0, 34.0, 25.0, 18.0, 11.0, 10.0, 13.0, 7.0, 8.0, 4.0, 5.0, 0.0, 3.0, 4.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0], "bins": [-68.6875, -66.556640625, -64.42578125, -62.294921875, -60.1640625, -58.033203125, -55.90234375, -53.771484375, -51.640625, -49.509765625, -47.37890625, -45.248046875, -43.1171875, -40.986328125, -38.85546875, -36.724609375, -34.59375, -32.462890625, -30.33203125, -28.201171875, -26.0703125, -23.939453125, -21.80859375, -19.677734375, -17.546875, -15.416015625, -13.28515625, -11.154296875, -9.0234375, -6.892578125, -4.76171875, -2.630859375, -0.5, 1.630859375, 3.76171875, 5.892578125, 8.0234375, 10.154296875, 12.28515625, 14.416015625, 16.546875, 18.677734375, 20.80859375, 22.939453125, 25.0703125, 27.201171875, 29.33203125, 31.462890625, 33.59375, 35.724609375, 37.85546875, 39.986328125, 42.1171875, 44.248046875, 46.37890625, 48.509765625, 50.640625, 52.771484375, 54.90234375, 57.033203125, 59.1640625, 61.294921875, 63.42578125, 65.556640625, 67.6875]}, "gradients/decoder.transformer.h.16.ln_2.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 5.0, 8.0, 75.0, 388.0, 407.0, 122.0, 14.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-64.16033172607422, -56.15629196166992, -48.152252197265625, -40.14820861816406, -32.144168853759766, -24.14012908935547, -16.136085510253906, -8.13204574584961, -0.1280059814453125, 7.876034736633301, 15.880075454711914, 23.884117126464844, 31.88815689086914, 39.89219665527344, 47.896240234375, 55.9002799987793, 63.904319763183594, 71.90836334228516, 79.91239929199219, 87.91644287109375, 95.92048645019531, 103.92452239990234, 111.9285659790039, 119.93260192871094, 127.9366455078125, 135.94068908691406, 143.94473266601562, 151.94876098632812, 159.9528045654297, 167.95684814453125, 175.9608917236328, 183.96493530273438, 191.96896362304688, 199.97300720214844, 207.97705078125, 215.9810791015625, 223.98512268066406, 231.98916625976562, 239.9932098388672, 247.99725341796875, 256.00128173828125, 264.00531005859375, 272.0093688964844, 280.0133972167969, 288.0174560546875, 296.021484375, 304.0255126953125, 312.0295715332031, 320.03363037109375, 328.03765869140625, 336.0417175292969, 344.0457458496094, 352.0498046875, 360.0538330078125, 368.057861328125, 376.0619201660156, 384.0659484863281, 392.0699768066406, 400.07403564453125, 408.07806396484375, 416.0821228027344, 424.0861511230469, 432.0902099609375, 440.09423828125, 448.0982666015625]}, "gradients/decoder.transformer.h.16.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 2.0, 4.0, 2.0, 9.0, 8.0, 1.0, 5.0, 11.0, 9.0, 10.0, 14.0, 27.0, 21.0, 23.0, 14.0, 29.0, 36.0, 32.0, 51.0, 40.0, 57.0, 38.0, 38.0, 34.0, 41.0, 34.0, 40.0, 47.0, 36.0, 51.0, 37.0, 29.0, 22.0, 29.0, 19.0, 18.0, 12.0, 20.0, 13.0, 10.0, 10.0, 8.0, 2.0, 3.0, 8.0, 2.0, 2.0, 1.0, 4.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-50.43025207519531, -48.888065338134766, -47.34587860107422, -45.80369186401367, -44.261505126953125, -42.719322204589844, -41.1771354675293, -39.63494873046875, -38.0927619934082, -36.550575256347656, -35.00838851928711, -33.46620178222656, -31.92401695251465, -30.3818302154541, -28.839645385742188, -27.29745864868164, -25.755271911621094, -24.213085174560547, -22.6708984375, -21.128713607788086, -19.58652687072754, -18.044340133666992, -16.502155303955078, -14.959968566894531, -13.417781829833984, -11.875595092773438, -10.333409309387207, -8.791223526000977, -7.24903678894043, -5.706850528717041, -4.164664268493652, -2.622478485107422, -1.080291748046875, 0.46189451217651367, 2.0040807723999023, 3.546267032623291, 5.08845329284668, 6.630639553070068, 8.172825813293457, 9.715011596679688, 11.257198333740234, 12.799385070800781, 14.341570854187012, 15.883756637573242, 17.42594337463379, 18.968130111694336, 20.51031494140625, 22.052501678466797, 23.594688415527344, 25.13687515258789, 26.679061889648438, 28.22124671936035, 29.7634334564209, 31.305620193481445, 32.84780502319336, 34.389991760253906, 35.93217849731445, 37.474365234375, 39.01655197143555, 40.558738708496094, 42.100921630859375, 43.64310836791992, 45.18529510498047, 46.727481842041016, 48.26966857910156]}, "gradients/decoder.transformer.h.16.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 2.0, 4.0, 3.0, 3.0, 5.0, 8.0, 8.0, 10.0, 9.0, 16.0, 11.0, 14.0, 25.0, 27.0, 31.0, 32.0, 38.0, 29.0, 35.0, 33.0, 37.0, 39.0, 43.0, 46.0, 38.0, 52.0, 37.0, 49.0, 47.0, 29.0, 34.0, 33.0, 28.0, 20.0, 28.0, 19.0, 12.0, 10.0, 13.0, 14.0, 12.0, 4.0, 6.0, 3.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.28125, -6.08349609375, -5.8857421875, -5.68798828125, -5.490234375, -5.29248046875, -5.0947265625, -4.89697265625, -4.69921875, -4.50146484375, -4.3037109375, -4.10595703125, -3.908203125, -3.71044921875, -3.5126953125, -3.31494140625, -3.1171875, -2.91943359375, -2.7216796875, -2.52392578125, -2.326171875, -2.12841796875, -1.9306640625, -1.73291015625, -1.53515625, -1.33740234375, -1.1396484375, -0.94189453125, -0.744140625, -0.54638671875, -0.3486328125, -0.15087890625, 0.046875, 0.24462890625, 0.4423828125, 0.64013671875, 0.837890625, 1.03564453125, 1.2333984375, 1.43115234375, 1.62890625, 1.82666015625, 2.0244140625, 2.22216796875, 2.419921875, 2.61767578125, 2.8154296875, 3.01318359375, 3.2109375, 3.40869140625, 3.6064453125, 3.80419921875, 4.001953125, 4.19970703125, 4.3974609375, 4.59521484375, 4.79296875, 4.99072265625, 5.1884765625, 5.38623046875, 5.583984375, 5.78173828125, 5.9794921875, 6.17724609375, 6.375]}, "gradients/decoder.transformer.h.16.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 5.0, 6.0, 5.0, 7.0, 2.0, 19.0, 23.0, 27.0, 42.0, 70.0, 98.0, 158.0, 230.0, 384.0, 618.0, 1011.0, 1665.0, 2776.0, 4754.0, 8055.0, 14368.0, 26138.0, 48659.0, 96855.0, 245774.0, 353390.0, 116390.0, 57344.0, 30414.0, 16524.0, 9358.0, 5346.0, 3111.0, 1909.0, 1140.0, 669.0, 439.0, 280.0, 155.0, 125.0, 71.0, 43.0, 34.0, 25.0, 18.0, 9.0, 10.0, 5.0, 2.0, 4.0, 4.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-2.064453125, -2.00018310546875, -1.9359130859375, -1.87164306640625, -1.807373046875, -1.74310302734375, -1.6788330078125, -1.61456298828125, -1.55029296875, -1.48602294921875, -1.4217529296875, -1.35748291015625, -1.293212890625, -1.22894287109375, -1.1646728515625, -1.10040283203125, -1.0361328125, -0.97186279296875, -0.9075927734375, -0.84332275390625, -0.779052734375, -0.71478271484375, -0.6505126953125, -0.58624267578125, -0.52197265625, -0.45770263671875, -0.3934326171875, -0.32916259765625, -0.264892578125, -0.20062255859375, -0.1363525390625, -0.07208251953125, -0.0078125, 0.05645751953125, 0.1207275390625, 0.18499755859375, 0.249267578125, 0.31353759765625, 0.3778076171875, 0.44207763671875, 0.50634765625, 0.57061767578125, 0.6348876953125, 0.69915771484375, 0.763427734375, 0.82769775390625, 0.8919677734375, 0.95623779296875, 1.0205078125, 1.08477783203125, 1.1490478515625, 1.21331787109375, 1.277587890625, 1.34185791015625, 1.4061279296875, 1.47039794921875, 1.53466796875, 1.59893798828125, 1.6632080078125, 1.72747802734375, 1.791748046875, 1.85601806640625, 1.9202880859375, 1.98455810546875, 2.048828125]}, "gradients/decoder.transformer.h.16.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 5.0, 6.0, 3.0, 3.0, 5.0, 10.0, 7.0, 7.0, 13.0, 13.0, 21.0, 27.0, 22.0, 31.0, 27.0, 39.0, 50.0, 42.0, 33.0, 52.0, 42.0, 1078.0, 49.0, 60.0, 44.0, 48.0, 40.0, 28.0, 41.0, 36.0, 16.0, 20.0, 21.0, 16.0, 18.0, 14.0, 8.0, 5.0, 8.0, 7.0, 4.0, 7.0, 6.0, 3.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.04296875, -3.90228271484375, -3.7615966796875, -3.62091064453125, -3.480224609375, -3.33953857421875, -3.1988525390625, -3.05816650390625, -2.91748046875, -2.77679443359375, -2.6361083984375, -2.49542236328125, -2.354736328125, -2.21405029296875, -2.0733642578125, -1.93267822265625, -1.7919921875, -1.65130615234375, -1.5106201171875, -1.36993408203125, -1.229248046875, -1.08856201171875, -0.9478759765625, -0.80718994140625, -0.66650390625, -0.52581787109375, -0.3851318359375, -0.24444580078125, -0.103759765625, 0.03692626953125, 0.1776123046875, 0.31829833984375, 0.458984375, 0.59967041015625, 0.7403564453125, 0.88104248046875, 1.021728515625, 1.16241455078125, 1.3031005859375, 1.44378662109375, 1.58447265625, 1.72515869140625, 1.8658447265625, 2.00653076171875, 2.147216796875, 2.28790283203125, 2.4285888671875, 2.56927490234375, 2.7099609375, 2.85064697265625, 2.9913330078125, 3.13201904296875, 3.272705078125, 3.41339111328125, 3.5540771484375, 3.69476318359375, 3.83544921875, 3.97613525390625, 4.1168212890625, 4.25750732421875, 4.398193359375, 4.53887939453125, 4.6795654296875, 4.82025146484375, 4.9609375]}, "gradients/decoder.transformer.h.16.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 3.0, 6.0, 7.0, 14.0, 10.0, 17.0, 21.0, 33.0, 94.0, 111.0, 159.0, 313.0, 537.0, 867.0, 1687.0, 2885.0, 5450.0, 10330.0, 20879.0, 44616.0, 104598.0, 1420678.0, 305494.0, 95880.0, 41387.0, 19595.0, 9819.0, 5089.0, 2829.0, 1575.0, 852.0, 547.0, 297.0, 153.0, 103.0, 91.0, 29.0, 25.0, 17.0, 12.0, 7.0, 8.0, 5.0, 7.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-2.705078125, -2.627685546875, -2.55029296875, -2.472900390625, -2.3955078125, -2.318115234375, -2.24072265625, -2.163330078125, -2.0859375, -2.008544921875, -1.93115234375, -1.853759765625, -1.7763671875, -1.698974609375, -1.62158203125, -1.544189453125, -1.466796875, -1.389404296875, -1.31201171875, -1.234619140625, -1.1572265625, -1.079833984375, -1.00244140625, -0.925048828125, -0.84765625, -0.770263671875, -0.69287109375, -0.615478515625, -0.5380859375, -0.460693359375, -0.38330078125, -0.305908203125, -0.228515625, -0.151123046875, -0.07373046875, 0.003662109375, 0.0810546875, 0.158447265625, 0.23583984375, 0.313232421875, 0.390625, 0.468017578125, 0.54541015625, 0.622802734375, 0.7001953125, 0.777587890625, 0.85498046875, 0.932373046875, 1.009765625, 1.087158203125, 1.16455078125, 1.241943359375, 1.3193359375, 1.396728515625, 1.47412109375, 1.551513671875, 1.62890625, 1.706298828125, 1.78369140625, 1.861083984375, 1.9384765625, 2.015869140625, 2.09326171875, 2.170654296875, 2.248046875]}, "gradients/decoder.transformer.h.16.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 0.0, 3.0, 3.0, 3.0, 0.0, 6.0, 3.0, 2.0, 8.0, 6.0, 8.0, 8.0, 10.0, 17.0, 11.0, 25.0, 45.0, 48.0, 86.0, 90.0, 118.0, 121.0, 104.0, 67.0, 45.0, 35.0, 33.0, 14.0, 22.0, 17.0, 15.0, 6.0, 8.0, 7.0, 5.0, 4.0, 5.0, 0.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0016431808471679688, -0.0015916526317596436, -0.0015401244163513184, -0.0014885962009429932, -0.001437067985534668, -0.0013855397701263428, -0.0013340115547180176, -0.0012824833393096924, -0.0012309551239013672, -0.001179426908493042, -0.0011278986930847168, -0.0010763704776763916, -0.0010248422622680664, -0.0009733140468597412, -0.000921785831451416, -0.0008702576160430908, -0.0008187294006347656, -0.0007672011852264404, -0.0007156729698181152, -0.00066414475440979, -0.0006126165390014648, -0.0005610883235931396, -0.0005095601081848145, -0.00045803189277648926, -0.00040650367736816406, -0.00035497546195983887, -0.00030344724655151367, -0.0002519190311431885, -0.00020039081573486328, -0.00014886260032653809, -9.733438491821289e-05, -4.5806169509887695e-05, 5.7220458984375e-06, 5.7250261306762695e-05, 0.00010877847671508789, 0.00016030669212341309, 0.00021183490753173828, 0.0002633631229400635, 0.00031489133834838867, 0.00036641955375671387, 0.00041794776916503906, 0.00046947598457336426, 0.0005210041999816895, 0.0005725324153900146, 0.0006240606307983398, 0.000675588846206665, 0.0007271170616149902, 0.0007786452770233154, 0.0008301734924316406, 0.0008817017078399658, 0.000933229923248291, 0.0009847581386566162, 0.0010362863540649414, 0.0010878145694732666, 0.0011393427848815918, 0.001190871000289917, 0.0012423992156982422, 0.0012939274311065674, 0.0013454556465148926, 0.0013969838619232178, 0.001448512077331543, 0.0015000402927398682, 0.0015515685081481934, 0.0016030967235565186, 0.0016546249389648438]}, "gradients/decoder.transformer.h.16.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 0.0, 5.0, 2.0, 4.0, 10.0, 6.0, 9.0, 6.0, 22.0, 16.0, 25.0, 42.0, 65.0, 90.0, 140.0, 281.0, 776.0, 165961.0, 879346.0, 949.0, 347.0, 163.0, 88.0, 56.0, 43.0, 29.0, 18.0, 13.0, 11.0, 6.0, 8.0, 2.0, 2.0, 4.0, 2.0, 1.0, 1.0, 4.0, 2.0, 3.0, 3.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.035308837890625, -0.034207820892333984, -0.03310680389404297, -0.03200578689575195, -0.030904769897460938, -0.029803752899169922, -0.028702735900878906, -0.02760171890258789, -0.026500701904296875, -0.02539968490600586, -0.024298667907714844, -0.023197650909423828, -0.022096633911132812, -0.020995616912841797, -0.01989459991455078, -0.018793582916259766, -0.01769256591796875, -0.016591548919677734, -0.015490531921386719, -0.014389514923095703, -0.013288497924804688, -0.012187480926513672, -0.011086463928222656, -0.00998544692993164, -0.008884429931640625, -0.007783412933349609, -0.006682395935058594, -0.005581378936767578, -0.0044803619384765625, -0.003379344940185547, -0.0022783279418945312, -0.0011773109436035156, -7.62939453125e-05, 0.0010247230529785156, 0.0021257400512695312, 0.003226757049560547, 0.0043277740478515625, 0.005428791046142578, 0.006529808044433594, 0.007630825042724609, 0.008731842041015625, 0.00983285903930664, 0.010933876037597656, 0.012034893035888672, 0.013135910034179688, 0.014236927032470703, 0.015337944030761719, 0.016438961029052734, 0.01753997802734375, 0.018640995025634766, 0.01974201202392578, 0.020843029022216797, 0.021944046020507812, 0.023045063018798828, 0.024146080017089844, 0.02524709701538086, 0.026348114013671875, 0.02744913101196289, 0.028550148010253906, 0.029651165008544922, 0.030752182006835938, 0.03185319900512695, 0.03295421600341797, 0.034055233001708984, 0.03515625]}, "gradients/decoder.transformer.h.16.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 12.0, 46.0, 150.0, 284.0, 293.0, 147.0, 53.0, 19.0, 5.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005440074019134045, -0.0004947661655023694, -0.0004455248999875039, -0.0003962836635764688, -0.0003470423980616033, -0.00029780116165056825, -0.0002485599252395332, -0.00019931865972466767, -0.0001500774233136326, -0.00010083617235068232, -5.159492866368964e-05, -2.353684976696968e-06, 4.688756598625332e-05, 9.612881694920361e-05, 0.00014537005336023867, 0.0001946113188751042, 0.00024385255528613925, 0.0002930937916971743, 0.00034233505721203983, 0.0003915762936230749, 0.0004408175591379404, 0.0004900587955489755, 0.0005393000319600105, 0.0005885412683710456, 0.0006377825047820807, 0.0006870237411931157, 0.0007362649776041508, 0.0007855062140151858, 0.0008347475086338818, 0.0008839887450449169, 0.0009332299814559519, 0.000982471276074648, 0.001031712512485683, 0.001080953748896718, 0.001130194985307753, 0.0011794362217187881, 0.0012286774581298232, 0.0012779186945408583, 0.0013271600473672152, 0.0013764012837782502, 0.0014256425201892853, 0.0014748837566003203, 0.0015241249930113554, 0.0015733662294223905, 0.0016226074658334255, 0.0016718488186597824, 0.0017210899386554956, 0.0017703312914818525, 0.0018195724114775658, 0.0018688136478886008, 0.0019180548842996359, 0.0019672962371259928, 0.002016537357121706, 0.002065778709948063, 0.002115019829943776, 0.002164261182770133, 0.00221350253559649, 0.002262743888422847, 0.00231198500841856, 0.002361226361244917, 0.00241046748124063, 0.002459708834066987, 0.0025089499540627003, 0.002558191306889057, 0.0026074324268847704]}, "gradients/decoder.transformer.h.16.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 2.0, 3.0, 9.0, 4.0, 8.0, 11.0, 7.0, 18.0, 13.0, 18.0, 27.0, 25.0, 28.0, 17.0, 24.0, 36.0, 31.0, 32.0, 38.0, 43.0, 36.0, 35.0, 44.0, 33.0, 42.0, 28.0, 36.0, 52.0, 36.0, 34.0, 26.0, 28.0, 28.0, 17.0, 24.0, 16.0, 32.0, 11.0, 14.0, 9.0, 5.0, 5.0, 3.0, 8.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0], "bins": [-0.0006296038627624512, -0.0006101317703723907, -0.0005906596779823303, -0.0005711875855922699, -0.0005517154932022095, -0.000532243400812149, -0.0005127713084220886, -0.0004932992160320282, -0.0004738271236419678, -0.00045435503125190735, -0.0004348829388618469, -0.0004154108464717865, -0.0003959387540817261, -0.00037646666169166565, -0.0003569945693016052, -0.0003375224769115448, -0.0003180503845214844, -0.00029857829213142395, -0.0002791061997413635, -0.0002596341073513031, -0.00024016201496124268, -0.00022068992257118225, -0.00020121783018112183, -0.0001817457377910614, -0.00016227364540100098, -0.00014280155301094055, -0.00012332946062088013, -0.0001038573682308197, -8.438527584075928e-05, -6.491318345069885e-05, -4.544109106063843e-05, -2.5968998670578003e-05, -6.496906280517578e-06, 1.2975186109542847e-05, 3.244727849960327e-05, 5.1919370889663696e-05, 7.139146327972412e-05, 9.086355566978455e-05, 0.00011033564805984497, 0.0001298077404499054, 0.00014927983283996582, 0.00016875192523002625, 0.00018822401762008667, 0.0002076961100101471, 0.00022716820240020752, 0.00024664029479026794, 0.00026611238718032837, 0.0002855844795703888, 0.0003050565719604492, 0.00032452866435050964, 0.00034400075674057007, 0.0003634728491306305, 0.0003829449415206909, 0.00040241703391075134, 0.00042188912630081177, 0.0004413612186908722, 0.0004608333110809326, 0.00048030540347099304, 0.0004997774958610535, 0.0005192495882511139, 0.0005387216806411743, 0.0005581937730312347, 0.0005776658654212952, 0.0005971379578113556, 0.000616610050201416]}, "gradients/decoder.transformer.h.16.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 2.0, 4.0, 3.0, 3.0, 5.0, 8.0, 8.0, 10.0, 9.0, 16.0, 11.0, 14.0, 25.0, 27.0, 31.0, 32.0, 38.0, 29.0, 35.0, 33.0, 37.0, 39.0, 43.0, 46.0, 38.0, 52.0, 37.0, 49.0, 47.0, 29.0, 34.0, 33.0, 28.0, 20.0, 28.0, 19.0, 12.0, 10.0, 13.0, 14.0, 12.0, 4.0, 6.0, 3.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.28125, -6.08349609375, -5.8857421875, -5.68798828125, -5.490234375, -5.29248046875, -5.0947265625, -4.89697265625, -4.69921875, -4.50146484375, -4.3037109375, -4.10595703125, -3.908203125, -3.71044921875, -3.5126953125, -3.31494140625, -3.1171875, -2.91943359375, -2.7216796875, -2.52392578125, -2.326171875, -2.12841796875, -1.9306640625, -1.73291015625, -1.53515625, -1.33740234375, -1.1396484375, -0.94189453125, -0.744140625, -0.54638671875, -0.3486328125, -0.15087890625, 0.046875, 0.24462890625, 0.4423828125, 0.64013671875, 0.837890625, 1.03564453125, 1.2333984375, 1.43115234375, 1.62890625, 1.82666015625, 2.0244140625, 2.22216796875, 2.419921875, 2.61767578125, 2.8154296875, 3.01318359375, 3.2109375, 3.40869140625, 3.6064453125, 3.80419921875, 4.001953125, 4.19970703125, 4.3974609375, 4.59521484375, 4.79296875, 4.99072265625, 5.1884765625, 5.38623046875, 5.583984375, 5.78173828125, 5.9794921875, 6.17724609375, 6.375]}, "gradients/decoder.transformer.h.16.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 4.0, 2.0, 4.0, 0.0, 3.0, 4.0, 7.0, 9.0, 10.0, 18.0, 30.0, 38.0, 84.0, 115.0, 180.0, 323.0, 554.0, 912.0, 1641.0, 2835.0, 5306.0, 9713.0, 18316.0, 37322.0, 81869.0, 216900.0, 394368.0, 152754.0, 62762.0, 29760.0, 14856.0, 7938.0, 4298.0, 2366.0, 1351.0, 772.0, 464.0, 268.0, 160.0, 99.0, 47.0, 37.0, 23.0, 11.0, 9.0, 7.0, 3.0, 5.0, 1.0, 2.0, 2.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 2.0], "bins": [-3.646484375, -3.533721923828125, -3.42095947265625, -3.308197021484375, -3.1954345703125, -3.082672119140625, -2.96990966796875, -2.857147216796875, -2.744384765625, -2.631622314453125, -2.51885986328125, -2.406097412109375, -2.2933349609375, -2.180572509765625, -2.06781005859375, -1.955047607421875, -1.84228515625, -1.729522705078125, -1.61676025390625, -1.503997802734375, -1.3912353515625, -1.278472900390625, -1.16571044921875, -1.052947998046875, -0.940185546875, -0.827423095703125, -0.71466064453125, -0.601898193359375, -0.4891357421875, -0.376373291015625, -0.26361083984375, -0.150848388671875, -0.0380859375, 0.074676513671875, 0.18743896484375, 0.300201416015625, 0.4129638671875, 0.525726318359375, 0.63848876953125, 0.751251220703125, 0.864013671875, 0.976776123046875, 1.08953857421875, 1.202301025390625, 1.3150634765625, 1.427825927734375, 1.54058837890625, 1.653350830078125, 1.76611328125, 1.878875732421875, 1.99163818359375, 2.104400634765625, 2.2171630859375, 2.329925537109375, 2.44268798828125, 2.555450439453125, 2.668212890625, 2.780975341796875, 2.89373779296875, 3.006500244140625, 3.1192626953125, 3.232025146484375, 3.34478759765625, 3.457550048828125, 3.5703125]}, "gradients/decoder.transformer.h.16.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 2.0, 6.0, 4.0, 8.0, 6.0, 8.0, 13.0, 12.0, 16.0, 16.0, 21.0, 35.0, 34.0, 22.0, 34.0, 23.0, 45.0, 50.0, 60.0, 115.0, 366.0, 1602.0, 108.0, 68.0, 60.0, 43.0, 29.0, 34.0, 30.0, 36.0, 25.0, 32.0, 15.0, 14.0, 17.0, 14.0, 9.0, 5.0, 9.0, 2.0, 8.0, 2.0, 2.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.0625, -23.356689453125, -22.65087890625, -21.945068359375, -21.2392578125, -20.533447265625, -19.82763671875, -19.121826171875, -18.416015625, -17.710205078125, -17.00439453125, -16.298583984375, -15.5927734375, -14.886962890625, -14.18115234375, -13.475341796875, -12.76953125, -12.063720703125, -11.35791015625, -10.652099609375, -9.9462890625, -9.240478515625, -8.53466796875, -7.828857421875, -7.123046875, -6.417236328125, -5.71142578125, -5.005615234375, -4.2998046875, -3.593994140625, -2.88818359375, -2.182373046875, -1.4765625, -0.770751953125, -0.06494140625, 0.640869140625, 1.3466796875, 2.052490234375, 2.75830078125, 3.464111328125, 4.169921875, 4.875732421875, 5.58154296875, 6.287353515625, 6.9931640625, 7.698974609375, 8.40478515625, 9.110595703125, 9.81640625, 10.522216796875, 11.22802734375, 11.933837890625, 12.6396484375, 13.345458984375, 14.05126953125, 14.757080078125, 15.462890625, 16.168701171875, 16.87451171875, 17.580322265625, 18.2861328125, 18.991943359375, 19.69775390625, 20.403564453125, 21.109375]}, "gradients/decoder.transformer.h.16.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 5.0, 5.0, 10.0, 7.0, 12.0, 14.0, 25.0, 28.0, 35.0, 47.0, 47.0, 90.0, 101.0, 163.0, 259.0, 423.0, 1149.0, 15939.0, 3095994.0, 28726.0, 1288.0, 450.0, 262.0, 168.0, 119.0, 91.0, 72.0, 32.0, 34.0, 23.0, 25.0, 13.0, 16.0, 11.0, 7.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 0.0, 1.0], "bins": [-53.8125, -52.25537109375, -50.6982421875, -49.14111328125, -47.583984375, -46.02685546875, -44.4697265625, -42.91259765625, -41.35546875, -39.79833984375, -38.2412109375, -36.68408203125, -35.126953125, -33.56982421875, -32.0126953125, -30.45556640625, -28.8984375, -27.34130859375, -25.7841796875, -24.22705078125, -22.669921875, -21.11279296875, -19.5556640625, -17.99853515625, -16.44140625, -14.88427734375, -13.3271484375, -11.77001953125, -10.212890625, -8.65576171875, -7.0986328125, -5.54150390625, -3.984375, -2.42724609375, -0.8701171875, 0.68701171875, 2.244140625, 3.80126953125, 5.3583984375, 6.91552734375, 8.47265625, 10.02978515625, 11.5869140625, 13.14404296875, 14.701171875, 16.25830078125, 17.8154296875, 19.37255859375, 20.9296875, 22.48681640625, 24.0439453125, 25.60107421875, 27.158203125, 28.71533203125, 30.2724609375, 31.82958984375, 33.38671875, 34.94384765625, 36.5009765625, 38.05810546875, 39.615234375, 41.17236328125, 42.7294921875, 44.28662109375, 45.84375]}, "gradients/decoder.transformer.h.16.ln_1.weight": {"_type": "histogram", "values": [2.0, 801.0, 217.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-23.72690200805664, -11.05643081665039, 1.6140403747558594, 14.28451156616211, 26.95498275756836, 39.62545394897461, 52.29592514038086, 64.96640014648438, 77.63687133789062, 90.30734252929688, 102.97781372070312, 115.64828491210938, 128.31875610351562, 140.98922729492188, 153.65969848632812, 166.33016967773438, 179.00064086914062, 191.67111206054688, 204.34158325195312, 217.01205444335938, 229.68252563476562, 242.35299682617188, 255.02346801757812, 267.6939392089844, 280.3644104003906, 293.0348815917969, 305.7053527832031, 318.3758239746094, 331.0462951660156, 343.7167663574219, 356.3872375488281, 369.0577087402344, 381.7281494140625, 394.39862060546875, 407.069091796875, 419.73956298828125, 432.4100341796875, 445.08050537109375, 457.7509765625, 470.42144775390625, 483.0919189453125, 495.76239013671875, 508.432861328125, 521.1033325195312, 533.7738037109375, 546.4442749023438, 559.11474609375, 571.7852172851562, 584.4556884765625, 597.1261596679688, 609.796630859375, 622.4671020507812, 635.1375732421875, 647.8080444335938, 660.478515625, 673.1489868164062, 685.8194580078125, 698.4899291992188, 711.160400390625, 723.8308715820312, 736.5013427734375, 749.1718139648438, 761.84228515625, 774.5127563476562, 787.1832275390625]}, "gradients/decoder.transformer.h.16.ln_1.bias": {"_type": "histogram", "values": [4.0, 2.0, 0.0, 1.0, 2.0, 3.0, 3.0, 5.0, 3.0, 4.0, 6.0, 8.0, 6.0, 7.0, 13.0, 12.0, 10.0, 12.0, 19.0, 19.0, 21.0, 28.0, 28.0, 28.0, 27.0, 27.0, 34.0, 37.0, 44.0, 53.0, 35.0, 46.0, 37.0, 41.0, 30.0, 48.0, 39.0, 29.0, 36.0, 22.0, 31.0, 27.0, 21.0, 15.0, 13.0, 13.0, 10.0, 19.0, 11.0, 9.0, 4.0, 3.0, 2.0, 3.0, 3.0, 1.0, 3.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-56.92397689819336, -55.117820739746094, -53.311668395996094, -51.50551223754883, -49.69935607910156, -47.89320373535156, -46.0870475769043, -44.28089141845703, -42.47473907470703, -40.668582916259766, -38.862430572509766, -37.0562744140625, -35.2501220703125, -33.443965911865234, -31.63780975341797, -29.831655502319336, -28.025501251220703, -26.21934700012207, -24.413192749023438, -22.607036590576172, -20.80088233947754, -18.994728088378906, -17.18857192993164, -15.382417678833008, -13.576263427734375, -11.770109176635742, -9.963953971862793, -8.157798767089844, -6.351644515991211, -4.545490264892578, -2.739335060119629, -0.9331798553466797, 0.8729705810546875, 2.6791253089904785, 4.4852800369262695, 6.2914347648620605, 8.097589492797852, 9.903743743896484, 11.709898948669434, 13.516054153442383, 15.322208404541016, 17.12836265563965, 18.93451690673828, 20.740673065185547, 22.54682731628418, 24.352981567382812, 26.159137725830078, 27.96529197692871, 29.771446228027344, 31.577600479125977, 33.38375473022461, 35.189910888671875, 36.996063232421875, 38.80221939086914, 40.608375549316406, 42.414527893066406, 44.22068405151367, 46.02684020996094, 47.83299255371094, 49.6391487121582, 51.44530487060547, 53.25145721435547, 55.057613372802734, 56.86376953125, 58.669921875]}, "gradients/decoder.transformer.h.15.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 3.0, 3.0, 3.0, 3.0, 4.0, 3.0, 7.0, 13.0, 14.0, 11.0, 15.0, 14.0, 20.0, 28.0, 35.0, 35.0, 33.0, 35.0, 36.0, 20.0, 39.0, 34.0, 36.0, 55.0, 41.0, 44.0, 51.0, 35.0, 34.0, 43.0, 27.0, 36.0, 30.0, 30.0, 26.0, 19.0, 15.0, 8.0, 16.0, 13.0, 10.0, 7.0, 7.0, 5.0, 2.0, 3.0, 2.0, 2.0, 4.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-6.26171875, -6.0599365234375, -5.858154296875, -5.6563720703125, -5.45458984375, -5.2528076171875, -5.051025390625, -4.8492431640625, -4.6474609375, -4.4456787109375, -4.243896484375, -4.0421142578125, -3.84033203125, -3.6385498046875, -3.436767578125, -3.2349853515625, -3.033203125, -2.8314208984375, -2.629638671875, -2.4278564453125, -2.22607421875, -2.0242919921875, -1.822509765625, -1.6207275390625, -1.4189453125, -1.2171630859375, -1.015380859375, -0.8135986328125, -0.61181640625, -0.4100341796875, -0.208251953125, -0.0064697265625, 0.1953125, 0.3970947265625, 0.598876953125, 0.8006591796875, 1.00244140625, 1.2042236328125, 1.406005859375, 1.6077880859375, 1.8095703125, 2.0113525390625, 2.213134765625, 2.4149169921875, 2.61669921875, 2.8184814453125, 3.020263671875, 3.2220458984375, 3.423828125, 3.6256103515625, 3.827392578125, 4.0291748046875, 4.23095703125, 4.4327392578125, 4.634521484375, 4.8363037109375, 5.0380859375, 5.2398681640625, 5.441650390625, 5.6434326171875, 5.84521484375, 6.0469970703125, 6.248779296875, 6.4505615234375, 6.65234375]}, "gradients/decoder.transformer.h.15.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 3.0, 4.0, 7.0, 8.0, 13.0, 13.0, 16.0, 23.0, 30.0, 39.0, 44.0, 78.0, 83.0, 150.0, 229.0, 337.0, 527.0, 1101.0, 2132.0, 5194.0, 14623.0, 48632.0, 188721.0, 681076.0, 1494286.0, 1193975.0, 412348.0, 105783.0, 28528.0, 9180.0, 3449.0, 1497.0, 751.0, 469.0, 291.0, 172.0, 118.0, 91.0, 73.0, 55.0, 36.0, 21.0, 29.0, 14.0, 10.0, 8.0, 8.0, 2.0, 7.0, 5.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-7.1796875, -6.953857421875, -6.72802734375, -6.502197265625, -6.2763671875, -6.050537109375, -5.82470703125, -5.598876953125, -5.373046875, -5.147216796875, -4.92138671875, -4.695556640625, -4.4697265625, -4.243896484375, -4.01806640625, -3.792236328125, -3.56640625, -3.340576171875, -3.11474609375, -2.888916015625, -2.6630859375, -2.437255859375, -2.21142578125, -1.985595703125, -1.759765625, -1.533935546875, -1.30810546875, -1.082275390625, -0.8564453125, -0.630615234375, -0.40478515625, -0.178955078125, 0.046875, 0.272705078125, 0.49853515625, 0.724365234375, 0.9501953125, 1.176025390625, 1.40185546875, 1.627685546875, 1.853515625, 2.079345703125, 2.30517578125, 2.531005859375, 2.7568359375, 2.982666015625, 3.20849609375, 3.434326171875, 3.66015625, 3.885986328125, 4.11181640625, 4.337646484375, 4.5634765625, 4.789306640625, 5.01513671875, 5.240966796875, 5.466796875, 5.692626953125, 5.91845703125, 6.144287109375, 6.3701171875, 6.595947265625, 6.82177734375, 7.047607421875, 7.2734375]}, "gradients/decoder.transformer.h.15.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 1.0, 3.0, 6.0, 5.0, 6.0, 9.0, 4.0, 9.0, 9.0, 19.0, 20.0, 29.0, 38.0, 43.0, 45.0, 73.0, 88.0, 102.0, 163.0, 207.0, 239.0, 316.0, 371.0, 396.0, 381.0, 321.0, 262.0, 185.0, 179.0, 136.0, 97.0, 76.0, 65.0, 42.0, 36.0, 28.0, 14.0, 15.0, 10.0, 7.0, 10.0, 7.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-12.546875, -12.16064453125, -11.7744140625, -11.38818359375, -11.001953125, -10.61572265625, -10.2294921875, -9.84326171875, -9.45703125, -9.07080078125, -8.6845703125, -8.29833984375, -7.912109375, -7.52587890625, -7.1396484375, -6.75341796875, -6.3671875, -5.98095703125, -5.5947265625, -5.20849609375, -4.822265625, -4.43603515625, -4.0498046875, -3.66357421875, -3.27734375, -2.89111328125, -2.5048828125, -2.11865234375, -1.732421875, -1.34619140625, -0.9599609375, -0.57373046875, -0.1875, 0.19873046875, 0.5849609375, 0.97119140625, 1.357421875, 1.74365234375, 2.1298828125, 2.51611328125, 2.90234375, 3.28857421875, 3.6748046875, 4.06103515625, 4.447265625, 4.83349609375, 5.2197265625, 5.60595703125, 5.9921875, 6.37841796875, 6.7646484375, 7.15087890625, 7.537109375, 7.92333984375, 8.3095703125, 8.69580078125, 9.08203125, 9.46826171875, 9.8544921875, 10.24072265625, 10.626953125, 11.01318359375, 11.3994140625, 11.78564453125, 12.171875]}, "gradients/decoder.transformer.h.15.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 4.0, 6.0, 7.0, 10.0, 9.0, 12.0, 20.0, 28.0, 28.0, 28.0, 41.0, 53.0, 66.0, 97.0, 97.0, 159.0, 215.0, 400.0, 878.0, 5917.0, 3516542.0, 665239.0, 2697.0, 631.0, 291.0, 204.0, 140.0, 102.0, 82.0, 49.0, 48.0, 35.0, 33.0, 18.0, 22.0, 14.0, 11.0, 14.0, 10.0, 4.0, 8.0, 6.0, 4.0, 4.0, 4.0, 1.0, 1.0, 1.0, 0.0, 3.0], "bins": [-70.625, -68.599609375, -66.57421875, -64.548828125, -62.5234375, -60.498046875, -58.47265625, -56.447265625, -54.421875, -52.396484375, -50.37109375, -48.345703125, -46.3203125, -44.294921875, -42.26953125, -40.244140625, -38.21875, -36.193359375, -34.16796875, -32.142578125, -30.1171875, -28.091796875, -26.06640625, -24.041015625, -22.015625, -19.990234375, -17.96484375, -15.939453125, -13.9140625, -11.888671875, -9.86328125, -7.837890625, -5.8125, -3.787109375, -1.76171875, 0.263671875, 2.2890625, 4.314453125, 6.33984375, 8.365234375, 10.390625, 12.416015625, 14.44140625, 16.466796875, 18.4921875, 20.517578125, 22.54296875, 24.568359375, 26.59375, 28.619140625, 30.64453125, 32.669921875, 34.6953125, 36.720703125, 38.74609375, 40.771484375, 42.796875, 44.822265625, 46.84765625, 48.873046875, 50.8984375, 52.923828125, 54.94921875, 56.974609375, 59.0]}, "gradients/decoder.transformer.h.15.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 6.0, 12.0, 13.0, 61.0, 106.0, 175.0, 200.0, 200.0, 127.0, 64.0, 34.0, 12.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-60.44974899291992, -57.365997314453125, -54.28224182128906, -51.198486328125, -48.1147346496582, -45.030982971191406, -41.947227478027344, -38.86347198486328, -35.779720306396484, -32.69596862792969, -29.612213134765625, -26.528459548950195, -23.444705963134766, -20.360952377319336, -17.277198791503906, -14.193445205688477, -11.109691619873047, -8.025938034057617, -4.9421844482421875, -1.8584308624267578, 1.2253227233886719, 4.309076309204102, 7.392829895019531, 10.476583480834961, 13.56033706665039, 16.64409065246582, 19.72784423828125, 22.81159782409668, 25.89535140991211, 28.97910499572754, 32.06285858154297, 35.14661407470703, 38.23036193847656, 41.314117431640625, 44.39786911010742, 47.48162078857422, 50.56537628173828, 53.649131774902344, 56.73288345336914, 59.81663513183594, 62.900390625, 65.98414611816406, 69.06790161132812, 72.15164947509766, 75.23540496826172, 78.31916046142578, 81.40290832519531, 84.48666381835938, 87.57041931152344, 90.6541748046875, 93.73793029785156, 96.8216781616211, 99.90543365478516, 102.98918914794922, 106.07293701171875, 109.15669250488281, 112.24044799804688, 115.32420349121094, 118.407958984375, 121.49170684814453, 124.5754623413086, 127.65921783447266, 130.7429656982422, 133.82672119140625, 136.9104766845703]}, "gradients/decoder.transformer.h.15.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 3.0, 3.0, 6.0, 0.0, 10.0, 5.0, 10.0, 9.0, 9.0, 15.0, 10.0, 20.0, 23.0, 22.0, 17.0, 27.0, 31.0, 29.0, 34.0, 36.0, 37.0, 45.0, 44.0, 53.0, 39.0, 48.0, 43.0, 30.0, 35.0, 36.0, 30.0, 37.0, 20.0, 40.0, 16.0, 24.0, 24.0, 19.0, 16.0, 13.0, 8.0, 2.0, 5.0, 10.0, 5.0, 3.0, 6.0, 1.0, 3.0, 0.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-49.761070251464844, -48.26976013183594, -46.77845001220703, -45.287139892578125, -43.79582977294922, -42.30451965332031, -40.813209533691406, -39.321895599365234, -37.83058547973633, -36.33927536010742, -34.847965240478516, -33.35665512084961, -31.86534309387207, -30.374032974243164, -28.882722854614258, -27.39141082763672, -25.900102615356445, -24.40879249572754, -22.917482376098633, -21.426170349121094, -19.934860229492188, -18.44355010986328, -16.952239990234375, -15.460928916931152, -13.969618797302246, -12.47830867767334, -10.986997604370117, -9.495687484741211, -8.004377365112305, -6.513066291809082, -5.021756172180176, -3.530445098876953, -2.039134979248047, -0.547824501991272, 0.9434859752655029, 2.4347963333129883, 3.9261069297790527, 5.417417526245117, 6.908727645874023, 8.400038719177246, 9.891348838806152, 11.382658958435059, 12.873970031738281, 14.365280151367188, 15.856590270996094, 17.347900390625, 18.839210510253906, 20.330522537231445, 21.82183265686035, 23.313142776489258, 24.804452896118164, 26.295764923095703, 27.78707504272461, 29.278385162353516, 30.769695281982422, 32.26100540161133, 33.752315521240234, 35.24362564086914, 36.73493576049805, 38.22624588012695, 39.71755599975586, 41.20886993408203, 42.70018005371094, 44.191490173339844, 45.68280029296875]}, "gradients/decoder.transformer.h.15.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 6.0, 4.0, 3.0, 6.0, 5.0, 9.0, 14.0, 15.0, 15.0, 17.0, 20.0, 29.0, 23.0, 44.0, 31.0, 33.0, 36.0, 37.0, 45.0, 34.0, 43.0, 36.0, 47.0, 30.0, 47.0, 40.0, 44.0, 28.0, 38.0, 29.0, 19.0, 28.0, 33.0, 24.0, 14.0, 20.0, 14.0, 8.0, 6.0, 5.0, 7.0, 3.0, 7.0, 3.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-6.26953125, -6.066650390625, -5.86376953125, -5.660888671875, -5.4580078125, -5.255126953125, -5.05224609375, -4.849365234375, -4.646484375, -4.443603515625, -4.24072265625, -4.037841796875, -3.8349609375, -3.632080078125, -3.42919921875, -3.226318359375, -3.0234375, -2.820556640625, -2.61767578125, -2.414794921875, -2.2119140625, -2.009033203125, -1.80615234375, -1.603271484375, -1.400390625, -1.197509765625, -0.99462890625, -0.791748046875, -0.5888671875, -0.385986328125, -0.18310546875, 0.019775390625, 0.22265625, 0.425537109375, 0.62841796875, 0.831298828125, 1.0341796875, 1.237060546875, 1.43994140625, 1.642822265625, 1.845703125, 2.048583984375, 2.25146484375, 2.454345703125, 2.6572265625, 2.860107421875, 3.06298828125, 3.265869140625, 3.46875, 3.671630859375, 3.87451171875, 4.077392578125, 4.2802734375, 4.483154296875, 4.68603515625, 4.888916015625, 5.091796875, 5.294677734375, 5.49755859375, 5.700439453125, 5.9033203125, 6.106201171875, 6.30908203125, 6.511962890625, 6.71484375]}, "gradients/decoder.transformer.h.15.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 4.0, 4.0, 3.0, 3.0, 5.0, 9.0, 15.0, 21.0, 40.0, 34.0, 66.0, 98.0, 128.0, 182.0, 257.0, 368.0, 558.0, 813.0, 1199.0, 1777.0, 2624.0, 4065.0, 6178.0, 9693.0, 14763.0, 23760.0, 38303.0, 61849.0, 109927.0, 250536.0, 246472.0, 108812.0, 61797.0, 37536.0, 23773.0, 15084.0, 9424.0, 6201.0, 3977.0, 2668.0, 1806.0, 1162.0, 813.0, 558.0, 373.0, 251.0, 173.0, 124.0, 90.0, 59.0, 54.0, 25.0, 11.0, 15.0, 11.0, 6.0, 8.0, 5.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-1.5185546875, -1.4695587158203125, -1.420562744140625, -1.3715667724609375, -1.32257080078125, -1.2735748291015625, -1.224578857421875, -1.1755828857421875, -1.1265869140625, -1.0775909423828125, -1.028594970703125, -0.9795989990234375, -0.93060302734375, -0.8816070556640625, -0.832611083984375, -0.7836151123046875, -0.734619140625, -0.6856231689453125, -0.636627197265625, -0.5876312255859375, -0.53863525390625, -0.4896392822265625, -0.440643310546875, -0.3916473388671875, -0.3426513671875, -0.2936553955078125, -0.244659423828125, -0.1956634521484375, -0.14666748046875, -0.0976715087890625, -0.048675537109375, 0.0003204345703125, 0.04931640625, 0.0983123779296875, 0.147308349609375, 0.1963043212890625, 0.24530029296875, 0.2942962646484375, 0.343292236328125, 0.3922882080078125, 0.4412841796875, 0.4902801513671875, 0.539276123046875, 0.5882720947265625, 0.63726806640625, 0.6862640380859375, 0.735260009765625, 0.7842559814453125, 0.833251953125, 0.8822479248046875, 0.931243896484375, 0.9802398681640625, 1.02923583984375, 1.0782318115234375, 1.127227783203125, 1.1762237548828125, 1.2252197265625, 1.2742156982421875, 1.323211669921875, 1.3722076416015625, 1.42120361328125, 1.4701995849609375, 1.519195556640625, 1.5681915283203125, 1.6171875]}, "gradients/decoder.transformer.h.15.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 5.0, 3.0, 5.0, 7.0, 6.0, 13.0, 8.0, 16.0, 15.0, 16.0, 23.0, 30.0, 25.0, 16.0, 32.0, 36.0, 31.0, 39.0, 49.0, 47.0, 49.0, 1063.0, 45.0, 41.0, 44.0, 40.0, 36.0, 33.0, 40.0, 24.0, 34.0, 34.0, 17.0, 18.0, 13.0, 10.0, 13.0, 14.0, 7.0, 17.0, 8.0, 3.0, 2.0, 3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.65625, -3.5291748046875, -3.402099609375, -3.2750244140625, -3.14794921875, -3.0208740234375, -2.893798828125, -2.7667236328125, -2.6396484375, -2.5125732421875, -2.385498046875, -2.2584228515625, -2.13134765625, -2.0042724609375, -1.877197265625, -1.7501220703125, -1.623046875, -1.4959716796875, -1.368896484375, -1.2418212890625, -1.11474609375, -0.9876708984375, -0.860595703125, -0.7335205078125, -0.6064453125, -0.4793701171875, -0.352294921875, -0.2252197265625, -0.09814453125, 0.0289306640625, 0.156005859375, 0.2830810546875, 0.41015625, 0.5372314453125, 0.664306640625, 0.7913818359375, 0.91845703125, 1.0455322265625, 1.172607421875, 1.2996826171875, 1.4267578125, 1.5538330078125, 1.680908203125, 1.8079833984375, 1.93505859375, 2.0621337890625, 2.189208984375, 2.3162841796875, 2.443359375, 2.5704345703125, 2.697509765625, 2.8245849609375, 2.95166015625, 3.0787353515625, 3.205810546875, 3.3328857421875, 3.4599609375, 3.5870361328125, 3.714111328125, 3.8411865234375, 3.96826171875, 4.0953369140625, 4.222412109375, 4.3494873046875, 4.4765625]}, "gradients/decoder.transformer.h.15.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 3.0, 2.0, 9.0, 8.0, 18.0, 20.0, 31.0, 40.0, 75.0, 91.0, 200.0, 291.0, 530.0, 872.0, 1657.0, 3055.0, 5600.0, 10343.0, 19957.0, 40595.0, 85642.0, 222134.0, 1464277.0, 126630.0, 57159.0, 27462.0, 14030.0, 7376.0, 3988.0, 2136.0, 1247.0, 684.0, 381.0, 217.0, 136.0, 82.0, 48.0, 29.0, 32.0, 12.0, 15.0, 7.0, 3.0, 5.0, 6.0, 2.0, 2.0, 1.0, 2.0, 0.0, 2.0], "bins": [-2.458984375, -2.389190673828125, -2.31939697265625, -2.249603271484375, -2.1798095703125, -2.110015869140625, -2.04022216796875, -1.970428466796875, -1.900634765625, -1.830841064453125, -1.76104736328125, -1.691253662109375, -1.6214599609375, -1.551666259765625, -1.48187255859375, -1.412078857421875, -1.34228515625, -1.272491455078125, -1.20269775390625, -1.132904052734375, -1.0631103515625, -0.993316650390625, -0.92352294921875, -0.853729248046875, -0.783935546875, -0.714141845703125, -0.64434814453125, -0.574554443359375, -0.5047607421875, -0.434967041015625, -0.36517333984375, -0.295379638671875, -0.2255859375, -0.155792236328125, -0.08599853515625, -0.016204833984375, 0.0535888671875, 0.123382568359375, 0.19317626953125, 0.262969970703125, 0.332763671875, 0.402557373046875, 0.47235107421875, 0.542144775390625, 0.6119384765625, 0.681732177734375, 0.75152587890625, 0.821319580078125, 0.89111328125, 0.960906982421875, 1.03070068359375, 1.100494384765625, 1.1702880859375, 1.240081787109375, 1.30987548828125, 1.379669189453125, 1.449462890625, 1.519256591796875, 1.58905029296875, 1.658843994140625, 1.7286376953125, 1.798431396484375, 1.86822509765625, 1.938018798828125, 2.0078125]}, "gradients/decoder.transformer.h.15.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 9.0, 5.0, 6.0, 4.0, 11.0, 7.0, 12.0, 14.0, 13.0, 23.0, 29.0, 28.0, 29.0, 64.0, 79.0, 119.0, 132.0, 104.0, 53.0, 55.0, 46.0, 27.0, 29.0, 11.0, 22.0, 13.0, 14.0, 15.0, 8.0, 7.0, 5.0, 3.0, 2.0, 4.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013170242309570312, -0.001270383596420288, -0.001223742961883545, -0.0011771023273468018, -0.0011304616928100586, -0.0010838210582733154, -0.0010371804237365723, -0.000990539789199829, -0.0009438991546630859, -0.0008972585201263428, -0.0008506178855895996, -0.0008039772510528564, -0.0007573366165161133, -0.0007106959819793701, -0.000664055347442627, -0.0006174147129058838, -0.0005707740783691406, -0.0005241334438323975, -0.0004774928092956543, -0.00043085217475891113, -0.00038421154022216797, -0.0003375709056854248, -0.00029093027114868164, -0.0002442896366119385, -0.0001976490020751953, -0.00015100836753845215, -0.00010436773300170898, -5.772709846496582e-05, -1.1086463928222656e-05, 3.555417060852051e-05, 8.219480514526367e-05, 0.00012883543968200684, 0.00017547607421875, 0.00022211670875549316, 0.00026875734329223633, 0.0003153979778289795, 0.00036203861236572266, 0.0004086792469024658, 0.000455319881439209, 0.0005019605159759521, 0.0005486011505126953, 0.0005952417850494385, 0.0006418824195861816, 0.0006885230541229248, 0.000735163688659668, 0.0007818043231964111, 0.0008284449577331543, 0.0008750855922698975, 0.0009217262268066406, 0.0009683668613433838, 0.001015007495880127, 0.0010616481304168701, 0.0011082887649536133, 0.0011549293994903564, 0.0012015700340270996, 0.0012482106685638428, 0.001294851303100586, 0.001341491937637329, 0.0013881325721740723, 0.0014347732067108154, 0.0014814138412475586, 0.0015280544757843018, 0.001574695110321045, 0.001621335744857788, 0.0016679763793945312]}, "gradients/decoder.transformer.h.15.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 3.0, 1.0, 1.0, 1.0, 3.0, 7.0, 4.0, 4.0, 8.0, 12.0, 13.0, 20.0, 17.0, 31.0, 28.0, 33.0, 69.0, 130.0, 236.0, 786.0, 801198.0, 244718.0, 680.0, 229.0, 99.0, 65.0, 48.0, 21.0, 24.0, 14.0, 12.0, 19.0, 7.0, 5.0, 7.0, 3.0, 2.0, 6.0, 0.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03924560546875, -0.0380549430847168, -0.036864280700683594, -0.03567361831665039, -0.03448295593261719, -0.033292293548583984, -0.03210163116455078, -0.030910968780517578, -0.029720306396484375, -0.028529644012451172, -0.02733898162841797, -0.026148319244384766, -0.024957656860351562, -0.02376699447631836, -0.022576332092285156, -0.021385669708251953, -0.02019500732421875, -0.019004344940185547, -0.017813682556152344, -0.01662302017211914, -0.015432357788085938, -0.014241695404052734, -0.013051033020019531, -0.011860370635986328, -0.010669708251953125, -0.009479045867919922, -0.008288383483886719, -0.007097721099853516, -0.0059070587158203125, -0.004716396331787109, -0.0035257339477539062, -0.002335071563720703, -0.0011444091796875, 4.6253204345703125e-05, 0.0012369155883789062, 0.0024275779724121094, 0.0036182403564453125, 0.004808902740478516, 0.005999565124511719, 0.007190227508544922, 0.008380889892578125, 0.009571552276611328, 0.010762214660644531, 0.011952877044677734, 0.013143539428710938, 0.01433420181274414, 0.015524864196777344, 0.016715526580810547, 0.01790618896484375, 0.019096851348876953, 0.020287513732910156, 0.02147817611694336, 0.022668838500976562, 0.023859500885009766, 0.02505016326904297, 0.026240825653076172, 0.027431488037109375, 0.028622150421142578, 0.02981281280517578, 0.031003475189208984, 0.03219413757324219, 0.03338479995727539, 0.034575462341308594, 0.0357661247253418, 0.036956787109375]}, "gradients/decoder.transformer.h.15.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 29.0, 160.0, 429.0, 310.0, 68.0, 12.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0019975234754383564, -0.001925585325807333, -0.0018536471761763096, -0.0017817090265452862, -0.0017097708769142628, -0.0016378327272832394, -0.001565894577652216, -0.0014939564280211926, -0.0014220182783901691, -0.0013500801287591457, -0.0012781419791281223, -0.001206203829497099, -0.0011342656798660755, -0.0010623275302350521, -0.0009903893806040287, -0.0009184512309730053, -0.0008465130813419819, -0.0007745749317109585, -0.0007026367820799351, -0.0006306986324489117, -0.0005587604828178883, -0.00048682233318686485, -0.00041488418355584145, -0.00034294603392481804, -0.00027100788429379463, -0.00019906973466277122, -0.00012713158503174782, -5.519343540072441e-05, 1.6744714230298996e-05, 8.86828638613224e-05, 0.0001606210134923458, 0.00023255916312336922, 0.00030449707992374897, 0.0003764352295547724, 0.0004483733791857958, 0.0005203115288168192, 0.0005922496784478426, 0.000664187828078866, 0.0007361259777098894, 0.0008080641273409128, 0.0008800022769719362, 0.0009519404266029596, 0.001023878576233983, 0.0010958167258650064, 0.0011677548754960299, 0.0012396930251270533, 0.0013116311747580767, 0.0013835693243891, 0.0014555074740201235, 0.0015274456236511469, 0.0015993837732821703, 0.0016713219229131937, 0.0017432600725442171, 0.0018151982221752405, 0.001887136371806264, 0.0019590745214372873, 0.0020310126710683107, 0.002102950820699334, 0.0021748889703303576, 0.002246827119961381, 0.0023187652695924044, 0.0023907034192234278, 0.002462641568854451, 0.0025345797184854746, 0.002606517868116498]}, "gradients/decoder.transformer.h.15.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 3.0, 1.0, 3.0, 4.0, 9.0, 6.0, 9.0, 6.0, 16.0, 22.0, 21.0, 18.0, 18.0, 28.0, 37.0, 31.0, 33.0, 39.0, 35.0, 42.0, 33.0, 44.0, 43.0, 40.0, 41.0, 48.0, 41.0, 32.0, 51.0, 28.0, 26.0, 29.0, 28.0, 23.0, 18.0, 17.0, 20.0, 17.0, 14.0, 6.0, 10.0, 6.0, 4.0, 3.0, 1.0, 1.0, 1.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006338357925415039, -0.0006140219047665596, -0.0005942080169916153, -0.000574394129216671, -0.0005545802414417267, -0.0005347663536667824, -0.0005149524658918381, -0.0004951385781168938, -0.00047532469034194946, -0.00045551080256700516, -0.00043569691479206085, -0.00041588302701711655, -0.00039606913924217224, -0.00037625525146722794, -0.00035644136369228363, -0.0003366274759173393, -0.000316813588142395, -0.0002969997003674507, -0.0002771858125925064, -0.0002573719248175621, -0.0002375580370426178, -0.0002177441492676735, -0.0001979302614927292, -0.00017811637371778488, -0.00015830248594284058, -0.00013848859816789627, -0.00011867471039295197, -9.886082261800766e-05, -7.904693484306335e-05, -5.923304706811905e-05, -3.9419159293174744e-05, -1.9605271518230438e-05, 2.086162567138672e-07, 2.0022504031658173e-05, 3.983639180660248e-05, 5.9650279581546783e-05, 7.946416735649109e-05, 9.92780551314354e-05, 0.0001190919429063797, 0.000138905830681324, 0.0001587197184562683, 0.00017853360623121262, 0.00019834749400615692, 0.00021816138178110123, 0.00023797526955604553, 0.00025778915733098984, 0.00027760304510593414, 0.00029741693288087845, 0.00031723082065582275, 0.00033704470843076706, 0.00035685859620571136, 0.00037667248398065567, 0.0003964863717556, 0.0004163002595305443, 0.0004361141473054886, 0.0004559280350804329, 0.0004757419228553772, 0.0004955558106303215, 0.0005153696984052658, 0.0005351835861802101, 0.0005549974739551544, 0.0005748113617300987, 0.000594625249505043, 0.0006144391372799873, 0.0006342530250549316]}, "gradients/decoder.transformer.h.15.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 6.0, 4.0, 3.0, 6.0, 5.0, 9.0, 14.0, 15.0, 15.0, 17.0, 20.0, 29.0, 23.0, 44.0, 31.0, 33.0, 36.0, 37.0, 45.0, 34.0, 43.0, 36.0, 47.0, 30.0, 47.0, 40.0, 44.0, 28.0, 38.0, 29.0, 19.0, 28.0, 34.0, 23.0, 14.0, 20.0, 14.0, 8.0, 6.0, 5.0, 7.0, 3.0, 7.0, 3.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-6.26953125, -6.066650390625, -5.86376953125, -5.660888671875, -5.4580078125, -5.255126953125, -5.05224609375, -4.849365234375, -4.646484375, -4.443603515625, -4.24072265625, -4.037841796875, -3.8349609375, -3.632080078125, -3.42919921875, -3.226318359375, -3.0234375, -2.820556640625, -2.61767578125, -2.414794921875, -2.2119140625, -2.009033203125, -1.80615234375, -1.603271484375, -1.400390625, -1.197509765625, -0.99462890625, -0.791748046875, -0.5888671875, -0.385986328125, -0.18310546875, 0.019775390625, 0.22265625, 0.425537109375, 0.62841796875, 0.831298828125, 1.0341796875, 1.237060546875, 1.43994140625, 1.642822265625, 1.845703125, 2.048583984375, 2.25146484375, 2.454345703125, 2.6572265625, 2.860107421875, 3.06298828125, 3.265869140625, 3.46875, 3.671630859375, 3.87451171875, 4.077392578125, 4.2802734375, 4.483154296875, 4.68603515625, 4.888916015625, 5.091796875, 5.294677734375, 5.49755859375, 5.700439453125, 5.9033203125, 6.106201171875, 6.30908203125, 6.511962890625, 6.71484375]}, "gradients/decoder.transformer.h.15.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 3.0, 4.0, 6.0, 9.0, 9.0, 18.0, 32.0, 44.0, 52.0, 93.0, 161.0, 253.0, 355.0, 620.0, 1177.0, 2194.0, 4553.0, 9890.0, 23833.0, 64983.0, 213248.0, 463455.0, 171645.0, 54616.0, 20238.0, 8651.0, 3834.0, 1943.0, 1111.0, 570.0, 361.0, 213.0, 133.0, 87.0, 57.0, 40.0, 27.0, 17.0, 15.0, 4.0, 2.0, 3.0, 2.0, 4.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.234375, -6.06317138671875, -5.8919677734375, -5.72076416015625, -5.549560546875, -5.37835693359375, -5.2071533203125, -5.03594970703125, -4.86474609375, -4.69354248046875, -4.5223388671875, -4.35113525390625, -4.179931640625, -4.00872802734375, -3.8375244140625, -3.66632080078125, -3.4951171875, -3.32391357421875, -3.1527099609375, -2.98150634765625, -2.810302734375, -2.63909912109375, -2.4678955078125, -2.29669189453125, -2.12548828125, -1.95428466796875, -1.7830810546875, -1.61187744140625, -1.440673828125, -1.26947021484375, -1.0982666015625, -0.92706298828125, -0.755859375, -0.58465576171875, -0.4134521484375, -0.24224853515625, -0.071044921875, 0.10015869140625, 0.2713623046875, 0.44256591796875, 0.61376953125, 0.78497314453125, 0.9561767578125, 1.12738037109375, 1.298583984375, 1.46978759765625, 1.6409912109375, 1.81219482421875, 1.9833984375, 2.15460205078125, 2.3258056640625, 2.49700927734375, 2.668212890625, 2.83941650390625, 3.0106201171875, 3.18182373046875, 3.35302734375, 3.52423095703125, 3.6954345703125, 3.86663818359375, 4.037841796875, 4.20904541015625, 4.3802490234375, 4.55145263671875, 4.72265625]}, "gradients/decoder.transformer.h.15.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 4.0, 3.0, 4.0, 6.0, 7.0, 5.0, 7.0, 11.0, 18.0, 20.0, 26.0, 32.0, 23.0, 32.0, 51.0, 36.0, 44.0, 51.0, 66.0, 129.0, 1507.0, 398.0, 125.0, 70.0, 59.0, 49.0, 42.0, 50.0, 28.0, 37.0, 23.0, 30.0, 14.0, 9.0, 15.0, 5.0, 7.0, 4.0, 5.0, 2.0, 3.0, 1.0, 3.0, 0.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-22.546875, -21.767822265625, -20.98876953125, -20.209716796875, -19.4306640625, -18.651611328125, -17.87255859375, -17.093505859375, -16.314453125, -15.535400390625, -14.75634765625, -13.977294921875, -13.1982421875, -12.419189453125, -11.64013671875, -10.861083984375, -10.08203125, -9.302978515625, -8.52392578125, -7.744873046875, -6.9658203125, -6.186767578125, -5.40771484375, -4.628662109375, -3.849609375, -3.070556640625, -2.29150390625, -1.512451171875, -0.7333984375, 0.045654296875, 0.82470703125, 1.603759765625, 2.3828125, 3.161865234375, 3.94091796875, 4.719970703125, 5.4990234375, 6.278076171875, 7.05712890625, 7.836181640625, 8.615234375, 9.394287109375, 10.17333984375, 10.952392578125, 11.7314453125, 12.510498046875, 13.28955078125, 14.068603515625, 14.84765625, 15.626708984375, 16.40576171875, 17.184814453125, 17.9638671875, 18.742919921875, 19.52197265625, 20.301025390625, 21.080078125, 21.859130859375, 22.63818359375, 23.417236328125, 24.1962890625, 24.975341796875, 25.75439453125, 26.533447265625, 27.3125]}, "gradients/decoder.transformer.h.15.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 4.0, 3.0, 1.0, 4.0, 3.0, 9.0, 3.0, 10.0, 5.0, 18.0, 12.0, 19.0, 30.0, 39.0, 50.0, 64.0, 107.0, 139.0, 163.0, 259.0, 405.0, 829.0, 6193.0, 2980428.0, 152992.0, 2284.0, 532.0, 263.0, 202.0, 165.0, 119.0, 82.0, 67.0, 43.0, 41.0, 39.0, 26.0, 15.0, 12.0, 6.0, 9.0, 9.0, 4.0, 6.0, 2.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.65625, -56.97119140625, -55.2861328125, -53.60107421875, -51.916015625, -50.23095703125, -48.5458984375, -46.86083984375, -45.17578125, -43.49072265625, -41.8056640625, -40.12060546875, -38.435546875, -36.75048828125, -35.0654296875, -33.38037109375, -31.6953125, -30.01025390625, -28.3251953125, -26.64013671875, -24.955078125, -23.27001953125, -21.5849609375, -19.89990234375, -18.21484375, -16.52978515625, -14.8447265625, -13.15966796875, -11.474609375, -9.78955078125, -8.1044921875, -6.41943359375, -4.734375, -3.04931640625, -1.3642578125, 0.32080078125, 2.005859375, 3.69091796875, 5.3759765625, 7.06103515625, 8.74609375, 10.43115234375, 12.1162109375, 13.80126953125, 15.486328125, 17.17138671875, 18.8564453125, 20.54150390625, 22.2265625, 23.91162109375, 25.5966796875, 27.28173828125, 28.966796875, 30.65185546875, 32.3369140625, 34.02197265625, 35.70703125, 37.39208984375, 39.0771484375, 40.76220703125, 42.447265625, 44.13232421875, 45.8173828125, 47.50244140625, 49.1875]}, "gradients/decoder.transformer.h.15.ln_1.weight": {"_type": "histogram", "values": [6.0, 85.0, 528.0, 356.0, 42.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-10.14566421508789, -6.601571083068848, -3.057478427886963, 0.4866142272949219, 4.030707359313965, 7.574800491333008, 11.118892669677734, 14.662986755371094, 18.20707893371582, 21.751171112060547, 25.295265197753906, 28.839357376098633, 32.38344955444336, 35.92754364013672, 39.47163391113281, 43.01573181152344, 46.55982208251953, 50.10391616821289, 53.648006439208984, 57.192100524902344, 60.7361946105957, 64.28028869628906, 67.82437896728516, 71.36846923828125, 74.91256713867188, 78.45665740966797, 82.0007553100586, 85.54484558105469, 89.08893585205078, 92.6330337524414, 96.1771240234375, 99.72122192382812, 103.26531219482422, 106.80940246582031, 110.35350036621094, 113.89759063720703, 117.44168090820312, 120.98577880859375, 124.52986907958984, 128.07395935058594, 131.61805725097656, 135.1621551513672, 138.70623779296875, 142.25033569335938, 145.79443359375, 149.33851623535156, 152.8826141357422, 156.4267120361328, 159.97079467773438, 163.514892578125, 167.05897521972656, 170.6030731201172, 174.1471710205078, 177.69125366210938, 181.2353515625, 184.77944946289062, 188.32354736328125, 191.86764526367188, 195.41172790527344, 198.95582580566406, 202.4999237060547, 206.04400634765625, 209.58810424804688, 213.1322021484375, 216.67628479003906]}, "gradients/decoder.transformer.h.15.ln_1.bias": {"_type": "histogram", "values": [3.0, 4.0, 3.0, 3.0, 0.0, 4.0, 2.0, 2.0, 5.0, 15.0, 3.0, 10.0, 19.0, 12.0, 16.0, 24.0, 30.0, 22.0, 22.0, 33.0, 51.0, 33.0, 40.0, 38.0, 42.0, 42.0, 48.0, 39.0, 48.0, 46.0, 50.0, 32.0, 46.0, 31.0, 37.0, 26.0, 19.0, 16.0, 20.0, 20.0, 14.0, 13.0, 7.0, 9.0, 4.0, 7.0, 3.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-47.633689880371094, -45.84664535522461, -44.05959701538086, -42.272552490234375, -40.485504150390625, -38.69845962524414, -36.911415100097656, -35.124366760253906, -33.33732223510742, -31.550275802612305, -29.763229370117188, -27.976184844970703, -26.189138412475586, -24.40209197998047, -22.61504554748535, -20.827999114990234, -19.040952682495117, -17.25390625, -15.4668607711792, -13.679814338684082, -11.892768859863281, -10.105722427368164, -8.318675994873047, -6.531630516052246, -4.744584083557129, -2.95753812789917, -1.1704919338226318, 0.6165542602539062, 2.4036002159118652, 4.190646171569824, 5.977692604064941, 7.764738082885742, 9.55178451538086, 11.338830947875977, 13.125876426696777, 14.912922859191895, 16.699968338012695, 18.487014770507812, 20.27406120300293, 22.061107635498047, 23.84815216064453, 25.63519859313965, 27.422245025634766, 29.20928955078125, 30.996335983276367, 32.783382415771484, 34.57042694091797, 36.35747528076172, 38.14452362060547, 39.93156814575195, 41.7186164855957, 43.50566101074219, 45.29270935058594, 47.07975387573242, 48.866798400878906, 50.653846740722656, 52.44089126586914, 54.227935791015625, 56.014984130859375, 57.80202865600586, 59.58907699584961, 61.376121520996094, 63.163169860839844, 64.95021057128906, 66.73725891113281]}, "gradients/decoder.transformer.h.14.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 4.0, 3.0, 4.0, 5.0, 3.0, 9.0, 10.0, 14.0, 6.0, 15.0, 21.0, 19.0, 23.0, 32.0, 37.0, 38.0, 26.0, 31.0, 30.0, 51.0, 45.0, 38.0, 35.0, 36.0, 48.0, 42.0, 51.0, 39.0, 37.0, 17.0, 37.0, 33.0, 29.0, 28.0, 23.0, 14.0, 13.0, 13.0, 14.0, 9.0, 6.0, 7.0, 3.0, 3.0, 4.0, 6.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-6.6640625, -6.449951171875, -6.23583984375, -6.021728515625, -5.8076171875, -5.593505859375, -5.37939453125, -5.165283203125, -4.951171875, -4.737060546875, -4.52294921875, -4.308837890625, -4.0947265625, -3.880615234375, -3.66650390625, -3.452392578125, -3.23828125, -3.024169921875, -2.81005859375, -2.595947265625, -2.3818359375, -2.167724609375, -1.95361328125, -1.739501953125, -1.525390625, -1.311279296875, -1.09716796875, -0.883056640625, -0.6689453125, -0.454833984375, -0.24072265625, -0.026611328125, 0.1875, 0.401611328125, 0.61572265625, 0.829833984375, 1.0439453125, 1.258056640625, 1.47216796875, 1.686279296875, 1.900390625, 2.114501953125, 2.32861328125, 2.542724609375, 2.7568359375, 2.970947265625, 3.18505859375, 3.399169921875, 3.61328125, 3.827392578125, 4.04150390625, 4.255615234375, 4.4697265625, 4.683837890625, 4.89794921875, 5.112060546875, 5.326171875, 5.540283203125, 5.75439453125, 5.968505859375, 6.1826171875, 6.396728515625, 6.61083984375, 6.824951171875, 7.0390625]}, "gradients/decoder.transformer.h.14.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 6.0, 7.0, 8.0, 8.0, 9.0, 14.0, 17.0, 10.0, 22.0, 26.0, 29.0, 32.0, 45.0, 58.0, 67.0, 131.0, 259.0, 682.0, 4329.0, 155508.0, 3543288.0, 479923.0, 8095.0, 869.0, 283.0, 157.0, 87.0, 68.0, 56.0, 42.0, 23.0, 21.0, 16.0, 23.0, 9.0, 16.0, 9.0, 8.0, 5.0, 8.0, 3.0, 3.0, 5.0, 2.0, 2.0, 3.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-24.28125, -23.48828125, -22.6953125, -21.90234375, -21.109375, -20.31640625, -19.5234375, -18.73046875, -17.9375, -17.14453125, -16.3515625, -15.55859375, -14.765625, -13.97265625, -13.1796875, -12.38671875, -11.59375, -10.80078125, -10.0078125, -9.21484375, -8.421875, -7.62890625, -6.8359375, -6.04296875, -5.25, -4.45703125, -3.6640625, -2.87109375, -2.078125, -1.28515625, -0.4921875, 0.30078125, 1.09375, 1.88671875, 2.6796875, 3.47265625, 4.265625, 5.05859375, 5.8515625, 6.64453125, 7.4375, 8.23046875, 9.0234375, 9.81640625, 10.609375, 11.40234375, 12.1953125, 12.98828125, 13.78125, 14.57421875, 15.3671875, 16.16015625, 16.953125, 17.74609375, 18.5390625, 19.33203125, 20.125, 20.91796875, 21.7109375, 22.50390625, 23.296875, 24.08984375, 24.8828125, 25.67578125, 26.46875]}, "gradients/decoder.transformer.h.14.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 1.0, 2.0, 2.0, 6.0, 6.0, 13.0, 11.0, 13.0, 25.0, 21.0, 46.0, 62.0, 92.0, 117.0, 185.0, 313.0, 448.0, 536.0, 550.0, 457.0, 381.0, 280.0, 175.0, 94.0, 69.0, 47.0, 32.0, 36.0, 22.0, 13.0, 9.0, 3.0, 6.0, 3.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-13.8671875, -13.3375244140625, -12.807861328125, -12.2781982421875, -11.74853515625, -11.2188720703125, -10.689208984375, -10.1595458984375, -9.6298828125, -9.1002197265625, -8.570556640625, -8.0408935546875, -7.51123046875, -6.9815673828125, -6.451904296875, -5.9222412109375, -5.392578125, -4.8629150390625, -4.333251953125, -3.8035888671875, -3.27392578125, -2.7442626953125, -2.214599609375, -1.6849365234375, -1.1552734375, -0.6256103515625, -0.095947265625, 0.4337158203125, 0.96337890625, 1.4930419921875, 2.022705078125, 2.5523681640625, 3.08203125, 3.6116943359375, 4.141357421875, 4.6710205078125, 5.20068359375, 5.7303466796875, 6.260009765625, 6.7896728515625, 7.3193359375, 7.8489990234375, 8.378662109375, 8.9083251953125, 9.43798828125, 9.9676513671875, 10.497314453125, 11.0269775390625, 11.556640625, 12.0863037109375, 12.615966796875, 13.1456298828125, 13.67529296875, 14.2049560546875, 14.734619140625, 15.2642822265625, 15.7939453125, 16.3236083984375, 16.853271484375, 17.3829345703125, 17.91259765625, 18.4422607421875, 18.971923828125, 19.5015869140625, 20.03125]}, "gradients/decoder.transformer.h.14.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 3.0, 6.0, 5.0, 1.0, 4.0, 3.0, 13.0, 7.0, 18.0, 23.0, 28.0, 27.0, 31.0, 40.0, 71.0, 66.0, 153.0, 167.0, 253.0, 575.0, 1694.0, 571076.0, 3615588.0, 2909.0, 581.0, 279.0, 181.0, 97.0, 86.0, 51.0, 75.0, 32.0, 32.0, 28.0, 17.0, 17.0, 11.0, 11.0, 8.0, 5.0, 7.0, 3.0, 8.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-77.0, -74.525390625, -72.05078125, -69.576171875, -67.1015625, -64.626953125, -62.15234375, -59.677734375, -57.203125, -54.728515625, -52.25390625, -49.779296875, -47.3046875, -44.830078125, -42.35546875, -39.880859375, -37.40625, -34.931640625, -32.45703125, -29.982421875, -27.5078125, -25.033203125, -22.55859375, -20.083984375, -17.609375, -15.134765625, -12.66015625, -10.185546875, -7.7109375, -5.236328125, -2.76171875, -0.287109375, 2.1875, 4.662109375, 7.13671875, 9.611328125, 12.0859375, 14.560546875, 17.03515625, 19.509765625, 21.984375, 24.458984375, 26.93359375, 29.408203125, 31.8828125, 34.357421875, 36.83203125, 39.306640625, 41.78125, 44.255859375, 46.73046875, 49.205078125, 51.6796875, 54.154296875, 56.62890625, 59.103515625, 61.578125, 64.052734375, 66.52734375, 69.001953125, 71.4765625, 73.951171875, 76.42578125, 78.900390625, 81.375]}, "gradients/decoder.transformer.h.14.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 6.0, 20.0, 64.0, 140.0, 241.0, 261.0, 166.0, 81.0, 24.0, 7.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-83.1891860961914, -79.55642700195312, -75.92367553710938, -72.2909164428711, -68.65815734863281, -65.02540588378906, -61.39264678955078, -57.7598876953125, -54.127132415771484, -50.49437713623047, -46.86161804199219, -43.22886276245117, -39.596107482910156, -35.963348388671875, -32.33059310913086, -28.69783592224121, -25.065078735351562, -21.432321548461914, -17.799564361572266, -14.16680908203125, -10.534051895141602, -6.901294708251953, -3.2685394287109375, 0.36421775817871094, 3.9969749450683594, 7.62973165512085, 11.26248836517334, 14.895244598388672, 18.52800178527832, 22.16075897216797, 25.793514251708984, 29.426271438598633, 33.05903625488281, 36.69179153442383, 40.32455062866211, 43.957305908203125, 47.590065002441406, 51.22282028198242, 54.85557556152344, 58.48833465576172, 62.121089935302734, 65.75384521484375, 69.38660430908203, 73.01936340332031, 76.65211486816406, 80.28487396240234, 83.91763305664062, 87.55038452148438, 91.18314361572266, 94.81590270996094, 98.44865417480469, 102.08141326904297, 105.71417236328125, 109.346923828125, 112.97968292236328, 116.61244201660156, 120.24519348144531, 123.8779525756836, 127.51070404052734, 131.14346313476562, 134.77621459960938, 138.4089813232422, 142.04173278808594, 145.6744842529297, 149.3072509765625]}, "gradients/decoder.transformer.h.14.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 4.0, 1.0, 3.0, 5.0, 9.0, 5.0, 5.0, 9.0, 7.0, 17.0, 12.0, 21.0, 18.0, 24.0, 22.0, 16.0, 24.0, 26.0, 26.0, 32.0, 43.0, 43.0, 43.0, 36.0, 43.0, 35.0, 47.0, 45.0, 38.0, 39.0, 31.0, 32.0, 22.0, 29.0, 26.0, 23.0, 25.0, 16.0, 15.0, 16.0, 17.0, 8.0, 14.0, 5.0, 6.0, 7.0, 6.0, 7.0, 7.0, 2.0, 0.0, 2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-39.503021240234375, -38.1512451171875, -36.799468994140625, -35.44769287109375, -34.095916748046875, -32.744136810302734, -31.39236068725586, -30.040584564208984, -28.68880844116211, -27.337032318115234, -25.98525619506836, -24.63347816467285, -23.281702041625977, -21.9299259185791, -20.578147888183594, -19.22637176513672, -17.874595642089844, -16.52281951904297, -15.171042442321777, -13.819265365600586, -12.467489242553711, -11.115713119506836, -9.763936042785645, -8.412158966064453, -7.060382843017578, -5.708606243133545, -4.356829643249512, -3.0050530433654785, -1.6532764434814453, -0.3014998435974121, 1.050276756286621, 2.4020538330078125, 3.7538299560546875, 5.105606555938721, 6.457383155822754, 7.809159755706787, 9.16093635559082, 10.512712478637695, 11.864489555358887, 13.216266632080078, 14.568042755126953, 15.919818878173828, 17.271595001220703, 18.62337303161621, 19.975149154663086, 21.32692527770996, 22.67870330810547, 24.030479431152344, 25.38225555419922, 26.734031677246094, 28.08580780029297, 29.437585830688477, 30.78936195373535, 32.14113998413086, 33.492916107177734, 34.84469223022461, 36.196468353271484, 37.54824447631836, 38.900020599365234, 40.25179672241211, 41.60357666015625, 42.955352783203125, 44.30712890625, 45.658905029296875, 47.01068115234375]}, "gradients/decoder.transformer.h.14.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 4.0, 5.0, 2.0, 3.0, 7.0, 4.0, 10.0, 11.0, 18.0, 14.0, 20.0, 25.0, 17.0, 27.0, 21.0, 32.0, 37.0, 36.0, 45.0, 37.0, 46.0, 48.0, 35.0, 38.0, 49.0, 44.0, 32.0, 40.0, 33.0, 42.0, 35.0, 24.0, 24.0, 32.0, 20.0, 15.0, 13.0, 18.0, 10.0, 6.0, 9.0, 4.0, 6.0, 6.0, 3.0, 1.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.87109375, -6.65704345703125, -6.4429931640625, -6.22894287109375, -6.014892578125, -5.80084228515625, -5.5867919921875, -5.37274169921875, -5.15869140625, -4.94464111328125, -4.7305908203125, -4.51654052734375, -4.302490234375, -4.08843994140625, -3.8743896484375, -3.66033935546875, -3.4462890625, -3.23223876953125, -3.0181884765625, -2.80413818359375, -2.590087890625, -2.37603759765625, -2.1619873046875, -1.94793701171875, -1.73388671875, -1.51983642578125, -1.3057861328125, -1.09173583984375, -0.877685546875, -0.66363525390625, -0.4495849609375, -0.23553466796875, -0.021484375, 0.19256591796875, 0.4066162109375, 0.62066650390625, 0.834716796875, 1.04876708984375, 1.2628173828125, 1.47686767578125, 1.69091796875, 1.90496826171875, 2.1190185546875, 2.33306884765625, 2.547119140625, 2.76116943359375, 2.9752197265625, 3.18927001953125, 3.4033203125, 3.61737060546875, 3.8314208984375, 4.04547119140625, 4.259521484375, 4.47357177734375, 4.6876220703125, 4.90167236328125, 5.11572265625, 5.32977294921875, 5.5438232421875, 5.75787353515625, 5.971923828125, 6.18597412109375, 6.4000244140625, 6.61407470703125, 6.828125]}, "gradients/decoder.transformer.h.14.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 6.0, 4.0, 5.0, 11.0, 10.0, 20.0, 29.0, 37.0, 62.0, 114.0, 168.0, 254.0, 418.0, 670.0, 1024.0, 1682.0, 2701.0, 4337.0, 7020.0, 11364.0, 18474.0, 30803.0, 54174.0, 100761.0, 227106.0, 309204.0, 122586.0, 63850.0, 36024.0, 21617.0, 12949.0, 7916.0, 4826.0, 3122.0, 1928.0, 1183.0, 786.0, 465.0, 319.0, 216.0, 111.0, 65.0, 58.0, 35.0, 21.0, 13.0, 7.0, 5.0, 2.0, 6.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.8828125, -1.8242340087890625, -1.765655517578125, -1.7070770263671875, -1.64849853515625, -1.5899200439453125, -1.531341552734375, -1.4727630615234375, -1.4141845703125, -1.3556060791015625, -1.297027587890625, -1.2384490966796875, -1.17987060546875, -1.1212921142578125, -1.062713623046875, -1.0041351318359375, -0.945556640625, -0.8869781494140625, -0.828399658203125, -0.7698211669921875, -0.71124267578125, -0.6526641845703125, -0.594085693359375, -0.5355072021484375, -0.4769287109375, -0.4183502197265625, -0.359771728515625, -0.3011932373046875, -0.24261474609375, -0.1840362548828125, -0.125457763671875, -0.0668792724609375, -0.00830078125, 0.0502777099609375, 0.108856201171875, 0.1674346923828125, 0.22601318359375, 0.2845916748046875, 0.343170166015625, 0.4017486572265625, 0.4603271484375, 0.5189056396484375, 0.577484130859375, 0.6360626220703125, 0.69464111328125, 0.7532196044921875, 0.811798095703125, 0.8703765869140625, 0.928955078125, 0.9875335693359375, 1.046112060546875, 1.1046905517578125, 1.16326904296875, 1.2218475341796875, 1.280426025390625, 1.3390045166015625, 1.3975830078125, 1.4561614990234375, 1.514739990234375, 1.5733184814453125, 1.63189697265625, 1.6904754638671875, 1.749053955078125, 1.8076324462890625, 1.8662109375]}, "gradients/decoder.transformer.h.14.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 4.0, 6.0, 6.0, 10.0, 9.0, 9.0, 13.0, 14.0, 16.0, 21.0, 19.0, 28.0, 25.0, 36.0, 33.0, 36.0, 33.0, 35.0, 52.0, 37.0, 40.0, 1069.0, 56.0, 41.0, 48.0, 42.0, 33.0, 25.0, 25.0, 23.0, 32.0, 20.0, 20.0, 17.0, 19.0, 14.0, 11.0, 12.0, 8.0, 9.0, 7.0, 4.0, 4.0, 5.0, 2.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.0625, -3.92889404296875, -3.7952880859375, -3.66168212890625, -3.528076171875, -3.39447021484375, -3.2608642578125, -3.12725830078125, -2.99365234375, -2.86004638671875, -2.7264404296875, -2.59283447265625, -2.459228515625, -2.32562255859375, -2.1920166015625, -2.05841064453125, -1.9248046875, -1.79119873046875, -1.6575927734375, -1.52398681640625, -1.390380859375, -1.25677490234375, -1.1231689453125, -0.98956298828125, -0.85595703125, -0.72235107421875, -0.5887451171875, -0.45513916015625, -0.321533203125, -0.18792724609375, -0.0543212890625, 0.07928466796875, 0.212890625, 0.34649658203125, 0.4801025390625, 0.61370849609375, 0.747314453125, 0.88092041015625, 1.0145263671875, 1.14813232421875, 1.28173828125, 1.41534423828125, 1.5489501953125, 1.68255615234375, 1.816162109375, 1.94976806640625, 2.0833740234375, 2.21697998046875, 2.3505859375, 2.48419189453125, 2.6177978515625, 2.75140380859375, 2.885009765625, 3.01861572265625, 3.1522216796875, 3.28582763671875, 3.41943359375, 3.55303955078125, 3.6866455078125, 3.82025146484375, 3.953857421875, 4.08746337890625, 4.2210693359375, 4.35467529296875, 4.48828125]}, "gradients/decoder.transformer.h.14.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 7.0, 7.0, 6.0, 8.0, 14.0, 16.0, 25.0, 50.0, 69.0, 107.0, 176.0, 311.0, 512.0, 885.0, 1634.0, 2804.0, 5197.0, 9452.0, 17762.0, 34821.0, 70598.0, 163034.0, 1493259.0, 156033.0, 68279.0, 34060.0, 17243.0, 9168.0, 5047.0, 2784.0, 1610.0, 855.0, 520.0, 305.0, 180.0, 98.0, 57.0, 52.0, 31.0, 16.0, 14.0, 9.0, 8.0, 6.0, 6.0, 2.0, 3.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-2.451171875, -2.377960205078125, -2.30474853515625, -2.231536865234375, -2.1583251953125, -2.085113525390625, -2.01190185546875, -1.938690185546875, -1.865478515625, -1.792266845703125, -1.71905517578125, -1.645843505859375, -1.5726318359375, -1.499420166015625, -1.42620849609375, -1.352996826171875, -1.27978515625, -1.206573486328125, -1.13336181640625, -1.060150146484375, -0.9869384765625, -0.913726806640625, -0.84051513671875, -0.767303466796875, -0.694091796875, -0.620880126953125, -0.54766845703125, -0.474456787109375, -0.4012451171875, -0.328033447265625, -0.25482177734375, -0.181610107421875, -0.1083984375, -0.035186767578125, 0.03802490234375, 0.111236572265625, 0.1844482421875, 0.257659912109375, 0.33087158203125, 0.404083251953125, 0.477294921875, 0.550506591796875, 0.62371826171875, 0.696929931640625, 0.7701416015625, 0.843353271484375, 0.91656494140625, 0.989776611328125, 1.06298828125, 1.136199951171875, 1.20941162109375, 1.282623291015625, 1.3558349609375, 1.429046630859375, 1.50225830078125, 1.575469970703125, 1.648681640625, 1.721893310546875, 1.79510498046875, 1.868316650390625, 1.9415283203125, 2.014739990234375, 2.08795166015625, 2.161163330078125, 2.234375]}, "gradients/decoder.transformer.h.14.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 7.0, 5.0, 10.0, 11.0, 9.0, 22.0, 27.0, 25.0, 48.0, 55.0, 77.0, 95.0, 106.0, 97.0, 95.0, 72.0, 54.0, 50.0, 29.0, 26.0, 22.0, 13.0, 19.0, 7.0, 6.0, 2.0, 6.0, 3.0, 3.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00106048583984375, -0.0010221302509307861, -0.0009837746620178223, -0.0009454190731048584, -0.0009070634841918945, -0.0008687078952789307, -0.0008303523063659668, -0.0007919967174530029, -0.0007536411285400391, -0.0007152855396270752, -0.0006769299507141113, -0.0006385743618011475, -0.0006002187728881836, -0.0005618631839752197, -0.0005235075950622559, -0.000485152006149292, -0.0004467964172363281, -0.00040844082832336426, -0.0003700852394104004, -0.0003317296504974365, -0.00029337406158447266, -0.0002550184726715088, -0.00021666288375854492, -0.00017830729484558105, -0.0001399517059326172, -0.00010159611701965332, -6.324052810668945e-05, -2.4884939193725586e-05, 1.3470649719238281e-05, 5.182623863220215e-05, 9.018182754516602e-05, 0.00012853741645812988, 0.00016689300537109375, 0.00020524859428405762, 0.00024360418319702148, 0.00028195977210998535, 0.0003203153610229492, 0.0003586709499359131, 0.00039702653884887695, 0.0004353821277618408, 0.0004737377166748047, 0.0005120933055877686, 0.0005504488945007324, 0.0005888044834136963, 0.0006271600723266602, 0.000665515661239624, 0.0007038712501525879, 0.0007422268390655518, 0.0007805824279785156, 0.0008189380168914795, 0.0008572936058044434, 0.0008956491947174072, 0.0009340047836303711, 0.000972360372543335, 0.0010107159614562988, 0.0010490715503692627, 0.0010874271392822266, 0.0011257827281951904, 0.0011641383171081543, 0.0012024939060211182, 0.001240849494934082, 0.001279205083847046, 0.0013175606727600098, 0.0013559162616729736, 0.0013942718505859375]}, "gradients/decoder.transformer.h.14.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 4.0, 5.0, 10.0, 10.0, 12.0, 22.0, 23.0, 30.0, 41.0, 64.0, 95.0, 180.0, 289.0, 1018.0, 838657.0, 206637.0, 838.0, 256.0, 124.0, 61.0, 42.0, 46.0, 26.0, 18.0, 10.0, 8.0, 12.0, 5.0, 5.0, 4.0, 4.0, 5.0, 2.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.037139892578125, -0.036254167556762695, -0.03536844253540039, -0.034482717514038086, -0.03359699249267578, -0.03271126747131348, -0.03182554244995117, -0.030939817428588867, -0.030054092407226562, -0.029168367385864258, -0.028282642364501953, -0.02739691734313965, -0.026511192321777344, -0.02562546730041504, -0.024739742279052734, -0.02385401725769043, -0.022968292236328125, -0.02208256721496582, -0.021196842193603516, -0.02031111717224121, -0.019425392150878906, -0.0185396671295166, -0.017653942108154297, -0.016768217086791992, -0.015882492065429688, -0.014996767044067383, -0.014111042022705078, -0.013225317001342773, -0.012339591979980469, -0.011453866958618164, -0.01056814193725586, -0.009682416915893555, -0.00879669189453125, -0.007910966873168945, -0.007025241851806641, -0.006139516830444336, -0.005253791809082031, -0.0043680667877197266, -0.003482341766357422, -0.002596616744995117, -0.0017108917236328125, -0.0008251667022705078, 6.0558319091796875e-05, 0.0009462833404541016, 0.0018320083618164062, 0.002717733383178711, 0.0036034584045410156, 0.00448918342590332, 0.005374908447265625, 0.00626063346862793, 0.007146358489990234, 0.008032083511352539, 0.008917808532714844, 0.009803533554077148, 0.010689258575439453, 0.011574983596801758, 0.012460708618164062, 0.013346433639526367, 0.014232158660888672, 0.015117883682250977, 0.01600360870361328, 0.016889333724975586, 0.01777505874633789, 0.018660783767700195, 0.0195465087890625]}, "gradients/decoder.transformer.h.14.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 6.0, 107.0, 617.0, 259.0, 25.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006217927439138293, -0.0005131011130288243, -0.00040440948214381933, -0.0002957178803626448, -0.0001870262494776398, -7.833464769646525e-05, 3.0356983188539743e-05, 0.00013904861407354474, 0.00024774024495854974, 0.00035643187584355474, 0.00046512350672855973, 0.0005738150794059038, 0.0006825067102909088, 0.0007911983411759138, 0.0008998899720609188, 0.0010085816029459238, 0.0011172732338309288, 0.0012259648647159338, 0.0013346564956009388, 0.0014433481264859438, 0.0015520397573709488, 0.0016607313882559538, 0.0017694230191409588, 0.0018781146500259638, 0.0019868062809109688, 0.0020954979117959738, 0.0022041895426809788, 0.0023128811735659838, 0.0024215728044509888, 0.0025302644353359938, 0.0026389560662209988, 0.0027476476971060038, 0.002856339095160365, 0.00296503072604537, 0.003073722356930375, 0.00318241398781538, 0.003291105618700385, 0.00339979724958539, 0.003508488880470395, 0.0036171805113554, 0.003725872142240405, 0.00383456377312541, 0.003943255171179771, 0.00405194703489542, 0.004160638432949781, 0.00426933029666543, 0.004378021694719791, 0.00448671355843544, 0.004595404956489801, 0.004704096354544163, 0.004812788218259811, 0.004921479616314173, 0.005030171480029821, 0.005138862878084183, 0.005247554741799831, 0.005356246139854193, 0.005464938003569841, 0.005573629401624203, 0.005682321265339851, 0.005791012663394213, 0.005899704527109861, 0.006008395925164223, 0.006117087788879871, 0.006225779186934233, 0.006334471050649881]}, "gradients/decoder.transformer.h.14.ln_cross_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 3.0, 0.0, 6.0, 4.0, 1.0, 3.0, 6.0, 4.0, 6.0, 9.0, 10.0, 15.0, 14.0, 8.0, 12.0, 17.0, 24.0, 22.0, 25.0, 28.0, 34.0, 39.0, 43.0, 33.0, 35.0, 41.0, 40.0, 39.0, 47.0, 53.0, 40.0, 40.0, 31.0, 41.0, 24.0, 26.0, 23.0, 16.0, 22.0, 22.0, 17.0, 12.0, 17.0, 11.0, 12.0, 11.0, 5.0, 6.0, 5.0, 6.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0003898739814758301, -0.00037703942507505417, -0.00036420486867427826, -0.00035137031227350235, -0.00033853575587272644, -0.00032570119947195053, -0.0003128666430711746, -0.0003000320866703987, -0.0002871975302696228, -0.0002743629738688469, -0.000261528417468071, -0.0002486938610672951, -0.00023585930466651917, -0.00022302474826574326, -0.00021019019186496735, -0.00019735563546419144, -0.00018452107906341553, -0.00017168652266263962, -0.0001588519662618637, -0.0001460174098610878, -0.0001331828534603119, -0.00012034829705953598, -0.00010751374065876007, -9.467918425798416e-05, -8.184462785720825e-05, -6.901007145643234e-05, -5.617551505565643e-05, -4.3340958654880524e-05, -3.0506402254104614e-05, -1.7671845853328705e-05, -4.837289452552795e-06, 7.997266948223114e-06, 2.0831823348999023e-05, 3.366637974977493e-05, 4.650093615055084e-05, 5.933549255132675e-05, 7.217004895210266e-05, 8.500460535287857e-05, 9.783916175365448e-05, 0.00011067371815443039, 0.0001235082745552063, 0.0001363428309559822, 0.00014917738735675812, 0.00016201194375753403, 0.00017484650015830994, 0.00018768105655908585, 0.00020051561295986176, 0.00021335016936063766, 0.00022618472576141357, 0.00023901928216218948, 0.0002518538385629654, 0.0002646883949637413, 0.0002775229513645172, 0.0002903575077652931, 0.00030319206416606903, 0.00031602662056684494, 0.00032886117696762085, 0.00034169573336839676, 0.00035453028976917267, 0.0003673648461699486, 0.0003801994025707245, 0.0003930339589715004, 0.0004058685153722763, 0.0004187030717730522, 0.0004315376281738281]}, "gradients/decoder.transformer.h.14.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 4.0, 5.0, 2.0, 3.0, 7.0, 4.0, 10.0, 12.0, 17.0, 14.0, 20.0, 25.0, 17.0, 27.0, 21.0, 32.0, 37.0, 36.0, 45.0, 37.0, 46.0, 48.0, 35.0, 38.0, 49.0, 44.0, 32.0, 40.0, 33.0, 42.0, 35.0, 24.0, 24.0, 32.0, 20.0, 15.0, 13.0, 18.0, 10.0, 6.0, 9.0, 4.0, 6.0, 6.0, 3.0, 1.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.87109375, -6.65704345703125, -6.4429931640625, -6.22894287109375, -6.014892578125, -5.80084228515625, -5.5867919921875, -5.37274169921875, -5.15869140625, -4.94464111328125, -4.7305908203125, -4.51654052734375, -4.302490234375, -4.08843994140625, -3.8743896484375, -3.66033935546875, -3.4462890625, -3.23223876953125, -3.0181884765625, -2.80413818359375, -2.590087890625, -2.37603759765625, -2.1619873046875, -1.94793701171875, -1.73388671875, -1.51983642578125, -1.3057861328125, -1.09173583984375, -0.877685546875, -0.66363525390625, -0.4495849609375, -0.23553466796875, -0.021484375, 0.19256591796875, 0.4066162109375, 0.62066650390625, 0.834716796875, 1.04876708984375, 1.2628173828125, 1.47686767578125, 1.69091796875, 1.90496826171875, 2.1190185546875, 2.33306884765625, 2.547119140625, 2.76116943359375, 2.9752197265625, 3.18927001953125, 3.4033203125, 3.61737060546875, 3.8314208984375, 4.04547119140625, 4.259521484375, 4.47357177734375, 4.6876220703125, 4.90167236328125, 5.11572265625, 5.32977294921875, 5.5438232421875, 5.75787353515625, 5.971923828125, 6.18597412109375, 6.4000244140625, 6.61407470703125, 6.828125]}, "gradients/decoder.transformer.h.14.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 4.0, 6.0, 7.0, 13.0, 15.0, 10.0, 17.0, 19.0, 32.0, 52.0, 70.0, 86.0, 131.0, 184.0, 267.0, 350.0, 556.0, 856.0, 1392.0, 2450.0, 4936.0, 12084.0, 36199.0, 135031.0, 557557.0, 211600.0, 54123.0, 16565.0, 6394.0, 2925.0, 1553.0, 1002.0, 648.0, 427.0, 254.0, 205.0, 144.0, 121.0, 91.0, 59.0, 29.0, 23.0, 21.0, 19.0, 18.0, 7.0, 7.0, 3.0, 5.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-7.53125, -7.2933349609375, -7.055419921875, -6.8175048828125, -6.57958984375, -6.3416748046875, -6.103759765625, -5.8658447265625, -5.6279296875, -5.3900146484375, -5.152099609375, -4.9141845703125, -4.67626953125, -4.4383544921875, -4.200439453125, -3.9625244140625, -3.724609375, -3.4866943359375, -3.248779296875, -3.0108642578125, -2.77294921875, -2.5350341796875, -2.297119140625, -2.0592041015625, -1.8212890625, -1.5833740234375, -1.345458984375, -1.1075439453125, -0.86962890625, -0.6317138671875, -0.393798828125, -0.1558837890625, 0.08203125, 0.3199462890625, 0.557861328125, 0.7957763671875, 1.03369140625, 1.2716064453125, 1.509521484375, 1.7474365234375, 1.9853515625, 2.2232666015625, 2.461181640625, 2.6990966796875, 2.93701171875, 3.1749267578125, 3.412841796875, 3.6507568359375, 3.888671875, 4.1265869140625, 4.364501953125, 4.6024169921875, 4.84033203125, 5.0782470703125, 5.316162109375, 5.5540771484375, 5.7919921875, 6.0299072265625, 6.267822265625, 6.5057373046875, 6.74365234375, 6.9815673828125, 7.219482421875, 7.4573974609375, 7.6953125]}, "gradients/decoder.transformer.h.14.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 4.0, 4.0, 6.0, 9.0, 8.0, 11.0, 19.0, 21.0, 23.0, 28.0, 32.0, 39.0, 43.0, 51.0, 60.0, 47.0, 97.0, 238.0, 1686.0, 136.0, 88.0, 65.0, 59.0, 41.0, 46.0, 39.0, 27.0, 24.0, 16.0, 22.0, 21.0, 13.0, 11.0, 8.0, 6.0, 5.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0], "bins": [-31.625, -30.819580078125, -30.01416015625, -29.208740234375, -28.4033203125, -27.597900390625, -26.79248046875, -25.987060546875, -25.181640625, -24.376220703125, -23.57080078125, -22.765380859375, -21.9599609375, -21.154541015625, -20.34912109375, -19.543701171875, -18.73828125, -17.932861328125, -17.12744140625, -16.322021484375, -15.5166015625, -14.711181640625, -13.90576171875, -13.100341796875, -12.294921875, -11.489501953125, -10.68408203125, -9.878662109375, -9.0732421875, -8.267822265625, -7.46240234375, -6.656982421875, -5.8515625, -5.046142578125, -4.24072265625, -3.435302734375, -2.6298828125, -1.824462890625, -1.01904296875, -0.213623046875, 0.591796875, 1.397216796875, 2.20263671875, 3.008056640625, 3.8134765625, 4.618896484375, 5.42431640625, 6.229736328125, 7.03515625, 7.840576171875, 8.64599609375, 9.451416015625, 10.2568359375, 11.062255859375, 11.86767578125, 12.673095703125, 13.478515625, 14.283935546875, 15.08935546875, 15.894775390625, 16.7001953125, 17.505615234375, 18.31103515625, 19.116455078125, 19.921875]}, "gradients/decoder.transformer.h.14.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 1.0, 5.0, 2.0, 10.0, 6.0, 11.0, 30.0, 48.0, 96.0, 197.0, 511.0, 2768.0, 3137727.0, 3379.0, 521.0, 198.0, 99.0, 46.0, 33.0, 12.0, 6.0, 5.0, 3.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-94.5, -89.66015625, -84.8203125, -79.98046875, -75.140625, -70.30078125, -65.4609375, -60.62109375, -55.78125, -50.94140625, -46.1015625, -41.26171875, -36.421875, -31.58203125, -26.7421875, -21.90234375, -17.0625, -12.22265625, -7.3828125, -2.54296875, 2.296875, 7.13671875, 11.9765625, 16.81640625, 21.65625, 26.49609375, 31.3359375, 36.17578125, 41.015625, 45.85546875, 50.6953125, 55.53515625, 60.375, 65.21484375, 70.0546875, 74.89453125, 79.734375, 84.57421875, 89.4140625, 94.25390625, 99.09375, 103.93359375, 108.7734375, 113.61328125, 118.453125, 123.29296875, 128.1328125, 132.97265625, 137.8125, 142.65234375, 147.4921875, 152.33203125, 157.171875, 162.01171875, 166.8515625, 171.69140625, 176.53125, 181.37109375, 186.2109375, 191.05078125, 195.890625, 200.73046875, 205.5703125, 210.41015625, 215.25]}, "gradients/decoder.transformer.h.14.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 185.0, 831.0, 3.0], "bins": [-1059.2291259765625, -1042.181884765625, -1025.134521484375, -1008.0872802734375, -991.0400390625, -973.9927368164062, -956.9454956054688, -939.898193359375, -922.8509521484375, -905.8036499023438, -888.7564086914062, -871.7091064453125, -854.661865234375, -837.6145629882812, -820.5673217773438, -803.52001953125, -786.4727783203125, -769.4254760742188, -752.3782348632812, -735.3309326171875, -718.28369140625, -701.2363891601562, -684.1891479492188, -667.141845703125, -650.0945434570312, -633.0472412109375, -616.0, -598.9526977539062, -581.9054565429688, -564.858154296875, -547.8109130859375, -530.7636108398438, -513.7164306640625, -496.6691589355469, -479.62188720703125, -462.5746154785156, -445.52734375, -428.4800720214844, -411.43280029296875, -394.385498046875, -377.3382263183594, -360.29095458984375, -343.2436828613281, -326.1964111328125, -309.1491394042969, -292.10186767578125, -275.0545654296875, -258.00732421875, -240.9600372314453, -223.9127655029297, -206.86549377441406, -189.81820678710938, -172.77093505859375, -155.72366333007812, -138.6763916015625, -121.62911987304688, -104.58185577392578, -87.53458404541016, -70.4873046875, -53.440032958984375, -36.39276123046875, -19.345489501953125, -2.2982101440429688, 14.749061584472656, 31.796335220336914]}, "gradients/decoder.transformer.h.14.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 4.0, 4.0, 2.0, 8.0, 4.0, 4.0, 11.0, 4.0, 4.0, 20.0, 15.0, 18.0, 14.0, 21.0, 23.0, 24.0, 35.0, 32.0, 28.0, 33.0, 36.0, 34.0, 41.0, 51.0, 42.0, 40.0, 35.0, 43.0, 48.0, 44.0, 38.0, 28.0, 38.0, 28.0, 25.0, 20.0, 24.0, 21.0, 10.0, 16.0, 8.0, 10.0, 8.0, 10.0, 2.0, 2.0, 1.0, 1.0, 0.0, 2.0, 3.0, 0.0, 2.0], "bins": [-68.77639770507812, -66.82051086425781, -64.8646240234375, -62.90873718261719, -60.95284652709961, -58.9969596862793, -57.041072845458984, -55.08518600463867, -53.12929916381836, -51.17341232299805, -49.217525482177734, -47.261634826660156, -45.305747985839844, -43.34986114501953, -41.39397430419922, -39.438087463378906, -37.482200622558594, -35.52631378173828, -33.57042694091797, -31.614538192749023, -29.65865135192871, -27.702762603759766, -25.746875762939453, -23.79098892211914, -21.835098266601562, -19.87921142578125, -17.923322677612305, -15.967435836791992, -14.01154899597168, -12.05566120147705, -10.099773406982422, -8.14388656616211, -6.187999725341797, -4.232112407684326, -2.2762248516082764, -0.32033729553222656, 1.6355500221252441, 3.591437339782715, 5.547325134277344, 7.503211975097656, 9.459099769592285, 11.414987564086914, 13.370874404907227, 15.326762199401855, 17.282649993896484, 19.238536834716797, 21.19442367553711, 23.150310516357422, 25.106199264526367, 27.06208610534668, 29.017974853515625, 30.973861694335938, 32.92974853515625, 34.88563537597656, 36.841522216796875, 38.79740905761719, 40.753299713134766, 42.70918655395508, 44.66507339477539, 46.62096405029297, 48.57685089111328, 50.532737731933594, 52.488624572753906, 54.44451141357422, 56.40039825439453]}, "gradients/decoder.transformer.h.13.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 1.0, 5.0, 6.0, 6.0, 9.0, 16.0, 17.0, 11.0, 20.0, 23.0, 20.0, 18.0, 23.0, 33.0, 30.0, 50.0, 49.0, 46.0, 48.0, 35.0, 48.0, 52.0, 47.0, 34.0, 36.0, 33.0, 50.0, 33.0, 26.0, 29.0, 23.0, 31.0, 15.0, 24.0, 12.0, 13.0, 5.0, 8.0, 9.0, 3.0, 3.0, 5.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-7.81640625, -7.57867431640625, -7.3409423828125, -7.10321044921875, -6.865478515625, -6.62774658203125, -6.3900146484375, -6.15228271484375, -5.91455078125, -5.67681884765625, -5.4390869140625, -5.20135498046875, -4.963623046875, -4.72589111328125, -4.4881591796875, -4.25042724609375, -4.0126953125, -3.77496337890625, -3.5372314453125, -3.29949951171875, -3.061767578125, -2.82403564453125, -2.5863037109375, -2.34857177734375, -2.11083984375, -1.87310791015625, -1.6353759765625, -1.39764404296875, -1.159912109375, -0.92218017578125, -0.6844482421875, -0.44671630859375, -0.208984375, 0.02874755859375, 0.2664794921875, 0.50421142578125, 0.741943359375, 0.97967529296875, 1.2174072265625, 1.45513916015625, 1.69287109375, 1.93060302734375, 2.1683349609375, 2.40606689453125, 2.643798828125, 2.88153076171875, 3.1192626953125, 3.35699462890625, 3.5947265625, 3.83245849609375, 4.0701904296875, 4.30792236328125, 4.545654296875, 4.78338623046875, 5.0211181640625, 5.25885009765625, 5.49658203125, 5.73431396484375, 5.9720458984375, 6.20977783203125, 6.447509765625, 6.68524169921875, 6.9229736328125, 7.16070556640625, 7.3984375]}, "gradients/decoder.transformer.h.13.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 1.0, 0.0, 4.0, 7.0, 11.0, 11.0, 18.0, 10.0, 12.0, 21.0, 15.0, 29.0, 24.0, 26.0, 44.0, 24.0, 63.0, 143.0, 548.0, 3206.0, 105534.0, 3557091.0, 518952.0, 7129.0, 807.0, 192.0, 83.0, 46.0, 30.0, 33.0, 29.0, 22.0, 28.0, 17.0, 21.0, 7.0, 12.0, 12.0, 8.0, 4.0, 5.0, 3.0, 1.0, 6.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-29.828125, -28.94189453125, -28.0556640625, -27.16943359375, -26.283203125, -25.39697265625, -24.5107421875, -23.62451171875, -22.73828125, -21.85205078125, -20.9658203125, -20.07958984375, -19.193359375, -18.30712890625, -17.4208984375, -16.53466796875, -15.6484375, -14.76220703125, -13.8759765625, -12.98974609375, -12.103515625, -11.21728515625, -10.3310546875, -9.44482421875, -8.55859375, -7.67236328125, -6.7861328125, -5.89990234375, -5.013671875, -4.12744140625, -3.2412109375, -2.35498046875, -1.46875, -0.58251953125, 0.3037109375, 1.18994140625, 2.076171875, 2.96240234375, 3.8486328125, 4.73486328125, 5.62109375, 6.50732421875, 7.3935546875, 8.27978515625, 9.166015625, 10.05224609375, 10.9384765625, 11.82470703125, 12.7109375, 13.59716796875, 14.4833984375, 15.36962890625, 16.255859375, 17.14208984375, 18.0283203125, 18.91455078125, 19.80078125, 20.68701171875, 21.5732421875, 22.45947265625, 23.345703125, 24.23193359375, 25.1181640625, 26.00439453125, 26.890625]}, "gradients/decoder.transformer.h.13.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 2.0, 5.0, 12.0, 5.0, 12.0, 13.0, 17.0, 30.0, 23.0, 34.0, 43.0, 52.0, 64.0, 84.0, 111.0, 122.0, 151.0, 230.0, 258.0, 314.0, 347.0, 339.0, 364.0, 297.0, 249.0, 202.0, 143.0, 118.0, 93.0, 67.0, 57.0, 46.0, 40.0, 29.0, 23.0, 16.0, 17.0, 13.0, 4.0, 6.0, 5.0, 7.0, 2.0, 2.0, 1.0, 2.0, 2.0, 2.0, 0.0, 1.0], "bins": [-12.203125, -11.857177734375, -11.51123046875, -11.165283203125, -10.8193359375, -10.473388671875, -10.12744140625, -9.781494140625, -9.435546875, -9.089599609375, -8.74365234375, -8.397705078125, -8.0517578125, -7.705810546875, -7.35986328125, -7.013916015625, -6.66796875, -6.322021484375, -5.97607421875, -5.630126953125, -5.2841796875, -4.938232421875, -4.59228515625, -4.246337890625, -3.900390625, -3.554443359375, -3.20849609375, -2.862548828125, -2.5166015625, -2.170654296875, -1.82470703125, -1.478759765625, -1.1328125, -0.786865234375, -0.44091796875, -0.094970703125, 0.2509765625, 0.596923828125, 0.94287109375, 1.288818359375, 1.634765625, 1.980712890625, 2.32666015625, 2.672607421875, 3.0185546875, 3.364501953125, 3.71044921875, 4.056396484375, 4.40234375, 4.748291015625, 5.09423828125, 5.440185546875, 5.7861328125, 6.132080078125, 6.47802734375, 6.823974609375, 7.169921875, 7.515869140625, 7.86181640625, 8.207763671875, 8.5537109375, 8.899658203125, 9.24560546875, 9.591552734375, 9.9375]}, "gradients/decoder.transformer.h.13.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 4.0, 4.0, 1.0, 4.0, 7.0, 13.0, 8.0, 7.0, 15.0, 29.0, 16.0, 27.0, 42.0, 47.0, 51.0, 56.0, 71.0, 69.0, 89.0, 125.0, 154.0, 241.0, 293.0, 458.0, 1062.0, 22369.0, 4048622.0, 116749.0, 1632.0, 550.0, 319.0, 240.0, 180.0, 134.0, 129.0, 92.0, 84.0, 56.0, 47.0, 34.0, 35.0, 16.0, 23.0, 13.0, 15.0, 8.0, 12.0, 5.0, 10.0, 8.0, 7.0, 1.0, 6.0, 4.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 4.0], "bins": [-53.90625, -52.08740234375, -50.2685546875, -48.44970703125, -46.630859375, -44.81201171875, -42.9931640625, -41.17431640625, -39.35546875, -37.53662109375, -35.7177734375, -33.89892578125, -32.080078125, -30.26123046875, -28.4423828125, -26.62353515625, -24.8046875, -22.98583984375, -21.1669921875, -19.34814453125, -17.529296875, -15.71044921875, -13.8916015625, -12.07275390625, -10.25390625, -8.43505859375, -6.6162109375, -4.79736328125, -2.978515625, -1.15966796875, 0.6591796875, 2.47802734375, 4.296875, 6.11572265625, 7.9345703125, 9.75341796875, 11.572265625, 13.39111328125, 15.2099609375, 17.02880859375, 18.84765625, 20.66650390625, 22.4853515625, 24.30419921875, 26.123046875, 27.94189453125, 29.7607421875, 31.57958984375, 33.3984375, 35.21728515625, 37.0361328125, 38.85498046875, 40.673828125, 42.49267578125, 44.3115234375, 46.13037109375, 47.94921875, 49.76806640625, 51.5869140625, 53.40576171875, 55.224609375, 57.04345703125, 58.8623046875, 60.68115234375, 62.5]}, "gradients/decoder.transformer.h.13.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 28.0, 112.0, 221.0, 320.0, 213.0, 81.0, 27.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-188.25103759765625, -184.0236053466797, -179.7961883544922, -175.56875610351562, -171.34133911132812, -167.11390686035156, -162.88648986816406, -158.6590576171875, -154.431640625, -150.20420837402344, -145.97679138183594, -141.74935913085938, -137.52194213867188, -133.2945098876953, -129.0670928955078, -124.83966064453125, -120.61223602294922, -116.38481140136719, -112.15738677978516, -107.92996215820312, -103.7025375366211, -99.47511291503906, -95.2476806640625, -91.020263671875, -86.79283142089844, -82.5654067993164, -78.33798217773438, -74.11055755615234, -69.88313293457031, -65.65570831298828, -61.428279876708984, -57.20085525512695, -52.973426818847656, -48.746002197265625, -44.518577575683594, -40.29115295410156, -36.06372833251953, -31.836301803588867, -27.608875274658203, -23.381450653076172, -19.15402603149414, -14.92660140991211, -10.699175834655762, -6.471750259399414, -2.244325637817383, 1.9830989837646484, 6.2105255126953125, 10.437950134277344, 14.665374755859375, 18.892799377441406, 23.120223999023438, 27.3476505279541, 31.575075149536133, 35.80249786376953, 40.02992630004883, 44.25735092163086, 48.48477554321289, 52.71220016479492, 56.93962478637695, 61.16705322265625, 65.39447784423828, 69.62190246582031, 73.84932708740234, 78.07675170898438, 82.3041763305664]}, "gradients/decoder.transformer.h.13.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 4.0, 14.0, 10.0, 8.0, 12.0, 11.0, 19.0, 23.0, 19.0, 30.0, 26.0, 24.0, 33.0, 26.0, 40.0, 37.0, 48.0, 38.0, 33.0, 37.0, 46.0, 36.0, 49.0, 44.0, 33.0, 32.0, 30.0, 26.0, 29.0, 26.0, 24.0, 25.0, 14.0, 18.0, 7.0, 14.0, 9.0, 7.0, 8.0, 5.0, 8.0, 2.0, 3.0, 5.0, 4.0, 2.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-40.45892333984375, -39.16551208496094, -37.87209701538086, -36.57868576049805, -35.28527069091797, -33.991859436035156, -32.698448181152344, -31.4050350189209, -30.111621856689453, -28.818208694458008, -27.524795532226562, -26.23138427734375, -24.937971115112305, -23.64455795288086, -22.351146697998047, -21.0577335357666, -19.764320373535156, -18.47090721130371, -17.177494049072266, -15.884082794189453, -14.590669631958008, -13.297256469726562, -12.003844261169434, -10.710432052612305, -9.41701889038086, -8.123605728149414, -6.830193519592285, -5.536780834197998, -4.243368148803711, -2.949955463409424, -1.6565427780151367, -0.3631305694580078, 0.9302825927734375, 2.2236952781677246, 3.5171079635620117, 4.810520648956299, 6.103933334350586, 7.397346019744873, 8.69075870513916, 9.984170913696289, 11.277584075927734, 12.57099723815918, 13.864409446716309, 15.157821655273438, 16.451234817504883, 17.744647979736328, 19.03805923461914, 20.331472396850586, 21.62488555908203, 22.918298721313477, 24.211711883544922, 25.505123138427734, 26.79853630065918, 28.091949462890625, 29.385360717773438, 30.678773880004883, 31.972187042236328, 33.26559829711914, 34.55901336669922, 35.85242462158203, 37.145835876464844, 38.43925094604492, 39.732662200927734, 41.02607727050781, 42.319488525390625]}, "gradients/decoder.transformer.h.13.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 3.0, 0.0, 7.0, 6.0, 9.0, 13.0, 9.0, 11.0, 22.0, 22.0, 24.0, 26.0, 33.0, 23.0, 38.0, 45.0, 53.0, 38.0, 43.0, 41.0, 42.0, 45.0, 53.0, 49.0, 41.0, 32.0, 40.0, 30.0, 36.0, 32.0, 31.0, 16.0, 12.0, 18.0, 15.0, 12.0, 12.0, 9.0, 4.0, 4.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-8.0234375, -7.78564453125, -7.5478515625, -7.31005859375, -7.072265625, -6.83447265625, -6.5966796875, -6.35888671875, -6.12109375, -5.88330078125, -5.6455078125, -5.40771484375, -5.169921875, -4.93212890625, -4.6943359375, -4.45654296875, -4.21875, -3.98095703125, -3.7431640625, -3.50537109375, -3.267578125, -3.02978515625, -2.7919921875, -2.55419921875, -2.31640625, -2.07861328125, -1.8408203125, -1.60302734375, -1.365234375, -1.12744140625, -0.8896484375, -0.65185546875, -0.4140625, -0.17626953125, 0.0615234375, 0.29931640625, 0.537109375, 0.77490234375, 1.0126953125, 1.25048828125, 1.48828125, 1.72607421875, 1.9638671875, 2.20166015625, 2.439453125, 2.67724609375, 2.9150390625, 3.15283203125, 3.390625, 3.62841796875, 3.8662109375, 4.10400390625, 4.341796875, 4.57958984375, 4.8173828125, 5.05517578125, 5.29296875, 5.53076171875, 5.7685546875, 6.00634765625, 6.244140625, 6.48193359375, 6.7197265625, 6.95751953125, 7.1953125]}, "gradients/decoder.transformer.h.13.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 10.0, 5.0, 5.0, 14.0, 15.0, 26.0, 35.0, 72.0, 88.0, 126.0, 201.0, 307.0, 466.0, 792.0, 1099.0, 1751.0, 2807.0, 4401.0, 6935.0, 11404.0, 18178.0, 29990.0, 52122.0, 96714.0, 225827.0, 324280.0, 118937.0, 61312.0, 34591.0, 21277.0, 12918.0, 7898.0, 5071.0, 3120.0, 2081.0, 1326.0, 791.0, 506.0, 357.0, 268.0, 151.0, 97.0, 51.0, 42.0, 39.0, 22.0, 12.0, 13.0, 5.0, 1.0, 5.0, 0.0, 1.0, 3.0, 0.0, 0.0, 2.0], "bins": [-1.8681640625, -1.8100433349609375, -1.751922607421875, -1.6938018798828125, -1.63568115234375, -1.5775604248046875, -1.519439697265625, -1.4613189697265625, -1.4031982421875, -1.3450775146484375, -1.286956787109375, -1.2288360595703125, -1.17071533203125, -1.1125946044921875, -1.054473876953125, -0.9963531494140625, -0.938232421875, -0.8801116943359375, -0.821990966796875, -0.7638702392578125, -0.70574951171875, -0.6476287841796875, -0.589508056640625, -0.5313873291015625, -0.4732666015625, -0.4151458740234375, -0.357025146484375, -0.2989044189453125, -0.24078369140625, -0.1826629638671875, -0.124542236328125, -0.0664215087890625, -0.00830078125, 0.0498199462890625, 0.107940673828125, 0.1660614013671875, 0.22418212890625, 0.2823028564453125, 0.340423583984375, 0.3985443115234375, 0.4566650390625, 0.5147857666015625, 0.572906494140625, 0.6310272216796875, 0.68914794921875, 0.7472686767578125, 0.805389404296875, 0.8635101318359375, 0.921630859375, 0.9797515869140625, 1.037872314453125, 1.0959930419921875, 1.15411376953125, 1.2122344970703125, 1.270355224609375, 1.3284759521484375, 1.3865966796875, 1.4447174072265625, 1.502838134765625, 1.5609588623046875, 1.61907958984375, 1.6772003173828125, 1.735321044921875, 1.7934417724609375, 1.8515625]}, "gradients/decoder.transformer.h.13.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 2.0, 1.0, 2.0, 3.0, 4.0, 3.0, 10.0, 7.0, 3.0, 7.0, 13.0, 10.0, 25.0, 23.0, 29.0, 23.0, 31.0, 37.0, 31.0, 46.0, 32.0, 47.0, 36.0, 36.0, 50.0, 1065.0, 35.0, 28.0, 37.0, 36.0, 34.0, 40.0, 23.0, 31.0, 29.0, 27.0, 25.0, 23.0, 18.0, 16.0, 9.0, 12.0, 7.0, 6.0, 9.0, 6.0, 5.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.0390625, -3.8953857421875, -3.751708984375, -3.6080322265625, -3.46435546875, -3.3206787109375, -3.177001953125, -3.0333251953125, -2.8896484375, -2.7459716796875, -2.602294921875, -2.4586181640625, -2.31494140625, -2.1712646484375, -2.027587890625, -1.8839111328125, -1.740234375, -1.5965576171875, -1.452880859375, -1.3092041015625, -1.16552734375, -1.0218505859375, -0.878173828125, -0.7344970703125, -0.5908203125, -0.4471435546875, -0.303466796875, -0.1597900390625, -0.01611328125, 0.1275634765625, 0.271240234375, 0.4149169921875, 0.55859375, 0.7022705078125, 0.845947265625, 0.9896240234375, 1.13330078125, 1.2769775390625, 1.420654296875, 1.5643310546875, 1.7080078125, 1.8516845703125, 1.995361328125, 2.1390380859375, 2.28271484375, 2.4263916015625, 2.570068359375, 2.7137451171875, 2.857421875, 3.0010986328125, 3.144775390625, 3.2884521484375, 3.43212890625, 3.5758056640625, 3.719482421875, 3.8631591796875, 4.0068359375, 4.1505126953125, 4.294189453125, 4.4378662109375, 4.58154296875, 4.7252197265625, 4.868896484375, 5.0125732421875, 5.15625]}, "gradients/decoder.transformer.h.13.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 4.0, 2.0, 4.0, 5.0, 16.0, 10.0, 16.0, 23.0, 31.0, 56.0, 107.0, 140.0, 229.0, 378.0, 657.0, 1117.0, 1951.0, 3670.0, 6652.0, 12626.0, 24722.0, 50147.0, 106831.0, 287786.0, 1377782.0, 113007.0, 53434.0, 26313.0, 13462.0, 7040.0, 3810.0, 2136.0, 1188.0, 747.0, 393.0, 233.0, 153.0, 96.0, 50.0, 34.0, 22.0, 17.0, 17.0, 9.0, 7.0, 3.0, 2.0, 2.0, 2.0, 4.0, 0.0, 2.0], "bins": [-2.83203125, -2.753448486328125, -2.67486572265625, -2.596282958984375, -2.5177001953125, -2.439117431640625, -2.36053466796875, -2.281951904296875, -2.203369140625, -2.124786376953125, -2.04620361328125, -1.967620849609375, -1.8890380859375, -1.810455322265625, -1.73187255859375, -1.653289794921875, -1.57470703125, -1.496124267578125, -1.41754150390625, -1.338958740234375, -1.2603759765625, -1.181793212890625, -1.10321044921875, -1.024627685546875, -0.946044921875, -0.867462158203125, -0.78887939453125, -0.710296630859375, -0.6317138671875, -0.553131103515625, -0.47454833984375, -0.395965576171875, -0.3173828125, -0.238800048828125, -0.16021728515625, -0.081634521484375, -0.0030517578125, 0.075531005859375, 0.15411376953125, 0.232696533203125, 0.311279296875, 0.389862060546875, 0.46844482421875, 0.547027587890625, 0.6256103515625, 0.704193115234375, 0.78277587890625, 0.861358642578125, 0.93994140625, 1.018524169921875, 1.09710693359375, 1.175689697265625, 1.2542724609375, 1.332855224609375, 1.41143798828125, 1.490020751953125, 1.568603515625, 1.647186279296875, 1.72576904296875, 1.804351806640625, 1.8829345703125, 1.961517333984375, 2.04010009765625, 2.118682861328125, 2.197265625]}, "gradients/decoder.transformer.h.13.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 3.0, 2.0, 3.0, 1.0, 1.0, 2.0, 5.0, 9.0, 5.0, 6.0, 15.0, 12.0, 12.0, 11.0, 25.0, 36.0, 54.0, 62.0, 74.0, 94.0, 100.0, 116.0, 76.0, 71.0, 54.0, 36.0, 25.0, 21.0, 17.0, 11.0, 12.0, 5.0, 11.0, 10.0, 3.0, 4.0, 1.0, 4.0, 2.0, 4.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0012006759643554688, -0.0011558085680007935, -0.0011109411716461182, -0.0010660737752914429, -0.0010212063789367676, -0.0009763389825820923, -0.000931471586227417, -0.0008866041898727417, -0.0008417367935180664, -0.0007968693971633911, -0.0007520020008087158, -0.0007071346044540405, -0.0006622672080993652, -0.0006173998117446899, -0.0005725324153900146, -0.0005276650190353394, -0.00048279762268066406, -0.00043793022632598877, -0.0003930628299713135, -0.0003481954336166382, -0.0003033280372619629, -0.0002584606409072876, -0.0002135932445526123, -0.000168725848197937, -0.00012385845184326172, -7.899105548858643e-05, -3.412365913391113e-05, 1.074373722076416e-05, 5.561113357543945e-05, 0.00010047852993011475, 0.00014534592628479004, 0.00019021332263946533, 0.00023508071899414062, 0.0002799481153488159, 0.0003248155117034912, 0.0003696829080581665, 0.0004145503044128418, 0.0004594177007675171, 0.0005042850971221924, 0.0005491524934768677, 0.000594019889831543, 0.0006388872861862183, 0.0006837546825408936, 0.0007286220788955688, 0.0007734894752502441, 0.0008183568716049194, 0.0008632242679595947, 0.00090809166431427, 0.0009529590606689453, 0.0009978264570236206, 0.001042693853378296, 0.0010875612497329712, 0.0011324286460876465, 0.0011772960424423218, 0.001222163438796997, 0.0012670308351516724, 0.0013118982315063477, 0.001356765627861023, 0.0014016330242156982, 0.0014465004205703735, 0.0014913678169250488, 0.0015362352132797241, 0.0015811026096343994, 0.0016259700059890747, 0.00167083740234375]}, "gradients/decoder.transformer.h.13.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 3.0, 7.0, 4.0, 3.0, 9.0, 13.0, 8.0, 14.0, 18.0, 30.0, 43.0, 80.0, 126.0, 295.0, 1309.0, 1040783.0, 4962.0, 418.0, 166.0, 93.0, 54.0, 27.0, 26.0, 12.0, 11.0, 12.0, 7.0, 3.0, 6.0, 5.0, 4.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.048919677734375, -0.047654151916503906, -0.04638862609863281, -0.04512310028076172, -0.043857574462890625, -0.04259204864501953, -0.04132652282714844, -0.040060997009277344, -0.03879547119140625, -0.037529945373535156, -0.03626441955566406, -0.03499889373779297, -0.033733367919921875, -0.03246784210205078, -0.031202316284179688, -0.029936790466308594, -0.0286712646484375, -0.027405738830566406, -0.026140213012695312, -0.02487468719482422, -0.023609161376953125, -0.02234363555908203, -0.021078109741210938, -0.019812583923339844, -0.01854705810546875, -0.017281532287597656, -0.016016006469726562, -0.014750480651855469, -0.013484954833984375, -0.012219429016113281, -0.010953903198242188, -0.009688377380371094, -0.0084228515625, -0.007157325744628906, -0.0058917999267578125, -0.004626274108886719, -0.003360748291015625, -0.0020952224731445312, -0.0008296966552734375, 0.00043582916259765625, 0.00170135498046875, 0.0029668807983398438, 0.0042324066162109375, 0.005497932434082031, 0.006763458251953125, 0.008028984069824219, 0.009294509887695312, 0.010560035705566406, 0.0118255615234375, 0.013091087341308594, 0.014356613159179688, 0.015622138977050781, 0.016887664794921875, 0.01815319061279297, 0.019418716430664062, 0.020684242248535156, 0.02194976806640625, 0.023215293884277344, 0.024480819702148438, 0.02574634552001953, 0.027011871337890625, 0.02827739715576172, 0.029542922973632812, 0.030808448791503906, 0.032073974609375]}, "gradients/decoder.transformer.h.13.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 25.0, 519.0, 449.0, 21.0, 5.0, 2.0], "bins": [-0.007656607311218977, -0.007528883405029774, -0.0074011594988405704, -0.007273435592651367, -0.007145711686462164, -0.007017987780272961, -0.006890263874083757, -0.006762539967894554, -0.006634816061705351, -0.006507092155516148, -0.006379368249326944, -0.006251644343137741, -0.006123920436948538, -0.0059961965307593346, -0.005868472624570131, -0.005740748718380928, -0.0056130243465304375, -0.005485300440341234, -0.005357576534152031, -0.005229852627962828, -0.005102128721773624, -0.004974404815584421, -0.004846680909395218, -0.004718957003206015, -0.004591233097016811, -0.004463509190827608, -0.004335785284638405, -0.004208061378449202, -0.004080337472259998, -0.003952613566070795, -0.003824889659881592, -0.0036971657536923885, -0.0035694423131644726, -0.0034417184069752693, -0.003313994500786066, -0.003186270594596863, -0.0030585466884076595, -0.0029308227822184563, -0.002803098876029253, -0.0026753749698400497, -0.002547650830820203, -0.0024199269246309996, -0.0022922030184417963, -0.002164479112252593, -0.0020367552060633898, -0.0019090312998741865, -0.0017813072772696614, -0.0016535833710804582, -0.0015258595813065767, -0.0013981356751173735, -0.0012704117689281702, -0.001142687862738967, -0.0010149639565497637, -0.0008872399921528995, -0.0007595160277560353, -0.0006317921215668321, -0.0005040681571699679, -0.00037634425098076463, -0.0002486203156877309, -0.00012089638039469719, 6.827525794506073e-06, 0.00013455143198370934, 0.0002622753963805735, 0.0003899993025697768, 0.00051772320875898]}, "gradients/decoder.transformer.h.13.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 2.0, 3.0, 3.0, 6.0, 8.0, 3.0, 15.0, 15.0, 16.0, 13.0, 21.0, 23.0, 23.0, 30.0, 34.0, 43.0, 28.0, 41.0, 48.0, 37.0, 48.0, 44.0, 48.0, 54.0, 48.0, 58.0, 45.0, 34.0, 37.0, 31.0, 22.0, 25.0, 18.0, 14.0, 19.0, 12.0, 10.0, 13.0, 4.0, 8.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0006244182586669922, -0.0006045559421181679, -0.0005846936255693436, -0.0005648313090205193, -0.000544968992471695, -0.0005251066759228706, -0.0005052443593740463, -0.000485382042825222, -0.0004655197262763977, -0.0004456574097275734, -0.0004257950931787491, -0.0004059327766299248, -0.00038607046008110046, -0.00036620814353227615, -0.00034634582698345184, -0.00032648351043462753, -0.0003066211938858032, -0.0002867588773369789, -0.0002668965607881546, -0.0002470342442393303, -0.00022717192769050598, -0.00020730961114168167, -0.00018744729459285736, -0.00016758497804403305, -0.00014772266149520874, -0.00012786034494638443, -0.00010799802839756012, -8.813571184873581e-05, -6.82733952999115e-05, -4.841107875108719e-05, -2.854876220226288e-05, -8.686445653438568e-06, 1.1175870895385742e-05, 3.103818744421005e-05, 5.090050399303436e-05, 7.076282054185867e-05, 9.062513709068298e-05, 0.0001104874536395073, 0.0001303497701883316, 0.00015021208673715591, 0.00017007440328598022, 0.00018993671983480453, 0.00020979903638362885, 0.00022966135293245316, 0.00024952366948127747, 0.0002693859860301018, 0.0002892483025789261, 0.0003091106191277504, 0.0003289729356765747, 0.000348835252225399, 0.00036869756877422333, 0.00038855988532304764, 0.00040842220187187195, 0.00042828451842069626, 0.00044814683496952057, 0.0004680091515183449, 0.0004878714680671692, 0.0005077337846159935, 0.0005275961011648178, 0.0005474584177136421, 0.0005673207342624664, 0.0005871830508112907, 0.000607045367360115, 0.0006269076839089394, 0.0006467700004577637]}, "gradients/decoder.transformer.h.13.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 3.0, 0.0, 7.0, 6.0, 9.0, 13.0, 9.0, 11.0, 22.0, 22.0, 24.0, 26.0, 33.0, 23.0, 38.0, 45.0, 53.0, 38.0, 43.0, 41.0, 42.0, 45.0, 53.0, 49.0, 41.0, 32.0, 40.0, 30.0, 36.0, 32.0, 31.0, 16.0, 12.0, 18.0, 15.0, 12.0, 12.0, 9.0, 4.0, 4.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-8.0234375, -7.78564453125, -7.5478515625, -7.31005859375, -7.072265625, -6.83447265625, -6.5966796875, -6.35888671875, -6.12109375, -5.88330078125, -5.6455078125, -5.40771484375, -5.169921875, -4.93212890625, -4.6943359375, -4.45654296875, -4.21875, -3.98095703125, -3.7431640625, -3.50537109375, -3.267578125, -3.02978515625, -2.7919921875, -2.55419921875, -2.31640625, -2.07861328125, -1.8408203125, -1.60302734375, -1.365234375, -1.12744140625, -0.8896484375, -0.65185546875, -0.4140625, -0.17626953125, 0.0615234375, 0.29931640625, 0.537109375, 0.77490234375, 1.0126953125, 1.25048828125, 1.48828125, 1.72607421875, 1.9638671875, 2.20166015625, 2.439453125, 2.67724609375, 2.9150390625, 3.15283203125, 3.390625, 3.62841796875, 3.8662109375, 4.10400390625, 4.341796875, 4.57958984375, 4.8173828125, 5.05517578125, 5.29296875, 5.53076171875, 5.7685546875, 6.00634765625, 6.244140625, 6.48193359375, 6.7197265625, 6.95751953125, 7.1953125]}, "gradients/decoder.transformer.h.13.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 3.0, 5.0, 2.0, 8.0, 6.0, 7.0, 4.0, 13.0, 12.0, 19.0, 31.0, 38.0, 53.0, 74.0, 111.0, 140.0, 232.0, 298.0, 467.0, 693.0, 1075.0, 1778.0, 3091.0, 5594.0, 11042.0, 24225.0, 59263.0, 162070.0, 391689.0, 235828.0, 85681.0, 33545.0, 14720.0, 7067.0, 3741.0, 2148.0, 1262.0, 773.0, 545.0, 343.0, 241.0, 179.0, 115.0, 88.0, 73.0, 53.0, 22.0, 23.0, 19.0, 16.0, 20.0, 9.0, 7.0, 3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 2.0], "bins": [-4.5234375, -4.38067626953125, -4.2379150390625, -4.09515380859375, -3.952392578125, -3.80963134765625, -3.6668701171875, -3.52410888671875, -3.38134765625, -3.23858642578125, -3.0958251953125, -2.95306396484375, -2.810302734375, -2.66754150390625, -2.5247802734375, -2.38201904296875, -2.2392578125, -2.09649658203125, -1.9537353515625, -1.81097412109375, -1.668212890625, -1.52545166015625, -1.3826904296875, -1.23992919921875, -1.09716796875, -0.95440673828125, -0.8116455078125, -0.66888427734375, -0.526123046875, -0.38336181640625, -0.2406005859375, -0.09783935546875, 0.044921875, 0.18768310546875, 0.3304443359375, 0.47320556640625, 0.615966796875, 0.75872802734375, 0.9014892578125, 1.04425048828125, 1.18701171875, 1.32977294921875, 1.4725341796875, 1.61529541015625, 1.758056640625, 1.90081787109375, 2.0435791015625, 2.18634033203125, 2.3291015625, 2.47186279296875, 2.6146240234375, 2.75738525390625, 2.900146484375, 3.04290771484375, 3.1856689453125, 3.32843017578125, 3.47119140625, 3.61395263671875, 3.7567138671875, 3.89947509765625, 4.042236328125, 4.18499755859375, 4.3277587890625, 4.47052001953125, 4.61328125]}, "gradients/decoder.transformer.h.13.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 5.0, 4.0, 5.0, 6.0, 3.0, 9.0, 15.0, 26.0, 16.0, 27.0, 29.0, 20.0, 32.0, 33.0, 42.0, 45.0, 43.0, 73.0, 84.0, 322.0, 1643.0, 118.0, 61.0, 63.0, 57.0, 44.0, 35.0, 39.0, 25.0, 25.0, 23.0, 12.0, 18.0, 13.0, 9.0, 6.0, 8.0, 6.0, 2.0, 3.0, 2.0, 4.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-23.625, -22.889892578125, -22.15478515625, -21.419677734375, -20.6845703125, -19.949462890625, -19.21435546875, -18.479248046875, -17.744140625, -17.009033203125, -16.27392578125, -15.538818359375, -14.8037109375, -14.068603515625, -13.33349609375, -12.598388671875, -11.86328125, -11.128173828125, -10.39306640625, -9.657958984375, -8.9228515625, -8.187744140625, -7.45263671875, -6.717529296875, -5.982421875, -5.247314453125, -4.51220703125, -3.777099609375, -3.0419921875, -2.306884765625, -1.57177734375, -0.836669921875, -0.1015625, 0.633544921875, 1.36865234375, 2.103759765625, 2.8388671875, 3.573974609375, 4.30908203125, 5.044189453125, 5.779296875, 6.514404296875, 7.24951171875, 7.984619140625, 8.7197265625, 9.454833984375, 10.18994140625, 10.925048828125, 11.66015625, 12.395263671875, 13.13037109375, 13.865478515625, 14.6005859375, 15.335693359375, 16.07080078125, 16.805908203125, 17.541015625, 18.276123046875, 19.01123046875, 19.746337890625, 20.4814453125, 21.216552734375, 21.95166015625, 22.686767578125, 23.421875]}, "gradients/decoder.transformer.h.13.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 3.0, 3.0, 3.0, 4.0, 8.0, 13.0, 11.0, 13.0, 29.0, 35.0, 57.0, 79.0, 126.0, 209.0, 337.0, 578.0, 2094.0, 3100183.0, 39846.0, 896.0, 396.0, 243.0, 175.0, 119.0, 83.0, 61.0, 34.0, 24.0, 18.0, 15.0, 4.0, 5.0, 5.0, 3.0, 1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-66.375, -63.984375, -61.59375, -59.203125, -56.8125, -54.421875, -52.03125, -49.640625, -47.25, -44.859375, -42.46875, -40.078125, -37.6875, -35.296875, -32.90625, -30.515625, -28.125, -25.734375, -23.34375, -20.953125, -18.5625, -16.171875, -13.78125, -11.390625, -9.0, -6.609375, -4.21875, -1.828125, 0.5625, 2.953125, 5.34375, 7.734375, 10.125, 12.515625, 14.90625, 17.296875, 19.6875, 22.078125, 24.46875, 26.859375, 29.25, 31.640625, 34.03125, 36.421875, 38.8125, 41.203125, 43.59375, 45.984375, 48.375, 50.765625, 53.15625, 55.546875, 57.9375, 60.328125, 62.71875, 65.109375, 67.5, 69.890625, 72.28125, 74.671875, 77.0625, 79.453125, 81.84375, 84.234375, 86.625]}, "gradients/decoder.transformer.h.13.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 17.0, 917.0, 84.0, 2.0, 1.0], "bins": [-407.33349609375, -400.61187744140625, -393.8902282714844, -387.1686096191406, -380.4469909667969, -373.7253723144531, -367.00372314453125, -360.2821044921875, -353.56048583984375, -346.8388671875, -340.1172180175781, -333.3955993652344, -326.6739807128906, -319.9523620605469, -313.230712890625, -306.50909423828125, -299.7874755859375, -293.06585693359375, -286.3442077636719, -279.6225891113281, -272.9009704589844, -266.1793518066406, -259.45770263671875, -252.736083984375, -246.0144500732422, -239.29281616210938, -232.57119750976562, -225.8495635986328, -219.12794494628906, -212.40631103515625, -205.6846923828125, -198.9630584716797, -192.24142456054688, -185.51979064941406, -178.7981719970703, -172.0765380859375, -165.35491943359375, -158.63328552246094, -151.9116668701172, -145.19003295898438, -138.46841430664062, -131.7467803955078, -125.02516174316406, -118.30353546142578, -111.5819091796875, -104.86027526855469, -98.13865661621094, -91.41702270507812, -84.69540405273438, -77.9737777709961, -71.25215148925781, -64.53052520751953, -57.80889892578125, -51.0872688293457, -44.36564254760742, -37.64401626586914, -30.922388076782227, -24.200761795043945, -17.47913360595703, -10.75750732421875, -4.035881042480469, 2.6857471466064453, 9.407373428344727, 16.128999710083008, 22.85062599182129]}, "gradients/decoder.transformer.h.13.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 1.0, 4.0, 3.0, 12.0, 10.0, 9.0, 6.0, 16.0, 13.0, 25.0, 23.0, 39.0, 28.0, 31.0, 39.0, 47.0, 50.0, 43.0, 51.0, 61.0, 54.0, 52.0, 36.0, 43.0, 47.0, 46.0, 42.0, 29.0, 36.0, 19.0, 22.0, 14.0, 15.0, 4.0, 10.0, 5.0, 5.0, 5.0, 3.0, 3.0, 3.0, 3.0, 1.0, 3.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-74.61872863769531, -72.40953063964844, -70.20033264160156, -67.99113464355469, -65.78193664550781, -63.57273864746094, -61.36354064941406, -59.15434265136719, -56.94514465332031, -54.73594665527344, -52.52674865722656, -50.31755065917969, -48.10835266113281, -45.89915466308594, -43.68995666503906, -41.48075866699219, -39.27156066894531, -37.06236267089844, -34.85316467285156, -32.64396667480469, -30.434768676757812, -28.225570678710938, -26.016372680664062, -23.807174682617188, -21.597976684570312, -19.388778686523438, -17.179580688476562, -14.970382690429688, -12.761184692382812, -10.551986694335938, -8.342788696289062, -6.1335906982421875, -3.9243850708007812, -1.7151870727539062, 0.49401092529296875, 2.7032089233398438, 4.912406921386719, 7.121604919433594, 9.330802917480469, 11.540000915527344, 13.749198913574219, 15.958396911621094, 18.16759490966797, 20.376792907714844, 22.58599090576172, 24.795188903808594, 27.00438690185547, 29.213584899902344, 31.42278289794922, 33.631980895996094, 35.84117889404297, 38.050376892089844, 40.25957489013672, 42.468772888183594, 44.67797088623047, 46.887168884277344, 49.09636688232422, 51.305564880371094, 53.51476287841797, 55.723960876464844, 57.93315887451172, 60.142356872558594, 62.35155487060547, 64.56075286865234, 66.76995086669922]}, "gradients/decoder.transformer.h.12.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 5.0, 3.0, 5.0, 5.0, 9.0, 16.0, 11.0, 11.0, 21.0, 20.0, 25.0, 22.0, 27.0, 31.0, 31.0, 48.0, 42.0, 42.0, 37.0, 41.0, 43.0, 55.0, 39.0, 43.0, 48.0, 30.0, 40.0, 35.0, 32.0, 29.0, 30.0, 30.0, 20.0, 14.0, 12.0, 11.0, 16.0, 7.0, 9.0, 3.0, 5.0, 1.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-8.1484375, -7.90924072265625, -7.6700439453125, -7.43084716796875, -7.191650390625, -6.95245361328125, -6.7132568359375, -6.47406005859375, -6.23486328125, -5.99566650390625, -5.7564697265625, -5.51727294921875, -5.278076171875, -5.03887939453125, -4.7996826171875, -4.56048583984375, -4.3212890625, -4.08209228515625, -3.8428955078125, -3.60369873046875, -3.364501953125, -3.12530517578125, -2.8861083984375, -2.64691162109375, -2.40771484375, -2.16851806640625, -1.9293212890625, -1.69012451171875, -1.450927734375, -1.21173095703125, -0.9725341796875, -0.73333740234375, -0.494140625, -0.25494384765625, -0.0157470703125, 0.22344970703125, 0.462646484375, 0.70184326171875, 0.9410400390625, 1.18023681640625, 1.41943359375, 1.65863037109375, 1.8978271484375, 2.13702392578125, 2.376220703125, 2.61541748046875, 2.8546142578125, 3.09381103515625, 3.3330078125, 3.57220458984375, 3.8114013671875, 4.05059814453125, 4.289794921875, 4.52899169921875, 4.7681884765625, 5.00738525390625, 5.24658203125, 5.48577880859375, 5.7249755859375, 5.96417236328125, 6.203369140625, 6.44256591796875, 6.6817626953125, 6.92095947265625, 7.16015625]}, "gradients/decoder.transformer.h.12.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 1.0, 2.0, 7.0, 7.0, 11.0, 11.0, 8.0, 18.0, 29.0, 18.0, 28.0, 26.0, 42.0, 61.0, 104.0, 157.0, 344.0, 1105.0, 5928.0, 103319.0, 2431147.0, 1595914.0, 50734.0, 3811.0, 725.0, 274.0, 141.0, 63.0, 56.0, 37.0, 32.0, 26.0, 21.0, 18.0, 16.0, 12.0, 8.0, 6.0, 8.0, 5.0, 1.0, 4.0, 2.0, 2.0, 2.0, 1.0, 2.0, 1.0], "bins": [-23.375, -22.741455078125, -22.10791015625, -21.474365234375, -20.8408203125, -20.207275390625, -19.57373046875, -18.940185546875, -18.306640625, -17.673095703125, -17.03955078125, -16.406005859375, -15.7724609375, -15.138916015625, -14.50537109375, -13.871826171875, -13.23828125, -12.604736328125, -11.97119140625, -11.337646484375, -10.7041015625, -10.070556640625, -9.43701171875, -8.803466796875, -8.169921875, -7.536376953125, -6.90283203125, -6.269287109375, -5.6357421875, -5.002197265625, -4.36865234375, -3.735107421875, -3.1015625, -2.468017578125, -1.83447265625, -1.200927734375, -0.5673828125, 0.066162109375, 0.69970703125, 1.333251953125, 1.966796875, 2.600341796875, 3.23388671875, 3.867431640625, 4.5009765625, 5.134521484375, 5.76806640625, 6.401611328125, 7.03515625, 7.668701171875, 8.30224609375, 8.935791015625, 9.5693359375, 10.202880859375, 10.83642578125, 11.469970703125, 12.103515625, 12.737060546875, 13.37060546875, 14.004150390625, 14.6376953125, 15.271240234375, 15.90478515625, 16.538330078125, 17.171875]}, "gradients/decoder.transformer.h.12.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 3.0, 4.0, 4.0, 0.0, 6.0, 10.0, 10.0, 12.0, 14.0, 12.0, 15.0, 35.0, 32.0, 61.0, 42.0, 74.0, 90.0, 118.0, 163.0, 197.0, 249.0, 311.0, 392.0, 409.0, 399.0, 330.0, 232.0, 209.0, 145.0, 106.0, 93.0, 74.0, 53.0, 37.0, 36.0, 19.0, 15.0, 17.0, 14.0, 9.0, 9.0, 9.0, 4.0, 4.0, 4.0, 4.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.953125, -9.5908203125, -9.228515625, -8.8662109375, -8.50390625, -8.1416015625, -7.779296875, -7.4169921875, -7.0546875, -6.6923828125, -6.330078125, -5.9677734375, -5.60546875, -5.2431640625, -4.880859375, -4.5185546875, -4.15625, -3.7939453125, -3.431640625, -3.0693359375, -2.70703125, -2.3447265625, -1.982421875, -1.6201171875, -1.2578125, -0.8955078125, -0.533203125, -0.1708984375, 0.19140625, 0.5537109375, 0.916015625, 1.2783203125, 1.640625, 2.0029296875, 2.365234375, 2.7275390625, 3.08984375, 3.4521484375, 3.814453125, 4.1767578125, 4.5390625, 4.9013671875, 5.263671875, 5.6259765625, 5.98828125, 6.3505859375, 6.712890625, 7.0751953125, 7.4375, 7.7998046875, 8.162109375, 8.5244140625, 8.88671875, 9.2490234375, 9.611328125, 9.9736328125, 10.3359375, 10.6982421875, 11.060546875, 11.4228515625, 11.78515625, 12.1474609375, 12.509765625, 12.8720703125, 13.234375]}, "gradients/decoder.transformer.h.12.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 3.0, 6.0, 8.0, 4.0, 5.0, 8.0, 14.0, 17.0, 22.0, 26.0, 35.0, 43.0, 61.0, 47.0, 87.0, 122.0, 143.0, 208.0, 313.0, 618.0, 2145.0, 3836362.0, 351291.0, 1127.0, 508.0, 279.0, 195.0, 151.0, 97.0, 77.0, 75.0, 45.0, 31.0, 26.0, 23.0, 11.0, 10.0, 17.0, 10.0, 7.0, 5.0, 5.0, 2.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-77.375, -74.783203125, -72.19140625, -69.599609375, -67.0078125, -64.416015625, -61.82421875, -59.232421875, -56.640625, -54.048828125, -51.45703125, -48.865234375, -46.2734375, -43.681640625, -41.08984375, -38.498046875, -35.90625, -33.314453125, -30.72265625, -28.130859375, -25.5390625, -22.947265625, -20.35546875, -17.763671875, -15.171875, -12.580078125, -9.98828125, -7.396484375, -4.8046875, -2.212890625, 0.37890625, 2.970703125, 5.5625, 8.154296875, 10.74609375, 13.337890625, 15.9296875, 18.521484375, 21.11328125, 23.705078125, 26.296875, 28.888671875, 31.48046875, 34.072265625, 36.6640625, 39.255859375, 41.84765625, 44.439453125, 47.03125, 49.623046875, 52.21484375, 54.806640625, 57.3984375, 59.990234375, 62.58203125, 65.173828125, 67.765625, 70.357421875, 72.94921875, 75.541015625, 78.1328125, 80.724609375, 83.31640625, 85.908203125, 88.5]}, "gradients/decoder.transformer.h.12.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 4.0, 8.0, 15.0, 17.0, 37.0, 49.0, 76.0, 128.0, 118.0, 149.0, 117.0, 115.0, 77.0, 47.0, 33.0, 14.0, 7.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-63.00080871582031, -61.27339172363281, -59.54597473144531, -57.81855773925781, -56.09113693237305, -54.36371994018555, -52.63630294799805, -50.90888595581055, -49.18146514892578, -47.45404815673828, -45.72663116455078, -43.99921417236328, -42.271793365478516, -40.544376373291016, -38.816959381103516, -37.089542388916016, -35.362125396728516, -33.634708404541016, -31.907289505004883, -30.179872512817383, -28.45245361328125, -26.72503662109375, -24.99761962890625, -23.27020263671875, -21.542783737182617, -19.815366744995117, -18.087947845458984, -16.360530853271484, -14.633112907409668, -12.905694961547852, -11.178277969360352, -9.450860023498535, -7.723438262939453, -5.996020317077637, -4.2686028480529785, -2.5411853790283203, -0.8137674331665039, 0.9136505126953125, 2.6410675048828125, 4.368485450744629, 6.095903396606445, 7.823321342468262, 9.550739288330078, 11.278156280517578, 13.005574226379395, 14.732992172241211, 16.46040916442871, 18.187828063964844, 19.915245056152344, 21.642662048339844, 23.370080947875977, 25.097497940063477, 26.82491683959961, 28.55233383178711, 30.27975082397461, 32.00716781616211, 33.734588623046875, 35.462005615234375, 37.189422607421875, 38.916839599609375, 40.64426040649414, 42.37167739868164, 44.09909439086914, 45.82651138305664, 47.55392837524414]}, "gradients/decoder.transformer.h.12.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 6.0, 5.0, 6.0, 3.0, 11.0, 11.0, 16.0, 27.0, 11.0, 18.0, 23.0, 22.0, 38.0, 33.0, 37.0, 49.0, 51.0, 51.0, 31.0, 49.0, 38.0, 44.0, 54.0, 48.0, 26.0, 30.0, 45.0, 27.0, 35.0, 32.0, 28.0, 15.0, 24.0, 18.0, 11.0, 13.0, 6.0, 4.0, 3.0, 5.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.092201232910156, -40.60743713378906, -39.1226692199707, -37.63790512084961, -36.15313720703125, -34.668373107910156, -33.18360900878906, -31.698841094970703, -30.214075088500977, -28.72930908203125, -27.244543075561523, -25.759777069091797, -24.275012969970703, -22.790245056152344, -21.30548095703125, -19.820714950561523, -18.335948944091797, -16.85118293762207, -15.366416931152344, -13.881651878356934, -12.396885871887207, -10.91211986541748, -9.42735481262207, -7.942588806152344, -6.457822799682617, -4.973056793212891, -3.4882912635803223, -2.003525733947754, -0.5187597274780273, 0.9660062789916992, 2.4507713317871094, 3.935537338256836, 5.420307159423828, 6.905073165893555, 8.389839172363281, 9.874604225158691, 11.359370231628418, 12.844136238098145, 14.328901290893555, 15.813667297363281, 17.298433303833008, 18.783199310302734, 20.26796531677246, 21.752731323242188, 23.23749542236328, 24.72226333618164, 26.207027435302734, 27.69179344177246, 29.176559448242188, 30.661325454711914, 32.14609146118164, 33.630855560302734, 35.115623474121094, 36.60038757324219, 38.08515167236328, 39.56991958618164, 41.0546875, 42.539451599121094, 44.02421951293945, 45.50898361206055, 46.993751525878906, 48.478515625, 49.963279724121094, 51.44804763793945, 52.93281173706055]}, "gradients/decoder.transformer.h.12.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 7.0, 3.0, 2.0, 5.0, 12.0, 19.0, 19.0, 27.0, 26.0, 22.0, 16.0, 34.0, 41.0, 41.0, 41.0, 41.0, 54.0, 50.0, 50.0, 48.0, 49.0, 48.0, 49.0, 51.0, 33.0, 26.0, 33.0, 38.0, 24.0, 24.0, 13.0, 12.0, 12.0, 6.0, 9.0, 4.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.484375, -8.2255859375, -7.966796875, -7.7080078125, -7.44921875, -7.1904296875, -6.931640625, -6.6728515625, -6.4140625, -6.1552734375, -5.896484375, -5.6376953125, -5.37890625, -5.1201171875, -4.861328125, -4.6025390625, -4.34375, -4.0849609375, -3.826171875, -3.5673828125, -3.30859375, -3.0498046875, -2.791015625, -2.5322265625, -2.2734375, -2.0146484375, -1.755859375, -1.4970703125, -1.23828125, -0.9794921875, -0.720703125, -0.4619140625, -0.203125, 0.0556640625, 0.314453125, 0.5732421875, 0.83203125, 1.0908203125, 1.349609375, 1.6083984375, 1.8671875, 2.1259765625, 2.384765625, 2.6435546875, 2.90234375, 3.1611328125, 3.419921875, 3.6787109375, 3.9375, 4.1962890625, 4.455078125, 4.7138671875, 4.97265625, 5.2314453125, 5.490234375, 5.7490234375, 6.0078125, 6.2666015625, 6.525390625, 6.7841796875, 7.04296875, 7.3017578125, 7.560546875, 7.8193359375, 8.078125]}, "gradients/decoder.transformer.h.12.crossattention.c_proj.weight": {"_type": "histogram", "values": [4.0, 3.0, 3.0, 2.0, 5.0, 6.0, 8.0, 13.0, 14.0, 26.0, 41.0, 54.0, 58.0, 115.0, 160.0, 223.0, 297.0, 462.0, 683.0, 1035.0, 1637.0, 2384.0, 3597.0, 5552.0, 8625.0, 13161.0, 20467.0, 32768.0, 54628.0, 97105.0, 222580.0, 304729.0, 114048.0, 62182.0, 36791.0, 23008.0, 14701.0, 9430.0, 6112.0, 3899.0, 2633.0, 1729.0, 1167.0, 731.0, 518.0, 369.0, 241.0, 178.0, 110.0, 95.0, 57.0, 43.0, 23.0, 10.0, 22.0, 11.0, 6.0, 6.0, 2.0, 5.0, 2.0, 0.0, 0.0, 2.0], "bins": [-1.8271484375, -1.7684326171875, -1.709716796875, -1.6510009765625, -1.59228515625, -1.5335693359375, -1.474853515625, -1.4161376953125, -1.357421875, -1.2987060546875, -1.239990234375, -1.1812744140625, -1.12255859375, -1.0638427734375, -1.005126953125, -0.9464111328125, -0.8876953125, -0.8289794921875, -0.770263671875, -0.7115478515625, -0.65283203125, -0.5941162109375, -0.535400390625, -0.4766845703125, -0.41796875, -0.3592529296875, -0.300537109375, -0.2418212890625, -0.18310546875, -0.1243896484375, -0.065673828125, -0.0069580078125, 0.0517578125, 0.1104736328125, 0.169189453125, 0.2279052734375, 0.28662109375, 0.3453369140625, 0.404052734375, 0.4627685546875, 0.521484375, 0.5802001953125, 0.638916015625, 0.6976318359375, 0.75634765625, 0.8150634765625, 0.873779296875, 0.9324951171875, 0.9912109375, 1.0499267578125, 1.108642578125, 1.1673583984375, 1.22607421875, 1.2847900390625, 1.343505859375, 1.4022216796875, 1.4609375, 1.5196533203125, 1.578369140625, 1.6370849609375, 1.69580078125, 1.7545166015625, 1.813232421875, 1.8719482421875, 1.9306640625]}, "gradients/decoder.transformer.h.12.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 5.0, 6.0, 8.0, 8.0, 9.0, 7.0, 19.0, 16.0, 12.0, 18.0, 26.0, 27.0, 22.0, 22.0, 36.0, 48.0, 32.0, 38.0, 35.0, 36.0, 40.0, 40.0, 1058.0, 49.0, 32.0, 27.0, 36.0, 39.0, 30.0, 29.0, 25.0, 32.0, 28.0, 22.0, 20.0, 15.0, 13.0, 14.0, 13.0, 12.0, 7.0, 5.0, 4.0, 2.0, 3.0, 2.0, 1.0, 3.0, 2.0, 2.0, 1.0], "bins": [-4.8828125, -4.7437744140625, -4.604736328125, -4.4656982421875, -4.32666015625, -4.1876220703125, -4.048583984375, -3.9095458984375, -3.7705078125, -3.6314697265625, -3.492431640625, -3.3533935546875, -3.21435546875, -3.0753173828125, -2.936279296875, -2.7972412109375, -2.658203125, -2.5191650390625, -2.380126953125, -2.2410888671875, -2.10205078125, -1.9630126953125, -1.823974609375, -1.6849365234375, -1.5458984375, -1.4068603515625, -1.267822265625, -1.1287841796875, -0.98974609375, -0.8507080078125, -0.711669921875, -0.5726318359375, -0.43359375, -0.2945556640625, -0.155517578125, -0.0164794921875, 0.12255859375, 0.2615966796875, 0.400634765625, 0.5396728515625, 0.6787109375, 0.8177490234375, 0.956787109375, 1.0958251953125, 1.23486328125, 1.3739013671875, 1.512939453125, 1.6519775390625, 1.791015625, 1.9300537109375, 2.069091796875, 2.2081298828125, 2.34716796875, 2.4862060546875, 2.625244140625, 2.7642822265625, 2.9033203125, 3.0423583984375, 3.181396484375, 3.3204345703125, 3.45947265625, 3.5985107421875, 3.737548828125, 3.8765869140625, 4.015625]}, "gradients/decoder.transformer.h.12.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 4.0, 2.0, 2.0, 2.0, 4.0, 3.0, 6.0, 6.0, 15.0, 29.0, 31.0, 48.0, 50.0, 98.0, 164.0, 278.0, 482.0, 826.0, 1337.0, 2475.0, 4267.0, 8017.0, 14574.0, 28096.0, 55175.0, 113030.0, 1380484.0, 275568.0, 104247.0, 51255.0, 25885.0, 13709.0, 7466.0, 4019.0, 2313.0, 1306.0, 756.0, 399.0, 233.0, 194.0, 99.0, 59.0, 47.0, 20.0, 16.0, 18.0, 8.0, 8.0, 6.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.203125, -2.12701416015625, -2.0509033203125, -1.97479248046875, -1.898681640625, -1.82257080078125, -1.7464599609375, -1.67034912109375, -1.59423828125, -1.51812744140625, -1.4420166015625, -1.36590576171875, -1.289794921875, -1.21368408203125, -1.1375732421875, -1.06146240234375, -0.9853515625, -0.90924072265625, -0.8331298828125, -0.75701904296875, -0.680908203125, -0.60479736328125, -0.5286865234375, -0.45257568359375, -0.37646484375, -0.30035400390625, -0.2242431640625, -0.14813232421875, -0.072021484375, 0.00408935546875, 0.0802001953125, 0.15631103515625, 0.232421875, 0.30853271484375, 0.3846435546875, 0.46075439453125, 0.536865234375, 0.61297607421875, 0.6890869140625, 0.76519775390625, 0.84130859375, 0.91741943359375, 0.9935302734375, 1.06964111328125, 1.145751953125, 1.22186279296875, 1.2979736328125, 1.37408447265625, 1.4501953125, 1.52630615234375, 1.6024169921875, 1.67852783203125, 1.754638671875, 1.83074951171875, 1.9068603515625, 1.98297119140625, 2.05908203125, 2.13519287109375, 2.2113037109375, 2.28741455078125, 2.363525390625, 2.43963623046875, 2.5157470703125, 2.59185791015625, 2.66796875]}, "gradients/decoder.transformer.h.12.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 1.0, 5.0, 2.0, 7.0, 6.0, 6.0, 5.0, 8.0, 13.0, 8.0, 21.0, 23.0, 28.0, 37.0, 45.0, 79.0, 82.0, 123.0, 102.0, 77.0, 60.0, 59.0, 36.0, 27.0, 24.0, 24.0, 20.0, 19.0, 6.0, 5.0, 9.0, 7.0, 7.0, 9.0, 6.0, 4.0, 1.0, 2.0, 2.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0017757415771484375, -0.001722082495689392, -0.0016684234142303467, -0.0016147643327713013, -0.0015611052513122559, -0.0015074461698532104, -0.001453787088394165, -0.0014001280069351196, -0.0013464689254760742, -0.0012928098440170288, -0.0012391507625579834, -0.001185491681098938, -0.0011318325996398926, -0.0010781735181808472, -0.0010245144367218018, -0.0009708553552627563, -0.0009171962738037109, -0.0008635371923446655, -0.0008098781108856201, -0.0007562190294265747, -0.0007025599479675293, -0.0006489008665084839, -0.0005952417850494385, -0.0005415827035903931, -0.00048792362213134766, -0.00043426454067230225, -0.00038060545921325684, -0.0003269463777542114, -0.000273287296295166, -0.0002196282148361206, -0.0001659691333770752, -0.00011231005191802979, -5.8650970458984375e-05, -4.991888999938965e-06, 4.8667192459106445e-05, 0.00010232627391815186, 0.00015598535537719727, 0.00020964443683624268, 0.0002633035182952881, 0.0003169625997543335, 0.0003706216812133789, 0.0004242807626724243, 0.0004779398441314697, 0.0005315989255905151, 0.0005852580070495605, 0.000638917088508606, 0.0006925761699676514, 0.0007462352514266968, 0.0007998943328857422, 0.0008535534143447876, 0.000907212495803833, 0.0009608715772628784, 0.0010145306587219238, 0.0010681897401809692, 0.0011218488216400146, 0.00117550790309906, 0.0012291669845581055, 0.0012828260660171509, 0.0013364851474761963, 0.0013901442289352417, 0.0014438033103942871, 0.0014974623918533325, 0.001551121473312378, 0.0016047805547714233, 0.0016584396362304688]}, "gradients/decoder.transformer.h.12.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 0.0, 1.0, 4.0, 0.0, 3.0, 5.0, 6.0, 11.0, 8.0, 6.0, 9.0, 8.0, 11.0, 22.0, 28.0, 37.0, 45.0, 62.0, 95.0, 214.0, 753.0, 783790.0, 262269.0, 671.0, 197.0, 90.0, 61.0, 37.0, 27.0, 22.0, 13.0, 14.0, 6.0, 9.0, 5.0, 6.0, 4.0, 5.0, 5.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.043914794921875, -0.042496681213378906, -0.04107856750488281, -0.03966045379638672, -0.038242340087890625, -0.03682422637939453, -0.03540611267089844, -0.033987998962402344, -0.03256988525390625, -0.031151771545410156, -0.029733657836914062, -0.02831554412841797, -0.026897430419921875, -0.02547931671142578, -0.024061203002929688, -0.022643089294433594, -0.0212249755859375, -0.019806861877441406, -0.018388748168945312, -0.01697063446044922, -0.015552520751953125, -0.014134407043457031, -0.012716293334960938, -0.011298179626464844, -0.00988006591796875, -0.008461952209472656, -0.0070438385009765625, -0.005625724792480469, -0.004207611083984375, -0.0027894973754882812, -0.0013713836669921875, 4.673004150390625e-05, 0.00146484375, 0.0028829574584960938, 0.0043010711669921875, 0.005719184875488281, 0.007137298583984375, 0.008555412292480469, 0.009973526000976562, 0.011391639709472656, 0.01280975341796875, 0.014227867126464844, 0.015645980834960938, 0.01706409454345703, 0.018482208251953125, 0.01990032196044922, 0.021318435668945312, 0.022736549377441406, 0.0241546630859375, 0.025572776794433594, 0.026990890502929688, 0.02840900421142578, 0.029827117919921875, 0.03124523162841797, 0.03266334533691406, 0.034081459045410156, 0.03549957275390625, 0.036917686462402344, 0.03833580017089844, 0.03975391387939453, 0.041172027587890625, 0.04259014129638672, 0.04400825500488281, 0.045426368713378906, 0.046844482421875]}, "gradients/decoder.transformer.h.12.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 23.0, 228.0, 576.0, 169.0, 15.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006457806448452175, -0.0005444050184451044, -0.00044302939204499125, -0.0003416537947487086, -0.0002402781683485955, -0.00013890257105231285, -3.7526944652199745e-05, 6.384868174791336e-05, 0.00016522430814802647, 0.00026659993454813957, 0.0003679755609482527, 0.0004693511582445353, 0.0005707268137484789, 0.0006721023819409311, 0.0007734780083410442, 0.0008748536347411573, 0.0009762292611412704, 0.0010776048293337226, 0.0011789804557338357, 0.0012803560821339488, 0.001381731708534062, 0.001483107334934175, 0.0015844829613342881, 0.0016858585877344012, 0.0017872342141345143, 0.0018886098405346274, 0.0019899853505194187, 0.002091360976919532, 0.002192736603319645, 0.002294112229719758, 0.002395487856119871, 0.0024968634825199842, 0.002598239341750741, 0.002699614968150854, 0.002800990594550967, 0.0029023662209510803, 0.0030037418473511934, 0.0031051174737513065, 0.0032064931001514196, 0.0033078687265515327, 0.003409244352951646, 0.003510619979351759, 0.003611995605751872, 0.003713371232151985, 0.0038147468585520983, 0.003916122484952211, 0.004017497878521681, 0.004118873737752438, 0.004220249131321907, 0.0043216245248913765, 0.004423000384122133, 0.004524375777691603, 0.0046257516369223595, 0.004727127030491829, 0.004828502889722586, 0.004929878283292055, 0.005031254142522812, 0.005132629536092281, 0.005234005395323038, 0.0053353807888925076, 0.005436756648123264, 0.005538132041692734, 0.0056395079009234905, 0.00574088329449296, 0.005842259153723717]}, "gradients/decoder.transformer.h.12.ln_cross_attn.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0, 2.0, 3.0, 3.0, 1.0, 6.0, 6.0, 10.0, 11.0, 9.0, 12.0, 10.0, 15.0, 14.0, 22.0, 23.0, 21.0, 28.0, 21.0, 27.0, 36.0, 36.0, 30.0, 31.0, 42.0, 33.0, 39.0, 33.0, 40.0, 31.0, 39.0, 35.0, 36.0, 34.0, 23.0, 35.0, 28.0, 18.0, 14.0, 21.0, 17.0, 24.0, 17.0, 17.0, 6.0, 13.0, 7.0, 11.0, 3.0, 3.0, 6.0, 3.0, 1.0, 4.0, 1.0, 0.0, 2.0, 1.0], "bins": [-0.0006854534149169922, -0.0006651263684034348, -0.0006447993218898773, -0.0006244722753763199, -0.0006041452288627625, -0.000583818182349205, -0.0005634911358356476, -0.0005431640893220901, -0.0005228370428085327, -0.0005025099962949753, -0.00048218294978141785, -0.0004618559032678604, -0.000441528856754303, -0.00042120181024074554, -0.0004008747637271881, -0.0003805477172136307, -0.00036022067070007324, -0.0003398936241865158, -0.0003195665776729584, -0.00029923953115940094, -0.0002789124846458435, -0.00025858543813228607, -0.00023825839161872864, -0.0002179313451051712, -0.00019760429859161377, -0.00017727725207805634, -0.0001569502055644989, -0.00013662315905094147, -0.00011629611253738403, -9.59690660238266e-05, -7.564201951026917e-05, -5.531497299671173e-05, -3.49879264831543e-05, -1.4660879969596863e-05, 5.666166543960571e-06, 2.5993213057518005e-05, 4.632025957107544e-05, 6.664730608463287e-05, 8.697435259819031e-05, 0.00010730139911174774, 0.00012762844562530518, 0.0001479554921388626, 0.00016828253865242004, 0.00018860958516597748, 0.0002089366316795349, 0.00022926367819309235, 0.0002495907247066498, 0.0002699177712202072, 0.00029024481773376465, 0.0003105718642473221, 0.0003308989107608795, 0.00035122595727443695, 0.0003715530037879944, 0.0003918800503015518, 0.00041220709681510925, 0.0004325341433286667, 0.0004528611898422241, 0.00047318823635578156, 0.000493515282869339, 0.0005138423293828964, 0.0005341693758964539, 0.0005544964224100113, 0.0005748234689235687, 0.0005951505154371262, 0.0006154775619506836]}, "gradients/decoder.transformer.h.12.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 7.0, 3.0, 2.0, 5.0, 12.0, 19.0, 19.0, 27.0, 26.0, 22.0, 16.0, 34.0, 41.0, 41.0, 41.0, 41.0, 54.0, 50.0, 50.0, 48.0, 49.0, 48.0, 49.0, 51.0, 33.0, 26.0, 33.0, 38.0, 24.0, 24.0, 13.0, 12.0, 12.0, 6.0, 9.0, 4.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.484375, -8.2255859375, -7.966796875, -7.7080078125, -7.44921875, -7.1904296875, -6.931640625, -6.6728515625, -6.4140625, -6.1552734375, -5.896484375, -5.6376953125, -5.37890625, -5.1201171875, -4.861328125, -4.6025390625, -4.34375, -4.0849609375, -3.826171875, -3.5673828125, -3.30859375, -3.0498046875, -2.791015625, -2.5322265625, -2.2734375, -2.0146484375, -1.755859375, -1.4970703125, -1.23828125, -0.9794921875, -0.720703125, -0.4619140625, -0.203125, 0.0556640625, 0.314453125, 0.5732421875, 0.83203125, 1.0908203125, 1.349609375, 1.6083984375, 1.8671875, 2.1259765625, 2.384765625, 2.6435546875, 2.90234375, 3.1611328125, 3.419921875, 3.6787109375, 3.9375, 4.1962890625, 4.455078125, 4.7138671875, 4.97265625, 5.2314453125, 5.490234375, 5.7490234375, 6.0078125, 6.2666015625, 6.525390625, 6.7841796875, 7.04296875, 7.3017578125, 7.560546875, 7.8193359375, 8.078125]}, "gradients/decoder.transformer.h.12.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 3.0, 5.0, 8.0, 3.0, 4.0, 6.0, 18.0, 27.0, 27.0, 52.0, 62.0, 70.0, 112.0, 212.0, 349.0, 649.0, 1670.0, 4666.0, 15977.0, 83788.0, 600229.0, 283410.0, 41970.0, 9757.0, 2949.0, 1238.0, 526.0, 264.0, 159.0, 127.0, 61.0, 41.0, 32.0, 20.0, 17.0, 16.0, 10.0, 5.0, 12.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.25, -7.99853515625, -7.7470703125, -7.49560546875, -7.244140625, -6.99267578125, -6.7412109375, -6.48974609375, -6.23828125, -5.98681640625, -5.7353515625, -5.48388671875, -5.232421875, -4.98095703125, -4.7294921875, -4.47802734375, -4.2265625, -3.97509765625, -3.7236328125, -3.47216796875, -3.220703125, -2.96923828125, -2.7177734375, -2.46630859375, -2.21484375, -1.96337890625, -1.7119140625, -1.46044921875, -1.208984375, -0.95751953125, -0.7060546875, -0.45458984375, -0.203125, 0.04833984375, 0.2998046875, 0.55126953125, 0.802734375, 1.05419921875, 1.3056640625, 1.55712890625, 1.80859375, 2.06005859375, 2.3115234375, 2.56298828125, 2.814453125, 3.06591796875, 3.3173828125, 3.56884765625, 3.8203125, 4.07177734375, 4.3232421875, 4.57470703125, 4.826171875, 5.07763671875, 5.3291015625, 5.58056640625, 5.83203125, 6.08349609375, 6.3349609375, 6.58642578125, 6.837890625, 7.08935546875, 7.3408203125, 7.59228515625, 7.84375]}, "gradients/decoder.transformer.h.12.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 3.0, 0.0, 1.0, 1.0, 2.0, 6.0, 4.0, 9.0, 14.0, 11.0, 11.0, 19.0, 26.0, 27.0, 16.0, 30.0, 33.0, 30.0, 50.0, 41.0, 56.0, 215.0, 1828.0, 143.0, 53.0, 61.0, 53.0, 43.0, 36.0, 36.0, 29.0, 36.0, 23.0, 16.0, 23.0, 13.0, 9.0, 14.0, 11.0, 9.0, 5.0, 3.0, 2.0, 3.0, 1.0, 3.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.15625, -23.38671875, -22.6171875, -21.84765625, -21.078125, -20.30859375, -19.5390625, -18.76953125, -18.0, -17.23046875, -16.4609375, -15.69140625, -14.921875, -14.15234375, -13.3828125, -12.61328125, -11.84375, -11.07421875, -10.3046875, -9.53515625, -8.765625, -7.99609375, -7.2265625, -6.45703125, -5.6875, -4.91796875, -4.1484375, -3.37890625, -2.609375, -1.83984375, -1.0703125, -0.30078125, 0.46875, 1.23828125, 2.0078125, 2.77734375, 3.546875, 4.31640625, 5.0859375, 5.85546875, 6.625, 7.39453125, 8.1640625, 8.93359375, 9.703125, 10.47265625, 11.2421875, 12.01171875, 12.78125, 13.55078125, 14.3203125, 15.08984375, 15.859375, 16.62890625, 17.3984375, 18.16796875, 18.9375, 19.70703125, 20.4765625, 21.24609375, 22.015625, 22.78515625, 23.5546875, 24.32421875, 25.09375]}, "gradients/decoder.transformer.h.12.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 5.0, 4.0, 5.0, 5.0, 5.0, 9.0, 8.0, 19.0, 13.0, 19.0, 23.0, 30.0, 37.0, 65.0, 66.0, 104.0, 143.0, 214.0, 308.0, 547.0, 2195.0, 2601392.0, 537355.0, 1796.0, 480.0, 279.0, 167.0, 129.0, 66.0, 58.0, 43.0, 26.0, 24.0, 16.0, 18.0, 12.0, 7.0, 9.0, 3.0, 2.0, 1.0, 1.0, 6.0, 1.0, 1.0, 1.0, 3.0, 0.0, 2.0, 0.0, 1.0], "bins": [-64.6875, -62.8369140625, -60.986328125, -59.1357421875, -57.28515625, -55.4345703125, -53.583984375, -51.7333984375, -49.8828125, -48.0322265625, -46.181640625, -44.3310546875, -42.48046875, -40.6298828125, -38.779296875, -36.9287109375, -35.078125, -33.2275390625, -31.376953125, -29.5263671875, -27.67578125, -25.8251953125, -23.974609375, -22.1240234375, -20.2734375, -18.4228515625, -16.572265625, -14.7216796875, -12.87109375, -11.0205078125, -9.169921875, -7.3193359375, -5.46875, -3.6181640625, -1.767578125, 0.0830078125, 1.93359375, 3.7841796875, 5.634765625, 7.4853515625, 9.3359375, 11.1865234375, 13.037109375, 14.8876953125, 16.73828125, 18.5888671875, 20.439453125, 22.2900390625, 24.140625, 25.9912109375, 27.841796875, 29.6923828125, 31.54296875, 33.3935546875, 35.244140625, 37.0947265625, 38.9453125, 40.7958984375, 42.646484375, 44.4970703125, 46.34765625, 48.1982421875, 50.048828125, 51.8994140625, 53.75]}, "gradients/decoder.transformer.h.12.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 154.0, 691.0, 166.0, 4.0, 0.0, 0.0, 3.0], "bins": [-175.4291534423828, -172.42959594726562, -169.43002319335938, -166.4304656982422, -163.430908203125, -160.4313507080078, -157.43179321289062, -154.43222045898438, -151.4326629638672, -148.43310546875, -145.43353271484375, -142.43397521972656, -139.43441772460938, -136.4348602294922, -133.435302734375, -130.43572998046875, -127.43617248535156, -124.43661499023438, -121.43704986572266, -118.43748474121094, -115.43792724609375, -112.43836975097656, -109.43880462646484, -106.43923950195312, -103.43968200683594, -100.44012451171875, -97.44055938720703, -94.44099426269531, -91.44143676757812, -88.44187927246094, -85.44231414794922, -82.4427490234375, -79.44319152832031, -76.44363403320312, -73.4440689086914, -70.44450378417969, -67.4449462890625, -64.44538879394531, -61.445823669433594, -58.44626235961914, -55.44670486450195, -52.4471435546875, -49.44758224487305, -46.448020935058594, -43.44845962524414, -40.44889831542969, -37.449337005615234, -34.44977569580078, -31.450214385986328, -28.450653076171875, -25.451091766357422, -22.45153045654297, -19.451969146728516, -16.452407836914062, -13.45284652709961, -10.453285217285156, -7.453723907470703, -4.45416259765625, -1.4546012878417969, 1.5449600219726562, 4.544521331787109, 7.5440826416015625, 10.543643951416016, 13.543205261230469, 16.542766571044922]}, "gradients/decoder.transformer.h.12.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 4.0, 2.0, 4.0, 4.0, 7.0, 8.0, 11.0, 7.0, 13.0, 15.0, 19.0, 16.0, 22.0, 23.0, 24.0, 29.0, 28.0, 35.0, 33.0, 44.0, 40.0, 42.0, 39.0, 48.0, 38.0, 44.0, 40.0, 37.0, 38.0, 34.0, 51.0, 20.0, 28.0, 17.0, 19.0, 20.0, 16.0, 16.0, 15.0, 10.0, 11.0, 10.0, 5.0, 7.0, 6.0, 3.0, 6.0, 2.0, 2.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-54.37822341918945, -52.69472122192383, -51.01121520996094, -49.32771301269531, -47.64421081542969, -45.96070861816406, -44.27720642089844, -42.59370040893555, -40.91019821166992, -39.2266960144043, -37.543190002441406, -35.85968780517578, -34.176185607910156, -32.49268341064453, -30.809179306030273, -29.125675201416016, -27.44217300415039, -25.758670806884766, -24.075166702270508, -22.39166259765625, -20.708160400390625, -19.024658203125, -17.341154098510742, -15.6576509475708, -13.97414779663086, -12.290644645690918, -10.607141494750977, -8.923638343811035, -7.240135192871094, -5.556632041931152, -3.873128890991211, -2.1896257400512695, -0.5061264038085938, 1.1773767471313477, 2.860879898071289, 4.5443830490112305, 6.227886199951172, 7.911389350891113, 9.594892501831055, 11.278395652770996, 12.961898803710938, 14.645401954650879, 16.32890510559082, 18.012409210205078, 19.695911407470703, 21.379413604736328, 23.062917709350586, 24.746421813964844, 26.42992401123047, 28.113426208496094, 29.79693031311035, 31.48043441772461, 33.163936614990234, 34.84743881225586, 36.53094482421875, 38.214447021484375, 39.89794921875, 41.581451416015625, 43.26495361328125, 44.94845962524414, 46.631961822509766, 48.31546401977539, 49.99897003173828, 51.682472229003906, 53.36597442626953]}, "gradients/decoder.transformer.h.11.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 0.0, 5.0, 2.0, 7.0, 4.0, 8.0, 9.0, 13.0, 21.0, 13.0, 36.0, 27.0, 18.0, 32.0, 27.0, 32.0, 46.0, 52.0, 45.0, 43.0, 59.0, 43.0, 41.0, 48.0, 52.0, 51.0, 38.0, 34.0, 28.0, 26.0, 35.0, 27.0, 15.0, 13.0, 17.0, 9.0, 7.0, 5.0, 10.0, 4.0, 4.0, 3.0, 2.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.3671875, -8.1077880859375, -7.848388671875, -7.5889892578125, -7.32958984375, -7.0701904296875, -6.810791015625, -6.5513916015625, -6.2919921875, -6.0325927734375, -5.773193359375, -5.5137939453125, -5.25439453125, -4.9949951171875, -4.735595703125, -4.4761962890625, -4.216796875, -3.9573974609375, -3.697998046875, -3.4385986328125, -3.17919921875, -2.9197998046875, -2.660400390625, -2.4010009765625, -2.1416015625, -1.8822021484375, -1.622802734375, -1.3634033203125, -1.10400390625, -0.8446044921875, -0.585205078125, -0.3258056640625, -0.06640625, 0.1929931640625, 0.452392578125, 0.7117919921875, 0.97119140625, 1.2305908203125, 1.489990234375, 1.7493896484375, 2.0087890625, 2.2681884765625, 2.527587890625, 2.7869873046875, 3.04638671875, 3.3057861328125, 3.565185546875, 3.8245849609375, 4.083984375, 4.3433837890625, 4.602783203125, 4.8621826171875, 5.12158203125, 5.3809814453125, 5.640380859375, 5.8997802734375, 6.1591796875, 6.4185791015625, 6.677978515625, 6.9373779296875, 7.19677734375, 7.4561767578125, 7.715576171875, 7.9749755859375, 8.234375]}, "gradients/decoder.transformer.h.11.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 4.0, 5.0, 2.0, 5.0, 8.0, 10.0, 27.0, 22.0, 35.0, 43.0, 55.0, 64.0, 59.0, 101.0, 129.0, 211.0, 374.0, 964.0, 5285.0, 127266.0, 2834064.0, 1193684.0, 27977.0, 2337.0, 605.0, 274.0, 180.0, 120.0, 90.0, 48.0, 55.0, 45.0, 40.0, 22.0, 19.0, 12.0, 14.0, 10.0, 13.0, 4.0, 1.0, 2.0, 3.0, 2.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-21.875, -21.2080078125, -20.541015625, -19.8740234375, -19.20703125, -18.5400390625, -17.873046875, -17.2060546875, -16.5390625, -15.8720703125, -15.205078125, -14.5380859375, -13.87109375, -13.2041015625, -12.537109375, -11.8701171875, -11.203125, -10.5361328125, -9.869140625, -9.2021484375, -8.53515625, -7.8681640625, -7.201171875, -6.5341796875, -5.8671875, -5.2001953125, -4.533203125, -3.8662109375, -3.19921875, -2.5322265625, -1.865234375, -1.1982421875, -0.53125, 0.1357421875, 0.802734375, 1.4697265625, 2.13671875, 2.8037109375, 3.470703125, 4.1376953125, 4.8046875, 5.4716796875, 6.138671875, 6.8056640625, 7.47265625, 8.1396484375, 8.806640625, 9.4736328125, 10.140625, 10.8076171875, 11.474609375, 12.1416015625, 12.80859375, 13.4755859375, 14.142578125, 14.8095703125, 15.4765625, 16.1435546875, 16.810546875, 17.4775390625, 18.14453125, 18.8115234375, 19.478515625, 20.1455078125, 20.8125]}, "gradients/decoder.transformer.h.11.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 1.0, 4.0, 3.0, 0.0, 1.0, 5.0, 1.0, 5.0, 10.0, 9.0, 16.0, 20.0, 22.0, 35.0, 38.0, 52.0, 72.0, 121.0, 164.0, 235.0, 293.0, 373.0, 476.0, 525.0, 443.0, 331.0, 227.0, 177.0, 108.0, 78.0, 61.0, 50.0, 35.0, 33.0, 15.0, 12.0, 8.0, 10.0, 5.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-16.375, -15.9442138671875, -15.513427734375, -15.0826416015625, -14.65185546875, -14.2210693359375, -13.790283203125, -13.3594970703125, -12.9287109375, -12.4979248046875, -12.067138671875, -11.6363525390625, -11.20556640625, -10.7747802734375, -10.343994140625, -9.9132080078125, -9.482421875, -9.0516357421875, -8.620849609375, -8.1900634765625, -7.75927734375, -7.3284912109375, -6.897705078125, -6.4669189453125, -6.0361328125, -5.6053466796875, -5.174560546875, -4.7437744140625, -4.31298828125, -3.8822021484375, -3.451416015625, -3.0206298828125, -2.58984375, -2.1590576171875, -1.728271484375, -1.2974853515625, -0.86669921875, -0.4359130859375, -0.005126953125, 0.4256591796875, 0.8564453125, 1.2872314453125, 1.718017578125, 2.1488037109375, 2.57958984375, 3.0103759765625, 3.441162109375, 3.8719482421875, 4.302734375, 4.7335205078125, 5.164306640625, 5.5950927734375, 6.02587890625, 6.4566650390625, 6.887451171875, 7.3182373046875, 7.7490234375, 8.1798095703125, 8.610595703125, 9.0413818359375, 9.47216796875, 9.9029541015625, 10.333740234375, 10.7645263671875, 11.1953125]}, "gradients/decoder.transformer.h.11.mlp.c_fc.weight": {"_type": "histogram", "values": [2.0, 2.0, 4.0, 3.0, 4.0, 9.0, 3.0, 7.0, 10.0, 13.0, 14.0, 18.0, 24.0, 39.0, 48.0, 59.0, 66.0, 80.0, 107.0, 139.0, 167.0, 230.0, 363.0, 545.0, 1711.0, 1212441.0, 2974188.0, 2061.0, 552.0, 345.0, 254.0, 176.0, 121.0, 98.0, 77.0, 58.0, 53.0, 36.0, 34.0, 36.0, 17.0, 23.0, 11.0, 9.0, 5.0, 14.0, 8.0, 1.0, 4.0, 0.0, 3.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-59.84375, -57.54638671875, -55.2490234375, -52.95166015625, -50.654296875, -48.35693359375, -46.0595703125, -43.76220703125, -41.46484375, -39.16748046875, -36.8701171875, -34.57275390625, -32.275390625, -29.97802734375, -27.6806640625, -25.38330078125, -23.0859375, -20.78857421875, -18.4912109375, -16.19384765625, -13.896484375, -11.59912109375, -9.3017578125, -7.00439453125, -4.70703125, -2.40966796875, -0.1123046875, 2.18505859375, 4.482421875, 6.77978515625, 9.0771484375, 11.37451171875, 13.671875, 15.96923828125, 18.2666015625, 20.56396484375, 22.861328125, 25.15869140625, 27.4560546875, 29.75341796875, 32.05078125, 34.34814453125, 36.6455078125, 38.94287109375, 41.240234375, 43.53759765625, 45.8349609375, 48.13232421875, 50.4296875, 52.72705078125, 55.0244140625, 57.32177734375, 59.619140625, 61.91650390625, 64.2138671875, 66.51123046875, 68.80859375, 71.10595703125, 73.4033203125, 75.70068359375, 77.998046875, 80.29541015625, 82.5927734375, 84.89013671875, 87.1875]}, "gradients/decoder.transformer.h.11.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 8.0, 29.0, 131.0, 283.0, 320.0, 178.0, 56.0, 10.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-145.71664428710938, -141.57821655273438, -137.4397735595703, -133.3013458251953, -129.16290283203125, -125.02447509765625, -120.88603973388672, -116.74760437011719, -112.60916900634766, -108.47073364257812, -104.3322982788086, -100.19386291503906, -96.05543518066406, -91.9169921875, -87.778564453125, -83.64012908935547, -79.50169372558594, -75.3632583618164, -71.22482299804688, -67.08638763427734, -62.94795608520508, -58.80952072143555, -54.67108917236328, -50.53265380859375, -46.39421844482422, -42.25578308105469, -38.117347717285156, -33.97891616821289, -29.84048080444336, -25.702045440673828, -21.56361198425293, -17.42517852783203, -13.2867431640625, -9.148308753967285, -5.00987434387207, -0.8714399337768555, 3.2669944763183594, 7.405429840087891, 11.543863296508789, 15.682296752929688, 19.82073211669922, 23.95916748046875, 28.09760093688965, 32.23603439331055, 36.37446975708008, 40.51290512084961, 44.651336669921875, 48.789772033691406, 52.92820739746094, 57.06664276123047, 61.205078125, 65.34351348876953, 69.48194885253906, 73.62037658691406, 77.7588119506836, 81.89724731445312, 86.03568267822266, 90.17411804199219, 94.31255340576172, 98.45098876953125, 102.58941650390625, 106.72785949707031, 110.86628723144531, 115.00472259521484, 119.14315795898438]}, "gradients/decoder.transformer.h.11.ln_2.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 6.0, 1.0, 9.0, 6.0, 6.0, 18.0, 17.0, 22.0, 17.0, 27.0, 31.0, 35.0, 39.0, 38.0, 42.0, 50.0, 55.0, 74.0, 59.0, 44.0, 58.0, 53.0, 49.0, 44.0, 33.0, 40.0, 24.0, 31.0, 12.0, 15.0, 13.0, 11.0, 9.0, 6.0, 8.0, 7.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-51.41609191894531, -49.62321472167969, -47.83034133911133, -46.03746795654297, -44.244590759277344, -42.45171356201172, -40.65884017944336, -38.865966796875, -37.073089599609375, -35.28021240234375, -33.48733901977539, -31.6944637298584, -29.901588439941406, -28.108713150024414, -26.315837860107422, -24.52296257019043, -22.730087280273438, -20.937211990356445, -19.144336700439453, -17.35146141052246, -15.558586120605469, -13.765710830688477, -11.972835540771484, -10.179960250854492, -8.3870849609375, -6.594209671020508, -4.801334381103516, -3.0084590911865234, -1.2155838012695312, 0.5772914886474609, 2.370166778564453, 4.163042068481445, 5.9559173583984375, 7.74879264831543, 9.541667938232422, 11.334543228149414, 13.127418518066406, 14.920293807983398, 16.71316909790039, 18.506044387817383, 20.298919677734375, 22.091794967651367, 23.88467025756836, 25.67754554748535, 27.470420837402344, 29.263296127319336, 31.056171417236328, 32.84904479980469, 34.64192199707031, 36.43479919433594, 38.2276725769043, 40.020545959472656, 41.81342315673828, 43.606300354003906, 45.399173736572266, 47.192047119140625, 48.98492431640625, 50.777801513671875, 52.570674896240234, 54.363548278808594, 56.15642547607422, 57.949302673339844, 59.7421760559082, 61.53504943847656, 63.32792663574219]}, "gradients/decoder.transformer.h.11.crossattention.c_proj.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 5.0, 5.0, 4.0, 11.0, 3.0, 8.0, 5.0, 10.0, 15.0, 17.0, 19.0, 19.0, 15.0, 27.0, 39.0, 25.0, 34.0, 43.0, 38.0, 33.0, 51.0, 37.0, 39.0, 47.0, 41.0, 33.0, 45.0, 35.0, 41.0, 36.0, 24.0, 27.0, 23.0, 21.0, 21.0, 17.0, 22.0, 12.0, 14.0, 8.0, 7.0, 10.0, 5.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.5546875, -6.33599853515625, -6.1173095703125, -5.89862060546875, -5.679931640625, -5.46124267578125, -5.2425537109375, -5.02386474609375, -4.80517578125, -4.58648681640625, -4.3677978515625, -4.14910888671875, -3.930419921875, -3.71173095703125, -3.4930419921875, -3.27435302734375, -3.0556640625, -2.83697509765625, -2.6182861328125, -2.39959716796875, -2.180908203125, -1.96221923828125, -1.7435302734375, -1.52484130859375, -1.30615234375, -1.08746337890625, -0.8687744140625, -0.65008544921875, -0.431396484375, -0.21270751953125, 0.0059814453125, 0.22467041015625, 0.443359375, 0.66204833984375, 0.8807373046875, 1.09942626953125, 1.318115234375, 1.53680419921875, 1.7554931640625, 1.97418212890625, 2.19287109375, 2.41156005859375, 2.6302490234375, 2.84893798828125, 3.067626953125, 3.28631591796875, 3.5050048828125, 3.72369384765625, 3.9423828125, 4.16107177734375, 4.3797607421875, 4.59844970703125, 4.817138671875, 5.03582763671875, 5.2545166015625, 5.47320556640625, 5.69189453125, 5.91058349609375, 6.1292724609375, 6.34796142578125, 6.566650390625, 6.78533935546875, 7.0040283203125, 7.22271728515625, 7.44140625]}, "gradients/decoder.transformer.h.11.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 3.0, 4.0, 7.0, 4.0, 9.0, 10.0, 14.0, 18.0, 46.0, 43.0, 80.0, 105.0, 151.0, 196.0, 315.0, 448.0, 620.0, 969.0, 1428.0, 2090.0, 3255.0, 4961.0, 7739.0, 11893.0, 19160.0, 30174.0, 49927.0, 89638.0, 181778.0, 326659.0, 134311.0, 71036.0, 41240.0, 25091.0, 15858.0, 9999.0, 6555.0, 4263.0, 2793.0, 1844.0, 1232.0, 813.0, 560.0, 375.0, 255.0, 173.0, 126.0, 93.0, 63.0, 41.0, 39.0, 13.0, 18.0, 16.0, 6.0, 8.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0], "bins": [-1.8115234375, -1.753662109375, -1.69580078125, -1.637939453125, -1.580078125, -1.522216796875, -1.46435546875, -1.406494140625, -1.3486328125, -1.290771484375, -1.23291015625, -1.175048828125, -1.1171875, -1.059326171875, -1.00146484375, -0.943603515625, -0.8857421875, -0.827880859375, -0.77001953125, -0.712158203125, -0.654296875, -0.596435546875, -0.53857421875, -0.480712890625, -0.4228515625, -0.364990234375, -0.30712890625, -0.249267578125, -0.19140625, -0.133544921875, -0.07568359375, -0.017822265625, 0.0400390625, 0.097900390625, 0.15576171875, 0.213623046875, 0.271484375, 0.329345703125, 0.38720703125, 0.445068359375, 0.5029296875, 0.560791015625, 0.61865234375, 0.676513671875, 0.734375, 0.792236328125, 0.85009765625, 0.907958984375, 0.9658203125, 1.023681640625, 1.08154296875, 1.139404296875, 1.197265625, 1.255126953125, 1.31298828125, 1.370849609375, 1.4287109375, 1.486572265625, 1.54443359375, 1.602294921875, 1.66015625, 1.718017578125, 1.77587890625, 1.833740234375, 1.8916015625]}, "gradients/decoder.transformer.h.11.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 5.0, 7.0, 6.0, 2.0, 6.0, 9.0, 9.0, 10.0, 15.0, 14.0, 21.0, 15.0, 21.0, 11.0, 24.0, 23.0, 21.0, 37.0, 49.0, 34.0, 32.0, 35.0, 48.0, 40.0, 1064.0, 42.0, 38.0, 34.0, 38.0, 31.0, 24.0, 37.0, 37.0, 25.0, 19.0, 22.0, 30.0, 13.0, 11.0, 12.0, 9.0, 12.0, 9.0, 9.0, 5.0, 8.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-4.40234375, -4.2669677734375, -4.131591796875, -3.9962158203125, -3.86083984375, -3.7254638671875, -3.590087890625, -3.4547119140625, -3.3193359375, -3.1839599609375, -3.048583984375, -2.9132080078125, -2.77783203125, -2.6424560546875, -2.507080078125, -2.3717041015625, -2.236328125, -2.1009521484375, -1.965576171875, -1.8302001953125, -1.69482421875, -1.5594482421875, -1.424072265625, -1.2886962890625, -1.1533203125, -1.0179443359375, -0.882568359375, -0.7471923828125, -0.61181640625, -0.4764404296875, -0.341064453125, -0.2056884765625, -0.0703125, 0.0650634765625, 0.200439453125, 0.3358154296875, 0.47119140625, 0.6065673828125, 0.741943359375, 0.8773193359375, 1.0126953125, 1.1480712890625, 1.283447265625, 1.4188232421875, 1.55419921875, 1.6895751953125, 1.824951171875, 1.9603271484375, 2.095703125, 2.2310791015625, 2.366455078125, 2.5018310546875, 2.63720703125, 2.7725830078125, 2.907958984375, 3.0433349609375, 3.1787109375, 3.3140869140625, 3.449462890625, 3.5848388671875, 3.72021484375, 3.8555908203125, 3.990966796875, 4.1263427734375, 4.26171875]}, "gradients/decoder.transformer.h.11.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 2.0, 5.0, 4.0, 11.0, 12.0, 10.0, 19.0, 34.0, 44.0, 61.0, 100.0, 195.0, 314.0, 514.0, 832.0, 1470.0, 2601.0, 4339.0, 7537.0, 13446.0, 24670.0, 47065.0, 94677.0, 244192.0, 1415915.0, 116798.0, 56688.0, 28995.0, 15695.0, 8798.0, 5062.0, 2851.0, 1680.0, 1033.0, 581.0, 311.0, 215.0, 108.0, 87.0, 56.0, 33.0, 16.0, 21.0, 10.0, 7.0, 7.0, 4.0, 6.0, 5.0, 1.0, 1.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.31640625, -2.24200439453125, -2.1676025390625, -2.09320068359375, -2.018798828125, -1.94439697265625, -1.8699951171875, -1.79559326171875, -1.72119140625, -1.64678955078125, -1.5723876953125, -1.49798583984375, -1.423583984375, -1.34918212890625, -1.2747802734375, -1.20037841796875, -1.1259765625, -1.05157470703125, -0.9771728515625, -0.90277099609375, -0.828369140625, -0.75396728515625, -0.6795654296875, -0.60516357421875, -0.53076171875, -0.45635986328125, -0.3819580078125, -0.30755615234375, -0.233154296875, -0.15875244140625, -0.0843505859375, -0.00994873046875, 0.064453125, 0.13885498046875, 0.2132568359375, 0.28765869140625, 0.362060546875, 0.43646240234375, 0.5108642578125, 0.58526611328125, 0.65966796875, 0.73406982421875, 0.8084716796875, 0.88287353515625, 0.957275390625, 1.03167724609375, 1.1060791015625, 1.18048095703125, 1.2548828125, 1.32928466796875, 1.4036865234375, 1.47808837890625, 1.552490234375, 1.62689208984375, 1.7012939453125, 1.77569580078125, 1.85009765625, 1.92449951171875, 1.9989013671875, 2.07330322265625, 2.147705078125, 2.22210693359375, 2.2965087890625, 2.37091064453125, 2.4453125]}, "gradients/decoder.transformer.h.11.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 1.0, 3.0, 4.0, 6.0, 6.0, 5.0, 5.0, 8.0, 10.0, 10.0, 12.0, 16.0, 15.0, 19.0, 24.0, 23.0, 33.0, 50.0, 66.0, 66.0, 74.0, 84.0, 64.0, 73.0, 53.0, 54.0, 45.0, 31.0, 26.0, 24.0, 18.0, 19.0, 10.0, 9.0, 7.0, 7.0, 4.0, 3.0, 3.0, 8.0, 1.0, 3.0, 2.0, 1.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-0.0010089874267578125, -0.000979110598564148, -0.0009492337703704834, -0.0009193569421768188, -0.0008894801139831543, -0.0008596032857894897, -0.0008297264575958252, -0.0007998496294021606, -0.0007699728012084961, -0.0007400959730148315, -0.000710219144821167, -0.0006803423166275024, -0.0006504654884338379, -0.0006205886602401733, -0.0005907118320465088, -0.0005608350038528442, -0.0005309581756591797, -0.0005010813474655151, -0.0004712045192718506, -0.00044132769107818604, -0.0004114508628845215, -0.00038157403469085693, -0.0003516972064971924, -0.00032182037830352783, -0.0002919435501098633, -0.00026206672191619873, -0.00023218989372253418, -0.00020231306552886963, -0.00017243623733520508, -0.00014255940914154053, -0.00011268258094787598, -8.280575275421143e-05, -5.2928924560546875e-05, -2.3052096366882324e-05, 6.8247318267822266e-06, 3.670156002044678e-05, 6.657838821411133e-05, 9.645521640777588e-05, 0.00012633204460144043, 0.00015620887279510498, 0.00018608570098876953, 0.00021596252918243408, 0.00024583935737609863, 0.0002757161855697632, 0.00030559301376342773, 0.0003354698419570923, 0.00036534667015075684, 0.0003952234983444214, 0.00042510032653808594, 0.0004549771547317505, 0.00048485398292541504, 0.0005147308111190796, 0.0005446076393127441, 0.0005744844675064087, 0.0006043612957000732, 0.0006342381238937378, 0.0006641149520874023, 0.0006939917802810669, 0.0007238686084747314, 0.000753745436668396, 0.0007836222648620605, 0.0008134990930557251, 0.0008433759212493896, 0.0008732527494430542, 0.0009031295776367188]}, "gradients/decoder.transformer.h.11.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 2.0, 3.0, 3.0, 3.0, 3.0, 5.0, 6.0, 6.0, 11.0, 14.0, 18.0, 23.0, 44.0, 39.0, 56.0, 77.0, 142.0, 234.0, 568.0, 7345.0, 1034835.0, 3922.0, 522.0, 225.0, 139.0, 68.0, 47.0, 35.0, 32.0, 26.0, 11.0, 16.0, 15.0, 11.0, 12.0, 8.0, 7.0, 6.0, 1.0, 3.0, 5.0, 3.0, 3.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0], "bins": [-0.0240020751953125, -0.02326226234436035, -0.022522449493408203, -0.021782636642456055, -0.021042823791503906, -0.020303010940551758, -0.01956319808959961, -0.01882338523864746, -0.018083572387695312, -0.017343759536743164, -0.016603946685791016, -0.015864133834838867, -0.015124320983886719, -0.01438450813293457, -0.013644695281982422, -0.012904882431030273, -0.012165069580078125, -0.011425256729125977, -0.010685443878173828, -0.00994563102722168, -0.009205818176269531, -0.008466005325317383, -0.007726192474365234, -0.006986379623413086, -0.0062465667724609375, -0.005506753921508789, -0.004766941070556641, -0.004027128219604492, -0.0032873153686523438, -0.0025475025177001953, -0.0018076896667480469, -0.0010678768157958984, -0.00032806396484375, 0.00041174888610839844, 0.0011515617370605469, 0.0018913745880126953, 0.0026311874389648438, 0.003371000289916992, 0.004110813140869141, 0.004850625991821289, 0.0055904388427734375, 0.006330251693725586, 0.007070064544677734, 0.007809877395629883, 0.008549690246582031, 0.00928950309753418, 0.010029315948486328, 0.010769128799438477, 0.011508941650390625, 0.012248754501342773, 0.012988567352294922, 0.01372838020324707, 0.014468193054199219, 0.015208005905151367, 0.015947818756103516, 0.016687631607055664, 0.017427444458007812, 0.01816725730895996, 0.01890707015991211, 0.019646883010864258, 0.020386695861816406, 0.021126508712768555, 0.021866321563720703, 0.02260613441467285, 0.023345947265625]}, "gradients/decoder.transformer.h.11.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 6.0, 10.0, 22.0, 38.0, 61.0, 101.0, 123.0, 154.0, 123.0, 135.0, 100.0, 59.0, 37.0, 22.0, 13.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.000758499139919877, -0.000737445370759815, -0.0007163916015997529, -0.0006953377742320299, -0.0006742840050719678, -0.0006532302359119058, -0.0006321764667518437, -0.0006111226975917816, -0.0005900689284317195, -0.0005690151592716575, -0.0005479613901115954, -0.0005269076209515333, -0.0005058537935838103, -0.00048480002442374825, -0.0004637462552636862, -0.0004426924861036241, -0.0004216386878397316, -0.0004005849186796695, -0.00037953112041577697, -0.0003584773512557149, -0.0003374235820956528, -0.00031636981293559074, -0.0002953160146716982, -0.00027426224551163614, -0.0002532084472477436, -0.0002321546635357663, -0.00021110089437570423, -0.00019004711066372693, -0.00016899334150366485, -0.00014793955779168755, -0.00012688577407971025, -0.00010583200491964817, -8.47782357595861e-05, -6.372445932356641e-05, -4.267067924956791e-05, -2.1616899175569415e-05, -5.631227395497262e-07, 2.0490653696469963e-05, 4.1544437408447266e-05, 6.259820656850934e-05, 8.365199028048664e-05, 0.00010470576671650633, 0.00012575954315252602, 0.00014681332686450332, 0.00016786711057648063, 0.0001889208797365427, 0.00020997466344852, 0.00023102843260858208, 0.0002520822163205594, 0.00027313598548062146, 0.000294189783744514, 0.00031524355290457606, 0.00033629732206463814, 0.0003573510912247002, 0.00037840488948859274, 0.0003994586586486548, 0.00042051245691254735, 0.0004415662260726094, 0.00046262002433650196, 0.00048367379349656403, 0.0005047275917604566, 0.0005257813609205186, 0.0005468351300805807, 0.0005678888992406428, 0.0005889426684007049]}, "gradients/decoder.transformer.h.11.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 2.0, 3.0, 5.0, 3.0, 4.0, 4.0, 2.0, 4.0, 10.0, 8.0, 11.0, 10.0, 23.0, 18.0, 17.0, 12.0, 20.0, 35.0, 34.0, 37.0, 36.0, 38.0, 41.0, 41.0, 34.0, 41.0, 39.0, 43.0, 51.0, 36.0, 43.0, 38.0, 33.0, 33.0, 31.0, 27.0, 17.0, 28.0, 23.0, 13.0, 13.0, 12.0, 9.0, 5.0, 6.0, 5.0, 9.0, 3.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00041943788528442383, -0.0004041166976094246, -0.00038879550993442535, -0.0003734743222594261, -0.0003581531345844269, -0.00034283194690942764, -0.0003275107592344284, -0.00031218957155942917, -0.00029686838388442993, -0.0002815471962094307, -0.00026622600853443146, -0.0002509048208594322, -0.00023558363318443298, -0.00022026244550943375, -0.0002049412578344345, -0.00018962007015943527, -0.00017429888248443604, -0.0001589776948094368, -0.00014365650713443756, -0.00012833531945943832, -0.00011301413178443909, -9.769294410943985e-05, -8.237175643444061e-05, -6.705056875944138e-05, -5.172938108444214e-05, -3.64081934094429e-05, -2.1087005734443665e-05, -5.7658180594444275e-06, 9.55536961555481e-06, 2.4876557290554047e-05, 4.0197744965553284e-05, 5.551893264055252e-05, 7.084012031555176e-05, 8.6161307990551e-05, 0.00010148249566555023, 0.00011680368334054947, 0.0001321248710155487, 0.00014744605869054794, 0.00016276724636554718, 0.00017808843404054642, 0.00019340962171554565, 0.0002087308093905449, 0.00022405199706554413, 0.00023937318474054337, 0.0002546943724155426, 0.00027001556009054184, 0.0002853367477655411, 0.0003006579354405403, 0.00031597912311553955, 0.0003313003107905388, 0.000346621498465538, 0.00036194268614053726, 0.0003772638738155365, 0.00039258506149053574, 0.000407906249165535, 0.0004232274368405342, 0.00043854862451553345, 0.0004538698121905327, 0.0004691909998655319, 0.00048451218754053116, 0.0004998333752155304, 0.0005151545628905296, 0.0005304757505655289, 0.0005457969382405281, 0.0005611181259155273]}, "gradients/decoder.transformer.h.11.attn.c_proj.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 5.0, 5.0, 4.0, 11.0, 3.0, 8.0, 5.0, 10.0, 15.0, 17.0, 19.0, 19.0, 15.0, 27.0, 39.0, 25.0, 34.0, 43.0, 38.0, 33.0, 51.0, 37.0, 39.0, 47.0, 41.0, 33.0, 45.0, 35.0, 41.0, 36.0, 24.0, 27.0, 23.0, 21.0, 21.0, 17.0, 22.0, 12.0, 14.0, 8.0, 7.0, 10.0, 5.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.5546875, -6.33599853515625, -6.1173095703125, -5.89862060546875, -5.679931640625, -5.46124267578125, -5.2425537109375, -5.02386474609375, -4.80517578125, -4.58648681640625, -4.3677978515625, -4.14910888671875, -3.930419921875, -3.71173095703125, -3.4930419921875, -3.27435302734375, -3.0556640625, -2.83697509765625, -2.6182861328125, -2.39959716796875, -2.180908203125, -1.96221923828125, -1.7435302734375, -1.52484130859375, -1.30615234375, -1.08746337890625, -0.8687744140625, -0.65008544921875, -0.431396484375, -0.21270751953125, 0.0059814453125, 0.22467041015625, 0.443359375, 0.66204833984375, 0.8807373046875, 1.09942626953125, 1.318115234375, 1.53680419921875, 1.7554931640625, 1.97418212890625, 2.19287109375, 2.41156005859375, 2.6302490234375, 2.84893798828125, 3.067626953125, 3.28631591796875, 3.5050048828125, 3.72369384765625, 3.9423828125, 4.16107177734375, 4.3797607421875, 4.59844970703125, 4.817138671875, 5.03582763671875, 5.2545166015625, 5.47320556640625, 5.69189453125, 5.91058349609375, 6.1292724609375, 6.34796142578125, 6.566650390625, 6.78533935546875, 7.0040283203125, 7.22271728515625, 7.44140625]}, "gradients/decoder.transformer.h.11.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 1.0, 5.0, 5.0, 8.0, 10.0, 20.0, 26.0, 43.0, 79.0, 103.0, 210.0, 326.0, 584.0, 1254.0, 3064.0, 8203.0, 25457.0, 105424.0, 528745.0, 292430.0, 56929.0, 15969.0, 5468.0, 2123.0, 956.0, 450.0, 266.0, 149.0, 104.0, 54.0, 29.0, 19.0, 19.0, 15.0, 7.0, 2.0, 5.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6.4296875, -6.21392822265625, -5.9981689453125, -5.78240966796875, -5.566650390625, -5.35089111328125, -5.1351318359375, -4.91937255859375, -4.70361328125, -4.48785400390625, -4.2720947265625, -4.05633544921875, -3.840576171875, -3.62481689453125, -3.4090576171875, -3.19329833984375, -2.9775390625, -2.76177978515625, -2.5460205078125, -2.33026123046875, -2.114501953125, -1.89874267578125, -1.6829833984375, -1.46722412109375, -1.25146484375, -1.03570556640625, -0.8199462890625, -0.60418701171875, -0.388427734375, -0.17266845703125, 0.0430908203125, 0.25885009765625, 0.474609375, 0.69036865234375, 0.9061279296875, 1.12188720703125, 1.337646484375, 1.55340576171875, 1.7691650390625, 1.98492431640625, 2.20068359375, 2.41644287109375, 2.6322021484375, 2.84796142578125, 3.063720703125, 3.27947998046875, 3.4952392578125, 3.71099853515625, 3.9267578125, 4.14251708984375, 4.3582763671875, 4.57403564453125, 4.789794921875, 5.00555419921875, 5.2213134765625, 5.43707275390625, 5.65283203125, 5.86859130859375, 6.0843505859375, 6.30010986328125, 6.515869140625, 6.73162841796875, 6.9473876953125, 7.16314697265625, 7.37890625]}, "gradients/decoder.transformer.h.11.attn.c_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 2.0, 3.0, 0.0, 4.0, 6.0, 6.0, 6.0, 5.0, 15.0, 9.0, 17.0, 25.0, 23.0, 29.0, 19.0, 28.0, 44.0, 49.0, 34.0, 51.0, 39.0, 66.0, 125.0, 1657.0, 325.0, 89.0, 49.0, 44.0, 34.0, 29.0, 31.0, 39.0, 32.0, 33.0, 21.0, 16.0, 12.0, 13.0, 11.0, 5.0, 6.0, 8.0, 2.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.265625, -20.472412109375, -19.67919921875, -18.885986328125, -18.0927734375, -17.299560546875, -16.50634765625, -15.713134765625, -14.919921875, -14.126708984375, -13.33349609375, -12.540283203125, -11.7470703125, -10.953857421875, -10.16064453125, -9.367431640625, -8.57421875, -7.781005859375, -6.98779296875, -6.194580078125, -5.4013671875, -4.608154296875, -3.81494140625, -3.021728515625, -2.228515625, -1.435302734375, -0.64208984375, 0.151123046875, 0.9443359375, 1.737548828125, 2.53076171875, 3.323974609375, 4.1171875, 4.910400390625, 5.70361328125, 6.496826171875, 7.2900390625, 8.083251953125, 8.87646484375, 9.669677734375, 10.462890625, 11.256103515625, 12.04931640625, 12.842529296875, 13.6357421875, 14.428955078125, 15.22216796875, 16.015380859375, 16.80859375, 17.601806640625, 18.39501953125, 19.188232421875, 19.9814453125, 20.774658203125, 21.56787109375, 22.361083984375, 23.154296875, 23.947509765625, 24.74072265625, 25.533935546875, 26.3271484375, 27.120361328125, 27.91357421875, 28.706787109375, 29.5]}, "gradients/decoder.transformer.h.11.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 3.0, 0.0, 3.0, 1.0, 9.0, 6.0, 7.0, 14.0, 15.0, 13.0, 24.0, 26.0, 48.0, 61.0, 69.0, 94.0, 116.0, 157.0, 249.0, 375.0, 970.0, 35354.0, 3100479.0, 5764.0, 593.0, 326.0, 219.0, 144.0, 149.0, 119.0, 66.0, 41.0, 46.0, 40.0, 33.0, 23.0, 19.0, 12.0, 7.0, 8.0, 4.0, 3.0, 5.0, 2.0, 2.0, 3.0, 0.0, 1.0, 2.0], "bins": [-70.75, -68.8544921875, -66.958984375, -65.0634765625, -63.16796875, -61.2724609375, -59.376953125, -57.4814453125, -55.5859375, -53.6904296875, -51.794921875, -49.8994140625, -48.00390625, -46.1083984375, -44.212890625, -42.3173828125, -40.421875, -38.5263671875, -36.630859375, -34.7353515625, -32.83984375, -30.9443359375, -29.048828125, -27.1533203125, -25.2578125, -23.3623046875, -21.466796875, -19.5712890625, -17.67578125, -15.7802734375, -13.884765625, -11.9892578125, -10.09375, -8.1982421875, -6.302734375, -4.4072265625, -2.51171875, -0.6162109375, 1.279296875, 3.1748046875, 5.0703125, 6.9658203125, 8.861328125, 10.7568359375, 12.65234375, 14.5478515625, 16.443359375, 18.3388671875, 20.234375, 22.1298828125, 24.025390625, 25.9208984375, 27.81640625, 29.7119140625, 31.607421875, 33.5029296875, 35.3984375, 37.2939453125, 39.189453125, 41.0849609375, 42.98046875, 44.8759765625, 46.771484375, 48.6669921875, 50.5625]}, "gradients/decoder.transformer.h.11.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 3.0, 610.0, 403.0, 1.0, 2.0], "bins": [-400.2997131347656, -393.7281188964844, -387.15655517578125, -380.5849609375, -374.0133972167969, -367.4418029785156, -360.8702087402344, -354.29864501953125, -347.72705078125, -341.15545654296875, -334.5838928222656, -328.0122985839844, -321.44073486328125, -314.869140625, -308.29754638671875, -301.7259826660156, -295.1543884277344, -288.5827941894531, -282.01123046875, -275.43963623046875, -268.8680725097656, -262.2964782714844, -255.7248992919922, -249.1533203125, -242.5817413330078, -236.01016235351562, -229.43858337402344, -222.8669891357422, -216.29541015625, -209.7238311767578, -203.15225219726562, -196.58065795898438, -190.00906372070312, -183.43748474121094, -176.86590576171875, -170.2943115234375, -163.7227325439453, -157.15115356445312, -150.57957458496094, -144.00799560546875, -137.43641662597656, -130.86483764648438, -124.29325103759766, -117.72167205810547, -111.15008544921875, -104.57850646972656, -98.00692749023438, -91.43534088134766, -84.86375427246094, -78.29217529296875, -71.72058868408203, -65.14900970458984, -58.577423095703125, -52.00584411621094, -45.434261322021484, -38.86267852783203, -32.29109573364258, -25.719512939453125, -19.147930145263672, -12.576349258422852, -6.004766464233398, 0.5668144226074219, 7.138397216796875, 13.709980010986328, 20.28156280517578]}, "gradients/decoder.transformer.h.11.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 5.0, 5.0, 11.0, 7.0, 11.0, 16.0, 19.0, 23.0, 29.0, 27.0, 33.0, 32.0, 42.0, 47.0, 58.0, 50.0, 52.0, 52.0, 38.0, 47.0, 47.0, 41.0, 42.0, 42.0, 40.0, 42.0, 25.0, 22.0, 20.0, 22.0, 17.0, 9.0, 10.0, 7.0, 5.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-64.42731475830078, -62.367454528808594, -60.307594299316406, -58.24773406982422, -56.18787384033203, -54.128013610839844, -52.06815719604492, -50.008296966552734, -47.94843673706055, -45.88857650756836, -43.82871627807617, -41.768856048583984, -39.70899963378906, -37.649139404296875, -35.58927917480469, -33.5294189453125, -31.469558715820312, -29.409698486328125, -27.349838256835938, -25.289979934692383, -23.230119705200195, -21.170259475708008, -19.110401153564453, -17.050540924072266, -14.990680694580078, -12.93082046508789, -10.87096118927002, -8.811101913452148, -6.751241683959961, -4.691381454467773, -2.6315221786499023, -0.5716629028320312, 1.488189697265625, 3.5480494499206543, 5.607909202575684, 7.667768955230713, 9.727628707885742, 11.78748893737793, 13.8473482131958, 15.907207489013672, 17.96706771850586, 20.026927947998047, 22.086788177490234, 24.14664649963379, 26.206506729125977, 28.266366958618164, 30.32622528076172, 32.386085510253906, 34.445945739746094, 36.50580596923828, 38.56566619873047, 40.625526428222656, 42.685386657714844, 44.74524688720703, 46.80510330200195, 48.86496353149414, 50.92482376098633, 52.984683990478516, 55.0445442199707, 57.10440444946289, 59.16426086425781, 61.22412109375, 63.28398132324219, 65.34384155273438, 67.40370178222656]}, "gradients/decoder.transformer.h.10.mlp.c_proj.bias": {"_type": "histogram", "values": [4.0, 2.0, 1.0, 0.0, 6.0, 2.0, 4.0, 5.0, 3.0, 6.0, 13.0, 7.0, 8.0, 10.0, 17.0, 17.0, 19.0, 14.0, 20.0, 26.0, 29.0, 26.0, 48.0, 34.0, 29.0, 34.0, 46.0, 32.0, 49.0, 38.0, 35.0, 30.0, 44.0, 47.0, 39.0, 27.0, 29.0, 26.0, 30.0, 19.0, 24.0, 22.0, 13.0, 16.0, 12.0, 9.0, 12.0, 6.0, 6.0, 9.0, 6.0, 2.0, 3.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.30078125, -6.08428955078125, -5.8677978515625, -5.65130615234375, -5.434814453125, -5.21832275390625, -5.0018310546875, -4.78533935546875, -4.56884765625, -4.35235595703125, -4.1358642578125, -3.91937255859375, -3.702880859375, -3.48638916015625, -3.2698974609375, -3.05340576171875, -2.8369140625, -2.62042236328125, -2.4039306640625, -2.18743896484375, -1.970947265625, -1.75445556640625, -1.5379638671875, -1.32147216796875, -1.10498046875, -0.88848876953125, -0.6719970703125, -0.45550537109375, -0.239013671875, -0.02252197265625, 0.1939697265625, 0.41046142578125, 0.626953125, 0.84344482421875, 1.0599365234375, 1.27642822265625, 1.492919921875, 1.70941162109375, 1.9259033203125, 2.14239501953125, 2.35888671875, 2.57537841796875, 2.7918701171875, 3.00836181640625, 3.224853515625, 3.44134521484375, 3.6578369140625, 3.87432861328125, 4.0908203125, 4.30731201171875, 4.5238037109375, 4.74029541015625, 4.956787109375, 5.17327880859375, 5.3897705078125, 5.60626220703125, 5.82275390625, 6.03924560546875, 6.2557373046875, 6.47222900390625, 6.688720703125, 6.90521240234375, 7.1217041015625, 7.33819580078125, 7.5546875]}, "gradients/decoder.transformer.h.10.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 4.0, 2.0, 1.0, 10.0, 4.0, 8.0, 7.0, 14.0, 14.0, 15.0, 13.0, 17.0, 15.0, 24.0, 25.0, 30.0, 45.0, 86.0, 116.0, 261.0, 525.0, 1572.0, 10001.0, 437703.0, 3476968.0, 257089.0, 7354.0, 1243.0, 437.0, 227.0, 131.0, 82.0, 49.0, 31.0, 26.0, 28.0, 15.0, 15.0, 12.0, 21.0, 9.0, 8.0, 9.0, 5.0, 7.0, 2.0, 2.0, 4.0, 1.0, 1.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0], "bins": [-22.234375, -21.50390625, -20.7734375, -20.04296875, -19.3125, -18.58203125, -17.8515625, -17.12109375, -16.390625, -15.66015625, -14.9296875, -14.19921875, -13.46875, -12.73828125, -12.0078125, -11.27734375, -10.546875, -9.81640625, -9.0859375, -8.35546875, -7.625, -6.89453125, -6.1640625, -5.43359375, -4.703125, -3.97265625, -3.2421875, -2.51171875, -1.78125, -1.05078125, -0.3203125, 0.41015625, 1.140625, 1.87109375, 2.6015625, 3.33203125, 4.0625, 4.79296875, 5.5234375, 6.25390625, 6.984375, 7.71484375, 8.4453125, 9.17578125, 9.90625, 10.63671875, 11.3671875, 12.09765625, 12.828125, 13.55859375, 14.2890625, 15.01953125, 15.75, 16.48046875, 17.2109375, 17.94140625, 18.671875, 19.40234375, 20.1328125, 20.86328125, 21.59375, 22.32421875, 23.0546875, 23.78515625, 24.515625]}, "gradients/decoder.transformer.h.10.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 3.0, 3.0, 2.0, 3.0, 3.0, 5.0, 10.0, 9.0, 10.0, 13.0, 22.0, 31.0, 46.0, 54.0, 77.0, 91.0, 109.0, 201.0, 263.0, 360.0, 473.0, 500.0, 477.0, 376.0, 270.0, 176.0, 128.0, 89.0, 89.0, 54.0, 39.0, 31.0, 18.0, 15.0, 9.0, 6.0, 5.0, 5.0, 3.0, 1.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-11.046875, -10.6142578125, -10.181640625, -9.7490234375, -9.31640625, -8.8837890625, -8.451171875, -8.0185546875, -7.5859375, -7.1533203125, -6.720703125, -6.2880859375, -5.85546875, -5.4228515625, -4.990234375, -4.5576171875, -4.125, -3.6923828125, -3.259765625, -2.8271484375, -2.39453125, -1.9619140625, -1.529296875, -1.0966796875, -0.6640625, -0.2314453125, 0.201171875, 0.6337890625, 1.06640625, 1.4990234375, 1.931640625, 2.3642578125, 2.796875, 3.2294921875, 3.662109375, 4.0947265625, 4.52734375, 4.9599609375, 5.392578125, 5.8251953125, 6.2578125, 6.6904296875, 7.123046875, 7.5556640625, 7.98828125, 8.4208984375, 8.853515625, 9.2861328125, 9.71875, 10.1513671875, 10.583984375, 11.0166015625, 11.44921875, 11.8818359375, 12.314453125, 12.7470703125, 13.1796875, 13.6123046875, 14.044921875, 14.4775390625, 14.91015625, 15.3427734375, 15.775390625, 16.2080078125, 16.640625]}, "gradients/decoder.transformer.h.10.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 4.0, 6.0, 5.0, 11.0, 7.0, 13.0, 15.0, 25.0, 38.0, 39.0, 35.0, 54.0, 68.0, 78.0, 106.0, 149.0, 167.0, 226.0, 382.0, 659.0, 4620.0, 4106229.0, 78575.0, 1048.0, 498.0, 309.0, 194.0, 162.0, 125.0, 105.0, 62.0, 65.0, 47.0, 41.0, 28.0, 18.0, 21.0, 10.0, 15.0, 8.0, 13.0, 2.0, 4.0, 5.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-84.5625, -82.1259765625, -79.689453125, -77.2529296875, -74.81640625, -72.3798828125, -69.943359375, -67.5068359375, -65.0703125, -62.6337890625, -60.197265625, -57.7607421875, -55.32421875, -52.8876953125, -50.451171875, -48.0146484375, -45.578125, -43.1416015625, -40.705078125, -38.2685546875, -35.83203125, -33.3955078125, -30.958984375, -28.5224609375, -26.0859375, -23.6494140625, -21.212890625, -18.7763671875, -16.33984375, -13.9033203125, -11.466796875, -9.0302734375, -6.59375, -4.1572265625, -1.720703125, 0.7158203125, 3.15234375, 5.5888671875, 8.025390625, 10.4619140625, 12.8984375, 15.3349609375, 17.771484375, 20.2080078125, 22.64453125, 25.0810546875, 27.517578125, 29.9541015625, 32.390625, 34.8271484375, 37.263671875, 39.7001953125, 42.13671875, 44.5732421875, 47.009765625, 49.4462890625, 51.8828125, 54.3193359375, 56.755859375, 59.1923828125, 61.62890625, 64.0654296875, 66.501953125, 68.9384765625, 71.375]}, "gradients/decoder.transformer.h.10.ln_2.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 4.0, 5.0, 20.0, 45.0, 140.0, 247.0, 273.0, 178.0, 71.0, 26.0, 7.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-32.20957565307617, -29.051116943359375, -25.892656326293945, -22.734195709228516, -19.57573699951172, -16.417278289794922, -13.258817672729492, -10.100357055664062, -6.941898345947266, -3.7834386825561523, -0.6249790191650391, 2.533480644226074, 5.6919403076171875, 8.8503999710083, 12.008859634399414, 15.167320251464844, 18.32577896118164, 21.484237670898438, 24.642698287963867, 27.801158905029297, 30.959617614746094, 34.11807632446289, 37.27653503417969, 40.43499755859375, 43.59345626831055, 46.751914978027344, 49.910377502441406, 53.0688362121582, 56.227294921875, 59.3857536315918, 62.544212341308594, 65.70267486572266, 68.86112976074219, 72.01959228515625, 75.17804718017578, 78.33650970458984, 81.49496459960938, 84.65342712402344, 87.8118896484375, 90.97035217285156, 94.1288070678711, 97.28726959228516, 100.44572448730469, 103.60418701171875, 106.76264953613281, 109.92110443115234, 113.0795669555664, 116.23802185058594, 119.396484375, 122.55494689941406, 125.7134017944336, 128.87185668945312, 132.0303192138672, 135.18878173828125, 138.3472442626953, 141.50570678710938, 144.66415405273438, 147.82261657714844, 150.9810791015625, 154.1395263671875, 157.29798889160156, 160.45645141601562, 163.6149139404297, 166.77337646484375, 169.9318389892578]}, "gradients/decoder.transformer.h.10.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 2.0, 3.0, 4.0, 3.0, 6.0, 11.0, 12.0, 14.0, 15.0, 13.0, 15.0, 20.0, 21.0, 28.0, 43.0, 35.0, 41.0, 40.0, 29.0, 35.0, 44.0, 38.0, 42.0, 45.0, 30.0, 24.0, 37.0, 38.0, 37.0, 29.0, 29.0, 27.0, 26.0, 23.0, 25.0, 20.0, 21.0, 19.0, 9.0, 10.0, 8.0, 9.0, 5.0, 4.0, 4.0, 3.0, 6.0, 3.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-44.12298583984375, -42.827606201171875, -41.532222747802734, -40.236839294433594, -38.94145965576172, -37.646080017089844, -36.3506965637207, -35.05531311035156, -33.75993347167969, -32.46455383300781, -31.169170379638672, -29.873788833618164, -28.578407287597656, -27.28302574157715, -25.98764419555664, -24.692262649536133, -23.396881103515625, -22.101499557495117, -20.80611801147461, -19.5107364654541, -18.215354919433594, -16.919973373413086, -15.624591827392578, -14.32921028137207, -13.033828735351562, -11.738447189331055, -10.443065643310547, -9.147684097290039, -7.852302551269531, -6.556921005249023, -5.261539459228516, -3.966157913208008, -2.6707763671875, -1.3753948211669922, -0.08001327514648438, 1.2153682708740234, 2.5107498168945312, 3.806131362915039, 5.101512908935547, 6.396894454956055, 7.6922760009765625, 8.98765754699707, 10.283039093017578, 11.578420639038086, 12.873802185058594, 14.169183731079102, 15.46456527709961, 16.759946823120117, 18.055328369140625, 19.350709915161133, 20.64609146118164, 21.94147300720215, 23.236854553222656, 24.532236099243164, 25.827617645263672, 27.12299919128418, 28.418380737304688, 29.713762283325195, 31.009143829345703, 32.304527282714844, 33.59990692138672, 34.895286560058594, 36.190670013427734, 37.486053466796875, 38.78143310546875]}, "gradients/decoder.transformer.h.10.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 4.0, 6.0, 4.0, 5.0, 6.0, 9.0, 10.0, 11.0, 9.0, 16.0, 17.0, 13.0, 29.0, 32.0, 36.0, 36.0, 45.0, 46.0, 40.0, 42.0, 35.0, 44.0, 46.0, 41.0, 41.0, 39.0, 42.0, 36.0, 25.0, 37.0, 16.0, 28.0, 27.0, 16.0, 26.0, 14.0, 17.0, 12.0, 7.0, 11.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 4.0], "bins": [-7.80859375, -7.57952880859375, -7.3504638671875, -7.12139892578125, -6.892333984375, -6.66326904296875, -6.4342041015625, -6.20513916015625, -5.97607421875, -5.74700927734375, -5.5179443359375, -5.28887939453125, -5.059814453125, -4.83074951171875, -4.6016845703125, -4.37261962890625, -4.1435546875, -3.91448974609375, -3.6854248046875, -3.45635986328125, -3.227294921875, -2.99822998046875, -2.7691650390625, -2.54010009765625, -2.31103515625, -2.08197021484375, -1.8529052734375, -1.62384033203125, -1.394775390625, -1.16571044921875, -0.9366455078125, -0.70758056640625, -0.478515625, -0.24945068359375, -0.0203857421875, 0.20867919921875, 0.437744140625, 0.66680908203125, 0.8958740234375, 1.12493896484375, 1.35400390625, 1.58306884765625, 1.8121337890625, 2.04119873046875, 2.270263671875, 2.49932861328125, 2.7283935546875, 2.95745849609375, 3.1865234375, 3.41558837890625, 3.6446533203125, 3.87371826171875, 4.102783203125, 4.33184814453125, 4.5609130859375, 4.78997802734375, 5.01904296875, 5.24810791015625, 5.4771728515625, 5.70623779296875, 5.935302734375, 6.16436767578125, 6.3934326171875, 6.62249755859375, 6.8515625]}, "gradients/decoder.transformer.h.10.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 6.0, 10.0, 5.0, 9.0, 13.0, 20.0, 35.0, 54.0, 66.0, 81.0, 141.0, 197.0, 264.0, 383.0, 545.0, 788.0, 1094.0, 1684.0, 2310.0, 3482.0, 4957.0, 7635.0, 11722.0, 18443.0, 29347.0, 48634.0, 83799.0, 160026.0, 335991.0, 139833.0, 75275.0, 44475.0, 27020.0, 17074.0, 11017.0, 7082.0, 4784.0, 3303.0, 2024.0, 1457.0, 1045.0, 724.0, 505.0, 336.0, 267.0, 165.0, 121.0, 100.0, 75.0, 44.0, 35.0, 23.0, 16.0, 12.0, 6.0, 5.0, 3.0, 1.0, 2.0, 2.0, 1.0], "bins": [-1.7197265625, -1.66497802734375, -1.6102294921875, -1.55548095703125, -1.500732421875, -1.44598388671875, -1.3912353515625, -1.33648681640625, -1.28173828125, -1.22698974609375, -1.1722412109375, -1.11749267578125, -1.062744140625, -1.00799560546875, -0.9532470703125, -0.89849853515625, -0.84375, -0.78900146484375, -0.7342529296875, -0.67950439453125, -0.624755859375, -0.57000732421875, -0.5152587890625, -0.46051025390625, -0.40576171875, -0.35101318359375, -0.2962646484375, -0.24151611328125, -0.186767578125, -0.13201904296875, -0.0772705078125, -0.02252197265625, 0.0322265625, 0.08697509765625, 0.1417236328125, 0.19647216796875, 0.251220703125, 0.30596923828125, 0.3607177734375, 0.41546630859375, 0.47021484375, 0.52496337890625, 0.5797119140625, 0.63446044921875, 0.689208984375, 0.74395751953125, 0.7987060546875, 0.85345458984375, 0.908203125, 0.96295166015625, 1.0177001953125, 1.07244873046875, 1.127197265625, 1.18194580078125, 1.2366943359375, 1.29144287109375, 1.34619140625, 1.40093994140625, 1.4556884765625, 1.51043701171875, 1.565185546875, 1.61993408203125, 1.6746826171875, 1.72943115234375, 1.7841796875]}, "gradients/decoder.transformer.h.10.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 5.0, 3.0, 4.0, 12.0, 6.0, 4.0, 9.0, 11.0, 12.0, 15.0, 9.0, 25.0, 37.0, 24.0, 36.0, 38.0, 33.0, 44.0, 56.0, 34.0, 46.0, 1068.0, 45.0, 39.0, 46.0, 51.0, 45.0, 27.0, 26.0, 32.0, 33.0, 24.0, 22.0, 23.0, 20.0, 11.0, 16.0, 10.0, 10.0, 8.0, 4.0, 4.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.50390625, -4.3463134765625, -4.188720703125, -4.0311279296875, -3.87353515625, -3.7159423828125, -3.558349609375, -3.4007568359375, -3.2431640625, -3.0855712890625, -2.927978515625, -2.7703857421875, -2.61279296875, -2.4552001953125, -2.297607421875, -2.1400146484375, -1.982421875, -1.8248291015625, -1.667236328125, -1.5096435546875, -1.35205078125, -1.1944580078125, -1.036865234375, -0.8792724609375, -0.7216796875, -0.5640869140625, -0.406494140625, -0.2489013671875, -0.09130859375, 0.0662841796875, 0.223876953125, 0.3814697265625, 0.5390625, 0.6966552734375, 0.854248046875, 1.0118408203125, 1.16943359375, 1.3270263671875, 1.484619140625, 1.6422119140625, 1.7998046875, 1.9573974609375, 2.114990234375, 2.2725830078125, 2.43017578125, 2.5877685546875, 2.745361328125, 2.9029541015625, 3.060546875, 3.2181396484375, 3.375732421875, 3.5333251953125, 3.69091796875, 3.8485107421875, 4.006103515625, 4.1636962890625, 4.3212890625, 4.4788818359375, 4.636474609375, 4.7940673828125, 4.95166015625, 5.1092529296875, 5.266845703125, 5.4244384765625, 5.58203125]}, "gradients/decoder.transformer.h.10.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 1.0, 4.0, 5.0, 2.0, 4.0, 9.0, 10.0, 19.0, 19.0, 37.0, 55.0, 85.0, 139.0, 203.0, 425.0, 682.0, 1234.0, 2266.0, 4169.0, 7856.0, 15075.0, 31167.0, 66230.0, 161641.0, 1493918.0, 173442.0, 70743.0, 33188.0, 16270.0, 8240.0, 4441.0, 2473.0, 1331.0, 726.0, 414.0, 234.0, 150.0, 86.0, 51.0, 26.0, 16.0, 17.0, 16.0, 12.0, 4.0, 3.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 1.0], "bins": [-3.0625, -2.976318359375, -2.89013671875, -2.803955078125, -2.7177734375, -2.631591796875, -2.54541015625, -2.459228515625, -2.373046875, -2.286865234375, -2.20068359375, -2.114501953125, -2.0283203125, -1.942138671875, -1.85595703125, -1.769775390625, -1.68359375, -1.597412109375, -1.51123046875, -1.425048828125, -1.3388671875, -1.252685546875, -1.16650390625, -1.080322265625, -0.994140625, -0.907958984375, -0.82177734375, -0.735595703125, -0.6494140625, -0.563232421875, -0.47705078125, -0.390869140625, -0.3046875, -0.218505859375, -0.13232421875, -0.046142578125, 0.0400390625, 0.126220703125, 0.21240234375, 0.298583984375, 0.384765625, 0.470947265625, 0.55712890625, 0.643310546875, 0.7294921875, 0.815673828125, 0.90185546875, 0.988037109375, 1.07421875, 1.160400390625, 1.24658203125, 1.332763671875, 1.4189453125, 1.505126953125, 1.59130859375, 1.677490234375, 1.763671875, 1.849853515625, 1.93603515625, 2.022216796875, 2.1083984375, 2.194580078125, 2.28076171875, 2.366943359375, 2.453125]}, "gradients/decoder.transformer.h.10.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 3.0, 4.0, 5.0, 7.0, 5.0, 4.0, 8.0, 9.0, 10.0, 5.0, 15.0, 14.0, 17.0, 17.0, 39.0, 51.0, 53.0, 46.0, 64.0, 72.0, 96.0, 77.0, 62.0, 49.0, 54.0, 42.0, 38.0, 18.0, 18.0, 16.0, 18.0, 17.0, 12.0, 9.0, 12.0, 6.0, 5.0, 4.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0014057159423828125, -0.0013637840747833252, -0.0013218522071838379, -0.0012799203395843506, -0.0012379884719848633, -0.001196056604385376, -0.0011541247367858887, -0.0011121928691864014, -0.001070261001586914, -0.0010283291339874268, -0.0009863972663879395, -0.0009444653987884521, -0.0009025335311889648, -0.0008606016635894775, -0.0008186697959899902, -0.0007767379283905029, -0.0007348060607910156, -0.0006928741931915283, -0.000650942325592041, -0.0006090104579925537, -0.0005670785903930664, -0.0005251467227935791, -0.0004832148551940918, -0.0004412829875946045, -0.0003993511199951172, -0.0003574192523956299, -0.0003154873847961426, -0.0002735555171966553, -0.00023162364959716797, -0.00018969178199768066, -0.00014775991439819336, -0.00010582804679870605, -6.389617919921875e-05, -2.1964311599731445e-05, 1.996755599975586e-05, 6.189942359924316e-05, 0.00010383129119873047, 0.00014576315879821777, 0.00018769502639770508, 0.00022962689399719238, 0.0002715587615966797, 0.000313490629196167, 0.0003554224967956543, 0.0003973543643951416, 0.0004392862319946289, 0.0004812180995941162, 0.0005231499671936035, 0.0005650818347930908, 0.0006070137023925781, 0.0006489455699920654, 0.0006908774375915527, 0.00073280930519104, 0.0007747411727905273, 0.0008166730403900146, 0.000858604907989502, 0.0009005367755889893, 0.0009424686431884766, 0.0009844005107879639, 0.0010263323783874512, 0.0010682642459869385, 0.0011101961135864258, 0.001152127981185913, 0.0011940598487854004, 0.0012359917163848877, 0.001277923583984375]}, "gradients/decoder.transformer.h.10.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 0.0, 3.0, 2.0, 3.0, 3.0, 6.0, 6.0, 7.0, 8.0, 14.0, 21.0, 22.0, 23.0, 19.0, 33.0, 49.0, 53.0, 91.0, 152.0, 431.0, 7213.0, 1038501.0, 1174.0, 283.0, 132.0, 69.0, 47.0, 48.0, 30.0, 27.0, 10.0, 19.0, 12.0, 10.0, 8.0, 6.0, 5.0, 5.0, 3.0, 1.0, 6.0, 2.0, 4.0, 3.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.037384033203125, -0.03626203536987305, -0.035140037536621094, -0.03401803970336914, -0.03289604187011719, -0.031774044036865234, -0.03065204620361328, -0.029530048370361328, -0.028408050537109375, -0.027286052703857422, -0.02616405487060547, -0.025042057037353516, -0.023920059204101562, -0.02279806137084961, -0.021676063537597656, -0.020554065704345703, -0.01943206787109375, -0.018310070037841797, -0.017188072204589844, -0.01606607437133789, -0.014944076538085938, -0.013822078704833984, -0.012700080871582031, -0.011578083038330078, -0.010456085205078125, -0.009334087371826172, -0.008212089538574219, -0.007090091705322266, -0.0059680938720703125, -0.004846096038818359, -0.0037240982055664062, -0.002602100372314453, -0.0014801025390625, -0.0003581047058105469, 0.0007638931274414062, 0.0018858909606933594, 0.0030078887939453125, 0.004129886627197266, 0.005251884460449219, 0.006373882293701172, 0.007495880126953125, 0.008617877960205078, 0.009739875793457031, 0.010861873626708984, 0.011983871459960938, 0.01310586929321289, 0.014227867126464844, 0.015349864959716797, 0.01647186279296875, 0.017593860626220703, 0.018715858459472656, 0.01983785629272461, 0.020959854125976562, 0.022081851959228516, 0.02320384979248047, 0.024325847625732422, 0.025447845458984375, 0.026569843292236328, 0.02769184112548828, 0.028813838958740234, 0.029935836791992188, 0.03105783462524414, 0.032179832458496094, 0.03330183029174805, 0.034423828125]}, "gradients/decoder.transformer.h.10.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 5.0, 83.0, 533.0, 356.0, 35.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00460321269929409, -0.004502732772380114, -0.004402253311127424, -0.004301773384213448, -0.004201293922960758, -0.0041008139960467815, -0.004000334534794092, -0.0038998546078801155, -0.0037993749137967825, -0.0036988952197134495, -0.0035984155256301165, -0.0034979358315467834, -0.0033974561374634504, -0.0032969764433801174, -0.0031964965164661407, -0.0030960168223828077, -0.0029955371282994747, -0.0028950574342161417, -0.0027945777401328087, -0.0026940980460494757, -0.0025936183519661427, -0.002493138425052166, -0.0023926589637994766, -0.0022921790368855, -0.0021916995756328106, -0.0020912198815494776, -0.0019907401874661446, -0.0018902604933828115, -0.0017897806828841567, -0.0016893009888008237, -0.0015888212947174907, -0.0014883414842188358, -0.0013878619065508246, -0.0012873822124674916, -0.0011869025183841586, -0.0010864227078855038, -0.0009859430138021708, -0.0008854633197188377, -0.0007849836256355047, -0.0006845038733445108, -0.0005840241792611778, -0.0004835444560740143, -0.00038306473288685083, -0.0002825850388035178, -0.00018210531561635435, -8.162559242919087e-05, 1.885410165414214e-05, 0.00011933385394513607, 0.00021981354802846909, 0.00032029327121563256, 0.00042077299440279603, 0.000521252688486129, 0.000621732440777123, 0.000722212134860456, 0.000822691828943789, 0.0009231715812347829, 0.0010236513335257769, 0.0011241310276091099, 0.001224610721692443, 0.001325090415775776, 0.0014255702262744308, 0.0015260499203577638, 0.0016265296144410968, 0.0017270094249397516, 0.0018274890026077628]}, "gradients/decoder.transformer.h.10.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 1.0, 4.0, 0.0, 3.0, 2.0, 2.0, 10.0, 8.0, 15.0, 7.0, 17.0, 11.0, 21.0, 23.0, 32.0, 26.0, 23.0, 33.0, 38.0, 38.0, 32.0, 31.0, 49.0, 47.0, 45.0, 46.0, 47.0, 31.0, 52.0, 43.0, 25.0, 33.0, 28.0, 30.0, 29.0, 18.0, 21.0, 24.0, 10.0, 12.0, 6.0, 9.0, 11.0, 8.0, 4.0, 3.0, 1.0, 4.0, 0.0, 2.0, 1.0, 1.0], "bins": [-0.0007723569869995117, -0.0007513277232646942, -0.0007302984595298767, -0.0007092691957950592, -0.0006882399320602417, -0.0006672106683254242, -0.0006461814045906067, -0.0006251521408557892, -0.0006041228771209717, -0.0005830936133861542, -0.0005620643496513367, -0.0005410350859165192, -0.0005200058221817017, -0.0004989765584468842, -0.00047794729471206665, -0.00045691803097724915, -0.00043588876724243164, -0.00041485950350761414, -0.00039383023977279663, -0.0003728009760379791, -0.0003517717123031616, -0.0003307424485683441, -0.0003097131848335266, -0.0002886839210987091, -0.0002676546573638916, -0.0002466253936290741, -0.0002255961298942566, -0.0002045668661594391, -0.00018353760242462158, -0.00016250833868980408, -0.00014147907495498657, -0.00012044981122016907, -9.942054748535156e-05, -7.839128375053406e-05, -5.736202001571655e-05, -3.633275628089905e-05, -1.5303492546081543e-05, 5.725771188735962e-06, 2.6755034923553467e-05, 4.778429865837097e-05, 6.881356239318848e-05, 8.984282612800598e-05, 0.00011087208986282349, 0.000131901353597641, 0.0001529306173324585, 0.000173959881067276, 0.0001949891448020935, 0.000216018408536911, 0.00023704767227172852, 0.000258076936006546, 0.0002791061997413635, 0.00030013546347618103, 0.00032116472721099854, 0.00034219399094581604, 0.00036322325468063354, 0.00038425251841545105, 0.00040528178215026855, 0.00042631104588508606, 0.00044734030961990356, 0.00046836957335472107, 0.0004893988370895386, 0.0005104281008243561, 0.0005314573645591736, 0.0005524866282939911, 0.0005735158920288086]}, "gradients/decoder.transformer.h.10.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 4.0, 6.0, 4.0, 5.0, 6.0, 9.0, 10.0, 11.0, 9.0, 16.0, 17.0, 13.0, 29.0, 32.0, 36.0, 36.0, 45.0, 46.0, 40.0, 42.0, 35.0, 44.0, 46.0, 41.0, 41.0, 39.0, 42.0, 36.0, 25.0, 37.0, 16.0, 28.0, 27.0, 16.0, 26.0, 14.0, 17.0, 12.0, 7.0, 11.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 4.0], "bins": [-7.80859375, -7.57952880859375, -7.3504638671875, -7.12139892578125, -6.892333984375, -6.66326904296875, -6.4342041015625, -6.20513916015625, -5.97607421875, -5.74700927734375, -5.5179443359375, -5.28887939453125, -5.059814453125, -4.83074951171875, -4.6016845703125, -4.37261962890625, -4.1435546875, -3.91448974609375, -3.6854248046875, -3.45635986328125, -3.227294921875, -2.99822998046875, -2.7691650390625, -2.54010009765625, -2.31103515625, -2.08197021484375, -1.8529052734375, -1.62384033203125, -1.394775390625, -1.16571044921875, -0.9366455078125, -0.70758056640625, -0.478515625, -0.24945068359375, -0.0203857421875, 0.20867919921875, 0.437744140625, 0.66680908203125, 0.8958740234375, 1.12493896484375, 1.35400390625, 1.58306884765625, 1.8121337890625, 2.04119873046875, 2.270263671875, 2.49932861328125, 2.7283935546875, 2.95745849609375, 3.1865234375, 3.41558837890625, 3.6446533203125, 3.87371826171875, 4.102783203125, 4.33184814453125, 4.5609130859375, 4.78997802734375, 5.01904296875, 5.24810791015625, 5.4771728515625, 5.70623779296875, 5.935302734375, 6.16436767578125, 6.3934326171875, 6.62249755859375, 6.8515625]}, "gradients/decoder.transformer.h.10.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 3.0, 1.0, 6.0, 6.0, 6.0, 6.0, 7.0, 9.0, 9.0, 23.0, 30.0, 35.0, 35.0, 45.0, 65.0, 103.0, 125.0, 220.0, 306.0, 557.0, 1169.0, 3273.0, 11806.0, 47920.0, 202458.0, 558627.0, 166315.0, 40211.0, 9825.0, 2867.0, 1047.0, 444.0, 265.0, 189.0, 138.0, 103.0, 70.0, 76.0, 39.0, 34.0, 19.0, 18.0, 12.0, 17.0, 9.0, 8.0, 1.0, 3.0, 1.0, 3.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0], "bins": [-8.46875, -8.2154541015625, -7.962158203125, -7.7088623046875, -7.45556640625, -7.2022705078125, -6.948974609375, -6.6956787109375, -6.4423828125, -6.1890869140625, -5.935791015625, -5.6824951171875, -5.42919921875, -5.1759033203125, -4.922607421875, -4.6693115234375, -4.416015625, -4.1627197265625, -3.909423828125, -3.6561279296875, -3.40283203125, -3.1495361328125, -2.896240234375, -2.6429443359375, -2.3896484375, -2.1363525390625, -1.883056640625, -1.6297607421875, -1.37646484375, -1.1231689453125, -0.869873046875, -0.6165771484375, -0.36328125, -0.1099853515625, 0.143310546875, 0.3966064453125, 0.64990234375, 0.9031982421875, 1.156494140625, 1.4097900390625, 1.6630859375, 1.9163818359375, 2.169677734375, 2.4229736328125, 2.67626953125, 2.9295654296875, 3.182861328125, 3.4361572265625, 3.689453125, 3.9427490234375, 4.196044921875, 4.4493408203125, 4.70263671875, 4.9559326171875, 5.209228515625, 5.4625244140625, 5.7158203125, 5.9691162109375, 6.222412109375, 6.4757080078125, 6.72900390625, 6.9822998046875, 7.235595703125, 7.4888916015625, 7.7421875]}, "gradients/decoder.transformer.h.10.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 5.0, 3.0, 2.0, 3.0, 3.0, 4.0, 8.0, 11.0, 10.0, 15.0, 11.0, 15.0, 18.0, 23.0, 34.0, 31.0, 30.0, 32.0, 40.0, 50.0, 47.0, 60.0, 141.0, 1727.0, 235.0, 77.0, 63.0, 53.0, 36.0, 37.0, 26.0, 31.0, 33.0, 23.0, 28.0, 14.0, 16.0, 17.0, 8.0, 8.0, 8.0, 8.0, 3.0, 6.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-25.015625, -24.2724609375, -23.529296875, -22.7861328125, -22.04296875, -21.2998046875, -20.556640625, -19.8134765625, -19.0703125, -18.3271484375, -17.583984375, -16.8408203125, -16.09765625, -15.3544921875, -14.611328125, -13.8681640625, -13.125, -12.3818359375, -11.638671875, -10.8955078125, -10.15234375, -9.4091796875, -8.666015625, -7.9228515625, -7.1796875, -6.4365234375, -5.693359375, -4.9501953125, -4.20703125, -3.4638671875, -2.720703125, -1.9775390625, -1.234375, -0.4912109375, 0.251953125, 0.9951171875, 1.73828125, 2.4814453125, 3.224609375, 3.9677734375, 4.7109375, 5.4541015625, 6.197265625, 6.9404296875, 7.68359375, 8.4267578125, 9.169921875, 9.9130859375, 10.65625, 11.3994140625, 12.142578125, 12.8857421875, 13.62890625, 14.3720703125, 15.115234375, 15.8583984375, 16.6015625, 17.3447265625, 18.087890625, 18.8310546875, 19.57421875, 20.3173828125, 21.060546875, 21.8037109375, 22.546875]}, "gradients/decoder.transformer.h.10.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 5.0, 7.0, 17.0, 28.0, 36.0, 66.0, 97.0, 161.0, 259.0, 560.0, 3860.0, 3136453.0, 2965.0, 537.0, 267.0, 165.0, 90.0, 44.0, 37.0, 27.0, 10.0, 10.0, 5.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-120.125, -116.8310546875, -113.537109375, -110.2431640625, -106.94921875, -103.6552734375, -100.361328125, -97.0673828125, -93.7734375, -90.4794921875, -87.185546875, -83.8916015625, -80.59765625, -77.3037109375, -74.009765625, -70.7158203125, -67.421875, -64.1279296875, -60.833984375, -57.5400390625, -54.24609375, -50.9521484375, -47.658203125, -44.3642578125, -41.0703125, -37.7763671875, -34.482421875, -31.1884765625, -27.89453125, -24.6005859375, -21.306640625, -18.0126953125, -14.71875, -11.4248046875, -8.130859375, -4.8369140625, -1.54296875, 1.7509765625, 5.044921875, 8.3388671875, 11.6328125, 14.9267578125, 18.220703125, 21.5146484375, 24.80859375, 28.1025390625, 31.396484375, 34.6904296875, 37.984375, 41.2783203125, 44.572265625, 47.8662109375, 51.16015625, 54.4541015625, 57.748046875, 61.0419921875, 64.3359375, 67.6298828125, 70.923828125, 74.2177734375, 77.51171875, 80.8056640625, 84.099609375, 87.3935546875, 90.6875]}, "gradients/decoder.transformer.h.10.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 9.0, 56.0, 219.0, 391.0, 252.0, 71.0, 14.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.63893508911133, -40.90486526489258, -39.17079544067383, -37.43672561645508, -35.702659606933594, -33.968589782714844, -32.234519958496094, -30.500450134277344, -28.766380310058594, -27.032310485839844, -25.298240661621094, -23.564172744750977, -21.830102920532227, -20.096033096313477, -18.36196517944336, -16.62789535522461, -14.89382553100586, -13.15975570678711, -11.425686836242676, -9.691617965698242, -7.957548141479492, -6.223478317260742, -4.489409446716309, -2.755340576171875, -1.021270751953125, 0.7127985954284668, 2.4468679428100586, 4.18093729019165, 5.915006637573242, 7.649076461791992, 9.383145332336426, 11.11721420288086, 12.851280212402344, 14.585350036621094, 16.319419860839844, 18.05348777770996, 19.78755760192871, 21.52162742614746, 23.255695343017578, 24.989765167236328, 26.723834991455078, 28.457904815673828, 30.191974639892578, 31.926042556762695, 33.66011047363281, 35.39418029785156, 37.12825012207031, 38.86231994628906, 40.59638977050781, 42.33045959472656, 44.06452941894531, 45.79859924316406, 47.53266906738281, 49.26673889160156, 51.00080490112305, 52.7348747253418, 54.46894454956055, 56.2030143737793, 57.93708419799805, 59.6711540222168, 61.40522003173828, 63.13928985595703, 64.87335968017578, 66.60742950439453, 68.34149932861328]}, "gradients/decoder.transformer.h.10.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 0.0, 2.0, 1.0, 3.0, 4.0, 3.0, 4.0, 5.0, 11.0, 9.0, 15.0, 10.0, 10.0, 7.0, 17.0, 17.0, 22.0, 20.0, 20.0, 26.0, 30.0, 39.0, 36.0, 32.0, 31.0, 46.0, 37.0, 36.0, 30.0, 40.0, 42.0, 39.0, 37.0, 34.0, 26.0, 37.0, 26.0, 34.0, 25.0, 18.0, 23.0, 21.0, 24.0, 13.0, 15.0, 9.0, 5.0, 2.0, 3.0, 4.0, 6.0, 4.0, 3.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-53.49555206298828, -51.79428482055664, -50.093017578125, -48.391754150390625, -46.690486907958984, -44.989219665527344, -43.28795623779297, -41.58668899536133, -39.88542175292969, -38.18415451049805, -36.482887268066406, -34.78162384033203, -33.08035659790039, -31.37908935546875, -29.677824020385742, -27.976558685302734, -26.275291442871094, -24.574024200439453, -22.872758865356445, -21.171493530273438, -19.470226287841797, -17.768959045410156, -16.06769371032715, -14.366427421569824, -12.6651611328125, -10.963894844055176, -9.262628555297852, -7.561362266540527, -5.860095977783203, -4.158829689025879, -2.4575634002685547, -0.7562971115112305, 0.9449691772460938, 2.646235466003418, 4.347501754760742, 6.048768043518066, 7.750034332275391, 9.451300621032715, 11.152566909790039, 12.853833198547363, 14.555099487304688, 16.256366729736328, 17.957632064819336, 19.658897399902344, 21.360164642333984, 23.061431884765625, 24.762697219848633, 26.46396255493164, 28.16522979736328, 29.866497039794922, 31.56776237487793, 33.26902770996094, 34.97029495239258, 36.67156219482422, 38.372825622558594, 40.074092864990234, 41.775360107421875, 43.476627349853516, 45.177894592285156, 46.87915802001953, 48.58042526245117, 50.28169250488281, 51.98295593261719, 53.68422317504883, 55.38549041748047]}, "gradients/decoder.transformer.h.9.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 3.0, 4.0, 5.0, 6.0, 3.0, 4.0, 8.0, 5.0, 7.0, 13.0, 8.0, 12.0, 15.0, 20.0, 22.0, 22.0, 30.0, 43.0, 36.0, 48.0, 47.0, 40.0, 37.0, 47.0, 45.0, 44.0, 40.0, 40.0, 40.0, 41.0, 37.0, 26.0, 28.0, 25.0, 26.0, 22.0, 23.0, 17.0, 13.0, 17.0, 11.0, 11.0, 8.0, 5.0, 5.0, 2.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0], "bins": [-8.234375, -7.9898681640625, -7.745361328125, -7.5008544921875, -7.25634765625, -7.0118408203125, -6.767333984375, -6.5228271484375, -6.2783203125, -6.0338134765625, -5.789306640625, -5.5447998046875, -5.30029296875, -5.0557861328125, -4.811279296875, -4.5667724609375, -4.322265625, -4.0777587890625, -3.833251953125, -3.5887451171875, -3.34423828125, -3.0997314453125, -2.855224609375, -2.6107177734375, -2.3662109375, -2.1217041015625, -1.877197265625, -1.6326904296875, -1.38818359375, -1.1436767578125, -0.899169921875, -0.6546630859375, -0.41015625, -0.1656494140625, 0.078857421875, 0.3233642578125, 0.56787109375, 0.8123779296875, 1.056884765625, 1.3013916015625, 1.5458984375, 1.7904052734375, 2.034912109375, 2.2794189453125, 2.52392578125, 2.7684326171875, 3.012939453125, 3.2574462890625, 3.501953125, 3.7464599609375, 3.990966796875, 4.2354736328125, 4.47998046875, 4.7244873046875, 4.968994140625, 5.2135009765625, 5.4580078125, 5.7025146484375, 5.947021484375, 6.1915283203125, 6.43603515625, 6.6805419921875, 6.925048828125, 7.1695556640625, 7.4140625]}, "gradients/decoder.transformer.h.9.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 3.0, 4.0, 4.0, 5.0, 7.0, 4.0, 6.0, 11.0, 10.0, 13.0, 21.0, 28.0, 23.0, 38.0, 52.0, 51.0, 109.0, 155.0, 205.0, 372.0, 900.0, 5564.0, 209569.0, 3619828.0, 347984.0, 7133.0, 1025.0, 360.0, 218.0, 151.0, 110.0, 71.0, 49.0, 51.0, 30.0, 26.0, 13.0, 13.0, 6.0, 17.0, 19.0, 10.0, 7.0, 5.0, 3.0, 1.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-25.421875, -24.6162109375, -23.810546875, -23.0048828125, -22.19921875, -21.3935546875, -20.587890625, -19.7822265625, -18.9765625, -18.1708984375, -17.365234375, -16.5595703125, -15.75390625, -14.9482421875, -14.142578125, -13.3369140625, -12.53125, -11.7255859375, -10.919921875, -10.1142578125, -9.30859375, -8.5029296875, -7.697265625, -6.8916015625, -6.0859375, -5.2802734375, -4.474609375, -3.6689453125, -2.86328125, -2.0576171875, -1.251953125, -0.4462890625, 0.359375, 1.1650390625, 1.970703125, 2.7763671875, 3.58203125, 4.3876953125, 5.193359375, 5.9990234375, 6.8046875, 7.6103515625, 8.416015625, 9.2216796875, 10.02734375, 10.8330078125, 11.638671875, 12.4443359375, 13.25, 14.0556640625, 14.861328125, 15.6669921875, 16.47265625, 17.2783203125, 18.083984375, 18.8896484375, 19.6953125, 20.5009765625, 21.306640625, 22.1123046875, 22.91796875, 23.7236328125, 24.529296875, 25.3349609375, 26.140625]}, "gradients/decoder.transformer.h.9.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 3.0, 0.0, 2.0, 3.0, 6.0, 3.0, 9.0, 12.0, 12.0, 17.0, 29.0, 35.0, 39.0, 37.0, 48.0, 75.0, 96.0, 109.0, 136.0, 201.0, 280.0, 353.0, 478.0, 420.0, 391.0, 292.0, 229.0, 179.0, 135.0, 98.0, 82.0, 53.0, 40.0, 47.0, 26.0, 15.0, 22.0, 21.0, 15.0, 10.0, 3.0, 3.0, 5.0, 2.0, 1.0, 5.0, 4.0, 5.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-11.5625, -11.2147216796875, -10.866943359375, -10.5191650390625, -10.17138671875, -9.8236083984375, -9.475830078125, -9.1280517578125, -8.7802734375, -8.4324951171875, -8.084716796875, -7.7369384765625, -7.38916015625, -7.0413818359375, -6.693603515625, -6.3458251953125, -5.998046875, -5.6502685546875, -5.302490234375, -4.9547119140625, -4.60693359375, -4.2591552734375, -3.911376953125, -3.5635986328125, -3.2158203125, -2.8680419921875, -2.520263671875, -2.1724853515625, -1.82470703125, -1.4769287109375, -1.129150390625, -0.7813720703125, -0.43359375, -0.0858154296875, 0.261962890625, 0.6097412109375, 0.95751953125, 1.3052978515625, 1.653076171875, 2.0008544921875, 2.3486328125, 2.6964111328125, 3.044189453125, 3.3919677734375, 3.73974609375, 4.0875244140625, 4.435302734375, 4.7830810546875, 5.130859375, 5.4786376953125, 5.826416015625, 6.1741943359375, 6.52197265625, 6.8697509765625, 7.217529296875, 7.5653076171875, 7.9130859375, 8.2608642578125, 8.608642578125, 8.9564208984375, 9.30419921875, 9.6519775390625, 9.999755859375, 10.3475341796875, 10.6953125]}, "gradients/decoder.transformer.h.9.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 5.0, 5.0, 3.0, 6.0, 7.0, 10.0, 11.0, 15.0, 20.0, 17.0, 26.0, 38.0, 42.0, 52.0, 56.0, 56.0, 91.0, 114.0, 158.0, 172.0, 253.0, 341.0, 615.0, 4225.0, 4050930.0, 134213.0, 1059.0, 444.0, 282.0, 204.0, 165.0, 119.0, 103.0, 78.0, 73.0, 61.0, 49.0, 42.0, 32.0, 23.0, 22.0, 14.0, 9.0, 11.0, 5.0, 4.0, 3.0, 3.0, 1.0, 2.0, 2.0, 1.0, 1.0, 0.0, 3.0, 1.0], "bins": [-79.0, -76.587890625, -74.17578125, -71.763671875, -69.3515625, -66.939453125, -64.52734375, -62.115234375, -59.703125, -57.291015625, -54.87890625, -52.466796875, -50.0546875, -47.642578125, -45.23046875, -42.818359375, -40.40625, -37.994140625, -35.58203125, -33.169921875, -30.7578125, -28.345703125, -25.93359375, -23.521484375, -21.109375, -18.697265625, -16.28515625, -13.873046875, -11.4609375, -9.048828125, -6.63671875, -4.224609375, -1.8125, 0.599609375, 3.01171875, 5.423828125, 7.8359375, 10.248046875, 12.66015625, 15.072265625, 17.484375, 19.896484375, 22.30859375, 24.720703125, 27.1328125, 29.544921875, 31.95703125, 34.369140625, 36.78125, 39.193359375, 41.60546875, 44.017578125, 46.4296875, 48.841796875, 51.25390625, 53.666015625, 56.078125, 58.490234375, 60.90234375, 63.314453125, 65.7265625, 68.138671875, 70.55078125, 72.962890625, 75.375]}, "gradients/decoder.transformer.h.9.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 5.0, 33.0, 117.0, 306.0, 326.0, 153.0, 61.0, 14.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.84886169433594, -38.99525833129883, -35.14165496826172, -31.288049697875977, -27.434446334838867, -23.580842971801758, -19.727237701416016, -15.873634338378906, -12.020030975341797, -8.166427612304688, -4.312823295593262, -0.45921897888183594, 3.3943843841552734, 7.247987747192383, 11.101593017578125, 14.955196380615234, 18.808799743652344, 22.662403106689453, 26.516006469726562, 30.369611740112305, 34.22321319580078, 38.076820373535156, 41.930423736572266, 45.784027099609375, 49.637630462646484, 53.491233825683594, 57.3448371887207, 61.19844055175781, 65.05204772949219, 68.90564727783203, 72.7592544555664, 76.61285400390625, 80.46646118164062, 84.320068359375, 88.17366790771484, 92.02727508544922, 95.88087463378906, 99.73448181152344, 103.58808898925781, 107.44168853759766, 111.2952880859375, 115.14889526367188, 119.00249481201172, 122.8561019897461, 126.70970153808594, 130.5633087158203, 134.4169158935547, 138.2705078125, 142.12411499023438, 145.97772216796875, 149.83132934570312, 153.68492126464844, 157.5385284423828, 161.3921356201172, 165.24574279785156, 169.09933471679688, 172.9529571533203, 176.8065643310547, 180.66017150878906, 184.51376342773438, 188.36737060546875, 192.22097778320312, 196.0745849609375, 199.92819213867188, 203.7817840576172]}, "gradients/decoder.transformer.h.9.ln_2.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 3.0, 5.0, 9.0, 2.0, 7.0, 10.0, 6.0, 7.0, 15.0, 20.0, 12.0, 17.0, 8.0, 19.0, 28.0, 18.0, 26.0, 22.0, 32.0, 28.0, 43.0, 29.0, 36.0, 33.0, 42.0, 40.0, 33.0, 51.0, 48.0, 33.0, 27.0, 27.0, 32.0, 29.0, 26.0, 25.0, 23.0, 19.0, 18.0, 15.0, 20.0, 12.0, 14.0, 10.0, 7.0, 8.0, 8.0, 6.0, 1.0, 0.0, 1.0, 1.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-34.623260498046875, -33.453102111816406, -32.28294372558594, -31.112781524658203, -29.942623138427734, -28.772464752197266, -27.602304458618164, -26.432144165039062, -25.261985778808594, -24.091827392578125, -22.921667098999023, -21.751506805419922, -20.581348419189453, -19.411190032958984, -18.241029739379883, -17.07086944580078, -15.900711059570312, -14.730551719665527, -13.560392379760742, -12.390233039855957, -11.220073699951172, -10.049914360046387, -8.879755020141602, -7.709595680236816, -6.539436340332031, -5.369277000427246, -4.199117660522461, -3.028958320617676, -1.8587989807128906, -0.6886396408081055, 0.4815196990966797, 1.6516790390014648, 2.82183837890625, 3.991997718811035, 5.16215705871582, 6.3323163986206055, 7.502475738525391, 8.672635078430176, 9.842794418334961, 11.012953758239746, 12.183113098144531, 13.353272438049316, 14.523431777954102, 15.693591117858887, 16.863750457763672, 18.03390884399414, 19.204069137573242, 20.374229431152344, 21.544387817382812, 22.71454620361328, 23.884706497192383, 25.054866790771484, 26.225025177001953, 27.395183563232422, 28.565343856811523, 29.735504150390625, 30.905662536621094, 32.07582092285156, 33.24597930908203, 34.416141510009766, 35.586299896240234, 36.7564582824707, 37.92662048339844, 39.096778869628906, 40.266937255859375]}, "gradients/decoder.transformer.h.9.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 6.0, 6.0, 6.0, 6.0, 13.0, 13.0, 5.0, 12.0, 12.0, 24.0, 30.0, 26.0, 36.0, 34.0, 41.0, 44.0, 48.0, 34.0, 34.0, 40.0, 42.0, 57.0, 43.0, 30.0, 42.0, 37.0, 28.0, 37.0, 23.0, 25.0, 20.0, 21.0, 23.0, 27.0, 10.0, 10.0, 16.0, 6.0, 12.0, 3.0, 4.0, 5.0, 1.0, 4.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-7.70703125, -7.47003173828125, -7.2330322265625, -6.99603271484375, -6.759033203125, -6.52203369140625, -6.2850341796875, -6.04803466796875, -5.81103515625, -5.57403564453125, -5.3370361328125, -5.10003662109375, -4.863037109375, -4.62603759765625, -4.3890380859375, -4.15203857421875, -3.9150390625, -3.67803955078125, -3.4410400390625, -3.20404052734375, -2.967041015625, -2.73004150390625, -2.4930419921875, -2.25604248046875, -2.01904296875, -1.78204345703125, -1.5450439453125, -1.30804443359375, -1.071044921875, -0.83404541015625, -0.5970458984375, -0.36004638671875, -0.123046875, 0.11395263671875, 0.3509521484375, 0.58795166015625, 0.824951171875, 1.06195068359375, 1.2989501953125, 1.53594970703125, 1.77294921875, 2.00994873046875, 2.2469482421875, 2.48394775390625, 2.720947265625, 2.95794677734375, 3.1949462890625, 3.43194580078125, 3.6689453125, 3.90594482421875, 4.1429443359375, 4.37994384765625, 4.616943359375, 4.85394287109375, 5.0909423828125, 5.32794189453125, 5.56494140625, 5.80194091796875, 6.0389404296875, 6.27593994140625, 6.512939453125, 6.74993896484375, 6.9869384765625, 7.22393798828125, 7.4609375]}, "gradients/decoder.transformer.h.9.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 3.0, 8.0, 16.0, 15.0, 25.0, 27.0, 49.0, 54.0, 91.0, 114.0, 151.0, 216.0, 276.0, 418.0, 551.0, 786.0, 1061.0, 1527.0, 2164.0, 2962.0, 4493.0, 6500.0, 9353.0, 14172.0, 21442.0, 32371.0, 50570.0, 81153.0, 141778.0, 307000.0, 140281.0, 80907.0, 50112.0, 32191.0, 20822.0, 13861.0, 9431.0, 6469.0, 4424.0, 3206.0, 2157.0, 1505.0, 1089.0, 811.0, 549.0, 390.0, 270.0, 228.0, 125.0, 119.0, 78.0, 58.0, 43.0, 32.0, 20.0, 16.0, 14.0, 6.0, 5.0, 2.0, 2.0, 1.0], "bins": [-1.6044921875, -1.5535430908203125, -1.502593994140625, -1.4516448974609375, -1.40069580078125, -1.3497467041015625, -1.298797607421875, -1.2478485107421875, -1.1968994140625, -1.1459503173828125, -1.095001220703125, -1.0440521240234375, -0.99310302734375, -0.9421539306640625, -0.891204833984375, -0.8402557373046875, -0.789306640625, -0.7383575439453125, -0.687408447265625, -0.6364593505859375, -0.58551025390625, -0.5345611572265625, -0.483612060546875, -0.4326629638671875, -0.3817138671875, -0.3307647705078125, -0.279815673828125, -0.2288665771484375, -0.17791748046875, -0.1269683837890625, -0.076019287109375, -0.0250701904296875, 0.02587890625, 0.0768280029296875, 0.127777099609375, 0.1787261962890625, 0.22967529296875, 0.2806243896484375, 0.331573486328125, 0.3825225830078125, 0.4334716796875, 0.4844207763671875, 0.535369873046875, 0.5863189697265625, 0.63726806640625, 0.6882171630859375, 0.739166259765625, 0.7901153564453125, 0.841064453125, 0.8920135498046875, 0.942962646484375, 0.9939117431640625, 1.04486083984375, 1.0958099365234375, 1.146759033203125, 1.1977081298828125, 1.2486572265625, 1.2996063232421875, 1.350555419921875, 1.4015045166015625, 1.45245361328125, 1.5034027099609375, 1.554351806640625, 1.6053009033203125, 1.65625]}, "gradients/decoder.transformer.h.9.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 2.0, 3.0, 0.0, 3.0, 3.0, 3.0, 3.0, 4.0, 5.0, 6.0, 4.0, 10.0, 16.0, 13.0, 19.0, 22.0, 20.0, 25.0, 36.0, 22.0, 34.0, 27.0, 32.0, 27.0, 35.0, 34.0, 39.0, 40.0, 1062.0, 34.0, 47.0, 33.0, 39.0, 32.0, 27.0, 33.0, 29.0, 29.0, 19.0, 27.0, 24.0, 19.0, 13.0, 13.0, 10.0, 11.0, 14.0, 13.0, 5.0, 8.0, 5.0, 3.0, 1.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-4.75390625, -4.60888671875, -4.4638671875, -4.31884765625, -4.173828125, -4.02880859375, -3.8837890625, -3.73876953125, -3.59375, -3.44873046875, -3.3037109375, -3.15869140625, -3.013671875, -2.86865234375, -2.7236328125, -2.57861328125, -2.43359375, -2.28857421875, -2.1435546875, -1.99853515625, -1.853515625, -1.70849609375, -1.5634765625, -1.41845703125, -1.2734375, -1.12841796875, -0.9833984375, -0.83837890625, -0.693359375, -0.54833984375, -0.4033203125, -0.25830078125, -0.11328125, 0.03173828125, 0.1767578125, 0.32177734375, 0.466796875, 0.61181640625, 0.7568359375, 0.90185546875, 1.046875, 1.19189453125, 1.3369140625, 1.48193359375, 1.626953125, 1.77197265625, 1.9169921875, 2.06201171875, 2.20703125, 2.35205078125, 2.4970703125, 2.64208984375, 2.787109375, 2.93212890625, 3.0771484375, 3.22216796875, 3.3671875, 3.51220703125, 3.6572265625, 3.80224609375, 3.947265625, 4.09228515625, 4.2373046875, 4.38232421875, 4.52734375]}, "gradients/decoder.transformer.h.9.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 0.0, 1.0, 3.0, 6.0, 12.0, 12.0, 19.0, 27.0, 28.0, 45.0, 76.0, 124.0, 173.0, 288.0, 475.0, 782.0, 1355.0, 2416.0, 4244.0, 7330.0, 13182.0, 24686.0, 46583.0, 91255.0, 209737.0, 1429806.0, 127748.0, 63491.0, 33048.0, 17635.0, 9713.0, 5423.0, 2971.0, 1741.0, 1086.0, 637.0, 382.0, 214.0, 128.0, 86.0, 54.0, 36.0, 22.0, 18.0, 11.0, 8.0, 6.0, 3.0, 4.0, 3.0, 3.0, 0.0, 3.0, 2.0, 2.0, 2.0, 0.0, 1.0], "bins": [-2.474609375, -2.3953857421875, -2.316162109375, -2.2369384765625, -2.15771484375, -2.0784912109375, -1.999267578125, -1.9200439453125, -1.8408203125, -1.7615966796875, -1.682373046875, -1.6031494140625, -1.52392578125, -1.4447021484375, -1.365478515625, -1.2862548828125, -1.20703125, -1.1278076171875, -1.048583984375, -0.9693603515625, -0.89013671875, -0.8109130859375, -0.731689453125, -0.6524658203125, -0.5732421875, -0.4940185546875, -0.414794921875, -0.3355712890625, -0.25634765625, -0.1771240234375, -0.097900390625, -0.0186767578125, 0.060546875, 0.1397705078125, 0.218994140625, 0.2982177734375, 0.37744140625, 0.4566650390625, 0.535888671875, 0.6151123046875, 0.6943359375, 0.7735595703125, 0.852783203125, 0.9320068359375, 1.01123046875, 1.0904541015625, 1.169677734375, 1.2489013671875, 1.328125, 1.4073486328125, 1.486572265625, 1.5657958984375, 1.64501953125, 1.7242431640625, 1.803466796875, 1.8826904296875, 1.9619140625, 2.0411376953125, 2.120361328125, 2.1995849609375, 2.27880859375, 2.3580322265625, 2.437255859375, 2.5164794921875, 2.595703125]}, "gradients/decoder.transformer.h.9.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 0.0, 1.0, 3.0, 1.0, 6.0, 7.0, 12.0, 9.0, 15.0, 23.0, 25.0, 30.0, 24.0, 53.0, 71.0, 101.0, 112.0, 134.0, 102.0, 78.0, 51.0, 45.0, 19.0, 15.0, 18.0, 12.0, 8.0, 5.0, 8.0, 6.0, 2.0, 2.0, 3.0, 1.0, 3.0, 0.0, 3.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0023021697998046875, -0.0022348761558532715, -0.0021675825119018555, -0.0021002888679504395, -0.0020329952239990234, -0.0019657015800476074, -0.0018984079360961914, -0.0018311142921447754, -0.0017638206481933594, -0.0016965270042419434, -0.0016292333602905273, -0.0015619397163391113, -0.0014946460723876953, -0.0014273524284362793, -0.0013600587844848633, -0.0012927651405334473, -0.0012254714965820312, -0.0011581778526306152, -0.0010908842086791992, -0.0010235905647277832, -0.0009562969207763672, -0.0008890032768249512, -0.0008217096328735352, -0.0007544159889221191, -0.0006871223449707031, -0.0006198287010192871, -0.0005525350570678711, -0.0004852414131164551, -0.00041794776916503906, -0.00035065412521362305, -0.00028336048126220703, -0.00021606683731079102, -0.000148773193359375, -8.147954940795898e-05, -1.4185905456542969e-05, 5.310773849487305e-05, 0.00012040138244628906, 0.00018769502639770508, 0.0002549886703491211, 0.0003222823143005371, 0.0003895759582519531, 0.00045686960220336914, 0.0005241632461547852, 0.0005914568901062012, 0.0006587505340576172, 0.0007260441780090332, 0.0007933378219604492, 0.0008606314659118652, 0.0009279251098632812, 0.0009952187538146973, 0.0010625123977661133, 0.0011298060417175293, 0.0011970996856689453, 0.0012643933296203613, 0.0013316869735717773, 0.0013989806175231934, 0.0014662742614746094, 0.0015335679054260254, 0.0016008615493774414, 0.0016681551933288574, 0.0017354488372802734, 0.0018027424812316895, 0.0018700361251831055, 0.0019373297691345215, 0.0020046234130859375]}, "gradients/decoder.transformer.h.9.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 3.0, 2.0, 2.0, 6.0, 3.0, 7.0, 11.0, 8.0, 20.0, 15.0, 27.0, 20.0, 46.0, 76.0, 132.0, 254.0, 1050.0, 1039616.0, 6313.0, 410.0, 180.0, 103.0, 59.0, 42.0, 33.0, 34.0, 26.0, 8.0, 17.0, 13.0, 9.0, 2.0, 3.0, 1.0, 3.0, 3.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.04486083984375, -0.0433506965637207, -0.041840553283691406, -0.04033041000366211, -0.03882026672363281, -0.037310123443603516, -0.03579998016357422, -0.03428983688354492, -0.032779693603515625, -0.03126955032348633, -0.02975940704345703, -0.028249263763427734, -0.026739120483398438, -0.02522897720336914, -0.023718833923339844, -0.022208690643310547, -0.02069854736328125, -0.019188404083251953, -0.017678260803222656, -0.01616811752319336, -0.014657974243164062, -0.013147830963134766, -0.011637687683105469, -0.010127544403076172, -0.008617401123046875, -0.007107257843017578, -0.005597114562988281, -0.004086971282958984, -0.0025768280029296875, -0.0010666847229003906, 0.00044345855712890625, 0.001953601837158203, 0.0034637451171875, 0.004973888397216797, 0.006484031677246094, 0.00799417495727539, 0.009504318237304688, 0.011014461517333984, 0.012524604797363281, 0.014034748077392578, 0.015544891357421875, 0.017055034637451172, 0.01856517791748047, 0.020075321197509766, 0.021585464477539062, 0.02309560775756836, 0.024605751037597656, 0.026115894317626953, 0.02762603759765625, 0.029136180877685547, 0.030646324157714844, 0.03215646743774414, 0.03366661071777344, 0.035176753997802734, 0.03668689727783203, 0.03819704055786133, 0.039707183837890625, 0.04121732711791992, 0.04272747039794922, 0.044237613677978516, 0.04574775695800781, 0.04725790023803711, 0.048768043518066406, 0.0502781867980957, 0.051788330078125]}, "gradients/decoder.transformer.h.9.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 3.0, 2.0, 19.0, 91.0, 443.0, 385.0, 68.0, 6.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005329780397005379, -0.00044286041520535946, -0.000352742790710181, -0.000262625195318833, -0.00017250757082365453, -8.238994632847607e-05, 7.727649062871933e-06, 9.78452735580504e-05, 0.00018796289805322886, 0.0002780805225484073, 0.0003681981470435858, 0.0004583157424349338, 0.0005484333960339427, 0.0006385509623214602, 0.0007286685868166387, 0.0008187862113118172, 0.0009089038358069956, 0.000999021460302174, 0.0010891390265896916, 0.001179256709292531, 0.0012693742755800486, 0.001359491958282888, 0.0014496095245704055, 0.0015397272072732449, 0.0016298447735607624, 0.00171996233984828, 0.0018100800225511193, 0.0019001975888386369, 0.0019903152715414762, 0.002080432837828994, 0.0021705504041165113, 0.0022606682032346725, 0.0023507855366915464, 0.002440903102979064, 0.0025310206692665815, 0.0026211384683847427, 0.0027112560346722603, 0.002801373600959778, 0.0028914911672472954, 0.002981608733534813, 0.003071726532652974, 0.0031618440989404917, 0.0032519616652280092, 0.0033420794643461704, 0.003432197030633688, 0.0035223145969212055, 0.003612432163208723, 0.0037025497294962406, 0.003792667295783758, 0.0038827848620712757, 0.003972902428358793, 0.004063019994646311, 0.004153137560933828, 0.004243255592882633, 0.004333373159170151, 0.004423490725457668, 0.004513608291745186, 0.004603725858032703, 0.004693843424320221, 0.0047839609906077385, 0.004874078556895256, 0.004964196588844061, 0.0050543141551315784, 0.005144431721419096, 0.0052345492877066135]}, "gradients/decoder.transformer.h.9.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 2.0, 2.0, 5.0, 3.0, 5.0, 5.0, 6.0, 8.0, 7.0, 13.0, 6.0, 15.0, 11.0, 15.0, 26.0, 30.0, 22.0, 38.0, 35.0, 39.0, 36.0, 35.0, 41.0, 37.0, 33.0, 38.0, 40.0, 43.0, 35.0, 24.0, 28.0, 43.0, 39.0, 29.0, 28.0, 24.0, 23.0, 18.0, 19.0, 18.0, 17.0, 13.0, 12.0, 13.0, 5.0, 4.0, 7.0, 6.0, 5.0, 2.0, 0.0, 2.0, 3.0, 1.0, 0.0, 2.0], "bins": [-0.0007742047309875488, -0.0007513789460062981, -0.0007285531610250473, -0.0007057273760437965, -0.0006829015910625458, -0.000660075806081295, -0.0006372500211000443, -0.0006144242361187935, -0.0005915984511375427, -0.000568772666156292, -0.0005459468811750412, -0.0005231210961937904, -0.0005002953112125397, -0.0004774695262312889, -0.00045464374125003815, -0.0004318179562687874, -0.0004089921712875366, -0.00038616638630628586, -0.0003633406013250351, -0.00034051481634378433, -0.00031768903136253357, -0.0002948632463812828, -0.00027203746140003204, -0.0002492116764187813, -0.00022638589143753052, -0.00020356010645627975, -0.000180734321475029, -0.00015790853649377823, -0.00013508275151252747, -0.0001122569665312767, -8.943118155002594e-05, -6.660539656877518e-05, -4.3779611587524414e-05, -2.095382660627365e-05, 1.8719583749771118e-06, 2.4697743356227875e-05, 4.752352833747864e-05, 7.03493133187294e-05, 9.317509829998016e-05, 0.00011600088328123093, 0.0001388266682624817, 0.00016165245324373245, 0.00018447823822498322, 0.00020730402320623398, 0.00023012980818748474, 0.0002529555931687355, 0.00027578137814998627, 0.00029860716313123703, 0.0003214329481124878, 0.00034425873309373856, 0.0003670845180749893, 0.0003899103030562401, 0.00041273608803749084, 0.0004355618730187416, 0.00045838765799999237, 0.00048121344298124313, 0.0005040392279624939, 0.0005268650129437447, 0.0005496907979249954, 0.0005725165829062462, 0.000595342367887497, 0.0006181681528687477, 0.0006409939378499985, 0.0006638197228312492, 0.0006866455078125]}, "gradients/decoder.transformer.h.9.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 6.0, 6.0, 6.0, 6.0, 13.0, 13.0, 5.0, 12.0, 12.0, 24.0, 30.0, 26.0, 37.0, 33.0, 41.0, 44.0, 48.0, 34.0, 34.0, 40.0, 43.0, 56.0, 44.0, 29.0, 42.0, 37.0, 28.0, 37.0, 24.0, 24.0, 21.0, 20.0, 23.0, 27.0, 10.0, 11.0, 15.0, 6.0, 13.0, 2.0, 4.0, 5.0, 1.0, 4.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-7.70703125, -7.469970703125, -7.23291015625, -6.995849609375, -6.7587890625, -6.521728515625, -6.28466796875, -6.047607421875, -5.810546875, -5.573486328125, -5.33642578125, -5.099365234375, -4.8623046875, -4.625244140625, -4.38818359375, -4.151123046875, -3.9140625, -3.677001953125, -3.43994140625, -3.202880859375, -2.9658203125, -2.728759765625, -2.49169921875, -2.254638671875, -2.017578125, -1.780517578125, -1.54345703125, -1.306396484375, -1.0693359375, -0.832275390625, -0.59521484375, -0.358154296875, -0.12109375, 0.115966796875, 0.35302734375, 0.590087890625, 0.8271484375, 1.064208984375, 1.30126953125, 1.538330078125, 1.775390625, 2.012451171875, 2.24951171875, 2.486572265625, 2.7236328125, 2.960693359375, 3.19775390625, 3.434814453125, 3.671875, 3.908935546875, 4.14599609375, 4.383056640625, 4.6201171875, 4.857177734375, 5.09423828125, 5.331298828125, 5.568359375, 5.805419921875, 6.04248046875, 6.279541015625, 6.5166015625, 6.753662109375, 6.99072265625, 7.227783203125, 7.46484375]}, "gradients/decoder.transformer.h.9.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 3.0, 1.0, 3.0, 3.0, 3.0, 6.0, 8.0, 6.0, 8.0, 12.0, 16.0, 19.0, 20.0, 29.0, 30.0, 59.0, 77.0, 78.0, 140.0, 211.0, 286.0, 480.0, 758.0, 1992.0, 10139.0, 87191.0, 764250.0, 160905.0, 16462.0, 2825.0, 963.0, 499.0, 295.0, 205.0, 142.0, 101.0, 73.0, 57.0, 47.0, 51.0, 25.0, 14.0, 19.0, 13.0, 16.0, 5.0, 3.0, 9.0, 2.0, 5.0, 0.0, 3.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-14.125, -13.6920166015625, -13.259033203125, -12.8260498046875, -12.39306640625, -11.9600830078125, -11.527099609375, -11.0941162109375, -10.6611328125, -10.2281494140625, -9.795166015625, -9.3621826171875, -8.92919921875, -8.4962158203125, -8.063232421875, -7.6302490234375, -7.197265625, -6.7642822265625, -6.331298828125, -5.8983154296875, -5.46533203125, -5.0323486328125, -4.599365234375, -4.1663818359375, -3.7333984375, -3.3004150390625, -2.867431640625, -2.4344482421875, -2.00146484375, -1.5684814453125, -1.135498046875, -0.7025146484375, -0.26953125, 0.1634521484375, 0.596435546875, 1.0294189453125, 1.46240234375, 1.8953857421875, 2.328369140625, 2.7613525390625, 3.1943359375, 3.6273193359375, 4.060302734375, 4.4932861328125, 4.92626953125, 5.3592529296875, 5.792236328125, 6.2252197265625, 6.658203125, 7.0911865234375, 7.524169921875, 7.9571533203125, 8.39013671875, 8.8231201171875, 9.256103515625, 9.6890869140625, 10.1220703125, 10.5550537109375, 10.988037109375, 11.4210205078125, 11.85400390625, 12.2869873046875, 12.719970703125, 13.1529541015625, 13.5859375]}, "gradients/decoder.transformer.h.9.attn.c_attn.bias": {"_type": "histogram", "values": [4.0, 2.0, 1.0, 3.0, 1.0, 1.0, 2.0, 3.0, 2.0, 8.0, 8.0, 11.0, 10.0, 13.0, 18.0, 18.0, 30.0, 30.0, 33.0, 32.0, 34.0, 36.0, 41.0, 49.0, 51.0, 93.0, 1600.0, 431.0, 84.0, 45.0, 48.0, 39.0, 39.0, 45.0, 29.0, 24.0, 26.0, 29.0, 24.0, 19.0, 12.0, 6.0, 7.0, 5.0, 7.0, 4.0, 1.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.703125, -20.89794921875, -20.0927734375, -19.28759765625, -18.482421875, -17.67724609375, -16.8720703125, -16.06689453125, -15.26171875, -14.45654296875, -13.6513671875, -12.84619140625, -12.041015625, -11.23583984375, -10.4306640625, -9.62548828125, -8.8203125, -8.01513671875, -7.2099609375, -6.40478515625, -5.599609375, -4.79443359375, -3.9892578125, -3.18408203125, -2.37890625, -1.57373046875, -0.7685546875, 0.03662109375, 0.841796875, 1.64697265625, 2.4521484375, 3.25732421875, 4.0625, 4.86767578125, 5.6728515625, 6.47802734375, 7.283203125, 8.08837890625, 8.8935546875, 9.69873046875, 10.50390625, 11.30908203125, 12.1142578125, 12.91943359375, 13.724609375, 14.52978515625, 15.3349609375, 16.14013671875, 16.9453125, 17.75048828125, 18.5556640625, 19.36083984375, 20.166015625, 20.97119140625, 21.7763671875, 22.58154296875, 23.38671875, 24.19189453125, 24.9970703125, 25.80224609375, 26.607421875, 27.41259765625, 28.2177734375, 29.02294921875, 29.828125]}, "gradients/decoder.transformer.h.9.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 2.0, 3.0, 1.0, 7.0, 4.0, 9.0, 5.0, 15.0, 18.0, 20.0, 29.0, 38.0, 28.0, 39.0, 63.0, 83.0, 96.0, 202.0, 342.0, 1144.0, 463488.0, 2677637.0, 1321.0, 430.0, 200.0, 116.0, 68.0, 58.0, 38.0, 32.0, 35.0, 30.0, 24.0, 22.0, 12.0, 11.0, 11.0, 7.0, 8.0, 4.0, 2.0, 0.0, 3.0, 1.0, 4.0, 1.0, 2.0, 3.0], "bins": [-89.1875, -86.7802734375, -84.373046875, -81.9658203125, -79.55859375, -77.1513671875, -74.744140625, -72.3369140625, -69.9296875, -67.5224609375, -65.115234375, -62.7080078125, -60.30078125, -57.8935546875, -55.486328125, -53.0791015625, -50.671875, -48.2646484375, -45.857421875, -43.4501953125, -41.04296875, -38.6357421875, -36.228515625, -33.8212890625, -31.4140625, -29.0068359375, -26.599609375, -24.1923828125, -21.78515625, -19.3779296875, -16.970703125, -14.5634765625, -12.15625, -9.7490234375, -7.341796875, -4.9345703125, -2.52734375, -0.1201171875, 2.287109375, 4.6943359375, 7.1015625, 9.5087890625, 11.916015625, 14.3232421875, 16.73046875, 19.1376953125, 21.544921875, 23.9521484375, 26.359375, 28.7666015625, 31.173828125, 33.5810546875, 35.98828125, 38.3955078125, 40.802734375, 43.2099609375, 45.6171875, 48.0244140625, 50.431640625, 52.8388671875, 55.24609375, 57.6533203125, 60.060546875, 62.4677734375, 64.875]}, "gradients/decoder.transformer.h.9.ln_1.weight": {"_type": "histogram", "values": [2.0, 2.0, 22.0, 152.0, 444.0, 336.0, 54.0, 8.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-10.117131233215332, -8.01663589477539, -5.916140556335449, -3.815645217895508, -1.7151498794555664, 0.385345458984375, 2.4858407974243164, 4.586336135864258, 6.686831474304199, 8.78732681274414, 10.887822151184082, 12.988317489624023, 15.088812828063965, 17.189308166503906, 19.28980255126953, 21.39029884338379, 23.490795135498047, 25.591289520263672, 27.69178581237793, 29.792282104492188, 31.892776489257812, 33.99327087402344, 36.09376525878906, 38.19426345825195, 40.29475784301758, 42.3952522277832, 44.495750427246094, 46.59624481201172, 48.696739196777344, 50.79723358154297, 52.897727966308594, 54.998226165771484, 57.098724365234375, 59.19921875, 61.299713134765625, 63.400211334228516, 65.50070190429688, 67.60120391845703, 69.70169830322266, 71.80219268798828, 73.9026870727539, 76.00318145751953, 78.10367584228516, 80.20417022705078, 82.30467224121094, 84.40516662597656, 86.50566101074219, 88.60615539550781, 90.70664978027344, 92.80714416503906, 94.90763854980469, 97.00813293457031, 99.10862731933594, 101.2091293334961, 103.30962371826172, 105.41011810302734, 107.51061248779297, 109.6111068725586, 111.71160125732422, 113.81209564208984, 115.91259765625, 118.01309204101562, 120.11358642578125, 122.21408081054688, 124.3145751953125]}, "gradients/decoder.transformer.h.9.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 6.0, 4.0, 9.0, 7.0, 13.0, 13.0, 10.0, 9.0, 20.0, 26.0, 21.0, 24.0, 28.0, 31.0, 30.0, 38.0, 41.0, 43.0, 44.0, 42.0, 45.0, 47.0, 49.0, 40.0, 42.0, 21.0, 51.0, 53.0, 29.0, 25.0, 26.0, 15.0, 21.0, 20.0, 12.0, 5.0, 10.0, 7.0, 12.0, 5.0, 4.0, 2.0, 7.0, 3.0, 0.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-69.00862121582031, -66.81301879882812, -64.6174087524414, -62.42180633544922, -60.226200103759766, -58.03059387207031, -55.834991455078125, -53.63938522338867, -51.44377899169922, -49.248172760009766, -47.05256652832031, -44.856964111328125, -42.66135787963867, -40.46575164794922, -38.27014923095703, -36.07454299926758, -33.878936767578125, -31.683330535888672, -29.48772621154785, -27.29212188720703, -25.096515655517578, -22.900909423828125, -20.705305099487305, -18.509700775146484, -16.31409454345703, -14.118489265441895, -11.922883987426758, -9.727278709411621, -7.531673431396484, -5.336068153381348, -3.140462875366211, -0.9448575973510742, 1.2507400512695312, 3.446345329284668, 5.641950607299805, 7.837555885314941, 10.033161163330078, 12.228766441345215, 14.424371719360352, 16.619976043701172, 18.815582275390625, 21.011188507080078, 23.2067928314209, 25.40239715576172, 27.598003387451172, 29.793609619140625, 31.989213943481445, 34.184818267822266, 36.38042449951172, 38.57603073120117, 40.771636962890625, 42.96723937988281, 45.162845611572266, 47.35845184326172, 49.554054260253906, 51.74966049194336, 53.94526672363281, 56.140872955322266, 58.33647918701172, 60.532081604003906, 62.72768783569336, 64.92329406738281, 67.118896484375, 69.31450653076172, 71.5101089477539]}, "gradients/decoder.transformer.h.8.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 2.0, 4.0, 6.0, 8.0, 10.0, 14.0, 8.0, 7.0, 8.0, 9.0, 22.0, 30.0, 26.0, 32.0, 27.0, 39.0, 26.0, 46.0, 43.0, 43.0, 37.0, 33.0, 35.0, 55.0, 47.0, 39.0, 30.0, 38.0, 32.0, 24.0, 25.0, 27.0, 27.0, 19.0, 19.0, 23.0, 19.0, 11.0, 9.0, 8.0, 11.0, 8.0, 2.0, 7.0, 1.0, 1.0, 3.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 2.0], "bins": [-7.8828125, -7.63946533203125, -7.3961181640625, -7.15277099609375, -6.909423828125, -6.66607666015625, -6.4227294921875, -6.17938232421875, -5.93603515625, -5.69268798828125, -5.4493408203125, -5.20599365234375, -4.962646484375, -4.71929931640625, -4.4759521484375, -4.23260498046875, -3.9892578125, -3.74591064453125, -3.5025634765625, -3.25921630859375, -3.015869140625, -2.77252197265625, -2.5291748046875, -2.28582763671875, -2.04248046875, -1.79913330078125, -1.5557861328125, -1.31243896484375, -1.069091796875, -0.82574462890625, -0.5823974609375, -0.33905029296875, -0.095703125, 0.14764404296875, 0.3909912109375, 0.63433837890625, 0.877685546875, 1.12103271484375, 1.3643798828125, 1.60772705078125, 1.85107421875, 2.09442138671875, 2.3377685546875, 2.58111572265625, 2.824462890625, 3.06781005859375, 3.3111572265625, 3.55450439453125, 3.7978515625, 4.04119873046875, 4.2845458984375, 4.52789306640625, 4.771240234375, 5.01458740234375, 5.2579345703125, 5.50128173828125, 5.74462890625, 5.98797607421875, 6.2313232421875, 6.47467041015625, 6.718017578125, 6.96136474609375, 7.2047119140625, 7.44805908203125, 7.69140625]}, "gradients/decoder.transformer.h.8.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 2.0, 5.0, 8.0, 9.0, 14.0, 9.0, 18.0, 24.0, 22.0, 33.0, 56.0, 51.0, 73.0, 79.0, 139.0, 217.0, 286.0, 475.0, 972.0, 2003.0, 5211.0, 15737.0, 60859.0, 267478.0, 916277.0, 1588448.0, 964583.0, 280282.0, 64495.0, 16556.0, 5245.0, 2099.0, 991.0, 505.0, 283.0, 175.0, 132.0, 107.0, 78.0, 48.0, 57.0, 33.0, 27.0, 20.0, 22.0, 13.0, 8.0, 11.0, 8.0, 4.0, 5.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0], "bins": [-9.015625, -8.7384033203125, -8.461181640625, -8.1839599609375, -7.90673828125, -7.6295166015625, -7.352294921875, -7.0750732421875, -6.7978515625, -6.5206298828125, -6.243408203125, -5.9661865234375, -5.68896484375, -5.4117431640625, -5.134521484375, -4.8572998046875, -4.580078125, -4.3028564453125, -4.025634765625, -3.7484130859375, -3.47119140625, -3.1939697265625, -2.916748046875, -2.6395263671875, -2.3623046875, -2.0850830078125, -1.807861328125, -1.5306396484375, -1.25341796875, -0.9761962890625, -0.698974609375, -0.4217529296875, -0.14453125, 0.1326904296875, 0.409912109375, 0.6871337890625, 0.96435546875, 1.2415771484375, 1.518798828125, 1.7960205078125, 2.0732421875, 2.3504638671875, 2.627685546875, 2.9049072265625, 3.18212890625, 3.4593505859375, 3.736572265625, 4.0137939453125, 4.291015625, 4.5682373046875, 4.845458984375, 5.1226806640625, 5.39990234375, 5.6771240234375, 5.954345703125, 6.2315673828125, 6.5087890625, 6.7860107421875, 7.063232421875, 7.3404541015625, 7.61767578125, 7.8948974609375, 8.172119140625, 8.4493408203125, 8.7265625]}, "gradients/decoder.transformer.h.8.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 6.0, 10.0, 10.0, 11.0, 12.0, 21.0, 36.0, 35.0, 47.0, 55.0, 56.0, 109.0, 123.0, 135.0, 177.0, 241.0, 347.0, 417.0, 442.0, 401.0, 318.0, 239.0, 156.0, 132.0, 125.0, 92.0, 70.0, 66.0, 35.0, 42.0, 29.0, 24.0, 17.0, 13.0, 10.0, 9.0, 3.0, 2.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0], "bins": [-15.5546875, -15.127197265625, -14.69970703125, -14.272216796875, -13.8447265625, -13.417236328125, -12.98974609375, -12.562255859375, -12.134765625, -11.707275390625, -11.27978515625, -10.852294921875, -10.4248046875, -9.997314453125, -9.56982421875, -9.142333984375, -8.71484375, -8.287353515625, -7.85986328125, -7.432373046875, -7.0048828125, -6.577392578125, -6.14990234375, -5.722412109375, -5.294921875, -4.867431640625, -4.43994140625, -4.012451171875, -3.5849609375, -3.157470703125, -2.72998046875, -2.302490234375, -1.875, -1.447509765625, -1.02001953125, -0.592529296875, -0.1650390625, 0.262451171875, 0.68994140625, 1.117431640625, 1.544921875, 1.972412109375, 2.39990234375, 2.827392578125, 3.2548828125, 3.682373046875, 4.10986328125, 4.537353515625, 4.96484375, 5.392333984375, 5.81982421875, 6.247314453125, 6.6748046875, 7.102294921875, 7.52978515625, 7.957275390625, 8.384765625, 8.812255859375, 9.23974609375, 9.667236328125, 10.0947265625, 10.522216796875, 10.94970703125, 11.377197265625, 11.8046875]}, "gradients/decoder.transformer.h.8.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 2.0, 5.0, 4.0, 7.0, 13.0, 14.0, 15.0, 27.0, 32.0, 33.0, 48.0, 66.0, 80.0, 92.0, 106.0, 129.0, 161.0, 170.0, 220.0, 283.0, 454.0, 1202.0, 15389.0, 4112405.0, 59593.0, 1613.0, 553.0, 295.0, 244.0, 207.0, 152.0, 126.0, 104.0, 86.0, 85.0, 66.0, 60.0, 26.0, 31.0, 24.0, 20.0, 12.0, 8.0, 7.0, 9.0, 4.0, 1.0, 3.0, 3.0, 2.0, 1.0, 0.0, 1.0], "bins": [-72.8125, -70.70849609375, -68.6044921875, -66.50048828125, -64.396484375, -62.29248046875, -60.1884765625, -58.08447265625, -55.98046875, -53.87646484375, -51.7724609375, -49.66845703125, -47.564453125, -45.46044921875, -43.3564453125, -41.25244140625, -39.1484375, -37.04443359375, -34.9404296875, -32.83642578125, -30.732421875, -28.62841796875, -26.5244140625, -24.42041015625, -22.31640625, -20.21240234375, -18.1083984375, -16.00439453125, -13.900390625, -11.79638671875, -9.6923828125, -7.58837890625, -5.484375, -3.38037109375, -1.2763671875, 0.82763671875, 2.931640625, 5.03564453125, 7.1396484375, 9.24365234375, 11.34765625, 13.45166015625, 15.5556640625, 17.65966796875, 19.763671875, 21.86767578125, 23.9716796875, 26.07568359375, 28.1796875, 30.28369140625, 32.3876953125, 34.49169921875, 36.595703125, 38.69970703125, 40.8037109375, 42.90771484375, 45.01171875, 47.11572265625, 49.2197265625, 51.32373046875, 53.427734375, 55.53173828125, 57.6357421875, 59.73974609375, 61.84375]}, "gradients/decoder.transformer.h.8.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 82.0, 581.0, 336.0, 15.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-274.33038330078125, -265.9565734863281, -257.5827941894531, -249.208984375, -240.83517456054688, -232.4613800048828, -224.08758544921875, -215.71377563476562, -207.3399658203125, -198.96617126464844, -190.5923614501953, -182.21856689453125, -173.84475708007812, -165.47096252441406, -157.09716796875, -148.72335815429688, -140.3495635986328, -131.97576904296875, -123.60195922851562, -115.22816467285156, -106.85435485839844, -98.48056030273438, -90.10675811767578, -81.73295593261719, -73.3591537475586, -64.9853515625, -56.611549377441406, -48.23775100708008, -39.863948822021484, -31.49014663696289, -23.116348266601562, -14.742546081542969, -6.368743896484375, 2.0050573348999023, 10.37885856628418, 18.75265884399414, 27.126461029052734, 35.50026321411133, 43.874061584472656, 52.24786376953125, 60.621665954589844, 68.99546813964844, 77.36927032470703, 85.74307250976562, 94.11686706542969, 102.49067687988281, 110.86447143554688, 119.23827362060547, 127.61207580566406, 135.98587036132812, 144.35968017578125, 152.7334747314453, 161.10728454589844, 169.4810791015625, 177.85488891601562, 186.2286834716797, 194.60247802734375, 202.9762725830078, 211.35008239746094, 219.723876953125, 228.09768676757812, 236.4714813232422, 244.84527587890625, 253.21908569335938, 261.5928955078125]}, "gradients/decoder.transformer.h.8.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 6.0, 10.0, 7.0, 10.0, 8.0, 13.0, 13.0, 10.0, 18.0, 26.0, 20.0, 25.0, 25.0, 33.0, 22.0, 22.0, 38.0, 36.0, 37.0, 33.0, 44.0, 46.0, 38.0, 40.0, 37.0, 36.0, 35.0, 39.0, 31.0, 25.0, 22.0, 30.0, 23.0, 20.0, 20.0, 20.0, 18.0, 13.0, 13.0, 14.0, 1.0, 8.0, 5.0, 5.0, 2.0, 3.0, 3.0, 1.0, 1.0, 3.0, 3.0, 3.0, 1.0], "bins": [-51.42364501953125, -49.89433670043945, -48.36502456665039, -46.835716247558594, -45.30640411376953, -43.777095794677734, -42.24778366088867, -40.718475341796875, -39.18916320800781, -37.659854888916016, -36.13054275512695, -34.601234436035156, -33.071922302246094, -31.542612075805664, -30.013301849365234, -28.483993530273438, -26.954683303833008, -25.425373077392578, -23.89606285095215, -22.36675262451172, -20.83744239807129, -19.30813217163086, -17.778823852539062, -16.24951171875, -14.720202445983887, -13.190892219543457, -11.661581993103027, -10.132272720336914, -8.602962493896484, -7.0736517906188965, -5.544342041015625, -4.015031814575195, -2.4857215881347656, -0.9564114809036255, 0.5728986263275146, 2.1022086143493652, 3.631518840789795, 5.160829067230225, 6.690138816833496, 8.219449043273926, 9.748759269714355, 11.278069496154785, 12.807379722595215, 14.336688995361328, 15.865999221801758, 17.395309448242188, 18.924619674682617, 20.453929901123047, 21.983240127563477, 23.512550354003906, 25.041860580444336, 26.571170806884766, 28.100481033325195, 29.629791259765625, 31.159099578857422, 32.688411712646484, 34.21772003173828, 35.74702835083008, 37.27634048461914, 38.80564880371094, 40.3349609375, 41.8642692565918, 43.39358139038086, 44.922889709472656, 46.45220184326172]}, "gradients/decoder.transformer.h.8.crossattention.c_proj.bias": {"_type": "histogram", "values": [5.0, 2.0, 1.0, 4.0, 1.0, 5.0, 5.0, 3.0, 4.0, 5.0, 4.0, 4.0, 7.0, 7.0, 8.0, 14.0, 8.0, 12.0, 24.0, 18.0, 22.0, 22.0, 21.0, 34.0, 25.0, 33.0, 30.0, 38.0, 49.0, 44.0, 42.0, 26.0, 41.0, 46.0, 40.0, 38.0, 32.0, 30.0, 24.0, 31.0, 23.0, 17.0, 33.0, 17.0, 17.0, 15.0, 13.0, 15.0, 13.0, 11.0, 9.0, 1.0, 5.0, 9.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0], "bins": [-6.63671875, -6.42572021484375, -6.2147216796875, -6.00372314453125, -5.792724609375, -5.58172607421875, -5.3707275390625, -5.15972900390625, -4.94873046875, -4.73773193359375, -4.5267333984375, -4.31573486328125, -4.104736328125, -3.89373779296875, -3.6827392578125, -3.47174072265625, -3.2607421875, -3.04974365234375, -2.8387451171875, -2.62774658203125, -2.416748046875, -2.20574951171875, -1.9947509765625, -1.78375244140625, -1.57275390625, -1.36175537109375, -1.1507568359375, -0.93975830078125, -0.728759765625, -0.51776123046875, -0.3067626953125, -0.09576416015625, 0.115234375, 0.32623291015625, 0.5372314453125, 0.74822998046875, 0.959228515625, 1.17022705078125, 1.3812255859375, 1.59222412109375, 1.80322265625, 2.01422119140625, 2.2252197265625, 2.43621826171875, 2.647216796875, 2.85821533203125, 3.0692138671875, 3.28021240234375, 3.4912109375, 3.70220947265625, 3.9132080078125, 4.12420654296875, 4.335205078125, 4.54620361328125, 4.7572021484375, 4.96820068359375, 5.17919921875, 5.39019775390625, 5.6011962890625, 5.81219482421875, 6.023193359375, 6.23419189453125, 6.4451904296875, 6.65618896484375, 6.8671875]}, "gradients/decoder.transformer.h.8.crossattention.c_proj.weight": {"_type": "histogram", "values": [5.0, 2.0, 6.0, 13.0, 5.0, 16.0, 13.0, 21.0, 30.0, 33.0, 48.0, 79.0, 127.0, 184.0, 261.0, 325.0, 519.0, 685.0, 935.0, 1343.0, 1910.0, 2746.0, 3933.0, 5871.0, 8534.0, 12790.0, 18987.0, 28832.0, 45174.0, 76976.0, 160881.0, 331865.0, 146180.0, 71991.0, 43002.0, 27339.0, 18069.0, 12004.0, 8241.0, 5630.0, 3963.0, 2680.0, 1922.0, 1244.0, 1006.0, 645.0, 457.0, 340.0, 217.0, 142.0, 109.0, 73.0, 52.0, 34.0, 23.0, 19.0, 13.0, 7.0, 8.0, 7.0, 5.0, 2.0, 1.0, 2.0], "bins": [-1.6875, -1.6338348388671875, -1.580169677734375, -1.5265045166015625, -1.47283935546875, -1.4191741943359375, -1.365509033203125, -1.3118438720703125, -1.2581787109375, -1.2045135498046875, -1.150848388671875, -1.0971832275390625, -1.04351806640625, -0.9898529052734375, -0.936187744140625, -0.8825225830078125, -0.828857421875, -0.7751922607421875, -0.721527099609375, -0.6678619384765625, -0.61419677734375, -0.5605316162109375, -0.506866455078125, -0.4532012939453125, -0.3995361328125, -0.3458709716796875, -0.292205810546875, -0.2385406494140625, -0.18487548828125, -0.1312103271484375, -0.077545166015625, -0.0238800048828125, 0.02978515625, 0.0834503173828125, 0.137115478515625, 0.1907806396484375, 0.24444580078125, 0.2981109619140625, 0.351776123046875, 0.4054412841796875, 0.4591064453125, 0.5127716064453125, 0.566436767578125, 0.6201019287109375, 0.67376708984375, 0.7274322509765625, 0.781097412109375, 0.8347625732421875, 0.888427734375, 0.9420928955078125, 0.995758056640625, 1.0494232177734375, 1.10308837890625, 1.1567535400390625, 1.210418701171875, 1.2640838623046875, 1.3177490234375, 1.3714141845703125, 1.425079345703125, 1.4787445068359375, 1.53240966796875, 1.5860748291015625, 1.639739990234375, 1.6934051513671875, 1.7470703125]}, "gradients/decoder.transformer.h.8.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 5.0, 1.0, 5.0, 8.0, 6.0, 4.0, 9.0, 12.0, 17.0, 18.0, 19.0, 18.0, 28.0, 36.0, 34.0, 28.0, 41.0, 34.0, 33.0, 47.0, 47.0, 50.0, 1067.0, 38.0, 42.0, 35.0, 36.0, 33.0, 27.0, 29.0, 28.0, 22.0, 22.0, 27.0, 18.0, 19.0, 15.0, 19.0, 8.0, 14.0, 5.0, 5.0, 4.0, 5.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0], "bins": [-4.94140625, -4.79248046875, -4.6435546875, -4.49462890625, -4.345703125, -4.19677734375, -4.0478515625, -3.89892578125, -3.75, -3.60107421875, -3.4521484375, -3.30322265625, -3.154296875, -3.00537109375, -2.8564453125, -2.70751953125, -2.55859375, -2.40966796875, -2.2607421875, -2.11181640625, -1.962890625, -1.81396484375, -1.6650390625, -1.51611328125, -1.3671875, -1.21826171875, -1.0693359375, -0.92041015625, -0.771484375, -0.62255859375, -0.4736328125, -0.32470703125, -0.17578125, -0.02685546875, 0.1220703125, 0.27099609375, 0.419921875, 0.56884765625, 0.7177734375, 0.86669921875, 1.015625, 1.16455078125, 1.3134765625, 1.46240234375, 1.611328125, 1.76025390625, 1.9091796875, 2.05810546875, 2.20703125, 2.35595703125, 2.5048828125, 2.65380859375, 2.802734375, 2.95166015625, 3.1005859375, 3.24951171875, 3.3984375, 3.54736328125, 3.6962890625, 3.84521484375, 3.994140625, 4.14306640625, 4.2919921875, 4.44091796875, 4.58984375]}, "gradients/decoder.transformer.h.8.crossattention.c_attn.weight": {"_type": "histogram", "values": [3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 5.0, 5.0, 13.0, 9.0, 9.0, 32.0, 30.0, 61.0, 77.0, 127.0, 174.0, 351.0, 511.0, 817.0, 1429.0, 2440.0, 4498.0, 7868.0, 14701.0, 28080.0, 56392.0, 119840.0, 1440580.0, 234057.0, 91266.0, 44079.0, 22548.0, 11882.0, 6500.0, 3573.0, 2099.0, 1198.0, 736.0, 443.0, 249.0, 150.0, 99.0, 77.0, 51.0, 19.0, 14.0, 14.0, 9.0, 7.0, 6.0, 3.0, 3.0, 4.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-2.515625, -2.43402099609375, -2.3524169921875, -2.27081298828125, -2.189208984375, -2.10760498046875, -2.0260009765625, -1.94439697265625, -1.86279296875, -1.78118896484375, -1.6995849609375, -1.61798095703125, -1.536376953125, -1.45477294921875, -1.3731689453125, -1.29156494140625, -1.2099609375, -1.12835693359375, -1.0467529296875, -0.96514892578125, -0.883544921875, -0.80194091796875, -0.7203369140625, -0.63873291015625, -0.55712890625, -0.47552490234375, -0.3939208984375, -0.31231689453125, -0.230712890625, -0.14910888671875, -0.0675048828125, 0.01409912109375, 0.095703125, 0.17730712890625, 0.2589111328125, 0.34051513671875, 0.422119140625, 0.50372314453125, 0.5853271484375, 0.66693115234375, 0.74853515625, 0.83013916015625, 0.9117431640625, 0.99334716796875, 1.074951171875, 1.15655517578125, 1.2381591796875, 1.31976318359375, 1.4013671875, 1.48297119140625, 1.5645751953125, 1.64617919921875, 1.727783203125, 1.80938720703125, 1.8909912109375, 1.97259521484375, 2.05419921875, 2.13580322265625, 2.2174072265625, 2.29901123046875, 2.380615234375, 2.46221923828125, 2.5438232421875, 2.62542724609375, 2.70703125]}, "gradients/decoder.transformer.h.8.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 3.0, 6.0, 18.0, 11.0, 12.0, 15.0, 19.0, 13.0, 32.0, 30.0, 36.0, 31.0, 59.0, 52.0, 62.0, 80.0, 70.0, 84.0, 55.0, 48.0, 45.0, 40.0, 35.0, 17.0, 18.0, 17.0, 20.0, 17.0, 6.0, 16.0, 3.0, 6.0, 8.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0011148452758789062, -0.0010750442743301392, -0.001035243272781372, -0.000995442271232605, -0.0009556412696838379, -0.0009158402681350708, -0.0008760392665863037, -0.0008362382650375366, -0.0007964372634887695, -0.0007566362619400024, -0.0007168352603912354, -0.0006770342588424683, -0.0006372332572937012, -0.0005974322557449341, -0.000557631254196167, -0.0005178302526473999, -0.0004780292510986328, -0.0004382282495498657, -0.00039842724800109863, -0.00035862624645233154, -0.00031882524490356445, -0.00027902424335479736, -0.00023922324180603027, -0.00019942224025726318, -0.0001596212387084961, -0.000119820237159729, -8.001923561096191e-05, -4.0218234062194824e-05, -4.172325134277344e-07, 3.9383769035339355e-05, 7.918477058410645e-05, 0.00011898577213287354, 0.00015878677368164062, 0.00019858777523040771, 0.0002383887767791748, 0.0002781897783279419, 0.000317990779876709, 0.0003577917814254761, 0.00039759278297424316, 0.00043739378452301025, 0.00047719478607177734, 0.0005169957876205444, 0.0005567967891693115, 0.0005965977907180786, 0.0006363987922668457, 0.0006761997938156128, 0.0007160007953643799, 0.000755801796913147, 0.0007956027984619141, 0.0008354038000106812, 0.0008752048015594482, 0.0009150058031082153, 0.0009548068046569824, 0.0009946078062057495, 0.0010344088077545166, 0.0010742098093032837, 0.0011140108108520508, 0.0011538118124008179, 0.001193612813949585, 0.001233413815498352, 0.0012732148170471191, 0.0013130158185958862, 0.0013528168201446533, 0.0013926178216934204, 0.0014324188232421875]}, "gradients/decoder.transformer.h.8.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 5.0, 0.0, 2.0, 5.0, 2.0, 2.0, 6.0, 7.0, 4.0, 6.0, 11.0, 20.0, 15.0, 23.0, 22.0, 24.0, 35.0, 37.0, 71.0, 86.0, 174.0, 301.0, 1018.0, 911604.0, 133594.0, 760.0, 249.0, 139.0, 84.0, 61.0, 39.0, 35.0, 26.0, 19.0, 24.0, 9.0, 10.0, 8.0, 10.0, 7.0, 3.0, 4.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.036529541015625, -0.035539865493774414, -0.03455018997192383, -0.03356051445007324, -0.032570838928222656, -0.03158116340637207, -0.030591487884521484, -0.0296018123626709, -0.028612136840820312, -0.027622461318969727, -0.02663278579711914, -0.025643110275268555, -0.02465343475341797, -0.023663759231567383, -0.022674083709716797, -0.02168440818786621, -0.020694732666015625, -0.01970505714416504, -0.018715381622314453, -0.017725706100463867, -0.01673603057861328, -0.015746355056762695, -0.01475667953491211, -0.013767004013061523, -0.012777328491210938, -0.011787652969360352, -0.010797977447509766, -0.00980830192565918, -0.008818626403808594, -0.007828950881958008, -0.006839275360107422, -0.005849599838256836, -0.00485992431640625, -0.003870248794555664, -0.002880573272705078, -0.0018908977508544922, -0.0009012222290039062, 8.845329284667969e-05, 0.0010781288146972656, 0.0020678043365478516, 0.0030574798583984375, 0.0040471553802490234, 0.005036830902099609, 0.006026506423950195, 0.007016181945800781, 0.008005857467651367, 0.008995532989501953, 0.009985208511352539, 0.010974884033203125, 0.011964559555053711, 0.012954235076904297, 0.013943910598754883, 0.014933586120605469, 0.015923261642456055, 0.01691293716430664, 0.017902612686157227, 0.018892288208007812, 0.0198819637298584, 0.020871639251708984, 0.02186131477355957, 0.022850990295410156, 0.023840665817260742, 0.024830341339111328, 0.025820016860961914, 0.0268096923828125]}, "gradients/decoder.transformer.h.8.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 179.0, 785.0, 51.0, 2.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006613454315811396, -0.0005097120301797986, -0.00035807868698611856, -0.00020644531468860805, -5.4811942391097546e-05, 9.682145901024342e-05, 0.00024845480220392346, 0.0004000881453976035, 0.0005517215467989445, 0.0007033549482002854, 0.0008549882913939655, 0.0010066216345876455, 0.0011582550359889865, 0.0013098884373903275, 0.0014615217223763466, 0.0016131551237776875, 0.0017647885251790285, 0.0019164219265803695, 0.0020680553279817104, 0.0022196886129677296, 0.0023713218979537487, 0.0025229554157704115, 0.0026745887007564306, 0.0028262222185730934, 0.0029778555035591125, 0.0031294887885451317, 0.0032811223063617945, 0.0034327555913478136, 0.0035843891091644764, 0.0037360223941504955, 0.0038876556791365147, 0.004039288964122534, 0.00419092271476984, 0.004342555999755859, 0.0044941892847418785, 0.004645823035389185, 0.004797456320375204, 0.004949089605361223, 0.005100722890347242, 0.0052523561753332615, 0.005403989925980568, 0.005555623210966587, 0.005707256495952606, 0.005858890246599913, 0.006010523531585932, 0.006162156816571951, 0.00631379010155797, 0.006465423386543989, 0.006617056671530008, 0.0067686899565160275, 0.006920323241502047, 0.007071956992149353, 0.007223590277135372, 0.007375223562121391, 0.00752685684710741, 0.0076784901320934296, 0.007830123417079449, 0.007981756702065468, 0.008133389987051487, 0.008285023272037506, 0.008436656557023525, 0.008588289842009544, 0.008739924058318138, 0.008891557343304157, 0.009043190628290176]}, "gradients/decoder.transformer.h.8.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 4.0, 4.0, 2.0, 3.0, 4.0, 7.0, 10.0, 7.0, 13.0, 8.0, 12.0, 22.0, 15.0, 23.0, 29.0, 29.0, 34.0, 34.0, 29.0, 27.0, 31.0, 41.0, 37.0, 31.0, 38.0, 35.0, 38.0, 50.0, 41.0, 42.0, 33.0, 29.0, 43.0, 28.0, 22.0, 25.0, 23.0, 25.0, 12.0, 15.0, 12.0, 15.0, 4.0, 6.0, 10.0, 2.0, 3.0, 4.0, 2.0, 0.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.0006281137466430664, -0.0006086360663175583, -0.0005891583859920502, -0.000569680705666542, -0.0005502030253410339, -0.0005307253450155258, -0.0005112476646900177, -0.0004917699843645096, -0.00047229230403900146, -0.00045281462371349335, -0.00043333694338798523, -0.0004138592630624771, -0.000394381582736969, -0.0003749039024114609, -0.00035542622208595276, -0.00033594854176044464, -0.0003164708614349365, -0.0002969931811094284, -0.0002775155007839203, -0.00025803782045841217, -0.00023856014013290405, -0.00021908245980739594, -0.00019960477948188782, -0.0001801270991563797, -0.00016064941883087158, -0.00014117173850536346, -0.00012169405817985535, -0.00010221637785434723, -8.273869752883911e-05, -6.3261017203331e-05, -4.3783336877822876e-05, -2.4305656552314758e-05, -4.827976226806641e-06, 1.4649704098701477e-05, 3.4127384424209595e-05, 5.360506474971771e-05, 7.308274507522583e-05, 9.256042540073395e-05, 0.00011203810572624207, 0.00013151578605175018, 0.0001509934663772583, 0.00017047114670276642, 0.00018994882702827454, 0.00020942650735378265, 0.00022890418767929077, 0.0002483818680047989, 0.000267859548330307, 0.0002873372286558151, 0.00030681490898132324, 0.00032629258930683136, 0.0003457702696323395, 0.0003652479499578476, 0.0003847256302833557, 0.00040420331060886383, 0.00042368099093437195, 0.00044315867125988007, 0.0004626363515853882, 0.0004821140319108963, 0.0005015917122364044, 0.0005210693925619125, 0.0005405470728874207, 0.0005600247532129288, 0.0005795024335384369, 0.000598980113863945, 0.0006184577941894531]}, "gradients/decoder.transformer.h.8.attn.c_proj.bias": {"_type": "histogram", "values": [5.0, 2.0, 1.0, 4.0, 1.0, 5.0, 5.0, 3.0, 4.0, 5.0, 4.0, 4.0, 7.0, 7.0, 8.0, 14.0, 8.0, 12.0, 24.0, 18.0, 22.0, 22.0, 21.0, 34.0, 25.0, 33.0, 30.0, 38.0, 49.0, 44.0, 42.0, 26.0, 41.0, 46.0, 40.0, 38.0, 32.0, 30.0, 24.0, 31.0, 23.0, 17.0, 33.0, 17.0, 17.0, 15.0, 13.0, 15.0, 13.0, 11.0, 9.0, 1.0, 5.0, 9.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0], "bins": [-6.63671875, -6.42572021484375, -6.2147216796875, -6.00372314453125, -5.792724609375, -5.58172607421875, -5.3707275390625, -5.15972900390625, -4.94873046875, -4.73773193359375, -4.5267333984375, -4.31573486328125, -4.104736328125, -3.89373779296875, -3.6827392578125, -3.47174072265625, -3.2607421875, -3.04974365234375, -2.8387451171875, -2.62774658203125, -2.416748046875, -2.20574951171875, -1.9947509765625, -1.78375244140625, -1.57275390625, -1.36175537109375, -1.1507568359375, -0.93975830078125, -0.728759765625, -0.51776123046875, -0.3067626953125, -0.09576416015625, 0.115234375, 0.32623291015625, 0.5372314453125, 0.74822998046875, 0.959228515625, 1.17022705078125, 1.3812255859375, 1.59222412109375, 1.80322265625, 2.01422119140625, 2.2252197265625, 2.43621826171875, 2.647216796875, 2.85821533203125, 3.0692138671875, 3.28021240234375, 3.4912109375, 3.70220947265625, 3.9132080078125, 4.12420654296875, 4.335205078125, 4.54620361328125, 4.7572021484375, 4.96820068359375, 5.17919921875, 5.39019775390625, 5.6011962890625, 5.81219482421875, 6.023193359375, 6.23419189453125, 6.4451904296875, 6.65618896484375, 6.8671875]}, "gradients/decoder.transformer.h.8.attn.c_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 3.0, 5.0, 3.0, 5.0, 12.0, 6.0, 6.0, 7.0, 8.0, 8.0, 9.0, 14.0, 15.0, 24.0, 38.0, 35.0, 60.0, 86.0, 130.0, 208.0, 358.0, 634.0, 1267.0, 2478.0, 5356.0, 11712.0, 25965.0, 59415.0, 150985.0, 353505.0, 261342.0, 98953.0, 41463.0, 18258.0, 8317.0, 3722.0, 1818.0, 917.0, 519.0, 327.0, 161.0, 119.0, 68.0, 57.0, 34.0, 30.0, 20.0, 20.0, 15.0, 10.0, 9.0, 8.0, 5.0, 2.0, 5.0, 2.0, 2.0, 4.0, 3.0, 0.0, 3.0, 2.0], "bins": [-5.421875, -5.25128173828125, -5.0806884765625, -4.91009521484375, -4.739501953125, -4.56890869140625, -4.3983154296875, -4.22772216796875, -4.05712890625, -3.88653564453125, -3.7159423828125, -3.54534912109375, -3.374755859375, -3.20416259765625, -3.0335693359375, -2.86297607421875, -2.6923828125, -2.52178955078125, -2.3511962890625, -2.18060302734375, -2.010009765625, -1.83941650390625, -1.6688232421875, -1.49822998046875, -1.32763671875, -1.15704345703125, -0.9864501953125, -0.81585693359375, -0.645263671875, -0.47467041015625, -0.3040771484375, -0.13348388671875, 0.037109375, 0.20770263671875, 0.3782958984375, 0.54888916015625, 0.719482421875, 0.89007568359375, 1.0606689453125, 1.23126220703125, 1.40185546875, 1.57244873046875, 1.7430419921875, 1.91363525390625, 2.084228515625, 2.25482177734375, 2.4254150390625, 2.59600830078125, 2.7666015625, 2.93719482421875, 3.1077880859375, 3.27838134765625, 3.448974609375, 3.61956787109375, 3.7901611328125, 3.96075439453125, 4.13134765625, 4.30194091796875, 4.4725341796875, 4.64312744140625, 4.813720703125, 4.98431396484375, 5.1549072265625, 5.32550048828125, 5.49609375]}, "gradients/decoder.transformer.h.8.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 2.0, 2.0, 2.0, 4.0, 2.0, 4.0, 6.0, 13.0, 6.0, 6.0, 13.0, 15.0, 18.0, 12.0, 17.0, 19.0, 21.0, 35.0, 26.0, 29.0, 40.0, 28.0, 48.0, 65.0, 92.0, 374.0, 1511.0, 140.0, 71.0, 59.0, 37.0, 31.0, 38.0, 33.0, 29.0, 23.0, 28.0, 35.0, 10.0, 15.0, 15.0, 17.0, 13.0, 14.0, 12.0, 9.0, 5.0, 2.0, 6.0, 2.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-21.140625, -20.461669921875, -19.78271484375, -19.103759765625, -18.4248046875, -17.745849609375, -17.06689453125, -16.387939453125, -15.708984375, -15.030029296875, -14.35107421875, -13.672119140625, -12.9931640625, -12.314208984375, -11.63525390625, -10.956298828125, -10.27734375, -9.598388671875, -8.91943359375, -8.240478515625, -7.5615234375, -6.882568359375, -6.20361328125, -5.524658203125, -4.845703125, -4.166748046875, -3.48779296875, -2.808837890625, -2.1298828125, -1.450927734375, -0.77197265625, -0.093017578125, 0.5859375, 1.264892578125, 1.94384765625, 2.622802734375, 3.3017578125, 3.980712890625, 4.65966796875, 5.338623046875, 6.017578125, 6.696533203125, 7.37548828125, 8.054443359375, 8.7333984375, 9.412353515625, 10.09130859375, 10.770263671875, 11.44921875, 12.128173828125, 12.80712890625, 13.486083984375, 14.1650390625, 14.843994140625, 15.52294921875, 16.201904296875, 16.880859375, 17.559814453125, 18.23876953125, 18.917724609375, 19.5966796875, 20.275634765625, 20.95458984375, 21.633544921875, 22.3125]}, "gradients/decoder.transformer.h.8.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 5.0, 5.0, 2.0, 4.0, 10.0, 13.0, 10.0, 18.0, 16.0, 27.0, 18.0, 31.0, 39.0, 51.0, 71.0, 85.0, 88.0, 114.0, 202.0, 247.0, 353.0, 629.0, 1614.0, 54785.0, 3047222.0, 36842.0, 1462.0, 568.0, 315.0, 185.0, 139.0, 114.0, 100.0, 70.0, 51.0, 43.0, 30.0, 32.0, 27.0, 17.0, 12.0, 8.0, 10.0, 8.0, 8.0, 3.0, 3.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-36.46875, -35.34521484375, -34.2216796875, -33.09814453125, -31.974609375, -30.85107421875, -29.7275390625, -28.60400390625, -27.48046875, -26.35693359375, -25.2333984375, -24.10986328125, -22.986328125, -21.86279296875, -20.7392578125, -19.61572265625, -18.4921875, -17.36865234375, -16.2451171875, -15.12158203125, -13.998046875, -12.87451171875, -11.7509765625, -10.62744140625, -9.50390625, -8.38037109375, -7.2568359375, -6.13330078125, -5.009765625, -3.88623046875, -2.7626953125, -1.63916015625, -0.515625, 0.60791015625, 1.7314453125, 2.85498046875, 3.978515625, 5.10205078125, 6.2255859375, 7.34912109375, 8.47265625, 9.59619140625, 10.7197265625, 11.84326171875, 12.966796875, 14.09033203125, 15.2138671875, 16.33740234375, 17.4609375, 18.58447265625, 19.7080078125, 20.83154296875, 21.955078125, 23.07861328125, 24.2021484375, 25.32568359375, 26.44921875, 27.57275390625, 28.6962890625, 29.81982421875, 30.943359375, 32.06689453125, 33.1904296875, 34.31396484375, 35.4375]}, "gradients/decoder.transformer.h.8.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 6.0, 501.0, 503.0, 5.0, 2.0, 0.0, 0.0, 1.0], "bins": [-325.35748291015625, -319.746826171875, -314.13616943359375, -308.5254821777344, -302.9148254394531, -297.3041687011719, -291.6935119628906, -286.08282470703125, -280.47216796875, -274.86151123046875, -269.2508544921875, -263.6401672363281, -258.0295104980469, -252.41885375976562, -246.8081817626953, -241.19752502441406, -235.58685302734375, -229.9761962890625, -224.3655242919922, -218.75486755371094, -213.14419555664062, -207.53353881835938, -201.92286682128906, -196.3122100830078, -190.70155334472656, -185.0908966064453, -179.480224609375, -173.86956787109375, -168.25889587402344, -162.6482391357422, -157.03756713867188, -151.42691040039062, -145.8162384033203, -140.20558166503906, -134.59490966796875, -128.9842529296875, -123.37358093261719, -117.7629165649414, -112.15225219726562, -106.54159545898438, -100.93092346191406, -95.32025909423828, -89.7095947265625, -84.09893035888672, -78.48826599121094, -72.87760162353516, -67.26693725585938, -61.65627670288086, -56.045616149902344, -50.43495178222656, -44.82428741455078, -39.213623046875, -33.60295867919922, -27.99229621887207, -22.381633758544922, -16.77096939086914, -11.16030502319336, -5.549641132354736, 0.06102275848388672, 5.671686172485352, 11.282350540161133, 16.893014907836914, 22.503677368164062, 28.114341735839844, 33.725006103515625]}, "gradients/decoder.transformer.h.8.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 4.0, 2.0, 7.0, 7.0, 9.0, 14.0, 8.0, 15.0, 15.0, 16.0, 21.0, 20.0, 32.0, 31.0, 36.0, 40.0, 41.0, 35.0, 41.0, 34.0, 50.0, 46.0, 39.0, 40.0, 40.0, 41.0, 41.0, 38.0, 36.0, 26.0, 20.0, 25.0, 25.0, 17.0, 12.0, 16.0, 12.0, 11.0, 9.0, 8.0, 9.0, 7.0, 4.0, 6.0, 2.0, 2.0, 1.0, 1.0, 1.0, 2.0], "bins": [-68.08250427246094, -66.18128204345703, -64.2800521850586, -62.37882614135742, -60.47760009765625, -58.57637405395508, -56.675148010253906, -54.77392578125, -52.87269592285156, -50.97146987915039, -49.07024383544922, -47.16901779174805, -45.267791748046875, -43.3665657043457, -41.46533966064453, -39.564117431640625, -37.66289138793945, -35.76166534423828, -33.86043930053711, -31.959213256835938, -30.057987213134766, -28.156761169433594, -26.255537033081055, -24.354310989379883, -22.45308494567871, -20.55185890197754, -18.650632858276367, -16.749408721923828, -14.84818172454834, -12.946955680847168, -11.045730590820312, -9.14450454711914, -7.243274688720703, -5.342048645019531, -3.4408230781555176, -1.539597511291504, 0.36162853240966797, 2.26285457611084, 4.164079666137695, 6.065305709838867, 7.966531753540039, 9.867757797241211, 11.768983840942383, 13.670208930969238, 15.57143497467041, 17.472660064697266, 19.373886108398438, 21.27511215209961, 23.17633819580078, 25.077564239501953, 26.978790283203125, 28.880016326904297, 30.78124237060547, 32.68246841430664, 34.58369445800781, 36.48491668701172, 38.386146545410156, 40.28737258911133, 42.1885986328125, 44.08982467651367, 45.991050720214844, 47.892276763916016, 49.79350280761719, 51.694725036621094, 53.595951080322266]}, "gradients/decoder.transformer.h.7.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 2.0, 3.0, 3.0, 5.0, 3.0, 3.0, 7.0, 4.0, 7.0, 6.0, 5.0, 13.0, 11.0, 15.0, 15.0, 19.0, 10.0, 27.0, 26.0, 31.0, 32.0, 36.0, 45.0, 39.0, 49.0, 47.0, 41.0, 46.0, 44.0, 43.0, 31.0, 36.0, 35.0, 35.0, 32.0, 29.0, 24.0, 29.0, 19.0, 21.0, 14.0, 20.0, 8.0, 10.0, 7.0, 7.0, 2.0, 6.0, 5.0, 2.0, 2.0, 3.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-7.140625, -6.903564453125, -6.66650390625, -6.429443359375, -6.1923828125, -5.955322265625, -5.71826171875, -5.481201171875, -5.244140625, -5.007080078125, -4.77001953125, -4.532958984375, -4.2958984375, -4.058837890625, -3.82177734375, -3.584716796875, -3.34765625, -3.110595703125, -2.87353515625, -2.636474609375, -2.3994140625, -2.162353515625, -1.92529296875, -1.688232421875, -1.451171875, -1.214111328125, -0.97705078125, -0.739990234375, -0.5029296875, -0.265869140625, -0.02880859375, 0.208251953125, 0.4453125, 0.682373046875, 0.91943359375, 1.156494140625, 1.3935546875, 1.630615234375, 1.86767578125, 2.104736328125, 2.341796875, 2.578857421875, 2.81591796875, 3.052978515625, 3.2900390625, 3.527099609375, 3.76416015625, 4.001220703125, 4.23828125, 4.475341796875, 4.71240234375, 4.949462890625, 5.1865234375, 5.423583984375, 5.66064453125, 5.897705078125, 6.134765625, 6.371826171875, 6.60888671875, 6.845947265625, 7.0830078125, 7.320068359375, 7.55712890625, 7.794189453125, 8.03125]}, "gradients/decoder.transformer.h.7.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 2.0, 1.0, 0.0, 5.0, 6.0, 2.0, 5.0, 4.0, 5.0, 5.0, 5.0, 11.0, 12.0, 17.0, 14.0, 16.0, 24.0, 19.0, 37.0, 52.0, 56.0, 69.0, 115.0, 203.0, 459.0, 1781.0, 16903.0, 874699.0, 3198204.0, 95244.0, 4642.0, 838.0, 282.0, 140.0, 86.0, 65.0, 52.0, 39.0, 28.0, 30.0, 16.0, 16.0, 17.0, 6.0, 19.0, 8.0, 9.0, 4.0, 1.0, 6.0, 6.0, 1.0, 4.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0], "bins": [-26.078125, -25.243408203125, -24.40869140625, -23.573974609375, -22.7392578125, -21.904541015625, -21.06982421875, -20.235107421875, -19.400390625, -18.565673828125, -17.73095703125, -16.896240234375, -16.0615234375, -15.226806640625, -14.39208984375, -13.557373046875, -12.72265625, -11.887939453125, -11.05322265625, -10.218505859375, -9.3837890625, -8.549072265625, -7.71435546875, -6.879638671875, -6.044921875, -5.210205078125, -4.37548828125, -3.540771484375, -2.7060546875, -1.871337890625, -1.03662109375, -0.201904296875, 0.6328125, 1.467529296875, 2.30224609375, 3.136962890625, 3.9716796875, 4.806396484375, 5.64111328125, 6.475830078125, 7.310546875, 8.145263671875, 8.97998046875, 9.814697265625, 10.6494140625, 11.484130859375, 12.31884765625, 13.153564453125, 13.98828125, 14.822998046875, 15.65771484375, 16.492431640625, 17.3271484375, 18.161865234375, 18.99658203125, 19.831298828125, 20.666015625, 21.500732421875, 22.33544921875, 23.170166015625, 24.0048828125, 24.839599609375, 25.67431640625, 26.509033203125, 27.34375]}, "gradients/decoder.transformer.h.7.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 7.0, 10.0, 9.0, 18.0, 14.0, 26.0, 43.0, 55.0, 71.0, 121.0, 155.0, 222.0, 284.0, 437.0, 597.0, 515.0, 427.0, 289.0, 246.0, 150.0, 117.0, 81.0, 53.0, 36.0, 22.0, 29.0, 16.0, 9.0, 15.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-11.875, -11.381103515625, -10.88720703125, -10.393310546875, -9.8994140625, -9.405517578125, -8.91162109375, -8.417724609375, -7.923828125, -7.429931640625, -6.93603515625, -6.442138671875, -5.9482421875, -5.454345703125, -4.96044921875, -4.466552734375, -3.97265625, -3.478759765625, -2.98486328125, -2.490966796875, -1.9970703125, -1.503173828125, -1.00927734375, -0.515380859375, -0.021484375, 0.472412109375, 0.96630859375, 1.460205078125, 1.9541015625, 2.447998046875, 2.94189453125, 3.435791015625, 3.9296875, 4.423583984375, 4.91748046875, 5.411376953125, 5.9052734375, 6.399169921875, 6.89306640625, 7.386962890625, 7.880859375, 8.374755859375, 8.86865234375, 9.362548828125, 9.8564453125, 10.350341796875, 10.84423828125, 11.338134765625, 11.83203125, 12.325927734375, 12.81982421875, 13.313720703125, 13.8076171875, 14.301513671875, 14.79541015625, 15.289306640625, 15.783203125, 16.277099609375, 16.77099609375, 17.264892578125, 17.7587890625, 18.252685546875, 18.74658203125, 19.240478515625, 19.734375]}, "gradients/decoder.transformer.h.7.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 4.0, 4.0, 7.0, 9.0, 5.0, 17.0, 24.0, 24.0, 29.0, 43.0, 49.0, 74.0, 93.0, 115.0, 110.0, 159.0, 233.0, 410.0, 864.0, 17791.0, 4159051.0, 12882.0, 871.0, 389.0, 249.0, 183.0, 141.0, 94.0, 91.0, 50.0, 48.0, 37.0, 30.0, 21.0, 22.0, 16.0, 13.0, 11.0, 4.0, 8.0, 3.0, 6.0, 5.0, 2.0, 0.0, 0.0, 3.0, 1.0], "bins": [-99.625, -96.8935546875, -94.162109375, -91.4306640625, -88.69921875, -85.9677734375, -83.236328125, -80.5048828125, -77.7734375, -75.0419921875, -72.310546875, -69.5791015625, -66.84765625, -64.1162109375, -61.384765625, -58.6533203125, -55.921875, -53.1904296875, -50.458984375, -47.7275390625, -44.99609375, -42.2646484375, -39.533203125, -36.8017578125, -34.0703125, -31.3388671875, -28.607421875, -25.8759765625, -23.14453125, -20.4130859375, -17.681640625, -14.9501953125, -12.21875, -9.4873046875, -6.755859375, -4.0244140625, -1.29296875, 1.4384765625, 4.169921875, 6.9013671875, 9.6328125, 12.3642578125, 15.095703125, 17.8271484375, 20.55859375, 23.2900390625, 26.021484375, 28.7529296875, 31.484375, 34.2158203125, 36.947265625, 39.6787109375, 42.41015625, 45.1416015625, 47.873046875, 50.6044921875, 53.3359375, 56.0673828125, 58.798828125, 61.5302734375, 64.26171875, 66.9931640625, 69.724609375, 72.4560546875, 75.1875]}, "gradients/decoder.transformer.h.7.ln_2.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 18.0, 456.0, 519.0, 21.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-73.60321807861328, -64.4305648803711, -55.257911682128906, -46.085262298583984, -36.9126091003418, -27.73995590209961, -18.567306518554688, -9.3946533203125, -0.2220001220703125, 8.950652122497559, 18.12330436706543, 27.295955657958984, 36.46860885620117, 45.64126205444336, 54.81391143798828, 63.98656463623047, 73.15921783447266, 82.33187103271484, 91.50452423095703, 100.67716979980469, 109.84982299804688, 119.02247619628906, 128.19512939453125, 137.36778259277344, 146.54043579101562, 155.7130889892578, 164.8857421875, 174.0583953857422, 183.23104858398438, 192.40370178222656, 201.57635498046875, 210.74899291992188, 219.92166137695312, 229.0943145751953, 238.2669677734375, 247.4396209716797, 256.6122741699219, 265.784912109375, 274.95758056640625, 284.1302185058594, 293.3028869628906, 302.47552490234375, 311.648193359375, 320.8208312988281, 329.9934997558594, 339.1661376953125, 348.33880615234375, 357.5114440917969, 366.68408203125, 375.8567199707031, 385.0293884277344, 394.2020263671875, 403.37469482421875, 412.5473327636719, 421.7200012207031, 430.89263916015625, 440.0653076171875, 449.2379455566406, 458.4106140136719, 467.583251953125, 476.75592041015625, 485.9285583496094, 495.1012268066406, 504.27386474609375, 513.446533203125]}, "gradients/decoder.transformer.h.7.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 4.0, 2.0, 6.0, 3.0, 6.0, 4.0, 5.0, 11.0, 12.0, 13.0, 17.0, 21.0, 33.0, 23.0, 29.0, 27.0, 30.0, 37.0, 36.0, 36.0, 32.0, 38.0, 49.0, 38.0, 50.0, 30.0, 34.0, 39.0, 40.0, 37.0, 35.0, 31.0, 25.0, 28.0, 16.0, 19.0, 19.0, 13.0, 6.0, 15.0, 12.0, 7.0, 9.0, 4.0, 5.0, 3.0, 3.0, 3.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0], "bins": [-45.55470275878906, -44.0551643371582, -42.55562973022461, -41.05609130859375, -39.55655288696289, -38.05701446533203, -36.55747985839844, -35.05794143676758, -33.55840301513672, -32.05886459350586, -30.559328079223633, -29.059791564941406, -27.560253143310547, -26.06071662902832, -24.561180114746094, -23.061641693115234, -21.56210708618164, -20.062570571899414, -18.563032150268555, -17.063495635986328, -15.563958168029785, -14.064420700073242, -12.564884185791016, -11.065346717834473, -9.56580924987793, -8.066271781921387, -6.566734790802002, -5.067197799682617, -3.567660331726074, -2.0681228637695312, -0.5685863494873047, 0.9309511184692383, 2.430492401123047, 3.9300296306610107, 5.429566860198975, 6.929103851318359, 8.428641319274902, 9.928178787231445, 11.427715301513672, 12.927252769470215, 14.426790237426758, 15.9263277053833, 17.425865173339844, 18.92540168762207, 20.424938201904297, 21.924476623535156, 23.424013137817383, 24.92354965209961, 26.42308807373047, 27.922624588012695, 29.422163009643555, 30.92169952392578, 32.42123794555664, 33.9207763671875, 35.420310974121094, 36.91984939575195, 38.41938781738281, 39.91892623901367, 41.418460845947266, 42.917999267578125, 44.417537689208984, 45.917076110839844, 47.41661071777344, 48.9161491394043, 50.41568374633789]}, "gradients/decoder.transformer.h.7.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 9.0, 8.0, 8.0, 8.0, 4.0, 17.0, 19.0, 18.0, 24.0, 20.0, 19.0, 36.0, 24.0, 31.0, 38.0, 33.0, 43.0, 35.0, 41.0, 42.0, 47.0, 44.0, 32.0, 44.0, 28.0, 29.0, 29.0, 38.0, 24.0, 21.0, 24.0, 26.0, 18.0, 20.0, 16.0, 13.0, 11.0, 9.0, 13.0, 4.0, 3.0, 6.0, 4.0, 5.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-7.109375, -6.88214111328125, -6.6549072265625, -6.42767333984375, -6.200439453125, -5.97320556640625, -5.7459716796875, -5.51873779296875, -5.29150390625, -5.06427001953125, -4.8370361328125, -4.60980224609375, -4.382568359375, -4.15533447265625, -3.9281005859375, -3.70086669921875, -3.4736328125, -3.24639892578125, -3.0191650390625, -2.79193115234375, -2.564697265625, -2.33746337890625, -2.1102294921875, -1.88299560546875, -1.65576171875, -1.42852783203125, -1.2012939453125, -0.97406005859375, -0.746826171875, -0.51959228515625, -0.2923583984375, -0.06512451171875, 0.162109375, 0.38934326171875, 0.6165771484375, 0.84381103515625, 1.071044921875, 1.29827880859375, 1.5255126953125, 1.75274658203125, 1.97998046875, 2.20721435546875, 2.4344482421875, 2.66168212890625, 2.888916015625, 3.11614990234375, 3.3433837890625, 3.57061767578125, 3.7978515625, 4.02508544921875, 4.2523193359375, 4.47955322265625, 4.706787109375, 4.93402099609375, 5.1612548828125, 5.38848876953125, 5.61572265625, 5.84295654296875, 6.0701904296875, 6.29742431640625, 6.524658203125, 6.75189208984375, 6.9791259765625, 7.20635986328125, 7.43359375]}, "gradients/decoder.transformer.h.7.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 3.0, 3.0, 3.0, 8.0, 13.0, 13.0, 25.0, 38.0, 65.0, 90.0, 110.0, 159.0, 255.0, 387.0, 573.0, 802.0, 1240.0, 1780.0, 2773.0, 4100.0, 6096.0, 9318.0, 14214.0, 22046.0, 33904.0, 55003.0, 95039.0, 204974.0, 303001.0, 115584.0, 64435.0, 39293.0, 25130.0, 16329.0, 10650.0, 7071.0, 4601.0, 3108.0, 2023.0, 1444.0, 911.0, 625.0, 425.0, 280.0, 210.0, 136.0, 96.0, 59.0, 38.0, 33.0, 19.0, 16.0, 8.0, 5.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-1.767578125, -1.710845947265625, -1.65411376953125, -1.597381591796875, -1.5406494140625, -1.483917236328125, -1.42718505859375, -1.370452880859375, -1.313720703125, -1.256988525390625, -1.20025634765625, -1.143524169921875, -1.0867919921875, -1.030059814453125, -0.97332763671875, -0.916595458984375, -0.85986328125, -0.803131103515625, -0.74639892578125, -0.689666748046875, -0.6329345703125, -0.576202392578125, -0.51947021484375, -0.462738037109375, -0.406005859375, -0.349273681640625, -0.29254150390625, -0.235809326171875, -0.1790771484375, -0.122344970703125, -0.06561279296875, -0.008880615234375, 0.0478515625, 0.104583740234375, 0.16131591796875, 0.218048095703125, 0.2747802734375, 0.331512451171875, 0.38824462890625, 0.444976806640625, 0.501708984375, 0.558441162109375, 0.61517333984375, 0.671905517578125, 0.7286376953125, 0.785369873046875, 0.84210205078125, 0.898834228515625, 0.95556640625, 1.012298583984375, 1.06903076171875, 1.125762939453125, 1.1824951171875, 1.239227294921875, 1.29595947265625, 1.352691650390625, 1.409423828125, 1.466156005859375, 1.52288818359375, 1.579620361328125, 1.6363525390625, 1.693084716796875, 1.74981689453125, 1.806549072265625, 1.86328125]}, "gradients/decoder.transformer.h.7.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 1.0, 2.0, 2.0, 4.0, 2.0, 8.0, 5.0, 11.0, 8.0, 16.0, 14.0, 16.0, 19.0, 19.0, 17.0, 38.0, 30.0, 31.0, 34.0, 51.0, 45.0, 48.0, 51.0, 1062.0, 35.0, 38.0, 45.0, 36.0, 34.0, 40.0, 27.0, 32.0, 25.0, 39.0, 27.0, 24.0, 13.0, 14.0, 10.0, 10.0, 14.0, 9.0, 8.0, 10.0, 5.0, 5.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-5.1875, -5.0302734375, -4.873046875, -4.7158203125, -4.55859375, -4.4013671875, -4.244140625, -4.0869140625, -3.9296875, -3.7724609375, -3.615234375, -3.4580078125, -3.30078125, -3.1435546875, -2.986328125, -2.8291015625, -2.671875, -2.5146484375, -2.357421875, -2.2001953125, -2.04296875, -1.8857421875, -1.728515625, -1.5712890625, -1.4140625, -1.2568359375, -1.099609375, -0.9423828125, -0.78515625, -0.6279296875, -0.470703125, -0.3134765625, -0.15625, 0.0009765625, 0.158203125, 0.3154296875, 0.47265625, 0.6298828125, 0.787109375, 0.9443359375, 1.1015625, 1.2587890625, 1.416015625, 1.5732421875, 1.73046875, 1.8876953125, 2.044921875, 2.2021484375, 2.359375, 2.5166015625, 2.673828125, 2.8310546875, 2.98828125, 3.1455078125, 3.302734375, 3.4599609375, 3.6171875, 3.7744140625, 3.931640625, 4.0888671875, 4.24609375, 4.4033203125, 4.560546875, 4.7177734375, 4.875]}, "gradients/decoder.transformer.h.7.crossattention.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 7.0, 13.0, 12.0, 20.0, 23.0, 29.0, 47.0, 87.0, 120.0, 237.0, 371.0, 644.0, 1147.0, 2209.0, 3934.0, 6951.0, 13506.0, 26622.0, 54841.0, 121282.0, 1439948.0, 244123.0, 92226.0, 43243.0, 21223.0, 11041.0, 5925.0, 3171.0, 1758.0, 973.0, 597.0, 318.0, 181.0, 109.0, 61.0, 49.0, 31.0, 18.0, 8.0, 12.0, 2.0, 6.0, 4.0, 2.0, 1.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.662109375, -2.575775146484375, -2.48944091796875, -2.403106689453125, -2.3167724609375, -2.230438232421875, -2.14410400390625, -2.057769775390625, -1.971435546875, -1.885101318359375, -1.79876708984375, -1.712432861328125, -1.6260986328125, -1.539764404296875, -1.45343017578125, -1.367095947265625, -1.28076171875, -1.194427490234375, -1.10809326171875, -1.021759033203125, -0.9354248046875, -0.849090576171875, -0.76275634765625, -0.676422119140625, -0.590087890625, -0.503753662109375, -0.41741943359375, -0.331085205078125, -0.2447509765625, -0.158416748046875, -0.07208251953125, 0.014251708984375, 0.1005859375, 0.186920166015625, 0.27325439453125, 0.359588623046875, 0.4459228515625, 0.532257080078125, 0.61859130859375, 0.704925537109375, 0.791259765625, 0.877593994140625, 0.96392822265625, 1.050262451171875, 1.1365966796875, 1.222930908203125, 1.30926513671875, 1.395599365234375, 1.48193359375, 1.568267822265625, 1.65460205078125, 1.740936279296875, 1.8272705078125, 1.913604736328125, 1.99993896484375, 2.086273193359375, 2.172607421875, 2.258941650390625, 2.34527587890625, 2.431610107421875, 2.5179443359375, 2.604278564453125, 2.69061279296875, 2.776947021484375, 2.86328125]}, "gradients/decoder.transformer.h.7.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 5.0, 12.0, 9.0, 18.0, 15.0, 20.0, 25.0, 30.0, 68.0, 81.0, 81.0, 125.0, 106.0, 93.0, 67.0, 53.0, 46.0, 33.0, 20.0, 24.0, 20.0, 17.0, 10.0, 7.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0013942718505859375, -0.001349329948425293, -0.0013043880462646484, -0.001259446144104004, -0.0012145042419433594, -0.0011695623397827148, -0.0011246204376220703, -0.0010796785354614258, -0.0010347366333007812, -0.0009897947311401367, -0.0009448528289794922, -0.0008999109268188477, -0.0008549690246582031, -0.0008100271224975586, -0.0007650852203369141, -0.0007201433181762695, -0.000675201416015625, -0.0006302595138549805, -0.0005853176116943359, -0.0005403757095336914, -0.0004954338073730469, -0.00045049190521240234, -0.0004055500030517578, -0.0003606081008911133, -0.00031566619873046875, -0.0002707242965698242, -0.0002257823944091797, -0.00018084049224853516, -0.00013589859008789062, -9.09566879272461e-05, -4.601478576660156e-05, -1.0728836059570312e-06, 4.38690185546875e-05, 8.881092071533203e-05, 0.00013375282287597656, 0.0001786947250366211, 0.00022363662719726562, 0.00026857852935791016, 0.0003135204315185547, 0.0003584623336791992, 0.00040340423583984375, 0.0004483461380004883, 0.0004932880401611328, 0.0005382299423217773, 0.0005831718444824219, 0.0006281137466430664, 0.0006730556488037109, 0.0007179975509643555, 0.000762939453125, 0.0008078813552856445, 0.0008528232574462891, 0.0008977651596069336, 0.0009427070617675781, 0.0009876489639282227, 0.0010325908660888672, 0.0010775327682495117, 0.0011224746704101562, 0.0011674165725708008, 0.0012123584747314453, 0.0012573003768920898, 0.0013022422790527344, 0.001347184181213379, 0.0013921260833740234, 0.001437067985534668, 0.0014820098876953125]}, "gradients/decoder.transformer.h.7.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 0.0, 2.0, 2.0, 3.0, 4.0, 4.0, 5.0, 12.0, 15.0, 19.0, 23.0, 30.0, 35.0, 39.0, 78.0, 97.0, 154.0, 361.0, 3256.0, 1041440.0, 2191.0, 316.0, 181.0, 86.0, 46.0, 28.0, 25.0, 22.0, 18.0, 15.0, 14.0, 8.0, 4.0, 9.0, 6.0, 4.0, 4.0, 5.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03057861328125, -0.029540538787841797, -0.028502464294433594, -0.02746438980102539, -0.026426315307617188, -0.025388240814208984, -0.02435016632080078, -0.023312091827392578, -0.022274017333984375, -0.021235942840576172, -0.02019786834716797, -0.019159793853759766, -0.018121719360351562, -0.01708364486694336, -0.016045570373535156, -0.015007495880126953, -0.01396942138671875, -0.012931346893310547, -0.011893272399902344, -0.01085519790649414, -0.009817123413085938, -0.008779048919677734, -0.007740974426269531, -0.006702899932861328, -0.005664825439453125, -0.004626750946044922, -0.0035886764526367188, -0.0025506019592285156, -0.0015125274658203125, -0.0004744529724121094, 0.0005636215209960938, 0.0016016960144042969, 0.0026397705078125, 0.003677845001220703, 0.004715919494628906, 0.005753993988037109, 0.0067920684814453125, 0.007830142974853516, 0.008868217468261719, 0.009906291961669922, 0.010944366455078125, 0.011982440948486328, 0.013020515441894531, 0.014058589935302734, 0.015096664428710938, 0.01613473892211914, 0.017172813415527344, 0.018210887908935547, 0.01924896240234375, 0.020287036895751953, 0.021325111389160156, 0.02236318588256836, 0.023401260375976562, 0.024439334869384766, 0.02547740936279297, 0.026515483856201172, 0.027553558349609375, 0.028591632843017578, 0.02962970733642578, 0.030667781829833984, 0.03170585632324219, 0.03274393081665039, 0.033782005310058594, 0.0348200798034668, 0.035858154296875]}, "gradients/decoder.transformer.h.7.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 4.0, 13.0, 45.0, 155.0, 286.0, 294.0, 166.0, 40.0, 11.0, 5.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0011612925445660949, -0.0011128628393635154, -0.0010644332505762577, -0.0010160035453736782, -0.0009675738983787596, -0.000919144251383841, -0.0008707145461812615, -0.000822284899186343, -0.0007738552521914244, -0.0007254256051965058, -0.0006769959582015872, -0.0006285662529990077, -0.0005801366060040891, -0.0005317069590091705, -0.0004832772829104215, -0.00043484760681167245, -0.00038641795981675386, -0.0003379883128218353, -0.00028955863672308624, -0.00024112897517625242, -0.0001926993136294186, -0.0001442696520825848, -9.583999053575099e-05, -4.7410314437001944e-05, 1.0193325579166412e-06, 4.9448994104750454e-05, 9.787865565158427e-05, 0.00014630831719841808, 0.0001947379787452519, 0.0002431676402920857, 0.0002915973018389195, 0.00034002697793766856, 0.00038845662493258715, 0.00043688627192750573, 0.0004853159480262548, 0.0005337456241250038, 0.0005821752711199224, 0.000630604918114841, 0.0006790346233174205, 0.0007274642703123391, 0.0007758939173072577, 0.0008243235643021762, 0.0008727532112970948, 0.0009211829164996743, 0.0009696125634945929, 0.0010180422104895115, 0.001066471915692091, 0.0011149016208946705, 0.0011633312096819282, 0.0012117609148845077, 0.0012601905036717653, 0.0013086202088743448, 0.0013570499140769243, 0.001405479502864182, 0.0014539092080667615, 0.0015023387968540192, 0.0015507685020565987, 0.0015991982072591782, 0.0016476277960464358, 0.0016960575012490153, 0.001744487090036273, 0.0017929167952388525, 0.001841346500441432, 0.0018897762056440115, 0.0019382057944312692]}, "gradients/decoder.transformer.h.7.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 3.0, 2.0, 7.0, 2.0, 5.0, 7.0, 11.0, 12.0, 11.0, 17.0, 21.0, 20.0, 21.0, 19.0, 23.0, 25.0, 26.0, 31.0, 31.0, 32.0, 37.0, 31.0, 43.0, 42.0, 46.0, 28.0, 37.0, 33.0, 32.0, 42.0, 33.0, 42.0, 32.0, 29.0, 27.0, 21.0, 18.0, 14.0, 19.0, 11.0, 7.0, 15.0, 8.0, 5.0, 8.0, 5.0, 3.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 5.0, 1.0, 2.0], "bins": [-0.0004977583885192871, -0.00048219598829746246, -0.0004666335880756378, -0.00045107118785381317, -0.0004355087876319885, -0.0004199463874101639, -0.00040438398718833923, -0.0003888215869665146, -0.00037325918674468994, -0.0003576967865228653, -0.00034213438630104065, -0.000326571986079216, -0.00031100958585739136, -0.0002954471856355667, -0.00027988478541374207, -0.0002643223851919174, -0.0002487599849700928, -0.00023319758474826813, -0.00021763518452644348, -0.00020207278430461884, -0.0001865103840827942, -0.00017094798386096954, -0.0001553855836391449, -0.00013982318341732025, -0.0001242607831954956, -0.00010869838297367096, -9.313598275184631e-05, -7.757358253002167e-05, -6.201118230819702e-05, -4.6448782086372375e-05, -3.088638186454773e-05, -1.5323981642723083e-05, 2.384185791015625e-07, 1.580081880092621e-05, 3.1363219022750854e-05, 4.69256192445755e-05, 6.248801946640015e-05, 7.805041968822479e-05, 9.361281991004944e-05, 0.00010917522013187408, 0.00012473762035369873, 0.00014030002057552338, 0.00015586242079734802, 0.00017142482101917267, 0.00018698722124099731, 0.00020254962146282196, 0.0002181120216846466, 0.00023367442190647125, 0.0002492368221282959, 0.00026479922235012054, 0.0002803616225719452, 0.00029592402279376984, 0.0003114864230155945, 0.00032704882323741913, 0.0003426112234592438, 0.0003581736236810684, 0.00037373602390289307, 0.0003892984241247177, 0.00040486082434654236, 0.000420423224568367, 0.00043598562479019165, 0.0004515480250120163, 0.00046711042523384094, 0.0004826728254556656, 0.0004982352256774902]}, "gradients/decoder.transformer.h.7.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 9.0, 8.0, 8.0, 8.0, 4.0, 17.0, 19.0, 18.0, 24.0, 20.0, 19.0, 36.0, 24.0, 31.0, 38.0, 33.0, 43.0, 35.0, 41.0, 42.0, 47.0, 44.0, 32.0, 44.0, 28.0, 29.0, 29.0, 38.0, 24.0, 21.0, 24.0, 26.0, 18.0, 20.0, 16.0, 13.0, 11.0, 9.0, 13.0, 4.0, 3.0, 6.0, 4.0, 5.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-7.109375, -6.88214111328125, -6.6549072265625, -6.42767333984375, -6.200439453125, -5.97320556640625, -5.7459716796875, -5.51873779296875, -5.29150390625, -5.06427001953125, -4.8370361328125, -4.60980224609375, -4.382568359375, -4.15533447265625, -3.9281005859375, -3.70086669921875, -3.4736328125, -3.24639892578125, -3.0191650390625, -2.79193115234375, -2.564697265625, -2.33746337890625, -2.1102294921875, -1.88299560546875, -1.65576171875, -1.42852783203125, -1.2012939453125, -0.97406005859375, -0.746826171875, -0.51959228515625, -0.2923583984375, -0.06512451171875, 0.162109375, 0.38934326171875, 0.6165771484375, 0.84381103515625, 1.071044921875, 1.29827880859375, 1.5255126953125, 1.75274658203125, 1.97998046875, 2.20721435546875, 2.4344482421875, 2.66168212890625, 2.888916015625, 3.11614990234375, 3.3433837890625, 3.57061767578125, 3.7978515625, 4.02508544921875, 4.2523193359375, 4.47955322265625, 4.706787109375, 4.93402099609375, 5.1612548828125, 5.38848876953125, 5.61572265625, 5.84295654296875, 6.0701904296875, 6.29742431640625, 6.524658203125, 6.75189208984375, 6.9791259765625, 7.20635986328125, 7.43359375]}, "gradients/decoder.transformer.h.7.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 0.0, 4.0, 2.0, 3.0, 2.0, 7.0, 10.0, 8.0, 13.0, 9.0, 9.0, 20.0, 27.0, 34.0, 44.0, 45.0, 56.0, 97.0, 110.0, 165.0, 261.0, 409.0, 966.0, 3044.0, 16549.0, 136584.0, 759983.0, 111089.0, 14164.0, 2726.0, 829.0, 407.0, 223.0, 166.0, 118.0, 79.0, 57.0, 60.0, 32.0, 29.0, 27.0, 18.0, 11.0, 16.0, 14.0, 6.0, 6.0, 7.0, 5.0, 3.0, 6.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-14.03125, -13.585205078125, -13.13916015625, -12.693115234375, -12.2470703125, -11.801025390625, -11.35498046875, -10.908935546875, -10.462890625, -10.016845703125, -9.57080078125, -9.124755859375, -8.6787109375, -8.232666015625, -7.78662109375, -7.340576171875, -6.89453125, -6.448486328125, -6.00244140625, -5.556396484375, -5.1103515625, -4.664306640625, -4.21826171875, -3.772216796875, -3.326171875, -2.880126953125, -2.43408203125, -1.988037109375, -1.5419921875, -1.095947265625, -0.64990234375, -0.203857421875, 0.2421875, 0.688232421875, 1.13427734375, 1.580322265625, 2.0263671875, 2.472412109375, 2.91845703125, 3.364501953125, 3.810546875, 4.256591796875, 4.70263671875, 5.148681640625, 5.5947265625, 6.040771484375, 6.48681640625, 6.932861328125, 7.37890625, 7.824951171875, 8.27099609375, 8.717041015625, 9.1630859375, 9.609130859375, 10.05517578125, 10.501220703125, 10.947265625, 11.393310546875, 11.83935546875, 12.285400390625, 12.7314453125, 13.177490234375, 13.62353515625, 14.069580078125, 14.515625]}, "gradients/decoder.transformer.h.7.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 3.0, 4.0, 3.0, 5.0, 7.0, 6.0, 10.0, 10.0, 20.0, 10.0, 15.0, 22.0, 19.0, 29.0, 34.0, 46.0, 29.0, 42.0, 45.0, 53.0, 61.0, 96.0, 1522.0, 386.0, 110.0, 61.0, 57.0, 54.0, 50.0, 39.0, 29.0, 42.0, 23.0, 13.0, 30.0, 16.0, 15.0, 12.0, 5.0, 5.0, 5.0, 3.0, 3.0, 6.0, 1.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-26.640625, -25.831298828125, -25.02197265625, -24.212646484375, -23.4033203125, -22.593994140625, -21.78466796875, -20.975341796875, -20.166015625, -19.356689453125, -18.54736328125, -17.738037109375, -16.9287109375, -16.119384765625, -15.31005859375, -14.500732421875, -13.69140625, -12.882080078125, -12.07275390625, -11.263427734375, -10.4541015625, -9.644775390625, -8.83544921875, -8.026123046875, -7.216796875, -6.407470703125, -5.59814453125, -4.788818359375, -3.9794921875, -3.170166015625, -2.36083984375, -1.551513671875, -0.7421875, 0.067138671875, 0.87646484375, 1.685791015625, 2.4951171875, 3.304443359375, 4.11376953125, 4.923095703125, 5.732421875, 6.541748046875, 7.35107421875, 8.160400390625, 8.9697265625, 9.779052734375, 10.58837890625, 11.397705078125, 12.20703125, 13.016357421875, 13.82568359375, 14.635009765625, 15.4443359375, 16.253662109375, 17.06298828125, 17.872314453125, 18.681640625, 19.490966796875, 20.30029296875, 21.109619140625, 21.9189453125, 22.728271484375, 23.53759765625, 24.346923828125, 25.15625]}, "gradients/decoder.transformer.h.7.attn.c_attn.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 0.0, 3.0, 5.0, 0.0, 4.0, 6.0, 3.0, 5.0, 7.0, 7.0, 15.0, 18.0, 32.0, 31.0, 31.0, 55.0, 69.0, 103.0, 180.0, 300.0, 715.0, 3793.0, 2876780.0, 259844.0, 2275.0, 602.0, 274.0, 158.0, 90.0, 82.0, 53.0, 37.0, 31.0, 21.0, 21.0, 17.0, 18.0, 9.0, 8.0, 3.0, 6.0, 5.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.75, -56.4814453125, -54.212890625, -51.9443359375, -49.67578125, -47.4072265625, -45.138671875, -42.8701171875, -40.6015625, -38.3330078125, -36.064453125, -33.7958984375, -31.52734375, -29.2587890625, -26.990234375, -24.7216796875, -22.453125, -20.1845703125, -17.916015625, -15.6474609375, -13.37890625, -11.1103515625, -8.841796875, -6.5732421875, -4.3046875, -2.0361328125, 0.232421875, 2.5009765625, 4.76953125, 7.0380859375, 9.306640625, 11.5751953125, 13.84375, 16.1123046875, 18.380859375, 20.6494140625, 22.91796875, 25.1865234375, 27.455078125, 29.7236328125, 31.9921875, 34.2607421875, 36.529296875, 38.7978515625, 41.06640625, 43.3349609375, 45.603515625, 47.8720703125, 50.140625, 52.4091796875, 54.677734375, 56.9462890625, 59.21484375, 61.4833984375, 63.751953125, 66.0205078125, 68.2890625, 70.5576171875, 72.826171875, 75.0947265625, 77.36328125, 79.6318359375, 81.900390625, 84.1689453125, 86.4375]}, "gradients/decoder.transformer.h.7.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 195.0, 816.0, 4.0], "bins": [-807.1841430664062, -794.1924438476562, -781.20068359375, -768.208984375, -755.21728515625, -742.2255859375, -729.2338256835938, -716.2421264648438, -703.2504272460938, -690.2587280273438, -677.2669677734375, -664.2752685546875, -651.2835693359375, -638.2918701171875, -625.3001098632812, -612.3084106445312, -599.316650390625, -586.324951171875, -573.3331909179688, -560.3414916992188, -547.3497924804688, -534.3580932617188, -521.3663330078125, -508.3746337890625, -495.3829345703125, -482.3912048339844, -469.3995056152344, -456.40777587890625, -443.41607666015625, -430.4243469238281, -417.4326171875, -404.44091796875, -391.4491882324219, -378.45745849609375, -365.46575927734375, -352.4740295410156, -339.4823303222656, -326.4906005859375, -313.4989013671875, -300.5071716308594, -287.5154724121094, -274.52374267578125, -261.53204345703125, -248.54031372070312, -235.54861450195312, -222.556884765625, -209.56517028808594, -196.57345581054688, -183.58172607421875, -170.5900115966797, -157.59829711914062, -144.6065673828125, -131.6148681640625, -118.6231460571289, -105.63142395019531, -92.63970947265625, -79.64799499511719, -66.65628051757812, -53.6645622253418, -40.67284393310547, -27.681129455566406, -14.689414978027344, -1.69769287109375, 11.294021606445312, 24.285734176635742]}, "gradients/decoder.transformer.h.7.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 6.0, 9.0, 5.0, 5.0, 9.0, 3.0, 11.0, 16.0, 17.0, 12.0, 19.0, 29.0, 10.0, 22.0, 23.0, 24.0, 37.0, 35.0, 33.0, 39.0, 40.0, 42.0, 40.0, 47.0, 35.0, 37.0, 35.0, 46.0, 43.0, 35.0, 25.0, 28.0, 21.0, 25.0, 14.0, 23.0, 16.0, 18.0, 10.0, 10.0, 11.0, 10.0, 8.0, 6.0, 9.0, 6.0, 3.0, 4.0, 0.0, 0.0, 2.0, 2.0, 0.0, 2.0, 2.0], "bins": [-62.119842529296875, -60.12122344970703, -58.12260437011719, -56.123985290527344, -54.1253662109375, -52.126747131347656, -50.12812805175781, -48.12950897216797, -46.130889892578125, -44.13227081298828, -42.13365173339844, -40.135032653808594, -38.13641357421875, -36.137794494628906, -34.13917541503906, -32.14055633544922, -30.141935348510742, -28.1433162689209, -26.144697189331055, -24.14607810974121, -22.147459030151367, -20.14883804321289, -18.150218963623047, -16.151599884033203, -14.152981758117676, -12.154362678527832, -10.155743598937988, -8.157123565673828, -6.158504962921143, -4.159885406494141, -2.161266326904297, -0.16264724731445312, 1.8359718322753906, 3.8345909118652344, 5.833209991455078, 7.83182954788208, 9.830448150634766, 11.829068183898926, 13.82768726348877, 15.826306343078613, 17.82492446899414, 19.823543548583984, 21.822162628173828, 23.820781707763672, 25.819400787353516, 27.81801986694336, 29.816638946533203, 31.815258026123047, 33.813880920410156, 35.8125, 37.811119079589844, 39.80973815917969, 41.80835723876953, 43.806976318359375, 45.80559539794922, 47.80421447753906, 49.802833557128906, 51.80145263671875, 53.800071716308594, 55.79869079589844, 57.79730987548828, 59.795928955078125, 61.79454803466797, 63.79316711425781, 65.79178619384766]}, "gradients/decoder.transformer.h.6.mlp.c_proj.bias": {"_type": "histogram", "values": [3.0, 2.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 5.0, 3.0, 4.0, 11.0, 9.0, 8.0, 15.0, 14.0, 17.0, 18.0, 19.0, 15.0, 20.0, 24.0, 32.0, 32.0, 41.0, 31.0, 41.0, 31.0, 31.0, 44.0, 43.0, 42.0, 37.0, 28.0, 48.0, 36.0, 36.0, 32.0, 32.0, 24.0, 30.0, 17.0, 23.0, 22.0, 13.0, 9.0, 12.0, 10.0, 8.0, 8.0, 4.0, 8.0, 5.0, 3.0, 2.0, 3.0, 0.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-7.609375, -7.3603515625, -7.111328125, -6.8623046875, -6.61328125, -6.3642578125, -6.115234375, -5.8662109375, -5.6171875, -5.3681640625, -5.119140625, -4.8701171875, -4.62109375, -4.3720703125, -4.123046875, -3.8740234375, -3.625, -3.3759765625, -3.126953125, -2.8779296875, -2.62890625, -2.3798828125, -2.130859375, -1.8818359375, -1.6328125, -1.3837890625, -1.134765625, -0.8857421875, -0.63671875, -0.3876953125, -0.138671875, 0.1103515625, 0.359375, 0.6083984375, 0.857421875, 1.1064453125, 1.35546875, 1.6044921875, 1.853515625, 2.1025390625, 2.3515625, 2.6005859375, 2.849609375, 3.0986328125, 3.34765625, 3.5966796875, 3.845703125, 4.0947265625, 4.34375, 4.5927734375, 4.841796875, 5.0908203125, 5.33984375, 5.5888671875, 5.837890625, 6.0869140625, 6.3359375, 6.5849609375, 6.833984375, 7.0830078125, 7.33203125, 7.5810546875, 7.830078125, 8.0791015625, 8.328125]}, "gradients/decoder.transformer.h.6.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 5.0, 5.0, 1.0, 6.0, 8.0, 7.0, 8.0, 16.0, 13.0, 18.0, 23.0, 27.0, 31.0, 27.0, 44.0, 59.0, 103.0, 144.0, 227.0, 464.0, 1309.0, 6632.0, 136597.0, 3115938.0, 907448.0, 20802.0, 2614.0, 771.0, 290.0, 187.0, 112.0, 67.0, 64.0, 38.0, 36.0, 31.0, 29.0, 13.0, 7.0, 9.0, 9.0, 15.0, 9.0, 4.0, 9.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 6.0, 2.0, 2.0], "bins": [-27.65625, -26.83642578125, -26.0166015625, -25.19677734375, -24.376953125, -23.55712890625, -22.7373046875, -21.91748046875, -21.09765625, -20.27783203125, -19.4580078125, -18.63818359375, -17.818359375, -16.99853515625, -16.1787109375, -15.35888671875, -14.5390625, -13.71923828125, -12.8994140625, -12.07958984375, -11.259765625, -10.43994140625, -9.6201171875, -8.80029296875, -7.98046875, -7.16064453125, -6.3408203125, -5.52099609375, -4.701171875, -3.88134765625, -3.0615234375, -2.24169921875, -1.421875, -0.60205078125, 0.2177734375, 1.03759765625, 1.857421875, 2.67724609375, 3.4970703125, 4.31689453125, 5.13671875, 5.95654296875, 6.7763671875, 7.59619140625, 8.416015625, 9.23583984375, 10.0556640625, 10.87548828125, 11.6953125, 12.51513671875, 13.3349609375, 14.15478515625, 14.974609375, 15.79443359375, 16.6142578125, 17.43408203125, 18.25390625, 19.07373046875, 19.8935546875, 20.71337890625, 21.533203125, 22.35302734375, 23.1728515625, 23.99267578125, 24.8125]}, "gradients/decoder.transformer.h.6.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 5.0, 6.0, 8.0, 10.0, 20.0, 24.0, 41.0, 84.0, 168.0, 274.0, 471.0, 767.0, 938.0, 620.0, 303.0, 153.0, 90.0, 42.0, 29.0, 17.0, 7.0, 3.0, 2.0, 0.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-23.140625, -22.257568359375, -21.37451171875, -20.491455078125, -19.6083984375, -18.725341796875, -17.84228515625, -16.959228515625, -16.076171875, -15.193115234375, -14.31005859375, -13.427001953125, -12.5439453125, -11.660888671875, -10.77783203125, -9.894775390625, -9.01171875, -8.128662109375, -7.24560546875, -6.362548828125, -5.4794921875, -4.596435546875, -3.71337890625, -2.830322265625, -1.947265625, -1.064208984375, -0.18115234375, 0.701904296875, 1.5849609375, 2.468017578125, 3.35107421875, 4.234130859375, 5.1171875, 6.000244140625, 6.88330078125, 7.766357421875, 8.6494140625, 9.532470703125, 10.41552734375, 11.298583984375, 12.181640625, 13.064697265625, 13.94775390625, 14.830810546875, 15.7138671875, 16.596923828125, 17.47998046875, 18.363037109375, 19.24609375, 20.129150390625, 21.01220703125, 21.895263671875, 22.7783203125, 23.661376953125, 24.54443359375, 25.427490234375, 26.310546875, 27.193603515625, 28.07666015625, 28.959716796875, 29.8427734375, 30.725830078125, 31.60888671875, 32.491943359375, 33.375]}, "gradients/decoder.transformer.h.6.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 5.0, 7.0, 6.0, 12.0, 17.0, 15.0, 16.0, 37.0, 39.0, 52.0, 64.0, 92.0, 123.0, 193.0, 274.0, 648.0, 2735.0, 369515.0, 3813176.0, 5116.0, 874.0, 448.0, 213.0, 151.0, 104.0, 98.0, 50.0, 49.0, 41.0, 33.0, 22.0, 18.0, 16.0, 12.0, 7.0, 1.0, 3.0, 1.0, 2.0, 0.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-87.125, -84.4140625, -81.703125, -78.9921875, -76.28125, -73.5703125, -70.859375, -68.1484375, -65.4375, -62.7265625, -60.015625, -57.3046875, -54.59375, -51.8828125, -49.171875, -46.4609375, -43.75, -41.0390625, -38.328125, -35.6171875, -32.90625, -30.1953125, -27.484375, -24.7734375, -22.0625, -19.3515625, -16.640625, -13.9296875, -11.21875, -8.5078125, -5.796875, -3.0859375, -0.375, 2.3359375, 5.046875, 7.7578125, 10.46875, 13.1796875, 15.890625, 18.6015625, 21.3125, 24.0234375, 26.734375, 29.4453125, 32.15625, 34.8671875, 37.578125, 40.2890625, 43.0, 45.7109375, 48.421875, 51.1328125, 53.84375, 56.5546875, 59.265625, 61.9765625, 64.6875, 67.3984375, 70.109375, 72.8203125, 75.53125, 78.2421875, 80.953125, 83.6640625, 86.375]}, "gradients/decoder.transformer.h.6.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 9.0, 85.0, 362.0, 422.0, 125.0, 8.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-247.7996368408203, -241.9252471923828, -236.05084228515625, -230.17645263671875, -224.30206298828125, -218.42767333984375, -212.5532684326172, -206.6788787841797, -200.80447387695312, -194.93008422851562, -189.05567932128906, -183.18128967285156, -177.30690002441406, -171.4324951171875, -165.55810546875, -159.6837158203125, -153.809326171875, -147.9349365234375, -142.06053161621094, -136.18614196777344, -130.31175231933594, -124.4373550415039, -118.56295776367188, -112.68856811523438, -106.81417083740234, -100.93977355957031, -95.06538391113281, -89.19098663330078, -83.31658935546875, -77.44219970703125, -71.56780242919922, -65.69340515136719, -59.81901550292969, -53.94462203979492, -48.070228576660156, -42.195831298828125, -36.32143783569336, -30.447044372558594, -24.572647094726562, -18.698253631591797, -12.823860168457031, -6.949465751647949, -1.0750713348388672, 4.799324035644531, 10.673717498779297, 16.548110961914062, 22.422508239746094, 28.29690170288086, 34.171295166015625, 40.04568862915039, 45.920082092285156, 51.79447937011719, 57.66887283325195, 63.54326629638672, 69.41766357421875, 75.29205322265625, 81.16645050048828, 87.04084777832031, 92.91523742675781, 98.78963470458984, 104.66403198242188, 110.53842163085938, 116.4128189086914, 122.28721618652344, 128.16160583496094]}, "gradients/decoder.transformer.h.6.ln_2.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 12.0, 6.0, 3.0, 9.0, 11.0, 16.0, 10.0, 14.0, 23.0, 21.0, 22.0, 28.0, 33.0, 34.0, 38.0, 43.0, 45.0, 42.0, 40.0, 37.0, 32.0, 34.0, 55.0, 49.0, 34.0, 30.0, 26.0, 38.0, 27.0, 30.0, 19.0, 21.0, 20.0, 24.0, 23.0, 10.0, 13.0, 8.0, 7.0, 4.0, 4.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-52.632354736328125, -50.9722900390625, -49.312225341796875, -47.65216064453125, -45.992095947265625, -44.33203125, -42.671966552734375, -41.01190185546875, -39.351837158203125, -37.6917724609375, -36.031707763671875, -34.37164306640625, -32.711578369140625, -31.051513671875, -29.391448974609375, -27.73138427734375, -26.071321487426758, -24.411256790161133, -22.751192092895508, -21.091127395629883, -19.431062698364258, -17.770999908447266, -16.11093521118164, -14.4508695602417, -12.790804862976074, -11.13074016571045, -9.470675468444824, -7.810611248016357, -6.150546550750732, -4.490482330322266, -2.8304176330566406, -1.1703529357910156, 0.4897117614746094, 2.1497764587402344, 3.8098409175872803, 5.469905376434326, 7.129970073699951, 8.790034294128418, 10.450098991394043, 12.110163688659668, 13.770228385925293, 15.430293083190918, 17.090356826782227, 18.75042152404785, 20.410486221313477, 22.0705509185791, 23.730615615844727, 25.39068031311035, 27.050745010375977, 28.7108097076416, 30.370874404907227, 32.03093719482422, 33.691001892089844, 35.35106658935547, 37.011131286621094, 38.67119598388672, 40.331260681152344, 41.99132537841797, 43.651390075683594, 45.31145477294922, 46.971519470214844, 48.63158416748047, 50.291648864746094, 51.95171356201172, 53.611778259277344]}, "gradients/decoder.transformer.h.6.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 5.0, 6.0, 8.0, 6.0, 10.0, 7.0, 11.0, 7.0, 6.0, 10.0, 12.0, 19.0, 21.0, 22.0, 16.0, 23.0, 44.0, 30.0, 32.0, 44.0, 41.0, 37.0, 36.0, 38.0, 25.0, 34.0, 51.0, 35.0, 33.0, 31.0, 34.0, 33.0, 27.0, 27.0, 29.0, 22.0, 23.0, 14.0, 12.0, 13.0, 14.0, 15.0, 9.0, 10.0, 6.0, 3.0, 3.0, 5.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-8.234375, -7.98736572265625, -7.7403564453125, -7.49334716796875, -7.246337890625, -6.99932861328125, -6.7523193359375, -6.50531005859375, -6.25830078125, -6.01129150390625, -5.7642822265625, -5.51727294921875, -5.270263671875, -5.02325439453125, -4.7762451171875, -4.52923583984375, -4.2822265625, -4.03521728515625, -3.7882080078125, -3.54119873046875, -3.294189453125, -3.04718017578125, -2.8001708984375, -2.55316162109375, -2.30615234375, -2.05914306640625, -1.8121337890625, -1.56512451171875, -1.318115234375, -1.07110595703125, -0.8240966796875, -0.57708740234375, -0.330078125, -0.08306884765625, 0.1639404296875, 0.41094970703125, 0.657958984375, 0.90496826171875, 1.1519775390625, 1.39898681640625, 1.64599609375, 1.89300537109375, 2.1400146484375, 2.38702392578125, 2.634033203125, 2.88104248046875, 3.1280517578125, 3.37506103515625, 3.6220703125, 3.86907958984375, 4.1160888671875, 4.36309814453125, 4.610107421875, 4.85711669921875, 5.1041259765625, 5.35113525390625, 5.59814453125, 5.84515380859375, 6.0921630859375, 6.33917236328125, 6.586181640625, 6.83319091796875, 7.0802001953125, 7.32720947265625, 7.57421875]}, "gradients/decoder.transformer.h.6.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 5.0, 8.0, 12.0, 17.0, 26.0, 44.0, 54.0, 119.0, 131.0, 219.0, 306.0, 450.0, 666.0, 947.0, 1354.0, 1867.0, 2555.0, 3444.0, 4928.0, 7212.0, 10004.0, 14393.0, 21940.0, 32906.0, 52278.0, 86694.0, 172936.0, 298384.0, 129049.0, 71699.0, 44276.0, 28025.0, 18872.0, 12712.0, 8725.0, 6104.0, 4340.0, 3137.0, 2243.0, 1648.0, 1189.0, 811.0, 615.0, 404.0, 294.0, 186.0, 123.0, 83.0, 38.0, 38.0, 23.0, 12.0, 10.0, 7.0, 2.0, 3.0, 1.0, 2.0], "bins": [-1.9638671875, -1.9049224853515625, -1.845977783203125, -1.7870330810546875, -1.72808837890625, -1.6691436767578125, -1.610198974609375, -1.5512542724609375, -1.4923095703125, -1.4333648681640625, -1.374420166015625, -1.3154754638671875, -1.25653076171875, -1.1975860595703125, -1.138641357421875, -1.0796966552734375, -1.020751953125, -0.9618072509765625, -0.902862548828125, -0.8439178466796875, -0.78497314453125, -0.7260284423828125, -0.667083740234375, -0.6081390380859375, -0.5491943359375, -0.4902496337890625, -0.431304931640625, -0.3723602294921875, -0.31341552734375, -0.2544708251953125, -0.195526123046875, -0.1365814208984375, -0.07763671875, -0.0186920166015625, 0.040252685546875, 0.0991973876953125, 0.15814208984375, 0.2170867919921875, 0.276031494140625, 0.3349761962890625, 0.3939208984375, 0.4528656005859375, 0.511810302734375, 0.5707550048828125, 0.62969970703125, 0.6886444091796875, 0.747589111328125, 0.8065338134765625, 0.865478515625, 0.9244232177734375, 0.983367919921875, 1.0423126220703125, 1.10125732421875, 1.1602020263671875, 1.219146728515625, 1.2780914306640625, 1.3370361328125, 1.3959808349609375, 1.454925537109375, 1.5138702392578125, 1.57281494140625, 1.6317596435546875, 1.690704345703125, 1.7496490478515625, 1.80859375]}, "gradients/decoder.transformer.h.6.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 6.0, 1.0, 2.0, 5.0, 5.0, 4.0, 8.0, 8.0, 9.0, 10.0, 14.0, 15.0, 18.0, 17.0, 20.0, 15.0, 23.0, 22.0, 31.0, 32.0, 26.0, 30.0, 35.0, 46.0, 37.0, 43.0, 1056.0, 36.0, 33.0, 27.0, 37.0, 35.0, 33.0, 43.0, 37.0, 23.0, 26.0, 16.0, 21.0, 22.0, 14.0, 22.0, 12.0, 7.0, 17.0, 13.0, 4.0, 8.0, 4.0, 3.0, 3.0, 1.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.91796875, -4.764892578125, -4.61181640625, -4.458740234375, -4.3056640625, -4.152587890625, -3.99951171875, -3.846435546875, -3.693359375, -3.540283203125, -3.38720703125, -3.234130859375, -3.0810546875, -2.927978515625, -2.77490234375, -2.621826171875, -2.46875, -2.315673828125, -2.16259765625, -2.009521484375, -1.8564453125, -1.703369140625, -1.55029296875, -1.397216796875, -1.244140625, -1.091064453125, -0.93798828125, -0.784912109375, -0.6318359375, -0.478759765625, -0.32568359375, -0.172607421875, -0.01953125, 0.133544921875, 0.28662109375, 0.439697265625, 0.5927734375, 0.745849609375, 0.89892578125, 1.052001953125, 1.205078125, 1.358154296875, 1.51123046875, 1.664306640625, 1.8173828125, 1.970458984375, 2.12353515625, 2.276611328125, 2.4296875, 2.582763671875, 2.73583984375, 2.888916015625, 3.0419921875, 3.195068359375, 3.34814453125, 3.501220703125, 3.654296875, 3.807373046875, 3.96044921875, 4.113525390625, 4.2666015625, 4.419677734375, 4.57275390625, 4.725830078125, 4.87890625]}, "gradients/decoder.transformer.h.6.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 1.0, 4.0, 1.0, 6.0, 11.0, 10.0, 12.0, 29.0, 21.0, 41.0, 59.0, 102.0, 156.0, 268.0, 369.0, 706.0, 1135.0, 2022.0, 3541.0, 6004.0, 10344.0, 18564.0, 34311.0, 64902.0, 132629.0, 1432323.0, 198833.0, 88331.0, 45971.0, 24408.0, 13560.0, 7700.0, 4421.0, 2590.0, 1496.0, 942.0, 505.0, 295.0, 195.0, 103.0, 81.0, 31.0, 34.0, 21.0, 15.0, 8.0, 7.0, 8.0, 4.0, 2.0, 1.0, 5.0, 3.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.6640625, -2.580078125, -2.49609375, -2.412109375, -2.328125, -2.244140625, -2.16015625, -2.076171875, -1.9921875, -1.908203125, -1.82421875, -1.740234375, -1.65625, -1.572265625, -1.48828125, -1.404296875, -1.3203125, -1.236328125, -1.15234375, -1.068359375, -0.984375, -0.900390625, -0.81640625, -0.732421875, -0.6484375, -0.564453125, -0.48046875, -0.396484375, -0.3125, -0.228515625, -0.14453125, -0.060546875, 0.0234375, 0.107421875, 0.19140625, 0.275390625, 0.359375, 0.443359375, 0.52734375, 0.611328125, 0.6953125, 0.779296875, 0.86328125, 0.947265625, 1.03125, 1.115234375, 1.19921875, 1.283203125, 1.3671875, 1.451171875, 1.53515625, 1.619140625, 1.703125, 1.787109375, 1.87109375, 1.955078125, 2.0390625, 2.123046875, 2.20703125, 2.291015625, 2.375, 2.458984375, 2.54296875, 2.626953125, 2.7109375]}, "gradients/decoder.transformer.h.6.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 5.0, 6.0, 5.0, 6.0, 3.0, 7.0, 16.0, 13.0, 14.0, 20.0, 21.0, 20.0, 30.0, 50.0, 51.0, 71.0, 97.0, 91.0, 99.0, 76.0, 56.0, 50.0, 43.0, 37.0, 28.0, 19.0, 15.0, 13.0, 8.0, 6.0, 9.0, 5.0, 8.0, 4.0, 3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0016326904296875, -0.0015811324119567871, -0.0015295743942260742, -0.0014780163764953613, -0.0014264583587646484, -0.0013749003410339355, -0.0013233423233032227, -0.0012717843055725098, -0.0012202262878417969, -0.001168668270111084, -0.001117110252380371, -0.0010655522346496582, -0.0010139942169189453, -0.0009624361991882324, -0.0009108781814575195, -0.0008593201637268066, -0.0008077621459960938, -0.0007562041282653809, -0.000704646110534668, -0.0006530880928039551, -0.0006015300750732422, -0.0005499720573425293, -0.0004984140396118164, -0.0004468560218811035, -0.0003952980041503906, -0.00034373998641967773, -0.00029218196868896484, -0.00024062395095825195, -0.00018906593322753906, -0.00013750791549682617, -8.594989776611328e-05, -3.439188003540039e-05, 1.71661376953125e-05, 6.872415542602539e-05, 0.00012028217315673828, 0.00017184019088745117, 0.00022339820861816406, 0.00027495622634887695, 0.00032651424407958984, 0.00037807226181030273, 0.0004296302795410156, 0.0004811882972717285, 0.0005327463150024414, 0.0005843043327331543, 0.0006358623504638672, 0.0006874203681945801, 0.000738978385925293, 0.0007905364036560059, 0.0008420944213867188, 0.0008936524391174316, 0.0009452104568481445, 0.0009967684745788574, 0.0010483264923095703, 0.0010998845100402832, 0.001151442527770996, 0.001203000545501709, 0.0012545585632324219, 0.0013061165809631348, 0.0013576745986938477, 0.0014092326164245605, 0.0014607906341552734, 0.0015123486518859863, 0.0015639066696166992, 0.0016154646873474121, 0.001667022705078125]}, "gradients/decoder.transformer.h.6.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 3.0, 3.0, 3.0, 3.0, 5.0, 8.0, 10.0, 12.0, 16.0, 16.0, 21.0, 41.0, 42.0, 58.0, 76.0, 143.0, 209.0, 526.0, 3518.0, 1038083.0, 4543.0, 504.0, 233.0, 137.0, 86.0, 60.0, 45.0, 22.0, 28.0, 16.0, 21.0, 13.0, 9.0, 9.0, 9.0, 9.0, 4.0, 5.0, 5.0, 4.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.037384033203125, -0.036269187927246094, -0.03515434265136719, -0.03403949737548828, -0.032924652099609375, -0.03180980682373047, -0.030694961547851562, -0.029580116271972656, -0.02846527099609375, -0.027350425720214844, -0.026235580444335938, -0.02512073516845703, -0.024005889892578125, -0.02289104461669922, -0.021776199340820312, -0.020661354064941406, -0.0195465087890625, -0.018431663513183594, -0.017316818237304688, -0.01620197296142578, -0.015087127685546875, -0.013972282409667969, -0.012857437133789062, -0.011742591857910156, -0.01062774658203125, -0.009512901306152344, -0.008398056030273438, -0.007283210754394531, -0.006168365478515625, -0.005053520202636719, -0.0039386749267578125, -0.0028238296508789062, -0.001708984375, -0.0005941390991210938, 0.0005207061767578125, 0.0016355514526367188, 0.002750396728515625, 0.0038652420043945312, 0.0049800872802734375, 0.006094932556152344, 0.00720977783203125, 0.008324623107910156, 0.009439468383789062, 0.010554313659667969, 0.011669158935546875, 0.012784004211425781, 0.013898849487304688, 0.015013694763183594, 0.0161285400390625, 0.017243385314941406, 0.018358230590820312, 0.01947307586669922, 0.020587921142578125, 0.02170276641845703, 0.022817611694335938, 0.023932456970214844, 0.02504730224609375, 0.026162147521972656, 0.027276992797851562, 0.02839183807373047, 0.029506683349609375, 0.03062152862548828, 0.03173637390136719, 0.032851219177246094, 0.033966064453125]}, "gradients/decoder.transformer.h.6.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 30.0, 632.0, 338.0, 16.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0016195472562685609, -0.001454607816413045, -0.0012896682601422071, -0.0011247288202866912, -0.0009597893222235143, -0.0007948498241603374, -0.0006299103843048215, -0.0004649708280339837, -0.00030003138817846775, -0.0001350919046672061, 2.9847578844055533e-05, 0.00019478704780340195, 0.0003597265458665788, 0.0005246660439297557, 0.0006896054837852716, 0.0008545450400561094, 0.0010194844799116254, 0.0011844239197671413, 0.0013493634760379791, 0.001514302915893495, 0.001679242355749011, 0.0018441819120198488, 0.002009121235460043, 0.0021740607917308807, 0.0023390003480017185, 0.0025039399042725563, 0.0026688792277127504, 0.0028338187839835882, 0.002998758340254426, 0.00316369766369462, 0.003328637219965458, 0.0034935767762362957, 0.003658515866845846, 0.003823455423116684, 0.003988394979387522, 0.0041533345356583595, 0.00431827362626791, 0.004483213182538748, 0.004648152738809586, 0.004813092295080423, 0.004978031851351261, 0.005142971407622099, 0.005307910963892937, 0.005472850054502487, 0.005637789610773325, 0.005802729167044163, 0.0059676687233150005, 0.006132608279585838, 0.006297547370195389, 0.006462486926466227, 0.006627426482737064, 0.006792365573346615, 0.006957305129617453, 0.00712224468588829, 0.007287184242159128, 0.007452123798429966, 0.007617063354700804, 0.0077820029109716415, 0.007946942001581192, 0.008111882023513317, 0.008276821114122868, 0.008441761136054993, 0.008606700226664543, 0.008771639317274094, 0.008936579339206219]}, "gradients/decoder.transformer.h.6.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 6.0, 11.0, 3.0, 7.0, 9.0, 13.0, 8.0, 18.0, 14.0, 17.0, 28.0, 27.0, 22.0, 36.0, 33.0, 47.0, 37.0, 29.0, 32.0, 38.0, 35.0, 38.0, 41.0, 30.0, 29.0, 31.0, 31.0, 35.0, 39.0, 35.0, 26.0, 26.0, 22.0, 25.0, 19.0, 19.0, 18.0, 11.0, 13.0, 7.0, 8.0, 9.0, 8.0, 6.0, 3.0, 3.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0], "bins": [-0.0006828904151916504, -0.0006628455594182014, -0.0006428007036447525, -0.0006227558478713036, -0.0006027109920978546, -0.0005826661363244057, -0.0005626212805509567, -0.0005425764247775078, -0.0005225315690040588, -0.0005024867132306099, -0.00048244185745716095, -0.000462397001683712, -0.00044235214591026306, -0.0004223072901368141, -0.0004022624343633652, -0.00038221757858991623, -0.0003621727228164673, -0.00034212786704301834, -0.0003220830112695694, -0.00030203815549612045, -0.0002819932997226715, -0.00026194844394922256, -0.00024190358817577362, -0.00022185873240232468, -0.00020181387662887573, -0.0001817690208554268, -0.00016172416508197784, -0.0001416793093085289, -0.00012163445353507996, -0.00010158959776163101, -8.154474198818207e-05, -6.149988621473312e-05, -4.145503044128418e-05, -2.1410174667835236e-05, -1.3653188943862915e-06, 1.8679536879062653e-05, 3.87243926525116e-05, 5.876924842596054e-05, 7.881410419940948e-05, 9.885895997285843e-05, 0.00011890381574630737, 0.00013894867151975632, 0.00015899352729320526, 0.0001790383830666542, 0.00019908323884010315, 0.0002191280946135521, 0.00023917295038700104, 0.00025921780616045, 0.0002792626619338989, 0.00029930751770734787, 0.0003193523734807968, 0.00033939722925424576, 0.0003594420850276947, 0.00037948694080114365, 0.0003995317965745926, 0.00041957665234804153, 0.0004396215081214905, 0.0004596663638949394, 0.00047971121966838837, 0.0004997560754418373, 0.0005198009312152863, 0.0005398457869887352, 0.0005598906427621841, 0.0005799354985356331, 0.000599980354309082]}, "gradients/decoder.transformer.h.6.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 5.0, 6.0, 8.0, 6.0, 10.0, 7.0, 11.0, 7.0, 6.0, 10.0, 12.0, 19.0, 21.0, 22.0, 16.0, 23.0, 44.0, 30.0, 32.0, 44.0, 41.0, 37.0, 36.0, 38.0, 25.0, 34.0, 51.0, 35.0, 33.0, 31.0, 34.0, 33.0, 27.0, 27.0, 29.0, 22.0, 23.0, 14.0, 12.0, 13.0, 14.0, 15.0, 9.0, 10.0, 6.0, 3.0, 3.0, 5.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-8.234375, -7.98736572265625, -7.7403564453125, -7.49334716796875, -7.246337890625, -6.99932861328125, -6.7523193359375, -6.50531005859375, -6.25830078125, -6.01129150390625, -5.7642822265625, -5.51727294921875, -5.270263671875, -5.02325439453125, -4.7762451171875, -4.52923583984375, -4.2822265625, -4.03521728515625, -3.7882080078125, -3.54119873046875, -3.294189453125, -3.04718017578125, -2.8001708984375, -2.55316162109375, -2.30615234375, -2.05914306640625, -1.8121337890625, -1.56512451171875, -1.318115234375, -1.07110595703125, -0.8240966796875, -0.57708740234375, -0.330078125, -0.08306884765625, 0.1639404296875, 0.41094970703125, 0.657958984375, 0.90496826171875, 1.1519775390625, 1.39898681640625, 1.64599609375, 1.89300537109375, 2.1400146484375, 2.38702392578125, 2.634033203125, 2.88104248046875, 3.1280517578125, 3.37506103515625, 3.6220703125, 3.86907958984375, 4.1160888671875, 4.36309814453125, 4.610107421875, 4.85711669921875, 5.1041259765625, 5.35113525390625, 5.59814453125, 5.84515380859375, 6.0921630859375, 6.33917236328125, 6.586181640625, 6.83319091796875, 7.0802001953125, 7.32720947265625, 7.57421875]}, "gradients/decoder.transformer.h.6.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 1.0, 1.0, 4.0, 4.0, 8.0, 6.0, 11.0, 15.0, 14.0, 18.0, 20.0, 21.0, 22.0, 18.0, 30.0, 33.0, 77.0, 91.0, 103.0, 135.0, 206.0, 248.0, 337.0, 480.0, 593.0, 1019.0, 2604.0, 14085.0, 135739.0, 825995.0, 54401.0, 7427.0, 1791.0, 807.0, 525.0, 382.0, 268.0, 216.0, 188.0, 133.0, 111.0, 77.0, 68.0, 36.0, 48.0, 31.0, 31.0, 24.0, 18.0, 10.0, 6.0, 8.0, 8.0, 5.0, 6.0, 1.0, 4.0, 1.0, 1.0, 2.0], "bins": [-20.8125, -20.18798828125, -19.5634765625, -18.93896484375, -18.314453125, -17.68994140625, -17.0654296875, -16.44091796875, -15.81640625, -15.19189453125, -14.5673828125, -13.94287109375, -13.318359375, -12.69384765625, -12.0693359375, -11.44482421875, -10.8203125, -10.19580078125, -9.5712890625, -8.94677734375, -8.322265625, -7.69775390625, -7.0732421875, -6.44873046875, -5.82421875, -5.19970703125, -4.5751953125, -3.95068359375, -3.326171875, -2.70166015625, -2.0771484375, -1.45263671875, -0.828125, -0.20361328125, 0.4208984375, 1.04541015625, 1.669921875, 2.29443359375, 2.9189453125, 3.54345703125, 4.16796875, 4.79248046875, 5.4169921875, 6.04150390625, 6.666015625, 7.29052734375, 7.9150390625, 8.53955078125, 9.1640625, 9.78857421875, 10.4130859375, 11.03759765625, 11.662109375, 12.28662109375, 12.9111328125, 13.53564453125, 14.16015625, 14.78466796875, 15.4091796875, 16.03369140625, 16.658203125, 17.28271484375, 17.9072265625, 18.53173828125, 19.15625]}, "gradients/decoder.transformer.h.6.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 2.0, 6.0, 8.0, 5.0, 12.0, 16.0, 8.0, 11.0, 14.0, 14.0, 18.0, 22.0, 24.0, 26.0, 27.0, 30.0, 32.0, 36.0, 46.0, 37.0, 76.0, 417.0, 1611.0, 108.0, 61.0, 53.0, 50.0, 37.0, 34.0, 29.0, 21.0, 26.0, 18.0, 16.0, 16.0, 18.0, 18.0, 7.0, 6.0, 5.0, 8.0, 6.0, 5.0, 4.0, 5.0, 3.0, 0.0, 0.0, 0.0, 0.0, 5.0], "bins": [-28.28125, -27.475830078125, -26.67041015625, -25.864990234375, -25.0595703125, -24.254150390625, -23.44873046875, -22.643310546875, -21.837890625, -21.032470703125, -20.22705078125, -19.421630859375, -18.6162109375, -17.810791015625, -17.00537109375, -16.199951171875, -15.39453125, -14.589111328125, -13.78369140625, -12.978271484375, -12.1728515625, -11.367431640625, -10.56201171875, -9.756591796875, -8.951171875, -8.145751953125, -7.34033203125, -6.534912109375, -5.7294921875, -4.924072265625, -4.11865234375, -3.313232421875, -2.5078125, -1.702392578125, -0.89697265625, -0.091552734375, 0.7138671875, 1.519287109375, 2.32470703125, 3.130126953125, 3.935546875, 4.740966796875, 5.54638671875, 6.351806640625, 7.1572265625, 7.962646484375, 8.76806640625, 9.573486328125, 10.37890625, 11.184326171875, 11.98974609375, 12.795166015625, 13.6005859375, 14.406005859375, 15.21142578125, 16.016845703125, 16.822265625, 17.627685546875, 18.43310546875, 19.238525390625, 20.0439453125, 20.849365234375, 21.65478515625, 22.460205078125, 23.265625]}, "gradients/decoder.transformer.h.6.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 0.0, 0.0, 4.0, 4.0, 4.0, 5.0, 4.0, 8.0, 7.0, 5.0, 8.0, 13.0, 20.0, 23.0, 16.0, 20.0, 31.0, 28.0, 43.0, 56.0, 76.0, 117.0, 227.0, 511.0, 1902.0, 2962450.0, 177885.0, 1211.0, 375.0, 187.0, 113.0, 68.0, 52.0, 40.0, 27.0, 28.0, 27.0, 15.0, 14.0, 17.0, 10.0, 7.0, 12.0, 18.0, 5.0, 9.0, 4.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-65.625, -63.353515625, -61.08203125, -58.810546875, -56.5390625, -54.267578125, -51.99609375, -49.724609375, -47.453125, -45.181640625, -42.91015625, -40.638671875, -38.3671875, -36.095703125, -33.82421875, -31.552734375, -29.28125, -27.009765625, -24.73828125, -22.466796875, -20.1953125, -17.923828125, -15.65234375, -13.380859375, -11.109375, -8.837890625, -6.56640625, -4.294921875, -2.0234375, 0.248046875, 2.51953125, 4.791015625, 7.0625, 9.333984375, 11.60546875, 13.876953125, 16.1484375, 18.419921875, 20.69140625, 22.962890625, 25.234375, 27.505859375, 29.77734375, 32.048828125, 34.3203125, 36.591796875, 38.86328125, 41.134765625, 43.40625, 45.677734375, 47.94921875, 50.220703125, 52.4921875, 54.763671875, 57.03515625, 59.306640625, 61.578125, 63.849609375, 66.12109375, 68.392578125, 70.6640625, 72.935546875, 75.20703125, 77.478515625, 79.75]}, "gradients/decoder.transformer.h.6.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 18.0, 346.0, 547.0, 99.0, 7.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-156.63739013671875, -153.6923370361328, -150.74729919433594, -147.80224609375, -144.85719299316406, -141.9121551513672, -138.96710205078125, -136.02206420898438, -133.07701110839844, -130.1319580078125, -127.1869125366211, -124.24186706542969, -121.29682159423828, -118.35177612304688, -115.40672302246094, -112.46167755126953, -109.51663208007812, -106.57158660888672, -103.62653350830078, -100.68148803710938, -97.73644256591797, -94.79139709472656, -91.84634399414062, -88.90129852294922, -85.95624542236328, -83.01119995117188, -80.06614685058594, -77.12110137939453, -74.17605590820312, -71.23101043701172, -68.28595733642578, -65.34091186523438, -62.39586639404297, -59.4508171081543, -56.50577163696289, -53.56072235107422, -50.61567687988281, -47.67062759399414, -44.72557830810547, -41.78053283691406, -38.83548355102539, -35.89043426513672, -32.94538879394531, -30.00033950805664, -27.055294036865234, -24.110244750976562, -21.165197372436523, -18.220149993896484, -15.275102615356445, -12.330055236816406, -9.385007858276367, -6.439959526062012, -3.4949121475219727, -0.5498647689819336, 2.395183563232422, 5.340230941772461, 8.2852783203125, 11.230325698852539, 14.175373077392578, 17.12042236328125, 20.065467834472656, 23.010517120361328, 25.955564498901367, 28.900611877441406, 31.845659255981445]}, "gradients/decoder.transformer.h.6.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 1.0, 1.0, 4.0, 4.0, 2.0, 6.0, 5.0, 7.0, 12.0, 9.0, 15.0, 12.0, 20.0, 24.0, 13.0, 19.0, 26.0, 24.0, 15.0, 40.0, 38.0, 37.0, 37.0, 52.0, 41.0, 39.0, 54.0, 39.0, 34.0, 33.0, 36.0, 34.0, 43.0, 25.0, 20.0, 20.0, 27.0, 18.0, 20.0, 14.0, 19.0, 16.0, 11.0, 5.0, 11.0, 8.0, 4.0, 5.0, 1.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 2.0], "bins": [-67.83663177490234, -65.82483673095703, -63.81304168701172, -61.801246643066406, -59.789451599121094, -57.77765655517578, -55.76586151123047, -53.754066467285156, -51.742271423339844, -49.73047637939453, -47.71868133544922, -45.706886291503906, -43.695091247558594, -41.68329620361328, -39.67150115966797, -37.659706115722656, -35.64790725708008, -33.636112213134766, -31.624317169189453, -29.61252212524414, -27.600727081298828, -25.588932037353516, -23.57713508605957, -21.565340042114258, -19.553544998168945, -17.541749954223633, -15.52995491027832, -13.518158912658691, -11.506363868713379, -9.494568824768066, -7.4827728271484375, -5.470977783203125, -3.459186553955078, -1.4473912715911865, 0.5644040107727051, 2.576199531555176, 4.587994575500488, 6.599789619445801, 8.61158561706543, 10.623380661010742, 12.635175704956055, 14.646970748901367, 16.65876579284668, 18.670562744140625, 20.682357788085938, 22.69415283203125, 24.705947875976562, 26.717742919921875, 28.729537963867188, 30.7413330078125, 32.75312805175781, 34.764923095703125, 36.77671813964844, 38.78851318359375, 40.80030822753906, 42.812103271484375, 44.82389831542969, 46.835693359375, 48.84748840332031, 50.859283447265625, 52.87107849121094, 54.88287353515625, 56.89466857910156, 58.906463623046875, 60.91826248168945]}, "gradients/decoder.transformer.h.5.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 4.0, 2.0, 6.0, 7.0, 2.0, 10.0, 9.0, 8.0, 9.0, 8.0, 7.0, 15.0, 22.0, 27.0, 17.0, 16.0, 21.0, 27.0, 30.0, 36.0, 25.0, 54.0, 31.0, 40.0, 36.0, 33.0, 33.0, 45.0, 39.0, 31.0, 36.0, 32.0, 32.0, 30.0, 38.0, 26.0, 22.0, 20.0, 22.0, 13.0, 16.0, 19.0, 9.0, 9.0, 11.0, 5.0, 5.0, 7.0, 4.0, 3.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-8.046875, -7.795166015625, -7.54345703125, -7.291748046875, -7.0400390625, -6.788330078125, -6.53662109375, -6.284912109375, -6.033203125, -5.781494140625, -5.52978515625, -5.278076171875, -5.0263671875, -4.774658203125, -4.52294921875, -4.271240234375, -4.01953125, -3.767822265625, -3.51611328125, -3.264404296875, -3.0126953125, -2.760986328125, -2.50927734375, -2.257568359375, -2.005859375, -1.754150390625, -1.50244140625, -1.250732421875, -0.9990234375, -0.747314453125, -0.49560546875, -0.243896484375, 0.0078125, 0.259521484375, 0.51123046875, 0.762939453125, 1.0146484375, 1.266357421875, 1.51806640625, 1.769775390625, 2.021484375, 2.273193359375, 2.52490234375, 2.776611328125, 3.0283203125, 3.280029296875, 3.53173828125, 3.783447265625, 4.03515625, 4.286865234375, 4.53857421875, 4.790283203125, 5.0419921875, 5.293701171875, 5.54541015625, 5.797119140625, 6.048828125, 6.300537109375, 6.55224609375, 6.803955078125, 7.0556640625, 7.307373046875, 7.55908203125, 7.810791015625, 8.0625]}, "gradients/decoder.transformer.h.5.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 5.0, 2.0, 6.0, 6.0, 3.0, 5.0, 6.0, 4.0, 9.0, 12.0, 12.0, 20.0, 20.0, 36.0, 40.0, 48.0, 53.0, 88.0, 111.0, 160.0, 186.0, 268.0, 401.0, 774.0, 1935.0, 6569.0, 41341.0, 983586.0, 2886087.0, 248857.0, 16770.0, 3683.0, 1257.0, 606.0, 376.0, 217.0, 181.0, 129.0, 88.0, 69.0, 64.0, 46.0, 38.0, 23.0, 22.0, 19.0, 12.0, 13.0, 8.0, 3.0, 9.0, 5.0, 6.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-21.90625, -21.2060546875, -20.505859375, -19.8056640625, -19.10546875, -18.4052734375, -17.705078125, -17.0048828125, -16.3046875, -15.6044921875, -14.904296875, -14.2041015625, -13.50390625, -12.8037109375, -12.103515625, -11.4033203125, -10.703125, -10.0029296875, -9.302734375, -8.6025390625, -7.90234375, -7.2021484375, -6.501953125, -5.8017578125, -5.1015625, -4.4013671875, -3.701171875, -3.0009765625, -2.30078125, -1.6005859375, -0.900390625, -0.2001953125, 0.5, 1.2001953125, 1.900390625, 2.6005859375, 3.30078125, 4.0009765625, 4.701171875, 5.4013671875, 6.1015625, 6.8017578125, 7.501953125, 8.2021484375, 8.90234375, 9.6025390625, 10.302734375, 11.0029296875, 11.703125, 12.4033203125, 13.103515625, 13.8037109375, 14.50390625, 15.2041015625, 15.904296875, 16.6044921875, 17.3046875, 18.0048828125, 18.705078125, 19.4052734375, 20.10546875, 20.8056640625, 21.505859375, 22.2060546875, 22.90625]}, "gradients/decoder.transformer.h.5.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 1.0, 2.0, 4.0, 6.0, 10.0, 16.0, 30.0, 45.0, 85.0, 138.0, 249.0, 449.0, 715.0, 891.0, 644.0, 355.0, 183.0, 116.0, 62.0, 32.0, 21.0, 14.0, 3.0, 6.0, 5.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-27.3125, -26.518310546875, -25.72412109375, -24.929931640625, -24.1357421875, -23.341552734375, -22.54736328125, -21.753173828125, -20.958984375, -20.164794921875, -19.37060546875, -18.576416015625, -17.7822265625, -16.988037109375, -16.19384765625, -15.399658203125, -14.60546875, -13.811279296875, -13.01708984375, -12.222900390625, -11.4287109375, -10.634521484375, -9.84033203125, -9.046142578125, -8.251953125, -7.457763671875, -6.66357421875, -5.869384765625, -5.0751953125, -4.281005859375, -3.48681640625, -2.692626953125, -1.8984375, -1.104248046875, -0.31005859375, 0.484130859375, 1.2783203125, 2.072509765625, 2.86669921875, 3.660888671875, 4.455078125, 5.249267578125, 6.04345703125, 6.837646484375, 7.6318359375, 8.426025390625, 9.22021484375, 10.014404296875, 10.80859375, 11.602783203125, 12.39697265625, 13.191162109375, 13.9853515625, 14.779541015625, 15.57373046875, 16.367919921875, 17.162109375, 17.956298828125, 18.75048828125, 19.544677734375, 20.3388671875, 21.133056640625, 21.92724609375, 22.721435546875, 23.515625]}, "gradients/decoder.transformer.h.5.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 2.0, 9.0, 9.0, 14.0, 19.0, 26.0, 37.0, 41.0, 71.0, 78.0, 136.0, 172.0, 312.0, 664.0, 2067.0, 48471.0, 4110100.0, 28571.0, 1946.0, 595.0, 293.0, 174.0, 129.0, 100.0, 70.0, 46.0, 31.0, 24.0, 9.0, 18.0, 10.0, 9.0, 9.0, 9.0, 3.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-70.0625, -67.68359375, -65.3046875, -62.92578125, -60.546875, -58.16796875, -55.7890625, -53.41015625, -51.03125, -48.65234375, -46.2734375, -43.89453125, -41.515625, -39.13671875, -36.7578125, -34.37890625, -32.0, -29.62109375, -27.2421875, -24.86328125, -22.484375, -20.10546875, -17.7265625, -15.34765625, -12.96875, -10.58984375, -8.2109375, -5.83203125, -3.453125, -1.07421875, 1.3046875, 3.68359375, 6.0625, 8.44140625, 10.8203125, 13.19921875, 15.578125, 17.95703125, 20.3359375, 22.71484375, 25.09375, 27.47265625, 29.8515625, 32.23046875, 34.609375, 36.98828125, 39.3671875, 41.74609375, 44.125, 46.50390625, 48.8828125, 51.26171875, 53.640625, 56.01953125, 58.3984375, 60.77734375, 63.15625, 65.53515625, 67.9140625, 70.29296875, 72.671875, 75.05078125, 77.4296875, 79.80859375, 82.1875]}, "gradients/decoder.transformer.h.5.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 414.0, 598.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-910.3716430664062, -885.8063354492188, -861.2410888671875, -836.67578125, -812.1104736328125, -787.5452270507812, -762.9799194335938, -738.4146728515625, -713.849365234375, -689.2840576171875, -664.7188110351562, -640.1535034179688, -615.5882568359375, -591.02294921875, -566.4576416015625, -541.892333984375, -517.3270874023438, -492.7618103027344, -468.196533203125, -443.6312255859375, -419.0659484863281, -394.50067138671875, -369.93536376953125, -345.3700866699219, -320.8048095703125, -296.2395324707031, -271.67425537109375, -247.10894775390625, -222.54367065429688, -197.9783935546875, -173.41310119628906, -148.84780883789062, -124.2825927734375, -99.7173080444336, -75.15202331542969, -50.58673858642578, -26.021453857421875, -1.4561691284179688, 23.109115600585938, 47.674407958984375, 72.23968505859375, 96.80496978759766, 121.37025451660156, 145.935546875, 170.50082397460938, 195.06610107421875, 219.6313934326172, 244.19668579101562, 268.761962890625, 293.3272399902344, 317.89251708984375, 342.45782470703125, 367.0231018066406, 391.58837890625, 416.1536865234375, 440.7189636230469, 465.28424072265625, 489.8495178222656, 514.414794921875, 538.9801025390625, 563.54541015625, 588.1106567382812, 612.6759643554688, 637.2412109375, 661.8065185546875]}, "gradients/decoder.transformer.h.5.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 0.0, 8.0, 2.0, 6.0, 4.0, 7.0, 4.0, 11.0, 14.0, 22.0, 19.0, 26.0, 26.0, 23.0, 29.0, 35.0, 34.0, 32.0, 33.0, 41.0, 40.0, 46.0, 41.0, 42.0, 34.0, 43.0, 42.0, 47.0, 42.0, 48.0, 30.0, 31.0, 18.0, 19.0, 21.0, 25.0, 25.0, 13.0, 11.0, 4.0, 2.0, 7.0, 2.0, 4.0, 1.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-48.35646057128906, -46.61703109741211, -44.877601623535156, -43.1381721496582, -41.39874267578125, -39.6593132019043, -37.919883728027344, -36.180450439453125, -34.44102478027344, -32.701595306396484, -30.96216583251953, -29.222736358642578, -27.483306884765625, -25.743877410888672, -24.004446029663086, -22.265016555786133, -20.525585174560547, -18.786155700683594, -17.04672622680664, -15.307295799255371, -13.567866325378418, -11.828436851501465, -10.089006423950195, -8.349576950073242, -6.610147476196289, -4.870718002319336, -3.1312880516052246, -1.3918581008911133, 0.34757137298583984, 2.087000846862793, 3.8264312744140625, 5.565860748291016, 7.305290222167969, 9.044719696044922, 10.784149169921875, 12.523579597473145, 14.263009071350098, 16.002437591552734, 17.74186897277832, 19.481298446655273, 21.220727920532227, 22.96015739440918, 24.699586868286133, 26.43901824951172, 28.178447723388672, 29.917877197265625, 31.657306671142578, 33.39673614501953, 35.136165618896484, 36.87559509277344, 38.61502456665039, 40.354454040527344, 42.0938835144043, 43.83331298828125, 45.57274627685547, 47.312171936035156, 49.051605224609375, 50.79103469848633, 52.53046417236328, 54.269893646240234, 56.00932312011719, 57.74875259399414, 59.488182067871094, 61.22761535644531, 62.967041015625]}, "gradients/decoder.transformer.h.5.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 4.0, 5.0, 7.0, 7.0, 9.0, 17.0, 11.0, 19.0, 18.0, 26.0, 25.0, 33.0, 16.0, 27.0, 33.0, 35.0, 40.0, 34.0, 48.0, 30.0, 41.0, 33.0, 41.0, 41.0, 41.0, 47.0, 36.0, 34.0, 32.0, 32.0, 20.0, 27.0, 22.0, 15.0, 20.0, 16.0, 11.0, 10.0, 9.0, 11.0, 3.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 1.0], "bins": [-10.2265625, -9.9412841796875, -9.656005859375, -9.3707275390625, -9.08544921875, -8.8001708984375, -8.514892578125, -8.2296142578125, -7.9443359375, -7.6590576171875, -7.373779296875, -7.0885009765625, -6.80322265625, -6.5179443359375, -6.232666015625, -5.9473876953125, -5.662109375, -5.3768310546875, -5.091552734375, -4.8062744140625, -4.52099609375, -4.2357177734375, -3.950439453125, -3.6651611328125, -3.3798828125, -3.0946044921875, -2.809326171875, -2.5240478515625, -2.23876953125, -1.9534912109375, -1.668212890625, -1.3829345703125, -1.09765625, -0.8123779296875, -0.527099609375, -0.2418212890625, 0.04345703125, 0.3287353515625, 0.614013671875, 0.8992919921875, 1.1845703125, 1.4698486328125, 1.755126953125, 2.0404052734375, 2.32568359375, 2.6109619140625, 2.896240234375, 3.1815185546875, 3.466796875, 3.7520751953125, 4.037353515625, 4.3226318359375, 4.60791015625, 4.8931884765625, 5.178466796875, 5.4637451171875, 5.7490234375, 6.0343017578125, 6.319580078125, 6.6048583984375, 6.89013671875, 7.1754150390625, 7.460693359375, 7.7459716796875, 8.03125]}, "gradients/decoder.transformer.h.5.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 6.0, 0.0, 10.0, 8.0, 8.0, 12.0, 16.0, 25.0, 37.0, 47.0, 100.0, 87.0, 160.0, 240.0, 304.0, 497.0, 685.0, 1111.0, 1575.0, 2588.0, 3909.0, 5974.0, 9387.0, 14563.0, 22785.0, 36158.0, 59710.0, 103222.0, 235543.0, 270962.0, 110918.0, 62424.0, 38086.0, 24250.0, 15105.0, 9676.0, 6319.0, 4106.0, 2676.0, 1741.0, 1159.0, 789.0, 494.0, 379.0, 214.0, 178.0, 110.0, 60.0, 49.0, 36.0, 23.0, 20.0, 14.0, 3.0, 1.0, 7.0, 1.0, 2.0, 0.0, 2.0], "bins": [-2.419921875, -2.346710205078125, -2.27349853515625, -2.200286865234375, -2.1270751953125, -2.053863525390625, -1.98065185546875, -1.907440185546875, -1.834228515625, -1.761016845703125, -1.68780517578125, -1.614593505859375, -1.5413818359375, -1.468170166015625, -1.39495849609375, -1.321746826171875, -1.24853515625, -1.175323486328125, -1.10211181640625, -1.028900146484375, -0.9556884765625, -0.882476806640625, -0.80926513671875, -0.736053466796875, -0.662841796875, -0.589630126953125, -0.51641845703125, -0.443206787109375, -0.3699951171875, -0.296783447265625, -0.22357177734375, -0.150360107421875, -0.0771484375, -0.003936767578125, 0.06927490234375, 0.142486572265625, 0.2156982421875, 0.288909912109375, 0.36212158203125, 0.435333251953125, 0.508544921875, 0.581756591796875, 0.65496826171875, 0.728179931640625, 0.8013916015625, 0.874603271484375, 0.94781494140625, 1.021026611328125, 1.09423828125, 1.167449951171875, 1.24066162109375, 1.313873291015625, 1.3870849609375, 1.460296630859375, 1.53350830078125, 1.606719970703125, 1.679931640625, 1.753143310546875, 1.82635498046875, 1.899566650390625, 1.9727783203125, 2.045989990234375, 2.11920166015625, 2.192413330078125, 2.265625]}, "gradients/decoder.transformer.h.5.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 4.0, 4.0, 5.0, 4.0, 11.0, 11.0, 12.0, 12.0, 20.0, 25.0, 23.0, 21.0, 23.0, 22.0, 24.0, 24.0, 31.0, 48.0, 38.0, 50.0, 47.0, 1065.0, 40.0, 47.0, 37.0, 31.0, 48.0, 37.0, 39.0, 29.0, 26.0, 26.0, 31.0, 18.0, 18.0, 19.0, 12.0, 13.0, 15.0, 10.0, 5.0, 6.0, 1.0, 5.0, 2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.109375, -4.91204833984375, -4.7147216796875, -4.51739501953125, -4.320068359375, -4.12274169921875, -3.9254150390625, -3.72808837890625, -3.53076171875, -3.33343505859375, -3.1361083984375, -2.93878173828125, -2.741455078125, -2.54412841796875, -2.3468017578125, -2.14947509765625, -1.9521484375, -1.75482177734375, -1.5574951171875, -1.36016845703125, -1.162841796875, -0.96551513671875, -0.7681884765625, -0.57086181640625, -0.37353515625, -0.17620849609375, 0.0211181640625, 0.21844482421875, 0.415771484375, 0.61309814453125, 0.8104248046875, 1.00775146484375, 1.205078125, 1.40240478515625, 1.5997314453125, 1.79705810546875, 1.994384765625, 2.19171142578125, 2.3890380859375, 2.58636474609375, 2.78369140625, 2.98101806640625, 3.1783447265625, 3.37567138671875, 3.572998046875, 3.77032470703125, 3.9676513671875, 4.16497802734375, 4.3623046875, 4.55963134765625, 4.7569580078125, 4.95428466796875, 5.151611328125, 5.34893798828125, 5.5462646484375, 5.74359130859375, 5.94091796875, 6.13824462890625, 6.3355712890625, 6.53289794921875, 6.730224609375, 6.92755126953125, 7.1248779296875, 7.32220458984375, 7.51953125]}, "gradients/decoder.transformer.h.5.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 5.0, 4.0, 7.0, 7.0, 19.0, 25.0, 21.0, 52.0, 65.0, 97.0, 183.0, 319.0, 514.0, 991.0, 1891.0, 3632.0, 6810.0, 12963.0, 25600.0, 51835.0, 110597.0, 1409315.0, 277959.0, 98802.0, 46844.0, 23152.0, 12060.0, 6171.0, 3255.0, 1755.0, 927.0, 498.0, 282.0, 171.0, 104.0, 74.0, 42.0, 23.0, 20.0, 18.0, 10.0, 7.0, 8.0, 3.0, 0.0, 3.0, 0.0, 2.0], "bins": [-4.125, -4.0162353515625, -3.907470703125, -3.7987060546875, -3.68994140625, -3.5811767578125, -3.472412109375, -3.3636474609375, -3.2548828125, -3.1461181640625, -3.037353515625, -2.9285888671875, -2.81982421875, -2.7110595703125, -2.602294921875, -2.4935302734375, -2.384765625, -2.2760009765625, -2.167236328125, -2.0584716796875, -1.94970703125, -1.8409423828125, -1.732177734375, -1.6234130859375, -1.5146484375, -1.4058837890625, -1.297119140625, -1.1883544921875, -1.07958984375, -0.9708251953125, -0.862060546875, -0.7532958984375, -0.64453125, -0.5357666015625, -0.427001953125, -0.3182373046875, -0.20947265625, -0.1007080078125, 0.008056640625, 0.1168212890625, 0.2255859375, 0.3343505859375, 0.443115234375, 0.5518798828125, 0.66064453125, 0.7694091796875, 0.878173828125, 0.9869384765625, 1.095703125, 1.2044677734375, 1.313232421875, 1.4219970703125, 1.53076171875, 1.6395263671875, 1.748291015625, 1.8570556640625, 1.9658203125, 2.0745849609375, 2.183349609375, 2.2921142578125, 2.40087890625, 2.5096435546875, 2.618408203125, 2.7271728515625, 2.8359375]}, "gradients/decoder.transformer.h.5.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 4.0, 1.0, 4.0, 7.0, 3.0, 3.0, 8.0, 7.0, 5.0, 4.0, 13.0, 16.0, 21.0, 27.0, 23.0, 43.0, 47.0, 80.0, 75.0, 74.0, 95.0, 91.0, 78.0, 51.0, 47.0, 47.0, 26.0, 20.0, 18.0, 14.0, 13.0, 9.0, 7.0, 6.0, 6.0, 6.0, 0.0, 5.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0015811920166015625, -0.0015336424112319946, -0.0014860928058624268, -0.0014385432004928589, -0.001390993595123291, -0.0013434439897537231, -0.0012958943843841553, -0.0012483447790145874, -0.0012007951736450195, -0.0011532455682754517, -0.0011056959629058838, -0.001058146357536316, -0.001010596752166748, -0.0009630471467971802, -0.0009154975414276123, -0.0008679479360580444, -0.0008203983306884766, -0.0007728487253189087, -0.0007252991199493408, -0.000677749514579773, -0.0006301999092102051, -0.0005826503038406372, -0.0005351006984710693, -0.00048755109310150146, -0.0004400014877319336, -0.0003924518823623657, -0.00034490227699279785, -0.00029735267162323, -0.0002498030662536621, -0.00020225346088409424, -0.00015470385551452637, -0.0001071542501449585, -5.9604644775390625e-05, -1.2055039405822754e-05, 3.549456596374512e-05, 8.304417133331299e-05, 0.00013059377670288086, 0.00017814338207244873, 0.0002256929874420166, 0.00027324259281158447, 0.00032079219818115234, 0.0003683418035507202, 0.0004158914089202881, 0.00046344101428985596, 0.0005109906196594238, 0.0005585402250289917, 0.0006060898303985596, 0.0006536394357681274, 0.0007011890411376953, 0.0007487386465072632, 0.0007962882518768311, 0.0008438378572463989, 0.0008913874626159668, 0.0009389370679855347, 0.0009864866733551025, 0.0010340362787246704, 0.0010815858840942383, 0.0011291354894638062, 0.001176685094833374, 0.001224234700202942, 0.0012717843055725098, 0.0013193339109420776, 0.0013668835163116455, 0.0014144331216812134, 0.0014619827270507812]}, "gradients/decoder.transformer.h.5.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 1.0, 3.0, 0.0, 4.0, 3.0, 6.0, 7.0, 10.0, 9.0, 15.0, 13.0, 25.0, 26.0, 36.0, 50.0, 60.0, 91.0, 143.0, 252.0, 451.0, 1794.0, 991661.0, 52133.0, 800.0, 338.0, 203.0, 111.0, 80.0, 56.0, 32.0, 36.0, 25.0, 15.0, 14.0, 9.0, 11.0, 10.0, 3.0, 6.0, 5.0, 4.0, 5.0, 2.0, 0.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.0322265625, -0.03124380111694336, -0.03026103973388672, -0.029278278350830078, -0.028295516967773438, -0.027312755584716797, -0.026329994201660156, -0.025347232818603516, -0.024364471435546875, -0.023381710052490234, -0.022398948669433594, -0.021416187286376953, -0.020433425903320312, -0.019450664520263672, -0.01846790313720703, -0.01748514175415039, -0.01650238037109375, -0.01551961898803711, -0.014536857604980469, -0.013554096221923828, -0.012571334838867188, -0.011588573455810547, -0.010605812072753906, -0.009623050689697266, -0.008640289306640625, -0.007657527923583984, -0.006674766540527344, -0.005692005157470703, -0.0047092437744140625, -0.003726482391357422, -0.0027437210083007812, -0.0017609596252441406, -0.0007781982421875, 0.00020456314086914062, 0.0011873245239257812, 0.002170085906982422, 0.0031528472900390625, 0.004135608673095703, 0.005118370056152344, 0.006101131439208984, 0.007083892822265625, 0.008066654205322266, 0.009049415588378906, 0.010032176971435547, 0.011014938354492188, 0.011997699737548828, 0.012980461120605469, 0.01396322250366211, 0.01494598388671875, 0.01592874526977539, 0.01691150665283203, 0.017894268035888672, 0.018877029418945312, 0.019859790802001953, 0.020842552185058594, 0.021825313568115234, 0.022808074951171875, 0.023790836334228516, 0.024773597717285156, 0.025756359100341797, 0.026739120483398438, 0.027721881866455078, 0.02870464324951172, 0.02968740463256836, 0.030670166015625]}, "gradients/decoder.transformer.h.5.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 20.0, 102.0, 285.0, 377.0, 175.0, 38.0, 12.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0034552295692265034, -0.003380466951057315, -0.00330570456571877, -0.0032309419475495815, -0.0031561795622110367, -0.003081416944041848, -0.0030066545587033033, -0.002931891940534115, -0.00285712955519557, -0.0027823669370263815, -0.0027076045516878366, -0.002632841933518648, -0.0025580795481801033, -0.002483316930010915, -0.00240855454467237, -0.0023337919265031815, -0.002259029308333993, -0.0021842666901648045, -0.0021095043048262596, -0.002034741686657071, -0.0019599793013185263, -0.0018852166831493378, -0.001810454181395471, -0.0017356916796416044, -0.0016609291778877378, -0.001586166676133871, -0.0015114041743800044, -0.0014366416726261377, -0.0013618790544569492, -0.0012871166691184044, -0.0012123540509492159, -0.0011375915491953492, -0.0010628288146108389, -0.0009880663128569722, -0.0009133038111031055, -0.000838541251141578, -0.0007637787493877113, -0.0006890162476338446, -0.000614253687672317, -0.0005394911859184504, -0.0004647286841645837, -0.000389966182410717, -0.0003152036515530199, -0.00024044113524723798, -0.00016567861894145608, -9.091611718758941e-05, -1.6153586329892278e-05, 5.860894452780485e-05, 0.00013337144628167152, 0.00020813396258745342, 0.0002828964788932353, 0.00035765900975093246, 0.00043242151150479913, 0.0005071840132586658, 0.0005819465732201934, 0.0006567090749740601, 0.0007314715767279267, 0.0008062340784817934, 0.0008809965802356601, 0.0009557591401971877, 0.0010305217001587152, 0.00110528408549726, 0.0011800467036664486, 0.0012548092054203153, 0.001329571707174182]}, "gradients/decoder.transformer.h.5.ln_cross_attn.bias": {"_type": "histogram", "values": [4.0, 0.0, 1.0, 0.0, 0.0, 4.0, 3.0, 3.0, 3.0, 3.0, 8.0, 13.0, 10.0, 17.0, 14.0, 20.0, 13.0, 21.0, 39.0, 31.0, 23.0, 34.0, 28.0, 37.0, 31.0, 39.0, 42.0, 41.0, 40.0, 53.0, 27.0, 31.0, 42.0, 45.0, 43.0, 22.0, 25.0, 27.0, 30.0, 19.0, 22.0, 20.0, 15.0, 17.0, 13.0, 4.0, 9.0, 6.0, 5.0, 6.0, 5.0, 6.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006074309349060059, -0.0005867434665560722, -0.0005660559982061386, -0.000545368529856205, -0.0005246810615062714, -0.0005039935931563377, -0.0004833061248064041, -0.0004626186564564705, -0.00044193118810653687, -0.00042124371975660324, -0.0004005562514066696, -0.000379868783056736, -0.00035918131470680237, -0.00033849384635686874, -0.0003178063780069351, -0.0002971189096570015, -0.00027643144130706787, -0.00025574397295713425, -0.00023505650460720062, -0.000214369036257267, -0.00019368156790733337, -0.00017299409955739975, -0.00015230663120746613, -0.0001316191628575325, -0.00011093169450759888, -9.024422615766525e-05, -6.955675780773163e-05, -4.8869289457798004e-05, -2.818182110786438e-05, -7.494352757930756e-06, 1.3193115592002869e-05, 3.388058394193649e-05, 5.456805229187012e-05, 7.525552064180374e-05, 9.594298899173737e-05, 0.00011663045734167099, 0.00013731792569160461, 0.00015800539404153824, 0.00017869286239147186, 0.0001993803307414055, 0.0002200677990913391, 0.00024075526744127274, 0.00026144273579120636, 0.00028213020414114, 0.0003028176724910736, 0.00032350514084100723, 0.00034419260919094086, 0.0003648800775408745, 0.0003855675458908081, 0.00040625501424074173, 0.00042694248259067535, 0.000447629950940609, 0.0004683174192905426, 0.0004890048876404762, 0.0005096923559904099, 0.0005303798243403435, 0.0005510672926902771, 0.0005717547610402107, 0.0005924422293901443, 0.000613129697740078, 0.0006338171660900116, 0.0006545046344399452, 0.0006751921027898788, 0.0006958795711398125, 0.0007165670394897461]}, "gradients/decoder.transformer.h.5.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 4.0, 5.0, 7.0, 7.0, 9.0, 17.0, 11.0, 19.0, 18.0, 26.0, 25.0, 33.0, 16.0, 27.0, 33.0, 35.0, 40.0, 34.0, 48.0, 30.0, 41.0, 33.0, 41.0, 41.0, 41.0, 47.0, 36.0, 34.0, 32.0, 32.0, 20.0, 27.0, 22.0, 15.0, 20.0, 16.0, 11.0, 10.0, 9.0, 11.0, 3.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 1.0], "bins": [-10.2265625, -9.9412841796875, -9.656005859375, -9.3707275390625, -9.08544921875, -8.8001708984375, -8.514892578125, -8.2296142578125, -7.9443359375, -7.6590576171875, -7.373779296875, -7.0885009765625, -6.80322265625, -6.5179443359375, -6.232666015625, -5.9473876953125, -5.662109375, -5.3768310546875, -5.091552734375, -4.8062744140625, -4.52099609375, -4.2357177734375, -3.950439453125, -3.6651611328125, -3.3798828125, -3.0946044921875, -2.809326171875, -2.5240478515625, -2.23876953125, -1.9534912109375, -1.668212890625, -1.3829345703125, -1.09765625, -0.8123779296875, -0.527099609375, -0.2418212890625, 0.04345703125, 0.3287353515625, 0.614013671875, 0.8992919921875, 1.1845703125, 1.4698486328125, 1.755126953125, 2.0404052734375, 2.32568359375, 2.6109619140625, 2.896240234375, 3.1815185546875, 3.466796875, 3.7520751953125, 4.037353515625, 4.3226318359375, 4.60791015625, 4.8931884765625, 5.178466796875, 5.4637451171875, 5.7490234375, 6.0343017578125, 6.319580078125, 6.6048583984375, 6.89013671875, 7.1754150390625, 7.460693359375, 7.7459716796875, 8.03125]}, "gradients/decoder.transformer.h.5.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 1.0, 2.0, 2.0, 5.0, 2.0, 6.0, 6.0, 7.0, 7.0, 11.0, 19.0, 19.0, 31.0, 32.0, 35.0, 56.0, 72.0, 106.0, 131.0, 174.0, 290.0, 495.0, 1040.0, 2671.0, 7652.0, 23067.0, 76403.0, 328410.0, 455880.0, 105672.0, 29887.0, 9798.0, 3525.0, 1375.0, 612.0, 315.0, 185.0, 134.0, 97.0, 77.0, 54.0, 53.0, 43.0, 27.0, 22.0, 18.0, 16.0, 6.0, 7.0, 5.0, 3.0, 3.0, 1.0, 0.0, 1.0, 1.0], "bins": [-13.6328125, -13.25537109375, -12.8779296875, -12.50048828125, -12.123046875, -11.74560546875, -11.3681640625, -10.99072265625, -10.61328125, -10.23583984375, -9.8583984375, -9.48095703125, -9.103515625, -8.72607421875, -8.3486328125, -7.97119140625, -7.59375, -7.21630859375, -6.8388671875, -6.46142578125, -6.083984375, -5.70654296875, -5.3291015625, -4.95166015625, -4.57421875, -4.19677734375, -3.8193359375, -3.44189453125, -3.064453125, -2.68701171875, -2.3095703125, -1.93212890625, -1.5546875, -1.17724609375, -0.7998046875, -0.42236328125, -0.044921875, 0.33251953125, 0.7099609375, 1.08740234375, 1.46484375, 1.84228515625, 2.2197265625, 2.59716796875, 2.974609375, 3.35205078125, 3.7294921875, 4.10693359375, 4.484375, 4.86181640625, 5.2392578125, 5.61669921875, 5.994140625, 6.37158203125, 6.7490234375, 7.12646484375, 7.50390625, 7.88134765625, 8.2587890625, 8.63623046875, 9.013671875, 9.39111328125, 9.7685546875, 10.14599609375, 10.5234375]}, "gradients/decoder.transformer.h.5.attn.c_attn.bias": {"_type": "histogram", "values": [4.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 5.0, 5.0, 6.0, 9.0, 14.0, 8.0, 7.0, 13.0, 10.0, 11.0, 21.0, 13.0, 22.0, 18.0, 26.0, 32.0, 37.0, 45.0, 48.0, 60.0, 60.0, 118.0, 294.0, 1540.0, 128.0, 83.0, 46.0, 45.0, 52.0, 34.0, 33.0, 26.0, 35.0, 23.0, 26.0, 19.0, 10.0, 17.0, 9.0, 13.0, 12.0, 7.0, 5.0, 3.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-25.875, -25.072021484375, -24.26904296875, -23.466064453125, -22.6630859375, -21.860107421875, -21.05712890625, -20.254150390625, -19.451171875, -18.648193359375, -17.84521484375, -17.042236328125, -16.2392578125, -15.436279296875, -14.63330078125, -13.830322265625, -13.02734375, -12.224365234375, -11.42138671875, -10.618408203125, -9.8154296875, -9.012451171875, -8.20947265625, -7.406494140625, -6.603515625, -5.800537109375, -4.99755859375, -4.194580078125, -3.3916015625, -2.588623046875, -1.78564453125, -0.982666015625, -0.1796875, 0.623291015625, 1.42626953125, 2.229248046875, 3.0322265625, 3.835205078125, 4.63818359375, 5.441162109375, 6.244140625, 7.047119140625, 7.85009765625, 8.653076171875, 9.4560546875, 10.259033203125, 11.06201171875, 11.864990234375, 12.66796875, 13.470947265625, 14.27392578125, 15.076904296875, 15.8798828125, 16.682861328125, 17.48583984375, 18.288818359375, 19.091796875, 19.894775390625, 20.69775390625, 21.500732421875, 22.3037109375, 23.106689453125, 23.90966796875, 24.712646484375, 25.515625]}, "gradients/decoder.transformer.h.5.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 4.0, 3.0, 15.0, 7.0, 8.0, 15.0, 19.0, 22.0, 36.0, 39.0, 49.0, 56.0, 84.0, 106.0, 151.0, 216.0, 317.0, 552.0, 1416.0, 16373.0, 1054084.0, 2047775.0, 20917.0, 1649.0, 570.0, 332.0, 224.0, 173.0, 105.0, 81.0, 75.0, 41.0, 40.0, 33.0, 23.0, 31.0, 13.0, 20.0, 12.0, 6.0, 5.0, 5.0, 2.0, 2.0, 4.0, 3.0, 4.0], "bins": [-41.96875, -40.865478515625, -39.76220703125, -38.658935546875, -37.5556640625, -36.452392578125, -35.34912109375, -34.245849609375, -33.142578125, -32.039306640625, -30.93603515625, -29.832763671875, -28.7294921875, -27.626220703125, -26.52294921875, -25.419677734375, -24.31640625, -23.213134765625, -22.10986328125, -21.006591796875, -19.9033203125, -18.800048828125, -17.69677734375, -16.593505859375, -15.490234375, -14.386962890625, -13.28369140625, -12.180419921875, -11.0771484375, -9.973876953125, -8.87060546875, -7.767333984375, -6.6640625, -5.560791015625, -4.45751953125, -3.354248046875, -2.2509765625, -1.147705078125, -0.04443359375, 1.058837890625, 2.162109375, 3.265380859375, 4.36865234375, 5.471923828125, 6.5751953125, 7.678466796875, 8.78173828125, 9.885009765625, 10.98828125, 12.091552734375, 13.19482421875, 14.298095703125, 15.4013671875, 16.504638671875, 17.60791015625, 18.711181640625, 19.814453125, 20.917724609375, 22.02099609375, 23.124267578125, 24.2275390625, 25.330810546875, 26.43408203125, 27.537353515625, 28.640625]}, "gradients/decoder.transformer.h.5.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 12.0, 666.0, 336.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.333749771118164, -21.752384185791016, -14.1710205078125, -6.589654922485352, 0.9917087554931641, 8.57307243347168, 16.15443992614746, 23.735803604125977, 31.317167282104492, 38.89853286743164, 46.479896545410156, 54.06126403808594, 61.64262390136719, 69.22399139404297, 76.80535888671875, 84.38671875, 91.96808624267578, 99.54945373535156, 107.13081359863281, 114.7121810913086, 122.29354858398438, 129.87490844726562, 137.45626831054688, 145.0376434326172, 152.61900329589844, 160.2003631591797, 167.78173828125, 175.36309814453125, 182.9444580078125, 190.52581787109375, 198.10719299316406, 205.6885528564453, 213.26992797851562, 220.85128784179688, 228.4326629638672, 236.01402282714844, 243.5953826904297, 251.1767578125, 258.75811767578125, 266.3394775390625, 273.92083740234375, 281.502197265625, 289.08355712890625, 296.6649169921875, 304.2463073730469, 311.8276672363281, 319.4090270996094, 326.9903869628906, 334.57177734375, 342.15313720703125, 349.7344970703125, 357.31585693359375, 364.8972473144531, 372.4786071777344, 380.0599670410156, 387.6413269042969, 395.2226867675781, 402.8040466308594, 410.3854064941406, 417.966796875, 425.54815673828125, 433.1295166015625, 440.71087646484375, 448.292236328125, 455.87359619140625]}, "gradients/decoder.transformer.h.5.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 3.0, 1.0, 10.0, 5.0, 4.0, 8.0, 6.0, 10.0, 12.0, 10.0, 18.0, 16.0, 14.0, 24.0, 28.0, 22.0, 25.0, 23.0, 38.0, 33.0, 33.0, 34.0, 33.0, 38.0, 36.0, 45.0, 51.0, 41.0, 37.0, 28.0, 41.0, 40.0, 28.0, 29.0, 25.0, 26.0, 16.0, 19.0, 11.0, 21.0, 10.0, 14.0, 10.0, 6.0, 8.0, 5.0, 5.0, 4.0, 3.0, 5.0, 2.0, 3.0], "bins": [-70.1112060546875, -68.2155532836914, -66.31989288330078, -64.42424011230469, -62.528587341308594, -60.632930755615234, -58.737274169921875, -56.84162139892578, -54.94596481323242, -53.05030822753906, -51.15465545654297, -49.25899887084961, -47.36334228515625, -45.467689514160156, -43.5720329284668, -41.67637634277344, -39.780723571777344, -37.885066986083984, -35.98941421508789, -34.09375762939453, -32.19810485839844, -30.302448272705078, -28.40679168701172, -26.511137008666992, -24.615482330322266, -22.71982765197754, -20.824172973632812, -18.928516387939453, -17.032861709594727, -15.13720703125, -13.241551399230957, -11.345895767211914, -9.450241088867188, -7.554585933685303, -5.658930778503418, -3.763275623321533, -1.8676204681396484, 0.028034210205078125, 1.923689842224121, 3.819345474243164, 5.715000152587891, 7.610655307769775, 9.50631046295166, 11.401966094970703, 13.29762077331543, 15.193275451660156, 17.088932037353516, 18.984586715698242, 20.88024139404297, 22.775896072387695, 24.671550750732422, 26.56720733642578, 28.462862014770508, 30.358516693115234, 32.254173278808594, 34.14982604980469, 36.04548263549805, 37.941139221191406, 39.8367919921875, 41.73244857788086, 43.62810516357422, 45.52375793457031, 47.41941452026367, 49.31507110595703, 51.210723876953125]}, "gradients/decoder.transformer.h.4.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 6.0, 3.0, 3.0, 4.0, 8.0, 7.0, 5.0, 11.0, 14.0, 18.0, 24.0, 19.0, 24.0, 25.0, 31.0, 21.0, 39.0, 37.0, 36.0, 44.0, 29.0, 60.0, 42.0, 43.0, 44.0, 40.0, 34.0, 37.0, 36.0, 41.0, 31.0, 27.0, 26.0, 19.0, 21.0, 21.0, 17.0, 14.0, 15.0, 7.0, 11.0, 5.0, 6.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0], "bins": [-11.1953125, -10.8909912109375, -10.586669921875, -10.2823486328125, -9.97802734375, -9.6737060546875, -9.369384765625, -9.0650634765625, -8.7607421875, -8.4564208984375, -8.152099609375, -7.8477783203125, -7.54345703125, -7.2391357421875, -6.934814453125, -6.6304931640625, -6.326171875, -6.0218505859375, -5.717529296875, -5.4132080078125, -5.10888671875, -4.8045654296875, -4.500244140625, -4.1959228515625, -3.8916015625, -3.5872802734375, -3.282958984375, -2.9786376953125, -2.67431640625, -2.3699951171875, -2.065673828125, -1.7613525390625, -1.45703125, -1.1527099609375, -0.848388671875, -0.5440673828125, -0.23974609375, 0.0645751953125, 0.368896484375, 0.6732177734375, 0.9775390625, 1.2818603515625, 1.586181640625, 1.8905029296875, 2.19482421875, 2.4991455078125, 2.803466796875, 3.1077880859375, 3.412109375, 3.7164306640625, 4.020751953125, 4.3250732421875, 4.62939453125, 4.9337158203125, 5.238037109375, 5.5423583984375, 5.8466796875, 6.1510009765625, 6.455322265625, 6.7596435546875, 7.06396484375, 7.3682861328125, 7.672607421875, 7.9769287109375, 8.28125]}, "gradients/decoder.transformer.h.4.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 4.0, 6.0, 0.0, 0.0, 0.0, 5.0, 1.0, 3.0, 5.0, 5.0, 12.0, 7.0, 9.0, 14.0, 16.0, 28.0, 27.0, 35.0, 38.0, 60.0, 96.0, 173.0, 465.0, 1255.0, 6177.0, 224604.0, 3811362.0, 142767.0, 5147.0, 1067.0, 342.0, 202.0, 96.0, 45.0, 49.0, 28.0, 24.0, 25.0, 19.0, 16.0, 17.0, 8.0, 11.0, 5.0, 7.0, 4.0, 0.0, 2.0, 6.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-38.59375, -37.36669921875, -36.1396484375, -34.91259765625, -33.685546875, -32.45849609375, -31.2314453125, -30.00439453125, -28.77734375, -27.55029296875, -26.3232421875, -25.09619140625, -23.869140625, -22.64208984375, -21.4150390625, -20.18798828125, -18.9609375, -17.73388671875, -16.5068359375, -15.27978515625, -14.052734375, -12.82568359375, -11.5986328125, -10.37158203125, -9.14453125, -7.91748046875, -6.6904296875, -5.46337890625, -4.236328125, -3.00927734375, -1.7822265625, -0.55517578125, 0.671875, 1.89892578125, 3.1259765625, 4.35302734375, 5.580078125, 6.80712890625, 8.0341796875, 9.26123046875, 10.48828125, 11.71533203125, 12.9423828125, 14.16943359375, 15.396484375, 16.62353515625, 17.8505859375, 19.07763671875, 20.3046875, 21.53173828125, 22.7587890625, 23.98583984375, 25.212890625, 26.43994140625, 27.6669921875, 28.89404296875, 30.12109375, 31.34814453125, 32.5751953125, 33.80224609375, 35.029296875, 36.25634765625, 37.4833984375, 38.71044921875, 39.9375]}, "gradients/decoder.transformer.h.4.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 5.0, 7.0, 5.0, 10.0, 13.0, 21.0, 26.0, 55.0, 75.0, 111.0, 214.0, 430.0, 659.0, 744.0, 687.0, 386.0, 233.0, 147.0, 85.0, 57.0, 32.0, 21.0, 22.0, 10.0, 10.0, 2.0, 6.0, 5.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-17.46875, -16.876708984375, -16.28466796875, -15.692626953125, -15.1005859375, -14.508544921875, -13.91650390625, -13.324462890625, -12.732421875, -12.140380859375, -11.54833984375, -10.956298828125, -10.3642578125, -9.772216796875, -9.18017578125, -8.588134765625, -7.99609375, -7.404052734375, -6.81201171875, -6.219970703125, -5.6279296875, -5.035888671875, -4.44384765625, -3.851806640625, -3.259765625, -2.667724609375, -2.07568359375, -1.483642578125, -0.8916015625, -0.299560546875, 0.29248046875, 0.884521484375, 1.4765625, 2.068603515625, 2.66064453125, 3.252685546875, 3.8447265625, 4.436767578125, 5.02880859375, 5.620849609375, 6.212890625, 6.804931640625, 7.39697265625, 7.989013671875, 8.5810546875, 9.173095703125, 9.76513671875, 10.357177734375, 10.94921875, 11.541259765625, 12.13330078125, 12.725341796875, 13.3173828125, 13.909423828125, 14.50146484375, 15.093505859375, 15.685546875, 16.277587890625, 16.86962890625, 17.461669921875, 18.0537109375, 18.645751953125, 19.23779296875, 19.829833984375, 20.421875]}, "gradients/decoder.transformer.h.4.mlp.c_fc.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 6.0, 9.0, 6.0, 6.0, 4.0, 3.0, 15.0, 7.0, 7.0, 27.0, 32.0, 48.0, 48.0, 83.0, 138.0, 260.0, 615.0, 1459.0, 5736.0, 52516.0, 3052603.0, 1046531.0, 28006.0, 3905.0, 1149.0, 471.0, 223.0, 125.0, 77.0, 48.0, 27.0, 22.0, 15.0, 10.0, 2.0, 13.0, 12.0, 7.0, 8.0, 1.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-41.9375, -40.7685546875, -39.599609375, -38.4306640625, -37.26171875, -36.0927734375, -34.923828125, -33.7548828125, -32.5859375, -31.4169921875, -30.248046875, -29.0791015625, -27.91015625, -26.7412109375, -25.572265625, -24.4033203125, -23.234375, -22.0654296875, -20.896484375, -19.7275390625, -18.55859375, -17.3896484375, -16.220703125, -15.0517578125, -13.8828125, -12.7138671875, -11.544921875, -10.3759765625, -9.20703125, -8.0380859375, -6.869140625, -5.7001953125, -4.53125, -3.3623046875, -2.193359375, -1.0244140625, 0.14453125, 1.3134765625, 2.482421875, 3.6513671875, 4.8203125, 5.9892578125, 7.158203125, 8.3271484375, 9.49609375, 10.6650390625, 11.833984375, 13.0029296875, 14.171875, 15.3408203125, 16.509765625, 17.6787109375, 18.84765625, 20.0166015625, 21.185546875, 22.3544921875, 23.5234375, 24.6923828125, 25.861328125, 27.0302734375, 28.19921875, 29.3681640625, 30.537109375, 31.7060546875, 32.875]}, "gradients/decoder.transformer.h.4.ln_2.weight": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 15.0, 47.0, 98.0, 262.0, 257.0, 190.0, 101.0, 32.0, 8.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.606117248535156, -55.20945739746094, -51.81279754638672, -48.4161376953125, -45.01947784423828, -41.62281799316406, -38.22616195678711, -34.82950210571289, -31.432842254638672, -28.036182403564453, -24.639522552490234, -21.24286460876465, -17.84620475769043, -14.449544906616211, -11.052886962890625, -7.656227111816406, -4.2595672607421875, -0.862907886505127, 2.5337514877319336, 5.930410385131836, 9.327070236206055, 12.723730087280273, 16.12038803100586, 19.517047882080078, 22.913707733154297, 26.310367584228516, 29.707027435302734, 33.10368347167969, 36.500343322753906, 39.897003173828125, 43.293663024902344, 46.69032287597656, 50.08697509765625, 53.48363494873047, 56.88029479980469, 60.276954650878906, 63.673614501953125, 67.07027435302734, 70.46693420410156, 73.86358642578125, 77.26025390625, 80.65691375732422, 84.05357360839844, 87.45023345947266, 90.84689331054688, 94.2435531616211, 97.64021301269531, 101.036865234375, 104.43352508544922, 107.83018493652344, 111.22684478759766, 114.62350463867188, 118.0201644897461, 121.41682434082031, 124.8134765625, 128.21014404296875, 131.60679626464844, 135.00344848632812, 138.40011596679688, 141.79676818847656, 145.1934356689453, 148.590087890625, 151.98675537109375, 155.38340759277344, 158.7800750732422]}, "gradients/decoder.transformer.h.4.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 2.0, 4.0, 3.0, 4.0, 8.0, 14.0, 9.0, 11.0, 18.0, 19.0, 21.0, 24.0, 29.0, 27.0, 28.0, 30.0, 37.0, 34.0, 36.0, 29.0, 41.0, 39.0, 42.0, 47.0, 42.0, 24.0, 42.0, 33.0, 38.0, 30.0, 35.0, 46.0, 19.0, 21.0, 20.0, 18.0, 20.0, 11.0, 14.0, 9.0, 6.0, 4.0, 7.0, 2.0, 4.0, 5.0, 5.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-38.5040168762207, -37.20600509643555, -35.907989501953125, -34.60997772216797, -33.31196594238281, -32.01395034790039, -30.715938568115234, -29.417924880981445, -28.119911193847656, -26.821897506713867, -25.523883819580078, -24.225872039794922, -22.927858352661133, -21.629844665527344, -20.331832885742188, -19.0338191986084, -17.73580551147461, -16.43779182434082, -15.139779090881348, -13.841766357421875, -12.543752670288086, -11.245738983154297, -9.947726249694824, -8.649713516235352, -7.3516998291015625, -6.053686618804932, -4.755673408508301, -3.45766019821167, -2.159646987915039, -0.8616337776184082, 0.43637943267822266, 1.7343921661376953, 3.0324020385742188, 4.33041524887085, 5.6284284591674805, 6.926441669464111, 8.224454879760742, 9.522468566894531, 10.820481300354004, 12.118494033813477, 13.416507720947266, 14.714521408081055, 16.012535095214844, 17.310546875, 18.60856056213379, 19.906574249267578, 21.204586029052734, 22.502599716186523, 23.800613403320312, 25.0986270904541, 26.39664077758789, 27.694652557373047, 28.992666244506836, 30.290679931640625, 31.58869171142578, 32.88670349121094, 34.18471908569336, 35.482730865478516, 36.78074645996094, 38.078758239746094, 39.37677001953125, 40.67478561401367, 41.97279739379883, 43.27081298828125, 44.568824768066406]}, "gradients/decoder.transformer.h.4.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 6.0, 6.0, 5.0, 5.0, 8.0, 16.0, 9.0, 17.0, 8.0, 14.0, 17.0, 25.0, 27.0, 23.0, 30.0, 37.0, 37.0, 31.0, 31.0, 45.0, 40.0, 42.0, 27.0, 50.0, 48.0, 45.0, 39.0, 38.0, 30.0, 40.0, 28.0, 31.0, 24.0, 23.0, 25.0, 13.0, 9.0, 12.0, 11.0, 13.0, 10.0, 2.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-9.8359375, -9.5316162109375, -9.227294921875, -8.9229736328125, -8.61865234375, -8.3143310546875, -8.010009765625, -7.7056884765625, -7.4013671875, -7.0970458984375, -6.792724609375, -6.4884033203125, -6.18408203125, -5.8797607421875, -5.575439453125, -5.2711181640625, -4.966796875, -4.6624755859375, -4.358154296875, -4.0538330078125, -3.74951171875, -3.4451904296875, -3.140869140625, -2.8365478515625, -2.5322265625, -2.2279052734375, -1.923583984375, -1.6192626953125, -1.31494140625, -1.0106201171875, -0.706298828125, -0.4019775390625, -0.09765625, 0.2066650390625, 0.510986328125, 0.8153076171875, 1.11962890625, 1.4239501953125, 1.728271484375, 2.0325927734375, 2.3369140625, 2.6412353515625, 2.945556640625, 3.2498779296875, 3.55419921875, 3.8585205078125, 4.162841796875, 4.4671630859375, 4.771484375, 5.0758056640625, 5.380126953125, 5.6844482421875, 5.98876953125, 6.2930908203125, 6.597412109375, 6.9017333984375, 7.2060546875, 7.5103759765625, 7.814697265625, 8.1190185546875, 8.42333984375, 8.7276611328125, 9.031982421875, 9.3363037109375, 9.640625]}, "gradients/decoder.transformer.h.4.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 4.0, 9.0, 12.0, 12.0, 19.0, 35.0, 39.0, 50.0, 99.0, 152.0, 178.0, 299.0, 477.0, 667.0, 1016.0, 1445.0, 2150.0, 3108.0, 4860.0, 7359.0, 10811.0, 16930.0, 26660.0, 44070.0, 77035.0, 150086.0, 330953.0, 161908.0, 81482.0, 46456.0, 28378.0, 17676.0, 11327.0, 7493.0, 5087.0, 3214.0, 2271.0, 1494.0, 1054.0, 708.0, 514.0, 309.0, 210.0, 148.0, 94.0, 76.0, 41.0, 32.0, 21.0, 14.0, 9.0, 12.0, 5.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-2.326171875, -2.25244140625, -2.1787109375, -2.10498046875, -2.03125, -1.95751953125, -1.8837890625, -1.81005859375, -1.736328125, -1.66259765625, -1.5888671875, -1.51513671875, -1.44140625, -1.36767578125, -1.2939453125, -1.22021484375, -1.146484375, -1.07275390625, -0.9990234375, -0.92529296875, -0.8515625, -0.77783203125, -0.7041015625, -0.63037109375, -0.556640625, -0.48291015625, -0.4091796875, -0.33544921875, -0.26171875, -0.18798828125, -0.1142578125, -0.04052734375, 0.033203125, 0.10693359375, 0.1806640625, 0.25439453125, 0.328125, 0.40185546875, 0.4755859375, 0.54931640625, 0.623046875, 0.69677734375, 0.7705078125, 0.84423828125, 0.91796875, 0.99169921875, 1.0654296875, 1.13916015625, 1.212890625, 1.28662109375, 1.3603515625, 1.43408203125, 1.5078125, 1.58154296875, 1.6552734375, 1.72900390625, 1.802734375, 1.87646484375, 1.9501953125, 2.02392578125, 2.09765625, 2.17138671875, 2.2451171875, 2.31884765625, 2.392578125]}, "gradients/decoder.transformer.h.4.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 2.0, 4.0, 6.0, 1.0, 4.0, 5.0, 6.0, 5.0, 11.0, 17.0, 16.0, 22.0, 31.0, 18.0, 30.0, 23.0, 24.0, 27.0, 42.0, 39.0, 47.0, 28.0, 32.0, 37.0, 1070.0, 37.0, 43.0, 39.0, 36.0, 27.0, 36.0, 33.0, 37.0, 25.0, 29.0, 27.0, 16.0, 14.0, 17.0, 19.0, 11.0, 11.0, 6.0, 6.0, 6.0, 5.0, 4.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-6.609375, -6.41741943359375, -6.2254638671875, -6.03350830078125, -5.841552734375, -5.64959716796875, -5.4576416015625, -5.26568603515625, -5.07373046875, -4.88177490234375, -4.6898193359375, -4.49786376953125, -4.305908203125, -4.11395263671875, -3.9219970703125, -3.73004150390625, -3.5380859375, -3.34613037109375, -3.1541748046875, -2.96221923828125, -2.770263671875, -2.57830810546875, -2.3863525390625, -2.19439697265625, -2.00244140625, -1.81048583984375, -1.6185302734375, -1.42657470703125, -1.234619140625, -1.04266357421875, -0.8507080078125, -0.65875244140625, -0.466796875, -0.27484130859375, -0.0828857421875, 0.10906982421875, 0.301025390625, 0.49298095703125, 0.6849365234375, 0.87689208984375, 1.06884765625, 1.26080322265625, 1.4527587890625, 1.64471435546875, 1.836669921875, 2.02862548828125, 2.2205810546875, 2.41253662109375, 2.6044921875, 2.79644775390625, 2.9884033203125, 3.18035888671875, 3.372314453125, 3.56427001953125, 3.7562255859375, 3.94818115234375, 4.14013671875, 4.33209228515625, 4.5240478515625, 4.71600341796875, 4.907958984375, 5.09991455078125, 5.2918701171875, 5.48382568359375, 5.67578125]}, "gradients/decoder.transformer.h.4.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 3.0, 4.0, 6.0, 6.0, 10.0, 10.0, 21.0, 34.0, 44.0, 53.0, 113.0, 163.0, 292.0, 488.0, 827.0, 1548.0, 2739.0, 4918.0, 8893.0, 17053.0, 32994.0, 66359.0, 141909.0, 1452355.0, 193493.0, 85160.0, 41427.0, 21302.0, 11304.0, 5983.0, 3254.0, 1849.0, 1025.0, 612.0, 331.0, 228.0, 120.0, 65.0, 50.0, 36.0, 16.0, 10.0, 9.0, 4.0, 3.0, 6.0, 6.0, 2.0, 5.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.12109375, -3.015899658203125, -2.91070556640625, -2.805511474609375, -2.7003173828125, -2.595123291015625, -2.48992919921875, -2.384735107421875, -2.279541015625, -2.174346923828125, -2.06915283203125, -1.963958740234375, -1.8587646484375, -1.753570556640625, -1.64837646484375, -1.543182373046875, -1.43798828125, -1.332794189453125, -1.22760009765625, -1.122406005859375, -1.0172119140625, -0.912017822265625, -0.80682373046875, -0.701629638671875, -0.596435546875, -0.491241455078125, -0.38604736328125, -0.280853271484375, -0.1756591796875, -0.070465087890625, 0.03472900390625, 0.139923095703125, 0.2451171875, 0.350311279296875, 0.45550537109375, 0.560699462890625, 0.6658935546875, 0.771087646484375, 0.87628173828125, 0.981475830078125, 1.086669921875, 1.191864013671875, 1.29705810546875, 1.402252197265625, 1.5074462890625, 1.612640380859375, 1.71783447265625, 1.823028564453125, 1.92822265625, 2.033416748046875, 2.13861083984375, 2.243804931640625, 2.3489990234375, 2.454193115234375, 2.55938720703125, 2.664581298828125, 2.769775390625, 2.874969482421875, 2.98016357421875, 3.085357666015625, 3.1905517578125, 3.295745849609375, 3.40093994140625, 3.506134033203125, 3.611328125]}, "gradients/decoder.transformer.h.4.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 8.0, 2.0, 7.0, 4.0, 6.0, 8.0, 6.0, 10.0, 15.0, 17.0, 21.0, 25.0, 38.0, 37.0, 45.0, 63.0, 71.0, 75.0, 76.0, 88.0, 63.0, 58.0, 51.0, 47.0, 32.0, 30.0, 22.0, 20.0, 12.0, 14.0, 9.0, 7.0, 9.0, 3.0, 4.0, 1.0, 2.0, 0.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013780593872070312, -0.0013279765844345093, -0.0012778937816619873, -0.0012278109788894653, -0.0011777281761169434, -0.0011276453733444214, -0.0010775625705718994, -0.0010274797677993774, -0.0009773969650268555, -0.0009273141622543335, -0.0008772313594818115, -0.0008271485567092896, -0.0007770657539367676, -0.0007269829511642456, -0.0006769001483917236, -0.0006268173456192017, -0.0005767345428466797, -0.0005266517400741577, -0.00047656893730163574, -0.00042648613452911377, -0.0003764033317565918, -0.0003263205289840698, -0.00027623772621154785, -0.00022615492343902588, -0.0001760721206665039, -0.00012598931789398193, -7.590651512145996e-05, -2.5823712348937988e-05, 2.4259090423583984e-05, 7.434189319610596e-05, 0.00012442469596862793, 0.0001745074987411499, 0.00022459030151367188, 0.00027467310428619385, 0.0003247559070587158, 0.0003748387098312378, 0.00042492151260375977, 0.00047500431537628174, 0.0005250871181488037, 0.0005751699209213257, 0.0006252527236938477, 0.0006753355264663696, 0.0007254183292388916, 0.0007755011320114136, 0.0008255839347839355, 0.0008756667375564575, 0.0009257495403289795, 0.0009758323431015015, 0.0010259151458740234, 0.0010759979486465454, 0.0011260807514190674, 0.0011761635541915894, 0.0012262463569641113, 0.0012763291597366333, 0.0013264119625091553, 0.0013764947652816772, 0.0014265775680541992, 0.0014766603708267212, 0.0015267431735992432, 0.0015768259763717651, 0.0016269087791442871, 0.001676991581916809, 0.001727074384689331, 0.001777157187461853, 0.001827239990234375]}, "gradients/decoder.transformer.h.4.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 1.0, 2.0, 2.0, 6.0, 4.0, 10.0, 5.0, 10.0, 13.0, 20.0, 21.0, 38.0, 32.0, 53.0, 75.0, 96.0, 152.0, 231.0, 481.0, 2133.0, 992821.0, 50492.0, 877.0, 363.0, 177.0, 126.0, 76.0, 57.0, 40.0, 34.0, 18.0, 15.0, 17.0, 15.0, 7.0, 8.0, 5.0, 5.0, 4.0, 9.0, 2.0, 3.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0367431640625, -0.03571605682373047, -0.03468894958496094, -0.033661842346191406, -0.032634735107421875, -0.031607627868652344, -0.030580520629882812, -0.02955341339111328, -0.02852630615234375, -0.02749919891357422, -0.026472091674804688, -0.025444984436035156, -0.024417877197265625, -0.023390769958496094, -0.022363662719726562, -0.02133655548095703, -0.0203094482421875, -0.01928234100341797, -0.018255233764648438, -0.017228126525878906, -0.016201019287109375, -0.015173912048339844, -0.014146804809570312, -0.013119697570800781, -0.01209259033203125, -0.011065483093261719, -0.010038375854492188, -0.009011268615722656, -0.007984161376953125, -0.006957054138183594, -0.0059299468994140625, -0.004902839660644531, -0.003875732421875, -0.0028486251831054688, -0.0018215179443359375, -0.0007944107055664062, 0.000232696533203125, 0.0012598037719726562, 0.0022869110107421875, 0.0033140182495117188, 0.00434112548828125, 0.005368232727050781, 0.0063953399658203125, 0.007422447204589844, 0.008449554443359375, 0.009476661682128906, 0.010503768920898438, 0.011530876159667969, 0.0125579833984375, 0.013585090637207031, 0.014612197875976562, 0.015639305114746094, 0.016666412353515625, 0.017693519592285156, 0.018720626831054688, 0.01974773406982422, 0.02077484130859375, 0.02180194854736328, 0.022829055786132812, 0.023856163024902344, 0.024883270263671875, 0.025910377502441406, 0.026937484741210938, 0.02796459197998047, 0.02899169921875]}, "gradients/decoder.transformer.h.4.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 18.0, 142.0, 588.0, 229.0, 36.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0024743997491896152, -0.002341821091249585, -0.0022092426661401987, -0.0020766640082001686, -0.0019440855830907822, -0.001811506925150752, -0.0016789283836260438, -0.0015463498421013355, -0.0014137713005766273, -0.001281192759051919, -0.0011486142175272107, -0.0010160356760025024, -0.0008834570762701333, -0.000750878534745425, -0.0006182999350130558, -0.00048572139348834753, -0.00035314285196363926, -0.00022056429588701576, -8.798573981039226e-05, 4.459283081814647e-05, 0.00017717137234285474, 0.000309749913867563, 0.0004423285135999322, 0.0005749070551246405, 0.0007074855966493487, 0.000840064138174057, 0.0009726426796987653, 0.0011052212212234735, 0.0012377998791635036, 0.00137037830427289, 0.0015029569622129202, 0.0016355355037376285, 0.0017681140452623367, 0.001900692586787045, 0.0020332711283117533, 0.0021658497862517834, 0.00229842821136117, 0.0024310068693012, 0.00256358552724123, 0.0026961639523506165, 0.002828742377460003, 0.002961321035400033, 0.0030938994605094194, 0.0032264781184494495, 0.003359056543558836, 0.003491635201498866, 0.003624213859438896, 0.0037567922845482826, 0.0038893709424883127, 0.004021949600428343, 0.004154528025537729, 0.004287106450647116, 0.0044196853414177895, 0.004552263766527176, 0.004684842191636562, 0.004817420616745949, 0.0049499995075166225, 0.005082577932626009, 0.005215156823396683, 0.005347735248506069, 0.005480313673615456, 0.005612892098724842, 0.005745470989495516, 0.005878049414604902, 0.006010627839714289]}, "gradients/decoder.transformer.h.4.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 2.0, 3.0, 1.0, 3.0, 2.0, 3.0, 5.0, 4.0, 1.0, 4.0, 5.0, 7.0, 9.0, 15.0, 19.0, 15.0, 19.0, 22.0, 23.0, 31.0, 28.0, 37.0, 42.0, 37.0, 39.0, 36.0, 37.0, 51.0, 36.0, 40.0, 34.0, 34.0, 36.0, 36.0, 31.0, 43.0, 34.0, 29.0, 26.0, 17.0, 19.0, 20.0, 16.0, 14.0, 10.0, 7.0, 8.0, 3.0, 9.0, 5.0, 2.0, 2.0, 3.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0007639527320861816, -0.0007404424250125885, -0.0007169321179389954, -0.0006934218108654022, -0.0006699115037918091, -0.0006464011967182159, -0.0006228908896446228, -0.0005993805825710297, -0.0005758702754974365, -0.0005523599684238434, -0.0005288496613502502, -0.0005053393542766571, -0.00048182904720306396, -0.0004583187401294708, -0.0004348084330558777, -0.00041129812598228455, -0.0003877878189086914, -0.00036427751183509827, -0.0003407672047615051, -0.000317256897687912, -0.00029374659061431885, -0.0002702362835407257, -0.00024672597646713257, -0.00022321566939353943, -0.0001997053623199463, -0.00017619505524635315, -0.00015268474817276, -0.00012917444109916687, -0.00010566413402557373, -8.215382695198059e-05, -5.864351987838745e-05, -3.513321280479431e-05, -1.1622905731201172e-05, 1.1887401342391968e-05, 3.539770841598511e-05, 5.890801548957825e-05, 8.241832256317139e-05, 0.00010592862963676453, 0.00012943893671035767, 0.0001529492437839508, 0.00017645955085754395, 0.00019996985793113708, 0.00022348016500473022, 0.00024699047207832336, 0.0002705007791519165, 0.00029401108622550964, 0.0003175213932991028, 0.0003410317003726959, 0.00036454200744628906, 0.0003880523145198822, 0.00041156262159347534, 0.0004350729286670685, 0.0004585832357406616, 0.00048209354281425476, 0.0005056038498878479, 0.000529114156961441, 0.0005526244640350342, 0.0005761347711086273, 0.0005996450781822205, 0.0006231553852558136, 0.0006466656923294067, 0.0006701759994029999, 0.000693686306476593, 0.0007171966135501862, 0.0007407069206237793]}, "gradients/decoder.transformer.h.4.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 6.0, 6.0, 5.0, 5.0, 8.0, 16.0, 9.0, 17.0, 8.0, 14.0, 17.0, 25.0, 27.0, 23.0, 30.0, 37.0, 37.0, 31.0, 31.0, 45.0, 40.0, 42.0, 27.0, 50.0, 48.0, 45.0, 39.0, 38.0, 30.0, 40.0, 28.0, 31.0, 24.0, 23.0, 25.0, 13.0, 9.0, 12.0, 11.0, 13.0, 10.0, 2.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-9.8359375, -9.5316162109375, -9.227294921875, -8.9229736328125, -8.61865234375, -8.3143310546875, -8.010009765625, -7.7056884765625, -7.4013671875, -7.0970458984375, -6.792724609375, -6.4884033203125, -6.18408203125, -5.8797607421875, -5.575439453125, -5.2711181640625, -4.966796875, -4.6624755859375, -4.358154296875, -4.0538330078125, -3.74951171875, -3.4451904296875, -3.140869140625, -2.8365478515625, -2.5322265625, -2.2279052734375, -1.923583984375, -1.6192626953125, -1.31494140625, -1.0106201171875, -0.706298828125, -0.4019775390625, -0.09765625, 0.2066650390625, 0.510986328125, 0.8153076171875, 1.11962890625, 1.4239501953125, 1.728271484375, 2.0325927734375, 2.3369140625, 2.6412353515625, 2.945556640625, 3.2498779296875, 3.55419921875, 3.8585205078125, 4.162841796875, 4.4671630859375, 4.771484375, 5.0758056640625, 5.380126953125, 5.6844482421875, 5.98876953125, 6.2930908203125, 6.597412109375, 6.9017333984375, 7.2060546875, 7.5103759765625, 7.814697265625, 8.1190185546875, 8.42333984375, 8.7276611328125, 9.031982421875, 9.3363037109375, 9.640625]}, "gradients/decoder.transformer.h.4.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 1.0, 1.0, 3.0, 7.0, 8.0, 8.0, 11.0, 17.0, 28.0, 23.0, 55.0, 58.0, 76.0, 102.0, 140.0, 193.0, 232.0, 378.0, 521.0, 822.0, 1397.0, 2672.0, 5911.0, 16024.0, 54415.0, 204414.0, 457085.0, 216044.0, 57868.0, 16821.0, 6303.0, 2685.0, 1515.0, 827.0, 506.0, 354.0, 271.0, 210.0, 124.0, 119.0, 79.0, 68.0, 51.0, 35.0, 21.0, 14.0, 13.0, 10.0, 10.0, 8.0, 1.0, 1.0, 3.0, 3.0, 1.0, 0.0, 0.0, 3.0], "bins": [-10.203125, -9.889404296875, -9.57568359375, -9.261962890625, -8.9482421875, -8.634521484375, -8.32080078125, -8.007080078125, -7.693359375, -7.379638671875, -7.06591796875, -6.752197265625, -6.4384765625, -6.124755859375, -5.81103515625, -5.497314453125, -5.18359375, -4.869873046875, -4.55615234375, -4.242431640625, -3.9287109375, -3.614990234375, -3.30126953125, -2.987548828125, -2.673828125, -2.360107421875, -2.04638671875, -1.732666015625, -1.4189453125, -1.105224609375, -0.79150390625, -0.477783203125, -0.1640625, 0.149658203125, 0.46337890625, 0.777099609375, 1.0908203125, 1.404541015625, 1.71826171875, 2.031982421875, 2.345703125, 2.659423828125, 2.97314453125, 3.286865234375, 3.6005859375, 3.914306640625, 4.22802734375, 4.541748046875, 4.85546875, 5.169189453125, 5.48291015625, 5.796630859375, 6.1103515625, 6.424072265625, 6.73779296875, 7.051513671875, 7.365234375, 7.678955078125, 7.99267578125, 8.306396484375, 8.6201171875, 8.933837890625, 9.24755859375, 9.561279296875, 9.875]}, "gradients/decoder.transformer.h.4.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 0.0, 2.0, 2.0, 0.0, 2.0, 5.0, 3.0, 2.0, 5.0, 4.0, 5.0, 9.0, 3.0, 8.0, 10.0, 19.0, 19.0, 26.0, 29.0, 35.0, 48.0, 35.0, 56.0, 61.0, 60.0, 154.0, 1612.0, 331.0, 104.0, 70.0, 53.0, 52.0, 36.0, 41.0, 25.0, 17.0, 26.0, 14.0, 18.0, 17.0, 13.0, 8.0, 7.0, 3.0, 3.0, 2.0, 2.0, 5.0, 3.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.765625, -28.798583984375, -27.83154296875, -26.864501953125, -25.8974609375, -24.930419921875, -23.96337890625, -22.996337890625, -22.029296875, -21.062255859375, -20.09521484375, -19.128173828125, -18.1611328125, -17.194091796875, -16.22705078125, -15.260009765625, -14.29296875, -13.325927734375, -12.35888671875, -11.391845703125, -10.4248046875, -9.457763671875, -8.49072265625, -7.523681640625, -6.556640625, -5.589599609375, -4.62255859375, -3.655517578125, -2.6884765625, -1.721435546875, -0.75439453125, 0.212646484375, 1.1796875, 2.146728515625, 3.11376953125, 4.080810546875, 5.0478515625, 6.014892578125, 6.98193359375, 7.948974609375, 8.916015625, 9.883056640625, 10.85009765625, 11.817138671875, 12.7841796875, 13.751220703125, 14.71826171875, 15.685302734375, 16.65234375, 17.619384765625, 18.58642578125, 19.553466796875, 20.5205078125, 21.487548828125, 22.45458984375, 23.421630859375, 24.388671875, 25.355712890625, 26.32275390625, 27.289794921875, 28.2568359375, 29.223876953125, 30.19091796875, 31.157958984375, 32.125]}, "gradients/decoder.transformer.h.4.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 7.0, 2.0, 4.0, 3.0, 2.0, 5.0, 14.0, 15.0, 20.0, 20.0, 37.0, 48.0, 48.0, 56.0, 82.0, 110.0, 163.0, 230.0, 364.0, 653.0, 2025.0, 64995.0, 3036044.0, 37373.0, 1604.0, 600.0, 322.0, 220.0, 152.0, 126.0, 87.0, 70.0, 41.0, 43.0, 32.0, 19.0, 11.0, 18.0, 12.0, 3.0, 11.0, 3.0, 3.0, 9.0, 3.0, 7.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-42.15625, -40.85693359375, -39.5576171875, -38.25830078125, -36.958984375, -35.65966796875, -34.3603515625, -33.06103515625, -31.76171875, -30.46240234375, -29.1630859375, -27.86376953125, -26.564453125, -25.26513671875, -23.9658203125, -22.66650390625, -21.3671875, -20.06787109375, -18.7685546875, -17.46923828125, -16.169921875, -14.87060546875, -13.5712890625, -12.27197265625, -10.97265625, -9.67333984375, -8.3740234375, -7.07470703125, -5.775390625, -4.47607421875, -3.1767578125, -1.87744140625, -0.578125, 0.72119140625, 2.0205078125, 3.31982421875, 4.619140625, 5.91845703125, 7.2177734375, 8.51708984375, 9.81640625, 11.11572265625, 12.4150390625, 13.71435546875, 15.013671875, 16.31298828125, 17.6123046875, 18.91162109375, 20.2109375, 21.51025390625, 22.8095703125, 24.10888671875, 25.408203125, 26.70751953125, 28.0068359375, 29.30615234375, 30.60546875, 31.90478515625, 33.2041015625, 34.50341796875, 35.802734375, 37.10205078125, 38.4013671875, 39.70068359375, 41.0]}, "gradients/decoder.transformer.h.4.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 8.0, 519.0, 487.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-365.8355712890625, -359.1766662597656, -352.5177307128906, -345.85882568359375, -339.19989013671875, -332.5409851074219, -325.882080078125, -319.22314453125, -312.5642395019531, -305.90533447265625, -299.24639892578125, -292.5874938964844, -285.9285583496094, -279.2696533203125, -272.6107482910156, -265.9518127441406, -259.29290771484375, -252.6339874267578, -245.97506713867188, -239.316162109375, -232.65724182128906, -225.99832153320312, -219.3394012451172, -212.68048095703125, -206.0215606689453, -199.36264038085938, -192.70372009277344, -186.04481506347656, -179.38589477539062, -172.7269744873047, -166.06805419921875, -159.40914916992188, -152.750244140625, -146.09132385253906, -139.43240356445312, -132.77349853515625, -126.11457824707031, -119.45565795898438, -112.79673767089844, -106.13782501220703, -99.47889709472656, -92.81997680664062, -86.16106414794922, -79.50214385986328, -72.84323120117188, -66.18431091308594, -59.525394439697266, -52.866477966308594, -46.20756530761719, -39.548648834228516, -32.889732360839844, -26.23081398010254, -19.571897506713867, -12.912979125976562, -6.254062652587891, 0.40485382080078125, 7.063770294189453, 13.722686767578125, 20.381603240966797, 27.0405216217041, 33.699440002441406, 40.35835647583008, 47.01727294921875, 53.67618942260742, 60.335105895996094]}, "gradients/decoder.transformer.h.4.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 4.0, 7.0, 5.0, 5.0, 3.0, 4.0, 4.0, 7.0, 7.0, 14.0, 14.0, 19.0, 14.0, 21.0, 14.0, 26.0, 23.0, 16.0, 24.0, 16.0, 28.0, 34.0, 31.0, 38.0, 36.0, 38.0, 34.0, 43.0, 32.0, 36.0, 43.0, 38.0, 39.0, 28.0, 33.0, 28.0, 27.0, 21.0, 18.0, 13.0, 22.0, 18.0, 14.0, 10.0, 5.0, 14.0, 8.0, 9.0, 9.0, 6.0, 7.0, 3.0, 7.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-49.79507827758789, -48.12015151977539, -46.44522476196289, -44.770301818847656, -43.095375061035156, -41.420448303222656, -39.745521545410156, -38.070594787597656, -36.395668029785156, -34.720741271972656, -33.045814514160156, -31.37088966369629, -29.695964813232422, -28.021038055419922, -26.346111297607422, -24.671184539794922, -22.996261596679688, -21.321334838867188, -19.64640998840332, -17.97148323059082, -16.296558380126953, -14.621631622314453, -12.946704864501953, -11.27177906036377, -9.596853256225586, -7.921927452087402, -6.2470011711120605, -4.572074890136719, -2.897149085998535, -1.2222232818603516, 0.45270347595214844, 2.127629280090332, 3.8025588989257812, 5.477484703063965, 7.152410984039307, 8.827337265014648, 10.502263069152832, 12.177188873291016, 13.852115631103516, 15.5270414352417, 17.201967239379883, 18.876893997192383, 20.55181884765625, 22.22674560546875, 23.90167236328125, 25.576597213745117, 27.251523971557617, 28.926448822021484, 30.601375579833984, 32.276302337646484, 33.951229095458984, 35.62615203857422, 37.30107879638672, 38.97600555419922, 40.65093231201172, 42.32585906982422, 44.00078582763672, 45.67571258544922, 47.35063934326172, 49.02556610107422, 50.70048904418945, 52.37541580200195, 54.05034255981445, 55.72526931762695, 57.40019226074219]}, "gradients/decoder.transformer.h.3.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 5.0, 4.0, 4.0, 1.0, 11.0, 17.0, 9.0, 5.0, 17.0, 18.0, 18.0, 26.0, 19.0, 28.0, 28.0, 28.0, 33.0, 33.0, 41.0, 34.0, 41.0, 48.0, 39.0, 39.0, 43.0, 30.0, 41.0, 40.0, 43.0, 36.0, 39.0, 23.0, 24.0, 26.0, 23.0, 18.0, 14.0, 9.0, 12.0, 11.0, 7.0, 6.0, 5.0, 4.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-10.328125, -10.021240234375, -9.71435546875, -9.407470703125, -9.1005859375, -8.793701171875, -8.48681640625, -8.179931640625, -7.873046875, -7.566162109375, -7.25927734375, -6.952392578125, -6.6455078125, -6.338623046875, -6.03173828125, -5.724853515625, -5.41796875, -5.111083984375, -4.80419921875, -4.497314453125, -4.1904296875, -3.883544921875, -3.57666015625, -3.269775390625, -2.962890625, -2.656005859375, -2.34912109375, -2.042236328125, -1.7353515625, -1.428466796875, -1.12158203125, -0.814697265625, -0.5078125, -0.200927734375, 0.10595703125, 0.412841796875, 0.7197265625, 1.026611328125, 1.33349609375, 1.640380859375, 1.947265625, 2.254150390625, 2.56103515625, 2.867919921875, 3.1748046875, 3.481689453125, 3.78857421875, 4.095458984375, 4.40234375, 4.709228515625, 5.01611328125, 5.322998046875, 5.6298828125, 5.936767578125, 6.24365234375, 6.550537109375, 6.857421875, 7.164306640625, 7.47119140625, 7.778076171875, 8.0849609375, 8.391845703125, 8.69873046875, 9.005615234375, 9.3125]}, "gradients/decoder.transformer.h.3.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 5.0, 3.0, 0.0, 2.0, 3.0, 1.0, 4.0, 4.0, 13.0, 9.0, 13.0, 11.0, 24.0, 17.0, 29.0, 55.0, 55.0, 72.0, 99.0, 132.0, 169.0, 276.0, 365.0, 485.0, 1323.0, 4187741.0, 1500.0, 461.0, 375.0, 283.0, 207.0, 141.0, 84.0, 91.0, 49.0, 40.0, 31.0, 25.0, 31.0, 17.0, 13.0, 4.0, 9.0, 7.0, 5.0, 2.0, 1.0, 7.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-324.25, -313.96484375, -303.6796875, -293.39453125, -283.109375, -272.82421875, -262.5390625, -252.25390625, -241.96875, -231.68359375, -221.3984375, -211.11328125, -200.828125, -190.54296875, -180.2578125, -169.97265625, -159.6875, -149.40234375, -139.1171875, -128.83203125, -118.546875, -108.26171875, -97.9765625, -87.69140625, -77.40625, -67.12109375, -56.8359375, -46.55078125, -36.265625, -25.98046875, -15.6953125, -5.41015625, 4.875, 15.16015625, 25.4453125, 35.73046875, 46.015625, 56.30078125, 66.5859375, 76.87109375, 87.15625, 97.44140625, 107.7265625, 118.01171875, 128.296875, 138.58203125, 148.8671875, 159.15234375, 169.4375, 179.72265625, 190.0078125, 200.29296875, 210.578125, 220.86328125, 231.1484375, 241.43359375, 251.71875, 262.00390625, 272.2890625, 282.57421875, 292.859375, 303.14453125, 313.4296875, 323.71484375, 334.0]}, "gradients/decoder.transformer.h.3.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 1.0, 4.0, 5.0, 5.0, 9.0, 20.0, 29.0, 37.0, 51.0, 75.0, 114.0, 222.0, 378.0, 678.0, 902.0, 665.0, 316.0, 190.0, 128.0, 86.0, 43.0, 31.0, 24.0, 18.0, 19.0, 10.0, 2.0, 8.0, 3.0, 3.0, 1.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-22.5625, -21.888671875, -21.21484375, -20.541015625, -19.8671875, -19.193359375, -18.51953125, -17.845703125, -17.171875, -16.498046875, -15.82421875, -15.150390625, -14.4765625, -13.802734375, -13.12890625, -12.455078125, -11.78125, -11.107421875, -10.43359375, -9.759765625, -9.0859375, -8.412109375, -7.73828125, -7.064453125, -6.390625, -5.716796875, -5.04296875, -4.369140625, -3.6953125, -3.021484375, -2.34765625, -1.673828125, -1.0, -0.326171875, 0.34765625, 1.021484375, 1.6953125, 2.369140625, 3.04296875, 3.716796875, 4.390625, 5.064453125, 5.73828125, 6.412109375, 7.0859375, 7.759765625, 8.43359375, 9.107421875, 9.78125, 10.455078125, 11.12890625, 11.802734375, 12.4765625, 13.150390625, 13.82421875, 14.498046875, 15.171875, 15.845703125, 16.51953125, 17.193359375, 17.8671875, 18.541015625, 19.21484375, 19.888671875, 20.5625]}, "gradients/decoder.transformer.h.3.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 0.0, 4.0, 6.0, 5.0, 7.0, 21.0, 17.0, 21.0, 51.0, 57.0, 82.0, 124.0, 205.0, 53537.0, 4139447.0, 323.0, 120.0, 82.0, 52.0, 64.0, 19.0, 13.0, 15.0, 10.0, 6.0, 4.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-158.25, -149.00390625, -139.7578125, -130.51171875, -121.265625, -112.01953125, -102.7734375, -93.52734375, -84.28125, -75.03515625, -65.7890625, -56.54296875, -47.296875, -38.05078125, -28.8046875, -19.55859375, -10.3125, -1.06640625, 8.1796875, 17.42578125, 26.671875, 35.91796875, 45.1640625, 54.41015625, 63.65625, 72.90234375, 82.1484375, 91.39453125, 100.640625, 109.88671875, 119.1328125, 128.37890625, 137.625, 146.87109375, 156.1171875, 165.36328125, 174.609375, 183.85546875, 193.1015625, 202.34765625, 211.59375, 220.83984375, 230.0859375, 239.33203125, 248.578125, 257.82421875, 267.0703125, 276.31640625, 285.5625, 294.80859375, 304.0546875, 313.30078125, 322.546875, 331.79296875, 341.0390625, 350.28515625, 359.53125, 368.77734375, 378.0234375, 387.26953125, 396.515625, 405.76171875, 415.0078125, 424.25390625, 433.5]}, "gradients/decoder.transformer.h.3.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 8.0, 177.0, 596.0, 218.0, 15.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-428.4667053222656, -420.61639404296875, -412.7660827636719, -404.9158020019531, -397.06549072265625, -389.2151794433594, -381.3648681640625, -373.51458740234375, -365.6642761230469, -357.81396484375, -349.9636535644531, -342.1133728027344, -334.2630615234375, -326.4127502441406, -318.56243896484375, -310.712158203125, -302.86181640625, -295.0115051269531, -287.16119384765625, -279.3109130859375, -271.4606018066406, -263.61029052734375, -255.75997924804688, -247.90968322753906, -240.05938720703125, -232.20907592773438, -224.35877990722656, -216.5084686279297, -208.65817260742188, -200.807861328125, -192.95755004882812, -185.1072540283203, -177.2569580078125, -169.40664672851562, -161.5563507080078, -153.70603942871094, -145.85574340820312, -138.00543212890625, -130.15512084960938, -122.30482482910156, -114.45452117919922, -106.60421752929688, -98.75391387939453, -90.90361022949219, -83.05329895019531, -75.2030029296875, -67.35269165039062, -59.50238800048828, -51.65208435058594, -43.801780700683594, -35.95147705078125, -28.10116958618164, -20.250865936279297, -12.400562286376953, -4.550254821777344, 3.300048828125, 11.150352478027344, 19.000656127929688, 26.850961685180664, 34.70126724243164, 42.551570892333984, 50.40187454223633, 58.25218200683594, 66.10248565673828, 73.95278930664062]}, "gradients/decoder.transformer.h.3.ln_2.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 5.0, 4.0, 6.0, 8.0, 13.0, 9.0, 14.0, 9.0, 16.0, 23.0, 22.0, 27.0, 33.0, 33.0, 32.0, 34.0, 32.0, 39.0, 39.0, 55.0, 41.0, 36.0, 45.0, 38.0, 37.0, 41.0, 36.0, 26.0, 30.0, 39.0, 24.0, 24.0, 26.0, 22.0, 23.0, 14.0, 15.0, 11.0, 8.0, 4.0, 6.0, 3.0, 3.0, 7.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-55.30072021484375, -53.498348236083984, -51.695980072021484, -49.89360809326172, -48.09123992919922, -46.28886795043945, -44.48649597167969, -42.68412780761719, -40.88175964355469, -39.07938766479492, -37.27701950073242, -35.474647521972656, -33.672279357910156, -31.86990737915039, -30.067537307739258, -28.265167236328125, -26.46279525756836, -24.660425186157227, -22.858055114746094, -21.055683135986328, -19.253314971923828, -17.450942993164062, -15.64857292175293, -13.846202850341797, -12.043832778930664, -10.241462707519531, -8.439092636108398, -6.636721611022949, -4.834351539611816, -3.0319814682006836, -1.2296104431152344, 0.5727596282958984, 2.3751296997070312, 4.177499771118164, 5.979870319366455, 7.782240867614746, 9.584610939025879, 11.386981010437012, 13.189352035522461, 14.991722106933594, 16.794092178344727, 18.59646224975586, 20.398832321166992, 22.201202392578125, 24.00357437133789, 25.80594253540039, 27.608314514160156, 29.41068458557129, 31.213054656982422, 33.01542663574219, 34.81779479980469, 36.62016677856445, 38.42253494262695, 40.22490692138672, 42.02727508544922, 43.829647064208984, 45.63201904296875, 47.434391021728516, 49.236759185791016, 51.03913116455078, 52.84149932861328, 54.64387130737305, 56.44624328613281, 58.24861145019531, 60.05097961425781]}, "gradients/decoder.transformer.h.3.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 4.0, 5.0, 5.0, 7.0, 4.0, 10.0, 8.0, 18.0, 6.0, 6.0, 9.0, 20.0, 25.0, 18.0, 36.0, 27.0, 34.0, 23.0, 28.0, 40.0, 33.0, 43.0, 41.0, 36.0, 40.0, 44.0, 52.0, 38.0, 31.0, 23.0, 40.0, 31.0, 26.0, 30.0, 30.0, 17.0, 23.0, 12.0, 12.0, 15.0, 17.0, 9.0, 11.0, 7.0, 4.0, 5.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-9.453125, -9.1593017578125, -8.865478515625, -8.5716552734375, -8.27783203125, -7.9840087890625, -7.690185546875, -7.3963623046875, -7.1025390625, -6.8087158203125, -6.514892578125, -6.2210693359375, -5.92724609375, -5.6334228515625, -5.339599609375, -5.0457763671875, -4.751953125, -4.4581298828125, -4.164306640625, -3.8704833984375, -3.57666015625, -3.2828369140625, -2.989013671875, -2.6951904296875, -2.4013671875, -2.1075439453125, -1.813720703125, -1.5198974609375, -1.22607421875, -0.9322509765625, -0.638427734375, -0.3446044921875, -0.05078125, 0.2430419921875, 0.536865234375, 0.8306884765625, 1.12451171875, 1.4183349609375, 1.712158203125, 2.0059814453125, 2.2998046875, 2.5936279296875, 2.887451171875, 3.1812744140625, 3.47509765625, 3.7689208984375, 4.062744140625, 4.3565673828125, 4.650390625, 4.9442138671875, 5.238037109375, 5.5318603515625, 5.82568359375, 6.1195068359375, 6.413330078125, 6.7071533203125, 7.0009765625, 7.2947998046875, 7.588623046875, 7.8824462890625, 8.17626953125, 8.4700927734375, 8.763916015625, 9.0577392578125, 9.3515625]}, "gradients/decoder.transformer.h.3.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 7.0, 7.0, 13.0, 17.0, 30.0, 44.0, 63.0, 101.0, 134.0, 175.0, 239.0, 405.0, 553.0, 867.0, 1209.0, 1793.0, 2887.0, 4136.0, 6066.0, 9138.0, 13876.0, 21583.0, 33830.0, 53851.0, 91331.0, 186123.0, 304730.0, 125368.0, 69038.0, 42705.0, 26648.0, 17311.0, 11456.0, 7746.0, 4979.0, 3295.0, 2203.0, 1494.0, 1022.0, 681.0, 467.0, 330.0, 208.0, 138.0, 100.0, 52.0, 30.0, 27.0, 21.0, 14.0, 13.0, 3.0, 5.0, 1.0, 1.0, 3.0, 1.0, 1.0], "bins": [-2.4140625, -2.339202880859375, -2.26434326171875, -2.189483642578125, -2.1146240234375, -2.039764404296875, -1.96490478515625, -1.890045166015625, -1.815185546875, -1.740325927734375, -1.66546630859375, -1.590606689453125, -1.5157470703125, -1.440887451171875, -1.36602783203125, -1.291168212890625, -1.21630859375, -1.141448974609375, -1.06658935546875, -0.991729736328125, -0.9168701171875, -0.842010498046875, -0.76715087890625, -0.692291259765625, -0.617431640625, -0.542572021484375, -0.46771240234375, -0.392852783203125, -0.3179931640625, -0.243133544921875, -0.16827392578125, -0.093414306640625, -0.0185546875, 0.056304931640625, 0.13116455078125, 0.206024169921875, 0.2808837890625, 0.355743408203125, 0.43060302734375, 0.505462646484375, 0.580322265625, 0.655181884765625, 0.73004150390625, 0.804901123046875, 0.8797607421875, 0.954620361328125, 1.02947998046875, 1.104339599609375, 1.17919921875, 1.254058837890625, 1.32891845703125, 1.403778076171875, 1.4786376953125, 1.553497314453125, 1.62835693359375, 1.703216552734375, 1.778076171875, 1.852935791015625, 1.92779541015625, 2.002655029296875, 2.0775146484375, 2.152374267578125, 2.22723388671875, 2.302093505859375, 2.376953125]}, "gradients/decoder.transformer.h.3.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 6.0, 6.0, 6.0, 4.0, 9.0, 8.0, 4.0, 10.0, 17.0, 18.0, 21.0, 19.0, 26.0, 23.0, 24.0, 39.0, 50.0, 31.0, 48.0, 45.0, 45.0, 1075.0, 43.0, 36.0, 40.0, 57.0, 51.0, 33.0, 40.0, 38.0, 28.0, 22.0, 15.0, 22.0, 19.0, 7.0, 10.0, 6.0, 7.0, 11.0, 9.0, 3.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-7.4375, -7.21832275390625, -6.9991455078125, -6.77996826171875, -6.560791015625, -6.34161376953125, -6.1224365234375, -5.90325927734375, -5.68408203125, -5.46490478515625, -5.2457275390625, -5.02655029296875, -4.807373046875, -4.58819580078125, -4.3690185546875, -4.14984130859375, -3.9306640625, -3.71148681640625, -3.4923095703125, -3.27313232421875, -3.053955078125, -2.83477783203125, -2.6156005859375, -2.39642333984375, -2.17724609375, -1.95806884765625, -1.7388916015625, -1.51971435546875, -1.300537109375, -1.08135986328125, -0.8621826171875, -0.64300537109375, -0.423828125, -0.20465087890625, 0.0145263671875, 0.23370361328125, 0.452880859375, 0.67205810546875, 0.8912353515625, 1.11041259765625, 1.32958984375, 1.54876708984375, 1.7679443359375, 1.98712158203125, 2.206298828125, 2.42547607421875, 2.6446533203125, 2.86383056640625, 3.0830078125, 3.30218505859375, 3.5213623046875, 3.74053955078125, 3.959716796875, 4.17889404296875, 4.3980712890625, 4.61724853515625, 4.83642578125, 5.05560302734375, 5.2747802734375, 5.49395751953125, 5.713134765625, 5.93231201171875, 6.1514892578125, 6.37066650390625, 6.58984375]}, "gradients/decoder.transformer.h.3.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 6.0, 14.0, 21.0, 18.0, 24.0, 54.0, 58.0, 111.0, 198.0, 281.0, 508.0, 837.0, 1453.0, 2848.0, 4935.0, 9420.0, 18480.0, 38706.0, 87849.0, 240725.0, 1465572.0, 123203.0, 51904.0, 24023.0, 12046.0, 6061.0, 3412.0, 1792.0, 1066.0, 624.0, 310.0, 205.0, 130.0, 90.0, 53.0, 23.0, 23.0, 10.0, 11.0, 8.0, 10.0, 8.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-3.63671875, -3.5162353515625, -3.395751953125, -3.2752685546875, -3.15478515625, -3.0343017578125, -2.913818359375, -2.7933349609375, -2.6728515625, -2.5523681640625, -2.431884765625, -2.3114013671875, -2.19091796875, -2.0704345703125, -1.949951171875, -1.8294677734375, -1.708984375, -1.5885009765625, -1.468017578125, -1.3475341796875, -1.22705078125, -1.1065673828125, -0.986083984375, -0.8656005859375, -0.7451171875, -0.6246337890625, -0.504150390625, -0.3836669921875, -0.26318359375, -0.1427001953125, -0.022216796875, 0.0982666015625, 0.21875, 0.3392333984375, 0.459716796875, 0.5802001953125, 0.70068359375, 0.8211669921875, 0.941650390625, 1.0621337890625, 1.1826171875, 1.3031005859375, 1.423583984375, 1.5440673828125, 1.66455078125, 1.7850341796875, 1.905517578125, 2.0260009765625, 2.146484375, 2.2669677734375, 2.387451171875, 2.5079345703125, 2.62841796875, 2.7489013671875, 2.869384765625, 2.9898681640625, 3.1103515625, 3.2308349609375, 3.351318359375, 3.4718017578125, 3.59228515625, 3.7127685546875, 3.833251953125, 3.9537353515625, 4.07421875]}, "gradients/decoder.transformer.h.3.crossattention.q_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 1.0, 0.0, 3.0, 0.0, 4.0, 0.0, 0.0, 2.0, 6.0, 6.0, 8.0, 9.0, 13.0, 12.0, 16.0, 15.0, 21.0, 27.0, 28.0, 43.0, 42.0, 65.0, 72.0, 88.0, 86.0, 70.0, 69.0, 56.0, 42.0, 39.0, 32.0, 23.0, 13.0, 17.0, 15.0, 18.0, 17.0, 10.0, 5.0, 1.0, 5.0, 5.0, 0.0, 0.0, 3.0, 4.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0014801025390625, -0.0014332085847854614, -0.0013863146305084229, -0.0013394206762313843, -0.0012925267219543457, -0.0012456327676773071, -0.0011987388134002686, -0.00115184485912323, -0.0011049509048461914, -0.0010580569505691528, -0.0010111629962921143, -0.0009642690420150757, -0.0009173750877380371, -0.0008704811334609985, -0.00082358717918396, -0.0007766932249069214, -0.0007297992706298828, -0.0006829053163528442, -0.0006360113620758057, -0.0005891174077987671, -0.0005422234535217285, -0.0004953294992446899, -0.00044843554496765137, -0.0004015415906906128, -0.0003546476364135742, -0.00030775368213653564, -0.00026085972785949707, -0.0002139657735824585, -0.00016707181930541992, -0.00012017786502838135, -7.328391075134277e-05, -2.63899564743042e-05, 2.0503997802734375e-05, 6.739795207977295e-05, 0.00011429190635681152, 0.0001611858606338501, 0.00020807981491088867, 0.00025497376918792725, 0.0003018677234649658, 0.0003487616777420044, 0.00039565563201904297, 0.00044254958629608154, 0.0004894435405731201, 0.0005363374948501587, 0.0005832314491271973, 0.0006301254034042358, 0.0006770193576812744, 0.000723913311958313, 0.0007708072662353516, 0.0008177012205123901, 0.0008645951747894287, 0.0009114891290664673, 0.0009583830833435059, 0.0010052770376205444, 0.001052170991897583, 0.0010990649461746216, 0.0011459589004516602, 0.0011928528547286987, 0.0012397468090057373, 0.0012866407632827759, 0.0013335347175598145, 0.001380428671836853, 0.0014273226261138916, 0.0014742165803909302, 0.0015211105346679688]}, "gradients/decoder.transformer.h.3.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 0.0, 3.0, 6.0, 2.0, 9.0, 7.0, 5.0, 10.0, 11.0, 15.0, 19.0, 35.0, 30.0, 49.0, 58.0, 64.0, 104.0, 152.0, 220.0, 350.0, 564.0, 2374.0, 977931.0, 64017.0, 1057.0, 447.0, 298.0, 203.0, 121.0, 89.0, 71.0, 45.0, 40.0, 39.0, 31.0, 25.0, 17.0, 10.0, 6.0, 6.0, 2.0, 2.0, 6.0, 1.0, 6.0, 2.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.029144287109375, -0.028281688690185547, -0.027419090270996094, -0.02655649185180664, -0.025693893432617188, -0.024831295013427734, -0.02396869659423828, -0.023106098175048828, -0.022243499755859375, -0.021380901336669922, -0.02051830291748047, -0.019655704498291016, -0.018793106079101562, -0.01793050765991211, -0.017067909240722656, -0.016205310821533203, -0.01534271240234375, -0.014480113983154297, -0.013617515563964844, -0.01275491714477539, -0.011892318725585938, -0.011029720306396484, -0.010167121887207031, -0.009304523468017578, -0.008441925048828125, -0.007579326629638672, -0.006716728210449219, -0.005854129791259766, -0.0049915313720703125, -0.004128932952880859, -0.0032663345336914062, -0.002403736114501953, -0.0015411376953125, -0.0006785392761230469, 0.00018405914306640625, 0.0010466575622558594, 0.0019092559814453125, 0.0027718544006347656, 0.0036344528198242188, 0.004497051239013672, 0.005359649658203125, 0.006222248077392578, 0.007084846496582031, 0.007947444915771484, 0.008810043334960938, 0.00967264175415039, 0.010535240173339844, 0.011397838592529297, 0.01226043701171875, 0.013123035430908203, 0.013985633850097656, 0.01484823226928711, 0.015710830688476562, 0.016573429107666016, 0.01743602752685547, 0.018298625946044922, 0.019161224365234375, 0.020023822784423828, 0.02088642120361328, 0.021749019622802734, 0.022611618041992188, 0.02347421646118164, 0.024336814880371094, 0.025199413299560547, 0.02606201171875]}, "gradients/decoder.transformer.h.3.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 7.0, 6.0, 19.0, 52.0, 105.0, 190.0, 229.0, 194.0, 122.0, 56.0, 21.0, 8.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0007859938777983189, -0.0007435671868734062, -0.0007011404377408326, -0.0006587137468159199, -0.0006162870558910072, -0.0005738603649660945, -0.0005314336158335209, -0.0004890069249086082, -0.00044658020487986505, -0.0004041534848511219, -0.0003617267939262092, -0.00031930007389746606, -0.0002768733538687229, -0.00023444666294381022, -0.00019201994291506708, -0.00014959325199015439, -0.00010716653196141124, -6.473982648458332e-05, -2.2313113731797785e-05, 2.011359902098775e-05, 6.254030449781567e-05, 0.00010496700997464359, 0.00014739373000338674, 0.00018982042092829943, 0.00023224714095704257, 0.0002746738609857857, 0.0003171005519106984, 0.00035952727193944156, 0.0004019539919681847, 0.0004443806828930974, 0.00048680740292184055, 0.0005292340647429228, 0.0005716608138754964, 0.0006140875048004091, 0.0006565142539329827, 0.0006989409448578954, 0.0007413676357828081, 0.0007837943267077208, 0.0008262210758402944, 0.000868647766765207, 0.0009110744576901197, 0.0009535011486150324, 0.0009959278395399451, 0.0010383545886725187, 0.0010807813378050923, 0.0011232079705223441, 0.0011656347196549177, 0.0012080613523721695, 0.001250488217920065, 0.0012929149670526385, 0.0013353415997698903, 0.001377768348902464, 0.0014201950980350375, 0.0014626217307522893, 0.001505048479884863, 0.0015474751126021147, 0.0015899018617346883, 0.0016323286108672619, 0.0016747552435845137, 0.0017171819927170873, 0.0017596087418496609, 0.0018020353745669127, 0.0018444621236994863, 0.0018868888728320599, 0.0019293155055493116]}, "gradients/decoder.transformer.h.3.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 5.0, 3.0, 3.0, 3.0, 4.0, 12.0, 8.0, 5.0, 10.0, 15.0, 20.0, 12.0, 15.0, 13.0, 30.0, 30.0, 26.0, 35.0, 49.0, 41.0, 40.0, 48.0, 40.0, 32.0, 47.0, 44.0, 48.0, 44.0, 50.0, 32.0, 34.0, 28.0, 28.0, 29.0, 23.0, 18.0, 19.0, 12.0, 12.0, 10.0, 5.0, 6.0, 8.0, 7.0, 2.0, 3.0, 2.0, 4.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006769299507141113, -0.0006539030000567436, -0.0006308760493993759, -0.0006078490987420082, -0.0005848221480846405, -0.0005617951974272728, -0.0005387682467699051, -0.0005157412961125374, -0.0004927143454551697, -0.00046968739479780197, -0.00044666044414043427, -0.00042363349348306656, -0.00040060654282569885, -0.00037757959216833115, -0.00035455264151096344, -0.00033152569085359573, -0.00030849874019622803, -0.0002854717895388603, -0.0002624448388814926, -0.0002394178882241249, -0.0002163909375667572, -0.0001933639869093895, -0.0001703370362520218, -0.00014731008559465408, -0.00012428313493728638, -0.00010125618427991867, -7.822923362255096e-05, -5.520228296518326e-05, -3.217533230781555e-05, -9.148381650447845e-06, 1.387856900691986e-05, 3.690551966428757e-05, 5.9932470321655273e-05, 8.295942097902298e-05, 0.00010598637163639069, 0.0001290133222937584, 0.0001520402729511261, 0.0001750672236084938, 0.0001980941742658615, 0.00022112112492322922, 0.0002441480755805969, 0.00026717502623796463, 0.00029020197689533234, 0.00031322892755270004, 0.00033625587821006775, 0.00035928282886743546, 0.00038230977952480316, 0.00040533673018217087, 0.0004283636808395386, 0.0004513906314969063, 0.000474417582154274, 0.0004974445328116417, 0.0005204714834690094, 0.0005434984341263771, 0.0005665253847837448, 0.0005895523354411125, 0.0006125792860984802, 0.0006356062367558479, 0.0006586331874132156, 0.0006816601380705833, 0.000704687088727951, 0.0007277140393853188, 0.0007507409900426865, 0.0007737679407000542, 0.0007967948913574219]}, "gradients/decoder.transformer.h.3.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 4.0, 5.0, 5.0, 7.0, 4.0, 10.0, 8.0, 18.0, 6.0, 6.0, 9.0, 20.0, 25.0, 18.0, 36.0, 27.0, 34.0, 23.0, 28.0, 40.0, 33.0, 43.0, 41.0, 36.0, 40.0, 44.0, 52.0, 38.0, 31.0, 23.0, 40.0, 31.0, 26.0, 30.0, 30.0, 17.0, 23.0, 12.0, 12.0, 15.0, 17.0, 9.0, 11.0, 7.0, 4.0, 5.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-9.453125, -9.1593017578125, -8.865478515625, -8.5716552734375, -8.27783203125, -7.9840087890625, -7.690185546875, -7.3963623046875, -7.1025390625, -6.8087158203125, -6.514892578125, -6.2210693359375, -5.92724609375, -5.6334228515625, -5.339599609375, -5.0457763671875, -4.751953125, -4.4581298828125, -4.164306640625, -3.8704833984375, -3.57666015625, -3.2828369140625, -2.989013671875, -2.6951904296875, -2.4013671875, -2.1075439453125, -1.813720703125, -1.5198974609375, -1.22607421875, -0.9322509765625, -0.638427734375, -0.3446044921875, -0.05078125, 0.2430419921875, 0.536865234375, 0.8306884765625, 1.12451171875, 1.4183349609375, 1.712158203125, 2.0059814453125, 2.2998046875, 2.5936279296875, 2.887451171875, 3.1812744140625, 3.47509765625, 3.7689208984375, 4.062744140625, 4.3565673828125, 4.650390625, 4.9442138671875, 5.238037109375, 5.5318603515625, 5.82568359375, 6.1195068359375, 6.413330078125, 6.7071533203125, 7.0009765625, 7.2947998046875, 7.588623046875, 7.8824462890625, 8.17626953125, 8.4700927734375, 8.763916015625, 9.0577392578125, 9.3515625]}, "gradients/decoder.transformer.h.3.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 5.0, 3.0, 6.0, 6.0, 10.0, 10.0, 19.0, 20.0, 26.0, 40.0, 52.0, 75.0, 86.0, 96.0, 152.0, 174.0, 202.0, 259.0, 348.0, 477.0, 581.0, 783.0, 1044.0, 1381.0, 2123.0, 3897.0, 10864.0, 47063.0, 253745.0, 536405.0, 144157.0, 27160.0, 7165.0, 3031.0, 1816.0, 1227.0, 877.0, 738.0, 539.0, 426.0, 300.0, 260.0, 199.0, 177.0, 135.0, 101.0, 77.0, 67.0, 44.0, 38.0, 25.0, 14.0, 8.0, 10.0, 9.0, 8.0, 5.0, 4.0, 1.0, 1.0, 2.0], "bins": [-13.1484375, -12.74169921875, -12.3349609375, -11.92822265625, -11.521484375, -11.11474609375, -10.7080078125, -10.30126953125, -9.89453125, -9.48779296875, -9.0810546875, -8.67431640625, -8.267578125, -7.86083984375, -7.4541015625, -7.04736328125, -6.640625, -6.23388671875, -5.8271484375, -5.42041015625, -5.013671875, -4.60693359375, -4.2001953125, -3.79345703125, -3.38671875, -2.97998046875, -2.5732421875, -2.16650390625, -1.759765625, -1.35302734375, -0.9462890625, -0.53955078125, -0.1328125, 0.27392578125, 0.6806640625, 1.08740234375, 1.494140625, 1.90087890625, 2.3076171875, 2.71435546875, 3.12109375, 3.52783203125, 3.9345703125, 4.34130859375, 4.748046875, 5.15478515625, 5.5615234375, 5.96826171875, 6.375, 6.78173828125, 7.1884765625, 7.59521484375, 8.001953125, 8.40869140625, 8.8154296875, 9.22216796875, 9.62890625, 10.03564453125, 10.4423828125, 10.84912109375, 11.255859375, 11.66259765625, 12.0693359375, 12.47607421875, 12.8828125]}, "gradients/decoder.transformer.h.3.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 4.0, 11.0, 13.0, 32.0, 22.0, 45.0, 33.0, 67.0, 79.0, 143.0, 337.0, 1770.0, 140.0, 89.0, 73.0, 49.0, 52.0, 36.0, 19.0, 17.0, 5.0, 9.0, 4.0, 4.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-46.84375, -45.23876953125, -43.6337890625, -42.02880859375, -40.423828125, -38.81884765625, -37.2138671875, -35.60888671875, -34.00390625, -32.39892578125, -30.7939453125, -29.18896484375, -27.583984375, -25.97900390625, -24.3740234375, -22.76904296875, -21.1640625, -19.55908203125, -17.9541015625, -16.34912109375, -14.744140625, -13.13916015625, -11.5341796875, -9.92919921875, -8.32421875, -6.71923828125, -5.1142578125, -3.50927734375, -1.904296875, -0.29931640625, 1.3056640625, 2.91064453125, 4.515625, 6.12060546875, 7.7255859375, 9.33056640625, 10.935546875, 12.54052734375, 14.1455078125, 15.75048828125, 17.35546875, 18.96044921875, 20.5654296875, 22.17041015625, 23.775390625, 25.38037109375, 26.9853515625, 28.59033203125, 30.1953125, 31.80029296875, 33.4052734375, 35.01025390625, 36.615234375, 38.22021484375, 39.8251953125, 41.43017578125, 43.03515625, 44.64013671875, 46.2451171875, 47.85009765625, 49.455078125, 51.06005859375, 52.6650390625, 54.27001953125, 55.875]}, "gradients/decoder.transformer.h.3.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 2.0, 4.0, 10.0, 4.0, 3.0, 10.0, 19.0, 25.0, 33.0, 56.0, 107.0, 143.0, 266.0, 664.0, 3033.0, 3136535.0, 3411.0, 704.0, 260.0, 152.0, 96.0, 53.0, 50.0, 25.0, 21.0, 13.0, 3.0, 7.0, 3.0, 1.0, 4.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-141.25, -137.158203125, -133.06640625, -128.974609375, -124.8828125, -120.791015625, -116.69921875, -112.607421875, -108.515625, -104.423828125, -100.33203125, -96.240234375, -92.1484375, -88.056640625, -83.96484375, -79.873046875, -75.78125, -71.689453125, -67.59765625, -63.505859375, -59.4140625, -55.322265625, -51.23046875, -47.138671875, -43.046875, -38.955078125, -34.86328125, -30.771484375, -26.6796875, -22.587890625, -18.49609375, -14.404296875, -10.3125, -6.220703125, -2.12890625, 1.962890625, 6.0546875, 10.146484375, 14.23828125, 18.330078125, 22.421875, 26.513671875, 30.60546875, 34.697265625, 38.7890625, 42.880859375, 46.97265625, 51.064453125, 55.15625, 59.248046875, 63.33984375, 67.431640625, 71.5234375, 75.615234375, 79.70703125, 83.798828125, 87.890625, 91.982421875, 96.07421875, 100.166015625, 104.2578125, 108.349609375, 112.44140625, 116.533203125, 120.625]}, "gradients/decoder.transformer.h.3.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 30.0, 367.0, 518.0, 90.0, 11.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-157.63394165039062, -153.01654052734375, -148.39913940429688, -143.78173828125, -139.16433715820312, -134.54693603515625, -129.92953491210938, -125.31212615966797, -120.6947250366211, -116.07732391357422, -111.45992279052734, -106.84252166748047, -102.22511291503906, -97.60771179199219, -92.99031066894531, -88.37290954589844, -83.75550842285156, -79.13810729980469, -74.52070617675781, -69.90330505371094, -65.28590393066406, -60.66849899291992, -56.05109405517578, -51.433692932128906, -46.81629180908203, -42.198890686035156, -37.58148956298828, -32.96408462524414, -28.346683502197266, -23.72928237915039, -19.111879348754883, -14.494476318359375, -9.8770751953125, -5.259673118591309, -0.6422710418701172, 3.975131034851074, 8.592533111572266, 13.20993423461914, 17.82733726501465, 22.444740295410156, 27.06214141845703, 31.679542541503906, 36.29694366455078, 40.91434860229492, 45.5317497253418, 50.14915084838867, 54.76655578613281, 59.38395690917969, 64.00135803222656, 68.61875915527344, 73.23616027832031, 77.85356140136719, 82.47096252441406, 87.08836364746094, 91.70577239990234, 96.32317352294922, 100.9405746459961, 105.55797576904297, 110.17537689208984, 114.79277801513672, 119.41018676757812, 124.027587890625, 128.64498901367188, 133.26239013671875, 137.87979125976562]}, "gradients/decoder.transformer.h.3.ln_1.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 6.0, 5.0, 6.0, 3.0, 6.0, 6.0, 11.0, 13.0, 16.0, 25.0, 34.0, 37.0, 33.0, 29.0, 40.0, 38.0, 43.0, 38.0, 61.0, 47.0, 46.0, 39.0, 50.0, 32.0, 34.0, 40.0, 30.0, 33.0, 37.0, 22.0, 28.0, 23.0, 18.0, 13.0, 13.0, 8.0, 16.0, 5.0, 6.0, 5.0, 3.0, 2.0, 0.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-75.12521362304688, -72.54951477050781, -69.97382354736328, -67.39812469482422, -64.82242584228516, -62.246734619140625, -59.67103576660156, -57.095340728759766, -54.51964569091797, -51.94395065307617, -49.36825180053711, -46.79255676269531, -44.216861724853516, -41.64116668701172, -39.065467834472656, -36.48977279663086, -33.9140739440918, -31.338376998901367, -28.76268196105957, -26.18698501586914, -23.611289978027344, -21.035593032836914, -18.459896087646484, -15.884201049804688, -13.308504104614258, -10.732808113098145, -8.157112121582031, -5.581415176391602, -3.0057191848754883, -0.430023193359375, 2.1456737518310547, 4.721368789672852, 7.297065734863281, 9.872761726379395, 12.448457717895508, 15.024154663085938, 17.599849700927734, 20.175546646118164, 22.751243591308594, 25.32693862915039, 27.90263557434082, 30.47833251953125, 33.05402755737305, 35.629722595214844, 38.205421447753906, 40.7811164855957, 43.3568115234375, 45.93251037597656, 48.50820541381836, 51.083900451660156, 53.65959930419922, 56.235294342041016, 58.81098937988281, 61.386688232421875, 63.96238327026367, 66.53807830810547, 69.11377716064453, 71.6894760131836, 74.26516723632812, 76.84086608886719, 79.41656494140625, 81.99225616455078, 84.56795501708984, 87.14364624023438, 89.71934509277344]}, "gradients/decoder.transformer.h.2.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 3.0, 0.0, 1.0, 1.0, 1.0, 2.0, 6.0, 5.0, 4.0, 5.0, 10.0, 6.0, 8.0, 10.0, 13.0, 12.0, 7.0, 13.0, 19.0, 22.0, 18.0, 21.0, 22.0, 26.0, 30.0, 33.0, 33.0, 30.0, 42.0, 37.0, 35.0, 38.0, 47.0, 38.0, 40.0, 29.0, 44.0, 31.0, 32.0, 19.0, 27.0, 26.0, 26.0, 19.0, 23.0, 14.0, 17.0, 11.0, 14.0, 8.0, 9.0, 7.0, 4.0, 5.0, 4.0, 3.0, 1.0, 2.0, 2.0, 1.0, 3.0, 2.0, 2.0], "bins": [-8.9375, -8.6636962890625, -8.389892578125, -8.1160888671875, -7.84228515625, -7.5684814453125, -7.294677734375, -7.0208740234375, -6.7470703125, -6.4732666015625, -6.199462890625, -5.9256591796875, -5.65185546875, -5.3780517578125, -5.104248046875, -4.8304443359375, -4.556640625, -4.2828369140625, -4.009033203125, -3.7352294921875, -3.46142578125, -3.1876220703125, -2.913818359375, -2.6400146484375, -2.3662109375, -2.0924072265625, -1.818603515625, -1.5447998046875, -1.27099609375, -0.9971923828125, -0.723388671875, -0.4495849609375, -0.17578125, 0.0980224609375, 0.371826171875, 0.6456298828125, 0.91943359375, 1.1932373046875, 1.467041015625, 1.7408447265625, 2.0146484375, 2.2884521484375, 2.562255859375, 2.8360595703125, 3.10986328125, 3.3836669921875, 3.657470703125, 3.9312744140625, 4.205078125, 4.4788818359375, 4.752685546875, 5.0264892578125, 5.30029296875, 5.5740966796875, 5.847900390625, 6.1217041015625, 6.3955078125, 6.6693115234375, 6.943115234375, 7.2169189453125, 7.49072265625, 7.7645263671875, 8.038330078125, 8.3121337890625, 8.5859375]}, "gradients/decoder.transformer.h.2.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 0.0, 13.0, 5.0, 3.0, 9.0, 14.0, 13.0, 18.0, 30.0, 39.0, 36.0, 55.0, 65.0, 105.0, 92.0, 141.0, 196.0, 263.0, 536.0, 1517.0, 8056.0, 403298.0, 3608197.0, 163609.0, 5424.0, 1098.0, 419.0, 248.0, 179.0, 124.0, 105.0, 79.0, 73.0, 50.0, 49.0, 40.0, 20.0, 25.0, 19.0, 6.0, 7.0, 8.0, 4.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-37.875, -36.6689453125, -35.462890625, -34.2568359375, -33.05078125, -31.8447265625, -30.638671875, -29.4326171875, -28.2265625, -27.0205078125, -25.814453125, -24.6083984375, -23.40234375, -22.1962890625, -20.990234375, -19.7841796875, -18.578125, -17.3720703125, -16.166015625, -14.9599609375, -13.75390625, -12.5478515625, -11.341796875, -10.1357421875, -8.9296875, -7.7236328125, -6.517578125, -5.3115234375, -4.10546875, -2.8994140625, -1.693359375, -0.4873046875, 0.71875, 1.9248046875, 3.130859375, 4.3369140625, 5.54296875, 6.7490234375, 7.955078125, 9.1611328125, 10.3671875, 11.5732421875, 12.779296875, 13.9853515625, 15.19140625, 16.3974609375, 17.603515625, 18.8095703125, 20.015625, 21.2216796875, 22.427734375, 23.6337890625, 24.83984375, 26.0458984375, 27.251953125, 28.4580078125, 29.6640625, 30.8701171875, 32.076171875, 33.2822265625, 34.48828125, 35.6943359375, 36.900390625, 38.1064453125, 39.3125]}, "gradients/decoder.transformer.h.2.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 7.0, 4.0, 1.0, 3.0, 12.0, 7.0, 10.0, 24.0, 36.0, 55.0, 58.0, 93.0, 152.0, 203.0, 285.0, 457.0, 671.0, 703.0, 433.0, 281.0, 210.0, 123.0, 81.0, 61.0, 37.0, 24.0, 14.0, 7.0, 7.0, 3.0, 6.0, 7.0, 4.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-19.296875, -18.632080078125, -17.96728515625, -17.302490234375, -16.6376953125, -15.972900390625, -15.30810546875, -14.643310546875, -13.978515625, -13.313720703125, -12.64892578125, -11.984130859375, -11.3193359375, -10.654541015625, -9.98974609375, -9.324951171875, -8.66015625, -7.995361328125, -7.33056640625, -6.665771484375, -6.0009765625, -5.336181640625, -4.67138671875, -4.006591796875, -3.341796875, -2.677001953125, -2.01220703125, -1.347412109375, -0.6826171875, -0.017822265625, 0.64697265625, 1.311767578125, 1.9765625, 2.641357421875, 3.30615234375, 3.970947265625, 4.6357421875, 5.300537109375, 5.96533203125, 6.630126953125, 7.294921875, 7.959716796875, 8.62451171875, 9.289306640625, 9.9541015625, 10.618896484375, 11.28369140625, 11.948486328125, 12.61328125, 13.278076171875, 13.94287109375, 14.607666015625, 15.2724609375, 15.937255859375, 16.60205078125, 17.266845703125, 17.931640625, 18.596435546875, 19.26123046875, 19.926025390625, 20.5908203125, 21.255615234375, 21.92041015625, 22.585205078125, 23.25]}, "gradients/decoder.transformer.h.2.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0, 9.0, 12.0, 25.0, 44.0, 66.0, 128.0, 293.0, 1103.0, 26923.0, 4155552.0, 8756.0, 846.0, 271.0, 121.0, 63.0, 34.0, 19.0, 7.0, 3.0, 6.0, 1.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-129.5, -126.126953125, -122.75390625, -119.380859375, -116.0078125, -112.634765625, -109.26171875, -105.888671875, -102.515625, -99.142578125, -95.76953125, -92.396484375, -89.0234375, -85.650390625, -82.27734375, -78.904296875, -75.53125, -72.158203125, -68.78515625, -65.412109375, -62.0390625, -58.666015625, -55.29296875, -51.919921875, -48.546875, -45.173828125, -41.80078125, -38.427734375, -35.0546875, -31.681640625, -28.30859375, -24.935546875, -21.5625, -18.189453125, -14.81640625, -11.443359375, -8.0703125, -4.697265625, -1.32421875, 2.048828125, 5.421875, 8.794921875, 12.16796875, 15.541015625, 18.9140625, 22.287109375, 25.66015625, 29.033203125, 32.40625, 35.779296875, 39.15234375, 42.525390625, 45.8984375, 49.271484375, 52.64453125, 56.017578125, 59.390625, 62.763671875, 66.13671875, 69.509765625, 72.8828125, 76.255859375, 79.62890625, 83.001953125, 86.375]}, "gradients/decoder.transformer.h.2.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 180.0, 786.0, 43.0, 4.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-678.7581176757812, -662.726318359375, -646.6944580078125, -630.66259765625, -614.6307983398438, -598.5989990234375, -582.567138671875, -566.5352783203125, -550.5034790039062, -534.4716796875, -518.4398193359375, -502.4079895019531, -486.37615966796875, -470.3443298339844, -454.3125, -438.2806701660156, -422.24884033203125, -406.2170104980469, -390.1851806640625, -374.1533508300781, -358.12152099609375, -342.0896911621094, -326.057861328125, -310.0260314941406, -293.99420166015625, -277.9623718261719, -261.9305419921875, -245.89871215820312, -229.86688232421875, -213.83505249023438, -197.80322265625, -181.77139282226562, -165.7396240234375, -149.70779418945312, -133.67596435546875, -117.64413452148438, -101.6123046875, -85.58047485351562, -69.54864501953125, -53.516815185546875, -37.4849853515625, -21.453155517578125, -5.42132568359375, 10.610504150390625, 26.642333984375, 42.674163818359375, 58.70599365234375, 74.73782348632812, 90.7696533203125, 106.80148315429688, 122.83331298828125, 138.86514282226562, 154.89697265625, 170.92880249023438, 186.96063232421875, 202.99246215820312, 219.0242919921875, 235.05612182617188, 251.08795166015625, 267.1197814941406, 283.151611328125, 299.1834411621094, 315.21527099609375, 331.2471008300781, 347.2789306640625]}, "gradients/decoder.transformer.h.2.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 4.0, 6.0, 6.0, 15.0, 14.0, 8.0, 21.0, 15.0, 23.0, 27.0, 21.0, 31.0, 39.0, 34.0, 49.0, 44.0, 54.0, 54.0, 54.0, 54.0, 41.0, 37.0, 52.0, 44.0, 36.0, 37.0, 31.0, 41.0, 22.0, 24.0, 17.0, 21.0, 5.0, 11.0, 6.0, 4.0, 2.0, 4.0, 2.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-74.73199462890625, -72.3093490600586, -69.88670349121094, -67.46405792236328, -65.04141235351562, -62.6187629699707, -60.19611740112305, -57.773468017578125, -55.35082244873047, -52.92817687988281, -50.505531311035156, -48.0828857421875, -45.66023635864258, -43.23759078979492, -40.814945220947266, -38.392295837402344, -35.96965408325195, -33.5470085144043, -31.124361038208008, -28.70171546936035, -26.279067993164062, -23.856422424316406, -21.43377685546875, -19.01112937927246, -16.588483810424805, -14.165837287902832, -11.74319076538086, -9.320545196533203, -6.8978986740112305, -4.475252151489258, -2.0526065826416016, 0.3700408935546875, 2.7926864624023438, 5.215332984924316, 7.637979030609131, 10.060625076293945, 12.483271598815918, 14.90591812133789, 17.328563690185547, 19.751211166381836, 22.173856735229492, 24.59650230407715, 27.019149780273438, 29.441795349121094, 31.86444091796875, 34.287086486816406, 36.70973205566406, 39.132381439208984, 41.55502700805664, 43.9776725769043, 46.40031814575195, 48.822967529296875, 51.24561309814453, 53.66825866699219, 56.090904235839844, 58.5135498046875, 60.936195373535156, 63.35884094238281, 65.78148651123047, 68.20413208007812, 70.62677764892578, 73.04942321777344, 75.47207641601562, 77.89472198486328, 80.31736755371094]}, "gradients/decoder.transformer.h.2.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 10.0, 6.0, 9.0, 10.0, 15.0, 18.0, 26.0, 25.0, 28.0, 25.0, 25.0, 36.0, 23.0, 26.0, 42.0, 46.0, 35.0, 42.0, 44.0, 52.0, 42.0, 43.0, 40.0, 35.0, 33.0, 37.0, 33.0, 29.0, 22.0, 15.0, 23.0, 24.0, 13.0, 14.0, 10.0, 10.0, 10.0, 7.0, 6.0, 3.0, 4.0, 2.0, 1.0, 2.0, 2.0, 2.0], "bins": [-10.609375, -10.321533203125, -10.03369140625, -9.745849609375, -9.4580078125, -9.170166015625, -8.88232421875, -8.594482421875, -8.306640625, -8.018798828125, -7.73095703125, -7.443115234375, -7.1552734375, -6.867431640625, -6.57958984375, -6.291748046875, -6.00390625, -5.716064453125, -5.42822265625, -5.140380859375, -4.8525390625, -4.564697265625, -4.27685546875, -3.989013671875, -3.701171875, -3.413330078125, -3.12548828125, -2.837646484375, -2.5498046875, -2.261962890625, -1.97412109375, -1.686279296875, -1.3984375, -1.110595703125, -0.82275390625, -0.534912109375, -0.2470703125, 0.040771484375, 0.32861328125, 0.616455078125, 0.904296875, 1.192138671875, 1.47998046875, 1.767822265625, 2.0556640625, 2.343505859375, 2.63134765625, 2.919189453125, 3.20703125, 3.494873046875, 3.78271484375, 4.070556640625, 4.3583984375, 4.646240234375, 4.93408203125, 5.221923828125, 5.509765625, 5.797607421875, 6.08544921875, 6.373291015625, 6.6611328125, 6.948974609375, 7.23681640625, 7.524658203125, 7.8125]}, "gradients/decoder.transformer.h.2.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 8.0, 6.0, 5.0, 8.0, 11.0, 28.0, 44.0, 72.0, 96.0, 129.0, 217.0, 370.0, 545.0, 877.0, 1376.0, 2239.0, 3561.0, 5697.0, 9398.0, 15514.0, 26684.0, 46110.0, 83038.0, 178442.0, 371520.0, 136986.0, 69368.0, 38381.0, 22731.0, 13782.0, 8212.0, 4965.0, 3002.0, 1835.0, 1082.0, 806.0, 511.0, 293.0, 183.0, 141.0, 106.0, 64.0, 41.0, 30.0, 17.0, 6.0, 9.0, 8.0, 4.0, 1.0, 4.0, 2.0], "bins": [-2.994140625, -2.911773681640625, -2.82940673828125, -2.747039794921875, -2.6646728515625, -2.582305908203125, -2.49993896484375, -2.417572021484375, -2.335205078125, -2.252838134765625, -2.17047119140625, -2.088104248046875, -2.0057373046875, -1.923370361328125, -1.84100341796875, -1.758636474609375, -1.67626953125, -1.593902587890625, -1.51153564453125, -1.429168701171875, -1.3468017578125, -1.264434814453125, -1.18206787109375, -1.099700927734375, -1.017333984375, -0.934967041015625, -0.85260009765625, -0.770233154296875, -0.6878662109375, -0.605499267578125, -0.52313232421875, -0.440765380859375, -0.3583984375, -0.276031494140625, -0.19366455078125, -0.111297607421875, -0.0289306640625, 0.053436279296875, 0.13580322265625, 0.218170166015625, 0.300537109375, 0.382904052734375, 0.46527099609375, 0.547637939453125, 0.6300048828125, 0.712371826171875, 0.79473876953125, 0.877105712890625, 0.95947265625, 1.041839599609375, 1.12420654296875, 1.206573486328125, 1.2889404296875, 1.371307373046875, 1.45367431640625, 1.536041259765625, 1.618408203125, 1.700775146484375, 1.78314208984375, 1.865509033203125, 1.9478759765625, 2.030242919921875, 2.11260986328125, 2.194976806640625, 2.27734375]}, "gradients/decoder.transformer.h.2.crossattention.c_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 5.0, 2.0, 2.0, 4.0, 6.0, 9.0, 3.0, 7.0, 11.0, 10.0, 14.0, 15.0, 22.0, 13.0, 21.0, 23.0, 20.0, 27.0, 24.0, 27.0, 40.0, 35.0, 34.0, 37.0, 21.0, 44.0, 36.0, 1065.0, 28.0, 36.0, 41.0, 28.0, 42.0, 28.0, 29.0, 26.0, 23.0, 15.0, 18.0, 26.0, 23.0, 19.0, 10.0, 15.0, 11.0, 8.0, 8.0, 1.0, 5.0, 9.0, 5.0, 5.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.609375, -4.45343017578125, -4.2974853515625, -4.14154052734375, -3.985595703125, -3.82965087890625, -3.6737060546875, -3.51776123046875, -3.36181640625, -3.20587158203125, -3.0499267578125, -2.89398193359375, -2.738037109375, -2.58209228515625, -2.4261474609375, -2.27020263671875, -2.1142578125, -1.95831298828125, -1.8023681640625, -1.64642333984375, -1.490478515625, -1.33453369140625, -1.1785888671875, -1.02264404296875, -0.86669921875, -0.71075439453125, -0.5548095703125, -0.39886474609375, -0.242919921875, -0.08697509765625, 0.0689697265625, 0.22491455078125, 0.380859375, 0.53680419921875, 0.6927490234375, 0.84869384765625, 1.004638671875, 1.16058349609375, 1.3165283203125, 1.47247314453125, 1.62841796875, 1.78436279296875, 1.9403076171875, 2.09625244140625, 2.252197265625, 2.40814208984375, 2.5640869140625, 2.72003173828125, 2.8759765625, 3.03192138671875, 3.1878662109375, 3.34381103515625, 3.499755859375, 3.65570068359375, 3.8116455078125, 3.96759033203125, 4.12353515625, 4.27947998046875, 4.4354248046875, 4.59136962890625, 4.747314453125, 4.90325927734375, 5.0592041015625, 5.21514892578125, 5.37109375]}, "gradients/decoder.transformer.h.2.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 2.0, 6.0, 5.0, 12.0, 5.0, 5.0, 11.0, 18.0, 23.0, 51.0, 62.0, 115.0, 174.0, 278.0, 468.0, 784.0, 1363.0, 2372.0, 3853.0, 6739.0, 11821.0, 20996.0, 38074.0, 72108.0, 146911.0, 1440736.0, 172831.0, 80885.0, 42326.0, 23258.0, 12818.0, 7540.0, 4222.0, 2587.0, 1470.0, 841.0, 536.0, 323.0, 161.0, 109.0, 72.0, 56.0, 30.0, 26.0, 14.0, 13.0, 3.0, 8.0, 9.0, 5.0, 1.0, 3.0, 4.0, 1.0, 3.0], "bins": [-2.970703125, -2.88482666015625, -2.7989501953125, -2.71307373046875, -2.627197265625, -2.54132080078125, -2.4554443359375, -2.36956787109375, -2.28369140625, -2.19781494140625, -2.1119384765625, -2.02606201171875, -1.940185546875, -1.85430908203125, -1.7684326171875, -1.68255615234375, -1.5966796875, -1.51080322265625, -1.4249267578125, -1.33905029296875, -1.253173828125, -1.16729736328125, -1.0814208984375, -0.99554443359375, -0.90966796875, -0.82379150390625, -0.7379150390625, -0.65203857421875, -0.566162109375, -0.48028564453125, -0.3944091796875, -0.30853271484375, -0.22265625, -0.13677978515625, -0.0509033203125, 0.03497314453125, 0.120849609375, 0.20672607421875, 0.2926025390625, 0.37847900390625, 0.46435546875, 0.55023193359375, 0.6361083984375, 0.72198486328125, 0.807861328125, 0.89373779296875, 0.9796142578125, 1.06549072265625, 1.1513671875, 1.23724365234375, 1.3231201171875, 1.40899658203125, 1.494873046875, 1.58074951171875, 1.6666259765625, 1.75250244140625, 1.83837890625, 1.92425537109375, 2.0101318359375, 2.09600830078125, 2.181884765625, 2.26776123046875, 2.3536376953125, 2.43951416015625, 2.525390625]}, "gradients/decoder.transformer.h.2.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 4.0, 6.0, 7.0, 11.0, 8.0, 11.0, 10.0, 23.0, 15.0, 26.0, 29.0, 52.0, 75.0, 92.0, 129.0, 133.0, 100.0, 68.0, 53.0, 45.0, 27.0, 23.0, 16.0, 8.0, 7.0, 8.0, 5.0, 4.0, 4.0, 1.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0022792816162109375, -0.0022161900997161865, -0.0021530985832214355, -0.0020900070667266846, -0.0020269155502319336, -0.0019638240337371826, -0.0019007325172424316, -0.0018376410007476807, -0.0017745494842529297, -0.0017114579677581787, -0.0016483664512634277, -0.0015852749347686768, -0.0015221834182739258, -0.0014590919017791748, -0.0013960003852844238, -0.0013329088687896729, -0.0012698173522949219, -0.001206725835800171, -0.00114363431930542, -0.001080542802810669, -0.001017451286315918, -0.000954359769821167, -0.000891268253326416, -0.000828176736831665, -0.0007650852203369141, -0.0007019937038421631, -0.0006389021873474121, -0.0005758106708526611, -0.0005127191543579102, -0.0004496276378631592, -0.0003865361213684082, -0.0003234446048736572, -0.00026035308837890625, -0.00019726157188415527, -0.0001341700553894043, -7.107853889465332e-05, -7.987022399902344e-06, 5.510449409484863e-05, 0.00011819601058959961, 0.00018128752708435059, 0.00024437904357910156, 0.00030747056007385254, 0.0003705620765686035, 0.0004336535930633545, 0.0004967451095581055, 0.0005598366260528564, 0.0006229281425476074, 0.0006860196590423584, 0.0007491111755371094, 0.0008122026920318604, 0.0008752942085266113, 0.0009383857250213623, 0.0010014772415161133, 0.0010645687580108643, 0.0011276602745056152, 0.0011907517910003662, 0.0012538433074951172, 0.0013169348239898682, 0.0013800263404846191, 0.0014431178569793701, 0.001506209373474121, 0.001569300889968872, 0.001632392406463623, 0.001695483922958374, 0.001758575439453125]}, "gradients/decoder.transformer.h.2.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 2.0, 3.0, 3.0, 5.0, 3.0, 5.0, 3.0, 6.0, 4.0, 6.0, 4.0, 7.0, 13.0, 13.0, 26.0, 22.0, 38.0, 55.0, 84.0, 105.0, 193.0, 280.0, 487.0, 1126.0, 523166.0, 520333.0, 1149.0, 520.0, 275.0, 194.0, 133.0, 69.0, 68.0, 32.0, 36.0, 21.0, 29.0, 15.0, 9.0, 11.0, 7.0, 2.0, 2.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.028839111328125, -0.02780914306640625, -0.0267791748046875, -0.02574920654296875, -0.02471923828125, -0.02368927001953125, -0.0226593017578125, -0.02162933349609375, -0.020599365234375, -0.01956939697265625, -0.0185394287109375, -0.01750946044921875, -0.0164794921875, -0.01544952392578125, -0.0144195556640625, -0.01338958740234375, -0.012359619140625, -0.01132965087890625, -0.0102996826171875, -0.00926971435546875, -0.00823974609375, -0.00720977783203125, -0.0061798095703125, -0.00514984130859375, -0.004119873046875, -0.00308990478515625, -0.0020599365234375, -0.00102996826171875, 0.0, 0.00102996826171875, 0.0020599365234375, 0.00308990478515625, 0.004119873046875, 0.00514984130859375, 0.0061798095703125, 0.00720977783203125, 0.00823974609375, 0.00926971435546875, 0.0102996826171875, 0.01132965087890625, 0.012359619140625, 0.01338958740234375, 0.0144195556640625, 0.01544952392578125, 0.0164794921875, 0.01750946044921875, 0.0185394287109375, 0.01956939697265625, 0.020599365234375, 0.02162933349609375, 0.0226593017578125, 0.02368927001953125, 0.02471923828125, 0.02574920654296875, 0.0267791748046875, 0.02780914306640625, 0.028839111328125, 0.02986907958984375, 0.0308990478515625, 0.03192901611328125, 0.032958984375, 0.03398895263671875, 0.0350189208984375, 0.03604888916015625, 0.037078857421875]}, "gradients/decoder.transformer.h.2.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 106.0, 619.0, 262.0, 22.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.005850363057106733, -0.005727712530642748, -0.00560506246984005, -0.005482411943376064, -0.005359761882573366, -0.005237111356109381, -0.005114461295306683, -0.004991810768842697, -0.004869160708039999, -0.0047465101815760136, -0.004623860120773315, -0.00450120959430933, -0.004378559533506632, -0.004255909007042646, -0.004133258946239948, -0.004010608419775963, -0.003887958126142621, -0.0037653078325092793, -0.0036426575388759375, -0.0035200072452425957, -0.003397356951609254, -0.003274706657975912, -0.0031520561315119267, -0.0030294060707092285, -0.002906755544245243, -0.0027841052506119013, -0.0026614549569785595, -0.0025388046633452177, -0.002416154369711876, -0.002293504076078534, -0.0021708537824451923, -0.002048203255981207, -0.0019255531951785088, -0.001802902901545167, -0.0016802526079118252, -0.0015576023142784834, -0.0014349520206451416, -0.0013123017270117998, -0.0011896513169631362, -0.0010670010233297944, -0.0009443507296964526, -0.0008217004360631108, -0.000699050142429769, -0.0005763997905887663, -0.00045374949695542455, -0.00033109920332208276, -0.00020844885148108006, -8.579855784773827e-05, 3.685173578560352e-05, 0.00015950204397086054, 0.00028215235215611756, 0.0004048026748932898, 0.0005274529685266316, 0.0006501032621599734, 0.0007727536140009761, 0.0008954039076343179, 0.0010180542012676597, 0.0011407044949010015, 0.0012633547885343432, 0.0013860051985830069, 0.0015086554922163486, 0.0016313057858496904, 0.0017539560794830322, 0.001876606373116374, 0.001999256666749716]}, "gradients/decoder.transformer.h.2.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 3.0, 8.0, 8.0, 10.0, 17.0, 9.0, 19.0, 21.0, 19.0, 17.0, 17.0, 20.0, 23.0, 20.0, 35.0, 37.0, 40.0, 27.0, 55.0, 49.0, 45.0, 43.0, 41.0, 38.0, 44.0, 32.0, 27.0, 33.0, 38.0, 31.0, 19.0, 25.0, 19.0, 15.0, 16.0, 13.0, 19.0, 16.0, 5.0, 7.0, 8.0, 6.0, 2.0, 5.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0007446408271789551, -0.0007225163280963898, -0.0007003918290138245, -0.0006782673299312592, -0.0006561428308486938, -0.0006340183317661285, -0.0006118938326835632, -0.0005897693336009979, -0.0005676448345184326, -0.0005455203354358673, -0.000523395836353302, -0.0005012713372707367, -0.0004791468381881714, -0.0004570223391056061, -0.00043489784002304077, -0.00041277334094047546, -0.00039064884185791016, -0.00036852434277534485, -0.00034639984369277954, -0.00032427534461021423, -0.0003021508455276489, -0.0002800263464450836, -0.0002579018473625183, -0.000235777348279953, -0.0002136528491973877, -0.0001915283501148224, -0.00016940385103225708, -0.00014727935194969177, -0.00012515485286712646, -0.00010303035378456116, -8.090585470199585e-05, -5.878135561943054e-05, -3.6656856536865234e-05, -1.4532357454299927e-05, 7.592141628265381e-06, 2.971664071083069e-05, 5.1841139793395996e-05, 7.39656388759613e-05, 9.609013795852661e-05, 0.00011821463704109192, 0.00014033913612365723, 0.00016246363520622253, 0.00018458813428878784, 0.00020671263337135315, 0.00022883713245391846, 0.00025096163153648376, 0.00027308613061904907, 0.0002952106297016144, 0.0003173351287841797, 0.000339459627866745, 0.0003615841269493103, 0.0003837086260318756, 0.0004058331251144409, 0.0004279576241970062, 0.00045008212327957153, 0.00047220662236213684, 0.0004943311214447021, 0.0005164556205272675, 0.0005385801196098328, 0.0005607046186923981, 0.0005828291177749634, 0.0006049536168575287, 0.000627078115940094, 0.0006492026150226593, 0.0006713271141052246]}, "gradients/decoder.transformer.h.2.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 10.0, 6.0, 9.0, 10.0, 15.0, 18.0, 26.0, 25.0, 28.0, 25.0, 25.0, 36.0, 23.0, 26.0, 42.0, 46.0, 35.0, 42.0, 44.0, 52.0, 42.0, 43.0, 40.0, 35.0, 33.0, 37.0, 33.0, 29.0, 22.0, 15.0, 23.0, 24.0, 13.0, 14.0, 10.0, 10.0, 10.0, 7.0, 6.0, 3.0, 4.0, 2.0, 1.0, 2.0, 2.0, 2.0], "bins": [-10.609375, -10.321533203125, -10.03369140625, -9.745849609375, -9.4580078125, -9.170166015625, -8.88232421875, -8.594482421875, -8.306640625, -8.018798828125, -7.73095703125, -7.443115234375, -7.1552734375, -6.867431640625, -6.57958984375, -6.291748046875, -6.00390625, -5.716064453125, -5.42822265625, -5.140380859375, -4.8525390625, -4.564697265625, -4.27685546875, -3.989013671875, -3.701171875, -3.413330078125, -3.12548828125, -2.837646484375, -2.5498046875, -2.261962890625, -1.97412109375, -1.686279296875, -1.3984375, -1.110595703125, -0.82275390625, -0.534912109375, -0.2470703125, 0.040771484375, 0.32861328125, 0.616455078125, 0.904296875, 1.192138671875, 1.47998046875, 1.767822265625, 2.0556640625, 2.343505859375, 2.63134765625, 2.919189453125, 3.20703125, 3.494873046875, 3.78271484375, 4.070556640625, 4.3583984375, 4.646240234375, 4.93408203125, 5.221923828125, 5.509765625, 5.797607421875, 6.08544921875, 6.373291015625, 6.6611328125, 6.948974609375, 7.23681640625, 7.524658203125, 7.8125]}, "gradients/decoder.transformer.h.2.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 3.0, 6.0, 13.0, 7.0, 14.0, 19.0, 24.0, 36.0, 45.0, 62.0, 109.0, 162.0, 229.0, 357.0, 531.0, 764.0, 1190.0, 1882.0, 3457.0, 8451.0, 51917.0, 697923.0, 248455.0, 20488.0, 5368.0, 2559.0, 1519.0, 982.0, 677.0, 431.0, 256.0, 171.0, 138.0, 97.0, 63.0, 41.0, 36.0, 26.0, 22.0, 8.0, 6.0, 6.0, 2.0, 4.0, 4.0, 0.0, 2.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.71875, -20.93701171875, -20.1552734375, -19.37353515625, -18.591796875, -17.81005859375, -17.0283203125, -16.24658203125, -15.46484375, -14.68310546875, -13.9013671875, -13.11962890625, -12.337890625, -11.55615234375, -10.7744140625, -9.99267578125, -9.2109375, -8.42919921875, -7.6474609375, -6.86572265625, -6.083984375, -5.30224609375, -4.5205078125, -3.73876953125, -2.95703125, -2.17529296875, -1.3935546875, -0.61181640625, 0.169921875, 0.95166015625, 1.7333984375, 2.51513671875, 3.296875, 4.07861328125, 4.8603515625, 5.64208984375, 6.423828125, 7.20556640625, 7.9873046875, 8.76904296875, 9.55078125, 10.33251953125, 11.1142578125, 11.89599609375, 12.677734375, 13.45947265625, 14.2412109375, 15.02294921875, 15.8046875, 16.58642578125, 17.3681640625, 18.14990234375, 18.931640625, 19.71337890625, 20.4951171875, 21.27685546875, 22.05859375, 22.84033203125, 23.6220703125, 24.40380859375, 25.185546875, 25.96728515625, 26.7490234375, 27.53076171875, 28.3125]}, "gradients/decoder.transformer.h.2.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 6.0, 7.0, 18.0, 15.0, 21.0, 47.0, 67.0, 79.0, 98.0, 146.0, 2069.0, 134.0, 82.0, 77.0, 55.0, 37.0, 35.0, 20.0, 20.0, 7.0, 7.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-62.375, -60.61572265625, -58.8564453125, -57.09716796875, -55.337890625, -53.57861328125, -51.8193359375, -50.06005859375, -48.30078125, -46.54150390625, -44.7822265625, -43.02294921875, -41.263671875, -39.50439453125, -37.7451171875, -35.98583984375, -34.2265625, -32.46728515625, -30.7080078125, -28.94873046875, -27.189453125, -25.43017578125, -23.6708984375, -21.91162109375, -20.15234375, -18.39306640625, -16.6337890625, -14.87451171875, -13.115234375, -11.35595703125, -9.5966796875, -7.83740234375, -6.078125, -4.31884765625, -2.5595703125, -0.80029296875, 0.958984375, 2.71826171875, 4.4775390625, 6.23681640625, 7.99609375, 9.75537109375, 11.5146484375, 13.27392578125, 15.033203125, 16.79248046875, 18.5517578125, 20.31103515625, 22.0703125, 23.82958984375, 25.5888671875, 27.34814453125, 29.107421875, 30.86669921875, 32.6259765625, 34.38525390625, 36.14453125, 37.90380859375, 39.6630859375, 41.42236328125, 43.181640625, 44.94091796875, 46.7001953125, 48.45947265625, 50.21875]}, "gradients/decoder.transformer.h.2.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 4.0, 3.0, 11.0, 14.0, 24.0, 44.0, 51.0, 96.0, 209.0, 403.0, 2138.0, 3139139.0, 2660.0, 458.0, 204.0, 119.0, 50.0, 27.0, 30.0, 16.0, 6.0, 3.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-159.375, -153.791015625, -148.20703125, -142.623046875, -137.0390625, -131.455078125, -125.87109375, -120.287109375, -114.703125, -109.119140625, -103.53515625, -97.951171875, -92.3671875, -86.783203125, -81.19921875, -75.615234375, -70.03125, -64.447265625, -58.86328125, -53.279296875, -47.6953125, -42.111328125, -36.52734375, -30.943359375, -25.359375, -19.775390625, -14.19140625, -8.607421875, -3.0234375, 2.560546875, 8.14453125, 13.728515625, 19.3125, 24.896484375, 30.48046875, 36.064453125, 41.6484375, 47.232421875, 52.81640625, 58.400390625, 63.984375, 69.568359375, 75.15234375, 80.736328125, 86.3203125, 91.904296875, 97.48828125, 103.072265625, 108.65625, 114.240234375, 119.82421875, 125.408203125, 130.9921875, 136.576171875, 142.16015625, 147.744140625, 153.328125, 158.912109375, 164.49609375, 170.080078125, 175.6640625, 181.248046875, 186.83203125, 192.416015625, 198.0]}, "gradients/decoder.transformer.h.2.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 10.0, 41.0, 163.0, 423.0, 293.0, 61.0, 15.0, 7.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-59.99898147583008, -55.90753936767578, -51.816097259521484, -47.72465515136719, -43.633216857910156, -39.541770935058594, -35.45033264160156, -31.358890533447266, -27.26744842529297, -23.176006317138672, -19.084564208984375, -14.993124008178711, -10.901681900024414, -6.810239791870117, -2.718799591064453, 1.3726425170898438, 5.464084625244141, 9.555526733398438, 13.646967887878418, 17.7384090423584, 21.829851150512695, 25.921293258666992, 30.012733459472656, 34.10417556762695, 38.19561767578125, 42.28705978393555, 46.378501892089844, 50.469940185546875, 54.56138610839844, 58.65282440185547, 62.744266510009766, 66.83570861816406, 70.92715454101562, 75.01859283447266, 79.11003875732422, 83.20147705078125, 87.29292297363281, 91.38436126708984, 95.47579956054688, 99.56724548339844, 103.65869140625, 107.75012969970703, 111.8415756225586, 115.93301391601562, 120.02445983886719, 124.11589813232422, 128.20733642578125, 132.2987823486328, 136.39022827148438, 140.48167419433594, 144.57310485839844, 148.66455078125, 152.75599670410156, 156.84744262695312, 160.93887329101562, 165.0303192138672, 169.1217498779297, 173.21319580078125, 177.30462646484375, 181.3960723876953, 185.48751831054688, 189.57896423339844, 193.67039489746094, 197.7618408203125, 201.85328674316406]}, "gradients/decoder.transformer.h.2.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 7.0, 2.0, 2.0, 7.0, 11.0, 8.0, 12.0, 3.0, 17.0, 16.0, 19.0, 21.0, 17.0, 23.0, 30.0, 24.0, 37.0, 46.0, 43.0, 42.0, 36.0, 42.0, 47.0, 34.0, 52.0, 52.0, 52.0, 47.0, 41.0, 30.0, 29.0, 25.0, 20.0, 15.0, 24.0, 18.0, 11.0, 9.0, 12.0, 7.0, 5.0, 3.0, 3.0, 2.0, 3.0, 4.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-82.27857971191406, -79.4348373413086, -76.59109497070312, -73.74734497070312, -70.90360260009766, -68.05986022949219, -65.21611785888672, -62.37237548828125, -59.52863311767578, -56.68489074707031, -53.84114456176758, -50.99740219116211, -48.15365982055664, -45.309913635253906, -42.46617126464844, -39.62242889404297, -36.778682708740234, -33.934940338134766, -31.091196060180664, -28.247451782226562, -25.403709411621094, -22.559965133666992, -19.71622085571289, -16.872478485107422, -14.02873420715332, -11.184990882873535, -8.34124755859375, -5.497503280639648, -2.6537599563598633, 0.18998336791992188, 3.0337276458740234, 5.877470016479492, 8.721214294433594, 11.564957618713379, 14.408700942993164, 17.252445220947266, 20.096187591552734, 22.939931869506836, 25.783676147460938, 28.627418518066406, 31.471162796020508, 34.31490707397461, 37.15864944458008, 40.00239562988281, 42.84613800048828, 45.68988037109375, 48.53362274169922, 51.37736511230469, 54.22111129760742, 57.06485366821289, 59.908599853515625, 62.752342224121094, 65.59608459472656, 68.43982696533203, 71.2835693359375, 74.1273193359375, 76.97106170654297, 79.81480407714844, 82.6585464477539, 85.50228881835938, 88.34603881835938, 91.18978118896484, 94.03352355957031, 96.87726593017578, 99.72100830078125]}, "gradients/decoder.transformer.h.1.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 4.0, 4.0, 5.0, 5.0, 8.0, 8.0, 8.0, 22.0, 16.0, 12.0, 26.0, 28.0, 15.0, 32.0, 38.0, 35.0, 34.0, 44.0, 41.0, 35.0, 36.0, 47.0, 44.0, 42.0, 50.0, 42.0, 40.0, 40.0, 35.0, 17.0, 25.0, 16.0, 20.0, 21.0, 20.0, 12.0, 12.0, 15.0, 11.0, 10.0, 9.0, 6.0, 6.0, 5.0, 3.0, 3.0, 3.0, 1.0, 0.0, 1.0, 2.0], "bins": [-10.3984375, -10.1014404296875, -9.804443359375, -9.5074462890625, -9.21044921875, -8.9134521484375, -8.616455078125, -8.3194580078125, -8.0224609375, -7.7254638671875, -7.428466796875, -7.1314697265625, -6.83447265625, -6.5374755859375, -6.240478515625, -5.9434814453125, -5.646484375, -5.3494873046875, -5.052490234375, -4.7554931640625, -4.45849609375, -4.1614990234375, -3.864501953125, -3.5675048828125, -3.2705078125, -2.9735107421875, -2.676513671875, -2.3795166015625, -2.08251953125, -1.7855224609375, -1.488525390625, -1.1915283203125, -0.89453125, -0.5975341796875, -0.300537109375, -0.0035400390625, 0.29345703125, 0.5904541015625, 0.887451171875, 1.1844482421875, 1.4814453125, 1.7784423828125, 2.075439453125, 2.3724365234375, 2.66943359375, 2.9664306640625, 3.263427734375, 3.5604248046875, 3.857421875, 4.1544189453125, 4.451416015625, 4.7484130859375, 5.04541015625, 5.3424072265625, 5.639404296875, 5.9364013671875, 6.2333984375, 6.5303955078125, 6.827392578125, 7.1243896484375, 7.42138671875, 7.7183837890625, 8.015380859375, 8.3123779296875, 8.609375]}, "gradients/decoder.transformer.h.1.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 4.0, 0.0, 1.0, 4.0, 2.0, 8.0, 4.0, 10.0, 9.0, 9.0, 19.0, 22.0, 36.0, 55.0, 65.0, 100.0, 130.0, 182.0, 260.0, 373.0, 580.0, 837.0, 1510.0, 2701.0, 5712.0, 13599.0, 40985.0, 188425.0, 868882.0, 1925738.0, 882824.0, 189501.0, 44176.0, 14526.0, 5859.0, 2897.0, 1533.0, 924.0, 551.0, 379.0, 255.0, 182.0, 111.0, 83.0, 64.0, 45.0, 36.0, 25.0, 18.0, 12.0, 12.0, 7.0, 4.0, 5.0, 1.0, 3.0, 2.0, 2.0, 1.0, 1.0, 2.0], "bins": [-13.53125, -13.114990234375, -12.69873046875, -12.282470703125, -11.8662109375, -11.449951171875, -11.03369140625, -10.617431640625, -10.201171875, -9.784912109375, -9.36865234375, -8.952392578125, -8.5361328125, -8.119873046875, -7.70361328125, -7.287353515625, -6.87109375, -6.454833984375, -6.03857421875, -5.622314453125, -5.2060546875, -4.789794921875, -4.37353515625, -3.957275390625, -3.541015625, -3.124755859375, -2.70849609375, -2.292236328125, -1.8759765625, -1.459716796875, -1.04345703125, -0.627197265625, -0.2109375, 0.205322265625, 0.62158203125, 1.037841796875, 1.4541015625, 1.870361328125, 2.28662109375, 2.702880859375, 3.119140625, 3.535400390625, 3.95166015625, 4.367919921875, 4.7841796875, 5.200439453125, 5.61669921875, 6.032958984375, 6.44921875, 6.865478515625, 7.28173828125, 7.697998046875, 8.1142578125, 8.530517578125, 8.94677734375, 9.363037109375, 9.779296875, 10.195556640625, 10.61181640625, 11.028076171875, 11.4443359375, 11.860595703125, 12.27685546875, 12.693115234375, 13.109375]}, "gradients/decoder.transformer.h.1.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 4.0, 0.0, 5.0, 8.0, 7.0, 8.0, 10.0, 16.0, 17.0, 34.0, 55.0, 87.0, 135.0, 165.0, 230.0, 308.0, 479.0, 625.0, 609.0, 424.0, 247.0, 190.0, 141.0, 72.0, 61.0, 32.0, 24.0, 20.0, 18.0, 16.0, 5.0, 6.0, 6.0, 4.0, 5.0, 3.0, 3.0, 2.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-17.328125, -16.6865234375, -16.044921875, -15.4033203125, -14.76171875, -14.1201171875, -13.478515625, -12.8369140625, -12.1953125, -11.5537109375, -10.912109375, -10.2705078125, -9.62890625, -8.9873046875, -8.345703125, -7.7041015625, -7.0625, -6.4208984375, -5.779296875, -5.1376953125, -4.49609375, -3.8544921875, -3.212890625, -2.5712890625, -1.9296875, -1.2880859375, -0.646484375, -0.0048828125, 0.63671875, 1.2783203125, 1.919921875, 2.5615234375, 3.203125, 3.8447265625, 4.486328125, 5.1279296875, 5.76953125, 6.4111328125, 7.052734375, 7.6943359375, 8.3359375, 8.9775390625, 9.619140625, 10.2607421875, 10.90234375, 11.5439453125, 12.185546875, 12.8271484375, 13.46875, 14.1103515625, 14.751953125, 15.3935546875, 16.03515625, 16.6767578125, 17.318359375, 17.9599609375, 18.6015625, 19.2431640625, 19.884765625, 20.5263671875, 21.16796875, 21.8095703125, 22.451171875, 23.0927734375, 23.734375]}, "gradients/decoder.transformer.h.1.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 6.0, 0.0, 4.0, 1.0, 2.0, 1.0, 1.0, 3.0, 8.0, 2.0, 10.0, 12.0, 28.0, 32.0, 54.0, 94.0, 175.0, 385.0, 1039.0, 4075.0, 3819862.0, 364066.0, 2782.0, 834.0, 349.0, 189.0, 104.0, 50.0, 44.0, 26.0, 19.0, 8.0, 5.0, 6.0, 8.0, 5.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-98.375, -95.6337890625, -92.892578125, -90.1513671875, -87.41015625, -84.6689453125, -81.927734375, -79.1865234375, -76.4453125, -73.7041015625, -70.962890625, -68.2216796875, -65.48046875, -62.7392578125, -59.998046875, -57.2568359375, -54.515625, -51.7744140625, -49.033203125, -46.2919921875, -43.55078125, -40.8095703125, -38.068359375, -35.3271484375, -32.5859375, -29.8447265625, -27.103515625, -24.3623046875, -21.62109375, -18.8798828125, -16.138671875, -13.3974609375, -10.65625, -7.9150390625, -5.173828125, -2.4326171875, 0.30859375, 3.0498046875, 5.791015625, 8.5322265625, 11.2734375, 14.0146484375, 16.755859375, 19.4970703125, 22.23828125, 24.9794921875, 27.720703125, 30.4619140625, 33.203125, 35.9443359375, 38.685546875, 41.4267578125, 44.16796875, 46.9091796875, 49.650390625, 52.3916015625, 55.1328125, 57.8740234375, 60.615234375, 63.3564453125, 66.09765625, 68.8388671875, 71.580078125, 74.3212890625, 77.0625]}, "gradients/decoder.transformer.h.1.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 13.0, 300.0, 608.0, 92.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-264.47296142578125, -254.75198364257812, -245.031005859375, -235.3100128173828, -225.5890350341797, -215.86805725097656, -206.14706420898438, -196.42608642578125, -186.70510864257812, -176.984130859375, -167.26315307617188, -157.5421600341797, -147.82118225097656, -138.10020446777344, -128.37921142578125, -118.65823364257812, -108.937255859375, -99.21627807617188, -89.49529266357422, -79.77430725097656, -70.05332946777344, -60.33234786987305, -50.611366271972656, -40.890380859375, -31.169403076171875, -21.448421478271484, -11.727439880371094, -2.006458282470703, 7.7145233154296875, 17.435504913330078, 27.15648651123047, 36.877471923828125, 46.598480224609375, 56.319461822509766, 66.04044342041016, 75.76142883300781, 85.48240661621094, 95.20338439941406, 104.92436981201172, 114.64535522460938, 124.3663330078125, 134.08731079101562, 143.80828857421875, 153.52928161621094, 163.25025939941406, 172.9712371826172, 182.69223022460938, 192.4132080078125, 202.13418579101562, 211.85516357421875, 221.57614135742188, 231.29713439941406, 241.0181121826172, 250.7390899658203, 260.4600830078125, 270.1810607910156, 279.90203857421875, 289.6230163574219, 299.343994140625, 309.0649719238281, 318.78594970703125, 328.5069580078125, 338.2279357910156, 347.94891357421875, 357.6698913574219]}, "gradients/decoder.transformer.h.1.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 5.0, 6.0, 2.0, 1.0, 1.0, 6.0, 10.0, 5.0, 13.0, 13.0, 12.0, 18.0, 17.0, 20.0, 24.0, 20.0, 27.0, 33.0, 36.0, 36.0, 36.0, 31.0, 45.0, 48.0, 56.0, 43.0, 36.0, 39.0, 49.0, 34.0, 49.0, 35.0, 16.0, 28.0, 28.0, 19.0, 14.0, 20.0, 12.0, 16.0, 10.0, 12.0, 7.0, 8.0, 3.0, 3.0, 5.0, 1.0, 0.0, 0.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0], "bins": [-64.44195556640625, -62.45415115356445, -60.46634292602539, -58.478538513183594, -56.4907341003418, -54.5029296875, -52.51512145996094, -50.52731704711914, -48.539512634277344, -46.55170822143555, -44.563899993896484, -42.57609558105469, -40.58829116821289, -38.600486755371094, -36.61267852783203, -34.624874114990234, -32.63706588745117, -30.649259567260742, -28.661455154418945, -26.673648834228516, -24.68584442138672, -22.69803810119629, -20.71023178100586, -18.722427368164062, -16.734621047973633, -14.74681568145752, -12.759010314941406, -10.771203994750977, -8.783398628234863, -6.79559326171875, -4.80778694152832, -2.819981575012207, -0.8321762084960938, 1.1556293964385986, 3.143435001373291, 5.1312408447265625, 7.119046211242676, 9.106851577758789, 11.094657897949219, 13.082463264465332, 15.070268630981445, 17.058074951171875, 19.045879364013672, 21.0336856842041, 23.02149200439453, 25.009296417236328, 26.997102737426758, 28.984909057617188, 30.972713470458984, 32.96051788330078, 34.948326110839844, 36.93613052368164, 38.92393493652344, 40.9117431640625, 42.8995475769043, 44.887351989746094, 46.875160217285156, 48.86296463012695, 50.850772857666016, 52.83857727050781, 54.82638168334961, 56.814186096191406, 58.80199432373047, 60.789798736572266, 62.77760314941406]}, "gradients/decoder.transformer.h.1.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 8.0, 8.0, 2.0, 2.0, 11.0, 11.0, 13.0, 11.0, 12.0, 17.0, 16.0, 24.0, 38.0, 34.0, 32.0, 33.0, 48.0, 35.0, 57.0, 42.0, 44.0, 41.0, 45.0, 36.0, 37.0, 38.0, 37.0, 31.0, 38.0, 24.0, 28.0, 31.0, 17.0, 13.0, 14.0, 13.0, 5.0, 10.0, 10.0, 9.0, 6.0, 7.0, 3.0, 2.0, 4.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-8.7890625, -8.5164794921875, -8.243896484375, -7.9713134765625, -7.69873046875, -7.4261474609375, -7.153564453125, -6.8809814453125, -6.6083984375, -6.3358154296875, -6.063232421875, -5.7906494140625, -5.51806640625, -5.2454833984375, -4.972900390625, -4.7003173828125, -4.427734375, -4.1551513671875, -3.882568359375, -3.6099853515625, -3.33740234375, -3.0648193359375, -2.792236328125, -2.5196533203125, -2.2470703125, -1.9744873046875, -1.701904296875, -1.4293212890625, -1.15673828125, -0.8841552734375, -0.611572265625, -0.3389892578125, -0.06640625, 0.2061767578125, 0.478759765625, 0.7513427734375, 1.02392578125, 1.2965087890625, 1.569091796875, 1.8416748046875, 2.1142578125, 2.3868408203125, 2.659423828125, 2.9320068359375, 3.20458984375, 3.4771728515625, 3.749755859375, 4.0223388671875, 4.294921875, 4.5675048828125, 4.840087890625, 5.1126708984375, 5.38525390625, 5.6578369140625, 5.930419921875, 6.2030029296875, 6.4755859375, 6.7481689453125, 7.020751953125, 7.2933349609375, 7.56591796875, 7.8385009765625, 8.111083984375, 8.3836669921875, 8.65625]}, "gradients/decoder.transformer.h.1.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 3.0, 5.0, 3.0, 5.0, 14.0, 17.0, 25.0, 47.0, 55.0, 71.0, 120.0, 174.0, 240.0, 327.0, 525.0, 750.0, 981.0, 1489.0, 2020.0, 3014.0, 4233.0, 6227.0, 8739.0, 13588.0, 20327.0, 32056.0, 52173.0, 90075.0, 192675.0, 314154.0, 122881.0, 66152.0, 39748.0, 25036.0, 16036.0, 10772.0, 7291.0, 4865.0, 3557.0, 2360.0, 1787.0, 1204.0, 810.0, 606.0, 401.0, 282.0, 208.0, 146.0, 101.0, 60.0, 41.0, 38.0, 20.0, 12.0, 6.0, 10.0, 4.0, 2.0, 3.0, 1.0, 1.0], "bins": [-2.076171875, -2.011749267578125, -1.94732666015625, -1.882904052734375, -1.8184814453125, -1.754058837890625, -1.68963623046875, -1.625213623046875, -1.560791015625, -1.496368408203125, -1.43194580078125, -1.367523193359375, -1.3031005859375, -1.238677978515625, -1.17425537109375, -1.109832763671875, -1.04541015625, -0.980987548828125, -0.91656494140625, -0.852142333984375, -0.7877197265625, -0.723297119140625, -0.65887451171875, -0.594451904296875, -0.530029296875, -0.465606689453125, -0.40118408203125, -0.336761474609375, -0.2723388671875, -0.207916259765625, -0.14349365234375, -0.079071044921875, -0.0146484375, 0.049774169921875, 0.11419677734375, 0.178619384765625, 0.2430419921875, 0.307464599609375, 0.37188720703125, 0.436309814453125, 0.500732421875, 0.565155029296875, 0.62957763671875, 0.694000244140625, 0.7584228515625, 0.822845458984375, 0.88726806640625, 0.951690673828125, 1.01611328125, 1.080535888671875, 1.14495849609375, 1.209381103515625, 1.2738037109375, 1.338226318359375, 1.40264892578125, 1.467071533203125, 1.531494140625, 1.595916748046875, 1.66033935546875, 1.724761962890625, 1.7891845703125, 1.853607177734375, 1.91802978515625, 1.982452392578125, 2.046875]}, "gradients/decoder.transformer.h.1.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 6.0, 6.0, 7.0, 10.0, 12.0, 17.0, 17.0, 19.0, 19.0, 17.0, 20.0, 25.0, 40.0, 22.0, 34.0, 44.0, 42.0, 39.0, 47.0, 35.0, 49.0, 1057.0, 45.0, 29.0, 40.0, 37.0, 30.0, 35.0, 34.0, 29.0, 30.0, 18.0, 19.0, 28.0, 16.0, 14.0, 8.0, 11.0, 6.0, 3.0, 5.0, 4.0, 3.0, 4.0, 1.0, 5.0], "bins": [-7.08984375, -6.90850830078125, -6.7271728515625, -6.54583740234375, -6.364501953125, -6.18316650390625, -6.0018310546875, -5.82049560546875, -5.63916015625, -5.45782470703125, -5.2764892578125, -5.09515380859375, -4.913818359375, -4.73248291015625, -4.5511474609375, -4.36981201171875, -4.1884765625, -4.00714111328125, -3.8258056640625, -3.64447021484375, -3.463134765625, -3.28179931640625, -3.1004638671875, -2.91912841796875, -2.73779296875, -2.55645751953125, -2.3751220703125, -2.19378662109375, -2.012451171875, -1.83111572265625, -1.6497802734375, -1.46844482421875, -1.287109375, -1.10577392578125, -0.9244384765625, -0.74310302734375, -0.561767578125, -0.38043212890625, -0.1990966796875, -0.01776123046875, 0.16357421875, 0.34490966796875, 0.5262451171875, 0.70758056640625, 0.888916015625, 1.07025146484375, 1.2515869140625, 1.43292236328125, 1.6142578125, 1.79559326171875, 1.9769287109375, 2.15826416015625, 2.339599609375, 2.52093505859375, 2.7022705078125, 2.88360595703125, 3.06494140625, 3.24627685546875, 3.4276123046875, 3.60894775390625, 3.790283203125, 3.97161865234375, 4.1529541015625, 4.33428955078125, 4.515625]}, "gradients/decoder.transformer.h.1.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 6.0, 0.0, 5.0, 4.0, 8.0, 7.0, 11.0, 12.0, 22.0, 35.0, 50.0, 94.0, 131.0, 255.0, 441.0, 835.0, 1553.0, 2873.0, 5477.0, 10800.0, 21132.0, 43990.0, 94520.0, 241702.0, 1440655.0, 122565.0, 55625.0, 26664.0, 13124.0, 6908.0, 3563.0, 1866.0, 960.0, 490.0, 292.0, 172.0, 103.0, 58.0, 50.0, 26.0, 20.0, 15.0, 10.0, 6.0, 6.0, 1.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.708984375, -2.605438232421875, -2.50189208984375, -2.398345947265625, -2.2947998046875, -2.191253662109375, -2.08770751953125, -1.984161376953125, -1.880615234375, -1.777069091796875, -1.67352294921875, -1.569976806640625, -1.4664306640625, -1.362884521484375, -1.25933837890625, -1.155792236328125, -1.05224609375, -0.948699951171875, -0.84515380859375, -0.741607666015625, -0.6380615234375, -0.534515380859375, -0.43096923828125, -0.327423095703125, -0.223876953125, -0.120330810546875, -0.01678466796875, 0.086761474609375, 0.1903076171875, 0.293853759765625, 0.39739990234375, 0.500946044921875, 0.6044921875, 0.708038330078125, 0.81158447265625, 0.915130615234375, 1.0186767578125, 1.122222900390625, 1.22576904296875, 1.329315185546875, 1.432861328125, 1.536407470703125, 1.63995361328125, 1.743499755859375, 1.8470458984375, 1.950592041015625, 2.05413818359375, 2.157684326171875, 2.26123046875, 2.364776611328125, 2.46832275390625, 2.571868896484375, 2.6754150390625, 2.778961181640625, 2.88250732421875, 2.986053466796875, 3.089599609375, 3.193145751953125, 3.29669189453125, 3.400238037109375, 3.5037841796875, 3.607330322265625, 3.71087646484375, 3.814422607421875, 3.91796875]}, "gradients/decoder.transformer.h.1.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 4.0, 0.0, 3.0, 5.0, 5.0, 8.0, 9.0, 15.0, 19.0, 36.0, 43.0, 61.0, 111.0, 162.0, 173.0, 124.0, 81.0, 38.0, 30.0, 27.0, 16.0, 7.0, 9.0, 8.0, 3.0, 2.0, 1.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.002452850341796875, -0.0023684799671173096, -0.002284109592437744, -0.0021997392177581787, -0.0021153688430786133, -0.002030998468399048, -0.0019466280937194824, -0.001862257719039917, -0.0017778873443603516, -0.0016935169696807861, -0.0016091465950012207, -0.0015247762203216553, -0.0014404058456420898, -0.0013560354709625244, -0.001271665096282959, -0.0011872947216033936, -0.0011029243469238281, -0.0010185539722442627, -0.0009341835975646973, -0.0008498132228851318, -0.0007654428482055664, -0.000681072473526001, -0.0005967020988464355, -0.0005123317241668701, -0.0004279613494873047, -0.00034359097480773926, -0.00025922060012817383, -0.0001748502254486084, -9.047985076904297e-05, -6.109476089477539e-06, 7.826089859008789e-05, 0.00016263127326965332, 0.00024700164794921875, 0.0003313720226287842, 0.0004157423973083496, 0.000500112771987915, 0.0005844831466674805, 0.0006688535213470459, 0.0007532238960266113, 0.0008375942707061768, 0.0009219646453857422, 0.0010063350200653076, 0.001090705394744873, 0.0011750757694244385, 0.001259446144104004, 0.0013438165187835693, 0.0014281868934631348, 0.0015125572681427002, 0.0015969276428222656, 0.001681298017501831, 0.0017656683921813965, 0.001850038766860962, 0.0019344091415405273, 0.0020187795162200928, 0.002103149890899658, 0.0021875202655792236, 0.002271890640258789, 0.0023562610149383545, 0.00244063138961792, 0.0025250017642974854, 0.0026093721389770508, 0.002693742513656616, 0.0027781128883361816, 0.002862483263015747, 0.0029468536376953125]}, "gradients/decoder.transformer.h.1.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 5.0, 0.0, 3.0, 6.0, 3.0, 8.0, 9.0, 13.0, 12.0, 19.0, 28.0, 31.0, 46.0, 78.0, 121.0, 228.0, 412.0, 1120.0, 1026064.0, 18773.0, 766.0, 315.0, 191.0, 91.0, 68.0, 39.0, 32.0, 19.0, 17.0, 10.0, 7.0, 10.0, 3.0, 3.0, 5.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.04351806640625, -0.04226875305175781, -0.041019439697265625, -0.03977012634277344, -0.03852081298828125, -0.03727149963378906, -0.036022186279296875, -0.03477287292480469, -0.0335235595703125, -0.03227424621582031, -0.031024932861328125, -0.029775619506835938, -0.02852630615234375, -0.027276992797851562, -0.026027679443359375, -0.024778366088867188, -0.023529052734375, -0.022279739379882812, -0.021030426025390625, -0.019781112670898438, -0.01853179931640625, -0.017282485961914062, -0.016033172607421875, -0.014783859252929688, -0.0135345458984375, -0.012285232543945312, -0.011035919189453125, -0.009786605834960938, -0.00853729248046875, -0.0072879791259765625, -0.006038665771484375, -0.0047893524169921875, -0.0035400390625, -0.0022907257080078125, -0.001041412353515625, 0.0002079010009765625, 0.00145721435546875, 0.0027065277099609375, 0.003955841064453125, 0.0052051544189453125, 0.0064544677734375, 0.0077037811279296875, 0.008953094482421875, 0.010202407836914062, 0.01145172119140625, 0.012701034545898438, 0.013950347900390625, 0.015199661254882812, 0.016448974609375, 0.017698287963867188, 0.018947601318359375, 0.020196914672851562, 0.02144622802734375, 0.022695541381835938, 0.023944854736328125, 0.025194168090820312, 0.0264434814453125, 0.027692794799804688, 0.028942108154296875, 0.030191421508789062, 0.03144073486328125, 0.03269004821777344, 0.033939361572265625, 0.03518867492675781, 0.03643798828125]}, "gradients/decoder.transformer.h.1.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 26.0, 140.0, 406.0, 334.0, 78.0, 22.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005947514437139034, -0.0005399063811637461, -0.0004850612604059279, -0.0004302161978557706, -0.0003753711062017828, -0.00032052601454779506, -0.00026568095199763775, -0.00021083586034364998, -0.00015599076868966222, -0.00010114568431163207, -4.6300599933601916e-05, 8.544477168470621e-06, 6.338956882245839e-05, 0.00011823466047644615, 0.00017307972302660346, 0.00022792481468059123, 0.000282769906334579, 0.00033761499798856676, 0.0003924600896425545, 0.00044730515219271183, 0.00050215027295053, 0.0005569953355006874, 0.0006118403980508447, 0.0006666855188086629, 0.0007215305813588202, 0.0007763756439089775, 0.0008312207646667957, 0.000886065827216953, 0.0009409108897671103, 0.0009957560105249286, 0.001050601014867425, 0.0011054461356252432, 0.0011602912563830614, 0.0012151363771408796, 0.001269981381483376, 0.0013248265022411942, 0.0013796716229990125, 0.0014345166273415089, 0.001489361748099327, 0.0015442068688571453, 0.0015990519896149635, 0.0016538971103727818, 0.0017087421147152781, 0.0017635872354730964, 0.0018184323562309146, 0.001873277360573411, 0.0019281224813312292, 0.0019829676020890474, 0.002037812490016222, 0.0020926576107740402, 0.0021475027315318584, 0.0022023478522896767, 0.0022571927402168512, 0.0023120378609746695, 0.0023668829817324877, 0.002421728102490306, 0.002476573223248124, 0.0025314183440059423, 0.0025862634647637606, 0.002641108352690935, 0.0026959534734487534, 0.0027507985942065716, 0.00280564371496439, 0.002860488835722208, 0.0029153339564800262]}, "gradients/decoder.transformer.h.1.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 1.0, 0.0, 1.0, 2.0, 1.0, 4.0, 3.0, 6.0, 5.0, 5.0, 9.0, 10.0, 4.0, 13.0, 16.0, 10.0, 16.0, 16.0, 20.0, 29.0, 23.0, 26.0, 26.0, 24.0, 28.0, 35.0, 29.0, 37.0, 32.0, 35.0, 37.0, 40.0, 42.0, 40.0, 41.0, 27.0, 25.0, 31.0, 33.0, 18.0, 25.0, 14.0, 26.0, 20.0, 22.0, 19.0, 9.0, 6.0, 17.0, 9.0, 11.0, 9.0, 9.0, 8.0, 3.0, 6.0, 1.0, 1.0, 1.0, 0.0, 3.0], "bins": [-0.0007615089416503906, -0.0007392885163426399, -0.0007170680910348892, -0.0006948476657271385, -0.0006726272404193878, -0.0006504068151116371, -0.0006281863898038864, -0.0006059659644961357, -0.000583745539188385, -0.0005615251138806343, -0.0005393046885728836, -0.0005170842632651329, -0.0004948638379573822, -0.0004726434126496315, -0.0004504229873418808, -0.0004282025620341301, -0.0004059821367263794, -0.0003837617114186287, -0.000361541286110878, -0.0003393208608031273, -0.0003171004354953766, -0.0002948800101876259, -0.0002726595848798752, -0.0002504391595721245, -0.00022821873426437378, -0.00020599830895662308, -0.00018377788364887238, -0.00016155745834112167, -0.00013933703303337097, -0.00011711660772562027, -9.489618241786957e-05, -7.267575711011887e-05, -5.0455331802368164e-05, -2.8234906494617462e-05, -6.01448118686676e-06, 1.620594412088394e-05, 3.8426369428634644e-05, 6.0646794736385345e-05, 8.286722004413605e-05, 0.00010508764535188675, 0.00012730807065963745, 0.00014952849596738815, 0.00017174892127513885, 0.00019396934658288956, 0.00021618977189064026, 0.00023841019719839096, 0.00026063062250614166, 0.00028285104781389236, 0.00030507147312164307, 0.00032729189842939377, 0.00034951232373714447, 0.00037173274904489517, 0.0003939531743526459, 0.0004161735996603966, 0.0004383940249681473, 0.000460614450275898, 0.0004828348755836487, 0.0005050553008913994, 0.0005272757261991501, 0.0005494961515069008, 0.0005717165768146515, 0.0005939370021224022, 0.0006161574274301529, 0.0006383778527379036, 0.0006605982780456543]}, "gradients/decoder.transformer.h.1.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 8.0, 8.0, 2.0, 2.0, 11.0, 11.0, 13.0, 11.0, 12.0, 17.0, 16.0, 24.0, 38.0, 34.0, 32.0, 33.0, 48.0, 35.0, 57.0, 42.0, 44.0, 41.0, 45.0, 36.0, 37.0, 38.0, 37.0, 31.0, 38.0, 24.0, 28.0, 31.0, 17.0, 13.0, 14.0, 13.0, 5.0, 10.0, 10.0, 9.0, 6.0, 7.0, 3.0, 2.0, 4.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-8.7890625, -8.5164794921875, -8.243896484375, -7.9713134765625, -7.69873046875, -7.4261474609375, -7.153564453125, -6.8809814453125, -6.6083984375, -6.3358154296875, -6.063232421875, -5.7906494140625, -5.51806640625, -5.2454833984375, -4.972900390625, -4.7003173828125, -4.427734375, -4.1551513671875, -3.882568359375, -3.6099853515625, -3.33740234375, -3.0648193359375, -2.792236328125, -2.5196533203125, -2.2470703125, -1.9744873046875, -1.701904296875, -1.4293212890625, -1.15673828125, -0.8841552734375, -0.611572265625, -0.3389892578125, -0.06640625, 0.2061767578125, 0.478759765625, 0.7513427734375, 1.02392578125, 1.2965087890625, 1.569091796875, 1.8416748046875, 2.1142578125, 2.3868408203125, 2.659423828125, 2.9320068359375, 3.20458984375, 3.4771728515625, 3.749755859375, 4.0223388671875, 4.294921875, 4.5675048828125, 4.840087890625, 5.1126708984375, 5.38525390625, 5.6578369140625, 5.930419921875, 6.2030029296875, 6.4755859375, 6.7481689453125, 7.020751953125, 7.2933349609375, 7.56591796875, 7.8385009765625, 8.111083984375, 8.3836669921875, 8.65625]}, "gradients/decoder.transformer.h.1.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 2.0, 4.0, 3.0, 8.0, 9.0, 13.0, 14.0, 14.0, 22.0, 37.0, 35.0, 61.0, 83.0, 106.0, 161.0, 240.0, 351.0, 523.0, 939.0, 1654.0, 3657.0, 8639.0, 27392.0, 137541.0, 695045.0, 129227.0, 26572.0, 8483.0, 3536.0, 1599.0, 881.0, 565.0, 330.0, 224.0, 162.0, 116.0, 77.0, 57.0, 43.0, 27.0, 28.0, 19.0, 15.0, 8.0, 10.0, 8.0, 3.0, 6.0, 3.0, 2.0, 4.0, 3.0, 1.0, 0.0, 0.0, 2.0], "bins": [-18.296875, -17.733642578125, -17.17041015625, -16.607177734375, -16.0439453125, -15.480712890625, -14.91748046875, -14.354248046875, -13.791015625, -13.227783203125, -12.66455078125, -12.101318359375, -11.5380859375, -10.974853515625, -10.41162109375, -9.848388671875, -9.28515625, -8.721923828125, -8.15869140625, -7.595458984375, -7.0322265625, -6.468994140625, -5.90576171875, -5.342529296875, -4.779296875, -4.216064453125, -3.65283203125, -3.089599609375, -2.5263671875, -1.963134765625, -1.39990234375, -0.836669921875, -0.2734375, 0.289794921875, 0.85302734375, 1.416259765625, 1.9794921875, 2.542724609375, 3.10595703125, 3.669189453125, 4.232421875, 4.795654296875, 5.35888671875, 5.922119140625, 6.4853515625, 7.048583984375, 7.61181640625, 8.175048828125, 8.73828125, 9.301513671875, 9.86474609375, 10.427978515625, 10.9912109375, 11.554443359375, 12.11767578125, 12.680908203125, 13.244140625, 13.807373046875, 14.37060546875, 14.933837890625, 15.4970703125, 16.060302734375, 16.62353515625, 17.186767578125, 17.75]}, "gradients/decoder.transformer.h.1.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 5.0, 5.0, 12.0, 8.0, 4.0, 11.0, 8.0, 16.0, 31.0, 28.0, 26.0, 45.0, 51.0, 45.0, 57.0, 69.0, 161.0, 1980.0, 101.0, 58.0, 60.0, 63.0, 35.0, 40.0, 29.0, 24.0, 11.0, 16.0, 11.0, 14.0, 5.0, 8.0, 3.0, 2.0, 4.0, 2.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-32.28125, -31.34326171875, -30.4052734375, -29.46728515625, -28.529296875, -27.59130859375, -26.6533203125, -25.71533203125, -24.77734375, -23.83935546875, -22.9013671875, -21.96337890625, -21.025390625, -20.08740234375, -19.1494140625, -18.21142578125, -17.2734375, -16.33544921875, -15.3974609375, -14.45947265625, -13.521484375, -12.58349609375, -11.6455078125, -10.70751953125, -9.76953125, -8.83154296875, -7.8935546875, -6.95556640625, -6.017578125, -5.07958984375, -4.1416015625, -3.20361328125, -2.265625, -1.32763671875, -0.3896484375, 0.54833984375, 1.486328125, 2.42431640625, 3.3623046875, 4.30029296875, 5.23828125, 6.17626953125, 7.1142578125, 8.05224609375, 8.990234375, 9.92822265625, 10.8662109375, 11.80419921875, 12.7421875, 13.68017578125, 14.6181640625, 15.55615234375, 16.494140625, 17.43212890625, 18.3701171875, 19.30810546875, 20.24609375, 21.18408203125, 22.1220703125, 23.06005859375, 23.998046875, 24.93603515625, 25.8740234375, 26.81201171875, 27.75]}, "gradients/decoder.transformer.h.1.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 1.0, 5.0, 2.0, 4.0, 5.0, 4.0, 11.0, 10.0, 7.0, 16.0, 22.0, 24.0, 47.0, 67.0, 119.0, 187.0, 376.0, 954.0, 45461.0, 3095662.0, 1564.0, 515.0, 244.0, 130.0, 70.0, 53.0, 35.0, 23.0, 26.0, 18.0, 12.0, 10.0, 7.0, 4.0, 6.0, 6.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-87.625, -84.7197265625, -81.814453125, -78.9091796875, -76.00390625, -73.0986328125, -70.193359375, -67.2880859375, -64.3828125, -61.4775390625, -58.572265625, -55.6669921875, -52.76171875, -49.8564453125, -46.951171875, -44.0458984375, -41.140625, -38.2353515625, -35.330078125, -32.4248046875, -29.51953125, -26.6142578125, -23.708984375, -20.8037109375, -17.8984375, -14.9931640625, -12.087890625, -9.1826171875, -6.27734375, -3.3720703125, -0.466796875, 2.4384765625, 5.34375, 8.2490234375, 11.154296875, 14.0595703125, 16.96484375, 19.8701171875, 22.775390625, 25.6806640625, 28.5859375, 31.4912109375, 34.396484375, 37.3017578125, 40.20703125, 43.1123046875, 46.017578125, 48.9228515625, 51.828125, 54.7333984375, 57.638671875, 60.5439453125, 63.44921875, 66.3544921875, 69.259765625, 72.1650390625, 75.0703125, 77.9755859375, 80.880859375, 83.7861328125, 86.69140625, 89.5966796875, 92.501953125, 95.4072265625, 98.3125]}, "gradients/decoder.transformer.h.1.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 6.0, 79.0, 815.0, 107.0, 8.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-38.51272201538086, -33.41378402709961, -28.314842224121094, -23.215904235839844, -18.11696434020996, -13.018024444580078, -7.919086456298828, -2.8201446533203125, 2.2787933349609375, 7.377732753753662, 12.476672172546387, 17.575611114501953, 22.674551010131836, 27.77349090576172, 32.87242889404297, 37.971370697021484, 43.070308685302734, 48.169246673583984, 53.2681884765625, 58.36712646484375, 63.466064453125, 68.56500244140625, 73.6639404296875, 78.76288604736328, 83.86182403564453, 88.96076202392578, 94.05970001220703, 99.15864562988281, 104.25758361816406, 109.35652160644531, 114.45545959472656, 119.55439758300781, 124.65333557128906, 129.7522735595703, 134.85121154785156, 139.9501495361328, 145.04908752441406, 150.14804077148438, 155.24697875976562, 160.34591674804688, 165.44485473632812, 170.54379272460938, 175.64273071289062, 180.74166870117188, 185.84060668945312, 190.93954467773438, 196.03848266601562, 201.13743591308594, 206.23635864257812, 211.33529663085938, 216.43423461914062, 221.53317260742188, 226.63211059570312, 231.73104858398438, 236.82998657226562, 241.92893981933594, 247.0278778076172, 252.12681579589844, 257.22576904296875, 262.32470703125, 267.42364501953125, 272.5225830078125, 277.62152099609375, 282.720458984375, 287.81939697265625]}, "gradients/decoder.transformer.h.1.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 1.0, 3.0, 4.0, 7.0, 5.0, 5.0, 7.0, 9.0, 10.0, 12.0, 20.0, 14.0, 19.0, 26.0, 34.0, 37.0, 37.0, 41.0, 46.0, 56.0, 54.0, 35.0, 55.0, 46.0, 49.0, 40.0, 47.0, 45.0, 36.0, 25.0, 26.0, 26.0, 21.0, 20.0, 21.0, 19.0, 12.0, 17.0, 6.0, 3.0, 3.0, 2.0, 3.0, 5.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-57.683353424072266, -55.71160125732422, -53.73984909057617, -51.768096923828125, -49.79634475708008, -47.82459259033203, -45.85283660888672, -43.88108825683594, -41.909332275390625, -39.93758010864258, -37.96582794189453, -35.994075775146484, -34.02232360839844, -32.05057144165039, -30.07881736755371, -28.107065200805664, -26.13531494140625, -24.163562774658203, -22.191810607910156, -20.22005844116211, -18.248306274414062, -16.276554107666016, -14.304800033569336, -12.333047866821289, -10.361295700073242, -8.389543533325195, -6.41779088973999, -4.446038246154785, -2.4742860794067383, -0.5025339126586914, 1.4692192077636719, 3.4409713745117188, 5.412727355957031, 7.384479522705078, 9.356231689453125, 11.327984809875488, 13.299736976623535, 15.271489143371582, 17.243242263793945, 19.214994430541992, 21.18674659729004, 23.158498764038086, 25.130250930786133, 27.102005004882812, 29.07375717163086, 31.045509338378906, 33.01726150512695, 34.989013671875, 36.96076583862305, 38.932518005371094, 40.90427017211914, 42.87602233886719, 44.847774505615234, 46.81952667236328, 48.791282653808594, 50.763031005859375, 52.73478698730469, 54.706539154052734, 56.67829132080078, 58.65004348754883, 60.621795654296875, 62.59354782104492, 64.56529998779297, 66.53705596923828, 68.50880432128906]}, "gradients/decoder.transformer.h.0.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 2.0, 8.0, 7.0, 3.0, 8.0, 2.0, 4.0, 13.0, 12.0, 10.0, 17.0, 11.0, 12.0, 22.0, 28.0, 27.0, 34.0, 25.0, 35.0, 42.0, 42.0, 36.0, 42.0, 53.0, 47.0, 49.0, 45.0, 31.0, 40.0, 42.0, 27.0, 32.0, 31.0, 26.0, 16.0, 20.0, 20.0, 14.0, 6.0, 16.0, 8.0, 8.0, 6.0, 7.0, 6.0, 6.0, 5.0, 3.0, 1.0, 3.0, 0.0, 2.0, 2.0, 0.0, 0.0, 2.0], "bins": [-10.34375, -10.0228271484375, -9.701904296875, -9.3809814453125, -9.06005859375, -8.7391357421875, -8.418212890625, -8.0972900390625, -7.7763671875, -7.4554443359375, -7.134521484375, -6.8135986328125, -6.49267578125, -6.1717529296875, -5.850830078125, -5.5299072265625, -5.208984375, -4.8880615234375, -4.567138671875, -4.2462158203125, -3.92529296875, -3.6043701171875, -3.283447265625, -2.9625244140625, -2.6416015625, -2.3206787109375, -1.999755859375, -1.6788330078125, -1.35791015625, -1.0369873046875, -0.716064453125, -0.3951416015625, -0.07421875, 0.2467041015625, 0.567626953125, 0.8885498046875, 1.20947265625, 1.5303955078125, 1.851318359375, 2.1722412109375, 2.4931640625, 2.8140869140625, 3.135009765625, 3.4559326171875, 3.77685546875, 4.0977783203125, 4.418701171875, 4.7396240234375, 5.060546875, 5.3814697265625, 5.702392578125, 6.0233154296875, 6.34423828125, 6.6651611328125, 6.986083984375, 7.3070068359375, 7.6279296875, 7.9488525390625, 8.269775390625, 8.5906982421875, 8.91162109375, 9.2325439453125, 9.553466796875, 9.8743896484375, 10.1953125]}, "gradients/decoder.transformer.h.0.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 3.0, 3.0, 2.0, 8.0, 3.0, 6.0, 10.0, 8.0, 12.0, 17.0, 16.0, 29.0, 26.0, 52.0, 42.0, 62.0, 76.0, 75.0, 89.0, 117.0, 168.0, 241.0, 428.0, 634.0, 1268.0, 4259.0, 776877.0, 3397063.0, 8525.0, 1761.0, 776.0, 434.0, 309.0, 197.0, 141.0, 100.0, 94.0, 75.0, 39.0, 42.0, 42.0, 31.0, 20.0, 26.0, 18.0, 18.0, 12.0, 10.0, 8.0, 9.0, 5.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-73.0, -70.6572265625, -68.314453125, -65.9716796875, -63.62890625, -61.2861328125, -58.943359375, -56.6005859375, -54.2578125, -51.9150390625, -49.572265625, -47.2294921875, -44.88671875, -42.5439453125, -40.201171875, -37.8583984375, -35.515625, -33.1728515625, -30.830078125, -28.4873046875, -26.14453125, -23.8017578125, -21.458984375, -19.1162109375, -16.7734375, -14.4306640625, -12.087890625, -9.7451171875, -7.40234375, -5.0595703125, -2.716796875, -0.3740234375, 1.96875, 4.3115234375, 6.654296875, 8.9970703125, 11.33984375, 13.6826171875, 16.025390625, 18.3681640625, 20.7109375, 23.0537109375, 25.396484375, 27.7392578125, 30.08203125, 32.4248046875, 34.767578125, 37.1103515625, 39.453125, 41.7958984375, 44.138671875, 46.4814453125, 48.82421875, 51.1669921875, 53.509765625, 55.8525390625, 58.1953125, 60.5380859375, 62.880859375, 65.2236328125, 67.56640625, 69.9091796875, 72.251953125, 74.5947265625, 76.9375]}, "gradients/decoder.transformer.h.0.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 3.0, 1.0, 4.0, 5.0, 6.0, 31.0, 68.0, 232.0, 953.0, 1853.0, 711.0, 130.0, 56.0, 16.0, 5.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-33.5, -31.861328125, -30.22265625, -28.583984375, -26.9453125, -25.306640625, -23.66796875, -22.029296875, -20.390625, -18.751953125, -17.11328125, -15.474609375, -13.8359375, -12.197265625, -10.55859375, -8.919921875, -7.28125, -5.642578125, -4.00390625, -2.365234375, -0.7265625, 0.912109375, 2.55078125, 4.189453125, 5.828125, 7.466796875, 9.10546875, 10.744140625, 12.3828125, 14.021484375, 15.66015625, 17.298828125, 18.9375, 20.576171875, 22.21484375, 23.853515625, 25.4921875, 27.130859375, 28.76953125, 30.408203125, 32.046875, 33.685546875, 35.32421875, 36.962890625, 38.6015625, 40.240234375, 41.87890625, 43.517578125, 45.15625, 46.794921875, 48.43359375, 50.072265625, 51.7109375, 53.349609375, 54.98828125, 56.626953125, 58.265625, 59.904296875, 61.54296875, 63.181640625, 64.8203125, 66.458984375, 68.09765625, 69.736328125, 71.375]}, "gradients/decoder.transformer.h.0.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 3.0, 2.0, 1.0, 7.0, 6.0, 12.0, 23.0, 48.0, 103.0, 220.0, 703.0, 4801.0, 4095550.0, 90486.0, 1611.0, 394.0, 158.0, 72.0, 43.0, 27.0, 7.0, 5.0, 2.0, 1.0, 2.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-79.75, -76.9775390625, -74.205078125, -71.4326171875, -68.66015625, -65.8876953125, -63.115234375, -60.3427734375, -57.5703125, -54.7978515625, -52.025390625, -49.2529296875, -46.48046875, -43.7080078125, -40.935546875, -38.1630859375, -35.390625, -32.6181640625, -29.845703125, -27.0732421875, -24.30078125, -21.5283203125, -18.755859375, -15.9833984375, -13.2109375, -10.4384765625, -7.666015625, -4.8935546875, -2.12109375, 0.6513671875, 3.423828125, 6.1962890625, 8.96875, 11.7412109375, 14.513671875, 17.2861328125, 20.05859375, 22.8310546875, 25.603515625, 28.3759765625, 31.1484375, 33.9208984375, 36.693359375, 39.4658203125, 42.23828125, 45.0107421875, 47.783203125, 50.5556640625, 53.328125, 56.1005859375, 58.873046875, 61.6455078125, 64.41796875, 67.1904296875, 69.962890625, 72.7353515625, 75.5078125, 78.2802734375, 81.052734375, 83.8251953125, 86.59765625, 89.3701171875, 92.142578125, 94.9150390625, 97.6875]}, "gradients/decoder.transformer.h.0.ln_2.weight": {"_type": "histogram", "values": [3.0, 2.0, 6.0, 7.0, 39.0, 110.0, 322.0, 377.0, 117.0, 25.0, 10.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-79.99321746826172, -68.5245132446289, -57.05580520629883, -45.58709716796875, -34.11839294433594, -22.649688720703125, -11.180976867675781, 0.28772735595703125, 11.756431579589844, 23.22513771057129, 34.693843841552734, 46.16255187988281, 57.631256103515625, 69.09996032714844, 80.56867218017578, 92.0373764038086, 103.5060806274414, 114.97478485107422, 126.44349670410156, 137.91220092773438, 149.3809051513672, 160.849609375, 172.31832885742188, 183.78701782226562, 195.2557373046875, 206.7244415283203, 218.19314575195312, 229.661865234375, 241.13055419921875, 252.59927368164062, 264.0679931640625, 275.53668212890625, 287.00537109375, 298.4740905761719, 309.9427795410156, 321.4114990234375, 332.88018798828125, 344.3489074707031, 355.817626953125, 367.28631591796875, 378.7550048828125, 390.2237243652344, 401.6924133300781, 413.1611328125, 424.62982177734375, 436.0985412597656, 447.5672607421875, 459.03594970703125, 470.5046691894531, 481.973388671875, 493.44207763671875, 504.9107971191406, 516.3795166015625, 527.8482055664062, 539.31689453125, 550.78564453125, 562.2543334960938, 573.7230224609375, 585.1917724609375, 596.6604614257812, 608.129150390625, 619.5978393554688, 631.0665893554688, 642.5352783203125, 654.0039672851562]}, "gradients/decoder.transformer.h.0.ln_2.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 3.0, 9.0, 9.0, 10.0, 10.0, 12.0, 14.0, 11.0, 17.0, 29.0, 24.0, 28.0, 35.0, 36.0, 32.0, 40.0, 46.0, 49.0, 46.0, 44.0, 39.0, 43.0, 46.0, 52.0, 57.0, 44.0, 32.0, 30.0, 26.0, 26.0, 16.0, 18.0, 12.0, 13.0, 6.0, 8.0, 8.0, 11.0, 5.0, 4.0, 3.0, 3.0, 1.0, 1.0, 2.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-70.95883178710938, -68.74300384521484, -66.52718353271484, -64.31135559082031, -62.09553146362305, -59.87970733642578, -57.66387939453125, -55.448055267333984, -53.23223114013672, -51.01640701293945, -48.80058288574219, -46.584754943847656, -44.36893081665039, -42.153106689453125, -39.937278747558594, -37.72145462036133, -35.50563049316406, -33.2898063659668, -31.0739803314209, -28.858154296875, -26.642330169677734, -24.42650604248047, -22.21068000793457, -19.994853973388672, -17.779029846191406, -15.563204765319824, -13.347379684448242, -11.13155460357666, -8.915729522705078, -6.699904441833496, -4.484079360961914, -2.268254280090332, -0.05242919921875, 2.163395881652832, 4.379220962524414, 6.595046043395996, 8.810871124267578, 11.02669620513916, 13.242521286010742, 15.458346366882324, 17.674171447753906, 19.889995574951172, 22.10582160949707, 24.32164764404297, 26.537471771240234, 28.7532958984375, 30.9691219329834, 33.1849479675293, 35.40077209472656, 37.61659622192383, 39.832420349121094, 42.048248291015625, 44.26407241821289, 46.479896545410156, 48.69572448730469, 50.91154861450195, 53.12737274169922, 55.343196868896484, 57.55902099609375, 59.77484893798828, 61.99067306518555, 64.20649719238281, 66.42232513427734, 68.63814544677734, 70.85397338867188]}, "gradients/decoder.transformer.h.0.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 2.0, 1.0, 6.0, 12.0, 7.0, 15.0, 12.0, 12.0, 18.0, 17.0, 28.0, 27.0, 26.0, 32.0, 38.0, 42.0, 37.0, 38.0, 41.0, 48.0, 45.0, 46.0, 44.0, 43.0, 49.0, 53.0, 38.0, 45.0, 25.0, 23.0, 30.0, 22.0, 15.0, 13.0, 9.0, 12.0, 5.0, 6.0, 8.0, 9.0, 3.0, 4.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-90.5625, -87.5234375, -84.484375, -81.4453125, -78.40625, -75.3671875, -72.328125, -69.2890625, -66.25, -63.2109375, -60.171875, -57.1328125, -54.09375, -51.0546875, -48.015625, -44.9765625, -41.9375, -38.8984375, -35.859375, -32.8203125, -29.78125, -26.7421875, -23.703125, -20.6640625, -17.625, -14.5859375, -11.546875, -8.5078125, -5.46875, -2.4296875, 0.609375, 3.6484375, 6.6875, 9.7265625, 12.765625, 15.8046875, 18.84375, 21.8828125, 24.921875, 27.9609375, 31.0, 34.0390625, 37.078125, 40.1171875, 43.15625, 46.1953125, 49.234375, 52.2734375, 55.3125, 58.3515625, 61.390625, 64.4296875, 67.46875, 70.5078125, 73.546875, 76.5859375, 79.625, 82.6640625, 85.703125, 88.7421875, 91.78125, 94.8203125, 97.859375, 100.8984375, 103.9375]}, "gradients/decoder.transformer.h.0.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 5.0, 4.0, 6.0, 5.0, 6.0, 16.0, 30.0, 36.0, 56.0, 91.0, 119.0, 169.0, 284.0, 426.0, 622.0, 972.0, 1591.0, 2381.0, 3692.0, 6063.0, 9886.0, 16349.0, 27946.0, 50019.0, 94867.0, 205586.0, 336765.0, 133163.0, 67148.0, 36428.0, 20844.0, 12421.0, 7593.0, 4726.0, 2912.0, 1794.0, 1203.0, 800.0, 509.0, 344.0, 231.0, 143.0, 108.0, 69.0, 42.0, 29.0, 19.0, 17.0, 13.0, 7.0, 7.0, 4.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-24.640625, -23.876220703125, -23.11181640625, -22.347412109375, -21.5830078125, -20.818603515625, -20.05419921875, -19.289794921875, -18.525390625, -17.760986328125, -16.99658203125, -16.232177734375, -15.4677734375, -14.703369140625, -13.93896484375, -13.174560546875, -12.41015625, -11.645751953125, -10.88134765625, -10.116943359375, -9.3525390625, -8.588134765625, -7.82373046875, -7.059326171875, -6.294921875, -5.530517578125, -4.76611328125, -4.001708984375, -3.2373046875, -2.472900390625, -1.70849609375, -0.944091796875, -0.1796875, 0.584716796875, 1.34912109375, 2.113525390625, 2.8779296875, 3.642333984375, 4.40673828125, 5.171142578125, 5.935546875, 6.699951171875, 7.46435546875, 8.228759765625, 8.9931640625, 9.757568359375, 10.52197265625, 11.286376953125, 12.05078125, 12.815185546875, 13.57958984375, 14.343994140625, 15.1083984375, 15.872802734375, 16.63720703125, 17.401611328125, 18.166015625, 18.930419921875, 19.69482421875, 20.459228515625, 21.2236328125, 21.988037109375, 22.75244140625, 23.516845703125, 24.28125]}, "gradients/decoder.transformer.h.0.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 2.0, 6.0, 4.0, 3.0, 4.0, 8.0, 6.0, 10.0, 7.0, 9.0, 19.0, 10.0, 19.0, 30.0, 15.0, 19.0, 22.0, 22.0, 29.0, 29.0, 34.0, 34.0, 36.0, 30.0, 31.0, 33.0, 1051.0, 39.0, 29.0, 38.0, 34.0, 37.0, 40.0, 31.0, 34.0, 33.0, 23.0, 34.0, 22.0, 15.0, 21.0, 16.0, 10.0, 7.0, 9.0, 6.0, 8.0, 9.0, 4.0, 2.0, 3.0, 4.0, 6.0, 3.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-45.25, -43.716796875, -42.18359375, -40.650390625, -39.1171875, -37.583984375, -36.05078125, -34.517578125, -32.984375, -31.451171875, -29.91796875, -28.384765625, -26.8515625, -25.318359375, -23.78515625, -22.251953125, -20.71875, -19.185546875, -17.65234375, -16.119140625, -14.5859375, -13.052734375, -11.51953125, -9.986328125, -8.453125, -6.919921875, -5.38671875, -3.853515625, -2.3203125, -0.787109375, 0.74609375, 2.279296875, 3.8125, 5.345703125, 6.87890625, 8.412109375, 9.9453125, 11.478515625, 13.01171875, 14.544921875, 16.078125, 17.611328125, 19.14453125, 20.677734375, 22.2109375, 23.744140625, 25.27734375, 26.810546875, 28.34375, 29.876953125, 31.41015625, 32.943359375, 34.4765625, 36.009765625, 37.54296875, 39.076171875, 40.609375, 42.142578125, 43.67578125, 45.208984375, 46.7421875, 48.275390625, 49.80859375, 51.341796875, 52.875]}, "gradients/decoder.transformer.h.0.crossattention.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 6.0, 4.0, 3.0, 3.0, 8.0, 12.0, 10.0, 17.0, 18.0, 40.0, 53.0, 66.0, 146.0, 222.0, 364.0, 529.0, 1004.0, 1575.0, 2660.0, 4393.0, 7623.0, 13362.0, 24134.0, 44868.0, 86549.0, 186122.0, 1421294.0, 145610.0, 71409.0, 37204.0, 20311.0, 11420.0, 6494.0, 3826.0, 2248.0, 1351.0, 840.0, 483.0, 295.0, 207.0, 113.0, 64.0, 64.0, 32.0, 25.0, 18.0, 9.0, 9.0, 4.0, 4.0, 6.0, 6.0, 2.0, 2.0, 1.0, 2.0], "bins": [-28.734375, -27.89794921875, -27.0615234375, -26.22509765625, -25.388671875, -24.55224609375, -23.7158203125, -22.87939453125, -22.04296875, -21.20654296875, -20.3701171875, -19.53369140625, -18.697265625, -17.86083984375, -17.0244140625, -16.18798828125, -15.3515625, -14.51513671875, -13.6787109375, -12.84228515625, -12.005859375, -11.16943359375, -10.3330078125, -9.49658203125, -8.66015625, -7.82373046875, -6.9873046875, -6.15087890625, -5.314453125, -4.47802734375, -3.6416015625, -2.80517578125, -1.96875, -1.13232421875, -0.2958984375, 0.54052734375, 1.376953125, 2.21337890625, 3.0498046875, 3.88623046875, 4.72265625, 5.55908203125, 6.3955078125, 7.23193359375, 8.068359375, 8.90478515625, 9.7412109375, 10.57763671875, 11.4140625, 12.25048828125, 13.0869140625, 13.92333984375, 14.759765625, 15.59619140625, 16.4326171875, 17.26904296875, 18.10546875, 18.94189453125, 19.7783203125, 20.61474609375, 21.451171875, 22.28759765625, 23.1240234375, 23.96044921875, 24.796875]}, "gradients/decoder.transformer.h.0.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 0.0, 2.0, 4.0, 5.0, 3.0, 7.0, 11.0, 10.0, 20.0, 15.0, 10.0, 13.0, 26.0, 43.0, 53.0, 61.0, 84.0, 95.0, 94.0, 110.0, 80.0, 53.0, 37.0, 32.0, 35.0, 27.0, 15.0, 19.0, 11.0, 11.0, 4.0, 8.0, 4.0, 4.0, 2.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0196533203125, -0.01908278465270996, -0.018512248992919922, -0.017941713333129883, -0.017371177673339844, -0.016800642013549805, -0.016230106353759766, -0.015659570693969727, -0.015089035034179688, -0.014518499374389648, -0.01394796371459961, -0.01337742805480957, -0.012806892395019531, -0.012236356735229492, -0.011665821075439453, -0.011095285415649414, -0.010524749755859375, -0.009954214096069336, -0.009383678436279297, -0.008813142776489258, -0.008242607116699219, -0.00767207145690918, -0.007101535797119141, -0.0065310001373291016, -0.0059604644775390625, -0.0053899288177490234, -0.004819393157958984, -0.004248857498168945, -0.0036783218383789062, -0.003107786178588867, -0.002537250518798828, -0.001966714859008789, -0.00139617919921875, -0.0008256435394287109, -0.0002551078796386719, 0.0003154277801513672, 0.0008859634399414062, 0.0014564990997314453, 0.0020270347595214844, 0.0025975704193115234, 0.0031681060791015625, 0.0037386417388916016, 0.004309177398681641, 0.00487971305847168, 0.005450248718261719, 0.006020784378051758, 0.006591320037841797, 0.007161855697631836, 0.007732391357421875, 0.008302927017211914, 0.008873462677001953, 0.009443998336791992, 0.010014533996582031, 0.01058506965637207, 0.01115560531616211, 0.011726140975952148, 0.012296676635742188, 0.012867212295532227, 0.013437747955322266, 0.014008283615112305, 0.014578819274902344, 0.015149354934692383, 0.015719890594482422, 0.01629042625427246, 0.0168609619140625]}, "gradients/decoder.transformer.h.0.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 4.0, 4.0, 6.0, 8.0, 8.0, 14.0, 13.0, 23.0, 30.0, 53.0, 70.0, 87.0, 136.0, 241.0, 452.0, 1132.0, 6794.0, 295753.0, 729150.0, 11816.0, 1487.0, 508.0, 262.0, 156.0, 107.0, 61.0, 45.0, 45.0, 21.0, 19.0, 12.0, 9.0, 9.0, 9.0, 2.0, 6.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.28759765625, -0.2791633605957031, -0.27072906494140625, -0.2622947692871094, -0.2538604736328125, -0.24542617797851562, -0.23699188232421875, -0.22855758666992188, -0.220123291015625, -0.21168899536132812, -0.20325469970703125, -0.19482040405273438, -0.1863861083984375, -0.17795181274414062, -0.16951751708984375, -0.16108322143554688, -0.15264892578125, -0.14421463012695312, -0.13578033447265625, -0.12734603881835938, -0.1189117431640625, -0.11047744750976562, -0.10204315185546875, -0.09360885620117188, -0.085174560546875, -0.07674026489257812, -0.06830596923828125, -0.059871673583984375, -0.0514373779296875, -0.043003082275390625, -0.03456878662109375, -0.026134490966796875, -0.0177001953125, -0.009265899658203125, -0.00083160400390625, 0.007602691650390625, 0.0160369873046875, 0.024471282958984375, 0.03290557861328125, 0.041339874267578125, 0.049774169921875, 0.058208465576171875, 0.06664276123046875, 0.07507705688476562, 0.0835113525390625, 0.09194564819335938, 0.10037994384765625, 0.10881423950195312, 0.11724853515625, 0.12568283081054688, 0.13411712646484375, 0.14255142211914062, 0.1509857177734375, 0.15942001342773438, 0.16785430908203125, 0.17628860473632812, 0.184722900390625, 0.19315719604492188, 0.20159149169921875, 0.21002578735351562, 0.2184600830078125, 0.22689437866210938, 0.23532867431640625, 0.24376296997070312, 0.252197265625]}, "gradients/decoder.transformer.h.0.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 4.0, 10.0, 16.0, 149.0, 429.0, 315.0, 68.0, 16.0, 5.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.07982425391674042, -0.0781906321644783, -0.07655701786279678, -0.07492339611053467, -0.07328978180885315, -0.07165616005659103, -0.07002254575490952, -0.0683889240026474, -0.06675530970096588, -0.06512168794870377, -0.06348807364702225, -0.06185445562005043, -0.06022083759307861, -0.058587219566106796, -0.05695360153913498, -0.05531998351216316, -0.053686365485191345, -0.05205274745821953, -0.05041912943124771, -0.048785511404275894, -0.04715189337730408, -0.04551827535033226, -0.04388465732336044, -0.042251039296388626, -0.04061741754412651, -0.038983799517154694, -0.03735018149018288, -0.03571656346321106, -0.03408294543623924, -0.032449327409267426, -0.03081570938229561, -0.02918209135532379, -0.027548471465706825, -0.025914853438735008, -0.02428123541176319, -0.022647617384791374, -0.021013999357819557, -0.01938037946820259, -0.017746761441230774, -0.016113143414258957, -0.014479526318609715, -0.012845908291637897, -0.01121229026466608, -0.009578671306371689, -0.007945053279399872, -0.006311435252428055, -0.004677817225456238, -0.0030441991984844208, -0.0014105811715126038, 0.00022303697187453508, 0.001856655115261674, 0.0034902733750641346, 0.005123891402035952, 0.006757509894669056, 0.008391127921640873, 0.01002474594861269, 0.011658363975584507, 0.013291982002556324, 0.014925600029528141, 0.016559218987822533, 0.01819283701479435, 0.019826455041766167, 0.021460073068737984, 0.0230936910957098, 0.024727309122681618]}, "gradients/decoder.transformer.h.0.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 1.0, 9.0, 7.0, 6.0, 18.0, 15.0, 19.0, 28.0, 22.0, 27.0, 36.0, 37.0, 39.0, 49.0, 65.0, 43.0, 44.0, 59.0, 68.0, 58.0, 47.0, 46.0, 34.0, 43.0, 43.0, 24.0, 22.0, 20.0, 14.0, 15.0, 18.0, 10.0, 4.0, 4.0, 3.0, 5.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.008314907550811768, -0.008013147860765457, -0.007711388170719147, -0.007409628480672836, -0.007107868790626526, -0.0068061091005802155, -0.006504349410533905, -0.006202589720487595, -0.005900830030441284, -0.005599070340394974, -0.005297310650348663, -0.004995550960302353, -0.0046937912702560425, -0.004392031580209732, -0.004090271890163422, -0.003788512200117111, -0.0034867525100708008, -0.0031849928200244904, -0.00288323312997818, -0.0025814734399318695, -0.002279713749885559, -0.0019779540598392487, -0.0016761943697929382, -0.0013744346797466278, -0.0010726749897003174, -0.000770915299654007, -0.00046915560960769653, -0.0001673959195613861, 0.00013436377048492432, 0.00043612346053123474, 0.0007378831505775452, 0.0010396428406238556, 0.001341402530670166, 0.0016431622207164764, 0.0019449219107627869, 0.0022466816008090973, 0.0025484412908554077, 0.002850200980901718, 0.0031519606709480286, 0.003453720360994339, 0.0037554800510406494, 0.00405723974108696, 0.00435899943113327, 0.004660759121179581, 0.004962518811225891, 0.0052642785012722015, 0.005566038191318512, 0.005867797881364822, 0.006169557571411133, 0.006471317261457443, 0.006773076951503754, 0.007074836641550064, 0.0073765963315963745, 0.007678356021642685, 0.007980115711688995, 0.008281875401735306, 0.008583635091781616, 0.008885394781827927, 0.009187154471874237, 0.009488914161920547, 0.009790673851966858, 0.010092433542013168, 0.010394193232059479, 0.01069595292210579, 0.0109977126121521]}, "gradients/decoder.transformer.h.0.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 2.0, 1.0, 6.0, 12.0, 7.0, 16.0, 11.0, 12.0, 18.0, 20.0, 25.0, 27.0, 27.0, 32.0, 38.0, 41.0, 38.0, 37.0, 41.0, 48.0, 46.0, 45.0, 44.0, 43.0, 51.0, 51.0, 39.0, 44.0, 25.0, 26.0, 27.0, 22.0, 15.0, 13.0, 9.0, 12.0, 6.0, 5.0, 8.0, 8.0, 4.0, 4.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-90.5, -87.4619140625, -84.423828125, -81.3857421875, -78.34765625, -75.3095703125, -72.271484375, -69.2333984375, -66.1953125, -63.1572265625, -60.119140625, -57.0810546875, -54.04296875, -51.0048828125, -47.966796875, -44.9287109375, -41.890625, -38.8525390625, -35.814453125, -32.7763671875, -29.73828125, -26.7001953125, -23.662109375, -20.6240234375, -17.5859375, -14.5478515625, -11.509765625, -8.4716796875, -5.43359375, -2.3955078125, 0.642578125, 3.6806640625, 6.71875, 9.7568359375, 12.794921875, 15.8330078125, 18.87109375, 21.9091796875, 24.947265625, 27.9853515625, 31.0234375, 34.0615234375, 37.099609375, 40.1376953125, 43.17578125, 46.2138671875, 49.251953125, 52.2900390625, 55.328125, 58.3662109375, 61.404296875, 64.4423828125, 67.48046875, 70.5185546875, 73.556640625, 76.5947265625, 79.6328125, 82.6708984375, 85.708984375, 88.7470703125, 91.78515625, 94.8232421875, 97.861328125, 100.8994140625, 103.9375]}, "gradients/decoder.transformer.h.0.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 5.0, 12.0, 4.0, 23.0, 9.0, 30.0, 33.0, 46.0, 82.0, 83.0, 146.0, 202.0, 296.0, 496.0, 916.0, 1449.0, 2578.0, 5565.0, 12980.0, 44497.0, 278002.0, 577639.0, 87306.0, 20610.0, 7307.0, 3617.0, 1899.0, 1016.0, 592.0, 374.0, 238.0, 155.0, 94.0, 68.0, 47.0, 29.0, 23.0, 29.0, 20.0, 9.0, 11.0, 4.0, 7.0, 7.0, 3.0, 3.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.21875, -23.360107421875, -22.50146484375, -21.642822265625, -20.7841796875, -19.925537109375, -19.06689453125, -18.208251953125, -17.349609375, -16.490966796875, -15.63232421875, -14.773681640625, -13.9150390625, -13.056396484375, -12.19775390625, -11.339111328125, -10.48046875, -9.621826171875, -8.76318359375, -7.904541015625, -7.0458984375, -6.187255859375, -5.32861328125, -4.469970703125, -3.611328125, -2.752685546875, -1.89404296875, -1.035400390625, -0.1767578125, 0.681884765625, 1.54052734375, 2.399169921875, 3.2578125, 4.116455078125, 4.97509765625, 5.833740234375, 6.6923828125, 7.551025390625, 8.40966796875, 9.268310546875, 10.126953125, 10.985595703125, 11.84423828125, 12.702880859375, 13.5615234375, 14.420166015625, 15.27880859375, 16.137451171875, 16.99609375, 17.854736328125, 18.71337890625, 19.572021484375, 20.4306640625, 21.289306640625, 22.14794921875, 23.006591796875, 23.865234375, 24.723876953125, 25.58251953125, 26.441162109375, 27.2998046875, 28.158447265625, 29.01708984375, 29.875732421875, 30.734375]}, "gradients/decoder.transformer.h.0.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 4.0, 13.0, 9.0, 12.0, 26.0, 43.0, 42.0, 66.0, 72.0, 67.0, 99.0, 2135.0, 96.0, 78.0, 78.0, 40.0, 42.0, 42.0, 26.0, 12.0, 17.0, 8.0, 5.0, 2.0, 4.0, 5.0, 1.0, 2.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-257.75, -249.8046875, -241.859375, -233.9140625, -225.96875, -218.0234375, -210.078125, -202.1328125, -194.1875, -186.2421875, -178.296875, -170.3515625, -162.40625, -154.4609375, -146.515625, -138.5703125, -130.625, -122.6796875, -114.734375, -106.7890625, -98.84375, -90.8984375, -82.953125, -75.0078125, -67.0625, -59.1171875, -51.171875, -43.2265625, -35.28125, -27.3359375, -19.390625, -11.4453125, -3.5, 4.4453125, 12.390625, 20.3359375, 28.28125, 36.2265625, 44.171875, 52.1171875, 60.0625, 68.0078125, 75.953125, 83.8984375, 91.84375, 99.7890625, 107.734375, 115.6796875, 123.625, 131.5703125, 139.515625, 147.4609375, 155.40625, 163.3515625, 171.296875, 179.2421875, 187.1875, 195.1328125, 203.078125, 211.0234375, 218.96875, 226.9140625, 234.859375, 242.8046875, 250.75]}, "gradients/decoder.transformer.h.0.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 3.0, 3.0, 11.0, 7.0, 15.0, 24.0, 11.0, 36.0, 52.0, 65.0, 132.0, 195.0, 342.0, 615.0, 1828.0, 10886.0, 2399638.0, 717892.0, 10693.0, 1787.0, 613.0, 323.0, 207.0, 115.0, 76.0, 32.0, 38.0, 26.0, 13.0, 8.0, 10.0, 6.0, 2.0, 6.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-62.125, -60.0537109375, -57.982421875, -55.9111328125, -53.83984375, -51.7685546875, -49.697265625, -47.6259765625, -45.5546875, -43.4833984375, -41.412109375, -39.3408203125, -37.26953125, -35.1982421875, -33.126953125, -31.0556640625, -28.984375, -26.9130859375, -24.841796875, -22.7705078125, -20.69921875, -18.6279296875, -16.556640625, -14.4853515625, -12.4140625, -10.3427734375, -8.271484375, -6.2001953125, -4.12890625, -2.0576171875, 0.013671875, 2.0849609375, 4.15625, 6.2275390625, 8.298828125, 10.3701171875, 12.44140625, 14.5126953125, 16.583984375, 18.6552734375, 20.7265625, 22.7978515625, 24.869140625, 26.9404296875, 29.01171875, 31.0830078125, 33.154296875, 35.2255859375, 37.296875, 39.3681640625, 41.439453125, 43.5107421875, 45.58203125, 47.6533203125, 49.724609375, 51.7958984375, 53.8671875, 55.9384765625, 58.009765625, 60.0810546875, 62.15234375, 64.2236328125, 66.294921875, 68.3662109375, 70.4375]}, "gradients/decoder.transformer.h.0.ln_1.weight": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 1.0, 15.0, 51.0, 491.0, 375.0, 64.0, 8.0, 3.0, 7.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-200.3923797607422, -171.44140625, -142.4904327392578, -113.53946685791016, -84.58849334716797, -55.63752746582031, -26.686553955078125, 2.2644195556640625, 31.21539306640625, 60.16636657714844, 89.11734008789062, 118.06830596923828, 147.019287109375, 175.97024536132812, 204.9212188720703, 233.8721923828125, 262.82318115234375, 291.7741394042969, 320.7251281738281, 349.67608642578125, 378.6270751953125, 407.5780334472656, 436.52899169921875, 465.47998046875, 494.4309387207031, 523.3818969726562, 552.3328857421875, 581.2838745117188, 610.2348022460938, 639.185791015625, 668.1367797851562, 697.0877685546875, 726.0387573242188, 754.98974609375, 783.940673828125, 812.8916625976562, 841.8426513671875, 870.7935791015625, 899.7445678710938, 928.695556640625, 957.6465454101562, 986.5975341796875, 1015.5484619140625, 1044.49951171875, 1073.450439453125, 1102.4013671875, 1131.3524169921875, 1160.3033447265625, 1189.2542724609375, 1218.2052001953125, 1247.15625, 1276.107177734375, 1305.05810546875, 1334.0091552734375, 1362.9600830078125, 1391.9111328125, 1420.862060546875, 1449.81298828125, 1478.7640380859375, 1507.7149658203125, 1536.6658935546875, 1565.616943359375, 1594.56787109375, 1623.518798828125, 1652.4698486328125]}, "gradients/decoder.transformer.h.0.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 4.0, 3.0, 1.0, 0.0, 7.0, 3.0, 4.0, 13.0, 9.0, 11.0, 8.0, 16.0, 9.0, 19.0, 22.0, 26.0, 26.0, 29.0, 34.0, 31.0, 47.0, 44.0, 53.0, 47.0, 48.0, 45.0, 46.0, 52.0, 35.0, 41.0, 37.0, 36.0, 29.0, 30.0, 16.0, 22.0, 23.0, 11.0, 13.0, 11.0, 11.0, 4.0, 10.0, 6.0, 7.0, 3.0, 3.0, 7.0, 1.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-189.0828094482422, -183.53890991210938, -177.9949951171875, -172.4510955810547, -166.90719604492188, -161.36329650878906, -155.8193817138672, -150.27548217773438, -144.73158264160156, -139.18768310546875, -133.64376831054688, -128.09986877441406, -122.55596923828125, -117.0120620727539, -111.46815490722656, -105.92425537109375, -100.3803482055664, -94.83644104003906, -89.29254150390625, -83.7486343383789, -78.2047348022461, -72.66082763671875, -67.11692810058594, -61.573020935058594, -56.029117584228516, -50.48521423339844, -44.94131088256836, -39.39740753173828, -33.85350036621094, -28.309598922729492, -22.76569366455078, -17.221790313720703, -11.677886962890625, -6.133983135223389, -0.5900793075561523, 4.953824996948242, 10.49772834777832, 16.0416316986084, 21.58553695678711, 27.129440307617188, 32.673343658447266, 38.217247009277344, 43.76115036010742, 49.3050537109375, 54.848960876464844, 60.392860412597656, 65.936767578125, 71.48066711425781, 77.02457427978516, 82.5684814453125, 88.11238098144531, 93.65628814697266, 99.20018768310547, 104.74409484863281, 110.28799438476562, 115.83190155029297, 121.37580871582031, 126.91971588134766, 132.463623046875, 138.0075225830078, 143.55142211914062, 149.09532165527344, 154.6392364501953, 160.18313598632812, 165.72703552246094]}, "gradients/decoder.transformer.wpe.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 4.0, 3.0, 0.0, 5.0, 2.0, 7.0, 7.0, 9.0, 12.0, 20.0, 21.0, 31.0, 33.0, 51.0, 59.0, 91.0, 124.0, 203.0, 263.0, 411.0, 615.0, 828.0, 987.0, 1041438.0, 893.0, 747.0, 544.0, 354.0, 240.0, 157.0, 107.0, 86.0, 53.0, 39.0, 18.0, 15.0, 17.0, 20.0, 7.0, 9.0, 7.0, 8.0, 4.0, 3.0, 6.0, 4.0, 3.0, 0.0, 2.0, 2.0, 0.0, 3.0], "bins": [-76.24835205078125, -74.08812713623047, -71.92790222167969, -69.76766967773438, -67.6074447631836, -65.44721984863281, -63.286991119384766, -61.12676239013672, -58.96653747558594, -56.806312561035156, -54.64608383178711, -52.48585510253906, -50.32563018798828, -48.1654052734375, -46.00517654418945, -43.844947814941406, -41.684722900390625, -39.524497985839844, -37.3642692565918, -35.20404052734375, -33.04381561279297, -30.883588790893555, -28.72336196899414, -26.563135147094727, -24.402908325195312, -22.2426815032959, -20.082454681396484, -17.92222785949707, -15.762001037597656, -13.601774215698242, -11.441547393798828, -9.281320571899414, -7.121086120605469, -4.960859298706055, -2.8006324768066406, -0.6404056549072266, 1.5198211669921875, 3.6800479888916016, 5.840274810791016, 8.00050163269043, 10.160728454589844, 12.320955276489258, 14.481182098388672, 16.641408920288086, 18.8016357421875, 20.961862564086914, 23.122089385986328, 25.282316207885742, 27.442543029785156, 29.60276985168457, 31.762996673583984, 33.92322540283203, 36.08345031738281, 38.243675231933594, 40.40390396118164, 42.56413269042969, 44.72435760498047, 46.88458251953125, 49.0448112487793, 51.205039978027344, 53.365264892578125, 55.525489807128906, 57.68571853637695, 59.845947265625, 62.00617218017578]}, "gradients/decoder.transformer.wte.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 6.0, 5.0, 7.0, 4.0, 12.0, 9.0, 16.0, 18.0, 23.0, 27.0, 103.0, 543.0, 51462052.0, 164.0, 49.0, 29.0, 26.0, 10.0, 3.0, 4.0, 0.0, 4.0, 5.0, 2.0, 4.0, 7.0, 4.0, 8.0, 4.0, 8.0, 4.0, 3.0, 2.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6036.0, -5777.8173828125, -5519.63427734375, -5261.45166015625, -5003.2685546875, -4745.0859375, -4486.9033203125, -4228.72021484375, -3970.537353515625, -3712.3544921875, -3454.171630859375, -3195.98876953125, -2937.80615234375, -2679.623046875, -2421.4404296875, -2163.257568359375, -1905.07470703125, -1646.891845703125, -1388.708984375, -1130.5262451171875, -872.3433837890625, -614.1605224609375, -355.977783203125, -97.794921875, 160.387939453125, 418.5707702636719, 676.7536010742188, 934.9364013671875, 1193.1192626953125, 1451.3021240234375, 1709.48486328125, 1967.667724609375, 2225.8505859375, 2484.033447265625, 2742.21630859375, 3000.39892578125, 3258.58203125, 3516.7646484375, 3774.947509765625, 4033.13037109375, 4291.3134765625, 4549.49609375, 4807.67919921875, 5065.86181640625, 5324.044921875, 5582.2275390625, 5840.41015625, 6098.59326171875, 6356.77587890625, 6614.95849609375, 6873.1416015625, 7131.32421875, 7389.50732421875, 7647.68994140625, 7905.873046875, 8164.0556640625, 8422.23828125, 8680.4208984375, 8938.603515625, 9196.787109375, 9454.9697265625, 9713.15234375, 9971.3349609375, 10229.517578125, 10487.701171875]}, "gradients/encoder.adapter.layers.2.conv.weight": {"_type": "histogram", "values": [2.0, 6.0, 6.0, 3.0, 7.0, 6.0, 16.0, 16.0, 18.0, 51.0, 58.0, 83.0, 129.0, 194.0, 284.0, 388.0, 519.0, 848.0, 1355.0, 2005.0, 2934.0, 4700.0, 6876.0, 10834.0, 16956.0, 27609.0, 44119.0, 73617.0, 127699.0, 232614.0, 507470.0, 3991997.0, 626517.0, 260131.0, 140669.0, 80718.0, 48035.0, 29454.0, 18509.0, 11969.0, 7701.0, 4892.0, 3048.0, 2071.0, 1412.0, 938.0, 624.0, 430.0, 290.0, 175.0, 162.0, 91.0, 66.0, 42.0, 28.0, 19.0, 14.0, 6.0, 8.0, 6.0, 6.0, 3.0, 2.0, 1.0], "bins": [-9.78125, -9.4715576171875, -9.161865234375, -8.8521728515625, -8.54248046875, -8.2327880859375, -7.923095703125, -7.6134033203125, -7.3037109375, -6.9940185546875, -6.684326171875, -6.3746337890625, -6.06494140625, -5.7552490234375, -5.445556640625, -5.1358642578125, -4.826171875, -4.5164794921875, -4.206787109375, -3.8970947265625, -3.58740234375, -3.2777099609375, -2.968017578125, -2.6583251953125, -2.3486328125, -2.0389404296875, -1.729248046875, -1.4195556640625, -1.10986328125, -0.8001708984375, -0.490478515625, -0.1807861328125, 0.12890625, 0.4385986328125, 0.748291015625, 1.0579833984375, 1.36767578125, 1.6773681640625, 1.987060546875, 2.2967529296875, 2.6064453125, 2.9161376953125, 3.225830078125, 3.5355224609375, 3.84521484375, 4.1549072265625, 4.464599609375, 4.7742919921875, 5.083984375, 5.3936767578125, 5.703369140625, 6.0130615234375, 6.32275390625, 6.6324462890625, 6.942138671875, 7.2518310546875, 7.5615234375, 7.8712158203125, 8.180908203125, 8.4906005859375, 8.80029296875, 9.1099853515625, 9.419677734375, 9.7293701171875, 10.0390625]}, "gradients/encoder.adapter.layers.2.conv.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 6.0, 3.0, 5.0, 4.0, 5.0, 11.0, 8.0, 6.0, 14.0, 12.0, 24.0, 14.0, 32.0, 21.0, 32.0, 37.0, 52.0, 37.0, 43.0, 55.0, 71.0, 191.0, 646.0, 213.0, 80.0, 49.0, 46.0, 44.0, 38.0, 41.0, 27.0, 28.0, 22.0, 27.0, 15.0, 12.0, 15.0, 14.0, 8.0, 9.0, 6.0, 5.0, 1.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-20.265625, -19.645263671875, -19.02490234375, -18.404541015625, -17.7841796875, -17.163818359375, -16.54345703125, -15.923095703125, -15.302734375, -14.682373046875, -14.06201171875, -13.441650390625, -12.8212890625, -12.200927734375, -11.58056640625, -10.960205078125, -10.33984375, -9.719482421875, -9.09912109375, -8.478759765625, -7.8583984375, -7.238037109375, -6.61767578125, -5.997314453125, -5.376953125, -4.756591796875, -4.13623046875, -3.515869140625, -2.8955078125, -2.275146484375, -1.65478515625, -1.034423828125, -0.4140625, 0.206298828125, 0.82666015625, 1.447021484375, 2.0673828125, 2.687744140625, 3.30810546875, 3.928466796875, 4.548828125, 5.169189453125, 5.78955078125, 6.409912109375, 7.0302734375, 7.650634765625, 8.27099609375, 8.891357421875, 9.51171875, 10.132080078125, 10.75244140625, 11.372802734375, 11.9931640625, 12.613525390625, 13.23388671875, 13.854248046875, 14.474609375, 15.094970703125, 15.71533203125, 16.335693359375, 16.9560546875, 17.576416015625, 18.19677734375, 18.817138671875, 19.4375]}, "gradients/encoder.adapter.layers.1.conv.weight": {"_type": "histogram", "values": [3.0, 3.0, 6.0, 9.0, 3.0, 6.0, 8.0, 24.0, 34.0, 63.0, 78.0, 75.0, 134.0, 156.0, 222.0, 316.0, 469.0, 674.0, 952.0, 1501.0, 2205.0, 3290.0, 5176.0, 7886.0, 12432.0, 19511.0, 31707.0, 52588.0, 89597.0, 162112.0, 342199.0, 2764134.0, 2072645.0, 335145.0, 159920.0, 88327.0, 51990.0, 31540.0, 19313.0, 12145.0, 7853.0, 4808.0, 3235.0, 2347.0, 1457.0, 996.0, 634.0, 509.0, 271.0, 238.0, 163.0, 91.0, 67.0, 62.0, 27.0, 32.0, 10.0, 17.0, 6.0, 13.0, 8.0, 8.0, 3.0, 3.0], "bins": [-9.7265625, -9.4224853515625, -9.118408203125, -8.8143310546875, -8.51025390625, -8.2061767578125, -7.902099609375, -7.5980224609375, -7.2939453125, -6.9898681640625, -6.685791015625, -6.3817138671875, -6.07763671875, -5.7735595703125, -5.469482421875, -5.1654052734375, -4.861328125, -4.5572509765625, -4.253173828125, -3.9490966796875, -3.64501953125, -3.3409423828125, -3.036865234375, -2.7327880859375, -2.4287109375, -2.1246337890625, -1.820556640625, -1.5164794921875, -1.21240234375, -0.9083251953125, -0.604248046875, -0.3001708984375, 0.00390625, 0.3079833984375, 0.612060546875, 0.9161376953125, 1.22021484375, 1.5242919921875, 1.828369140625, 2.1324462890625, 2.4365234375, 2.7406005859375, 3.044677734375, 3.3487548828125, 3.65283203125, 3.9569091796875, 4.260986328125, 4.5650634765625, 4.869140625, 5.1732177734375, 5.477294921875, 5.7813720703125, 6.08544921875, 6.3895263671875, 6.693603515625, 6.9976806640625, 7.3017578125, 7.6058349609375, 7.909912109375, 8.2139892578125, 8.51806640625, 8.8221435546875, 9.126220703125, 9.4302978515625, 9.734375]}, "gradients/encoder.adapter.layers.1.conv.bias": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 1.0, 1.0, 3.0, 2.0, 2.0, 6.0, 3.0, 9.0, 8.0, 8.0, 12.0, 20.0, 17.0, 18.0, 19.0, 30.0, 32.0, 34.0, 47.0, 40.0, 41.0, 64.0, 90.0, 178.0, 520.0, 309.0, 95.0, 66.0, 44.0, 30.0, 40.0, 36.0, 30.0, 31.0, 24.0, 21.0, 15.0, 16.0, 17.0, 14.0, 14.0, 8.0, 6.0, 3.0, 4.0, 1.0, 3.0, 2.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-11.8046875, -11.380615234375, -10.95654296875, -10.532470703125, -10.1083984375, -9.684326171875, -9.26025390625, -8.836181640625, -8.412109375, -7.988037109375, -7.56396484375, -7.139892578125, -6.7158203125, -6.291748046875, -5.86767578125, -5.443603515625, -5.01953125, -4.595458984375, -4.17138671875, -3.747314453125, -3.3232421875, -2.899169921875, -2.47509765625, -2.051025390625, -1.626953125, -1.202880859375, -0.77880859375, -0.354736328125, 0.0693359375, 0.493408203125, 0.91748046875, 1.341552734375, 1.765625, 2.189697265625, 2.61376953125, 3.037841796875, 3.4619140625, 3.885986328125, 4.31005859375, 4.734130859375, 5.158203125, 5.582275390625, 6.00634765625, 6.430419921875, 6.8544921875, 7.278564453125, 7.70263671875, 8.126708984375, 8.55078125, 8.974853515625, 9.39892578125, 9.822998046875, 10.2470703125, 10.671142578125, 11.09521484375, 11.519287109375, 11.943359375, 12.367431640625, 12.79150390625, 13.215576171875, 13.6396484375, 14.063720703125, 14.48779296875, 14.911865234375, 15.3359375]}, "gradients/encoder.adapter.layers.0.conv.weight": {"_type": "histogram", "values": [5.0, 4.0, 3.0, 7.0, 5.0, 0.0, 6.0, 1.0, 10.0, 20.0, 19.0, 32.0, 26.0, 43.0, 69.0, 79.0, 103.0, 157.0, 188.0, 248.0, 289.0, 464.0, 655.0, 832.0, 1157.0, 1852.0, 2832.0, 4815.0, 9100.0, 20385.0, 56520.0, 538635.0, 5549320.0, 59237.0, 20954.0, 9308.0, 4911.0, 2855.0, 1863.0, 1170.0, 889.0, 629.0, 435.0, 301.0, 259.0, 184.0, 118.0, 132.0, 47.0, 65.0, 52.0, 61.0, 25.0, 4.0, 18.0, 14.0, 5.0, 13.0, 8.0, 9.0, 6.0, 0.0, 0.0, 3.0], "bins": [-28.328125, -27.443359375, -26.55859375, -25.673828125, -24.7890625, -23.904296875, -23.01953125, -22.134765625, -21.25, -20.365234375, -19.48046875, -18.595703125, -17.7109375, -16.826171875, -15.94140625, -15.056640625, -14.171875, -13.287109375, -12.40234375, -11.517578125, -10.6328125, -9.748046875, -8.86328125, -7.978515625, -7.09375, -6.208984375, -5.32421875, -4.439453125, -3.5546875, -2.669921875, -1.78515625, -0.900390625, -0.015625, 0.869140625, 1.75390625, 2.638671875, 3.5234375, 4.408203125, 5.29296875, 6.177734375, 7.0625, 7.947265625, 8.83203125, 9.716796875, 10.6015625, 11.486328125, 12.37109375, 13.255859375, 14.140625, 15.025390625, 15.91015625, 16.794921875, 17.6796875, 18.564453125, 19.44921875, 20.333984375, 21.21875, 22.103515625, 22.98828125, 23.873046875, 24.7578125, 25.642578125, 26.52734375, 27.412109375, 28.296875]}, "gradients/encoder.adapter.layers.0.conv.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 6.0, 3.0, 4.0, 3.0, 2.0, 8.0, 10.0, 10.0, 14.0, 12.0, 22.0, 14.0, 30.0, 27.0, 31.0, 32.0, 36.0, 39.0, 49.0, 56.0, 96.0, 160.0, 266.0, 438.0, 152.0, 93.0, 63.0, 46.0, 34.0, 37.0, 30.0, 37.0, 28.0, 28.0, 23.0, 14.0, 17.0, 9.0, 16.0, 8.0, 11.0, 3.0, 6.0, 4.0, 2.0, 2.0, 4.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.9375, -9.6290283203125, -9.320556640625, -9.0120849609375, -8.70361328125, -8.3951416015625, -8.086669921875, -7.7781982421875, -7.4697265625, -7.1612548828125, -6.852783203125, -6.5443115234375, -6.23583984375, -5.9273681640625, -5.618896484375, -5.3104248046875, -5.001953125, -4.6934814453125, -4.385009765625, -4.0765380859375, -3.76806640625, -3.4595947265625, -3.151123046875, -2.8426513671875, -2.5341796875, -2.2257080078125, -1.917236328125, -1.6087646484375, -1.30029296875, -0.9918212890625, -0.683349609375, -0.3748779296875, -0.06640625, 0.2420654296875, 0.550537109375, 0.8590087890625, 1.16748046875, 1.4759521484375, 1.784423828125, 2.0928955078125, 2.4013671875, 2.7098388671875, 3.018310546875, 3.3267822265625, 3.63525390625, 3.9437255859375, 4.252197265625, 4.5606689453125, 4.869140625, 5.1776123046875, 5.486083984375, 5.7945556640625, 6.10302734375, 6.4114990234375, 6.719970703125, 7.0284423828125, 7.3369140625, 7.6453857421875, 7.953857421875, 8.2623291015625, 8.57080078125, 8.8792724609375, 9.187744140625, 9.4962158203125, 9.8046875]}, "gradients/encoder.encoder.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 4.0, 3.0, 4.0, 12.0, 10.0, 12.0, 52.0, 109.0, 408.0, 281.0, 61.0, 32.0, 9.0, 6.0, 4.0, 2.0, 1.0, 1.0, 2.0], "bins": [-97.36585998535156, -95.55780792236328, -93.749755859375, -91.94171142578125, -90.13365936279297, -88.32560729980469, -86.5175552368164, -84.70951080322266, -82.90145874023438, -81.0934066772461, -79.28535461425781, -77.47731018066406, -75.66925811767578, -73.8612060546875, -72.05315399169922, -70.24510955810547, -68.43705749511719, -66.6290054321289, -64.82095336914062, -63.01290512084961, -61.204856872558594, -59.39680480957031, -57.5887565612793, -55.780704498291016, -53.972652435302734, -52.16460037231445, -50.35655212402344, -48.548500061035156, -46.74045181274414, -44.93239974975586, -43.124351501464844, -41.31629943847656, -39.50825500488281, -37.70020294189453, -35.892154693603516, -34.084102630615234, -32.27605438232422, -30.468002319335938, -28.659954071044922, -26.85190200805664, -25.04384994506836, -23.23579978942871, -21.427749633789062, -19.619699478149414, -17.811649322509766, -16.003597259521484, -14.195548057556152, -12.387497901916504, -10.579448699951172, -8.771398544311523, -6.963348388671875, -5.155297756195068, -3.34724760055542, -1.5391969680786133, 0.26885318756103516, 2.0769033432006836, 3.884953498840332, 5.6930036544799805, 7.501053810119629, 9.309104919433594, 11.117155075073242, 12.92520523071289, 14.733255386352539, 16.541305541992188, 18.349355697631836]}, "gradients/encoder.encoder.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 6.0, 8.0, 7.0, 4.0, 6.0, 2.0, 8.0, 10.0, 10.0, 16.0, 20.0, 19.0, 18.0, 26.0, 27.0, 21.0, 31.0, 32.0, 28.0, 34.0, 42.0, 38.0, 33.0, 37.0, 39.0, 36.0, 25.0, 37.0, 35.0, 30.0, 25.0, 31.0, 36.0, 27.0, 25.0, 35.0, 18.0, 22.0, 16.0, 13.0, 13.0, 9.0, 12.0, 7.0, 8.0, 8.0, 7.0, 5.0, 7.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-12.808023452758789, -12.405165672302246, -12.002306938171387, -11.599449157714844, -11.196590423583984, -10.793732643127441, -10.390874862670898, -9.988016128540039, -9.585158348083496, -9.182300567626953, -8.779441833496094, -8.37658405303955, -7.97372579574585, -7.570867538452148, -7.1680097579956055, -6.765151500701904, -6.362293243408203, -5.959434986114502, -5.556576728820801, -5.153718948364258, -4.750860691070557, -4.3480024337768555, -3.9451444149017334, -3.5422863960266113, -3.13942813873291, -2.736569881439209, -2.333711862564087, -1.9308537244796753, -1.5279955863952637, -1.1251373291015625, -0.7222793102264404, -0.31942129135131836, 0.08343791961669922, 0.48629605770111084, 0.8891541957855225, 1.292012333869934, 1.6948704719543457, 2.097728729248047, 2.500586748123169, 2.903444766998291, 3.306303024291992, 3.7091612815856934, 4.1120195388793945, 4.5148773193359375, 4.917735576629639, 5.32059383392334, 5.723451614379883, 6.126309871673584, 6.529168128967285, 6.932026386260986, 7.3348846435546875, 7.7377424240112305, 8.140600204467773, 8.543458938598633, 8.946316719055176, 9.349174499511719, 9.752033233642578, 10.154891014099121, 10.55774974822998, 10.960607528686523, 11.363466262817383, 11.766324043273926, 12.169181823730469, 12.572040557861328, 12.974898338317871]}, "gradients/encoder.encoder.layers.23.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 1.0, 4.0, 3.0, 2.0, 5.0, 0.0, 9.0, 7.0, 8.0, 10.0, 8.0, 10.0, 17.0, 21.0, 27.0, 45.0, 48.0, 86.0, 86.0, 119.0, 189.0, 250.0, 334.0, 531.0, 755.0, 1150.0, 1970.0, 3235.0, 6076.0, 14326.0, 72974.0, 4032292.0, 36278.0, 10578.0, 4884.0, 2810.0, 1676.0, 1063.0, 710.0, 516.0, 310.0, 284.0, 146.0, 118.0, 79.0, 61.0, 38.0, 37.0, 22.0, 19.0, 12.0, 12.0, 16.0, 10.0, 4.0, 5.0, 4.0, 3.0, 4.0, 3.0], "bins": [-0.0377197265625, -0.036652565002441406, -0.03558540344238281, -0.03451824188232422, -0.033451080322265625, -0.03238391876220703, -0.03131675720214844, -0.030249595642089844, -0.02918243408203125, -0.028115272521972656, -0.027048110961914062, -0.02598094940185547, -0.024913787841796875, -0.02384662628173828, -0.022779464721679688, -0.021712303161621094, -0.0206451416015625, -0.019577980041503906, -0.018510818481445312, -0.01744365692138672, -0.016376495361328125, -0.015309333801269531, -0.014242172241210938, -0.013175010681152344, -0.01210784912109375, -0.011040687561035156, -0.009973526000976562, -0.008906364440917969, -0.007839202880859375, -0.006772041320800781, -0.0057048797607421875, -0.004637718200683594, -0.003570556640625, -0.0025033950805664062, -0.0014362335205078125, -0.00036907196044921875, 0.000698089599609375, 0.0017652511596679688, 0.0028324127197265625, 0.0038995742797851562, 0.00496673583984375, 0.006033897399902344, 0.0071010589599609375, 0.008168220520019531, 0.009235382080078125, 0.010302543640136719, 0.011369705200195312, 0.012436866760253906, 0.0135040283203125, 0.014571189880371094, 0.015638351440429688, 0.01670551300048828, 0.017772674560546875, 0.01883983612060547, 0.019906997680664062, 0.020974159240722656, 0.02204132080078125, 0.023108482360839844, 0.024175643920898438, 0.02524280548095703, 0.026309967041015625, 0.02737712860107422, 0.028444290161132812, 0.029511451721191406, 0.03057861328125]}, "gradients/encoder.encoder.layers.23.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 2.0, 4.0, 0.0, 4.0, 3.0, 9.0, 4.0, 7.0, 2.0, 5.0, 3.0, 13.0, 16.0, 26.0, 290.0, 457.0, 66.0, 16.0, 9.0, 12.0, 9.0, 7.0, 9.0, 4.0, 5.0, 4.0, 4.0, 1.0, 3.0, 1.0, 4.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.002956390380859375, -0.0028515756130218506, -0.002746760845184326, -0.0026419460773468018, -0.0025371313095092773, -0.002432316541671753, -0.0023275017738342285, -0.002222687005996704, -0.0021178722381591797, -0.0020130574703216553, -0.0019082427024841309, -0.0018034279346466064, -0.001698613166809082, -0.0015937983989715576, -0.0014889836311340332, -0.0013841688632965088, -0.0012793540954589844, -0.00117453932762146, -0.0010697245597839355, -0.0009649097919464111, -0.0008600950241088867, -0.0007552802562713623, -0.0006504654884338379, -0.0005456507205963135, -0.00044083595275878906, -0.00033602118492126465, -0.00023120641708374023, -0.00012639164924621582, -2.1576881408691406e-05, 8.323788642883301e-05, 0.00018805265426635742, 0.00029286742210388184, 0.00039768218994140625, 0.0005024969577789307, 0.0006073117256164551, 0.0007121264934539795, 0.0008169412612915039, 0.0009217560291290283, 0.0010265707969665527, 0.0011313855648040771, 0.0012362003326416016, 0.001341015100479126, 0.0014458298683166504, 0.0015506446361541748, 0.0016554594039916992, 0.0017602741718292236, 0.001865088939666748, 0.0019699037075042725, 0.002074718475341797, 0.0021795332431793213, 0.0022843480110168457, 0.00238916277885437, 0.0024939775466918945, 0.002598792314529419, 0.0027036070823669434, 0.0028084218502044678, 0.002913236618041992, 0.0030180513858795166, 0.003122866153717041, 0.0032276809215545654, 0.00333249568939209, 0.0034373104572296143, 0.0035421252250671387, 0.003646939992904663, 0.0037517547607421875]}, "gradients/encoder.encoder.layers.23.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 3.0, 2.0, 4.0, 7.0, 1.0, 5.0, 9.0, 13.0, 23.0, 23.0, 42.0, 62.0, 99.0, 152.0, 224.0, 377.0, 708.0, 1522.0, 4200.0, 19221.0, 978924.0, 3159763.0, 21178.0, 4285.0, 1647.0, 705.0, 393.0, 219.0, 151.0, 110.0, 74.0, 46.0, 35.0, 21.0, 14.0, 4.0, 3.0, 5.0, 2.0, 6.0, 3.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0638427734375, -0.06178474426269531, -0.059726715087890625, -0.05766868591308594, -0.05561065673828125, -0.05355262756347656, -0.051494598388671875, -0.04943656921386719, -0.0473785400390625, -0.04532051086425781, -0.043262481689453125, -0.04120445251464844, -0.03914642333984375, -0.03708839416503906, -0.035030364990234375, -0.03297233581542969, -0.030914306640625, -0.028856277465820312, -0.026798248291015625, -0.024740219116210938, -0.02268218994140625, -0.020624160766601562, -0.018566131591796875, -0.016508102416992188, -0.0144500732421875, -0.012392044067382812, -0.010334014892578125, -0.008275985717773438, -0.00621795654296875, -0.0041599273681640625, -0.002101898193359375, -4.38690185546875e-05, 0.00201416015625, 0.0040721893310546875, 0.006130218505859375, 0.008188247680664062, 0.01024627685546875, 0.012304306030273438, 0.014362335205078125, 0.016420364379882812, 0.0184783935546875, 0.020536422729492188, 0.022594451904296875, 0.024652481079101562, 0.02671051025390625, 0.028768539428710938, 0.030826568603515625, 0.03288459777832031, 0.034942626953125, 0.03700065612792969, 0.039058685302734375, 0.04111671447753906, 0.04317474365234375, 0.04523277282714844, 0.047290802001953125, 0.04934883117675781, 0.0514068603515625, 0.05346488952636719, 0.055522918701171875, 0.05758094787597656, 0.05963897705078125, 0.06169700622558594, 0.06375503540039062, 0.06581306457519531, 0.06787109375]}, "gradients/encoder.encoder.layers.23.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 7.0, 3.0, 11.0, 8.0, 13.0, 9.0, 22.0, 27.0, 26.0, 42.0, 55.0, 58.0, 86.0, 119.0, 170.0, 429.0, 2025.0, 387.0, 178.0, 91.0, 62.0, 57.0, 44.0, 35.0, 23.0, 21.0, 25.0, 9.0, 10.0, 8.0, 7.0, 6.0, 4.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.006290435791015625, -0.0060964226722717285, -0.005902409553527832, -0.0057083964347839355, -0.005514383316040039, -0.005320370197296143, -0.005126357078552246, -0.00493234395980835, -0.004738330841064453, -0.004544317722320557, -0.00435030460357666, -0.004156291484832764, -0.003962278366088867, -0.0037682652473449707, -0.0035742521286010742, -0.0033802390098571777, -0.0031862258911132812, -0.0029922127723693848, -0.0027981996536254883, -0.002604186534881592, -0.0024101734161376953, -0.002216160297393799, -0.0020221471786499023, -0.0018281340599060059, -0.0016341209411621094, -0.0014401078224182129, -0.0012460947036743164, -0.00105208158493042, -0.0008580684661865234, -0.000664055347442627, -0.00047004222869873047, -0.000276029109954834, -8.20159912109375e-05, 0.00011199712753295898, 0.00030601024627685547, 0.000500023365020752, 0.0006940364837646484, 0.0008880496025085449, 0.0010820627212524414, 0.0012760758399963379, 0.0014700889587402344, 0.0016641020774841309, 0.0018581151962280273, 0.002052128314971924, 0.0022461414337158203, 0.002440154552459717, 0.0026341676712036133, 0.0028281807899475098, 0.0030221939086914062, 0.0032162070274353027, 0.0034102201461791992, 0.0036042332649230957, 0.003798246383666992, 0.003992259502410889, 0.004186272621154785, 0.004380285739898682, 0.004574298858642578, 0.004768311977386475, 0.004962325096130371, 0.005156338214874268, 0.005350351333618164, 0.0055443644523620605, 0.005738377571105957, 0.0059323906898498535, 0.00612640380859375]}, "gradients/encoder.encoder.layers.23.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 14.0, 21.0, 93.0, 662.0, 170.0, 25.0, 12.0, 6.0, 3.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.21538019180297852, -0.21094518899917603, -0.20651017129421234, -0.20207516849040985, -0.19764015078544617, -0.19320514798164368, -0.1887701451778412, -0.1843351274728775, -0.179900124669075, -0.17546512186527252, -0.17103010416030884, -0.16659510135650635, -0.16216008365154266, -0.15772508084774017, -0.1532900631427765, -0.148855060338974, -0.1444200575351715, -0.13998505473136902, -0.13555003702640533, -0.13111503422260284, -0.12668001651763916, -0.12224501371383667, -0.11781000345945358, -0.1133749932050705, -0.10893997550010681, -0.10450496524572372, -0.10006995499134064, -0.09563495218753815, -0.09119994193315506, -0.08676493167877197, -0.08232992142438889, -0.0778949111700058, -0.07345990091562271, -0.06902489066123962, -0.06458988040685654, -0.06015487387776375, -0.05571986734867096, -0.05128485709428787, -0.046849846839904785, -0.0424148365855217, -0.03797983005642891, -0.03354481980204582, -0.029109813272953033, -0.024674803018569946, -0.02023979462683201, -0.01580478623509407, -0.011369775980710983, -0.006934767588973045, -0.0024997591972351074, 0.0019352496601641178, 0.006370258517563343, 0.010805267840623856, 0.015240276232361794, 0.01967528462409973, 0.02411029487848282, 0.028545303270220757, 0.032980311661958694, 0.03741532191634178, 0.04185032844543457, 0.04628533869981766, 0.050720348954200745, 0.05515535548329353, 0.05959036573767662, 0.06402537226676941, 0.0684603825211525]}, "gradients/encoder.encoder.layers.23.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 6.0, 1.0, 3.0, 2.0, 5.0, 3.0, 7.0, 11.0, 4.0, 14.0, 13.0, 14.0, 14.0, 26.0, 26.0, 38.0, 41.0, 29.0, 44.0, 42.0, 45.0, 62.0, 68.0, 57.0, 65.0, 58.0, 47.0, 40.0, 48.0, 29.0, 30.0, 22.0, 23.0, 14.0, 9.0, 10.0, 13.0, 11.0, 2.0, 3.0, 0.0, 4.0, 1.0, 1.0, 2.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.020285427570343018, -0.0196320042014122, -0.018978580832481384, -0.018325157463550568, -0.01767173409461975, -0.017018310725688934, -0.016364887356758118, -0.0157114639878273, -0.015058040618896484, -0.014404617249965668, -0.013751193881034851, -0.013097770512104034, -0.012444347143173218, -0.011790923774242401, -0.011137500405311584, -0.010484077036380768, -0.009830653667449951, -0.009177230298519135, -0.008523806929588318, -0.007870383560657501, -0.007216960191726685, -0.006563536822795868, -0.005910113453865051, -0.005256690084934235, -0.004603266716003418, -0.003949843347072601, -0.0032964199781417847, -0.002642996609210968, -0.0019895732402801514, -0.0013361498713493347, -0.0006827265024185181, -2.9303133487701416e-05, 0.0006241202354431152, 0.0012775436043739319, 0.0019309669733047485, 0.002584390342235565, 0.003237813711166382, 0.0038912370800971985, 0.004544660449028015, 0.005198083817958832, 0.0058515071868896484, 0.006504930555820465, 0.007158353924751282, 0.007811777293682098, 0.008465200662612915, 0.009118624031543732, 0.009772047400474548, 0.010425470769405365, 0.011078894138336182, 0.011732317507266998, 0.012385740876197815, 0.013039164245128632, 0.013692587614059448, 0.014346010982990265, 0.014999434351921082, 0.015652857720851898, 0.016306281089782715, 0.01695970445871353, 0.017613127827644348, 0.018266551196575165, 0.01891997456550598, 0.019573397934436798, 0.020226821303367615, 0.02088024467229843, 0.021533668041229248]}, "gradients/encoder.encoder.layers.23.attention.out_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 6.0, 4.0, 4.0, 7.0, 16.0, 15.0, 12.0, 23.0, 35.0, 33.0, 43.0, 70.0, 56.0, 113.0, 139.0, 158.0, 212.0, 322.0, 470.0, 669.0, 847.0, 1203.0, 1719.0, 2547.0, 3927.0, 6316.0, 10899.0, 24390.0, 730179.0, 216044.0, 20462.0, 9979.0, 5828.0, 3690.0, 2404.0, 1570.0, 1112.0, 802.0, 612.0, 401.0, 304.0, 230.0, 140.0, 124.0, 110.0, 63.0, 59.0, 49.0, 33.0, 32.0, 27.0, 14.0, 17.0, 12.0, 7.0, 7.0, 1.0, 3.0, 2.0, 0.0, 1.0], "bins": [-0.033294677734375, -0.03225135803222656, -0.031208038330078125, -0.030164718627929688, -0.02912139892578125, -0.028078079223632812, -0.027034759521484375, -0.025991439819335938, -0.0249481201171875, -0.023904800415039062, -0.022861480712890625, -0.021818161010742188, -0.02077484130859375, -0.019731521606445312, -0.018688201904296875, -0.017644882202148438, -0.0166015625, -0.015558242797851562, -0.014514923095703125, -0.013471603393554688, -0.01242828369140625, -0.011384963989257812, -0.010341644287109375, -0.009298324584960938, -0.0082550048828125, -0.0072116851806640625, -0.006168365478515625, -0.0051250457763671875, -0.00408172607421875, -0.0030384063720703125, -0.001995086669921875, -0.0009517669677734375, 9.1552734375e-05, 0.0011348724365234375, 0.002178192138671875, 0.0032215118408203125, 0.00426483154296875, 0.0053081512451171875, 0.006351470947265625, 0.0073947906494140625, 0.0084381103515625, 0.009481430053710938, 0.010524749755859375, 0.011568069458007812, 0.01261138916015625, 0.013654708862304688, 0.014698028564453125, 0.015741348266601562, 0.01678466796875, 0.017827987670898438, 0.018871307373046875, 0.019914627075195312, 0.02095794677734375, 0.022001266479492188, 0.023044586181640625, 0.024087905883789062, 0.0251312255859375, 0.026174545288085938, 0.027217864990234375, 0.028261184692382812, 0.02930450439453125, 0.030347824096679688, 0.031391143798828125, 0.03243446350097656, 0.033477783203125]}, "gradients/encoder.encoder.layers.23.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 3.0, 0.0, 2.0, 3.0, 6.0, 5.0, 5.0, 5.0, 5.0, 3.0, 6.0, 9.0, 18.0, 36.0, 299.0, 404.0, 92.0, 24.0, 6.0, 11.0, 10.0, 4.0, 8.0, 8.0, 5.0, 3.0, 5.0, 4.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0029125213623046875, -0.0028128623962402344, -0.0027132034301757812, -0.002613544464111328, -0.002513885498046875, -0.002414226531982422, -0.0023145675659179688, -0.0022149085998535156, -0.0021152496337890625, -0.0020155906677246094, -0.0019159317016601562, -0.0018162727355957031, -0.00171661376953125, -0.0016169548034667969, -0.0015172958374023438, -0.0014176368713378906, -0.0013179779052734375, -0.0012183189392089844, -0.0011186599731445312, -0.0010190010070800781, -0.000919342041015625, -0.0008196830749511719, -0.0007200241088867188, -0.0006203651428222656, -0.0005207061767578125, -0.0004210472106933594, -0.00032138824462890625, -0.00022172927856445312, -0.0001220703125, -2.2411346435546875e-05, 7.724761962890625e-05, 0.00017690658569335938, 0.0002765655517578125, 0.0003762245178222656, 0.00047588348388671875, 0.0005755424499511719, 0.000675201416015625, 0.0007748603820800781, 0.0008745193481445312, 0.0009741783142089844, 0.0010738372802734375, 0.0011734962463378906, 0.0012731552124023438, 0.0013728141784667969, 0.00147247314453125, 0.0015721321105957031, 0.0016717910766601562, 0.0017714500427246094, 0.0018711090087890625, 0.0019707679748535156, 0.0020704269409179688, 0.002170085906982422, 0.002269744873046875, 0.002369403839111328, 0.0024690628051757812, 0.0025687217712402344, 0.0026683807373046875, 0.0027680397033691406, 0.0028676986694335938, 0.002967357635498047, 0.0030670166015625, 0.003166675567626953, 0.0032663345336914062, 0.0033659934997558594, 0.0034656524658203125]}, "gradients/encoder.encoder.layers.23.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 3.0, 4.0, 6.0, 19.0, 14.0, 25.0, 27.0, 40.0, 42.0, 55.0, 86.0, 126.0, 167.0, 284.0, 360.0, 487.0, 783.0, 1251.0, 2173.0, 4344.0, 9579.0, 34829.0, 756174.0, 201702.0, 20440.0, 7062.0, 3245.0, 1861.0, 1129.0, 697.0, 443.0, 329.0, 230.0, 148.0, 104.0, 72.0, 64.0, 31.0, 35.0, 26.0, 17.0, 15.0, 11.0, 6.0, 3.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0511474609375, -0.04948854446411133, -0.047829627990722656, -0.046170711517333984, -0.04451179504394531, -0.04285287857055664, -0.04119396209716797, -0.0395350456237793, -0.037876129150390625, -0.03621721267700195, -0.03455829620361328, -0.03289937973022461, -0.031240463256835938, -0.029581546783447266, -0.027922630310058594, -0.026263713836669922, -0.02460479736328125, -0.022945880889892578, -0.021286964416503906, -0.019628047943115234, -0.017969131469726562, -0.01631021499633789, -0.014651298522949219, -0.012992382049560547, -0.011333465576171875, -0.009674549102783203, -0.008015632629394531, -0.006356716156005859, -0.0046977996826171875, -0.0030388832092285156, -0.0013799667358398438, 0.0002789497375488281, 0.0019378662109375, 0.003596782684326172, 0.005255699157714844, 0.006914615631103516, 0.008573532104492188, 0.01023244857788086, 0.011891365051269531, 0.013550281524658203, 0.015209197998046875, 0.016868114471435547, 0.01852703094482422, 0.02018594741821289, 0.021844863891601562, 0.023503780364990234, 0.025162696838378906, 0.026821613311767578, 0.02848052978515625, 0.030139446258544922, 0.031798362731933594, 0.033457279205322266, 0.03511619567871094, 0.03677511215209961, 0.03843402862548828, 0.04009294509887695, 0.041751861572265625, 0.0434107780456543, 0.04506969451904297, 0.04672861099243164, 0.04838752746582031, 0.050046443939208984, 0.051705360412597656, 0.05336427688598633, 0.055023193359375]}, "gradients/encoder.encoder.layers.23.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 3.0, 6.0, 12.0, 6.0, 14.0, 9.0, 13.0, 21.0, 12.0, 20.0, 22.0, 28.0, 41.0, 29.0, 31.0, 23.0, 25.0, 26.0, 23.0, 29.0, 37.0, 26.0, 56.0, 32.0, 35.0, 23.0, 54.0, 48.0, 29.0, 34.0, 19.0, 29.0, 25.0, 20.0, 17.0, 20.0, 19.0, 17.0, 15.0, 8.0, 8.0, 9.0, 6.0, 5.0, 8.0, 3.0, 1.0, 2.0, 1.0, 3.0, 2.0, 3.0, 2.0, 1.0, 1.0], "bins": [-0.0107421875, -0.010399460792541504, -0.010056734085083008, -0.009714007377624512, -0.009371280670166016, -0.00902855396270752, -0.008685827255249023, -0.008343100547790527, -0.008000373840332031, -0.007657647132873535, -0.007314920425415039, -0.006972193717956543, -0.006629467010498047, -0.006286740303039551, -0.005944013595581055, -0.005601286888122559, -0.0052585601806640625, -0.004915833473205566, -0.00457310676574707, -0.004230380058288574, -0.003887653350830078, -0.003544926643371582, -0.003202199935913086, -0.00285947322845459, -0.0025167465209960938, -0.0021740198135375977, -0.0018312931060791016, -0.0014885663986206055, -0.0011458396911621094, -0.0008031129837036133, -0.0004603862762451172, -0.0001176595687866211, 0.000225067138671875, 0.0005677938461303711, 0.0009105205535888672, 0.0012532472610473633, 0.0015959739685058594, 0.0019387006759643555, 0.0022814273834228516, 0.0026241540908813477, 0.0029668807983398438, 0.00330960750579834, 0.003652334213256836, 0.003995060920715332, 0.004337787628173828, 0.004680514335632324, 0.00502324104309082, 0.005365967750549316, 0.0057086944580078125, 0.006051421165466309, 0.006394147872924805, 0.006736874580383301, 0.007079601287841797, 0.007422327995300293, 0.007765054702758789, 0.008107781410217285, 0.008450508117675781, 0.008793234825134277, 0.009135961532592773, 0.00947868824005127, 0.009821414947509766, 0.010164141654968262, 0.010506868362426758, 0.010849595069885254, 0.01119232177734375]}, "gradients/encoder.encoder.layers.23.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 3.0, 4.0, 5.0, 1.0, 1.0, 1.0, 5.0, 13.0, 16.0, 23.0, 31.0, 50.0, 110.0, 276.0, 581.0, 2483.0, 64058.0, 974817.0, 4633.0, 874.0, 265.0, 139.0, 66.0, 43.0, 16.0, 14.0, 13.0, 6.0, 5.0, 3.0, 4.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.060791015625, -0.059061527252197266, -0.05733203887939453, -0.0556025505065918, -0.05387306213378906, -0.05214357376098633, -0.050414085388183594, -0.04868459701538086, -0.046955108642578125, -0.04522562026977539, -0.043496131896972656, -0.04176664352416992, -0.04003715515136719, -0.03830766677856445, -0.03657817840576172, -0.034848690032958984, -0.03311920166015625, -0.031389713287353516, -0.02966022491455078, -0.027930736541748047, -0.026201248168945312, -0.024471759796142578, -0.022742271423339844, -0.02101278305053711, -0.019283294677734375, -0.01755380630493164, -0.015824317932128906, -0.014094829559326172, -0.012365341186523438, -0.010635852813720703, -0.008906364440917969, -0.007176876068115234, -0.0054473876953125, -0.0037178993225097656, -0.0019884109497070312, -0.0002589225769042969, 0.0014705657958984375, 0.003200054168701172, 0.004929542541503906, 0.006659030914306641, 0.008388519287109375, 0.01011800765991211, 0.011847496032714844, 0.013576984405517578, 0.015306472778320312, 0.017035961151123047, 0.01876544952392578, 0.020494937896728516, 0.02222442626953125, 0.023953914642333984, 0.02568340301513672, 0.027412891387939453, 0.029142379760742188, 0.030871868133544922, 0.032601356506347656, 0.03433084487915039, 0.036060333251953125, 0.03778982162475586, 0.039519309997558594, 0.04124879837036133, 0.04297828674316406, 0.0447077751159668, 0.04643726348876953, 0.048166751861572266, 0.049896240234375]}, "gradients/encoder.encoder.layers.23.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 3.0, 5.0, 3.0, 4.0, 7.0, 5.0, 8.0, 10.0, 14.0, 19.0, 28.0, 50.0, 85.0, 146.0, 177.0, 157.0, 102.0, 70.0, 36.0, 19.0, 18.0, 7.0, 8.0, 7.0, 5.0, 7.0, 5.0, 2.0, 5.0, 1.0, 1.0, 0.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.4139881134033203e-05, -2.3443251848220825e-05, -2.2746622562408447e-05, -2.204999327659607e-05, -2.135336399078369e-05, -2.0656734704971313e-05, -1.9960105419158936e-05, -1.9263476133346558e-05, -1.856684684753418e-05, -1.7870217561721802e-05, -1.7173588275909424e-05, -1.6476958990097046e-05, -1.5780329704284668e-05, -1.508370041847229e-05, -1.4387071132659912e-05, -1.3690441846847534e-05, -1.2993812561035156e-05, -1.2297183275222778e-05, -1.16005539894104e-05, -1.0903924703598022e-05, -1.0207295417785645e-05, -9.510666131973267e-06, -8.814036846160889e-06, -8.11740756034851e-06, -7.420778274536133e-06, -6.724148988723755e-06, -6.027519702911377e-06, -5.330890417098999e-06, -4.634261131286621e-06, -3.937631845474243e-06, -3.2410025596618652e-06, -2.5443732738494873e-06, -1.8477439880371094e-06, -1.1511147022247314e-06, -4.544854164123535e-07, 2.421438694000244e-07, 9.387731552124023e-07, 1.6354024410247803e-06, 2.332031726837158e-06, 3.028661012649536e-06, 3.725290298461914e-06, 4.421919584274292e-06, 5.11854887008667e-06, 5.815178155899048e-06, 6.511807441711426e-06, 7.208436727523804e-06, 7.905066013336182e-06, 8.60169529914856e-06, 9.298324584960938e-06, 9.994953870773315e-06, 1.0691583156585693e-05, 1.1388212442398071e-05, 1.208484172821045e-05, 1.2781471014022827e-05, 1.3478100299835205e-05, 1.4174729585647583e-05, 1.4871358871459961e-05, 1.556798815727234e-05, 1.6264617443084717e-05, 1.6961246728897095e-05, 1.7657876014709473e-05, 1.835450530052185e-05, 1.905113458633423e-05, 1.9747763872146606e-05, 2.0444393157958984e-05]}, "gradients/encoder.encoder.layers.23.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 3.0, 4.0, 7.0, 9.0, 8.0, 20.0, 12.0, 18.0, 24.0, 34.0, 68.0, 96.0, 169.0, 395.0, 986.0, 4488.0, 312368.0, 723187.0, 4671.0, 1048.0, 379.0, 214.0, 104.0, 82.0, 53.0, 40.0, 20.0, 8.0, 10.0, 11.0, 7.0, 3.0, 7.0, 1.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0859375, -0.08263397216796875, -0.0793304443359375, -0.07602691650390625, -0.072723388671875, -0.06941986083984375, -0.0661163330078125, -0.06281280517578125, -0.05950927734375, -0.05620574951171875, -0.0529022216796875, -0.04959869384765625, -0.046295166015625, -0.04299163818359375, -0.0396881103515625, -0.03638458251953125, -0.0330810546875, -0.02977752685546875, -0.0264739990234375, -0.02317047119140625, -0.019866943359375, -0.01656341552734375, -0.0132598876953125, -0.00995635986328125, -0.00665283203125, -0.00334930419921875, -4.57763671875e-05, 0.00325775146484375, 0.006561279296875, 0.00986480712890625, 0.0131683349609375, 0.01647186279296875, 0.019775390625, 0.02307891845703125, 0.0263824462890625, 0.02968597412109375, 0.032989501953125, 0.03629302978515625, 0.0395965576171875, 0.04290008544921875, 0.04620361328125, 0.04950714111328125, 0.0528106689453125, 0.05611419677734375, 0.059417724609375, 0.06272125244140625, 0.0660247802734375, 0.06932830810546875, 0.0726318359375, 0.07593536376953125, 0.0792388916015625, 0.08254241943359375, 0.085845947265625, 0.08914947509765625, 0.0924530029296875, 0.09575653076171875, 0.09906005859375, 0.10236358642578125, 0.1056671142578125, 0.10897064208984375, 0.112274169921875, 0.11557769775390625, 0.1188812255859375, 0.12218475341796875, 0.12548828125]}, "gradients/encoder.encoder.layers.23.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 8.0, 5.0, 7.0, 6.0, 7.0, 9.0, 9.0, 10.0, 27.0, 35.0, 39.0, 82.0, 342.0, 184.0, 81.0, 55.0, 24.0, 25.0, 10.0, 7.0, 5.0, 4.0, 3.0, 4.0, 2.0, 1.0, 3.0, 4.0, 1.0, 1.0, 4.0, 1.0, 0.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.015411376953125, -0.01473546028137207, -0.01405954360961914, -0.013383626937866211, -0.012707710266113281, -0.012031793594360352, -0.011355876922607422, -0.010679960250854492, -0.010004043579101562, -0.009328126907348633, -0.008652210235595703, -0.007976293563842773, -0.007300376892089844, -0.006624460220336914, -0.005948543548583984, -0.005272626876831055, -0.004596710205078125, -0.003920793533325195, -0.0032448768615722656, -0.002568960189819336, -0.0018930435180664062, -0.0012171268463134766, -0.0005412101745605469, 0.0001347064971923828, 0.0008106231689453125, 0.0014865398406982422, 0.002162456512451172, 0.0028383731842041016, 0.0035142898559570312, 0.004190206527709961, 0.004866123199462891, 0.00554203987121582, 0.00621795654296875, 0.00689387321472168, 0.007569789886474609, 0.008245706558227539, 0.008921623229980469, 0.009597539901733398, 0.010273456573486328, 0.010949373245239258, 0.011625289916992188, 0.012301206588745117, 0.012977123260498047, 0.013653039932250977, 0.014328956604003906, 0.015004873275756836, 0.015680789947509766, 0.016356706619262695, 0.017032623291015625, 0.017708539962768555, 0.018384456634521484, 0.019060373306274414, 0.019736289978027344, 0.020412206649780273, 0.021088123321533203, 0.021764039993286133, 0.022439956665039062, 0.023115873336791992, 0.023791790008544922, 0.02446770668029785, 0.02514362335205078, 0.02581954002380371, 0.02649545669555664, 0.02717137336730957, 0.0278472900390625]}, "gradients/encoder.encoder.layers.23.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 3.0, 6.0, 3.0, 8.0, 10.0, 17.0, 66.0, 364.0, 425.0, 49.0, 13.0, 10.0, 7.0, 6.0, 2.0, 1.0, 4.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.40105634927749634, -0.3836613595485687, -0.3662663698196411, -0.3488713800907135, -0.3314763903617859, -0.3140813708305359, -0.2966863811016083, -0.27929139137268066, -0.26189640164375305, -0.24450141191482544, -0.22710642218589783, -0.20971141755580902, -0.1923164278268814, -0.1749214380979538, -0.157526433467865, -0.14013144373893738, -0.12273645401000977, -0.10534146428108215, -0.08794646710157394, -0.07055146992206573, -0.05315648019313812, -0.03576149046421051, -0.0183664932847023, -0.0009714961051940918, 0.01642349362373352, 0.03381848707795143, 0.05121348053216934, 0.06860847771167755, 0.08600346744060516, 0.10339845716953278, 0.12079345434904099, 0.1381884515285492, 0.15558350086212158, 0.1729784905910492, 0.1903734803199768, 0.2077684849500656, 0.22516347467899323, 0.24255846440792084, 0.25995346903800964, 0.27734845876693726, 0.29474344849586487, 0.3121384382247925, 0.3295334279537201, 0.3469284176826477, 0.3643234372138977, 0.38171839714050293, 0.39911341667175293, 0.41650840640068054, 0.43390339612960815, 0.45129838585853577, 0.4686933755874634, 0.486088365316391, 0.5034833550453186, 0.5208783745765686, 0.5382733345031738, 0.5556683540344238, 0.5730633735656738, 0.5904583930969238, 0.607853353023529, 0.625248372554779, 0.6426433324813843, 0.6600383520126343, 0.6774333119392395, 0.6948283314704895, 0.7122232913970947]}, "gradients/encoder.encoder.layers.23.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 3.0, 4.0, 7.0, 9.0, 3.0, 6.0, 11.0, 9.0, 17.0, 35.0, 45.0, 76.0, 111.0, 127.0, 130.0, 99.0, 106.0, 66.0, 33.0, 27.0, 9.0, 20.0, 8.0, 6.0, 6.0, 4.0, 8.0, 2.0, 3.0, 2.0, 1.0, 1.0, 5.0, 2.0, 0.0, 3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2344350814819336, -0.22719219326972961, -0.21994929015636444, -0.21270640194416046, -0.20546351373195648, -0.1982206106185913, -0.19097772240638733, -0.18373483419418335, -0.17649194598197937, -0.1692490577697754, -0.16200615465641022, -0.15476326644420624, -0.14752037823200226, -0.14027747511863708, -0.1330345869064331, -0.12579169869422913, -0.11854879558086395, -0.11130589991807938, -0.1040630117058754, -0.09682011604309082, -0.08957722783088684, -0.08233433216810226, -0.07509143650531769, -0.06784854829311371, -0.06060565263032913, -0.053362760692834854, -0.046119868755340576, -0.038876973092556, -0.03163408115506172, -0.024391189217567444, -0.017148293554782867, -0.00990540161728859, -0.0026625096797943115, 0.004580383189022541, 0.011823276057839394, 0.01906616985797882, 0.0263090617954731, 0.03355195373296738, 0.04079484939575195, 0.04803774133324623, 0.05528063327074051, 0.06252352893352509, 0.06976641714572906, 0.07700931280851364, 0.08425220847129822, 0.0914950966835022, 0.09873799234628677, 0.10598088800907135, 0.11322377622127533, 0.1204666718840599, 0.12770956754684448, 0.13495245575904846, 0.14219534397125244, 0.14943823218345642, 0.1566811352968216, 0.16392402350902557, 0.17116692662239075, 0.17840981483459473, 0.1856527179479599, 0.19289560616016388, 0.20013849437236786, 0.20738139748573303, 0.214624285697937, 0.221867173910141, 0.22911006212234497]}, "gradients/encoder.encoder.layers.22.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 3.0, 3.0, 4.0, 6.0, 5.0, 2.0, 9.0, 6.0, 9.0, 11.0, 13.0, 18.0, 22.0, 29.0, 75.0, 242.0, 2997.0, 4111681.0, 77355.0, 1472.0, 139.0, 49.0, 30.0, 18.0, 14.0, 15.0, 8.0, 9.0, 6.0, 12.0, 5.0, 6.0, 1.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 3.0, 2.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-4.375, -4.253082275390625, -4.13116455078125, -4.009246826171875, -3.8873291015625, -3.765411376953125, -3.64349365234375, -3.521575927734375, -3.399658203125, -3.277740478515625, -3.15582275390625, -3.033905029296875, -2.9119873046875, -2.790069580078125, -2.66815185546875, -2.546234130859375, -2.42431640625, -2.302398681640625, -2.18048095703125, -2.058563232421875, -1.9366455078125, -1.814727783203125, -1.69281005859375, -1.570892333984375, -1.448974609375, -1.327056884765625, -1.20513916015625, -1.083221435546875, -0.9613037109375, -0.839385986328125, -0.71746826171875, -0.595550537109375, -0.4736328125, -0.351715087890625, -0.22979736328125, -0.107879638671875, 0.0140380859375, 0.135955810546875, 0.25787353515625, 0.379791259765625, 0.501708984375, 0.623626708984375, 0.74554443359375, 0.867462158203125, 0.9893798828125, 1.111297607421875, 1.23321533203125, 1.355133056640625, 1.47705078125, 1.598968505859375, 1.72088623046875, 1.842803955078125, 1.9647216796875, 2.086639404296875, 2.20855712890625, 2.330474853515625, 2.452392578125, 2.574310302734375, 2.69622802734375, 2.818145751953125, 2.9400634765625, 3.061981201171875, 3.18389892578125, 3.305816650390625, 3.427734375]}, "gradients/encoder.encoder.layers.22.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 2.0, 4.0, 1.0, 4.0, 7.0, 2.0, 5.0, 0.0, 1.0, 2.0, 6.0, 10.0, 9.0, 7.0, 8.0, 13.0, 75.0, 297.0, 333.0, 112.0, 27.0, 9.0, 13.0, 7.0, 9.0, 0.0, 5.0, 7.0, 6.0, 7.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 4.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0027332305908203125, -0.0026482045650482178, -0.002563178539276123, -0.0024781525135040283, -0.0023931264877319336, -0.002308100461959839, -0.002223074436187744, -0.0021380484104156494, -0.0020530223846435547, -0.00196799635887146, -0.0018829703330993652, -0.0017979443073272705, -0.0017129182815551758, -0.001627892255783081, -0.0015428662300109863, -0.0014578402042388916, -0.0013728141784667969, -0.0012877881526947021, -0.0012027621269226074, -0.0011177361011505127, -0.001032710075378418, -0.0009476840496063232, -0.0008626580238342285, -0.0007776319980621338, -0.0006926059722900391, -0.0006075799465179443, -0.0005225539207458496, -0.0004375278949737549, -0.00035250186920166016, -0.00026747584342956543, -0.0001824498176574707, -9.742379188537598e-05, -1.239776611328125e-05, 7.262825965881348e-05, 0.0001576542854309082, 0.00024268031120300293, 0.00032770633697509766, 0.0004127323627471924, 0.0004977583885192871, 0.0005827844142913818, 0.0006678104400634766, 0.0007528364658355713, 0.000837862491607666, 0.0009228885173797607, 0.0010079145431518555, 0.0010929405689239502, 0.001177966594696045, 0.0012629926204681396, 0.0013480186462402344, 0.001433044672012329, 0.0015180706977844238, 0.0016030967235565186, 0.0016881227493286133, 0.001773148775100708, 0.0018581748008728027, 0.0019432008266448975, 0.002028226852416992, 0.002113252878189087, 0.0021982789039611816, 0.0022833049297332764, 0.002368330955505371, 0.002453356981277466, 0.0025383830070495605, 0.0026234090328216553, 0.00270843505859375]}, "gradients/encoder.encoder.layers.22.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 6.0, 5.0, 1.0, 7.0, 9.0, 18.0, 12.0, 20.0, 20.0, 19.0, 23.0, 41.0, 43.0, 54.0, 67.0, 73.0, 89.0, 100.0, 137.0, 380.0, 5378.0, 4171286.0, 15190.0, 497.0, 153.0, 138.0, 93.0, 93.0, 60.0, 62.0, 50.0, 37.0, 37.0, 19.0, 19.0, 12.0, 9.0, 11.0, 6.0, 2.0, 6.0, 1.0, 1.0, 1.0, 2.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.69189453125, -0.6706771850585938, -0.6494598388671875, -0.6282424926757812, -0.607025146484375, -0.5858078002929688, -0.5645904541015625, -0.5433731079101562, -0.52215576171875, -0.5009384155273438, -0.4797210693359375, -0.45850372314453125, -0.437286376953125, -0.41606903076171875, -0.3948516845703125, -0.37363433837890625, -0.3524169921875, -0.33119964599609375, -0.3099822998046875, -0.28876495361328125, -0.267547607421875, -0.24633026123046875, -0.2251129150390625, -0.20389556884765625, -0.18267822265625, -0.16146087646484375, -0.1402435302734375, -0.11902618408203125, -0.097808837890625, -0.07659149169921875, -0.0553741455078125, -0.03415679931640625, -0.012939453125, 0.00827789306640625, 0.0294952392578125, 0.05071258544921875, 0.071929931640625, 0.09314727783203125, 0.1143646240234375, 0.13558197021484375, 0.15679931640625, 0.17801666259765625, 0.1992340087890625, 0.22045135498046875, 0.241668701171875, 0.26288604736328125, 0.2841033935546875, 0.30532073974609375, 0.3265380859375, 0.34775543212890625, 0.3689727783203125, 0.39019012451171875, 0.411407470703125, 0.43262481689453125, 0.4538421630859375, 0.47505950927734375, 0.49627685546875, 0.5174942016601562, 0.5387115478515625, 0.5599288940429688, 0.581146240234375, 0.6023635864257812, 0.6235809326171875, 0.6447982788085938, 0.666015625]}, "gradients/encoder.encoder.layers.22.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 3.0, 5.0, 5.0, 2.0, 5.0, 13.0, 15.0, 19.0, 16.0, 24.0, 17.0, 27.0, 36.0, 45.0, 76.0, 76.0, 93.0, 93.0, 126.0, 126.0, 219.0, 286.0, 1114.0, 414.0, 228.0, 171.0, 158.0, 144.0, 96.0, 75.0, 72.0, 55.0, 45.0, 51.0, 34.0, 19.0, 20.0, 9.0, 12.0, 12.0, 5.0, 3.0, 10.0, 1.0, 2.0, 1.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.005359649658203125, -0.005196750164031982, -0.00503385066986084, -0.004870951175689697, -0.004708051681518555, -0.004545152187347412, -0.0043822526931762695, -0.004219353199005127, -0.004056453704833984, -0.003893554210662842, -0.0037306547164916992, -0.0035677552223205566, -0.003404855728149414, -0.0032419562339782715, -0.003079056739807129, -0.0029161572456359863, -0.0027532577514648438, -0.002590358257293701, -0.0024274587631225586, -0.002264559268951416, -0.0021016597747802734, -0.0019387602806091309, -0.0017758607864379883, -0.0016129612922668457, -0.0014500617980957031, -0.0012871623039245605, -0.001124262809753418, -0.0009613633155822754, -0.0007984638214111328, -0.0006355643272399902, -0.00047266483306884766, -0.0003097653388977051, -0.0001468658447265625, 1.6033649444580078e-05, 0.00017893314361572266, 0.00034183263778686523, 0.0005047321319580078, 0.0006676316261291504, 0.000830531120300293, 0.0009934306144714355, 0.0011563301086425781, 0.0013192296028137207, 0.0014821290969848633, 0.0016450285911560059, 0.0018079280853271484, 0.001970827579498291, 0.0021337270736694336, 0.002296626567840576, 0.0024595260620117188, 0.0026224255561828613, 0.002785325050354004, 0.0029482245445251465, 0.003111124038696289, 0.0032740235328674316, 0.0034369230270385742, 0.003599822521209717, 0.0037627220153808594, 0.003925621509552002, 0.0040885210037231445, 0.004251420497894287, 0.00441431999206543, 0.004577219486236572, 0.004740118980407715, 0.004903018474578857, 0.00506591796875]}, "gradients/encoder.encoder.layers.22.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 4.0, 16.0, 32.0, 706.0, 227.0, 21.0, 9.0, 2.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.13565918803215027, -0.12024404853582382, -0.10482890903949738, -0.08941377699375153, -0.07399863749742508, -0.05858349800109863, -0.04316836595535278, -0.027753226459026337, -0.01233808696269989, 0.003077050670981407, 0.018492188304662704, 0.03390732407569885, 0.0493224635720253, 0.06473760306835175, 0.0801527351140976, 0.09556787461042404, 0.11098301410675049, 0.12639814615249634, 0.14181329309940338, 0.15722842514514923, 0.17264357209205627, 0.18805870413780212, 0.20347383618354797, 0.21888896822929382, 0.23430411517620087, 0.24971924722194672, 0.26513439416885376, 0.2805495262145996, 0.29596465826034546, 0.3113797903060913, 0.32679492235183716, 0.3422100841999054, 0.35762524604797363, 0.3730403780937195, 0.38845551013946533, 0.4038706421852112, 0.4192858040332794, 0.43470093607902527, 0.4501160681247711, 0.46553120017051697, 0.4809463620185852, 0.49636149406433105, 0.5117766261100769, 0.5271917581558228, 0.5426068902015686, 0.5580220222473145, 0.5734372138977051, 0.5888523459434509, 0.6042674779891968, 0.6196826100349426, 0.6350977420806885, 0.6505128741264343, 0.6659280061721802, 0.6813431978225708, 0.6967582702636719, 0.7121734619140625, 0.7275885343551636, 0.7430036664009094, 0.7584187984466553, 0.7738339304924011, 0.789249062538147, 0.8046642541885376, 0.8200793266296387, 0.8354945182800293, 0.8509096503257751]}, "gradients/encoder.encoder.layers.22.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 4.0, 5.0, 4.0, 6.0, 4.0, 5.0, 10.0, 12.0, 13.0, 19.0, 17.0, 28.0, 27.0, 35.0, 43.0, 56.0, 49.0, 52.0, 50.0, 63.0, 69.0, 69.0, 56.0, 53.0, 46.0, 41.0, 35.0, 28.0, 28.0, 17.0, 20.0, 9.0, 5.0, 9.0, 3.0, 6.0, 4.0, 2.0, 4.0, 5.0, 4.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.03824424743652344, -0.036715056747198105, -0.03518586605787277, -0.03365667909383774, -0.032127488404512405, -0.030598297715187073, -0.02906910888850689, -0.027539920061826706, -0.026010729372501373, -0.02448153868317604, -0.022952349856495857, -0.021423161029815674, -0.01989397034049034, -0.01836477965116501, -0.016835590824484825, -0.015306401066482067, -0.013777211308479309, -0.012248021550476551, -0.010718831792473793, -0.009189642034471035, -0.007660452276468277, -0.006131262518465519, -0.004602072760462761, -0.003072883002460003, -0.0015436932444572449, -1.4503486454486847e-05, 0.0015146862715482712, 0.003043876029551029, 0.004573065787553787, 0.006102255545556545, 0.007631445303559303, 0.009160635061562061, 0.01068982481956482, 0.012219014577567577, 0.013748204335570335, 0.015277394093573093, 0.01680658385157585, 0.018335774540901184, 0.019864963367581367, 0.02139415219426155, 0.022923342883586884, 0.024452533572912216, 0.0259817223995924, 0.027510911226272583, 0.029040101915597916, 0.03056929260492325, 0.03209847956895828, 0.033627670258283615, 0.03515686094760895, 0.03668605163693428, 0.03821524232625961, 0.03974442929029465, 0.04127361997961998, 0.04280281066894531, 0.04433199763298035, 0.04586118832230568, 0.04739037901163101, 0.048919569700956345, 0.05044876039028168, 0.05197794735431671, 0.053507138043642044, 0.05503632873296738, 0.05656551569700241, 0.058094706386327744, 0.059623897075653076]}, "gradients/encoder.encoder.layers.22.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 5.0, 6.0, 12.0, 7.0, 20.0, 18.0, 42.0, 62.0, 75.0, 117.0, 197.0, 267.0, 391.0, 617.0, 976.0, 1590.0, 2589.0, 4635.0, 8533.0, 18540.0, 142369.0, 812738.0, 29142.0, 11224.0, 5895.0, 3252.0, 1874.0, 1173.0, 722.0, 510.0, 306.0, 221.0, 148.0, 86.0, 77.0, 51.0, 31.0, 19.0, 16.0, 3.0, 6.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.05938720703125, -0.05735301971435547, -0.05531883239746094, -0.053284645080566406, -0.051250457763671875, -0.049216270446777344, -0.04718208312988281, -0.04514789581298828, -0.04311370849609375, -0.04107952117919922, -0.03904533386230469, -0.037011146545410156, -0.034976959228515625, -0.032942771911621094, -0.030908584594726562, -0.02887439727783203, -0.0268402099609375, -0.02480602264404297, -0.022771835327148438, -0.020737648010253906, -0.018703460693359375, -0.016669273376464844, -0.014635086059570312, -0.012600898742675781, -0.01056671142578125, -0.008532524108886719, -0.0064983367919921875, -0.004464149475097656, -0.002429962158203125, -0.00039577484130859375, 0.0016384124755859375, 0.0036725997924804688, 0.005706787109375, 0.007740974426269531, 0.009775161743164062, 0.011809349060058594, 0.013843536376953125, 0.015877723693847656, 0.017911911010742188, 0.01994609832763672, 0.02198028564453125, 0.02401447296142578, 0.026048660278320312, 0.028082847595214844, 0.030117034912109375, 0.032151222229003906, 0.03418540954589844, 0.03621959686279297, 0.0382537841796875, 0.04028797149658203, 0.04232215881347656, 0.044356346130371094, 0.046390533447265625, 0.048424720764160156, 0.05045890808105469, 0.05249309539794922, 0.05452728271484375, 0.05656147003173828, 0.05859565734863281, 0.060629844665527344, 0.06266403198242188, 0.0646982192993164, 0.06673240661621094, 0.06876659393310547, 0.07080078125]}, "gradients/encoder.encoder.layers.22.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 1.0, 3.0, 3.0, 1.0, 4.0, 2.0, 10.0, 7.0, 8.0, 9.0, 17.0, 23.0, 128.0, 323.0, 271.0, 93.0, 25.0, 18.0, 9.0, 8.0, 9.0, 9.0, 5.0, 5.0, 5.0, 2.0, 2.0, 3.0, 2.0, 0.0, 0.0, 2.0, 0.0, 2.0, 2.0], "bins": [-0.004489898681640625, -0.004385203123092651, -0.004280507564544678, -0.004175812005996704, -0.0040711164474487305, -0.003966420888900757, -0.003861725330352783, -0.0037570297718048096, -0.003652334213256836, -0.0035476386547088623, -0.0034429430961608887, -0.003338247537612915, -0.0032335519790649414, -0.0031288564205169678, -0.003024160861968994, -0.0029194653034210205, -0.002814769744873047, -0.0027100741863250732, -0.0026053786277770996, -0.002500683069229126, -0.0023959875106811523, -0.0022912919521331787, -0.002186596393585205, -0.0020819008350372314, -0.001977205276489258, -0.0018725097179412842, -0.0017678141593933105, -0.001663118600845337, -0.0015584230422973633, -0.0014537274837493896, -0.001349031925201416, -0.0012443363666534424, -0.0011396408081054688, -0.0010349452495574951, -0.0009302496910095215, -0.0008255541324615479, -0.0007208585739135742, -0.0006161630153656006, -0.000511467456817627, -0.0004067718982696533, -0.0003020763397216797, -0.00019738078117370605, -9.268522262573242e-05, 1.2010335922241211e-05, 0.00011670589447021484, 0.00022140145301818848, 0.0003260970115661621, 0.00043079257011413574, 0.0005354881286621094, 0.000640183687210083, 0.0007448792457580566, 0.0008495748043060303, 0.0009542703628540039, 0.0010589659214019775, 0.0011636614799499512, 0.0012683570384979248, 0.0013730525970458984, 0.001477748155593872, 0.0015824437141418457, 0.0016871392726898193, 0.001791834831237793, 0.0018965303897857666, 0.0020012259483337402, 0.002105921506881714, 0.0022106170654296875]}, "gradients/encoder.encoder.layers.22.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 5.0, 4.0, 4.0, 7.0, 7.0, 10.0, 8.0, 13.0, 16.0, 23.0, 12.0, 25.0, 32.0, 26.0, 38.0, 44.0, 39.0, 75.0, 253.0, 1805.0, 38056.0, 996901.0, 9828.0, 801.0, 158.0, 60.0, 54.0, 36.0, 32.0, 33.0, 26.0, 22.0, 16.0, 18.0, 15.0, 14.0, 6.0, 15.0, 10.0, 4.0, 1.0, 3.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0], "bins": [-0.17333984375, -0.1682891845703125, -0.163238525390625, -0.1581878662109375, -0.15313720703125, -0.1480865478515625, -0.143035888671875, -0.1379852294921875, -0.1329345703125, -0.1278839111328125, -0.122833251953125, -0.1177825927734375, -0.11273193359375, -0.1076812744140625, -0.102630615234375, -0.0975799560546875, -0.092529296875, -0.0874786376953125, -0.082427978515625, -0.0773773193359375, -0.07232666015625, -0.0672760009765625, -0.062225341796875, -0.0571746826171875, -0.0521240234375, -0.0470733642578125, -0.042022705078125, -0.0369720458984375, -0.03192138671875, -0.0268707275390625, -0.021820068359375, -0.0167694091796875, -0.01171875, -0.0066680908203125, -0.001617431640625, 0.0034332275390625, 0.00848388671875, 0.0135345458984375, 0.018585205078125, 0.0236358642578125, 0.0286865234375, 0.0337371826171875, 0.038787841796875, 0.0438385009765625, 0.04888916015625, 0.0539398193359375, 0.058990478515625, 0.0640411376953125, 0.069091796875, 0.0741424560546875, 0.079193115234375, 0.0842437744140625, 0.08929443359375, 0.0943450927734375, 0.099395751953125, 0.1044464111328125, 0.1094970703125, 0.1145477294921875, 0.119598388671875, 0.1246490478515625, 0.12969970703125, 0.1347503662109375, 0.139801025390625, 0.1448516845703125, 0.14990234375]}, "gradients/encoder.encoder.layers.22.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 3.0, 7.0, 6.0, 3.0, 9.0, 6.0, 6.0, 11.0, 14.0, 13.0, 22.0, 19.0, 16.0, 33.0, 23.0, 36.0, 25.0, 29.0, 34.0, 24.0, 30.0, 39.0, 49.0, 27.0, 55.0, 42.0, 38.0, 43.0, 44.0, 37.0, 36.0, 26.0, 35.0, 31.0, 12.0, 25.0, 19.0, 13.0, 7.0, 14.0, 15.0, 5.0, 8.0, 4.0, 5.0, 2.0, 2.0, 1.0, 1.0, 2.0, 2.0, 2.0, 6.0], "bins": [-0.0087127685546875, -0.00846177339553833, -0.00821077823638916, -0.00795978307723999, -0.00770878791809082, -0.00745779275894165, -0.0072067975997924805, -0.0069558024406433105, -0.006704807281494141, -0.006453812122344971, -0.006202816963195801, -0.005951821804046631, -0.005700826644897461, -0.005449831485748291, -0.005198836326599121, -0.004947841167449951, -0.004696846008300781, -0.004445850849151611, -0.004194855690002441, -0.0039438605308532715, -0.0036928653717041016, -0.0034418702125549316, -0.0031908750534057617, -0.002939879894256592, -0.002688884735107422, -0.002437889575958252, -0.002186894416809082, -0.0019358992576599121, -0.0016849040985107422, -0.0014339089393615723, -0.0011829137802124023, -0.0009319186210632324, -0.0006809234619140625, -0.0004299283027648926, -0.00017893314361572266, 7.206201553344727e-05, 0.0003230571746826172, 0.0005740523338317871, 0.000825047492980957, 0.001076042652130127, 0.0013270378112792969, 0.0015780329704284668, 0.0018290281295776367, 0.0020800232887268066, 0.0023310184478759766, 0.0025820136070251465, 0.0028330087661743164, 0.0030840039253234863, 0.0033349990844726562, 0.003585994243621826, 0.003836989402770996, 0.004087984561920166, 0.004338979721069336, 0.004589974880218506, 0.004840970039367676, 0.005091965198516846, 0.005342960357666016, 0.0055939555168151855, 0.0058449506759643555, 0.006095945835113525, 0.006346940994262695, 0.006597936153411865, 0.006848931312561035, 0.007099926471710205, 0.007350921630859375]}, "gradients/encoder.encoder.layers.22.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 2.0, 3.0, 6.0, 0.0, 5.0, 4.0, 3.0, 1.0, 7.0, 8.0, 6.0, 13.0, 17.0, 42.0, 83.0, 334.0, 3488.0, 1035439.0, 8233.0, 580.0, 141.0, 50.0, 24.0, 14.0, 10.0, 7.0, 3.0, 7.0, 6.0, 2.0, 4.0, 7.0, 1.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0855712890625, -0.08194732666015625, -0.0783233642578125, -0.07469940185546875, -0.071075439453125, -0.06745147705078125, -0.0638275146484375, -0.06020355224609375, -0.05657958984375, -0.05295562744140625, -0.0493316650390625, -0.04570770263671875, -0.042083740234375, -0.03845977783203125, -0.0348358154296875, -0.03121185302734375, -0.027587890625, -0.02396392822265625, -0.0203399658203125, -0.01671600341796875, -0.013092041015625, -0.00946807861328125, -0.0058441162109375, -0.00222015380859375, 0.00140380859375, 0.00502777099609375, 0.0086517333984375, 0.01227569580078125, 0.015899658203125, 0.01952362060546875, 0.0231475830078125, 0.02677154541015625, 0.0303955078125, 0.03401947021484375, 0.0376434326171875, 0.04126739501953125, 0.044891357421875, 0.04851531982421875, 0.0521392822265625, 0.05576324462890625, 0.05938720703125, 0.06301116943359375, 0.0666351318359375, 0.07025909423828125, 0.073883056640625, 0.07750701904296875, 0.0811309814453125, 0.08475494384765625, 0.08837890625, 0.09200286865234375, 0.0956268310546875, 0.09925079345703125, 0.102874755859375, 0.10649871826171875, 0.1101226806640625, 0.11374664306640625, 0.11737060546875, 0.12099456787109375, 0.1246185302734375, 0.12824249267578125, 0.131866455078125, 0.13549041748046875, 0.1391143798828125, 0.14273834228515625, 0.1463623046875]}, "gradients/encoder.encoder.layers.22.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 4.0, 6.0, 1.0, 5.0, 6.0, 9.0, 4.0, 3.0, 22.0, 18.0, 34.0, 41.0, 120.0, 237.0, 235.0, 121.0, 63.0, 27.0, 15.0, 9.0, 5.0, 2.0, 4.0, 4.0, 4.0, 5.0, 2.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.0003268718719482422, -0.0003190934658050537, -0.00031131505966186523, -0.00030353665351867676, -0.0002957582473754883, -0.0002879798412322998, -0.00028020143508911133, -0.00027242302894592285, -0.0002646446228027344, -0.0002568662166595459, -0.0002490878105163574, -0.00024130940437316895, -0.00023353099822998047, -0.000225752592086792, -0.00021797418594360352, -0.00021019577980041504, -0.00020241737365722656, -0.00019463896751403809, -0.0001868605613708496, -0.00017908215522766113, -0.00017130374908447266, -0.00016352534294128418, -0.0001557469367980957, -0.00014796853065490723, -0.00014019012451171875, -0.00013241171836853027, -0.0001246333122253418, -0.00011685490608215332, -0.00010907649993896484, -0.00010129809379577637, -9.351968765258789e-05, -8.574128150939941e-05, -7.796287536621094e-05, -7.018446922302246e-05, -6.240606307983398e-05, -5.462765693664551e-05, -4.684925079345703e-05, -3.9070844650268555e-05, -3.129243850708008e-05, -2.35140323638916e-05, -1.5735626220703125e-05, -7.957220077514648e-06, -1.7881393432617188e-07, 7.599592208862305e-06, 1.537799835205078e-05, 2.3156404495239258e-05, 3.0934810638427734e-05, 3.871321678161621e-05, 4.649162292480469e-05, 5.4270029067993164e-05, 6.204843521118164e-05, 6.982684135437012e-05, 7.76052474975586e-05, 8.538365364074707e-05, 9.316205978393555e-05, 0.00010094046592712402, 0.0001087188720703125, 0.00011649727821350098, 0.00012427568435668945, 0.00013205409049987793, 0.0001398324966430664, 0.00014761090278625488, 0.00015538930892944336, 0.00016316771507263184, 0.0001709461212158203]}, "gradients/encoder.encoder.layers.22.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 5.0, 3.0, 2.0, 8.0, 5.0, 15.0, 18.0, 27.0, 48.0, 74.0, 219.0, 691.0, 4162.0, 955525.0, 84403.0, 2613.0, 441.0, 145.0, 62.0, 29.0, 12.0, 12.0, 7.0, 11.0, 5.0, 3.0, 4.0, 4.0, 1.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0], "bins": [-0.1651611328125, -0.16131114959716797, -0.15746116638183594, -0.1536111831665039, -0.14976119995117188, -0.14591121673583984, -0.1420612335205078, -0.13821125030517578, -0.13436126708984375, -0.13051128387451172, -0.1266613006591797, -0.12281131744384766, -0.11896133422851562, -0.1151113510131836, -0.11126136779785156, -0.10741138458251953, -0.1035614013671875, -0.09971141815185547, -0.09586143493652344, -0.0920114517211914, -0.08816146850585938, -0.08431148529052734, -0.08046150207519531, -0.07661151885986328, -0.07276153564453125, -0.06891155242919922, -0.06506156921386719, -0.061211585998535156, -0.057361602783203125, -0.053511619567871094, -0.04966163635253906, -0.04581165313720703, -0.041961669921875, -0.03811168670654297, -0.03426170349121094, -0.030411720275878906, -0.026561737060546875, -0.022711753845214844, -0.018861770629882812, -0.015011787414550781, -0.01116180419921875, -0.007311820983886719, -0.0034618377685546875, 0.00038814544677734375, 0.004238128662109375, 0.008088111877441406, 0.011938095092773438, 0.01578807830810547, 0.0196380615234375, 0.02348804473876953, 0.027338027954101562, 0.031188011169433594, 0.035037994384765625, 0.038887977600097656, 0.04273796081542969, 0.04658794403076172, 0.05043792724609375, 0.05428791046142578, 0.05813789367675781, 0.061987876892089844, 0.06583786010742188, 0.0696878433227539, 0.07353782653808594, 0.07738780975341797, 0.08123779296875]}, "gradients/encoder.encoder.layers.22.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 3.0, 3.0, 11.0, 7.0, 6.0, 14.0, 16.0, 18.0, 49.0, 68.0, 473.0, 182.0, 51.0, 38.0, 16.0, 12.0, 7.0, 6.0, 1.0, 5.0, 4.0, 3.0, 1.0, 3.0, 4.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 4.0], "bins": [-0.038604736328125, -0.0377042293548584, -0.0368037223815918, -0.035903215408325195, -0.035002708435058594, -0.03410220146179199, -0.03320169448852539, -0.03230118751525879, -0.03140068054199219, -0.030500173568725586, -0.029599666595458984, -0.028699159622192383, -0.02779865264892578, -0.02689814567565918, -0.025997638702392578, -0.025097131729125977, -0.024196624755859375, -0.023296117782592773, -0.022395610809326172, -0.02149510383605957, -0.02059459686279297, -0.019694089889526367, -0.018793582916259766, -0.017893075942993164, -0.016992568969726562, -0.01609206199645996, -0.01519155502319336, -0.014291048049926758, -0.013390541076660156, -0.012490034103393555, -0.011589527130126953, -0.010689020156860352, -0.00978851318359375, -0.008888006210327148, -0.007987499237060547, -0.007086992263793945, -0.006186485290527344, -0.005285978317260742, -0.004385471343994141, -0.003484964370727539, -0.0025844573974609375, -0.001683950424194336, -0.0007834434509277344, 0.00011706352233886719, 0.0010175704956054688, 0.0019180774688720703, 0.002818584442138672, 0.0037190914154052734, 0.004619598388671875, 0.0055201053619384766, 0.006420612335205078, 0.00732111930847168, 0.008221626281738281, 0.009122133255004883, 0.010022640228271484, 0.010923147201538086, 0.011823654174804688, 0.012724161148071289, 0.01362466812133789, 0.014525175094604492, 0.015425682067871094, 0.016326189041137695, 0.017226696014404297, 0.0181272029876709, 0.0190277099609375]}, "gradients/encoder.encoder.layers.22.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 3.0, 4.0, 11.0, 20.0, 86.0, 703.0, 147.0, 21.0, 10.0, 4.0, 4.0, 2.0, 1.0, 1.0], "bins": [-0.9651861786842346, -0.947821855545044, -0.9304575324058533, -0.9130932092666626, -0.8957288265228271, -0.8783645033836365, -0.8610001802444458, -0.8436358571052551, -0.8262715339660645, -0.8089072108268738, -0.7915428876876831, -0.7741785049438477, -0.756814181804657, -0.7394498586654663, -0.7220855355262756, -0.704721212387085, -0.6873568296432495, -0.6699925065040588, -0.6526281833648682, -0.6352638006210327, -0.617899477481842, -0.6005351543426514, -0.5831708312034607, -0.56580650806427, -0.5484421849250793, -0.5310778617858887, -0.513713538646698, -0.49634918570518494, -0.47898486256599426, -0.4616205096244812, -0.4442561864852905, -0.42689186334609985, -0.4095275104045868, -0.3921631872653961, -0.37479883432388306, -0.3574345111846924, -0.3400701880455017, -0.32270586490631104, -0.305341511964798, -0.2879771888256073, -0.27061283588409424, -0.25324851274490356, -0.2358841747045517, -0.21851983666419983, -0.20115551352500916, -0.1837911754846573, -0.16642683744430542, -0.14906251430511475, -0.13169819116592407, -0.1143338605761528, -0.09696952998638153, -0.07960519194602966, -0.06224086135625839, -0.04487653076648712, -0.027512192726135254, -0.010147862136363983, 0.007216468453407288, 0.024580800905823708, 0.04194513335824013, 0.0593094676733017, 0.07667379826307297, 0.09403812885284424, 0.1114024668931961, 0.12876680493354797, 0.14613112807273865]}, "gradients/encoder.encoder.layers.22.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 3.0, 5.0, 4.0, 4.0, 6.0, 5.0, 8.0, 12.0, 11.0, 18.0, 18.0, 28.0, 32.0, 51.0, 52.0, 62.0, 71.0, 69.0, 74.0, 73.0, 78.0, 61.0, 55.0, 45.0, 38.0, 21.0, 21.0, 11.0, 13.0, 10.0, 8.0, 9.0, 4.0, 2.0, 4.0, 5.0, 6.0, 4.0, 1.0, 4.0, 1.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0941152572631836, -0.09138787537813187, -0.08866048604249954, -0.08593310415744781, -0.08320571482181549, -0.08047833293676376, -0.07775095105171204, -0.07502356171607971, -0.07229617983102798, -0.06956879794597626, -0.06684140861034393, -0.0641140267252922, -0.06138664111495018, -0.058659255504608154, -0.05593187361955643, -0.0532044880092144, -0.050477102398872375, -0.04774971678853035, -0.045022331178188324, -0.0422949492931366, -0.03956756368279457, -0.036840178072452545, -0.03411279618740082, -0.03138541057705879, -0.028658024966716766, -0.02593063935637474, -0.023203255608677864, -0.020475871860980988, -0.017748486250638962, -0.01502110157161951, -0.01229371689260006, -0.009566333144903183, -0.006838947534561157, -0.004111562855541706, -0.001384178176522255, 0.0013432065024971962, 0.004070591181516647, 0.0067979758605360985, 0.00952536053955555, 0.012252744287252426, 0.014980129897594452, 0.017707515507936478, 0.020434899255633354, 0.02316228300333023, 0.025889668613672256, 0.028617054224014282, 0.03134443610906601, 0.034071821719408035, 0.03679920732975006, 0.03952659294009209, 0.04225397855043411, 0.04498136043548584, 0.047708746045827866, 0.05043613165616989, 0.05316351354122162, 0.055890899151563644, 0.05861828476190567, 0.061345670372247696, 0.06407305598258972, 0.06680043786764145, 0.06952781975269318, 0.0722552090883255, 0.07498259097337723, 0.07770997285842896, 0.08043736219406128]}, "gradients/encoder.encoder.layers.21.feed_forward.output_dense.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 4.0, 0.0, 0.0, 6.0, 2.0, 2.0, 0.0, 10.0, 8.0, 8.0, 8.0, 6.0, 10.0, 4.0, 18.0, 16.0, 10.0, 30.0, 38.0, 66.0, 416.0, 4192941.0, 398.0, 97.0, 30.0, 26.0, 28.0, 18.0, 2.0, 8.0, 8.0, 10.0, 8.0, 8.0, 6.0, 4.0, 0.0, 0.0, 6.0, 2.0, 2.0, 8.0, 4.0, 2.0, 2.0, 4.0, 0.0, 2.0, 4.0], "bins": [-3.16796875, -3.08123779296875, -2.9945068359375, -2.90777587890625, -2.821044921875, -2.73431396484375, -2.6475830078125, -2.56085205078125, -2.47412109375, -2.38739013671875, -2.3006591796875, -2.21392822265625, -2.127197265625, -2.04046630859375, -1.9537353515625, -1.86700439453125, -1.7802734375, -1.69354248046875, -1.6068115234375, -1.52008056640625, -1.433349609375, -1.34661865234375, -1.2598876953125, -1.17315673828125, -1.08642578125, -0.99969482421875, -0.9129638671875, -0.82623291015625, -0.739501953125, -0.65277099609375, -0.5660400390625, -0.47930908203125, -0.392578125, -0.30584716796875, -0.2191162109375, -0.13238525390625, -0.045654296875, 0.04107666015625, 0.1278076171875, 0.21453857421875, 0.30126953125, 0.38800048828125, 0.4747314453125, 0.56146240234375, 0.648193359375, 0.73492431640625, 0.8216552734375, 0.90838623046875, 0.9951171875, 1.08184814453125, 1.1685791015625, 1.25531005859375, 1.342041015625, 1.42877197265625, 1.5155029296875, 1.60223388671875, 1.68896484375, 1.77569580078125, 1.8624267578125, 1.94915771484375, 2.035888671875, 2.12261962890625, 2.2093505859375, 2.29608154296875, 2.3828125]}, "gradients/encoder.encoder.layers.21.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 4.0, 1.0, 1.0, 0.0, 4.0, 3.0, 4.0, 3.0, 2.0, 2.0, 12.0, 5.0, 7.0, 12.0, 12.0, 24.0, 81.0, 170.0, 257.0, 193.0, 81.0, 42.0, 24.0, 4.0, 5.0, 9.0, 8.0, 10.0, 7.0, 2.0, 6.0, 4.0, 4.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0], "bins": [-0.00286102294921875, -0.002782970666885376, -0.002704918384552002, -0.002626866102218628, -0.002548813819885254, -0.00247076153755188, -0.002392709255218506, -0.002314656972885132, -0.002236604690551758, -0.002158552408218384, -0.0020805001258850098, -0.0020024478435516357, -0.0019243955612182617, -0.0018463432788848877, -0.0017682909965515137, -0.0016902387142181396, -0.0016121864318847656, -0.0015341341495513916, -0.0014560818672180176, -0.0013780295848846436, -0.0012999773025512695, -0.0012219250202178955, -0.0011438727378845215, -0.0010658204555511475, -0.0009877681732177734, -0.0009097158908843994, -0.0008316636085510254, -0.0007536113262176514, -0.0006755590438842773, -0.0005975067615509033, -0.0005194544792175293, -0.0004414021968841553, -0.00036334991455078125, -0.0002852976322174072, -0.0002072453498840332, -0.00012919306755065918, -5.1140785217285156e-05, 2.6911497116088867e-05, 0.00010496377944946289, 0.00018301606178283691, 0.00026106834411621094, 0.00033912062644958496, 0.000417172908782959, 0.000495225191116333, 0.000573277473449707, 0.0006513297557830811, 0.0007293820381164551, 0.0008074343204498291, 0.0008854866027832031, 0.0009635388851165771, 0.0010415911674499512, 0.0011196434497833252, 0.0011976957321166992, 0.0012757480144500732, 0.0013538002967834473, 0.0014318525791168213, 0.0015099048614501953, 0.0015879571437835693, 0.0016660094261169434, 0.0017440617084503174, 0.0018221139907836914, 0.0019001662731170654, 0.0019782185554504395, 0.0020562708377838135, 0.0021343231201171875]}, "gradients/encoder.encoder.layers.21.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 4.0, 4.0, 4.0, 4.0, 4.0, 9.0, 9.0, 11.0, 22.0, 38.0, 51.0, 69.0, 106.0, 277.0, 1360.0, 4191067.0, 766.0, 182.0, 80.0, 71.0, 38.0, 31.0, 28.0, 16.0, 13.0, 7.0, 6.0, 5.0, 2.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.69873046875, -0.674102783203125, -0.64947509765625, -0.624847412109375, -0.6002197265625, -0.575592041015625, -0.55096435546875, -0.526336669921875, -0.501708984375, -0.477081298828125, -0.45245361328125, -0.427825927734375, -0.4031982421875, -0.378570556640625, -0.35394287109375, -0.329315185546875, -0.3046875, -0.280059814453125, -0.25543212890625, -0.230804443359375, -0.2061767578125, -0.181549072265625, -0.15692138671875, -0.132293701171875, -0.107666015625, -0.083038330078125, -0.05841064453125, -0.033782958984375, -0.0091552734375, 0.015472412109375, 0.04010009765625, 0.064727783203125, 0.08935546875, 0.113983154296875, 0.13861083984375, 0.163238525390625, 0.1878662109375, 0.212493896484375, 0.23712158203125, 0.261749267578125, 0.286376953125, 0.311004638671875, 0.33563232421875, 0.360260009765625, 0.3848876953125, 0.409515380859375, 0.43414306640625, 0.458770751953125, 0.4833984375, 0.508026123046875, 0.53265380859375, 0.557281494140625, 0.5819091796875, 0.606536865234375, 0.63116455078125, 0.655792236328125, 0.680419921875, 0.705047607421875, 0.72967529296875, 0.754302978515625, 0.7789306640625, 0.803558349609375, 0.82818603515625, 0.852813720703125, 0.87744140625]}, "gradients/encoder.encoder.layers.21.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 5.0, 7.0, 10.0, 26.0, 77.0, 247.0, 1631.0, 1713.0, 242.0, 77.0, 34.0, 10.0, 3.0, 3.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0275421142578125, -0.02701246738433838, -0.026482820510864258, -0.025953173637390137, -0.025423526763916016, -0.024893879890441895, -0.024364233016967773, -0.023834586143493652, -0.02330493927001953, -0.02277529239654541, -0.02224564552307129, -0.021715998649597168, -0.021186351776123047, -0.020656704902648926, -0.020127058029174805, -0.019597411155700684, -0.019067764282226562, -0.01853811740875244, -0.01800847053527832, -0.0174788236618042, -0.016949176788330078, -0.016419529914855957, -0.015889883041381836, -0.015360236167907715, -0.014830589294433594, -0.014300942420959473, -0.013771295547485352, -0.01324164867401123, -0.01271200180053711, -0.012182354927062988, -0.011652708053588867, -0.011123061180114746, -0.010593414306640625, -0.010063767433166504, -0.009534120559692383, -0.009004473686218262, -0.00847482681274414, -0.00794517993927002, -0.0074155330657958984, -0.006885886192321777, -0.006356239318847656, -0.005826592445373535, -0.005296945571899414, -0.004767298698425293, -0.004237651824951172, -0.0037080049514770508, -0.0031783580780029297, -0.0026487112045288086, -0.0021190643310546875, -0.0015894174575805664, -0.0010597705841064453, -0.0005301237106323242, -4.76837158203125e-07, 0.000529170036315918, 0.001058816909790039, 0.0015884637832641602, 0.0021181106567382812, 0.0026477575302124023, 0.0031774044036865234, 0.0037070512771606445, 0.004236698150634766, 0.004766345024108887, 0.005295991897583008, 0.005825638771057129, 0.00635528564453125]}, "gradients/encoder.encoder.layers.21.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 153.0, 847.0, 16.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.3899803161621094, -1.3622641563415527, -1.3345481157302856, -1.306831955909729, -1.279115915298462, -1.2513997554779053, -1.2236835956573486, -1.1959675550460815, -1.168251395225525, -1.1405352354049683, -1.1128191947937012, -1.0851030349731445, -1.0573869943618774, -1.0296708345413208, -1.0019547939300537, -0.9742386341094971, -0.9465225338935852, -0.9188064336776733, -0.8910903334617615, -0.8633742332458496, -0.835658073425293, -0.8079419732093811, -0.7802258729934692, -0.7525097727775574, -0.7247936725616455, -0.6970775723457336, -0.6693614721298218, -0.6416453123092651, -0.6139292120933533, -0.5862131118774414, -0.5584970116615295, -0.5307809114456177, -0.503064751625061, -0.47534865140914917, -0.4476325213909149, -0.41991642117500305, -0.3922002911567688, -0.36448419094085693, -0.33676809072494507, -0.3090519905090332, -0.28133586049079895, -0.2536197602748871, -0.22590363025665283, -0.19818753004074097, -0.1704714149236679, -0.14275529980659485, -0.11503919959068298, -0.08732308447360992, -0.059606969356536865, -0.031890857964754105, -0.004174746572971344, 0.023541361093521118, 0.05125747621059418, 0.07897359132766724, 0.1066896915435791, 0.13440580666065216, 0.16212192177772522, 0.18983803689479828, 0.21755415201187134, 0.2452702522277832, 0.27298635244369507, 0.3007024824619293, 0.3284185826778412, 0.35613471269607544, 0.3838508129119873]}, "gradients/encoder.encoder.layers.21.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 16.0, 40.0, 113.0, 184.0, 252.0, 213.0, 116.0, 47.0, 20.0, 7.0, 1.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.57295823097229, -0.5622075796127319, -0.5514569282531738, -0.540706217288971, -0.5299555659294128, -0.5192049145698547, -0.5084542632102966, -0.49770358204841614, -0.48695290088653564, -0.47620224952697754, -0.46545156836509705, -0.45470091700553894, -0.44395023584365845, -0.43319958448410034, -0.42244890332221985, -0.41169825196266174, -0.40094757080078125, -0.39019691944122314, -0.37944623827934265, -0.36869558691978455, -0.35794490575790405, -0.34719425439834595, -0.33644357323646545, -0.32569292187690735, -0.31494227051734924, -0.30419161915779114, -0.29344093799591064, -0.28269028663635254, -0.27193960547447205, -0.26118895411491394, -0.25043827295303345, -0.23968762159347534, -0.22893694043159485, -0.21818627417087555, -0.20743560791015625, -0.19668494164943695, -0.18593427538871765, -0.17518360912799835, -0.16443294286727905, -0.15368229150772095, -0.14293161034584045, -0.13218094408512115, -0.12143027782440186, -0.11067961156368256, -0.09992894530296326, -0.08917827904224396, -0.07842762023210526, -0.06767695397138596, -0.05692629516124725, -0.046175628900527954, -0.035424962639808655, -0.024674300104379654, -0.013923633843660355, -0.0031729675829410553, 0.0075776949524879456, 0.018328361213207245, 0.029079027473926544, 0.039829693734645844, 0.05058035999536514, 0.061331022530794144, 0.07208168506622314, 0.08283235132694244, 0.09358301758766174, 0.10433368384838104, 0.11508435010910034]}, "gradients/encoder.encoder.layers.21.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 3.0, 0.0, 0.0, 2.0, 2.0, 6.0, 10.0, 10.0, 11.0, 20.0, 31.0, 16.0, 20.0, 38.0, 48.0, 49.0, 41.0, 66.0, 59.0, 4660.0, 1043013.0, 74.0, 57.0, 65.0, 52.0, 47.0, 37.0, 20.0, 21.0, 18.0, 13.0, 10.0, 6.0, 6.0, 7.0, 8.0, 2.0, 1.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 5.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.6826171875, -0.6620712280273438, -0.6415252685546875, -0.6209793090820312, -0.600433349609375, -0.5798873901367188, -0.5593414306640625, -0.5387954711914062, -0.51824951171875, -0.49770355224609375, -0.4771575927734375, -0.45661163330078125, -0.436065673828125, -0.41551971435546875, -0.3949737548828125, -0.37442779541015625, -0.3538818359375, -0.33333587646484375, -0.3127899169921875, -0.29224395751953125, -0.271697998046875, -0.25115203857421875, -0.2306060791015625, -0.21006011962890625, -0.18951416015625, -0.16896820068359375, -0.1484222412109375, -0.12787628173828125, -0.107330322265625, -0.08678436279296875, -0.0662384033203125, -0.04569244384765625, -0.025146484375, -0.00460052490234375, 0.0159454345703125, 0.03649139404296875, 0.057037353515625, 0.07758331298828125, 0.0981292724609375, 0.11867523193359375, 0.13922119140625, 0.15976715087890625, 0.1803131103515625, 0.20085906982421875, 0.221405029296875, 0.24195098876953125, 0.2624969482421875, 0.28304290771484375, 0.3035888671875, 0.32413482666015625, 0.3446807861328125, 0.36522674560546875, 0.385772705078125, 0.40631866455078125, 0.4268646240234375, 0.44741058349609375, 0.46795654296875, 0.48850250244140625, 0.5090484619140625, 0.5295944213867188, 0.550140380859375, 0.5706863403320312, 0.5912322998046875, 0.6117782592773438, 0.63232421875]}, "gradients/encoder.encoder.layers.21.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 14.0, 395.0, 581.0, 33.0], "bins": [-0.136474609375, -0.13427501916885376, -0.13207542896270752, -0.12987583875656128, -0.12767624855041504, -0.1254766583442688, -0.12327706813812256, -0.12107747793197632, -0.11887788772583008, -0.11667829751968384, -0.1144787073135376, -0.11227911710739136, -0.11007952690124512, -0.10787993669509888, -0.10568034648895264, -0.1034807562828064, -0.10128116607666016, -0.09908157587051392, -0.09688198566436768, -0.09468239545822144, -0.0924828052520752, -0.09028321504592896, -0.08808362483978271, -0.08588403463363647, -0.08368444442749023, -0.081484854221344, -0.07928526401519775, -0.07708567380905151, -0.07488608360290527, -0.07268649339675903, -0.07048690319061279, -0.06828731298446655, -0.06608772277832031, -0.06388813257217407, -0.06168854236602783, -0.05948895215988159, -0.05728936195373535, -0.05508977174758911, -0.05289018154144287, -0.05069059133529663, -0.04849100112915039, -0.04629141092300415, -0.04409182071685791, -0.04189223051071167, -0.03969264030456543, -0.03749305009841919, -0.03529345989227295, -0.03309386968612671, -0.03089427947998047, -0.02869468927383423, -0.02649509906768799, -0.024295508861541748, -0.022095918655395508, -0.019896328449249268, -0.017696738243103027, -0.015497148036956787, -0.013297557830810547, -0.011097967624664307, -0.008898377418518066, -0.006698787212371826, -0.004499197006225586, -0.0022996068000793457, -0.00010001659393310547, 0.0020995736122131348, 0.004299163818359375]}, "gradients/encoder.encoder.layers.21.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 3.0, 3.0, 6.0, 3.0, 3.0, 7.0, 12.0, 8.0, 10.0, 16.0, 27.0, 40.0, 49.0, 71.0, 120.0, 208.0, 582.0, 4103.0, 176144.0, 857135.0, 8426.0, 909.0, 275.0, 136.0, 81.0, 47.0, 35.0, 28.0, 16.0, 14.0, 14.0, 7.0, 5.0, 5.0, 5.0, 4.0, 0.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.264892578125, -0.2547569274902344, -0.24462127685546875, -0.23448562622070312, -0.2243499755859375, -0.21421432495117188, -0.20407867431640625, -0.19394302368164062, -0.183807373046875, -0.17367172241210938, -0.16353607177734375, -0.15340042114257812, -0.1432647705078125, -0.13312911987304688, -0.12299346923828125, -0.11285781860351562, -0.10272216796875, -0.09258651733398438, -0.08245086669921875, -0.07231521606445312, -0.0621795654296875, -0.052043914794921875, -0.04190826416015625, -0.031772613525390625, -0.021636962890625, -0.011501312255859375, -0.00136566162109375, 0.008769989013671875, 0.0189056396484375, 0.029041290283203125, 0.03917694091796875, 0.049312591552734375, 0.0594482421875, 0.06958389282226562, 0.07971954345703125, 0.08985519409179688, 0.0999908447265625, 0.11012649536132812, 0.12026214599609375, 0.13039779663085938, 0.140533447265625, 0.15066909790039062, 0.16080474853515625, 0.17094039916992188, 0.1810760498046875, 0.19121170043945312, 0.20134735107421875, 0.21148300170898438, 0.22161865234375, 0.23175430297851562, 0.24188995361328125, 0.2520256042480469, 0.2621612548828125, 0.2722969055175781, 0.28243255615234375, 0.2925682067871094, 0.302703857421875, 0.3128395080566406, 0.32297515869140625, 0.3331108093261719, 0.3432464599609375, 0.3533821105957031, 0.36351776123046875, 0.3736534118652344, 0.3837890625]}, "gradients/encoder.encoder.layers.21.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 3.0, 3.0, 7.0, 1.0, 7.0, 12.0, 11.0, 14.0, 13.0, 15.0, 28.0, 40.0, 34.0, 56.0, 60.0, 47.0, 79.0, 71.0, 68.0, 72.0, 62.0, 66.0, 53.0, 31.0, 34.0, 27.0, 21.0, 20.0, 17.0, 4.0, 9.0, 6.0, 2.0, 7.0, 2.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09210205078125, -0.08881282806396484, -0.08552360534667969, -0.08223438262939453, -0.07894515991210938, -0.07565593719482422, -0.07236671447753906, -0.0690774917602539, -0.06578826904296875, -0.062499046325683594, -0.05920982360839844, -0.05592060089111328, -0.052631378173828125, -0.04934215545654297, -0.04605293273925781, -0.042763710021972656, -0.0394744873046875, -0.036185264587402344, -0.03289604187011719, -0.02960681915283203, -0.026317596435546875, -0.02302837371826172, -0.019739151000976562, -0.016449928283691406, -0.01316070556640625, -0.009871482849121094, -0.0065822601318359375, -0.0032930374145507812, -3.814697265625e-06, 0.0032854080200195312, 0.0065746307373046875, 0.009863853454589844, 0.013153076171875, 0.016442298889160156, 0.019731521606445312, 0.02302074432373047, 0.026309967041015625, 0.02959918975830078, 0.03288841247558594, 0.036177635192871094, 0.03946685791015625, 0.042756080627441406, 0.04604530334472656, 0.04933452606201172, 0.052623748779296875, 0.05591297149658203, 0.05920219421386719, 0.062491416931152344, 0.0657806396484375, 0.06906986236572266, 0.07235908508300781, 0.07564830780029297, 0.07893753051757812, 0.08222675323486328, 0.08551597595214844, 0.0888051986694336, 0.09209442138671875, 0.0953836441040039, 0.09867286682128906, 0.10196208953857422, 0.10525131225585938, 0.10854053497314453, 0.11182975769042969, 0.11511898040771484, 0.118408203125]}, "gradients/encoder.encoder.layers.21.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 1.0, 4.0, 6.0, 8.0, 13.0, 9.0, 8.0, 20.0, 17.0, 21.0, 41.0, 41.0, 80.0, 120.0, 241.0, 502.0, 1982.0, 11072.0, 920945.0, 105607.0, 5768.0, 1219.0, 338.0, 146.0, 96.0, 67.0, 39.0, 28.0, 28.0, 15.0, 17.0, 10.0, 10.0, 7.0, 8.0, 0.0, 5.0, 4.0, 6.0, 2.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.304443359375, -0.2954444885253906, -0.28644561767578125, -0.2774467468261719, -0.2684478759765625, -0.2594490051269531, -0.25045013427734375, -0.24145126342773438, -0.232452392578125, -0.22345352172851562, -0.21445465087890625, -0.20545578002929688, -0.1964569091796875, -0.18745803833007812, -0.17845916748046875, -0.16946029663085938, -0.16046142578125, -0.15146255493164062, -0.14246368408203125, -0.13346481323242188, -0.1244659423828125, -0.11546707153320312, -0.10646820068359375, -0.09746932983398438, -0.088470458984375, -0.07947158813476562, -0.07047271728515625, -0.061473846435546875, -0.0524749755859375, -0.043476104736328125, -0.03447723388671875, -0.025478363037109375, -0.0164794921875, -0.007480621337890625, 0.00151824951171875, 0.010517120361328125, 0.0195159912109375, 0.028514862060546875, 0.03751373291015625, 0.046512603759765625, 0.055511474609375, 0.06451034545898438, 0.07350921630859375, 0.08250808715820312, 0.0915069580078125, 0.10050582885742188, 0.10950469970703125, 0.11850357055664062, 0.12750244140625, 0.13650131225585938, 0.14550018310546875, 0.15449905395507812, 0.1634979248046875, 0.17249679565429688, 0.18149566650390625, 0.19049453735351562, 0.199493408203125, 0.20849227905273438, 0.21749114990234375, 0.22649002075195312, 0.2354888916015625, 0.24448776245117188, 0.25348663330078125, 0.2624855041503906, 0.271484375]}, "gradients/encoder.encoder.layers.21.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 2.0, 8.0, 0.0, 6.0, 4.0, 7.0, 9.0, 10.0, 19.0, 31.0, 71.0, 86.0, 130.0, 158.0, 158.0, 109.0, 65.0, 29.0, 16.0, 15.0, 10.0, 8.0, 13.0, 9.0, 7.0, 4.0, 2.0, 2.0, 1.0, 3.0, 3.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.1636486053466797e-05, -2.0976178348064423e-05, -2.031587064266205e-05, -1.9655562937259674e-05, -1.89952552318573e-05, -1.8334947526454926e-05, -1.767463982105255e-05, -1.7014332115650177e-05, -1.6354024410247803e-05, -1.569371670484543e-05, -1.5033408999443054e-05, -1.437310129404068e-05, -1.3712793588638306e-05, -1.3052485883235931e-05, -1.2392178177833557e-05, -1.1731870472431183e-05, -1.1071562767028809e-05, -1.0411255061626434e-05, -9.75094735622406e-06, -9.090639650821686e-06, -8.430331945419312e-06, -7.770024240016937e-06, -7.109716534614563e-06, -6.449408829212189e-06, -5.7891011238098145e-06, -5.12879341840744e-06, -4.468485713005066e-06, -3.8081780076026917e-06, -3.1478703022003174e-06, -2.487562596797943e-06, -1.8272548913955688e-06, -1.1669471859931946e-06, -5.066394805908203e-07, 1.5366822481155396e-07, 8.139759302139282e-07, 1.4742836356163025e-06, 2.1345913410186768e-06, 2.794899046421051e-06, 3.4552067518234253e-06, 4.1155144572257996e-06, 4.775822162628174e-06, 5.436129868030548e-06, 6.096437573432922e-06, 6.756745278835297e-06, 7.417052984237671e-06, 8.077360689640045e-06, 8.73766839504242e-06, 9.397976100444794e-06, 1.0058283805847168e-05, 1.0718591511249542e-05, 1.1378899216651917e-05, 1.203920692205429e-05, 1.2699514627456665e-05, 1.335982233285904e-05, 1.4020130038261414e-05, 1.4680437743663788e-05, 1.5340745449066162e-05, 1.6001053154468536e-05, 1.666136085987091e-05, 1.7321668565273285e-05, 1.798197627067566e-05, 1.8642283976078033e-05, 1.9302591681480408e-05, 1.9962899386882782e-05, 2.0623207092285156e-05]}, "gradients/encoder.encoder.layers.21.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 5.0, 2.0, 0.0, 6.0, 11.0, 5.0, 9.0, 17.0, 19.0, 33.0, 80.0, 117.0, 320.0, 1234.0, 5705.0, 954041.0, 81197.0, 4239.0, 957.0, 273.0, 109.0, 77.0, 31.0, 17.0, 20.0, 10.0, 12.0, 3.0, 3.0, 5.0, 3.0, 4.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.9013671875, -0.87310791015625, -0.8448486328125, -0.81658935546875, -0.788330078125, -0.76007080078125, -0.7318115234375, -0.70355224609375, -0.67529296875, -0.64703369140625, -0.6187744140625, -0.59051513671875, -0.562255859375, -0.53399658203125, -0.5057373046875, -0.47747802734375, -0.44921875, -0.42095947265625, -0.3927001953125, -0.36444091796875, -0.336181640625, -0.30792236328125, -0.2796630859375, -0.25140380859375, -0.22314453125, -0.19488525390625, -0.1666259765625, -0.13836669921875, -0.110107421875, -0.08184814453125, -0.0535888671875, -0.02532958984375, 0.0029296875, 0.03118896484375, 0.0594482421875, 0.08770751953125, 0.115966796875, 0.14422607421875, 0.1724853515625, 0.20074462890625, 0.22900390625, 0.25726318359375, 0.2855224609375, 0.31378173828125, 0.342041015625, 0.37030029296875, 0.3985595703125, 0.42681884765625, 0.455078125, 0.48333740234375, 0.5115966796875, 0.53985595703125, 0.568115234375, 0.59637451171875, 0.6246337890625, 0.65289306640625, 0.68115234375, 0.70941162109375, 0.7376708984375, 0.76593017578125, 0.794189453125, 0.82244873046875, 0.8507080078125, 0.87896728515625, 0.9072265625]}, "gradients/encoder.encoder.layers.21.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 1.0, 0.0, 0.0, 3.0, 2.0, 3.0, 3.0, 3.0, 8.0, 11.0, 19.0, 57.0, 533.0, 261.0, 44.0, 22.0, 13.0, 5.0, 5.0, 0.0, 2.0, 3.0, 2.0, 2.0, 3.0, 0.0, 0.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.260498046875, -0.24856948852539062, -0.23664093017578125, -0.22471237182617188, -0.2127838134765625, -0.20085525512695312, -0.18892669677734375, -0.17699813842773438, -0.165069580078125, -0.15314102172851562, -0.14121246337890625, -0.12928390502929688, -0.1173553466796875, -0.10542678833007812, -0.09349822998046875, -0.08156967163085938, -0.06964111328125, -0.057712554931640625, -0.04578399658203125, -0.033855438232421875, -0.0219268798828125, -0.009998321533203125, 0.00193023681640625, 0.013858795166015625, 0.025787353515625, 0.037715911865234375, 0.04964447021484375, 0.061573028564453125, 0.0735015869140625, 0.08543014526367188, 0.09735870361328125, 0.10928726196289062, 0.1212158203125, 0.13314437866210938, 0.14507293701171875, 0.15700149536132812, 0.1689300537109375, 0.18085861206054688, 0.19278717041015625, 0.20471572875976562, 0.216644287109375, 0.22857284545898438, 0.24050140380859375, 0.2524299621582031, 0.2643585205078125, 0.2762870788574219, 0.28821563720703125, 0.3001441955566406, 0.31207275390625, 0.3240013122558594, 0.33592987060546875, 0.3478584289550781, 0.3597869873046875, 0.3717155456542969, 0.38364410400390625, 0.3955726623535156, 0.407501220703125, 0.4194297790527344, 0.43135833740234375, 0.4432868957519531, 0.4552154541015625, 0.4671440124511719, 0.47907257080078125, 0.4910011291503906, 0.5029296875]}, "gradients/encoder.encoder.layers.21.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 5.0, 61.0, 871.0, 67.0, 9.0, 4.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.73531723022461, -8.561802864074707, -8.388288497924805, -8.214775085449219, -8.041260719299316, -7.867746353149414, -7.694231986999512, -7.520717620849609, -7.347203254699707, -7.173688888549805, -7.0001749992370605, -6.826660633087158, -6.653146266937256, -6.479632377624512, -6.306118011474609, -6.132603645324707, -5.959089756011963, -5.7855753898620605, -5.612061500549316, -5.438547134399414, -5.265032768249512, -5.091518402099609, -4.918004512786865, -4.744490146636963, -4.570976257324219, -4.397461891174316, -4.223948001861572, -4.05043363571167, -3.8769192695617676, -3.7034051418304443, -3.529891014099121, -3.3563766479492188, -3.1828627586364746, -3.0093486309051514, -2.835834264755249, -2.662320137023926, -2.4888057708740234, -2.3152916431427, -2.141777515411377, -1.9682632684707642, -1.7947490215301514, -1.6212347745895386, -1.4477205276489258, -1.2742063999176025, -1.1006921529769897, -0.927177906036377, -0.7536637783050537, -0.5801495313644409, -0.4066352844238281, -0.23312106728553772, -0.059606850147247314, 0.1139073371887207, 0.2874215841293335, 0.4609358310699463, 0.6344499588012695, 0.8079642057418823, 0.9814784526824951, 1.154992699623108, 1.3285069465637207, 1.502021074295044, 1.6755353212356567, 1.8490495681762695, 2.0225636959075928, 2.196077823638916, 2.3695921897888184]}, "gradients/encoder.encoder.layers.21.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 3.0, 8.0, 9.0, 16.0, 54.0, 59.0, 94.0, 119.0, 138.0, 142.0, 116.0, 99.0, 73.0, 44.0, 20.0, 14.0, 6.0, 2.0, 0.0, 3.0], "bins": [-2.9110710620880127, -2.858119487762451, -2.8051681518554688, -2.7522165775299072, -2.699265241622925, -2.6463136672973633, -2.593362331390381, -2.5404107570648193, -2.487459421157837, -2.4345078468322754, -2.381556510925293, -2.3286049365997314, -2.275653600692749, -2.2227020263671875, -2.169750690460205, -2.1167991161346436, -2.063847541809082, -2.0108959674835205, -1.957944631576538, -1.9049931764602661, -1.8520417213439941, -1.7990902662277222, -1.7461388111114502, -1.6931872367858887, -1.6402359008789062, -1.5872844457626343, -1.5343329906463623, -1.4813815355300903, -1.4284300804138184, -1.3754786252975464, -1.3225271701812744, -1.269575595855713, -1.216624140739441, -1.163672685623169, -1.110721230506897, -1.057769775390625, -1.004818320274353, -0.951866865158081, -0.8989153504371643, -0.8459638953208923, -0.7930124402046204, -0.7400609850883484, -0.6871095299720764, -0.6341580152511597, -0.5812065601348877, -0.5282551050186157, -0.47530364990234375, -0.4223521947860718, -0.3694007396697998, -0.31644928455352783, -0.26349782943725586, -0.2105463445186615, -0.15759488940238953, -0.10464343428611755, -0.05169194936752319, 0.0012595057487487793, 0.05421096086502075, 0.10716242343187332, 0.1601138859987259, 0.21306535601615906, 0.26601681113243103, 0.318968266248703, 0.37191975116729736, 0.42487120628356934, 0.4778226613998413]}, "gradients/encoder.encoder.layers.20.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 13.0, 7.0, 11.0, 19.0, 26.0, 48.0, 67.0, 120.0, 239.0, 811.0, 4192047.0, 746.0, 122.0, 19.0, 5.0], "bins": [-4.48046875, -4.405242919921875, -4.33001708984375, -4.254791259765625, -4.1795654296875, -4.104339599609375, -4.02911376953125, -3.953887939453125, -3.878662109375, -3.803436279296875, -3.72821044921875, -3.652984619140625, -3.5777587890625, -3.502532958984375, -3.42730712890625, -3.352081298828125, -3.27685546875, -3.201629638671875, -3.12640380859375, -3.051177978515625, -2.9759521484375, -2.900726318359375, -2.82550048828125, -2.750274658203125, -2.675048828125, -2.599822998046875, -2.52459716796875, -2.449371337890625, -2.3741455078125, -2.298919677734375, -2.22369384765625, -2.148468017578125, -2.0732421875, -1.998016357421875, -1.92279052734375, -1.847564697265625, -1.7723388671875, -1.697113037109375, -1.62188720703125, -1.546661376953125, -1.471435546875, -1.396209716796875, -1.32098388671875, -1.245758056640625, -1.1705322265625, -1.095306396484375, -1.02008056640625, -0.944854736328125, -0.86962890625, -0.794403076171875, -0.71917724609375, -0.643951416015625, -0.5687255859375, -0.493499755859375, -0.41827392578125, -0.343048095703125, -0.267822265625, -0.192596435546875, -0.11737060546875, -0.042144775390625, 0.0330810546875, 0.108306884765625, 0.18353271484375, 0.258758544921875, 0.333984375]}, "gradients/encoder.encoder.layers.20.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 33.0, 121.0, 328.0, 342.0, 155.0, 32.0, 7.0], "bins": [-0.1578369140625, -0.15520596504211426, -0.15257501602172852, -0.14994406700134277, -0.14731311798095703, -0.1446821689605713, -0.14205121994018555, -0.1394202709197998, -0.13678932189941406, -0.13415837287902832, -0.13152742385864258, -0.12889647483825684, -0.1262655258178711, -0.12363457679748535, -0.12100362777709961, -0.11837267875671387, -0.11574172973632812, -0.11311078071594238, -0.11047983169555664, -0.1078488826751709, -0.10521793365478516, -0.10258698463439941, -0.09995603561401367, -0.09732508659362793, -0.09469413757324219, -0.09206318855285645, -0.0894322395324707, -0.08680129051208496, -0.08417034149169922, -0.08153939247131348, -0.07890844345092773, -0.07627749443054199, -0.07364654541015625, -0.07101559638977051, -0.06838464736938477, -0.06575369834899902, -0.06312274932861328, -0.06049180030822754, -0.0578608512878418, -0.055229902267456055, -0.05259895324707031, -0.04996800422668457, -0.04733705520629883, -0.044706106185913086, -0.042075157165527344, -0.0394442081451416, -0.03681325912475586, -0.03418231010437012, -0.031551361083984375, -0.028920412063598633, -0.02628946304321289, -0.02365851402282715, -0.021027565002441406, -0.018396615982055664, -0.015765666961669922, -0.01313471794128418, -0.010503768920898438, -0.007872819900512695, -0.005241870880126953, -0.002610921859741211, 2.002716064453125e-05, 0.0026509761810302734, 0.005281925201416016, 0.007912874221801758, 0.0105438232421875]}, "gradients/encoder.encoder.layers.20.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 9.0, 30.0, 53.0, 100.0, 317.0, 1557.0, 4184059.0, 7489.0, 480.0, 133.0, 39.0, 12.0, 7.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.69921875, -2.62353515625, -2.5478515625, -2.47216796875, -2.396484375, -2.32080078125, -2.2451171875, -2.16943359375, -2.09375, -2.01806640625, -1.9423828125, -1.86669921875, -1.791015625, -1.71533203125, -1.6396484375, -1.56396484375, -1.48828125, -1.41259765625, -1.3369140625, -1.26123046875, -1.185546875, -1.10986328125, -1.0341796875, -0.95849609375, -0.8828125, -0.80712890625, -0.7314453125, -0.65576171875, -0.580078125, -0.50439453125, -0.4287109375, -0.35302734375, -0.27734375, -0.20166015625, -0.1259765625, -0.05029296875, 0.025390625, 0.10107421875, 0.1767578125, 0.25244140625, 0.328125, 0.40380859375, 0.4794921875, 0.55517578125, 0.630859375, 0.70654296875, 0.7822265625, 0.85791015625, 0.93359375, 1.00927734375, 1.0849609375, 1.16064453125, 1.236328125, 1.31201171875, 1.3876953125, 1.46337890625, 1.5390625, 1.61474609375, 1.6904296875, 1.76611328125, 1.841796875, 1.91748046875, 1.9931640625, 2.06884765625, 2.14453125]}, "gradients/encoder.encoder.layers.20.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 7.0, 30.0, 789.0, 3138.0, 92.0, 22.0, 5.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.44970703125, -0.4395008087158203, -0.4292945861816406, -0.41908836364746094, -0.40888214111328125, -0.39867591857910156, -0.3884696960449219, -0.3782634735107422, -0.3680572509765625, -0.3578510284423828, -0.3476448059082031, -0.33743858337402344, -0.32723236083984375, -0.31702613830566406, -0.3068199157714844, -0.2966136932373047, -0.286407470703125, -0.2762012481689453, -0.2659950256347656, -0.25578880310058594, -0.24558258056640625, -0.23537635803222656, -0.22517013549804688, -0.2149639129638672, -0.2047576904296875, -0.1945514678955078, -0.18434524536132812, -0.17413902282714844, -0.16393280029296875, -0.15372657775878906, -0.14352035522460938, -0.1333141326904297, -0.12310791015625, -0.11290168762207031, -0.10269546508789062, -0.09248924255371094, -0.08228302001953125, -0.07207679748535156, -0.061870574951171875, -0.05166435241699219, -0.0414581298828125, -0.03125190734863281, -0.021045684814453125, -0.010839462280273438, -0.00063323974609375, 0.009572982788085938, 0.019779205322265625, 0.029985427856445312, 0.040191650390625, 0.05039787292480469, 0.060604095458984375, 0.07081031799316406, 0.08101654052734375, 0.09122276306152344, 0.10142898559570312, 0.11163520812988281, 0.1218414306640625, 0.1320476531982422, 0.14225387573242188, 0.15246009826660156, 0.16266632080078125, 0.17287254333496094, 0.18307876586914062, 0.1932849884033203, 0.2034912109375]}, "gradients/encoder.encoder.layers.20.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 107.0, 878.0, 22.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6.116090297698975, -5.97467565536499, -5.833261013031006, -5.691845893859863, -5.550431251525879, -5.4090166091918945, -5.26760196685791, -5.126187324523926, -4.984772682189941, -4.843358039855957, -4.701943397521973, -4.56052827835083, -4.419113636016846, -4.277698993682861, -4.136284351348877, -3.9948697090148926, -3.85345458984375, -3.7120399475097656, -3.570625066757202, -3.4292104244232178, -3.2877955436706543, -3.14638090133667, -3.0049662590026855, -2.863551616668701, -2.7221367359161377, -2.5807220935821533, -2.43930721282959, -2.2978925704956055, -2.156477928161621, -2.0150630474090576, -1.8736484050750732, -1.7322336435317993, -1.5908193588256836, -1.4494045972824097, -1.3079898357391357, -1.1665751934051514, -1.0251604318618774, -0.8837456703186035, -0.7423309683799744, -0.6009162664413452, -0.4595015048980713, -0.31808677315711975, -0.1766720414161682, -0.035257309675216675, 0.10615742206573486, 0.2475721836090088, 0.38898688554763794, 0.5304015874862671, 0.671816349029541, 0.8132311105728149, 0.9546458125114441, 1.0960605144500732, 1.2374752759933472, 1.378890037536621, 1.5203046798706055, 1.6617194414138794, 1.8031342029571533, 1.9445489645004272, 2.085963726043701, 2.2273783683776855, 2.36879301071167, 2.5102078914642334, 2.6516225337982178, 2.7930374145507812, 2.9344520568847656]}, "gradients/encoder.encoder.layers.20.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 5.0, 3.0, 13.0, 18.0, 24.0, 60.0, 84.0, 99.0, 128.0, 134.0, 119.0, 106.0, 91.0, 52.0, 36.0, 19.0, 9.0, 4.0, 2.0, 4.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1245499849319458, -1.0871564149856567, -1.0497628450393677, -1.012369155883789, -0.9749756455421448, -0.9375820159912109, -0.9001884460449219, -0.8627948760986328, -0.8254013061523438, -0.7880077362060547, -0.7506141066551208, -0.7132205367088318, -0.6758269667625427, -0.6384333372116089, -0.6010397672653198, -0.5636461973190308, -0.5262525677680969, -0.4888589680194855, -0.4514653980731964, -0.41407179832458496, -0.3766782283782959, -0.33928462862968445, -0.301891028881073, -0.26449745893478394, -0.22710385918617249, -0.18971027433872223, -0.15231668949127197, -0.11492308974266052, -0.07752950489521027, -0.04013592004776001, -0.0027423202991485596, 0.0346512496471405, 0.07204484939575195, 0.10943843424320221, 0.14683201909065247, 0.18422561883926392, 0.22161920368671417, 0.25901278853416443, 0.2964063882827759, 0.33379995822906494, 0.3711935579776764, 0.40858715772628784, 0.4459807276725769, 0.48337432742118835, 0.5207679271697998, 0.5581614971160889, 0.5955550670623779, 0.632948637008667, 0.6703422665596008, 0.7077358365058899, 0.7451294660568237, 0.7825230360031128, 0.8199166059494019, 0.8573101758956909, 0.8947038054466248, 0.9320973753929138, 0.9694910049438477, 1.0068845748901367, 1.0442781448364258, 1.0816717147827148, 1.1190654039382935, 1.1564589738845825, 1.1938525438308716, 1.2312461137771606, 1.2686396837234497]}, "gradients/encoder.encoder.layers.20.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 4.0, 2.0, 2.0, 11.0, 12.0, 11.0, 18.0, 22.0, 41.0, 52.0, 78.0, 188.0, 1200.0, 142323.0, 901911.0, 2154.0, 263.0, 91.0, 62.0, 31.0, 27.0, 20.0, 17.0, 9.0, 2.0, 2.0, 2.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.962890625, -0.9362258911132812, -0.9095611572265625, -0.8828964233398438, -0.856231689453125, -0.8295669555664062, -0.8029022216796875, -0.7762374877929688, -0.74957275390625, -0.7229080200195312, -0.6962432861328125, -0.6695785522460938, -0.642913818359375, -0.6162490844726562, -0.5895843505859375, -0.5629196166992188, -0.5362548828125, -0.5095901489257812, -0.4829254150390625, -0.45626068115234375, -0.429595947265625, -0.40293121337890625, -0.3762664794921875, -0.34960174560546875, -0.32293701171875, -0.29627227783203125, -0.2696075439453125, -0.24294281005859375, -0.216278076171875, -0.18961334228515625, -0.1629486083984375, -0.13628387451171875, -0.109619140625, -0.08295440673828125, -0.0562896728515625, -0.02962493896484375, -0.002960205078125, 0.02370452880859375, 0.0503692626953125, 0.07703399658203125, 0.10369873046875, 0.13036346435546875, 0.1570281982421875, 0.18369293212890625, 0.210357666015625, 0.23702239990234375, 0.2636871337890625, 0.29035186767578125, 0.3170166015625, 0.34368133544921875, 0.3703460693359375, 0.39701080322265625, 0.423675537109375, 0.45034027099609375, 0.4770050048828125, 0.5036697387695312, 0.53033447265625, 0.5569992065429688, 0.5836639404296875, 0.6103286743164062, 0.636993408203125, 0.6636581420898438, 0.6903228759765625, 0.7169876098632812, 0.74365234375]}, "gradients/encoder.encoder.layers.20.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 4.0, 17.0, 73.0, 147.0, 252.0, 287.0, 149.0, 54.0, 22.0, 5.0, 1.0, 2.0, 1.0, 1.0, 1.0], "bins": [-0.17431640625, -0.17108488082885742, -0.16785335540771484, -0.16462182998657227, -0.1613903045654297, -0.1581587791442871, -0.15492725372314453, -0.15169572830200195, -0.14846420288085938, -0.1452326774597168, -0.14200115203857422, -0.13876962661743164, -0.13553810119628906, -0.13230657577514648, -0.1290750503540039, -0.12584352493286133, -0.12261199951171875, -0.11938047409057617, -0.1161489486694336, -0.11291742324829102, -0.10968589782714844, -0.10645437240600586, -0.10322284698486328, -0.0999913215637207, -0.09675979614257812, -0.09352827072143555, -0.09029674530029297, -0.08706521987915039, -0.08383369445800781, -0.08060216903686523, -0.07737064361572266, -0.07413911819458008, -0.0709075927734375, -0.06767606735229492, -0.06444454193115234, -0.061213016510009766, -0.05798149108886719, -0.05474996566772461, -0.05151844024658203, -0.04828691482543945, -0.045055389404296875, -0.0418238639831543, -0.03859233856201172, -0.03536081314086914, -0.03212928771972656, -0.028897762298583984, -0.025666236877441406, -0.022434711456298828, -0.01920318603515625, -0.015971660614013672, -0.012740135192871094, -0.009508609771728516, -0.0062770843505859375, -0.0030455589294433594, 0.00018596649169921875, 0.003417491912841797, 0.006649017333984375, 0.009880542755126953, 0.013112068176269531, 0.01634359359741211, 0.019575119018554688, 0.022806644439697266, 0.026038169860839844, 0.029269695281982422, 0.032501220703125]}, "gradients/encoder.encoder.layers.20.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 5.0, 2.0, 7.0, 9.0, 12.0, 25.0, 29.0, 43.0, 54.0, 79.0, 191.0, 457.0, 2102.0, 84070.0, 952133.0, 7928.0, 814.0, 267.0, 128.0, 63.0, 40.0, 33.0, 23.0, 14.0, 9.0, 4.0, 11.0, 7.0, 2.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.50390625, -0.4896087646484375, -0.475311279296875, -0.4610137939453125, -0.44671630859375, -0.4324188232421875, -0.418121337890625, -0.4038238525390625, -0.3895263671875, -0.3752288818359375, -0.360931396484375, -0.3466339111328125, -0.33233642578125, -0.3180389404296875, -0.303741455078125, -0.2894439697265625, -0.275146484375, -0.2608489990234375, -0.246551513671875, -0.2322540283203125, -0.21795654296875, -0.2036590576171875, -0.189361572265625, -0.1750640869140625, -0.1607666015625, -0.1464691162109375, -0.132171630859375, -0.1178741455078125, -0.10357666015625, -0.0892791748046875, -0.074981689453125, -0.0606842041015625, -0.04638671875, -0.0320892333984375, -0.017791748046875, -0.0034942626953125, 0.01080322265625, 0.0251007080078125, 0.039398193359375, 0.0536956787109375, 0.0679931640625, 0.0822906494140625, 0.096588134765625, 0.1108856201171875, 0.12518310546875, 0.1394805908203125, 0.153778076171875, 0.1680755615234375, 0.182373046875, 0.1966705322265625, 0.210968017578125, 0.2252655029296875, 0.23956298828125, 0.2538604736328125, 0.268157958984375, 0.2824554443359375, 0.2967529296875, 0.3110504150390625, 0.325347900390625, 0.3396453857421875, 0.35394287109375, 0.3682403564453125, 0.382537841796875, 0.3968353271484375, 0.4111328125]}, "gradients/encoder.encoder.layers.20.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 2.0, 13.0, 12.0, 11.0, 21.0, 25.0, 25.0, 50.0, 65.0, 39.0, 70.0, 72.0, 88.0, 76.0, 79.0, 79.0, 62.0, 47.0, 39.0, 27.0, 23.0, 22.0, 12.0, 13.0, 10.0, 8.0, 8.0, 4.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.182373046875, -0.17718505859375, -0.1719970703125, -0.16680908203125, -0.16162109375, -0.15643310546875, -0.1512451171875, -0.14605712890625, -0.140869140625, -0.13568115234375, -0.1304931640625, -0.12530517578125, -0.1201171875, -0.11492919921875, -0.1097412109375, -0.10455322265625, -0.099365234375, -0.09417724609375, -0.0889892578125, -0.08380126953125, -0.07861328125, -0.07342529296875, -0.0682373046875, -0.06304931640625, -0.057861328125, -0.05267333984375, -0.0474853515625, -0.04229736328125, -0.037109375, -0.03192138671875, -0.0267333984375, -0.02154541015625, -0.016357421875, -0.01116943359375, -0.0059814453125, -0.00079345703125, 0.00439453125, 0.00958251953125, 0.0147705078125, 0.01995849609375, 0.025146484375, 0.03033447265625, 0.0355224609375, 0.04071044921875, 0.0458984375, 0.05108642578125, 0.0562744140625, 0.06146240234375, 0.066650390625, 0.07183837890625, 0.0770263671875, 0.08221435546875, 0.08740234375, 0.09259033203125, 0.0977783203125, 0.10296630859375, 0.108154296875, 0.11334228515625, 0.1185302734375, 0.12371826171875, 0.12890625, 0.13409423828125, 0.1392822265625, 0.14447021484375, 0.149658203125]}, "gradients/encoder.encoder.layers.20.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 2.0, 0.0, 3.0, 2.0, 7.0, 4.0, 6.0, 7.0, 8.0, 15.0, 16.0, 20.0, 40.0, 43.0, 91.0, 178.0, 307.0, 652.0, 1810.0, 8664.0, 167548.0, 838782.0, 25079.0, 3353.0, 954.0, 413.0, 210.0, 116.0, 76.0, 47.0, 29.0, 13.0, 13.0, 12.0, 12.0, 4.0, 8.0, 3.0, 6.0, 6.0, 2.0, 3.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.129150390625, -0.12514305114746094, -0.12113571166992188, -0.11712837219238281, -0.11312103271484375, -0.10911369323730469, -0.10510635375976562, -0.10109901428222656, -0.0970916748046875, -0.09308433532714844, -0.08907699584960938, -0.08506965637207031, -0.08106231689453125, -0.07705497741699219, -0.07304763793945312, -0.06904029846191406, -0.065032958984375, -0.06102561950683594, -0.057018280029296875, -0.05301094055175781, -0.04900360107421875, -0.04499626159667969, -0.040988922119140625, -0.03698158264160156, -0.0329742431640625, -0.028966903686523438, -0.024959564208984375, -0.020952224731445312, -0.01694488525390625, -0.012937545776367188, -0.008930206298828125, -0.0049228668212890625, -0.00091552734375, 0.0030918121337890625, 0.007099151611328125, 0.011106491088867188, 0.01511383056640625, 0.019121170043945312, 0.023128509521484375, 0.027135848999023438, 0.0311431884765625, 0.03515052795410156, 0.039157867431640625, 0.04316520690917969, 0.04717254638671875, 0.05117988586425781, 0.055187225341796875, 0.05919456481933594, 0.063201904296875, 0.06720924377441406, 0.07121658325195312, 0.07522392272949219, 0.07923126220703125, 0.08323860168457031, 0.08724594116210938, 0.09125328063964844, 0.0952606201171875, 0.09926795959472656, 0.10327529907226562, 0.10728263854980469, 0.11128997802734375, 0.11529731750488281, 0.11930465698242188, 0.12331199645996094, 0.1273193359375]}, "gradients/encoder.encoder.layers.20.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 1.0, 0.0, 1.0, 1.0, 4.0, 6.0, 8.0, 5.0, 6.0, 22.0, 13.0, 47.0, 48.0, 56.0, 81.0, 99.0, 112.0, 106.0, 115.0, 83.0, 57.0, 58.0, 35.0, 11.0, 13.0, 9.0, 4.0, 2.0, 3.0, 0.0, 0.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.424551010131836e-05, -1.3824552297592163e-05, -1.3403594493865967e-05, -1.298263669013977e-05, -1.2561678886413574e-05, -1.2140721082687378e-05, -1.1719763278961182e-05, -1.1298805475234985e-05, -1.0877847671508789e-05, -1.0456889867782593e-05, -1.0035932064056396e-05, -9.6149742603302e-06, -9.194016456604004e-06, -8.773058652877808e-06, -8.352100849151611e-06, -7.931143045425415e-06, -7.510185241699219e-06, -7.0892274379730225e-06, -6.668269634246826e-06, -6.24731183052063e-06, -5.826354026794434e-06, -5.405396223068237e-06, -4.984438419342041e-06, -4.563480615615845e-06, -4.1425228118896484e-06, -3.721565008163452e-06, -3.300607204437256e-06, -2.8796494007110596e-06, -2.4586915969848633e-06, -2.037733793258667e-06, -1.6167759895324707e-06, -1.1958181858062744e-06, -7.748603820800781e-07, -3.5390257835388184e-07, 6.705522537231445e-08, 4.880130290985107e-07, 9.08970832824707e-07, 1.3299286365509033e-06, 1.7508864402770996e-06, 2.171844244003296e-06, 2.592802047729492e-06, 3.0137598514556885e-06, 3.4347176551818848e-06, 3.855675458908081e-06, 4.276633262634277e-06, 4.697591066360474e-06, 5.11854887008667e-06, 5.539506673812866e-06, 5.9604644775390625e-06, 6.381422281265259e-06, 6.802380084991455e-06, 7.223337888717651e-06, 7.644295692443848e-06, 8.065253496170044e-06, 8.48621129989624e-06, 8.907169103622437e-06, 9.328126907348633e-06, 9.749084711074829e-06, 1.0170042514801025e-05, 1.0591000318527222e-05, 1.1011958122253418e-05, 1.1432915925979614e-05, 1.185387372970581e-05, 1.2274831533432007e-05, 1.2695789337158203e-05]}, "gradients/encoder.encoder.layers.20.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 5.0, 17.0, 38.0, 76.0, 294.0, 1247.0, 20280.0, 1018394.0, 7148.0, 786.0, 173.0, 52.0, 25.0, 11.0, 6.0, 4.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.42626953125, -0.41268157958984375, -0.3990936279296875, -0.38550567626953125, -0.371917724609375, -0.35832977294921875, -0.3447418212890625, -0.33115386962890625, -0.31756591796875, -0.30397796630859375, -0.2903900146484375, -0.27680206298828125, -0.263214111328125, -0.24962615966796875, -0.2360382080078125, -0.22245025634765625, -0.2088623046875, -0.19527435302734375, -0.1816864013671875, -0.16809844970703125, -0.154510498046875, -0.14092254638671875, -0.1273345947265625, -0.11374664306640625, -0.10015869140625, -0.08657073974609375, -0.0729827880859375, -0.05939483642578125, -0.045806884765625, -0.03221893310546875, -0.0186309814453125, -0.00504302978515625, 0.008544921875, 0.02213287353515625, 0.0357208251953125, 0.04930877685546875, 0.062896728515625, 0.07648468017578125, 0.0900726318359375, 0.10366058349609375, 0.11724853515625, 0.13083648681640625, 0.1444244384765625, 0.15801239013671875, 0.171600341796875, 0.18518829345703125, 0.1987762451171875, 0.21236419677734375, 0.2259521484375, 0.23954010009765625, 0.2531280517578125, 0.26671600341796875, 0.280303955078125, 0.29389190673828125, 0.3074798583984375, 0.32106781005859375, 0.33465576171875, 0.34824371337890625, 0.3618316650390625, 0.37541961669921875, 0.389007568359375, 0.40259552001953125, 0.4161834716796875, 0.42977142333984375, 0.443359375]}, "gradients/encoder.encoder.layers.20.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 3.0, 4.0, 10.0, 13.0, 12.0, 29.0, 41.0, 90.0, 151.0, 213.0, 209.0, 115.0, 50.0, 34.0, 18.0, 4.0, 9.0, 2.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 2.0], "bins": [-0.188232421875, -0.18428802490234375, -0.1803436279296875, -0.17639923095703125, -0.172454833984375, -0.16851043701171875, -0.1645660400390625, -0.16062164306640625, -0.15667724609375, -0.15273284912109375, -0.1487884521484375, -0.14484405517578125, -0.140899658203125, -0.13695526123046875, -0.1330108642578125, -0.12906646728515625, -0.1251220703125, -0.12117767333984375, -0.1172332763671875, -0.11328887939453125, -0.109344482421875, -0.10540008544921875, -0.1014556884765625, -0.09751129150390625, -0.09356689453125, -0.08962249755859375, -0.0856781005859375, -0.08173370361328125, -0.077789306640625, -0.07384490966796875, -0.0699005126953125, -0.06595611572265625, -0.06201171875, -0.05806732177734375, -0.0541229248046875, -0.05017852783203125, -0.046234130859375, -0.04228973388671875, -0.0383453369140625, -0.03440093994140625, -0.03045654296875, -0.02651214599609375, -0.0225677490234375, -0.01862335205078125, -0.014678955078125, -0.01073455810546875, -0.0067901611328125, -0.00284576416015625, 0.0010986328125, 0.00504302978515625, 0.0089874267578125, 0.01293182373046875, 0.016876220703125, 0.02082061767578125, 0.0247650146484375, 0.02870941162109375, 0.03265380859375, 0.03659820556640625, 0.0405426025390625, 0.04448699951171875, 0.048431396484375, 0.05237579345703125, 0.0563201904296875, 0.06026458740234375, 0.064208984375]}, "gradients/encoder.encoder.layers.20.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 4.0, 72.0, 894.0, 43.0, 3.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.2823108434677124, -1.1305437088012695, -0.9787766933441162, -0.8270095586776733, -0.6752424836158752, -0.5234754085540771, -0.3717082738876343, -0.21994125843048096, -0.06817412376403809, 0.0835929661989212, 0.2353600561618805, 0.387127161026001, 0.5388942360877991, 0.6906613111495972, 0.84242844581604, 0.9941954612731934, 1.1459625959396362, 1.297729730606079, 1.4494967460632324, 1.6012638807296753, 1.7530310153961182, 1.9047980308532715, 2.056565284729004, 2.208332061767578, 2.3600993156433105, 2.511866331100464, 2.6636335849761963, 2.8154006004333496, 2.967167615890503, 3.1189346313476562, 3.2707018852233887, 3.422468900680542, 3.5742363929748535, 3.726003408432007, 3.8777706623077393, 4.029537677764893, 4.181304931640625, 4.333071708679199, 4.484838962554932, 4.636606216430664, 4.788372993469238, 4.940140247344971, 5.091907024383545, 5.243674278259277, 5.39544153213501, 5.547208309173584, 5.698975563049316, 5.850742340087891, 6.002510070800781, 6.154277324676514, 6.306044101715088, 6.45781135559082, 6.609578609466553, 6.761345386505127, 6.913112640380859, 7.064879417419434, 7.216646671295166, 7.368413925170898, 7.520180702209473, 7.671947956085205, 7.8237152099609375, 7.975481986999512, 8.127248764038086, 8.279016494750977, 8.43078327178955]}, "gradients/encoder.encoder.layers.20.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 4.0, 2.0, 4.0, 4.0, 6.0, 7.0, 11.0, 15.0, 14.0, 24.0, 39.0, 26.0, 41.0, 42.0, 51.0, 71.0, 63.0, 69.0, 65.0, 75.0, 57.0, 58.0, 52.0, 51.0, 37.0, 23.0, 25.0, 22.0, 13.0, 22.0, 8.0, 2.0, 2.0, 3.0, 3.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6482570767402649, -0.6215471625328064, -0.5948372483253479, -0.5681272745132446, -0.5414173603057861, -0.5147074460983276, -0.48799753189086914, -0.46128761768341064, -0.43457767367362976, -0.40786775946617126, -0.3811578154563904, -0.3544479012489319, -0.3277379870414734, -0.3010280430316925, -0.274318128824234, -0.24760819971561432, -0.22089827060699463, -0.19418834149837494, -0.16747841238975525, -0.14076849818229675, -0.11405856907367706, -0.08734863996505737, -0.06063872575759888, -0.03392879664897919, -0.007218867540359497, 0.019491057842969894, 0.046200983226299286, 0.07291090488433838, 0.09962083399295807, 0.12633076310157776, 0.15304067730903625, 0.17975060641765594, 0.20646047592163086, 0.23317040503025055, 0.25988033413887024, 0.28659024834632874, 0.3133001923561096, 0.3400101065635681, 0.3667200207710266, 0.3934299349784851, 0.420139878988266, 0.4468497931957245, 0.47355973720550537, 0.5002696514129639, 0.5269795656204224, 0.5536894798278809, 0.5803993940353394, 0.6071093678474426, 0.6338192820549011, 0.6605291962623596, 0.6872391104698181, 0.7139490842819214, 0.7406589984893799, 0.7673689126968384, 0.7940788269042969, 0.8207887411117554, 0.8474986553192139, 0.8742085695266724, 0.9009184837341309, 0.9276283979415894, 0.9543383717536926, 0.9810482859611511, 1.0077581405639648, 1.034468173980713, 1.0611780881881714]}, "gradients/encoder.encoder.layers.19.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 2.0, 3.0, 2.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 3.0, 6.0, 5.0, 5.0, 6.0, 8.0, 17.0, 15.0, 11.0, 23.0, 26.0, 28.0, 25.0, 51.0, 55.0, 62.0, 114.0, 236.0, 946.0, 11983.0, 4080819.0, 96755.0, 2589.0, 337.0, 88.0, 36.0, 14.0, 6.0, 0.0, 1.0, 0.0, 3.0], "bins": [-1.0390625, -1.0193595886230469, -0.9996566772460938, -0.9799537658691406, -0.9602508544921875, -0.9405479431152344, -0.9208450317382812, -0.9011421203613281, -0.881439208984375, -0.8617362976074219, -0.8420333862304688, -0.8223304748535156, -0.8026275634765625, -0.7829246520996094, -0.7632217407226562, -0.7435188293457031, -0.72381591796875, -0.7041130065917969, -0.6844100952148438, -0.6647071838378906, -0.6450042724609375, -0.6253013610839844, -0.6055984497070312, -0.5858955383300781, -0.566192626953125, -0.5464897155761719, -0.5267868041992188, -0.5070838928222656, -0.4873809814453125, -0.4676780700683594, -0.44797515869140625, -0.4282722473144531, -0.4085693359375, -0.3888664245605469, -0.36916351318359375, -0.3494606018066406, -0.3297576904296875, -0.3100547790527344, -0.29035186767578125, -0.2706489562988281, -0.250946044921875, -0.23124313354492188, -0.21154022216796875, -0.19183731079101562, -0.1721343994140625, -0.15243148803710938, -0.13272857666015625, -0.11302566528320312, -0.09332275390625, -0.07361984252929688, -0.05391693115234375, -0.034214019775390625, -0.0145111083984375, 0.005191802978515625, 0.02489471435546875, 0.044597625732421875, 0.064300537109375, 0.08400344848632812, 0.10370635986328125, 0.12340927124023438, 0.1431121826171875, 0.16281509399414062, 0.18251800537109375, 0.20222091674804688, 0.221923828125]}, "gradients/encoder.encoder.layers.19.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 2.0, 13.0, 34.0, 91.0, 175.0, 254.0, 226.0, 134.0, 55.0, 23.0, 8.0, 0.0, 1.0, 1.0], "bins": [-0.165771484375, -0.1627941131591797, -0.15981674194335938, -0.15683937072753906, -0.15386199951171875, -0.15088462829589844, -0.14790725708007812, -0.1449298858642578, -0.1419525146484375, -0.1389751434326172, -0.13599777221679688, -0.13302040100097656, -0.13004302978515625, -0.12706565856933594, -0.12408828735351562, -0.12111091613769531, -0.118133544921875, -0.11515617370605469, -0.11217880249023438, -0.10920143127441406, -0.10622406005859375, -0.10324668884277344, -0.10026931762695312, -0.09729194641113281, -0.0943145751953125, -0.09133720397949219, -0.08835983276367188, -0.08538246154785156, -0.08240509033203125, -0.07942771911621094, -0.07645034790039062, -0.07347297668457031, -0.07049560546875, -0.06751823425292969, -0.06454086303710938, -0.06156349182128906, -0.05858612060546875, -0.05560874938964844, -0.052631378173828125, -0.04965400695800781, -0.0466766357421875, -0.04369926452636719, -0.040721893310546875, -0.03774452209472656, -0.03476715087890625, -0.03178977966308594, -0.028812408447265625, -0.025835037231445312, -0.022857666015625, -0.019880294799804688, -0.016902923583984375, -0.013925552368164062, -0.01094818115234375, -0.007970809936523438, -0.004993438720703125, -0.0020160675048828125, 0.0009613037109375, 0.0039386749267578125, 0.006916046142578125, 0.009893417358398438, 0.01287078857421875, 0.015848159790039062, 0.018825531005859375, 0.021802902221679688, 0.0247802734375]}, "gradients/encoder.encoder.layers.19.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 2.0, 6.0, 12.0, 15.0, 42.0, 95.0, 175.0, 409.0, 1315.0, 326442.0, 3864130.0, 1101.0, 296.0, 145.0, 58.0, 26.0, 10.0, 7.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.322265625, -2.2758560180664062, -2.2294464111328125, -2.1830368041992188, -2.136627197265625, -2.0902175903320312, -2.0438079833984375, -1.9973983764648438, -1.95098876953125, -1.9045791625976562, -1.8581695556640625, -1.8117599487304688, -1.765350341796875, -1.7189407348632812, -1.6725311279296875, -1.6261215209960938, -1.5797119140625, -1.5333023071289062, -1.4868927001953125, -1.4404830932617188, -1.394073486328125, -1.3476638793945312, -1.3012542724609375, -1.2548446655273438, -1.20843505859375, -1.1620254516601562, -1.1156158447265625, -1.0692062377929688, -1.022796630859375, -0.9763870239257812, -0.9299774169921875, -0.8835678100585938, -0.837158203125, -0.7907485961914062, -0.7443389892578125, -0.6979293823242188, -0.651519775390625, -0.6051101684570312, -0.5587005615234375, -0.5122909545898438, -0.46588134765625, -0.41947174072265625, -0.3730621337890625, -0.32665252685546875, -0.280242919921875, -0.23383331298828125, -0.1874237060546875, -0.14101409912109375, -0.0946044921875, -0.04819488525390625, -0.0017852783203125, 0.04462432861328125, 0.091033935546875, 0.13744354248046875, 0.1838531494140625, 0.23026275634765625, 0.27667236328125, 0.32308197021484375, 0.3694915771484375, 0.41590118408203125, 0.462310791015625, 0.5087203979492188, 0.5551300048828125, 0.6015396118164062, 0.64794921875]}, "gradients/encoder.encoder.layers.19.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 5.0, 7.0, 15.0, 30.0, 89.0, 255.0, 3033.0, 434.0, 96.0, 60.0, 34.0, 9.0, 9.0, 4.0, 1.0, 2.0, 1.0], "bins": [-0.235595703125, -0.23120403289794922, -0.22681236267089844, -0.22242069244384766, -0.21802902221679688, -0.2136373519897461, -0.2092456817626953, -0.20485401153564453, -0.20046234130859375, -0.19607067108154297, -0.1916790008544922, -0.1872873306274414, -0.18289566040039062, -0.17850399017333984, -0.17411231994628906, -0.16972064971923828, -0.1653289794921875, -0.16093730926513672, -0.15654563903808594, -0.15215396881103516, -0.14776229858398438, -0.1433706283569336, -0.1389789581298828, -0.13458728790283203, -0.13019561767578125, -0.12580394744873047, -0.12141227722167969, -0.1170206069946289, -0.11262893676757812, -0.10823726654052734, -0.10384559631347656, -0.09945392608642578, -0.095062255859375, -0.09067058563232422, -0.08627891540527344, -0.08188724517822266, -0.07749557495117188, -0.0731039047241211, -0.06871223449707031, -0.06432056427001953, -0.05992889404296875, -0.05553722381591797, -0.05114555358886719, -0.046753883361816406, -0.042362213134765625, -0.037970542907714844, -0.03357887268066406, -0.02918720245361328, -0.0247955322265625, -0.02040386199951172, -0.016012191772460938, -0.011620521545410156, -0.007228851318359375, -0.0028371810913085938, 0.0015544891357421875, 0.005946159362792969, 0.01033782958984375, 0.014729499816894531, 0.019121170043945312, 0.023512840270996094, 0.027904510498046875, 0.032296180725097656, 0.03668785095214844, 0.04107952117919922, 0.04547119140625]}, "gradients/encoder.encoder.layers.19.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 2.0, 28.0, 350.0, 589.0, 34.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0609214305877686, -0.995305061340332, -0.9296887516975403, -0.8640724420547485, -0.798456072807312, -0.7328397035598755, -0.6672233939170837, -0.601607084274292, -0.5359907150268555, -0.47037437558174133, -0.4047580361366272, -0.33914169669151306, -0.2735253572463989, -0.2079090178012848, -0.14229267835617065, -0.07667633891105652, -0.011059999465942383, 0.05455633997917175, 0.12017267942428589, 0.18578901886940002, 0.25140535831451416, 0.3170216977596283, 0.38263803720474243, 0.44825437664985657, 0.5138707160949707, 0.5794870853424072, 0.645103394985199, 0.7107197046279907, 0.7763360738754272, 0.8419524431228638, 0.9075687527656555, 0.9731850624084473, 1.0388011932373047, 1.1044175624847412, 1.1700339317321777, 1.2356501817703247, 1.3012665510177612, 1.3668829202651978, 1.4324991703033447, 1.4981155395507812, 1.5637319087982178, 1.6293482780456543, 1.6949646472930908, 1.7605808973312378, 1.8261972665786743, 1.8918136358261108, 1.9574298858642578, 2.0230462551116943, 2.088662624359131, 2.1542789936065674, 2.219895362854004, 2.2855117321014404, 2.351128101348877, 2.4167442321777344, 2.482360601425171, 2.5479769706726074, 2.613593339920044, 2.6792097091674805, 2.744826078414917, 2.8104424476623535, 2.876058578491211, 2.9416749477386475, 3.007291316986084, 3.0729076862335205, 3.138524055480957]}, "gradients/encoder.encoder.layers.19.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 2.0, 4.0, 7.0, 11.0, 14.0, 29.0, 40.0, 68.0, 98.0, 86.0, 85.0, 96.0, 101.0, 95.0, 89.0, 58.0, 38.0, 27.0, 28.0, 13.0, 6.0, 2.0, 3.0, 1.0, 3.0, 3.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6604729294776917, -0.6416794657707214, -0.622886061668396, -0.6040925979614258, -0.5852991938591003, -0.5665057301521301, -0.5477123260498047, -0.5289188623428345, -0.5101253986358643, -0.49133196473121643, -0.4725385308265686, -0.4537450969219208, -0.43495166301727295, -0.41615819931030273, -0.3973647654056549, -0.3785713315010071, -0.35977792739868164, -0.3409844934940338, -0.322191059589386, -0.30339762568473816, -0.28460419178009033, -0.2658107280731201, -0.2470172941684723, -0.22822386026382446, -0.20943042635917664, -0.1906369924545288, -0.17184355854988098, -0.15305010974407196, -0.13425667583942413, -0.1154632419347763, -0.09666980057954788, -0.07787635922431946, -0.059082865715026855, -0.04028942808508873, -0.021495990455150604, -0.0027025528252124786, 0.016090884804725647, 0.034884318709373474, 0.0536777600646019, 0.07247120141983032, 0.09126463532447815, 0.11005806922912598, 0.1288515031337738, 0.14764495193958282, 0.16643838584423065, 0.18523181974887848, 0.2040252685546875, 0.22281870245933533, 0.24161213636398315, 0.260405570268631, 0.2791990041732788, 0.29799243807792664, 0.31678587198257446, 0.3355793356895447, 0.3543727695941925, 0.37316620349884033, 0.39195963740348816, 0.410753071308136, 0.4295465052127838, 0.44833993911743164, 0.46713340282440186, 0.4859268069267273, 0.5047202706336975, 0.523513674736023, 0.5423071384429932]}, "gradients/encoder.encoder.layers.19.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 3.0, 2.0, 3.0, 4.0, 3.0, 5.0, 4.0, 12.0, 15.0, 13.0, 12.0, 18.0, 22.0, 29.0, 37.0, 48.0, 56.0, 97.0, 208.0, 595.0, 2331.0, 20825.0, 638686.0, 368628.0, 13944.0, 1907.0, 499.0, 194.0, 95.0, 50.0, 41.0, 41.0, 28.0, 19.0, 21.0, 14.0, 15.0, 8.0, 5.0, 7.0, 5.0, 4.0, 1.0, 5.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.356689453125, -0.345855712890625, -0.33502197265625, -0.324188232421875, -0.3133544921875, -0.302520751953125, -0.29168701171875, -0.280853271484375, -0.27001953125, -0.259185791015625, -0.24835205078125, -0.237518310546875, -0.2266845703125, -0.215850830078125, -0.20501708984375, -0.194183349609375, -0.183349609375, -0.172515869140625, -0.16168212890625, -0.150848388671875, -0.1400146484375, -0.129180908203125, -0.11834716796875, -0.107513427734375, -0.0966796875, -0.085845947265625, -0.07501220703125, -0.064178466796875, -0.0533447265625, -0.042510986328125, -0.03167724609375, -0.020843505859375, -0.010009765625, 0.000823974609375, 0.01165771484375, 0.022491455078125, 0.0333251953125, 0.044158935546875, 0.05499267578125, 0.065826416015625, 0.07666015625, 0.087493896484375, 0.09832763671875, 0.109161376953125, 0.1199951171875, 0.130828857421875, 0.14166259765625, 0.152496337890625, 0.163330078125, 0.174163818359375, 0.18499755859375, 0.195831298828125, 0.2066650390625, 0.217498779296875, 0.22833251953125, 0.239166259765625, 0.25, 0.260833740234375, 0.27166748046875, 0.282501220703125, 0.2933349609375, 0.304168701171875, 0.31500244140625, 0.325836181640625, 0.336669921875]}, "gradients/encoder.encoder.layers.19.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 0.0, 1.0, 15.0, 14.0, 55.0, 110.0, 147.0, 201.0, 191.0, 120.0, 80.0, 42.0, 22.0, 9.0, 4.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.1663818359375, -0.16322898864746094, -0.16007614135742188, -0.1569232940673828, -0.15377044677734375, -0.1506175994873047, -0.14746475219726562, -0.14431190490722656, -0.1411590576171875, -0.13800621032714844, -0.13485336303710938, -0.1317005157470703, -0.12854766845703125, -0.1253948211669922, -0.12224197387695312, -0.11908912658691406, -0.115936279296875, -0.11278343200683594, -0.10963058471679688, -0.10647773742675781, -0.10332489013671875, -0.10017204284667969, -0.09701919555664062, -0.09386634826660156, -0.0907135009765625, -0.08756065368652344, -0.08440780639648438, -0.08125495910644531, -0.07810211181640625, -0.07494926452636719, -0.07179641723632812, -0.06864356994628906, -0.06549072265625, -0.06233787536621094, -0.059185028076171875, -0.05603218078613281, -0.05287933349609375, -0.04972648620605469, -0.046573638916015625, -0.04342079162597656, -0.0402679443359375, -0.03711509704589844, -0.033962249755859375, -0.030809402465820312, -0.02765655517578125, -0.024503707885742188, -0.021350860595703125, -0.018198013305664062, -0.015045166015625, -0.011892318725585938, -0.008739471435546875, -0.0055866241455078125, -0.00243377685546875, 0.0007190704345703125, 0.003871917724609375, 0.0070247650146484375, 0.0101776123046875, 0.013330459594726562, 0.016483306884765625, 0.019636154174804688, 0.02278900146484375, 0.025941848754882812, 0.029094696044921875, 0.03224754333496094, 0.035400390625]}, "gradients/encoder.encoder.layers.19.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 2.0, 2.0, 4.0, 8.0, 7.0, 15.0, 11.0, 19.0, 19.0, 35.0, 43.0, 66.0, 90.0, 103.0, 188.0, 358.0, 824.0, 2456.0, 11586.0, 115874.0, 832645.0, 71975.0, 8714.0, 1961.0, 660.0, 307.0, 166.0, 97.0, 74.0, 48.0, 43.0, 37.0, 28.0, 19.0, 18.0, 16.0, 10.0, 7.0, 8.0, 2.0, 2.0, 2.0, 5.0, 3.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.233642578125, -0.22620582580566406, -0.21876907348632812, -0.2113323211669922, -0.20389556884765625, -0.1964588165283203, -0.18902206420898438, -0.18158531188964844, -0.1741485595703125, -0.16671180725097656, -0.15927505493164062, -0.1518383026123047, -0.14440155029296875, -0.1369647979736328, -0.12952804565429688, -0.12209129333496094, -0.114654541015625, -0.10721778869628906, -0.09978103637695312, -0.09234428405761719, -0.08490753173828125, -0.07747077941894531, -0.07003402709960938, -0.06259727478027344, -0.0551605224609375, -0.04772377014160156, -0.040287017822265625, -0.03285026550292969, -0.02541351318359375, -0.017976760864257812, -0.010540008544921875, -0.0031032562255859375, 0.00433349609375, 0.011770248413085938, 0.019207000732421875, 0.026643753051757812, 0.03408050537109375, 0.04151725769042969, 0.048954010009765625, 0.05639076232910156, 0.0638275146484375, 0.07126426696777344, 0.07870101928710938, 0.08613777160644531, 0.09357452392578125, 0.10101127624511719, 0.10844802856445312, 0.11588478088378906, 0.123321533203125, 0.13075828552246094, 0.13819503784179688, 0.1456317901611328, 0.15306854248046875, 0.1605052947998047, 0.16794204711914062, 0.17537879943847656, 0.1828155517578125, 0.19025230407714844, 0.19768905639648438, 0.2051258087158203, 0.21256256103515625, 0.2199993133544922, 0.22743606567382812, 0.23487281799316406, 0.2423095703125]}, "gradients/encoder.encoder.layers.19.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 4.0, 1.0, 3.0, 4.0, 5.0, 9.0, 13.0, 13.0, 10.0, 23.0, 25.0, 26.0, 30.0, 37.0, 31.0, 39.0, 43.0, 44.0, 50.0, 62.0, 58.0, 66.0, 51.0, 43.0, 43.0, 44.0, 44.0, 28.0, 24.0, 29.0, 22.0, 12.0, 16.0, 13.0, 9.0, 9.0, 3.0, 8.0, 5.0, 3.0, 6.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1475830078125, -0.14284706115722656, -0.13811111450195312, -0.1333751678466797, -0.12863922119140625, -0.12390327453613281, -0.11916732788085938, -0.11443138122558594, -0.1096954345703125, -0.10495948791503906, -0.10022354125976562, -0.09548759460449219, -0.09075164794921875, -0.08601570129394531, -0.08127975463867188, -0.07654380798339844, -0.071807861328125, -0.06707191467285156, -0.062335968017578125, -0.05760002136230469, -0.05286407470703125, -0.04812812805175781, -0.043392181396484375, -0.03865623474121094, -0.0339202880859375, -0.029184341430664062, -0.024448394775390625, -0.019712448120117188, -0.01497650146484375, -0.010240554809570312, -0.005504608154296875, -0.0007686614990234375, 0.00396728515625, 0.008703231811523438, 0.013439178466796875, 0.018175125122070312, 0.02291107177734375, 0.027647018432617188, 0.032382965087890625, 0.03711891174316406, 0.0418548583984375, 0.04659080505371094, 0.051326751708984375, 0.05606269836425781, 0.06079864501953125, 0.06553459167480469, 0.07027053833007812, 0.07500648498535156, 0.079742431640625, 0.08447837829589844, 0.08921432495117188, 0.09395027160644531, 0.09868621826171875, 0.10342216491699219, 0.10815811157226562, 0.11289405822753906, 0.1176300048828125, 0.12236595153808594, 0.12710189819335938, 0.1318378448486328, 0.13657379150390625, 0.1413097381591797, 0.14604568481445312, 0.15078163146972656, 0.155517578125]}, "gradients/encoder.encoder.layers.19.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 2.0, 1.0, 2.0, 1.0, 4.0, 3.0, 6.0, 3.0, 4.0, 8.0, 13.0, 31.0, 24.0, 52.0, 65.0, 109.0, 166.0, 240.0, 418.0, 806.0, 1667.0, 3936.0, 11868.0, 66263.0, 757063.0, 175169.0, 20412.0, 5577.0, 2212.0, 1071.0, 539.0, 302.0, 169.0, 109.0, 66.0, 48.0, 45.0, 22.0, 22.0, 15.0, 5.0, 7.0, 3.0, 3.0, 2.0, 2.0, 2.0, 2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0914306640625, -0.08854484558105469, -0.08565902709960938, -0.08277320861816406, -0.07988739013671875, -0.07700157165527344, -0.07411575317382812, -0.07122993469238281, -0.0683441162109375, -0.06545829772949219, -0.06257247924804688, -0.05968666076660156, -0.05680084228515625, -0.05391502380371094, -0.051029205322265625, -0.04814338684082031, -0.045257568359375, -0.04237174987792969, -0.039485931396484375, -0.03660011291503906, -0.03371429443359375, -0.030828475952148438, -0.027942657470703125, -0.025056838989257812, -0.0221710205078125, -0.019285202026367188, -0.016399383544921875, -0.013513565063476562, -0.01062774658203125, -0.0077419281005859375, -0.004856109619140625, -0.0019702911376953125, 0.00091552734375, 0.0038013458251953125, 0.006687164306640625, 0.009572982788085938, 0.01245880126953125, 0.015344619750976562, 0.018230438232421875, 0.021116256713867188, 0.0240020751953125, 0.026887893676757812, 0.029773712158203125, 0.03265953063964844, 0.03554534912109375, 0.03843116760253906, 0.041316986083984375, 0.04420280456542969, 0.047088623046875, 0.04997444152832031, 0.052860260009765625, 0.05574607849121094, 0.05863189697265625, 0.06151771545410156, 0.06440353393554688, 0.06728935241699219, 0.0701751708984375, 0.07306098937988281, 0.07594680786132812, 0.07883262634277344, 0.08171844482421875, 0.08460426330566406, 0.08749008178710938, 0.09037590026855469, 0.09326171875]}, "gradients/encoder.encoder.layers.19.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 4.0, 0.0, 1.0, 2.0, 5.0, 6.0, 12.0, 7.0, 24.0, 50.0, 79.0, 129.0, 176.0, 175.0, 116.0, 80.0, 60.0, 30.0, 23.0, 11.0, 13.0, 5.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.6047229766845703e-05, -2.530217170715332e-05, -2.4557113647460938e-05, -2.3812055587768555e-05, -2.3066997528076172e-05, -2.232193946838379e-05, -2.1576881408691406e-05, -2.0831823348999023e-05, -2.008676528930664e-05, -1.9341707229614258e-05, -1.8596649169921875e-05, -1.7851591110229492e-05, -1.710653305053711e-05, -1.6361474990844727e-05, -1.5616416931152344e-05, -1.4871358871459961e-05, -1.4126300811767578e-05, -1.3381242752075195e-05, -1.2636184692382812e-05, -1.189112663269043e-05, -1.1146068572998047e-05, -1.0401010513305664e-05, -9.655952453613281e-06, -8.910894393920898e-06, -8.165836334228516e-06, -7.420778274536133e-06, -6.67572021484375e-06, -5.930662155151367e-06, -5.185604095458984e-06, -4.4405460357666016e-06, -3.6954879760742188e-06, -2.950429916381836e-06, -2.205371856689453e-06, -1.4603137969970703e-06, -7.152557373046875e-07, 2.9802322387695312e-08, 7.748603820800781e-07, 1.519918441772461e-06, 2.2649765014648438e-06, 3.0100345611572266e-06, 3.7550926208496094e-06, 4.500150680541992e-06, 5.245208740234375e-06, 5.990266799926758e-06, 6.735324859619141e-06, 7.4803829193115234e-06, 8.225440979003906e-06, 8.970499038696289e-06, 9.715557098388672e-06, 1.0460615158081055e-05, 1.1205673217773438e-05, 1.195073127746582e-05, 1.2695789337158203e-05, 1.3440847396850586e-05, 1.4185905456542969e-05, 1.4930963516235352e-05, 1.5676021575927734e-05, 1.6421079635620117e-05, 1.71661376953125e-05, 1.7911195755004883e-05, 1.8656253814697266e-05, 1.940131187438965e-05, 2.014636993408203e-05, 2.0891427993774414e-05, 2.1636486053466797e-05]}, "gradients/encoder.encoder.layers.19.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 3.0, 2.0, 7.0, 7.0, 4.0, 15.0, 19.0, 31.0, 54.0, 117.0, 188.0, 501.0, 1291.0, 4567.0, 27680.0, 714172.0, 278869.0, 16027.0, 3287.0, 957.0, 405.0, 158.0, 86.0, 52.0, 19.0, 13.0, 10.0, 9.0, 3.0, 5.0, 3.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.17236328125, -0.16768836975097656, -0.16301345825195312, -0.1583385467529297, -0.15366363525390625, -0.1489887237548828, -0.14431381225585938, -0.13963890075683594, -0.1349639892578125, -0.13028907775878906, -0.12561416625976562, -0.12093925476074219, -0.11626434326171875, -0.11158943176269531, -0.10691452026367188, -0.10223960876464844, -0.097564697265625, -0.09288978576660156, -0.08821487426757812, -0.08353996276855469, -0.07886505126953125, -0.07419013977050781, -0.06951522827148438, -0.06484031677246094, -0.0601654052734375, -0.05549049377441406, -0.050815582275390625, -0.04614067077636719, -0.04146575927734375, -0.03679084777832031, -0.032115936279296875, -0.027441024780273438, -0.02276611328125, -0.018091201782226562, -0.013416290283203125, -0.008741378784179688, -0.00406646728515625, 0.0006084442138671875, 0.005283355712890625, 0.009958267211914062, 0.0146331787109375, 0.019308090209960938, 0.023983001708984375, 0.028657913208007812, 0.03333282470703125, 0.03800773620605469, 0.042682647705078125, 0.04735755920410156, 0.052032470703125, 0.05670738220214844, 0.061382293701171875, 0.06605720520019531, 0.07073211669921875, 0.07540702819824219, 0.08008193969726562, 0.08475685119628906, 0.0894317626953125, 0.09410667419433594, 0.09878158569335938, 0.10345649719238281, 0.10813140869140625, 0.11280632019042969, 0.11748123168945312, 0.12215614318847656, 0.1268310546875]}, "gradients/encoder.encoder.layers.19.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 6.0, 3.0, 7.0, 13.0, 15.0, 12.0, 25.0, 26.0, 44.0, 70.0, 78.0, 108.0, 140.0, 98.0, 100.0, 87.0, 52.0, 41.0, 17.0, 24.0, 10.0, 8.0, 1.0, 7.0, 5.0, 8.0, 2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 3.0], "bins": [-0.0982666015625, -0.09602117538452148, -0.09377574920654297, -0.09153032302856445, -0.08928489685058594, -0.08703947067260742, -0.0847940444946289, -0.08254861831665039, -0.08030319213867188, -0.07805776596069336, -0.07581233978271484, -0.07356691360473633, -0.07132148742675781, -0.0690760612487793, -0.06683063507080078, -0.06458520889282227, -0.06233978271484375, -0.060094356536865234, -0.05784893035888672, -0.0556035041809082, -0.05335807800292969, -0.05111265182495117, -0.048867225646972656, -0.04662179946899414, -0.044376373291015625, -0.04213094711303711, -0.039885520935058594, -0.03764009475708008, -0.03539466857910156, -0.03314924240112305, -0.03090381622314453, -0.028658390045166016, -0.0264129638671875, -0.024167537689208984, -0.02192211151123047, -0.019676685333251953, -0.017431259155273438, -0.015185832977294922, -0.012940406799316406, -0.01069498062133789, -0.008449554443359375, -0.006204128265380859, -0.003958702087402344, -0.0017132759094238281, 0.0005321502685546875, 0.002777576446533203, 0.005023002624511719, 0.007268428802490234, 0.00951385498046875, 0.011759281158447266, 0.014004707336425781, 0.016250133514404297, 0.018495559692382812, 0.020740985870361328, 0.022986412048339844, 0.02523183822631836, 0.027477264404296875, 0.02972269058227539, 0.031968116760253906, 0.03421354293823242, 0.03645896911621094, 0.03870439529418945, 0.04094982147216797, 0.043195247650146484, 0.045440673828125]}, "gradients/encoder.encoder.layers.19.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 5.0, 40.0, 802.0, 162.0, 7.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.3697757720947266, -3.1988203525543213, -3.027864694595337, -2.8569092750549316, -2.6859536170959473, -2.514998197555542, -2.3440427780151367, -2.1730871200561523, -2.002131462097168, -1.8311759233474731, -1.6602203845977783, -1.489264965057373, -1.3183093070983887, -1.1473538875579834, -0.9763983488082886, -0.8054428100585938, -0.6344873905181885, -0.46353185176849365, -0.2925763428211212, -0.12162083387374878, 0.049334704875946045, 0.22029024362564087, 0.3912457227706909, 0.5622012615203857, 0.7331568002700806, 0.9041123390197754, 1.0750678777694702, 1.246023416519165, 1.4169788360595703, 1.5879344940185547, 1.75888991355896, 1.9298454523086548, 2.1008009910583496, 2.271756410598755, 2.4427120685577393, 2.6136674880981445, 2.784623146057129, 2.955578565597534, 3.1265339851379395, 3.297489643096924, 3.468445301055908, 3.6394007205963135, 3.810356378555298, 3.981311798095703, 4.1522674560546875, 4.323223114013672, 4.494178295135498, 4.665133953094482, 4.836089134216309, 5.007044792175293, 5.177999973297119, 5.3489556312561035, 5.519911289215088, 5.690866947174072, 5.861822128295898, 6.032777786254883, 6.203733444213867, 6.374689102172852, 6.545644283294678, 6.716599941253662, 6.8875555992126465, 7.058511257171631, 7.229466438293457, 7.400422096252441, 7.571377754211426]}, "gradients/encoder.encoder.layers.19.layer_norm.bias": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 1.0, 2.0, 3.0, 2.0, 5.0, 2.0, 10.0, 7.0, 16.0, 13.0, 26.0, 22.0, 21.0, 22.0, 35.0, 46.0, 47.0, 46.0, 53.0, 50.0, 61.0, 51.0, 50.0, 60.0, 57.0, 45.0, 43.0, 31.0, 33.0, 32.0, 18.0, 18.0, 23.0, 18.0, 20.0, 4.0, 4.0, 4.0, 7.0, 1.0, 4.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7007409930229187, -0.6725161075592041, -0.6442911624908447, -0.6160662770271301, -0.5878413319587708, -0.5596164464950562, -0.5313915014266968, -0.5031666159629822, -0.4749417006969452, -0.4467167854309082, -0.4184918701648712, -0.39026695489883423, -0.36204206943511963, -0.33381712436676025, -0.30559223890304565, -0.27736732363700867, -0.24914240837097168, -0.2209174931049347, -0.1926925778388977, -0.1644676774740219, -0.13624276220798492, -0.10801784694194794, -0.07979294657707214, -0.051568031311035156, -0.02334311604499817, 0.00488179549574852, 0.03310670703649521, 0.0613316148519516, 0.08955653011798859, 0.11778144538402557, 0.14600634574890137, 0.17423126101493835, 0.20245611667633057, 0.23068103194236755, 0.25890594720840454, 0.28713083267211914, 0.3153557777404785, 0.3435806632041931, 0.3718055784702301, 0.4000304937362671, 0.4282554090023041, 0.45648032426834106, 0.48470523953437805, 0.512930154800415, 0.5411550402641296, 0.569379985332489, 0.5976048707962036, 0.625829815864563, 0.6540547013282776, 0.6822795867919922, 0.7105045318603516, 0.7387294173240662, 0.7669543623924255, 0.7951792478561401, 0.8234041929244995, 0.8516290783882141, 0.8798539638519287, 0.9080788493156433, 0.9363037943840027, 0.9645286798477173, 0.9927536249160767, 1.020978569984436, 1.0492033958435059, 1.0774283409118652, 1.1056532859802246]}, "gradients/encoder.encoder.layers.18.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 4.0, 2.0, 6.0, 2.0, 7.0, 4.0, 3.0, 7.0, 6.0, 10.0, 5.0, 7.0, 9.0, 16.0, 15.0, 13.0, 32.0, 32.0, 53.0, 75.0, 144.0, 242.0, 751.0, 2559.0, 16687.0, 3857683.0, 301126.0, 11875.0, 2156.0, 507.0, 161.0, 57.0, 16.0, 9.0, 5.0, 1.0, 2.0, 1.0, 1.0], "bins": [-0.7353515625, -0.7209014892578125, -0.706451416015625, -0.6920013427734375, -0.67755126953125, -0.6631011962890625, -0.648651123046875, -0.6342010498046875, -0.6197509765625, -0.6053009033203125, -0.590850830078125, -0.5764007568359375, -0.56195068359375, -0.5475006103515625, -0.533050537109375, -0.5186004638671875, -0.504150390625, -0.4897003173828125, -0.475250244140625, -0.4608001708984375, -0.44635009765625, -0.4319000244140625, -0.417449951171875, -0.4029998779296875, -0.3885498046875, -0.3740997314453125, -0.359649658203125, -0.3451995849609375, -0.33074951171875, -0.3162994384765625, -0.301849365234375, -0.2873992919921875, -0.27294921875, -0.2584991455078125, -0.244049072265625, -0.2295989990234375, -0.21514892578125, -0.2006988525390625, -0.186248779296875, -0.1717987060546875, -0.1573486328125, -0.1428985595703125, -0.128448486328125, -0.1139984130859375, -0.09954833984375, -0.0850982666015625, -0.070648193359375, -0.0561981201171875, -0.041748046875, -0.0272979736328125, -0.012847900390625, 0.0016021728515625, 0.01605224609375, 0.0305023193359375, 0.044952392578125, 0.0594024658203125, 0.0738525390625, 0.0883026123046875, 0.102752685546875, 0.1172027587890625, 0.13165283203125, 0.1461029052734375, 0.160552978515625, 0.1750030517578125, 0.189453125]}, "gradients/encoder.encoder.layers.18.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 2.0, 17.0, 25.0, 57.0, 112.0, 184.0, 163.0, 176.0, 117.0, 82.0, 40.0, 20.0, 8.0, 3.0, 4.0, 3.0, 0.0, 1.0], "bins": [-0.1654052734375, -0.1622624397277832, -0.1591196060180664, -0.1559767723083496, -0.1528339385986328, -0.14969110488891602, -0.14654827117919922, -0.14340543746948242, -0.14026260375976562, -0.13711977005004883, -0.13397693634033203, -0.13083410263061523, -0.12769126892089844, -0.12454843521118164, -0.12140560150146484, -0.11826276779174805, -0.11511993408203125, -0.11197710037231445, -0.10883426666259766, -0.10569143295288086, -0.10254859924316406, -0.09940576553344727, -0.09626293182373047, -0.09312009811401367, -0.08997726440429688, -0.08683443069458008, -0.08369159698486328, -0.08054876327514648, -0.07740592956542969, -0.07426309585571289, -0.0711202621459961, -0.0679774284362793, -0.0648345947265625, -0.0616917610168457, -0.058548927307128906, -0.05540609359741211, -0.05226325988769531, -0.049120426177978516, -0.04597759246826172, -0.04283475875854492, -0.039691925048828125, -0.03654909133911133, -0.03340625762939453, -0.030263423919677734, -0.027120590209960938, -0.02397775650024414, -0.020834922790527344, -0.017692089080810547, -0.01454925537109375, -0.011406421661376953, -0.008263587951660156, -0.005120754241943359, -0.0019779205322265625, 0.0011649131774902344, 0.004307746887207031, 0.007450580596923828, 0.010593414306640625, 0.013736248016357422, 0.01687908172607422, 0.020021915435791016, 0.023164749145507812, 0.02630758285522461, 0.029450416564941406, 0.0325932502746582, 0.035736083984375]}, "gradients/encoder.encoder.layers.18.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 11.0, 15.0, 28.0, 60.0, 151.0, 420.0, 1389.0, 16149.0, 4160205.0, 13446.0, 1558.0, 432.0, 205.0, 92.0, 51.0, 33.0, 19.0, 7.0, 4.0, 5.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.70849609375, -0.6795425415039062, -0.6505889892578125, -0.6216354370117188, -0.592681884765625, -0.5637283325195312, -0.5347747802734375, -0.5058212280273438, -0.47686767578125, -0.44791412353515625, -0.4189605712890625, -0.39000701904296875, -0.361053466796875, -0.33209991455078125, -0.3031463623046875, -0.27419281005859375, -0.2452392578125, -0.21628570556640625, -0.1873321533203125, -0.15837860107421875, -0.129425048828125, -0.10047149658203125, -0.0715179443359375, -0.04256439208984375, -0.01361083984375, 0.01534271240234375, 0.0442962646484375, 0.07324981689453125, 0.102203369140625, 0.13115692138671875, 0.1601104736328125, 0.18906402587890625, 0.218017578125, 0.24697113037109375, 0.2759246826171875, 0.30487823486328125, 0.333831787109375, 0.36278533935546875, 0.3917388916015625, 0.42069244384765625, 0.44964599609375, 0.47859954833984375, 0.5075531005859375, 0.5365066528320312, 0.565460205078125, 0.5944137573242188, 0.6233673095703125, 0.6523208618164062, 0.6812744140625, 0.7102279663085938, 0.7391815185546875, 0.7681350708007812, 0.797088623046875, 0.8260421752929688, 0.8549957275390625, 0.8839492797851562, 0.91290283203125, 0.9418563842773438, 0.9708099365234375, 0.9997634887695312, 1.028717041015625, 1.0576705932617188, 1.0866241455078125, 1.1155776977539062, 1.14453125]}, "gradients/encoder.encoder.layers.18.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 3.0, 11.0, 22.0, 47.0, 175.0, 3084.0, 598.0, 86.0, 28.0, 13.0, 6.0, 7.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.108154296875, -0.10207366943359375, -0.0959930419921875, -0.08991241455078125, -0.083831787109375, -0.07775115966796875, -0.0716705322265625, -0.06558990478515625, -0.05950927734375, -0.05342864990234375, -0.0473480224609375, -0.04126739501953125, -0.035186767578125, -0.02910614013671875, -0.0230255126953125, -0.01694488525390625, -0.0108642578125, -0.00478363037109375, 0.0012969970703125, 0.00737762451171875, 0.013458251953125, 0.01953887939453125, 0.0256195068359375, 0.03170013427734375, 0.03778076171875, 0.04386138916015625, 0.0499420166015625, 0.05602264404296875, 0.062103271484375, 0.06818389892578125, 0.0742645263671875, 0.08034515380859375, 0.08642578125, 0.09250640869140625, 0.0985870361328125, 0.10466766357421875, 0.110748291015625, 0.11682891845703125, 0.1229095458984375, 0.12899017333984375, 0.13507080078125, 0.14115142822265625, 0.1472320556640625, 0.15331268310546875, 0.159393310546875, 0.16547393798828125, 0.1715545654296875, 0.17763519287109375, 0.1837158203125, 0.18979644775390625, 0.1958770751953125, 0.20195770263671875, 0.208038330078125, 0.21411895751953125, 0.2201995849609375, 0.22628021240234375, 0.23236083984375, 0.23844146728515625, 0.2445220947265625, 0.25060272216796875, 0.256683349609375, 0.26276397705078125, 0.2688446044921875, 0.27492523193359375, 0.281005859375]}, "gradients/encoder.encoder.layers.18.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 23.0, 978.0, 12.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.666889190673828, -8.507675170898438, -8.34846019744873, -8.18924617767334, -8.030031204223633, -7.870817184448242, -7.711602687835693, -7.5523881912231445, -7.393174171447754, -7.233959674835205, -7.074745178222656, -6.915531158447266, -6.756316661834717, -6.597102165222168, -6.437887668609619, -6.27867317199707, -6.1194586753845215, -5.960244178771973, -5.801029682159424, -5.641815662384033, -5.482601165771484, -5.3233866691589355, -5.164172172546387, -5.004957675933838, -4.845743179321289, -4.68652868270874, -4.527314186096191, -4.368100166320801, -4.208885669708252, -4.049671173095703, -3.8904566764831543, -3.7312421798706055, -3.572028636932373, -3.412814140319824, -3.2535998821258545, -3.0943853855133057, -2.935171127319336, -2.775956630706787, -2.6167421340942383, -2.4575276374816895, -2.2983131408691406, -2.139098644256592, -1.979884386062622, -1.8206698894500732, -1.661455512046814, -1.5022411346435547, -1.3430266380310059, -1.1838122606277466, -1.0245980024337769, -0.8653836250305176, -0.7061691880226135, -0.5469547510147095, -0.3877403736114502, -0.22852599620819092, -0.06931155920028687, 0.08990287780761719, 0.24911725521087646, 0.40833166241645813, 0.5675460696220398, 0.7267605066299438, 0.8859748840332031, 1.0451892614364624, 1.2044036388397217, 1.3636181354522705, 1.5228325128555298]}, "gradients/encoder.encoder.layers.18.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 5.0, 8.0, 17.0, 17.0, 45.0, 58.0, 69.0, 101.0, 116.0, 92.0, 130.0, 109.0, 79.0, 61.0, 39.0, 23.0, 13.0, 7.0, 9.0, 6.0, 2.0, 1.0, 4.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4563528299331665, -0.4409220218658447, -0.42549121379852295, -0.41006040573120117, -0.3946295976638794, -0.3791987895965576, -0.36376798152923584, -0.34833720326423645, -0.3329063951969147, -0.3174755871295929, -0.3020447790622711, -0.28661397099494934, -0.27118316292762756, -0.2557523846626282, -0.2403215616941452, -0.22489076852798462, -0.20945994555950165, -0.19402913749217987, -0.1785983294248581, -0.1631675362586975, -0.14773672819137573, -0.13230592012405396, -0.11687511205673218, -0.101444311439991, -0.08601350337266922, -0.07058269530534744, -0.05515189468860626, -0.039721086621284485, -0.024290282279253006, -0.008859477937221527, 0.00657133013010025, 0.02200213074684143, 0.03743293881416321, 0.05286374315619469, 0.06829454749822617, 0.08372535556554794, 0.09915615618228912, 0.1145869642496109, 0.13001777231693268, 0.14544856548309326, 0.16087937355041504, 0.17631018161773682, 0.1917409896850586, 0.20717179775238037, 0.22260259091854095, 0.23803339898586273, 0.2534642219543457, 0.2688950002193451, 0.28432583808898926, 0.29975664615631104, 0.3151874542236328, 0.3306182622909546, 0.34604907035827637, 0.36147987842559814, 0.3769106864929199, 0.3923414647579193, 0.4077722728252411, 0.42320308089256287, 0.43863388895988464, 0.4540646970272064, 0.4694955050945282, 0.4849262833595276, 0.5003570914268494, 0.5157878994941711, 0.5312187075614929]}, "gradients/encoder.encoder.layers.18.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 4.0, 6.0, 4.0, 4.0, 6.0, 10.0, 21.0, 26.0, 26.0, 42.0, 57.0, 88.0, 204.0, 664.0, 3186.0, 56962.0, 939518.0, 43774.0, 2831.0, 614.0, 222.0, 90.0, 51.0, 47.0, 32.0, 18.0, 17.0, 13.0, 8.0, 5.0, 5.0, 3.0, 3.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.415771484375, -0.4046745300292969, -0.39357757568359375, -0.3824806213378906, -0.3713836669921875, -0.3602867126464844, -0.34918975830078125, -0.3380928039550781, -0.326995849609375, -0.3158988952636719, -0.30480194091796875, -0.2937049865722656, -0.2826080322265625, -0.2715110778808594, -0.26041412353515625, -0.24931716918945312, -0.23822021484375, -0.22712326049804688, -0.21602630615234375, -0.20492935180664062, -0.1938323974609375, -0.18273544311523438, -0.17163848876953125, -0.16054153442382812, -0.149444580078125, -0.13834762573242188, -0.12725067138671875, -0.11615371704101562, -0.1050567626953125, -0.09395980834960938, -0.08286285400390625, -0.07176589965820312, -0.0606689453125, -0.049571990966796875, -0.03847503662109375, -0.027378082275390625, -0.0162811279296875, -0.005184173583984375, 0.00591278076171875, 0.017009735107421875, 0.028106689453125, 0.039203643798828125, 0.05030059814453125, 0.061397552490234375, 0.0724945068359375, 0.08359146118164062, 0.09468841552734375, 0.10578536987304688, 0.11688232421875, 0.12797927856445312, 0.13907623291015625, 0.15017318725585938, 0.1612701416015625, 0.17236709594726562, 0.18346405029296875, 0.19456100463867188, 0.205657958984375, 0.21675491333007812, 0.22785186767578125, 0.23894882202148438, 0.2500457763671875, 0.2611427307128906, 0.27223968505859375, 0.2833366394042969, 0.29443359375]}, "gradients/encoder.encoder.layers.18.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 7.0, 26.0, 56.0, 95.0, 155.0, 198.0, 169.0, 141.0, 79.0, 45.0, 17.0, 8.0, 2.0, 4.0, 5.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1605224609375, -0.1573619842529297, -0.15420150756835938, -0.15104103088378906, -0.14788055419921875, -0.14472007751464844, -0.14155960083007812, -0.1383991241455078, -0.1352386474609375, -0.1320781707763672, -0.12891769409179688, -0.12575721740722656, -0.12259674072265625, -0.11943626403808594, -0.11627578735351562, -0.11311531066894531, -0.109954833984375, -0.10679435729980469, -0.10363388061523438, -0.10047340393066406, -0.09731292724609375, -0.09415245056152344, -0.09099197387695312, -0.08783149719238281, -0.0846710205078125, -0.08151054382324219, -0.07835006713867188, -0.07518959045410156, -0.07202911376953125, -0.06886863708496094, -0.06570816040039062, -0.06254768371582031, -0.05938720703125, -0.05622673034667969, -0.053066253662109375, -0.04990577697753906, -0.04674530029296875, -0.04358482360839844, -0.040424346923828125, -0.03726387023925781, -0.0341033935546875, -0.030942916870117188, -0.027782440185546875, -0.024621963500976562, -0.02146148681640625, -0.018301010131835938, -0.015140533447265625, -0.011980056762695312, -0.008819580078125, -0.0056591033935546875, -0.002498626708984375, 0.0006618499755859375, 0.00382232666015625, 0.0069828033447265625, 0.010143280029296875, 0.013303756713867188, 0.0164642333984375, 0.019624710083007812, 0.022785186767578125, 0.025945663452148438, 0.02910614013671875, 0.03226661682128906, 0.035427093505859375, 0.03858757019042969, 0.041748046875]}, "gradients/encoder.encoder.layers.18.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 5.0, 5.0, 4.0, 2.0, 9.0, 6.0, 7.0, 17.0, 15.0, 29.0, 34.0, 73.0, 83.0, 155.0, 331.0, 918.0, 3846.0, 33546.0, 841253.0, 156466.0, 8981.0, 1644.0, 521.0, 237.0, 128.0, 87.0, 56.0, 22.0, 26.0, 19.0, 13.0, 9.0, 6.0, 4.0, 3.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.3046875, -0.2959136962890625, -0.287139892578125, -0.2783660888671875, -0.26959228515625, -0.2608184814453125, -0.252044677734375, -0.2432708740234375, -0.2344970703125, -0.2257232666015625, -0.216949462890625, -0.2081756591796875, -0.19940185546875, -0.1906280517578125, -0.181854248046875, -0.1730804443359375, -0.164306640625, -0.1555328369140625, -0.146759033203125, -0.1379852294921875, -0.12921142578125, -0.1204376220703125, -0.111663818359375, -0.1028900146484375, -0.0941162109375, -0.0853424072265625, -0.076568603515625, -0.0677947998046875, -0.05902099609375, -0.0502471923828125, -0.041473388671875, -0.0326995849609375, -0.02392578125, -0.0151519775390625, -0.006378173828125, 0.0023956298828125, 0.01116943359375, 0.0199432373046875, 0.028717041015625, 0.0374908447265625, 0.0462646484375, 0.0550384521484375, 0.063812255859375, 0.0725860595703125, 0.08135986328125, 0.0901336669921875, 0.098907470703125, 0.1076812744140625, 0.116455078125, 0.1252288818359375, 0.134002685546875, 0.1427764892578125, 0.15155029296875, 0.1603240966796875, 0.169097900390625, 0.1778717041015625, 0.1866455078125, 0.1954193115234375, 0.204193115234375, 0.2129669189453125, 0.22174072265625, 0.2305145263671875, 0.239288330078125, 0.2480621337890625, 0.2568359375]}, "gradients/encoder.encoder.layers.18.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 3.0, 4.0, 1.0, 5.0, 6.0, 7.0, 18.0, 11.0, 20.0, 18.0, 28.0, 34.0, 38.0, 54.0, 64.0, 86.0, 70.0, 82.0, 63.0, 78.0, 59.0, 50.0, 49.0, 39.0, 37.0, 22.0, 15.0, 10.0, 12.0, 11.0, 2.0, 2.0, 2.0, 8.0, 2.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.246337890625, -0.23956680297851562, -0.23279571533203125, -0.22602462768554688, -0.2192535400390625, -0.21248245239257812, -0.20571136474609375, -0.19894027709960938, -0.192169189453125, -0.18539810180664062, -0.17862701416015625, -0.17185592651367188, -0.1650848388671875, -0.15831375122070312, -0.15154266357421875, -0.14477157592773438, -0.13800048828125, -0.13122940063476562, -0.12445831298828125, -0.11768722534179688, -0.1109161376953125, -0.10414505004882812, -0.09737396240234375, -0.09060287475585938, -0.083831787109375, -0.07706069946289062, -0.07028961181640625, -0.06351852416992188, -0.0567474365234375, -0.049976348876953125, -0.04320526123046875, -0.036434173583984375, -0.0296630859375, -0.022891998291015625, -0.01612091064453125, -0.009349822998046875, -0.0025787353515625, 0.004192352294921875, 0.01096343994140625, 0.017734527587890625, 0.024505615234375, 0.031276702880859375, 0.03804779052734375, 0.044818878173828125, 0.0515899658203125, 0.058361053466796875, 0.06513214111328125, 0.07190322875976562, 0.07867431640625, 0.08544540405273438, 0.09221649169921875, 0.09898757934570312, 0.1057586669921875, 0.11252975463867188, 0.11930084228515625, 0.12607192993164062, 0.132843017578125, 0.13961410522460938, 0.14638519287109375, 0.15315628051757812, 0.1599273681640625, 0.16669845581054688, 0.17346954345703125, 0.18024063110351562, 0.18701171875]}, "gradients/encoder.encoder.layers.18.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 4.0, 4.0, 6.0, 9.0, 7.0, 8.0, 8.0, 16.0, 23.0, 38.0, 48.0, 89.0, 125.0, 194.0, 371.0, 670.0, 1534.0, 3955.0, 14475.0, 87959.0, 660417.0, 239910.0, 27873.0, 6519.0, 2162.0, 939.0, 493.0, 268.0, 147.0, 88.0, 51.0, 30.0, 42.0, 17.0, 11.0, 11.0, 9.0, 13.0, 3.0, 6.0, 2.0, 1.0, 3.0, 0.0, 0.0, 3.0, 1.0, 0.0, 0.0, 3.0, 1.0, 1.0], "bins": [-0.057037353515625, -0.05529356002807617, -0.053549766540527344, -0.051805973052978516, -0.05006217956542969, -0.04831838607788086, -0.04657459259033203, -0.0448307991027832, -0.043087005615234375, -0.04134321212768555, -0.03959941864013672, -0.03785562515258789, -0.03611183166503906, -0.034368038177490234, -0.032624244689941406, -0.030880451202392578, -0.02913665771484375, -0.027392864227294922, -0.025649070739746094, -0.023905277252197266, -0.022161483764648438, -0.02041769027709961, -0.01867389678955078, -0.016930103302001953, -0.015186309814453125, -0.013442516326904297, -0.011698722839355469, -0.00995492935180664, -0.008211135864257812, -0.006467342376708984, -0.004723548889160156, -0.002979755401611328, -0.0012359619140625, 0.0005078315734863281, 0.0022516250610351562, 0.003995418548583984, 0.0057392120361328125, 0.007483005523681641, 0.009226799011230469, 0.010970592498779297, 0.012714385986328125, 0.014458179473876953, 0.01620197296142578, 0.01794576644897461, 0.019689559936523438, 0.021433353424072266, 0.023177146911621094, 0.024920940399169922, 0.02666473388671875, 0.028408527374267578, 0.030152320861816406, 0.031896114349365234, 0.03363990783691406, 0.03538370132446289, 0.03712749481201172, 0.03887128829956055, 0.040615081787109375, 0.0423588752746582, 0.04410266876220703, 0.04584646224975586, 0.04759025573730469, 0.049334049224853516, 0.051077842712402344, 0.05282163619995117, 0.0545654296875]}, "gradients/encoder.encoder.layers.18.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 3.0, 0.0, 1.0, 2.0, 4.0, 11.0, 12.0, 20.0, 18.0, 44.0, 39.0, 57.0, 50.0, 93.0, 95.0, 82.0, 67.0, 78.0, 81.0, 53.0, 50.0, 56.0, 34.0, 11.0, 18.0, 14.0, 5.0, 5.0, 3.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.3172626495361328e-05, -1.2831762433052063e-05, -1.2490898370742798e-05, -1.2150034308433533e-05, -1.1809170246124268e-05, -1.1468306183815002e-05, -1.1127442121505737e-05, -1.0786578059196472e-05, -1.0445713996887207e-05, -1.0104849934577942e-05, -9.763985872268677e-06, -9.423121809959412e-06, -9.082257747650146e-06, -8.741393685340881e-06, -8.400529623031616e-06, -8.059665560722351e-06, -7.718801498413086e-06, -7.377937436103821e-06, -7.037073373794556e-06, -6.6962093114852905e-06, -6.355345249176025e-06, -6.01448118686676e-06, -5.673617124557495e-06, -5.33275306224823e-06, -4.991888999938965e-06, -4.6510249376297e-06, -4.3101608753204346e-06, -3.9692968130111694e-06, -3.6284327507019043e-06, -3.287568688392639e-06, -2.946704626083374e-06, -2.605840563774109e-06, -2.2649765014648438e-06, -1.9241124391555786e-06, -1.5832483768463135e-06, -1.2423843145370483e-06, -9.015202522277832e-07, -5.606561899185181e-07, -2.1979212760925293e-07, 1.210719347000122e-07, 4.6193599700927734e-07, 8.028000593185425e-07, 1.1436641216278076e-06, 1.4845281839370728e-06, 1.8253922462463379e-06, 2.166256308555603e-06, 2.507120370864868e-06, 2.8479844331741333e-06, 3.1888484954833984e-06, 3.5297125577926636e-06, 3.870576620101929e-06, 4.211440682411194e-06, 4.552304744720459e-06, 4.893168807029724e-06, 5.234032869338989e-06, 5.574896931648254e-06, 5.9157609939575195e-06, 6.256625056266785e-06, 6.59748911857605e-06, 6.938353180885315e-06, 7.27921724319458e-06, 7.620081305503845e-06, 7.96094536781311e-06, 8.301809430122375e-06, 8.64267349243164e-06]}, "gradients/encoder.encoder.layers.18.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 3.0, 5.0, 3.0, 5.0, 12.0, 15.0, 21.0, 34.0, 57.0, 125.0, 219.0, 544.0, 1259.0, 4696.0, 31743.0, 716349.0, 273958.0, 14719.0, 3006.0, 983.0, 391.0, 172.0, 114.0, 38.0, 34.0, 19.0, 12.0, 8.0, 6.0, 4.0, 1.0, 3.0, 3.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0941162109375, -0.09096145629882812, -0.08780670166015625, -0.08465194702148438, -0.0814971923828125, -0.07834243774414062, -0.07518768310546875, -0.07203292846679688, -0.068878173828125, -0.06572341918945312, -0.06256866455078125, -0.059413909912109375, -0.0562591552734375, -0.053104400634765625, -0.04994964599609375, -0.046794891357421875, -0.04364013671875, -0.040485382080078125, -0.03733062744140625, -0.034175872802734375, -0.0310211181640625, -0.027866363525390625, -0.02471160888671875, -0.021556854248046875, -0.018402099609375, -0.015247344970703125, -0.01209259033203125, -0.008937835693359375, -0.0057830810546875, -0.002628326416015625, 0.00052642822265625, 0.003681182861328125, 0.0068359375, 0.009990692138671875, 0.01314544677734375, 0.016300201416015625, 0.0194549560546875, 0.022609710693359375, 0.02576446533203125, 0.028919219970703125, 0.032073974609375, 0.035228729248046875, 0.03838348388671875, 0.041538238525390625, 0.0446929931640625, 0.047847747802734375, 0.05100250244140625, 0.054157257080078125, 0.05731201171875, 0.060466766357421875, 0.06362152099609375, 0.06677627563476562, 0.0699310302734375, 0.07308578491210938, 0.07624053955078125, 0.07939529418945312, 0.082550048828125, 0.08570480346679688, 0.08885955810546875, 0.09201431274414062, 0.0951690673828125, 0.09832382202148438, 0.10147857666015625, 0.10463333129882812, 0.1077880859375]}, "gradients/encoder.encoder.layers.18.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 9.0, 12.0, 11.0, 28.0, 39.0, 71.0, 122.0, 133.0, 159.0, 153.0, 102.0, 55.0, 45.0, 27.0, 11.0, 12.0, 9.0, 2.0, 2.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09808349609375, -0.09552478790283203, -0.09296607971191406, -0.0904073715209961, -0.08784866333007812, -0.08528995513916016, -0.08273124694824219, -0.08017253875732422, -0.07761383056640625, -0.07505512237548828, -0.07249641418457031, -0.06993770599365234, -0.06737899780273438, -0.0648202896118164, -0.06226158142089844, -0.05970287322998047, -0.0571441650390625, -0.05458545684814453, -0.05202674865722656, -0.049468040466308594, -0.046909332275390625, -0.044350624084472656, -0.04179191589355469, -0.03923320770263672, -0.03667449951171875, -0.03411579132080078, -0.03155708312988281, -0.028998374938964844, -0.026439666748046875, -0.023880958557128906, -0.021322250366210938, -0.01876354217529297, -0.016204833984375, -0.013646125793457031, -0.011087417602539062, -0.008528709411621094, -0.005970001220703125, -0.0034112930297851562, -0.0008525848388671875, 0.0017061233520507812, 0.00426483154296875, 0.006823539733886719, 0.009382247924804688, 0.011940956115722656, 0.014499664306640625, 0.017058372497558594, 0.019617080688476562, 0.02217578887939453, 0.0247344970703125, 0.02729320526123047, 0.029851913452148438, 0.032410621643066406, 0.034969329833984375, 0.037528038024902344, 0.04008674621582031, 0.04264545440673828, 0.04520416259765625, 0.04776287078857422, 0.05032157897949219, 0.052880287170410156, 0.055438995361328125, 0.057997703552246094, 0.06055641174316406, 0.06311511993408203, 0.065673828125]}, "gradients/encoder.encoder.layers.18.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 8.0, 26.0, 186.0, 593.0, 159.0, 34.0, 5.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.482228755950928, -4.398406982421875, -4.3145856857299805, -4.230763912200928, -4.146942138671875, -4.063120365142822, -3.9792988300323486, -3.895477294921875, -3.8116555213928223, -3.7278337478637695, -3.644012212753296, -3.5601906776428223, -3.4763689041137695, -3.392547130584717, -3.308725595474243, -3.2249040603637695, -3.141082286834717, -3.057260513305664, -2.9734389781951904, -2.889617443084717, -2.805795669555664, -2.7219738960266113, -2.6381523609161377, -2.554330825805664, -2.4705090522766113, -2.3866872787475586, -2.302865743637085, -2.2190442085266113, -2.1352224349975586, -2.051400661468506, -1.9675791263580322, -1.883757472038269, -1.7999355792999268, -1.7161139249801636, -1.6322922706604004, -1.5484706163406372, -1.464648962020874, -1.3808273077011108, -1.2970056533813477, -1.2131839990615845, -1.1293623447418213, -1.045540690422058, -0.9617190361022949, -0.8778973817825317, -0.7940757274627686, -0.7102540731430054, -0.6264324188232422, -0.542610764503479, -0.4587891101837158, -0.37496745586395264, -0.29114580154418945, -0.20732414722442627, -0.12350249290466309, -0.0396808385848999, 0.04414081573486328, 0.12796247005462646, 0.21178412437438965, 0.29560577869415283, 0.379427433013916, 0.4632490873336792, 0.5470707416534424, 0.6308923959732056, 0.7147140502929688, 0.7985357046127319, 0.8823573589324951]}, "gradients/encoder.encoder.layers.18.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 1.0, 3.0, 2.0, 0.0, 3.0, 6.0, 3.0, 6.0, 13.0, 15.0, 10.0, 16.0, 19.0, 30.0, 23.0, 30.0, 34.0, 42.0, 50.0, 36.0, 48.0, 54.0, 42.0, 50.0, 46.0, 51.0, 54.0, 49.0, 33.0, 36.0, 29.0, 26.0, 27.0, 26.0, 25.0, 13.0, 5.0, 9.0, 9.0, 5.0, 3.0, 4.0, 8.0, 8.0, 2.0, 1.0, 3.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.8090323209762573, -0.7833037376403809, -0.7575752139091492, -0.7318466305732727, -0.706118106842041, -0.6803895235061646, -0.6546609997749329, -0.6289324164390564, -0.6032038927078247, -0.5774753093719482, -0.5517467856407166, -0.5260182023048401, -0.5002896785736084, -0.47456109523773193, -0.44883257150650024, -0.4231039881706238, -0.3973754346370697, -0.3716468811035156, -0.34591832756996155, -0.32018977403640747, -0.2944612205028534, -0.2687326669692993, -0.24300409853458405, -0.21727554500102997, -0.1915469914674759, -0.16581843793392181, -0.14008988440036774, -0.11436132341623306, -0.08863276988267899, -0.06290420889854431, -0.037175655364990234, -0.011447101831436157, 0.01428145170211792, 0.040010005235672, 0.06573855876922607, 0.09146711975336075, 0.11719567328691483, 0.1429242342710495, 0.16865278780460358, 0.19438134133815765, 0.22010989487171173, 0.2458384484052658, 0.2715670168399811, 0.29729557037353516, 0.32302412390708923, 0.3487526774406433, 0.3744812309741974, 0.40020978450775146, 0.42593833804130554, 0.4516668915748596, 0.4773954451084137, 0.5031239986419678, 0.5288525819778442, 0.5545811057090759, 0.5803096890449524, 0.6060382127761841, 0.6317667961120605, 0.657495379447937, 0.6832239031791687, 0.7089524865150452, 0.7346810102462769, 0.7604095935821533, 0.786138117313385, 0.8118667006492615, 0.8375952243804932]}, "gradients/encoder.encoder.layers.17.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 3.0, 3.0, 6.0, 3.0, 5.0, 5.0, 5.0, 8.0, 10.0, 12.0, 15.0, 13.0, 22.0, 16.0, 32.0, 33.0, 71.0, 110.0, 172.0, 419.0, 1057.0, 4405.0, 41249.0, 4062647.0, 75647.0, 6343.0, 1355.0, 374.0, 148.0, 61.0, 15.0, 16.0, 7.0, 0.0, 0.0, 4.0, 1.0, 1.0], "bins": [-0.7412109375, -0.7262611389160156, -0.7113113403320312, -0.6963615417480469, -0.6814117431640625, -0.6664619445800781, -0.6515121459960938, -0.6365623474121094, -0.621612548828125, -0.6066627502441406, -0.5917129516601562, -0.5767631530761719, -0.5618133544921875, -0.5468635559082031, -0.5319137573242188, -0.5169639587402344, -0.50201416015625, -0.4870643615722656, -0.47211456298828125, -0.4571647644042969, -0.4422149658203125, -0.4272651672363281, -0.41231536865234375, -0.3973655700683594, -0.382415771484375, -0.3674659729003906, -0.35251617431640625, -0.3375663757324219, -0.3226165771484375, -0.3076667785644531, -0.29271697998046875, -0.2777671813964844, -0.2628173828125, -0.24786758422851562, -0.23291778564453125, -0.21796798706054688, -0.2030181884765625, -0.18806838989257812, -0.17311859130859375, -0.15816879272460938, -0.143218994140625, -0.12826919555664062, -0.11331939697265625, -0.09836959838867188, -0.0834197998046875, -0.06847000122070312, -0.05352020263671875, -0.038570404052734375, -0.02362060546875, -0.008670806884765625, 0.00627899169921875, 0.021228790283203125, 0.0361785888671875, 0.051128387451171875, 0.06607818603515625, 0.08102798461914062, 0.095977783203125, 0.11092758178710938, 0.12587738037109375, 0.14082717895507812, 0.1557769775390625, 0.17072677612304688, 0.18567657470703125, 0.20062637329101562, 0.215576171875]}, "gradients/encoder.encoder.layers.17.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 6.0, 14.0, 30.0, 66.0, 94.0, 141.0, 177.0, 186.0, 121.0, 85.0, 38.0, 27.0, 22.0, 0.0, 5.0, 3.0], "bins": [-0.1629638671875, -0.1599884033203125, -0.157012939453125, -0.1540374755859375, -0.15106201171875, -0.1480865478515625, -0.145111083984375, -0.1421356201171875, -0.13916015625, -0.1361846923828125, -0.133209228515625, -0.1302337646484375, -0.12725830078125, -0.1242828369140625, -0.121307373046875, -0.1183319091796875, -0.1153564453125, -0.1123809814453125, -0.109405517578125, -0.1064300537109375, -0.10345458984375, -0.1004791259765625, -0.097503662109375, -0.0945281982421875, -0.091552734375, -0.0885772705078125, -0.085601806640625, -0.0826263427734375, -0.07965087890625, -0.0766754150390625, -0.073699951171875, -0.0707244873046875, -0.0677490234375, -0.0647735595703125, -0.061798095703125, -0.0588226318359375, -0.05584716796875, -0.0528717041015625, -0.049896240234375, -0.0469207763671875, -0.0439453125, -0.0409698486328125, -0.037994384765625, -0.0350189208984375, -0.03204345703125, -0.0290679931640625, -0.026092529296875, -0.0231170654296875, -0.0201416015625, -0.0171661376953125, -0.014190673828125, -0.0112152099609375, -0.00823974609375, -0.0052642822265625, -0.002288818359375, 0.0006866455078125, 0.003662109375, 0.0066375732421875, 0.009613037109375, 0.0125885009765625, 0.01556396484375, 0.0185394287109375, 0.021514892578125, 0.0244903564453125, 0.0274658203125]}, "gradients/encoder.encoder.layers.17.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 17.0, 16.0, 29.0, 56.0, 119.0, 237.0, 904.0, 16117.0, 4169253.0, 6611.0, 571.0, 197.0, 81.0, 41.0, 22.0, 11.0, 9.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.72265625, -1.6753387451171875, -1.628021240234375, -1.5807037353515625, -1.53338623046875, -1.4860687255859375, -1.438751220703125, -1.3914337158203125, -1.3441162109375, -1.2967987060546875, -1.249481201171875, -1.2021636962890625, -1.15484619140625, -1.1075286865234375, -1.060211181640625, -1.0128936767578125, -0.965576171875, -0.9182586669921875, -0.870941162109375, -0.8236236572265625, -0.77630615234375, -0.7289886474609375, -0.681671142578125, -0.6343536376953125, -0.5870361328125, -0.5397186279296875, -0.492401123046875, -0.4450836181640625, -0.39776611328125, -0.3504486083984375, -0.303131103515625, -0.2558135986328125, -0.20849609375, -0.1611785888671875, -0.113861083984375, -0.0665435791015625, -0.01922607421875, 0.0280914306640625, 0.075408935546875, 0.1227264404296875, 0.1700439453125, 0.2173614501953125, 0.264678955078125, 0.3119964599609375, 0.35931396484375, 0.4066314697265625, 0.453948974609375, 0.5012664794921875, 0.548583984375, 0.5959014892578125, 0.643218994140625, 0.6905364990234375, 0.73785400390625, 0.7851715087890625, 0.832489013671875, 0.8798065185546875, 0.9271240234375, 0.9744415283203125, 1.021759033203125, 1.0690765380859375, 1.11639404296875, 1.1637115478515625, 1.211029052734375, 1.2583465576171875, 1.3056640625]}, "gradients/encoder.encoder.layers.17.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 0.0, 3.0, 1.0, 11.0, 4.0, 5.0, 4.0, 17.0, 18.0, 88.0, 454.0, 3173.0, 223.0, 46.0, 21.0, 10.0, 3.0, 2.0, 5.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.400390625, -0.3899497985839844, -0.37950897216796875, -0.3690681457519531, -0.3586273193359375, -0.3481864929199219, -0.33774566650390625, -0.3273048400878906, -0.316864013671875, -0.3064231872558594, -0.29598236083984375, -0.2855415344238281, -0.2751007080078125, -0.2646598815917969, -0.25421905517578125, -0.24377822875976562, -0.23333740234375, -0.22289657592773438, -0.21245574951171875, -0.20201492309570312, -0.1915740966796875, -0.18113327026367188, -0.17069244384765625, -0.16025161743164062, -0.149810791015625, -0.13936996459960938, -0.12892913818359375, -0.11848831176757812, -0.1080474853515625, -0.09760665893554688, -0.08716583251953125, -0.07672500610351562, -0.0662841796875, -0.055843353271484375, -0.04540252685546875, -0.034961700439453125, -0.0245208740234375, -0.014080047607421875, -0.00363922119140625, 0.006801605224609375, 0.017242431640625, 0.027683258056640625, 0.03812408447265625, 0.048564910888671875, 0.0590057373046875, 0.06944656372070312, 0.07988739013671875, 0.09032821655273438, 0.10076904296875, 0.11120986938476562, 0.12165069580078125, 0.13209152221679688, 0.1425323486328125, 0.15297317504882812, 0.16341400146484375, 0.17385482788085938, 0.184295654296875, 0.19473648071289062, 0.20517730712890625, 0.21561813354492188, 0.2260589599609375, 0.23649978637695312, 0.24694061279296875, 0.2573814392089844, 0.267822265625]}, "gradients/encoder.encoder.layers.17.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 11.0, 113.0, 862.0, 24.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.687887191772461, -8.524981498718262, -8.362075805664062, -8.199170112609863, -8.036263465881348, -7.873357772827148, -7.710452079772949, -7.54754638671875, -7.384640693664551, -7.221735000610352, -7.058828830718994, -6.895923137664795, -6.733017444610596, -6.570111274719238, -6.407205581665039, -6.24429988861084, -6.081393718719482, -5.918488025665283, -5.755581855773926, -5.592676162719727, -5.429770469665527, -5.266864776611328, -5.103958606719971, -4.9410529136657715, -4.778146743774414, -4.615241050720215, -4.452334880828857, -4.289429187774658, -4.126523494720459, -3.9636175632476807, -3.8007116317749023, -3.637805938720703, -3.474900245666504, -3.3119943141937256, -3.1490886211395264, -2.986182689666748, -2.823276996612549, -2.6603710651397705, -2.497465133666992, -2.334559440612793, -2.1716535091400146, -2.0087475776672363, -1.845841884613037, -1.6829359531402588, -1.52003014087677, -1.3571243286132812, -1.194218397140503, -1.0313125848770142, -0.8684067726135254, -0.7055009603500366, -0.5425950884819031, -0.3796892464160919, -0.21678340435028076, -0.05387759208679199, 0.10902827978134155, 0.2719341516494751, 0.43483996391296387, 0.5977457761764526, 0.7606516480445862, 0.9235575199127197, 1.0864633321762085, 1.2493691444396973, 1.4122750759124756, 1.5751808881759644, 1.7380867004394531]}, "gradients/encoder.encoder.layers.17.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 6.0, 15.0, 28.0, 32.0, 58.0, 84.0, 92.0, 160.0, 136.0, 131.0, 108.0, 64.0, 46.0, 27.0, 13.0, 5.0, 3.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0474042892456055, -1.012473464012146, -0.9775426387786865, -0.9426117539405823, -0.9076809287071228, -0.8727501034736633, -0.8378192186355591, -0.8028883934020996, -0.7679575681686401, -0.7330267429351807, -0.6980959177017212, -0.6631650328636169, -0.6282342076301575, -0.593303382396698, -0.5583724975585938, -0.5234416723251343, -0.4885108470916748, -0.45358002185821533, -0.41864916682243347, -0.3837183117866516, -0.34878748655319214, -0.31385666131973267, -0.2789258062839508, -0.24399496614933014, -0.20906412601470947, -0.1741332858800888, -0.13920244574546814, -0.10427160561084747, -0.0693407654762268, -0.03440992534160614, 0.0005209147930145264, 0.03545175492763519, 0.07038271427154541, 0.10531355440616608, 0.14024439454078674, 0.1751752346754074, 0.21010607481002808, 0.24503691494464874, 0.2799677550792694, 0.31489861011505127, 0.34982943534851074, 0.3847602605819702, 0.4196911156177521, 0.45462197065353394, 0.4895527958869934, 0.5244836211204529, 0.5594145059585571, 0.5943453311920166, 0.6292761564254761, 0.6642069816589355, 0.699137806892395, 0.7340686917304993, 0.7689995169639587, 0.8039303421974182, 0.8388612270355225, 0.8737920522689819, 0.9087228775024414, 0.9436537027359009, 0.9785845279693604, 1.0135153532028198, 1.0484461784362793, 1.0833771228790283, 1.1183079481124878, 1.1532387733459473, 1.1881695985794067]}, "gradients/encoder.encoder.layers.17.attention.out_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 5.0, 2.0, 2.0, 2.0, 5.0, 5.0, 6.0, 10.0, 16.0, 20.0, 16.0, 22.0, 27.0, 52.0, 60.0, 79.0, 118.0, 161.0, 268.0, 507.0, 1148.0, 3181.0, 15170.0, 148908.0, 760570.0, 101258.0, 11938.0, 2806.0, 954.0, 462.0, 235.0, 152.0, 99.0, 72.0, 50.0, 48.0, 27.0, 20.0, 19.0, 12.0, 8.0, 10.0, 7.0, 8.0, 5.0, 5.0, 4.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0], "bins": [-0.2054443359375, -0.19846343994140625, -0.1914825439453125, -0.18450164794921875, -0.177520751953125, -0.17053985595703125, -0.1635589599609375, -0.15657806396484375, -0.14959716796875, -0.14261627197265625, -0.1356353759765625, -0.12865447998046875, -0.121673583984375, -0.11469268798828125, -0.1077117919921875, -0.10073089599609375, -0.09375, -0.08676910400390625, -0.0797882080078125, -0.07280731201171875, -0.065826416015625, -0.05884552001953125, -0.0518646240234375, -0.04488372802734375, -0.03790283203125, -0.03092193603515625, -0.0239410400390625, -0.01696014404296875, -0.009979248046875, -0.00299835205078125, 0.0039825439453125, 0.01096343994140625, 0.0179443359375, 0.02492523193359375, 0.0319061279296875, 0.03888702392578125, 0.045867919921875, 0.05284881591796875, 0.0598297119140625, 0.06681060791015625, 0.07379150390625, 0.08077239990234375, 0.0877532958984375, 0.09473419189453125, 0.101715087890625, 0.10869598388671875, 0.1156768798828125, 0.12265777587890625, 0.129638671875, 0.13661956787109375, 0.1436004638671875, 0.15058135986328125, 0.157562255859375, 0.16454315185546875, 0.1715240478515625, 0.17850494384765625, 0.18548583984375, 0.19246673583984375, 0.1994476318359375, 0.20642852783203125, 0.213409423828125, 0.22039031982421875, 0.2273712158203125, 0.23435211181640625, 0.2413330078125]}, "gradients/encoder.encoder.layers.17.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 3.0, 6.0, 19.0, 37.0, 62.0, 95.0, 173.0, 145.0, 169.0, 132.0, 78.0, 44.0, 24.0, 9.0, 6.0, 5.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.134765625, -0.13181447982788086, -0.12886333465576172, -0.12591218948364258, -0.12296104431152344, -0.1200098991394043, -0.11705875396728516, -0.11410760879516602, -0.11115646362304688, -0.10820531845092773, -0.1052541732788086, -0.10230302810668945, -0.09935188293457031, -0.09640073776245117, -0.09344959259033203, -0.09049844741821289, -0.08754730224609375, -0.08459615707397461, -0.08164501190185547, -0.07869386672973633, -0.07574272155761719, -0.07279157638549805, -0.0698404312133789, -0.06688928604125977, -0.06393814086914062, -0.060986995697021484, -0.058035850524902344, -0.0550847053527832, -0.05213356018066406, -0.04918241500854492, -0.04623126983642578, -0.04328012466430664, -0.0403289794921875, -0.03737783432006836, -0.03442668914794922, -0.03147554397583008, -0.028524398803710938, -0.025573253631591797, -0.022622108459472656, -0.019670963287353516, -0.016719818115234375, -0.013768672943115234, -0.010817527770996094, -0.007866382598876953, -0.0049152374267578125, -0.001964092254638672, 0.0009870529174804688, 0.003938198089599609, 0.00688934326171875, 0.00984048843383789, 0.012791633605957031, 0.015742778778076172, 0.018693923950195312, 0.021645069122314453, 0.024596214294433594, 0.027547359466552734, 0.030498504638671875, 0.033449649810791016, 0.036400794982910156, 0.0393519401550293, 0.04230308532714844, 0.04525423049926758, 0.04820537567138672, 0.05115652084350586, 0.054107666015625]}, "gradients/encoder.encoder.layers.17.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 3.0, 3.0, 1.0, 1.0, 3.0, 11.0, 6.0, 5.0, 10.0, 14.0, 10.0, 15.0, 31.0, 49.0, 59.0, 96.0, 223.0, 474.0, 1280.0, 3927.0, 15323.0, 76988.0, 483765.0, 389842.0, 59194.0, 12111.0, 3120.0, 1100.0, 411.0, 184.0, 91.0, 63.0, 31.0, 29.0, 24.0, 15.0, 9.0, 8.0, 4.0, 5.0, 5.0, 6.0, 4.0, 4.0, 4.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1229248046875, -0.11894989013671875, -0.1149749755859375, -0.11100006103515625, -0.107025146484375, -0.10305023193359375, -0.0990753173828125, -0.09510040283203125, -0.09112548828125, -0.08715057373046875, -0.0831756591796875, -0.07920074462890625, -0.075225830078125, -0.07125091552734375, -0.0672760009765625, -0.06330108642578125, -0.059326171875, -0.05535125732421875, -0.0513763427734375, -0.04740142822265625, -0.043426513671875, -0.03945159912109375, -0.0354766845703125, -0.03150177001953125, -0.02752685546875, -0.02355194091796875, -0.0195770263671875, -0.01560211181640625, -0.011627197265625, -0.00765228271484375, -0.0036773681640625, 0.00029754638671875, 0.0042724609375, 0.00824737548828125, 0.0122222900390625, 0.01619720458984375, 0.020172119140625, 0.02414703369140625, 0.0281219482421875, 0.03209686279296875, 0.03607177734375, 0.04004669189453125, 0.0440216064453125, 0.04799652099609375, 0.051971435546875, 0.05594635009765625, 0.0599212646484375, 0.06389617919921875, 0.06787109375, 0.07184600830078125, 0.0758209228515625, 0.07979583740234375, 0.083770751953125, 0.08774566650390625, 0.0917205810546875, 0.09569549560546875, 0.09967041015625, 0.10364532470703125, 0.1076202392578125, 0.11159515380859375, 0.115570068359375, 0.11954498291015625, 0.1235198974609375, 0.12749481201171875, 0.1314697265625]}, "gradients/encoder.encoder.layers.17.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 3.0, 4.0, 4.0, 3.0, 7.0, 7.0, 7.0, 13.0, 19.0, 12.0, 17.0, 33.0, 27.0, 30.0, 26.0, 42.0, 36.0, 43.0, 35.0, 45.0, 44.0, 49.0, 42.0, 52.0, 55.0, 28.0, 40.0, 32.0, 43.0, 35.0, 26.0, 27.0, 25.0, 19.0, 18.0, 7.0, 12.0, 11.0, 8.0, 5.0, 7.0, 4.0, 4.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.11383056640625, -0.1100454330444336, -0.10626029968261719, -0.10247516632080078, -0.09869003295898438, -0.09490489959716797, -0.09111976623535156, -0.08733463287353516, -0.08354949951171875, -0.07976436614990234, -0.07597923278808594, -0.07219409942626953, -0.06840896606445312, -0.06462383270263672, -0.06083869934082031, -0.057053565979003906, -0.0532684326171875, -0.049483299255371094, -0.04569816589355469, -0.04191303253173828, -0.038127899169921875, -0.03434276580810547, -0.030557632446289062, -0.026772499084472656, -0.02298736572265625, -0.019202232360839844, -0.015417098999023438, -0.011631965637207031, -0.007846832275390625, -0.004061698913574219, -0.0002765655517578125, 0.0035085678100585938, 0.007293701171875, 0.011078834533691406, 0.014863967895507812, 0.01864910125732422, 0.022434234619140625, 0.02621936798095703, 0.030004501342773438, 0.033789634704589844, 0.03757476806640625, 0.041359901428222656, 0.04514503479003906, 0.04893016815185547, 0.052715301513671875, 0.05650043487548828, 0.06028556823730469, 0.0640707015991211, 0.0678558349609375, 0.0716409683227539, 0.07542610168457031, 0.07921123504638672, 0.08299636840820312, 0.08678150177001953, 0.09056663513183594, 0.09435176849365234, 0.09813690185546875, 0.10192203521728516, 0.10570716857910156, 0.10949230194091797, 0.11327743530273438, 0.11706256866455078, 0.12084770202636719, 0.1246328353881836, 0.12841796875]}, "gradients/encoder.encoder.layers.17.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 6.0, 6.0, 7.0, 4.0, 13.0, 27.0, 59.0, 98.0, 201.0, 673.0, 2945.0, 28424.0, 795015.0, 210088.0, 8889.0, 1405.0, 383.0, 157.0, 69.0, 34.0, 21.0, 11.0, 6.0, 7.0, 2.0, 7.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.1217041015625, -0.1183023452758789, -0.11490058898925781, -0.11149883270263672, -0.10809707641601562, -0.10469532012939453, -0.10129356384277344, -0.09789180755615234, -0.09449005126953125, -0.09108829498291016, -0.08768653869628906, -0.08428478240966797, -0.08088302612304688, -0.07748126983642578, -0.07407951354980469, -0.0706777572631836, -0.0672760009765625, -0.0638742446899414, -0.06047248840332031, -0.05707073211669922, -0.053668975830078125, -0.05026721954345703, -0.04686546325683594, -0.043463706970214844, -0.04006195068359375, -0.036660194396972656, -0.03325843811035156, -0.02985668182373047, -0.026454925537109375, -0.02305316925048828, -0.019651412963867188, -0.016249656677246094, -0.012847900390625, -0.009446144104003906, -0.0060443878173828125, -0.0026426315307617188, 0.000759124755859375, 0.004160881042480469, 0.0075626373291015625, 0.010964393615722656, 0.01436614990234375, 0.017767906188964844, 0.021169662475585938, 0.02457141876220703, 0.027973175048828125, 0.03137493133544922, 0.03477668762207031, 0.038178443908691406, 0.0415802001953125, 0.044981956481933594, 0.04838371276855469, 0.05178546905517578, 0.055187225341796875, 0.05858898162841797, 0.06199073791503906, 0.06539249420166016, 0.06879425048828125, 0.07219600677490234, 0.07559776306152344, 0.07899951934814453, 0.08240127563476562, 0.08580303192138672, 0.08920478820800781, 0.0926065444946289, 0.09600830078125]}, "gradients/encoder.encoder.layers.17.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 5.0, 6.0, 2.0, 6.0, 2.0, 11.0, 7.0, 8.0, 20.0, 13.0, 23.0, 31.0, 56.0, 47.0, 58.0, 65.0, 90.0, 77.0, 66.0, 82.0, 56.0, 71.0, 43.0, 37.0, 33.0, 21.0, 22.0, 12.0, 8.0, 8.0, 4.0, 3.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.0073184967041016e-05, -9.7593292593956e-06, -9.445473551750183e-06, -9.131617844104767e-06, -8.81776213645935e-06, -8.503906428813934e-06, -8.190050721168518e-06, -7.876195013523102e-06, -7.5623393058776855e-06, -7.248483598232269e-06, -6.934627890586853e-06, -6.620772182941437e-06, -6.3069164752960205e-06, -5.993060767650604e-06, -5.679205060005188e-06, -5.365349352359772e-06, -5.0514936447143555e-06, -4.737637937068939e-06, -4.423782229423523e-06, -4.109926521778107e-06, -3.7960708141326904e-06, -3.482215106487274e-06, -3.168359398841858e-06, -2.8545036911964417e-06, -2.5406479835510254e-06, -2.226792275905609e-06, -1.912936568260193e-06, -1.5990808606147766e-06, -1.2852251529693604e-06, -9.71369445323944e-07, -6.575137376785278e-07, -3.4365803003311157e-07, -2.9802322387695312e-08, 2.8405338525772095e-07, 5.979090929031372e-07, 9.117648005485535e-07, 1.2256205081939697e-06, 1.539476215839386e-06, 1.8533319234848022e-06, 2.1671876311302185e-06, 2.4810433387756348e-06, 2.794899046421051e-06, 3.1087547540664673e-06, 3.4226104617118835e-06, 3.7364661693573e-06, 4.050321877002716e-06, 4.364177584648132e-06, 4.678033292293549e-06, 4.991888999938965e-06, 5.305744707584381e-06, 5.619600415229797e-06, 5.933456122875214e-06, 6.24731183052063e-06, 6.561167538166046e-06, 6.875023245811462e-06, 7.188878953456879e-06, 7.502734661102295e-06, 7.816590368747711e-06, 8.130446076393127e-06, 8.444301784038544e-06, 8.75815749168396e-06, 9.072013199329376e-06, 9.385868906974792e-06, 9.699724614620209e-06, 1.0013580322265625e-05]}, "gradients/encoder.encoder.layers.17.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 4.0, 3.0, 1.0, 5.0, 9.0, 9.0, 15.0, 39.0, 56.0, 87.0, 177.0, 392.0, 951.0, 2411.0, 8221.0, 36957.0, 315443.0, 592320.0, 72155.0, 13100.0, 3752.0, 1300.0, 586.0, 246.0, 131.0, 64.0, 42.0, 28.0, 17.0, 10.0, 2.0, 7.0, 10.0, 3.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.08172607421875, -0.07963991165161133, -0.07755374908447266, -0.07546758651733398, -0.07338142395019531, -0.07129526138305664, -0.06920909881591797, -0.0671229362487793, -0.06503677368164062, -0.06295061111450195, -0.06086444854736328, -0.05877828598022461, -0.05669212341308594, -0.054605960845947266, -0.052519798278808594, -0.05043363571166992, -0.04834747314453125, -0.04626131057739258, -0.044175148010253906, -0.042088985443115234, -0.04000282287597656, -0.03791666030883789, -0.03583049774169922, -0.03374433517456055, -0.031658172607421875, -0.029572010040283203, -0.02748584747314453, -0.02539968490600586, -0.023313522338867188, -0.021227359771728516, -0.019141197204589844, -0.017055034637451172, -0.0149688720703125, -0.012882709503173828, -0.010796546936035156, -0.008710384368896484, -0.0066242218017578125, -0.004538059234619141, -0.0024518966674804688, -0.0003657341003417969, 0.001720428466796875, 0.003806591033935547, 0.005892753601074219, 0.00797891616821289, 0.010065078735351562, 0.012151241302490234, 0.014237403869628906, 0.016323566436767578, 0.01840972900390625, 0.020495891571044922, 0.022582054138183594, 0.024668216705322266, 0.026754379272460938, 0.02884054183959961, 0.03092670440673828, 0.03301286697387695, 0.035099029541015625, 0.0371851921081543, 0.03927135467529297, 0.04135751724243164, 0.04344367980957031, 0.045529842376708984, 0.047616004943847656, 0.04970216751098633, 0.051788330078125]}, "gradients/encoder.encoder.layers.17.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 5.0, 3.0, 6.0, 13.0, 14.0, 21.0, 18.0, 31.0, 41.0, 56.0, 75.0, 73.0, 102.0, 111.0, 92.0, 74.0, 64.0, 57.0, 31.0, 35.0, 22.0, 16.0, 13.0, 4.0, 8.0, 8.0, 2.0, 2.0, 5.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.05865478515625, -0.05712747573852539, -0.05560016632080078, -0.05407285690307617, -0.05254554748535156, -0.05101823806762695, -0.049490928649902344, -0.047963619232177734, -0.046436309814453125, -0.044909000396728516, -0.043381690979003906, -0.0418543815612793, -0.04032707214355469, -0.03879976272583008, -0.03727245330810547, -0.03574514389038086, -0.03421783447265625, -0.03269052505493164, -0.03116321563720703, -0.029635906219482422, -0.028108596801757812, -0.026581287384033203, -0.025053977966308594, -0.023526668548583984, -0.021999359130859375, -0.020472049713134766, -0.018944740295410156, -0.017417430877685547, -0.015890121459960938, -0.014362812042236328, -0.012835502624511719, -0.01130819320678711, -0.0097808837890625, -0.00825357437133789, -0.006726264953613281, -0.005198955535888672, -0.0036716461181640625, -0.002144336700439453, -0.0006170272827148438, 0.0009102821350097656, 0.002437591552734375, 0.003964900970458984, 0.005492210388183594, 0.007019519805908203, 0.008546829223632812, 0.010074138641357422, 0.011601448059082031, 0.01312875747680664, 0.01465606689453125, 0.01618337631225586, 0.01771068572998047, 0.019237995147705078, 0.020765304565429688, 0.022292613983154297, 0.023819923400878906, 0.025347232818603516, 0.026874542236328125, 0.028401851654052734, 0.029929161071777344, 0.03145647048950195, 0.03298377990722656, 0.03451108932495117, 0.03603839874267578, 0.03756570816040039, 0.039093017578125]}, "gradients/encoder.encoder.layers.17.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 8.0, 54.0, 407.0, 452.0, 78.0, 15.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.123210906982422, -4.035541534423828, -3.9478721618652344, -3.8602027893066406, -3.7725331783294678, -3.684863805770874, -3.5971944332122803, -3.5095250606536865, -3.4218554496765137, -3.33418607711792, -3.246516704559326, -3.1588473320007324, -3.0711777210235596, -2.983508348464966, -2.895838975906372, -2.8081696033477783, -2.7205002307891846, -2.632830858230591, -2.545161485671997, -2.457491874694824, -2.3698225021362305, -2.2821531295776367, -2.194483757019043, -2.106814384460449, -2.0191450119018555, -1.9314756393432617, -1.8438061475753784, -1.7561367750167847, -1.6684672832489014, -1.5807979106903076, -1.4931285381317139, -1.4054591655731201, -1.3177893161773682, -1.2301199436187744, -1.1424504518508911, -1.0547810792922974, -0.9671116471290588, -0.8794422149658203, -0.7917728424072266, -0.704103410243988, -0.6164339780807495, -0.528764545917511, -0.44109514355659485, -0.3534257411956787, -0.2657563090324402, -0.17808687686920166, -0.09041750431060791, -0.0027480721473693848, 0.08492136001586914, 0.17259077727794647, 0.2602601945400238, 0.34792959690093994, 0.43559902906417847, 0.523268461227417, 0.6109378337860107, 0.6986072659492493, 0.7862766981124878, 0.8739461302757263, 0.9616155624389648, 1.0492849349975586, 1.1369543075561523, 1.2246237993240356, 1.3122931718826294, 1.3999626636505127, 1.4876320362091064]}, "gradients/encoder.encoder.layers.17.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 5.0, 11.0, 8.0, 13.0, 24.0, 26.0, 44.0, 37.0, 54.0, 62.0, 73.0, 84.0, 80.0, 98.0, 65.0, 73.0, 57.0, 57.0, 43.0, 24.0, 17.0, 23.0, 14.0, 10.0, 5.0, 5.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0030444860458374, -0.9652947187423706, -0.927544891834259, -0.8897950649261475, -0.8520452976226807, -0.8142955303192139, -0.7765457034111023, -0.7387958765029907, -0.7010461091995239, -0.6632963418960571, -0.6255465149879456, -0.587796688079834, -0.5500469207763672, -0.5122971534729004, -0.4745473265647888, -0.43679752945899963, -0.39904773235321045, -0.36129793524742126, -0.3235481381416321, -0.2857983410358429, -0.2480485439300537, -0.21029874682426453, -0.17254894971847534, -0.13479915261268616, -0.09704935550689697, -0.05929955840110779, -0.021549761295318604, 0.01620003581047058, 0.053949832916259766, 0.09169963002204895, 0.12944942712783813, 0.16719922423362732, 0.20494914054870605, 0.24269893765449524, 0.2804487347602844, 0.3181985318660736, 0.3559483289718628, 0.393698126077652, 0.43144792318344116, 0.46919772028923035, 0.5069475173950195, 0.5446972846984863, 0.5824471116065979, 0.6201969385147095, 0.6579467058181763, 0.6956964731216431, 0.7334463000297546, 0.7711961269378662, 0.808945894241333, 0.8466956615447998, 0.8844454884529114, 0.922195315361023, 0.9599450826644897, 0.9976948499679565, 1.035444736480713, 1.0731945037841797, 1.1109442710876465, 1.1486940383911133, 1.18644380569458, 1.2241936922073364, 1.2619434595108032, 1.29969322681427, 1.3374431133270264, 1.3751928806304932, 1.41294264793396]}, "gradients/encoder.encoder.layers.16.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 4.0, 3.0, 1.0, 4.0, 3.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 10.0, 7.0, 7.0, 17.0, 17.0, 27.0, 14.0, 27.0, 33.0, 44.0, 86.0, 111.0, 175.0, 256.0, 530.0, 1156.0, 3026.0, 10634.0, 104576.0, 4018401.0, 43734.0, 7308.0, 2196.0, 863.0, 415.0, 213.0, 136.0, 80.0, 59.0, 27.0, 29.0, 12.0, 2.0, 11.0, 7.0, 11.0, 3.0, 5.0, 0.0, 2.0], "bins": [-0.35693359375, -0.34869956970214844, -0.3404655456542969, -0.3322315216064453, -0.32399749755859375, -0.3157634735107422, -0.3075294494628906, -0.29929542541503906, -0.2910614013671875, -0.28282737731933594, -0.2745933532714844, -0.2663593292236328, -0.25812530517578125, -0.2498912811279297, -0.24165725708007812, -0.23342323303222656, -0.225189208984375, -0.21695518493652344, -0.20872116088867188, -0.2004871368408203, -0.19225311279296875, -0.1840190887451172, -0.17578506469726562, -0.16755104064941406, -0.1593170166015625, -0.15108299255371094, -0.14284896850585938, -0.1346149444580078, -0.12638092041015625, -0.11814689636230469, -0.10991287231445312, -0.10167884826660156, -0.09344482421875, -0.08521080017089844, -0.07697677612304688, -0.06874275207519531, -0.06050872802734375, -0.05227470397949219, -0.044040679931640625, -0.03580665588378906, -0.0275726318359375, -0.019338607788085938, -0.011104583740234375, -0.0028705596923828125, 0.00536346435546875, 0.013597488403320312, 0.021831512451171875, 0.030065536499023438, 0.038299560546875, 0.04653358459472656, 0.054767608642578125, 0.06300163269042969, 0.07123565673828125, 0.07946968078613281, 0.08770370483398438, 0.09593772888183594, 0.1041717529296875, 0.11240577697753906, 0.12063980102539062, 0.1288738250732422, 0.13710784912109375, 0.1453418731689453, 0.15357589721679688, 0.16180992126464844, 0.1700439453125]}, "gradients/encoder.encoder.layers.16.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 6.0, 7.0, 33.0, 27.0, 64.0, 81.0, 120.0, 128.0, 134.0, 131.0, 84.0, 73.0, 64.0, 27.0, 9.0, 5.0, 5.0, 6.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10137939453125, -0.09891510009765625, -0.0964508056640625, -0.09398651123046875, -0.091522216796875, -0.08905792236328125, -0.0865936279296875, -0.08412933349609375, -0.0816650390625, -0.07920074462890625, -0.0767364501953125, -0.07427215576171875, -0.071807861328125, -0.06934356689453125, -0.0668792724609375, -0.06441497802734375, -0.06195068359375, -0.05948638916015625, -0.0570220947265625, -0.05455780029296875, -0.052093505859375, -0.04962921142578125, -0.0471649169921875, -0.04470062255859375, -0.042236328125, -0.03977203369140625, -0.0373077392578125, -0.03484344482421875, -0.032379150390625, -0.02991485595703125, -0.0274505615234375, -0.02498626708984375, -0.02252197265625, -0.02005767822265625, -0.0175933837890625, -0.01512908935546875, -0.012664794921875, -0.01020050048828125, -0.0077362060546875, -0.00527191162109375, -0.0028076171875, -0.00034332275390625, 0.0021209716796875, 0.00458526611328125, 0.007049560546875, 0.00951385498046875, 0.0119781494140625, 0.01444244384765625, 0.01690673828125, 0.01937103271484375, 0.0218353271484375, 0.02429962158203125, 0.026763916015625, 0.02922821044921875, 0.0316925048828125, 0.03415679931640625, 0.03662109375, 0.03908538818359375, 0.0415496826171875, 0.04401397705078125, 0.046478271484375, 0.04894256591796875, 0.0514068603515625, 0.05387115478515625, 0.05633544921875]}, "gradients/encoder.encoder.layers.16.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 3.0, 7.0, 11.0, 10.0, 24.0, 35.0, 45.0, 52.0, 91.0, 142.0, 1502.0, 3781819.0, 408919.0, 1173.0, 150.0, 89.0, 53.0, 58.0, 40.0, 32.0, 15.0, 8.0, 8.0, 3.0, 6.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.57958984375, -0.5490188598632812, -0.5184478759765625, -0.48787689208984375, -0.457305908203125, -0.42673492431640625, -0.3961639404296875, -0.36559295654296875, -0.33502197265625, -0.30445098876953125, -0.2738800048828125, -0.24330902099609375, -0.212738037109375, -0.18216705322265625, -0.1515960693359375, -0.12102508544921875, -0.0904541015625, -0.05988311767578125, -0.0293121337890625, 0.00125885009765625, 0.031829833984375, 0.06240081787109375, 0.0929718017578125, 0.12354278564453125, 0.15411376953125, 0.18468475341796875, 0.2152557373046875, 0.24582672119140625, 0.276397705078125, 0.30696868896484375, 0.3375396728515625, 0.36811065673828125, 0.398681640625, 0.42925262451171875, 0.4598236083984375, 0.49039459228515625, 0.520965576171875, 0.5515365600585938, 0.5821075439453125, 0.6126785278320312, 0.64324951171875, 0.6738204956054688, 0.7043914794921875, 0.7349624633789062, 0.765533447265625, 0.7961044311523438, 0.8266754150390625, 0.8572463989257812, 0.8878173828125, 0.9183883666992188, 0.9489593505859375, 0.9795303344726562, 1.010101318359375, 1.0406723022460938, 1.0712432861328125, 1.1018142700195312, 1.13238525390625, 1.1629562377929688, 1.1935272216796875, 1.2240982055664062, 1.254669189453125, 1.2852401733398438, 1.3158111572265625, 1.3463821411132812, 1.376953125]}, "gradients/encoder.encoder.layers.16.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 3.0, 2.0, 6.0, 8.0, 18.0, 38.0, 97.0, 423.0, 3093.0, 259.0, 75.0, 40.0, 13.0, 9.0, 5.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.050384521484375, -0.04501962661743164, -0.03965473175048828, -0.03428983688354492, -0.028924942016601562, -0.023560047149658203, -0.018195152282714844, -0.012830257415771484, -0.007465362548828125, -0.0021004676818847656, 0.0032644271850585938, 0.008629322052001953, 0.013994216918945312, 0.019359111785888672, 0.02472400665283203, 0.03008890151977539, 0.03545379638671875, 0.04081869125366211, 0.04618358612060547, 0.05154848098754883, 0.05691337585449219, 0.06227827072143555, 0.0676431655883789, 0.07300806045532227, 0.07837295532226562, 0.08373785018920898, 0.08910274505615234, 0.0944676399230957, 0.09983253479003906, 0.10519742965698242, 0.11056232452392578, 0.11592721939086914, 0.1212921142578125, 0.12665700912475586, 0.13202190399169922, 0.13738679885864258, 0.14275169372558594, 0.1481165885925293, 0.15348148345947266, 0.15884637832641602, 0.16421127319335938, 0.16957616806030273, 0.1749410629272461, 0.18030595779418945, 0.1856708526611328, 0.19103574752807617, 0.19640064239501953, 0.2017655372619629, 0.20713043212890625, 0.2124953269958496, 0.21786022186279297, 0.22322511672973633, 0.2285900115966797, 0.23395490646362305, 0.2393198013305664, 0.24468469619750977, 0.2500495910644531, 0.2554144859313965, 0.26077938079833984, 0.2661442756652832, 0.27150917053222656, 0.2768740653991699, 0.2822389602661133, 0.28760385513305664, 0.29296875]}, "gradients/encoder.encoder.layers.16.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 3.0, 6.0, 8.0, 12.0, 57.0, 183.0, 384.0, 232.0, 77.0, 21.0, 9.0, 6.0, 2.0, 1.0, 2.0, 0.0, 2.0, 0.0, 4.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.8283349275588989, -0.7994133234024048, -0.7704916596412659, -0.7415700554847717, -0.7126483917236328, -0.6837267875671387, -0.6548051834106445, -0.6258835196495056, -0.5969619154930115, -0.5680403113365173, -0.5391186475753784, -0.5101970434188843, -0.48127540946006775, -0.4523537755012512, -0.4234321415424347, -0.39451050758361816, -0.36558887362480164, -0.3366672396659851, -0.3077456057071686, -0.27882397174835205, -0.2499023675918579, -0.22098073363304138, -0.19205909967422485, -0.16313748061656952, -0.134215846657753, -0.10529422014951706, -0.07637259364128113, -0.0474509596824646, -0.018529333174228668, 0.010392293334007263, 0.03931392729282379, 0.06823554635047913, 0.09715718030929565, 0.12607881426811218, 0.15500043332576752, 0.18392206728458405, 0.21284368634223938, 0.2417653203010559, 0.27068695425987244, 0.29960858821868896, 0.3285301923751831, 0.35745182633399963, 0.38637346029281616, 0.4152950644493103, 0.44421669840812683, 0.47313833236694336, 0.5020599365234375, 0.5309816002845764, 0.5599032640457153, 0.5888248682022095, 0.6177465319633484, 0.6466681361198425, 0.6755897998809814, 0.7045114040374756, 0.7334330081939697, 0.7623546719551086, 0.7912762761116028, 0.8201978802680969, 0.8491195440292358, 0.87804114818573, 0.9069628119468689, 0.935884416103363, 0.964806079864502, 0.9937276840209961, 1.0226492881774902]}, "gradients/encoder.encoder.layers.16.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 4.0, 3.0, 9.0, 14.0, 22.0, 23.0, 71.0, 66.0, 92.0, 105.0, 121.0, 125.0, 104.0, 89.0, 53.0, 34.0, 30.0, 16.0, 11.0, 7.0, 4.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.31133735179901123, -0.29565608501434326, -0.2799748182296753, -0.26429352164268494, -0.24861225485801697, -0.232930988073349, -0.21724970638751984, -0.20156842470169067, -0.1858871579170227, -0.17020589113235474, -0.15452460944652557, -0.1388433277606964, -0.12316206097602844, -0.10748078674077988, -0.09179951250553131, -0.07611823827028275, -0.06043696403503418, -0.044755689799785614, -0.02907441556453705, -0.013393141329288483, 0.002288132905960083, 0.01796940714120865, 0.033650681376457214, 0.04933195561170578, 0.06501322984695435, 0.08069450408220291, 0.09637577831745148, 0.11205705255270004, 0.1277383267879486, 0.14341959357261658, 0.15910087525844574, 0.1747821569442749, 0.19046348333358765, 0.20614475011825562, 0.22182603180408478, 0.23750731348991394, 0.2531885802745819, 0.2688698470592499, 0.28455114364624023, 0.3002324104309082, 0.31591367721557617, 0.33159494400024414, 0.3472762107849121, 0.36295750737190247, 0.37863877415657043, 0.3943200409412384, 0.41000133752822876, 0.42568260431289673, 0.4413638710975647, 0.45704513788223267, 0.47272640466690063, 0.488407701253891, 0.5040889978408813, 0.5197702646255493, 0.5354515314102173, 0.5511327981948853, 0.5668140649795532, 0.5824953317642212, 0.5981765985488892, 0.6138578653335571, 0.6295391321182251, 0.6452204585075378, 0.6609017252922058, 0.6765829920768738, 0.6922642588615417]}, "gradients/encoder.encoder.layers.16.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 4.0, 8.0, 7.0, 6.0, 9.0, 9.0, 21.0, 18.0, 37.0, 49.0, 53.0, 88.0, 113.0, 181.0, 299.0, 499.0, 984.0, 2718.0, 10997.0, 86169.0, 719441.0, 200331.0, 19668.0, 3910.0, 1325.0, 598.0, 341.0, 212.0, 158.0, 77.0, 74.0, 45.0, 33.0, 23.0, 18.0, 12.0, 11.0, 4.0, 10.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.2259521484375, -0.21961593627929688, -0.21327972412109375, -0.20694351196289062, -0.2006072998046875, -0.19427108764648438, -0.18793487548828125, -0.18159866333007812, -0.175262451171875, -0.16892623901367188, -0.16259002685546875, -0.15625381469726562, -0.1499176025390625, -0.14358139038085938, -0.13724517822265625, -0.13090896606445312, -0.12457275390625, -0.11823654174804688, -0.11190032958984375, -0.10556411743164062, -0.0992279052734375, -0.09289169311523438, -0.08655548095703125, -0.08021926879882812, -0.073883056640625, -0.06754684448242188, -0.06121063232421875, -0.054874420166015625, -0.0485382080078125, -0.042201995849609375, -0.03586578369140625, -0.029529571533203125, -0.023193359375, -0.016857147216796875, -0.01052093505859375, -0.004184722900390625, 0.0021514892578125, 0.008487701416015625, 0.01482391357421875, 0.021160125732421875, 0.027496337890625, 0.033832550048828125, 0.04016876220703125, 0.046504974365234375, 0.0528411865234375, 0.059177398681640625, 0.06551361083984375, 0.07184982299804688, 0.07818603515625, 0.08452224731445312, 0.09085845947265625, 0.09719467163085938, 0.1035308837890625, 0.10986709594726562, 0.11620330810546875, 0.12253952026367188, 0.128875732421875, 0.13521194458007812, 0.14154815673828125, 0.14788436889648438, 0.1542205810546875, 0.16055679321289062, 0.16689300537109375, 0.17322921752929688, 0.1795654296875]}, "gradients/encoder.encoder.layers.16.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 9.0, 16.0, 26.0, 38.0, 65.0, 87.0, 134.0, 149.0, 149.0, 99.0, 95.0, 62.0, 30.0, 28.0, 6.0, 6.0, 3.0, 1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.104248046875, -0.10175132751464844, -0.09925460815429688, -0.09675788879394531, -0.09426116943359375, -0.09176445007324219, -0.08926773071289062, -0.08677101135253906, -0.0842742919921875, -0.08177757263183594, -0.07928085327148438, -0.07678413391113281, -0.07428741455078125, -0.07179069519042969, -0.06929397583007812, -0.06679725646972656, -0.064300537109375, -0.06180381774902344, -0.059307098388671875, -0.05681037902832031, -0.05431365966796875, -0.05181694030761719, -0.049320220947265625, -0.04682350158691406, -0.0443267822265625, -0.04183006286621094, -0.039333343505859375, -0.03683662414550781, -0.03433990478515625, -0.03184318542480469, -0.029346466064453125, -0.026849746704101562, -0.02435302734375, -0.021856307983398438, -0.019359588623046875, -0.016862869262695312, -0.01436614990234375, -0.011869430541992188, -0.009372711181640625, -0.0068759918212890625, -0.0043792724609375, -0.0018825531005859375, 0.000614166259765625, 0.0031108856201171875, 0.00560760498046875, 0.008104324340820312, 0.010601043701171875, 0.013097763061523438, 0.015594482421875, 0.018091201782226562, 0.020587921142578125, 0.023084640502929688, 0.02558135986328125, 0.028078079223632812, 0.030574798583984375, 0.03307151794433594, 0.0355682373046875, 0.03806495666503906, 0.040561676025390625, 0.04305839538574219, 0.04555511474609375, 0.04805183410644531, 0.050548553466796875, 0.05304527282714844, 0.0555419921875]}, "gradients/encoder.encoder.layers.16.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 10.0, 6.0, 10.0, 14.0, 25.0, 18.0, 34.0, 65.0, 117.0, 259.0, 469.0, 1228.0, 3634.0, 14661.0, 89644.0, 618623.0, 275065.0, 33975.0, 7056.0, 2085.0, 821.0, 347.0, 165.0, 78.0, 41.0, 34.0, 24.0, 13.0, 8.0, 5.0, 6.0, 5.0, 5.0, 5.0, 1.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.151123046875, -0.1467742919921875, -0.142425537109375, -0.1380767822265625, -0.13372802734375, -0.1293792724609375, -0.125030517578125, -0.1206817626953125, -0.1163330078125, -0.1119842529296875, -0.107635498046875, -0.1032867431640625, -0.09893798828125, -0.0945892333984375, -0.090240478515625, -0.0858917236328125, -0.08154296875, -0.0771942138671875, -0.072845458984375, -0.0684967041015625, -0.06414794921875, -0.0597991943359375, -0.055450439453125, -0.0511016845703125, -0.0467529296875, -0.0424041748046875, -0.038055419921875, -0.0337066650390625, -0.02935791015625, -0.0250091552734375, -0.020660400390625, -0.0163116455078125, -0.011962890625, -0.0076141357421875, -0.003265380859375, 0.0010833740234375, 0.00543212890625, 0.0097808837890625, 0.014129638671875, 0.0184783935546875, 0.0228271484375, 0.0271759033203125, 0.031524658203125, 0.0358734130859375, 0.04022216796875, 0.0445709228515625, 0.048919677734375, 0.0532684326171875, 0.0576171875, 0.0619659423828125, 0.066314697265625, 0.0706634521484375, 0.07501220703125, 0.0793609619140625, 0.083709716796875, 0.0880584716796875, 0.0924072265625, 0.0967559814453125, 0.101104736328125, 0.1054534912109375, 0.10980224609375, 0.1141510009765625, 0.118499755859375, 0.1228485107421875, 0.127197265625]}, "gradients/encoder.encoder.layers.16.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 3.0, 1.0, 1.0, 3.0, 4.0, 5.0, 10.0, 7.0, 7.0, 11.0, 20.0, 24.0, 18.0, 23.0, 26.0, 38.0, 31.0, 33.0, 34.0, 36.0, 40.0, 27.0, 53.0, 42.0, 50.0, 38.0, 46.0, 29.0, 48.0, 39.0, 35.0, 35.0, 28.0, 31.0, 30.0, 19.0, 16.0, 16.0, 10.0, 7.0, 5.0, 9.0, 4.0, 5.0, 4.0, 2.0, 2.0, 3.0, 4.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.11407470703125, -0.11045169830322266, -0.10682868957519531, -0.10320568084716797, -0.09958267211914062, -0.09595966339111328, -0.09233665466308594, -0.0887136459350586, -0.08509063720703125, -0.0814676284790039, -0.07784461975097656, -0.07422161102294922, -0.07059860229492188, -0.06697559356689453, -0.06335258483886719, -0.059729576110839844, -0.0561065673828125, -0.052483558654785156, -0.04886054992675781, -0.04523754119873047, -0.041614532470703125, -0.03799152374267578, -0.03436851501464844, -0.030745506286621094, -0.02712249755859375, -0.023499488830566406, -0.019876480102539062, -0.01625347137451172, -0.012630462646484375, -0.009007453918457031, -0.0053844451904296875, -0.0017614364624023438, 0.001861572265625, 0.005484580993652344, 0.009107589721679688, 0.012730598449707031, 0.016353607177734375, 0.01997661590576172, 0.023599624633789062, 0.027222633361816406, 0.03084564208984375, 0.034468650817871094, 0.03809165954589844, 0.04171466827392578, 0.045337677001953125, 0.04896068572998047, 0.05258369445800781, 0.056206703186035156, 0.0598297119140625, 0.06345272064208984, 0.06707572937011719, 0.07069873809814453, 0.07432174682617188, 0.07794475555419922, 0.08156776428222656, 0.0851907730102539, 0.08881378173828125, 0.0924367904663086, 0.09605979919433594, 0.09968280792236328, 0.10330581665039062, 0.10692882537841797, 0.11055183410644531, 0.11417484283447266, 0.1177978515625]}, "gradients/encoder.encoder.layers.16.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 3.0, 4.0, 5.0, 4.0, 9.0, 9.0, 14.0, 12.0, 26.0, 34.0, 43.0, 66.0, 98.0, 171.0, 291.0, 526.0, 1002.0, 2137.0, 4783.0, 12362.0, 42265.0, 211249.0, 552128.0, 166751.0, 35382.0, 10868.0, 4298.0, 1872.0, 935.0, 467.0, 280.0, 160.0, 105.0, 62.0, 39.0, 25.0, 17.0, 20.0, 10.0, 10.0, 5.0, 4.0, 2.0, 5.0, 2.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.041656494140625, -0.040410518646240234, -0.03916454315185547, -0.0379185676574707, -0.03667259216308594, -0.03542661666870117, -0.034180641174316406, -0.03293466567993164, -0.031688690185546875, -0.03044271469116211, -0.029196739196777344, -0.027950763702392578, -0.026704788208007812, -0.025458812713623047, -0.02421283721923828, -0.022966861724853516, -0.02172088623046875, -0.020474910736083984, -0.01922893524169922, -0.017982959747314453, -0.016736984252929688, -0.015491008758544922, -0.014245033264160156, -0.01299905776977539, -0.011753082275390625, -0.01050710678100586, -0.009261131286621094, -0.008015155792236328, -0.0067691802978515625, -0.005523204803466797, -0.004277229309082031, -0.0030312538146972656, -0.0017852783203125, -0.0005393028259277344, 0.0007066726684570312, 0.0019526481628417969, 0.0031986236572265625, 0.004444599151611328, 0.005690574645996094, 0.006936550140380859, 0.008182525634765625, 0.00942850112915039, 0.010674476623535156, 0.011920452117919922, 0.013166427612304688, 0.014412403106689453, 0.01565837860107422, 0.016904354095458984, 0.01815032958984375, 0.019396305084228516, 0.02064228057861328, 0.021888256072998047, 0.023134231567382812, 0.024380207061767578, 0.025626182556152344, 0.02687215805053711, 0.028118133544921875, 0.02936410903930664, 0.030610084533691406, 0.03185606002807617, 0.03310203552246094, 0.0343480110168457, 0.03559398651123047, 0.036839962005615234, 0.0380859375]}, "gradients/encoder.encoder.layers.16.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 4.0, 7.0, 10.0, 12.0, 17.0, 25.0, 39.0, 64.0, 84.0, 113.0, 122.0, 135.0, 114.0, 79.0, 62.0, 36.0, 26.0, 17.0, 12.0, 8.0, 1.0, 5.0, 2.0, 3.0, 0.0, 4.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.199411392211914e-05, -2.145674079656601e-05, -2.091936767101288e-05, -2.0381994545459747e-05, -1.9844621419906616e-05, -1.9307248294353485e-05, -1.8769875168800354e-05, -1.8232502043247223e-05, -1.7695128917694092e-05, -1.715775579214096e-05, -1.662038266658783e-05, -1.60830095410347e-05, -1.5545636415481567e-05, -1.5008263289928436e-05, -1.4470890164375305e-05, -1.3933517038822174e-05, -1.3396143913269043e-05, -1.2858770787715912e-05, -1.232139766216278e-05, -1.178402453660965e-05, -1.1246651411056519e-05, -1.0709278285503387e-05, -1.0171905159950256e-05, -9.634532034397125e-06, -9.097158908843994e-06, -8.559785783290863e-06, -8.022412657737732e-06, -7.485039532184601e-06, -6.94766640663147e-06, -6.410293281078339e-06, -5.8729201555252075e-06, -5.335547029972076e-06, -4.798173904418945e-06, -4.260800778865814e-06, -3.723427653312683e-06, -3.186054527759552e-06, -2.648681402206421e-06, -2.11130827665329e-06, -1.5739351511001587e-06, -1.0365620255470276e-06, -4.991888999938965e-07, 3.818422555923462e-08, 5.755573511123657e-07, 1.1129304766654968e-06, 1.650303602218628e-06, 2.187676727771759e-06, 2.72504985332489e-06, 3.2624229788780212e-06, 3.7997961044311523e-06, 4.3371692299842834e-06, 4.8745423555374146e-06, 5.411915481090546e-06, 5.949288606643677e-06, 6.486661732196808e-06, 7.024034857749939e-06, 7.56140798330307e-06, 8.098781108856201e-06, 8.636154234409332e-06, 9.173527359962463e-06, 9.710900485515594e-06, 1.0248273611068726e-05, 1.0785646736621857e-05, 1.1323019862174988e-05, 1.1860392987728119e-05, 1.239776611328125e-05]}, "gradients/encoder.encoder.layers.16.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 4.0, 2.0, 6.0, 10.0, 5.0, 16.0, 25.0, 30.0, 48.0, 96.0, 133.0, 219.0, 491.0, 1077.0, 2484.0, 7420.0, 28082.0, 182186.0, 668629.0, 125866.0, 21499.0, 6177.0, 2217.0, 912.0, 433.0, 194.0, 117.0, 70.0, 37.0, 30.0, 17.0, 5.0, 5.0, 9.0, 4.0, 2.0, 2.0, 1.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.053955078125, -0.052056312561035156, -0.05015754699707031, -0.04825878143310547, -0.046360015869140625, -0.04446125030517578, -0.04256248474121094, -0.040663719177246094, -0.03876495361328125, -0.036866188049316406, -0.03496742248535156, -0.03306865692138672, -0.031169891357421875, -0.02927112579345703, -0.027372360229492188, -0.025473594665527344, -0.0235748291015625, -0.021676063537597656, -0.019777297973632812, -0.01787853240966797, -0.015979766845703125, -0.014081001281738281, -0.012182235717773438, -0.010283470153808594, -0.00838470458984375, -0.006485939025878906, -0.0045871734619140625, -0.0026884078979492188, -0.000789642333984375, 0.0011091232299804688, 0.0030078887939453125, 0.004906654357910156, 0.006805419921875, 0.008704185485839844, 0.010602951049804688, 0.012501716613769531, 0.014400482177734375, 0.01629924774169922, 0.018198013305664062, 0.020096778869628906, 0.02199554443359375, 0.023894309997558594, 0.025793075561523438, 0.02769184112548828, 0.029590606689453125, 0.03148937225341797, 0.03338813781738281, 0.035286903381347656, 0.0371856689453125, 0.039084434509277344, 0.04098320007324219, 0.04288196563720703, 0.044780731201171875, 0.04667949676513672, 0.04857826232910156, 0.050477027893066406, 0.05237579345703125, 0.054274559020996094, 0.05617332458496094, 0.05807209014892578, 0.059970855712890625, 0.06186962127685547, 0.06376838684082031, 0.06566715240478516, 0.06756591796875]}, "gradients/encoder.encoder.layers.16.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 2.0, 0.0, 5.0, 6.0, 4.0, 2.0, 7.0, 8.0, 9.0, 11.0, 7.0, 18.0, 17.0, 25.0, 29.0, 33.0, 52.0, 56.0, 66.0, 70.0, 60.0, 67.0, 61.0, 63.0, 56.0, 41.0, 35.0, 45.0, 26.0, 23.0, 19.0, 14.0, 15.0, 10.0, 9.0, 6.0, 3.0, 5.0, 2.0, 5.0, 1.0, 3.0, 4.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0], "bins": [-0.0296783447265625, -0.02862238883972168, -0.02756643295288086, -0.02651047706604004, -0.02545452117919922, -0.0243985652923584, -0.023342609405517578, -0.022286653518676758, -0.021230697631835938, -0.020174741744995117, -0.019118785858154297, -0.018062829971313477, -0.017006874084472656, -0.015950918197631836, -0.014894962310791016, -0.013839006423950195, -0.012783050537109375, -0.011727094650268555, -0.010671138763427734, -0.009615182876586914, -0.008559226989746094, -0.0075032711029052734, -0.006447315216064453, -0.005391359329223633, -0.0043354034423828125, -0.003279447555541992, -0.002223491668701172, -0.0011675357818603516, -0.00011157989501953125, 0.0009443759918212891, 0.0020003318786621094, 0.0030562877655029297, 0.00411224365234375, 0.00516819953918457, 0.006224155426025391, 0.007280111312866211, 0.008336067199707031, 0.009392023086547852, 0.010447978973388672, 0.011503934860229492, 0.012559890747070312, 0.013615846633911133, 0.014671802520751953, 0.015727758407592773, 0.016783714294433594, 0.017839670181274414, 0.018895626068115234, 0.019951581954956055, 0.021007537841796875, 0.022063493728637695, 0.023119449615478516, 0.024175405502319336, 0.025231361389160156, 0.026287317276000977, 0.027343273162841797, 0.028399229049682617, 0.029455184936523438, 0.030511140823364258, 0.03156709671020508, 0.0326230525970459, 0.03367900848388672, 0.03473496437072754, 0.03579092025756836, 0.03684687614440918, 0.03790283203125]}, "gradients/encoder.encoder.layers.16.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 10.0, 27.0, 87.0, 208.0, 298.0, 232.0, 101.0, 27.0, 13.0, 2.0, 2.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8476474285125732, -1.8043205738067627, -1.7609935998916626, -1.7176666259765625, -1.674339771270752, -1.6310129165649414, -1.5876859426498413, -1.5443589687347412, -1.5010321140289307, -1.4577052593231201, -1.41437828540802, -1.37105131149292, -1.3277244567871094, -1.2843976020812988, -1.2410706281661987, -1.1977436542510986, -1.154416799545288, -1.1110899448394775, -1.0677629709243774, -1.0244359970092773, -0.9811091423034668, -0.9377822279930115, -0.8944553136825562, -0.8511283993721008, -0.8078014850616455, -0.7644745707511902, -0.7211476564407349, -0.6778207421302795, -0.6344938278198242, -0.5911669135093689, -0.5478399991989136, -0.5045130848884583, -0.4611862897872925, -0.41785937547683716, -0.37453246116638184, -0.3312055468559265, -0.2878786325454712, -0.24455171823501587, -0.20122480392456055, -0.15789788961410522, -0.1145709753036499, -0.07124406099319458, -0.027917146682739258, 0.015409767627716064, 0.05873668193817139, 0.10206359624862671, 0.14539051055908203, 0.18871742486953735, 0.23204433917999268, 0.275371253490448, 0.3186981678009033, 0.36202508211135864, 0.40535199642181396, 0.4486789107322693, 0.4920058250427246, 0.5353327393531799, 0.5786596536636353, 0.6219865679740906, 0.6653134822845459, 0.7086403965950012, 0.7519673109054565, 0.7952942252159119, 0.8386211395263672, 0.8819480538368225, 0.9252749681472778]}, "gradients/encoder.encoder.layers.16.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 3.0, 6.0, 5.0, 15.0, 13.0, 10.0, 21.0, 36.0, 34.0, 36.0, 43.0, 59.0, 52.0, 70.0, 72.0, 80.0, 67.0, 55.0, 44.0, 47.0, 45.0, 34.0, 40.0, 21.0, 35.0, 24.0, 11.0, 9.0, 9.0, 6.0, 7.0, 5.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.8110131025314331, -0.7816190719604492, -0.7522250413894653, -0.7228310704231262, -0.6934370398521423, -0.6640430092811584, -0.6346489787101746, -0.6052550077438354, -0.5758609771728516, -0.5464669466018677, -0.5170729160308838, -0.4876789152622223, -0.4582849144935608, -0.4288908839225769, -0.399496853351593, -0.3701028525829315, -0.34070882201194763, -0.31131479144096375, -0.28192079067230225, -0.25252676010131836, -0.22313275933265686, -0.19373872876167297, -0.16434471309185028, -0.1349506974220276, -0.1055566817522049, -0.0761626660823822, -0.04676864668726921, -0.01737462729215622, 0.012019388377666473, 0.04141341149806976, 0.07080742716789246, 0.10020144283771515, 0.12959545850753784, 0.15898947417736053, 0.18838348984718323, 0.21777752041816711, 0.2471715211868286, 0.2765655517578125, 0.3059595823287964, 0.3353535830974579, 0.3647475838661194, 0.39414161443710327, 0.42353561520576477, 0.45292964577674866, 0.48232364654541016, 0.511717677116394, 0.5411117076873779, 0.5705057382583618, 0.5998997688293457, 0.6292937994003296, 0.6586878299713135, 0.6880818009376526, 0.7174758315086365, 0.7468698620796204, 0.7762638926506042, 0.8056578636169434, 0.8350518941879272, 0.8644459247589111, 0.893839955329895, 0.9232339262962341, 0.952627956867218, 0.9820219874382019, 1.011415958404541, 1.040809988975525, 1.0702040195465088]}, "gradients/encoder.encoder.layers.15.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 3.0, 1.0, 0.0, 4.0, 4.0, 8.0, 22.0, 65.0, 166.0, 586.0, 9580.0, 4179122.0, 4145.0, 416.0, 120.0, 27.0, 10.0, 6.0, 5.0, 4.0, 1.0, 0.0, 1.0, 1.0], "bins": [-1.5615234375, -1.5311431884765625, -1.500762939453125, -1.4703826904296875, -1.44000244140625, -1.4096221923828125, -1.379241943359375, -1.3488616943359375, -1.3184814453125, -1.2881011962890625, -1.257720947265625, -1.2273406982421875, -1.19696044921875, -1.1665802001953125, -1.136199951171875, -1.1058197021484375, -1.075439453125, -1.0450592041015625, -1.014678955078125, -0.9842987060546875, -0.95391845703125, -0.9235382080078125, -0.893157958984375, -0.8627777099609375, -0.8323974609375, -0.8020172119140625, -0.771636962890625, -0.7412567138671875, -0.71087646484375, -0.6804962158203125, -0.650115966796875, -0.6197357177734375, -0.58935546875, -0.5589752197265625, -0.528594970703125, -0.4982147216796875, -0.46783447265625, -0.4374542236328125, -0.407073974609375, -0.3766937255859375, -0.3463134765625, -0.3159332275390625, -0.285552978515625, -0.2551727294921875, -0.22479248046875, -0.1944122314453125, -0.164031982421875, -0.1336517333984375, -0.103271484375, -0.0728912353515625, -0.042510986328125, -0.0121307373046875, 0.01824951171875, 0.0486297607421875, 0.079010009765625, 0.1093902587890625, 0.1397705078125, 0.1701507568359375, 0.200531005859375, 0.2309112548828125, 0.26129150390625, 0.2916717529296875, 0.322052001953125, 0.3524322509765625, 0.3828125]}, "gradients/encoder.encoder.layers.15.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 2.0, 2.0, 15.0, 13.0, 20.0, 46.0, 85.0, 82.0, 128.0, 126.0, 140.0, 118.0, 89.0, 54.0, 47.0, 20.0, 9.0, 5.0, 3.0, 4.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10101318359375, -0.09853029251098633, -0.09604740142822266, -0.09356451034545898, -0.09108161926269531, -0.08859872817993164, -0.08611583709716797, -0.0836329460144043, -0.08115005493164062, -0.07866716384887695, -0.07618427276611328, -0.07370138168334961, -0.07121849060058594, -0.06873559951782227, -0.0662527084350586, -0.06376981735229492, -0.06128692626953125, -0.05880403518676758, -0.056321144104003906, -0.053838253021240234, -0.05135536193847656, -0.04887247085571289, -0.04638957977294922, -0.04390668869018555, -0.041423797607421875, -0.0389409065246582, -0.03645801544189453, -0.03397512435913086, -0.03149223327636719, -0.029009342193603516, -0.026526451110839844, -0.024043560028076172, -0.0215606689453125, -0.019077777862548828, -0.016594886779785156, -0.014111995697021484, -0.011629104614257812, -0.00914621353149414, -0.006663322448730469, -0.004180431365966797, -0.001697540283203125, 0.0007853507995605469, 0.0032682418823242188, 0.005751132965087891, 0.008234024047851562, 0.010716915130615234, 0.013199806213378906, 0.015682697296142578, 0.01816558837890625, 0.020648479461669922, 0.023131370544433594, 0.025614261627197266, 0.028097152709960938, 0.03058004379272461, 0.03306293487548828, 0.03554582595825195, 0.038028717041015625, 0.0405116081237793, 0.04299449920654297, 0.04547739028930664, 0.04796028137207031, 0.050443172454833984, 0.052926063537597656, 0.05540895462036133, 0.057891845703125]}, "gradients/encoder.encoder.layers.15.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 4.0, 3.0, 7.0, 6.0, 8.0, 15.0, 23.0, 43.0, 60.0, 83.0, 213.0, 1144.0, 9877.0, 2835204.0, 1336989.0, 9052.0, 1054.0, 224.0, 107.0, 68.0, 37.0, 27.0, 19.0, 13.0, 9.0, 6.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.289306640625, -0.2748298645019531, -0.26035308837890625, -0.24587631225585938, -0.2313995361328125, -0.21692276000976562, -0.20244598388671875, -0.18796920776367188, -0.173492431640625, -0.15901565551757812, -0.14453887939453125, -0.13006210327148438, -0.1155853271484375, -0.10110855102539062, -0.08663177490234375, -0.07215499877929688, -0.05767822265625, -0.043201446533203125, -0.02872467041015625, -0.014247894287109375, 0.0002288818359375, 0.014705657958984375, 0.02918243408203125, 0.043659210205078125, 0.058135986328125, 0.07261276245117188, 0.08708953857421875, 0.10156631469726562, 0.1160430908203125, 0.13051986694335938, 0.14499664306640625, 0.15947341918945312, 0.1739501953125, 0.18842697143554688, 0.20290374755859375, 0.21738052368164062, 0.2318572998046875, 0.24633407592773438, 0.26081085205078125, 0.2752876281738281, 0.289764404296875, 0.3042411804199219, 0.31871795654296875, 0.3331947326660156, 0.3476715087890625, 0.3621482849121094, 0.37662506103515625, 0.3911018371582031, 0.40557861328125, 0.4200553894042969, 0.43453216552734375, 0.4490089416503906, 0.4634857177734375, 0.4779624938964844, 0.49243927001953125, 0.5069160461425781, 0.521392822265625, 0.5358695983886719, 0.5503463745117188, 0.5648231506347656, 0.5792999267578125, 0.5937767028808594, 0.6082534790039062, 0.6227302551269531, 0.63720703125]}, "gradients/encoder.encoder.layers.15.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 2.0, 4.0, 4.0, 8.0, 10.0, 13.0, 27.0, 40.0, 94.0, 278.0, 2359.0, 906.0, 174.0, 76.0, 46.0, 20.0, 8.0, 5.0, 5.0, 5.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0633544921875, -0.059078216552734375, -0.05480194091796875, -0.050525665283203125, -0.0462493896484375, -0.041973114013671875, -0.03769683837890625, -0.033420562744140625, -0.029144287109375, -0.024868011474609375, -0.02059173583984375, -0.016315460205078125, -0.0120391845703125, -0.007762908935546875, -0.00348663330078125, 0.000789642333984375, 0.00506591796875, 0.009342193603515625, 0.01361846923828125, 0.017894744873046875, 0.0221710205078125, 0.026447296142578125, 0.03072357177734375, 0.034999847412109375, 0.039276123046875, 0.043552398681640625, 0.04782867431640625, 0.052104949951171875, 0.0563812255859375, 0.060657501220703125, 0.06493377685546875, 0.06921005249023438, 0.073486328125, 0.07776260375976562, 0.08203887939453125, 0.08631515502929688, 0.0905914306640625, 0.09486770629882812, 0.09914398193359375, 0.10342025756835938, 0.107696533203125, 0.11197280883789062, 0.11624908447265625, 0.12052536010742188, 0.1248016357421875, 0.12907791137695312, 0.13335418701171875, 0.13763046264648438, 0.14190673828125, 0.14618301391601562, 0.15045928955078125, 0.15473556518554688, 0.1590118408203125, 0.16328811645507812, 0.16756439208984375, 0.17184066772460938, 0.176116943359375, 0.18039321899414062, 0.18466949462890625, 0.18894577026367188, 0.1932220458984375, 0.19749832153320312, 0.20177459716796875, 0.20605087280273438, 0.2103271484375]}, "gradients/encoder.encoder.layers.15.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 115.0, 891.0, 11.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.4221875667572021, -1.2685809135437012, -1.1149742603302002, -0.9613674879074097, -0.8077608346939087, -0.6541541814804077, -0.500547468662262, -0.3469407558441162, -0.19333410263061523, -0.03972741961479187, 0.1138792634010315, 0.26748594641685486, 0.4210926294326782, 0.5746992826461792, 0.728305995464325, 0.8819127082824707, 1.0355193614959717, 1.1891260147094727, 1.3427326679229736, 1.4963394403457642, 1.6499460935592651, 1.8035527467727661, 1.9571595191955566, 2.1107661724090576, 2.2643728256225586, 2.4179794788360596, 2.5715861320495605, 2.7251927852630615, 2.8787994384765625, 3.0324063301086426, 3.1860129833221436, 3.3396196365356445, 3.4932260513305664, 3.6468327045440674, 3.8004393577575684, 3.9540460109710693, 4.10765266418457, 4.26125955581665, 4.414865970611572, 4.568472862243652, 4.722079277038574, 4.875686168670654, 5.029292583465576, 5.182899475097656, 5.336505889892578, 5.490112781524658, 5.64371919631958, 5.79732608795166, 5.95093297958374, 6.10453987121582, 6.258146286010742, 6.411753177642822, 6.565359592437744, 6.718966484069824, 6.872572898864746, 7.026179790496826, 7.179786682128906, 7.333393573760986, 7.486999988555908, 7.640606880187988, 7.79421329498291, 7.94782018661499, 8.10142707824707, 8.255033493041992, 8.408639907836914]}, "gradients/encoder.encoder.layers.15.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 1.0, 3.0, 7.0, 6.0, 10.0, 12.0, 18.0, 29.0, 42.0, 43.0, 60.0, 65.0, 64.0, 79.0, 66.0, 98.0, 66.0, 79.0, 52.0, 52.0, 45.0, 33.0, 23.0, 20.0, 12.0, 10.0, 8.0, 4.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.3687801957130432, -0.35645145177841187, -0.34412267804145813, -0.3317939341068268, -0.31946519017219543, -0.3071364164352417, -0.29480767250061035, -0.282478928565979, -0.27015018463134766, -0.2578214406967163, -0.24549268186092377, -0.23316392302513123, -0.22083517909049988, -0.20850642025470734, -0.1961776614189148, -0.18384891748428345, -0.1715201437473297, -0.15919138491153717, -0.14686264097690582, -0.13453388214111328, -0.12220513075590134, -0.10987637937068939, -0.09754762053489685, -0.0852188691496849, -0.07289011776447296, -0.06056136637926102, -0.048232611268758774, -0.03590385615825653, -0.023575104773044586, -0.011246353387832642, 0.0010824054479599, 0.013411156833171844, 0.02573990821838379, 0.038068659603595734, 0.05039741471409798, 0.06272616982460022, 0.07505492120981216, 0.08738367259502411, 0.09971243143081665, 0.1120411828160286, 0.12436993420124054, 0.13669869303703308, 0.14902743697166443, 0.16135619580745697, 0.1736849546432495, 0.18601369857788086, 0.1983424574136734, 0.21067121624946594, 0.2229999601840973, 0.23532871901988983, 0.24765746295452118, 0.2599862217903137, 0.27231496572494507, 0.2846437096595764, 0.29697248339653015, 0.3093012273311615, 0.32163000106811523, 0.3339587450027466, 0.3462875187397003, 0.35861626267433167, 0.370945006608963, 0.38327378034591675, 0.3956025242805481, 0.40793126821517944, 0.4202600121498108]}, "gradients/encoder.encoder.layers.15.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 2.0, 4.0, 1.0, 3.0, 1.0, 9.0, 5.0, 7.0, 13.0, 26.0, 34.0, 47.0, 68.0, 107.0, 202.0, 348.0, 748.0, 1795.0, 6234.0, 43119.0, 635473.0, 330491.0, 22889.0, 4269.0, 1356.0, 547.0, 302.0, 150.0, 100.0, 59.0, 41.0, 36.0, 24.0, 19.0, 3.0, 6.0, 5.0, 8.0, 3.0, 4.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.21435546875, -0.2080249786376953, -0.20169448852539062, -0.19536399841308594, -0.18903350830078125, -0.18270301818847656, -0.17637252807617188, -0.1700420379638672, -0.1637115478515625, -0.1573810577392578, -0.15105056762695312, -0.14472007751464844, -0.13838958740234375, -0.13205909729003906, -0.12572860717773438, -0.11939811706542969, -0.113067626953125, -0.10673713684082031, -0.10040664672851562, -0.09407615661621094, -0.08774566650390625, -0.08141517639160156, -0.07508468627929688, -0.06875419616699219, -0.0624237060546875, -0.05609321594238281, -0.049762725830078125, -0.04343223571777344, -0.03710174560546875, -0.030771255493164062, -0.024440765380859375, -0.018110275268554688, -0.01177978515625, -0.0054492950439453125, 0.000881195068359375, 0.0072116851806640625, 0.01354217529296875, 0.019872665405273438, 0.026203155517578125, 0.03253364562988281, 0.0388641357421875, 0.04519462585449219, 0.051525115966796875, 0.05785560607910156, 0.06418609619140625, 0.07051658630371094, 0.07684707641601562, 0.08317756652832031, 0.089508056640625, 0.09583854675292969, 0.10216903686523438, 0.10849952697753906, 0.11483001708984375, 0.12116050720214844, 0.12749099731445312, 0.1338214874267578, 0.1401519775390625, 0.1464824676513672, 0.15281295776367188, 0.15914344787597656, 0.16547393798828125, 0.17180442810058594, 0.17813491821289062, 0.1844654083251953, 0.1907958984375]}, "gradients/encoder.encoder.layers.15.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 8.0, 19.0, 33.0, 43.0, 61.0, 96.0, 119.0, 110.0, 123.0, 117.0, 89.0, 71.0, 46.0, 26.0, 15.0, 11.0, 5.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09368896484375, -0.09134244918823242, -0.08899593353271484, -0.08664941787719727, -0.08430290222167969, -0.08195638656616211, -0.07960987091064453, -0.07726335525512695, -0.07491683959960938, -0.0725703239440918, -0.07022380828857422, -0.06787729263305664, -0.06553077697753906, -0.06318426132202148, -0.060837745666503906, -0.05849123001098633, -0.05614471435546875, -0.05379819869995117, -0.051451683044433594, -0.049105167388916016, -0.04675865173339844, -0.04441213607788086, -0.04206562042236328, -0.0397191047668457, -0.037372589111328125, -0.03502607345581055, -0.03267955780029297, -0.03033304214477539, -0.027986526489257812, -0.025640010833740234, -0.023293495178222656, -0.020946979522705078, -0.0186004638671875, -0.016253948211669922, -0.013907432556152344, -0.011560916900634766, -0.009214401245117188, -0.006867885589599609, -0.004521369934082031, -0.002174854278564453, 0.000171661376953125, 0.002518177032470703, 0.004864692687988281, 0.007211208343505859, 0.009557723999023438, 0.011904239654541016, 0.014250755310058594, 0.016597270965576172, 0.01894378662109375, 0.021290302276611328, 0.023636817932128906, 0.025983333587646484, 0.028329849243164062, 0.03067636489868164, 0.03302288055419922, 0.0353693962097168, 0.037715911865234375, 0.04006242752075195, 0.04240894317626953, 0.04475545883178711, 0.04710197448730469, 0.049448490142822266, 0.051795005798339844, 0.05414152145385742, 0.056488037109375]}, "gradients/encoder.encoder.layers.15.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 2.0, 5.0, 5.0, 5.0, 6.0, 7.0, 12.0, 21.0, 18.0, 36.0, 42.0, 79.0, 121.0, 210.0, 437.0, 960.0, 2647.0, 11056.0, 73564.0, 588381.0, 326316.0, 35064.0, 6181.0, 1869.0, 692.0, 345.0, 190.0, 95.0, 47.0, 48.0, 25.0, 24.0, 6.0, 11.0, 7.0, 6.0, 3.0, 4.0, 4.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1246337890625, -0.1205902099609375, -0.116546630859375, -0.1125030517578125, -0.10845947265625, -0.1044158935546875, -0.100372314453125, -0.0963287353515625, -0.09228515625, -0.0882415771484375, -0.084197998046875, -0.0801544189453125, -0.07611083984375, -0.0720672607421875, -0.068023681640625, -0.0639801025390625, -0.0599365234375, -0.0558929443359375, -0.051849365234375, -0.0478057861328125, -0.04376220703125, -0.0397186279296875, -0.035675048828125, -0.0316314697265625, -0.027587890625, -0.0235443115234375, -0.019500732421875, -0.0154571533203125, -0.01141357421875, -0.0073699951171875, -0.003326416015625, 0.0007171630859375, 0.0047607421875, 0.0088043212890625, 0.012847900390625, 0.0168914794921875, 0.02093505859375, 0.0249786376953125, 0.029022216796875, 0.0330657958984375, 0.037109375, 0.0411529541015625, 0.045196533203125, 0.0492401123046875, 0.05328369140625, 0.0573272705078125, 0.061370849609375, 0.0654144287109375, 0.0694580078125, 0.0735015869140625, 0.077545166015625, 0.0815887451171875, 0.08563232421875, 0.0896759033203125, 0.093719482421875, 0.0977630615234375, 0.101806640625, 0.1058502197265625, 0.109893798828125, 0.1139373779296875, 0.11798095703125, 0.1220245361328125, 0.126068115234375, 0.1301116943359375, 0.1341552734375]}, "gradients/encoder.encoder.layers.15.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 7.0, 6.0, 7.0, 7.0, 14.0, 7.0, 13.0, 15.0, 22.0, 22.0, 26.0, 30.0, 26.0, 34.0, 33.0, 39.0, 35.0, 52.0, 46.0, 37.0, 44.0, 42.0, 41.0, 53.0, 44.0, 29.0, 31.0, 27.0, 22.0, 28.0, 22.0, 39.0, 22.0, 15.0, 15.0, 10.0, 13.0, 4.0, 7.0, 5.0, 3.0, 6.0, 4.0, 1.0, 1.0, 0.0, 0.0, 4.0], "bins": [-0.1319580078125, -0.12833118438720703, -0.12470436096191406, -0.1210775375366211, -0.11745071411132812, -0.11382389068603516, -0.11019706726074219, -0.10657024383544922, -0.10294342041015625, -0.09931659698486328, -0.09568977355957031, -0.09206295013427734, -0.08843612670898438, -0.0848093032836914, -0.08118247985839844, -0.07755565643310547, -0.0739288330078125, -0.07030200958251953, -0.06667518615722656, -0.0630483627319336, -0.059421539306640625, -0.055794715881347656, -0.05216789245605469, -0.04854106903076172, -0.04491424560546875, -0.04128742218017578, -0.03766059875488281, -0.034033775329589844, -0.030406951904296875, -0.026780128479003906, -0.023153305053710938, -0.01952648162841797, -0.015899658203125, -0.012272834777832031, -0.008646011352539062, -0.005019187927246094, -0.001392364501953125, 0.0022344589233398438, 0.0058612823486328125, 0.009488105773925781, 0.01311492919921875, 0.01674175262451172, 0.020368576049804688, 0.023995399475097656, 0.027622222900390625, 0.031249046325683594, 0.03487586975097656, 0.03850269317626953, 0.0421295166015625, 0.04575634002685547, 0.04938316345214844, 0.053009986877441406, 0.056636810302734375, 0.060263633728027344, 0.06389045715332031, 0.06751728057861328, 0.07114410400390625, 0.07477092742919922, 0.07839775085449219, 0.08202457427978516, 0.08565139770507812, 0.0892782211303711, 0.09290504455566406, 0.09653186798095703, 0.10015869140625]}, "gradients/encoder.encoder.layers.15.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 4.0, 1.0, 7.0, 10.0, 14.0, 22.0, 31.0, 74.0, 193.0, 571.0, 2425.0, 18064.0, 798628.0, 219036.0, 7552.0, 1354.0, 314.0, 138.0, 52.0, 19.0, 20.0, 11.0, 3.0, 6.0, 3.0, 2.0, 5.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06561279296875, -0.06285762786865234, -0.06010246276855469, -0.05734729766845703, -0.054592132568359375, -0.05183696746826172, -0.04908180236816406, -0.046326637268066406, -0.04357147216796875, -0.040816307067871094, -0.03806114196777344, -0.03530597686767578, -0.032550811767578125, -0.02979564666748047, -0.027040481567382812, -0.024285316467285156, -0.0215301513671875, -0.018774986267089844, -0.016019821166992188, -0.013264656066894531, -0.010509490966796875, -0.007754325866699219, -0.0049991607666015625, -0.0022439956665039062, 0.00051116943359375, 0.0032663345336914062, 0.0060214996337890625, 0.008776664733886719, 0.011531829833984375, 0.014286994934082031, 0.017042160034179688, 0.019797325134277344, 0.022552490234375, 0.025307655334472656, 0.028062820434570312, 0.03081798553466797, 0.033573150634765625, 0.03632831573486328, 0.03908348083496094, 0.041838645935058594, 0.04459381103515625, 0.047348976135253906, 0.05010414123535156, 0.05285930633544922, 0.055614471435546875, 0.05836963653564453, 0.06112480163574219, 0.06387996673583984, 0.0666351318359375, 0.06939029693603516, 0.07214546203613281, 0.07490062713623047, 0.07765579223632812, 0.08041095733642578, 0.08316612243652344, 0.0859212875366211, 0.08867645263671875, 0.0914316177368164, 0.09418678283691406, 0.09694194793701172, 0.09969711303710938, 0.10245227813720703, 0.10520744323730469, 0.10796260833740234, 0.1107177734375]}, "gradients/encoder.encoder.layers.15.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 3.0, 6.0, 8.0, 4.0, 10.0, 15.0, 16.0, 27.0, 52.0, 49.0, 64.0, 81.0, 88.0, 111.0, 98.0, 75.0, 73.0, 67.0, 43.0, 38.0, 32.0, 15.0, 16.0, 10.0, 3.0, 5.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-1.5556812286376953e-05, -1.520942896604538e-05, -1.4862045645713806e-05, -1.4514662325382233e-05, -1.416727900505066e-05, -1.3819895684719086e-05, -1.3472512364387512e-05, -1.3125129044055939e-05, -1.2777745723724365e-05, -1.2430362403392792e-05, -1.2082979083061218e-05, -1.1735595762729645e-05, -1.1388212442398071e-05, -1.1040829122066498e-05, -1.0693445801734924e-05, -1.034606248140335e-05, -9.998679161071777e-06, -9.651295840740204e-06, -9.30391252040863e-06, -8.956529200077057e-06, -8.609145879745483e-06, -8.26176255941391e-06, -7.914379239082336e-06, -7.566995918750763e-06, -7.2196125984191895e-06, -6.872229278087616e-06, -6.5248459577560425e-06, -6.177462637424469e-06, -5.8300793170928955e-06, -5.482695996761322e-06, -5.1353126764297485e-06, -4.787929356098175e-06, -4.4405460357666016e-06, -4.093162715435028e-06, -3.7457793951034546e-06, -3.398396074771881e-06, -3.0510127544403076e-06, -2.703629434108734e-06, -2.3562461137771606e-06, -2.008862793445587e-06, -1.6614794731140137e-06, -1.3140961527824402e-06, -9.667128324508667e-07, -6.193295121192932e-07, -2.7194619178771973e-07, 7.543712854385376e-08, 4.2282044887542725e-07, 7.702037692070007e-07, 1.1175870895385742e-06, 1.4649704098701477e-06, 1.8123537302017212e-06, 2.1597370505332947e-06, 2.507120370864868e-06, 2.8545036911964417e-06, 3.201887011528015e-06, 3.5492703318595886e-06, 3.896653652191162e-06, 4.244036972522736e-06, 4.591420292854309e-06, 4.9388036131858826e-06, 5.286186933517456e-06, 5.6335702538490295e-06, 5.980953574180603e-06, 6.3283368945121765e-06, 6.67572021484375e-06]}, "gradients/encoder.encoder.layers.15.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 5.0, 4.0, 5.0, 6.0, 22.0, 51.0, 116.0, 244.0, 730.0, 2698.0, 17264.0, 791486.0, 224950.0, 8450.0, 1699.0, 494.0, 180.0, 81.0, 40.0, 19.0, 4.0, 8.0, 5.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1192626953125, -0.1159353256225586, -0.11260795593261719, -0.10928058624267578, -0.10595321655273438, -0.10262584686279297, -0.09929847717285156, -0.09597110748291016, -0.09264373779296875, -0.08931636810302734, -0.08598899841308594, -0.08266162872314453, -0.07933425903320312, -0.07600688934326172, -0.07267951965332031, -0.0693521499633789, -0.0660247802734375, -0.0626974105834961, -0.05937004089355469, -0.05604267120361328, -0.052715301513671875, -0.04938793182373047, -0.04606056213378906, -0.042733192443847656, -0.03940582275390625, -0.036078453063964844, -0.03275108337402344, -0.02942371368408203, -0.026096343994140625, -0.02276897430419922, -0.019441604614257812, -0.016114234924316406, -0.012786865234375, -0.009459495544433594, -0.0061321258544921875, -0.0028047561645507812, 0.000522613525390625, 0.0038499832153320312, 0.0071773529052734375, 0.010504722595214844, 0.01383209228515625, 0.017159461975097656, 0.020486831665039062, 0.02381420135498047, 0.027141571044921875, 0.03046894073486328, 0.03379631042480469, 0.037123680114746094, 0.0404510498046875, 0.043778419494628906, 0.04710578918457031, 0.05043315887451172, 0.053760528564453125, 0.05708789825439453, 0.06041526794433594, 0.06374263763427734, 0.06707000732421875, 0.07039737701416016, 0.07372474670410156, 0.07705211639404297, 0.08037948608398438, 0.08370685577392578, 0.08703422546386719, 0.0903615951538086, 0.09368896484375]}, "gradients/encoder.encoder.layers.15.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 0.0, 1.0, 4.0, 6.0, 11.0, 17.0, 18.0, 29.0, 39.0, 72.0, 100.0, 143.0, 155.0, 127.0, 93.0, 60.0, 55.0, 22.0, 22.0, 17.0, 7.0, 2.0, 0.0, 4.0, 0.0, 5.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.057281494140625, -0.055567264556884766, -0.05385303497314453, -0.0521388053894043, -0.05042457580566406, -0.04871034622192383, -0.046996116638183594, -0.04528188705444336, -0.043567657470703125, -0.04185342788696289, -0.040139198303222656, -0.03842496871948242, -0.03671073913574219, -0.03499650955200195, -0.03328227996826172, -0.031568050384521484, -0.02985382080078125, -0.028139591217041016, -0.02642536163330078, -0.024711132049560547, -0.022996902465820312, -0.021282672882080078, -0.019568443298339844, -0.01785421371459961, -0.016139984130859375, -0.01442575454711914, -0.012711524963378906, -0.010997295379638672, -0.009283065795898438, -0.007568836212158203, -0.005854606628417969, -0.004140377044677734, -0.0024261474609375, -0.0007119178771972656, 0.0010023117065429688, 0.002716541290283203, 0.0044307708740234375, 0.006145000457763672, 0.007859230041503906, 0.00957345962524414, 0.011287689208984375, 0.01300191879272461, 0.014716148376464844, 0.016430377960205078, 0.018144607543945312, 0.019858837127685547, 0.02157306671142578, 0.023287296295166016, 0.02500152587890625, 0.026715755462646484, 0.02842998504638672, 0.030144214630126953, 0.03185844421386719, 0.03357267379760742, 0.035286903381347656, 0.03700113296508789, 0.038715362548828125, 0.04042959213256836, 0.042143821716308594, 0.04385805130004883, 0.04557228088378906, 0.0472865104675293, 0.04900074005126953, 0.050714969635009766, 0.05242919921875]}, "gradients/encoder.encoder.layers.15.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 3.0, 14.0, 43.0, 178.0, 390.0, 257.0, 89.0, 27.0, 7.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8725656270980835, -1.8241504430770874, -1.7757352590560913, -1.7273200750350952, -1.6789050102233887, -1.6304898262023926, -1.5820746421813965, -1.5336594581604004, -1.4852442741394043, -1.4368290901184082, -1.388413906097412, -1.339998722076416, -1.29158353805542, -1.2431684732437134, -1.1947532892227173, -1.1463381052017212, -1.097922921180725, -1.049507737159729, -1.001092553138733, -0.9526774287223816, -0.9042622447013855, -0.8558470606803894, -0.8074319362640381, -0.759016752243042, -0.7106015682220459, -0.6621863842010498, -0.6137712001800537, -0.5653560757637024, -0.5169408917427063, -0.4685257077217102, -0.4201105535030365, -0.3716953992843628, -0.32328009605407715, -0.27486491203308105, -0.22644975781440735, -0.17803458869457245, -0.12961941957473755, -0.08120425045490265, -0.03278908133506775, 0.015626072883605957, 0.06404125690460205, 0.11245642602443695, 0.16087159514427185, 0.20928676426410675, 0.25770193338394165, 0.30611711740493774, 0.35453227162361145, 0.40294742584228516, 0.45136260986328125, 0.49977779388427734, 0.5481929779052734, 0.5966081023216248, 0.6450232863426208, 0.6934384703636169, 0.7418535947799683, 0.7902687788009644, 0.8386839628219604, 0.8870991468429565, 0.9355143308639526, 0.983929455280304, 1.0323445796966553, 1.0807597637176514, 1.1291749477386475, 1.1775901317596436, 1.2260053157806396]}, "gradients/encoder.encoder.layers.15.layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 5.0, 0.0, 5.0, 2.0, 6.0, 6.0, 10.0, 16.0, 17.0, 18.0, 17.0, 24.0, 29.0, 36.0, 32.0, 37.0, 40.0, 44.0, 48.0, 37.0, 38.0, 44.0, 52.0, 44.0, 57.0, 37.0, 40.0, 46.0, 32.0, 38.0, 30.0, 18.0, 18.0, 12.0, 17.0, 16.0, 10.0, 10.0, 8.0, 5.0, 4.0, 4.0, 1.0, 1.0, 0.0, 5.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.44711625576019287, -0.4291492700576782, -0.4111822545528412, -0.39321526885032654, -0.3752482533454895, -0.35728126764297485, -0.3393142819404602, -0.32134726643562317, -0.30338025093078613, -0.2854132652282715, -0.26744624972343445, -0.2494792640209198, -0.23151224851608276, -0.21354526281356812, -0.19557826220989227, -0.17761126160621643, -0.15964427590370178, -0.14167727530002594, -0.1237102746963501, -0.10574328154325485, -0.08777628093957901, -0.06980928033590317, -0.05184228718280792, -0.03387528657913208, -0.015908285975456238, 0.0020587127655744553, 0.02002571150660515, 0.03799270838499069, 0.055959708988666534, 0.07392670959234238, 0.09189370274543762, 0.10986070334911346, 0.1278277039527893, 0.14579470455646515, 0.163761705160141, 0.18172869086265564, 0.19969570636749268, 0.21766269207000732, 0.23562969267368317, 0.253596693277359, 0.27156370878219604, 0.2895306944847107, 0.30749770998954773, 0.3254646956920624, 0.3434317111968994, 0.36139869689941406, 0.3793656826019287, 0.39733269810676575, 0.4152996838092804, 0.43326666951179504, 0.4512336850166321, 0.46920067071914673, 0.48716768622398376, 0.5051347017288208, 0.5231016874313354, 0.5410686731338501, 0.5590356588363647, 0.5770026445388794, 0.594969630241394, 0.6129366755485535, 0.6309036612510681, 0.6488706469535828, 0.6668376326560974, 0.6848046779632568, 0.7027716636657715]}, "gradients/encoder.encoder.layers.14.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 4.0, 1.0, 7.0, 7.0, 3.0, 9.0, 7.0, 11.0, 13.0, 20.0, 24.0, 39.0, 35.0, 96.0, 195.0, 426.0, 1109.0, 3609.0, 27341.0, 4129954.0, 25741.0, 3638.0, 1084.0, 449.0, 181.0, 110.0, 53.0, 34.0, 36.0, 19.0, 12.0, 9.0, 3.0, 4.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.4345703125, -0.42409515380859375, -0.4136199951171875, -0.40314483642578125, -0.392669677734375, -0.38219451904296875, -0.3717193603515625, -0.36124420166015625, -0.35076904296875, -0.34029388427734375, -0.3298187255859375, -0.31934356689453125, -0.308868408203125, -0.29839324951171875, -0.2879180908203125, -0.27744293212890625, -0.2669677734375, -0.25649261474609375, -0.2460174560546875, -0.23554229736328125, -0.225067138671875, -0.21459197998046875, -0.2041168212890625, -0.19364166259765625, -0.18316650390625, -0.17269134521484375, -0.1622161865234375, -0.15174102783203125, -0.141265869140625, -0.13079071044921875, -0.1203155517578125, -0.10984039306640625, -0.099365234375, -0.08889007568359375, -0.0784149169921875, -0.06793975830078125, -0.057464599609375, -0.04698944091796875, -0.0365142822265625, -0.02603912353515625, -0.01556396484375, -0.00508880615234375, 0.0053863525390625, 0.01586151123046875, 0.026336669921875, 0.03681182861328125, 0.0472869873046875, 0.05776214599609375, 0.0682373046875, 0.07871246337890625, 0.0891876220703125, 0.09966278076171875, 0.110137939453125, 0.12061309814453125, 0.1310882568359375, 0.14156341552734375, 0.15203857421875, 0.16251373291015625, 0.1729888916015625, 0.18346405029296875, 0.193939208984375, 0.20441436767578125, 0.2148895263671875, 0.22536468505859375, 0.23583984375]}, "gradients/encoder.encoder.layers.14.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 7.0, 10.0, 17.0, 41.0, 44.0, 68.0, 92.0, 125.0, 113.0, 122.0, 122.0, 87.0, 61.0, 39.0, 29.0, 10.0, 11.0, 3.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0987548828125, -0.0963296890258789, -0.09390449523925781, -0.09147930145263672, -0.08905410766601562, -0.08662891387939453, -0.08420372009277344, -0.08177852630615234, -0.07935333251953125, -0.07692813873291016, -0.07450294494628906, -0.07207775115966797, -0.06965255737304688, -0.06722736358642578, -0.06480216979980469, -0.062376976013183594, -0.0599517822265625, -0.057526588439941406, -0.05510139465332031, -0.05267620086669922, -0.050251007080078125, -0.04782581329345703, -0.04540061950683594, -0.042975425720214844, -0.04055023193359375, -0.038125038146972656, -0.03569984436035156, -0.03327465057373047, -0.030849456787109375, -0.02842426300048828, -0.025999069213867188, -0.023573875427246094, -0.021148681640625, -0.018723487854003906, -0.016298294067382812, -0.013873100280761719, -0.011447906494140625, -0.009022712707519531, -0.0065975189208984375, -0.004172325134277344, -0.00174713134765625, 0.0006780624389648438, 0.0031032562255859375, 0.005528450012207031, 0.007953643798828125, 0.010378837585449219, 0.012804031372070312, 0.015229225158691406, 0.0176544189453125, 0.020079612731933594, 0.022504806518554688, 0.02493000030517578, 0.027355194091796875, 0.02978038787841797, 0.03220558166503906, 0.034630775451660156, 0.03705596923828125, 0.039481163024902344, 0.04190635681152344, 0.04433155059814453, 0.046756744384765625, 0.04918193817138672, 0.05160713195800781, 0.054032325744628906, 0.05645751953125]}, "gradients/encoder.encoder.layers.14.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 2.0, 2.0, 2.0, 3.0, 5.0, 5.0, 9.0, 18.0, 24.0, 33.0, 49.0, 70.0, 88.0, 139.0, 252.0, 564.0, 4460.0, 4161607.0, 24992.0, 1031.0, 356.0, 179.0, 113.0, 99.0, 60.0, 44.0, 38.0, 19.0, 11.0, 10.0, 4.0, 6.0, 4.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.53125, -0.502838134765625, -0.47442626953125, -0.446014404296875, -0.4176025390625, -0.389190673828125, -0.36077880859375, -0.332366943359375, -0.303955078125, -0.275543212890625, -0.24713134765625, -0.218719482421875, -0.1903076171875, -0.161895751953125, -0.13348388671875, -0.105072021484375, -0.07666015625, -0.048248291015625, -0.01983642578125, 0.008575439453125, 0.0369873046875, 0.065399169921875, 0.09381103515625, 0.122222900390625, 0.150634765625, 0.179046630859375, 0.20745849609375, 0.235870361328125, 0.2642822265625, 0.292694091796875, 0.32110595703125, 0.349517822265625, 0.3779296875, 0.406341552734375, 0.43475341796875, 0.463165283203125, 0.4915771484375, 0.519989013671875, 0.54840087890625, 0.576812744140625, 0.605224609375, 0.633636474609375, 0.66204833984375, 0.690460205078125, 0.7188720703125, 0.747283935546875, 0.77569580078125, 0.804107666015625, 0.83251953125, 0.860931396484375, 0.88934326171875, 0.917755126953125, 0.9461669921875, 0.974578857421875, 1.00299072265625, 1.031402587890625, 1.059814453125, 1.088226318359375, 1.11663818359375, 1.145050048828125, 1.1734619140625, 1.201873779296875, 1.23028564453125, 1.258697509765625, 1.287109375]}, "gradients/encoder.encoder.layers.14.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 5.0, 11.0, 33.0, 97.0, 3053.0, 777.0, 68.0, 20.0, 10.0, 6.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10101318359375, -0.09421253204345703, -0.08741188049316406, -0.0806112289428711, -0.07381057739257812, -0.06700992584228516, -0.06020927429199219, -0.05340862274169922, -0.04660797119140625, -0.03980731964111328, -0.03300666809082031, -0.026206016540527344, -0.019405364990234375, -0.012604713439941406, -0.0058040618896484375, 0.0009965896606445312, 0.0077972412109375, 0.014597892761230469, 0.021398544311523438, 0.028199195861816406, 0.034999847412109375, 0.041800498962402344, 0.04860115051269531, 0.05540180206298828, 0.06220245361328125, 0.06900310516357422, 0.07580375671386719, 0.08260440826416016, 0.08940505981445312, 0.0962057113647461, 0.10300636291503906, 0.10980701446533203, 0.116607666015625, 0.12340831756591797, 0.13020896911621094, 0.1370096206665039, 0.14381027221679688, 0.15061092376708984, 0.1574115753173828, 0.16421222686767578, 0.17101287841796875, 0.17781352996826172, 0.1846141815185547, 0.19141483306884766, 0.19821548461914062, 0.2050161361694336, 0.21181678771972656, 0.21861743927001953, 0.2254180908203125, 0.23221874237060547, 0.23901939392089844, 0.2458200454711914, 0.2526206970214844, 0.25942134857177734, 0.2662220001220703, 0.2730226516723633, 0.27982330322265625, 0.2866239547729492, 0.2934246063232422, 0.30022525787353516, 0.3070259094238281, 0.3138265609741211, 0.32062721252441406, 0.32742786407470703, 0.334228515625]}, "gradients/encoder.encoder.layers.14.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 9.0, 18.0, 70.0, 290.0, 439.0, 117.0, 29.0, 9.0, 13.0, 5.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7024028897285461, -0.6638566255569458, -0.6253104209899902, -0.5867641568183899, -0.5482178926467896, -0.509671688079834, -0.47112542390823364, -0.4325791597366333, -0.39403292536735535, -0.3554866909980774, -0.31694042682647705, -0.2783941924571991, -0.23984794318675995, -0.2013016939163208, -0.16275545954704285, -0.1242091953754425, -0.08566296100616455, -0.0471167154610157, -0.008570469915866852, 0.0299757719039917, 0.06852202117443085, 0.10706827044487, 0.14561450481414795, 0.1841607689857483, 0.22270700335502625, 0.2612532377243042, 0.29979950189590454, 0.3383457362651825, 0.37689197063446045, 0.4154382348060608, 0.45398446917533875, 0.4925307333469391, 0.5310769081115723, 0.5696231722831726, 0.6081693768501282, 0.6467156410217285, 0.6852619051933289, 0.7238081693649292, 0.7623543739318848, 0.8009006381034851, 0.8394469022750854, 0.8779931664466858, 0.9165393710136414, 0.9550856351852417, 0.993631899356842, 1.0321781635284424, 1.070724368095398, 1.1092705726623535, 1.1478168964385986, 1.1863631010055542, 1.2249094247817993, 1.2634556293487549, 1.3020018339157104, 1.3405481576919556, 1.3790943622589111, 1.4176406860351562, 1.4561867713928223, 1.4947329759597778, 1.533279299736023, 1.5718255043029785, 1.610371708869934, 1.6489180326461792, 1.6874642372131348, 1.7260105609893799, 1.7645567655563354]}, "gradients/encoder.encoder.layers.14.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 3.0, 4.0, 3.0, 3.0, 3.0, 15.0, 33.0, 56.0, 65.0, 108.0, 123.0, 150.0, 127.0, 107.0, 86.0, 45.0, 36.0, 13.0, 13.0, 3.0, 3.0, 5.0, 4.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 5.0], "bins": [-0.9069772362709045, -0.8871030807495117, -0.8672289252281189, -0.8473547697067261, -0.8274805545806885, -0.8076063990592957, -0.7877322435379028, -0.76785808801651, -0.7479839324951172, -0.7281097769737244, -0.7082356214523315, -0.6883614659309387, -0.6684873104095459, -0.6486130952835083, -0.6287389397621155, -0.6088647842407227, -0.5889906287193298, -0.569116473197937, -0.5492423176765442, -0.5293681621551514, -0.5094939470291138, -0.48961982131004333, -0.4697456359863281, -0.4498714804649353, -0.4299973249435425, -0.41012316942214966, -0.39024901390075684, -0.3703748285770416, -0.3505006730556488, -0.330626517534256, -0.31075233221054077, -0.29087817668914795, -0.2710040211677551, -0.2511298656463623, -0.2312556952238083, -0.21138152480125427, -0.19150736927986145, -0.17163321375846863, -0.1517590433359146, -0.1318848729133606, -0.11201071739196777, -0.09213655441999435, -0.07226239144802094, -0.052388228476047516, -0.0325140655040741, -0.012639902532100677, 0.007234260439872742, 0.027108430862426758, 0.04698258638381958, 0.066856749355793, 0.08673091232776642, 0.10660507529973984, 0.12647923827171326, 0.14635339379310608, 0.1662275642156601, 0.1861017346382141, 0.20597589015960693, 0.22585004568099976, 0.24572421610355377, 0.2655983865261078, 0.2854725420475006, 0.30534669756889343, 0.32522088289260864, 0.34509503841400146, 0.3649691939353943]}, "gradients/encoder.encoder.layers.14.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 3.0, 3.0, 7.0, 5.0, 5.0, 5.0, 9.0, 21.0, 20.0, 39.0, 45.0, 54.0, 72.0, 122.0, 191.0, 265.0, 445.0, 781.0, 1480.0, 3692.0, 12103.0, 62781.0, 425486.0, 452147.0, 68457.0, 12816.0, 3739.0, 1587.0, 837.0, 462.0, 286.0, 182.0, 114.0, 97.0, 47.0, 35.0, 41.0, 18.0, 15.0, 13.0, 16.0, 4.0, 6.0, 1.0, 4.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.133544921875, -0.1292400360107422, -0.12493515014648438, -0.12063026428222656, -0.11632537841796875, -0.11202049255371094, -0.10771560668945312, -0.10341072082519531, -0.0991058349609375, -0.09480094909667969, -0.09049606323242188, -0.08619117736816406, -0.08188629150390625, -0.07758140563964844, -0.07327651977539062, -0.06897163391113281, -0.064666748046875, -0.06036186218261719, -0.056056976318359375, -0.05175209045410156, -0.04744720458984375, -0.04314231872558594, -0.038837432861328125, -0.03453254699707031, -0.0302276611328125, -0.025922775268554688, -0.021617889404296875, -0.017313003540039062, -0.01300811767578125, -0.008703231811523438, -0.004398345947265625, -9.34600830078125e-05, 0.00421142578125, 0.008516311645507812, 0.012821197509765625, 0.017126083374023438, 0.02143096923828125, 0.025735855102539062, 0.030040740966796875, 0.03434562683105469, 0.0386505126953125, 0.04295539855957031, 0.047260284423828125, 0.05156517028808594, 0.05587005615234375, 0.06017494201660156, 0.06447982788085938, 0.06878471374511719, 0.073089599609375, 0.07739448547363281, 0.08169937133789062, 0.08600425720214844, 0.09030914306640625, 0.09461402893066406, 0.09891891479492188, 0.10322380065917969, 0.1075286865234375, 0.11183357238769531, 0.11613845825195312, 0.12044334411621094, 0.12474822998046875, 0.12905311584472656, 0.13335800170898438, 0.1376628875732422, 0.1419677734375]}, "gradients/encoder.encoder.layers.14.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 4.0, 3.0, 1.0, 5.0, 8.0, 17.0, 25.0, 47.0, 58.0, 94.0, 97.0, 105.0, 104.0, 120.0, 93.0, 76.0, 61.0, 33.0, 24.0, 13.0, 7.0, 7.0, 2.0, 2.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0], "bins": [-0.0997314453125, -0.09737300872802734, -0.09501457214355469, -0.09265613555908203, -0.09029769897460938, -0.08793926239013672, -0.08558082580566406, -0.0832223892211914, -0.08086395263671875, -0.0785055160522461, -0.07614707946777344, -0.07378864288330078, -0.07143020629882812, -0.06907176971435547, -0.06671333312988281, -0.06435489654541016, -0.0619964599609375, -0.059638023376464844, -0.05727958679199219, -0.05492115020751953, -0.052562713623046875, -0.05020427703857422, -0.04784584045410156, -0.045487403869628906, -0.04312896728515625, -0.040770530700683594, -0.03841209411621094, -0.03605365753173828, -0.033695220947265625, -0.03133678436279297, -0.028978347778320312, -0.026619911193847656, -0.024261474609375, -0.021903038024902344, -0.019544601440429688, -0.01718616485595703, -0.014827728271484375, -0.012469291687011719, -0.010110855102539062, -0.007752418518066406, -0.00539398193359375, -0.0030355453491210938, -0.0006771087646484375, 0.0016813278198242188, 0.004039764404296875, 0.006398200988769531, 0.008756637573242188, 0.011115074157714844, 0.0134735107421875, 0.015831947326660156, 0.018190383911132812, 0.02054882049560547, 0.022907257080078125, 0.02526569366455078, 0.027624130249023438, 0.029982566833496094, 0.03234100341796875, 0.034699440002441406, 0.03705787658691406, 0.03941631317138672, 0.041774749755859375, 0.04413318634033203, 0.04649162292480469, 0.048850059509277344, 0.05120849609375]}, "gradients/encoder.encoder.layers.14.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0, 0.0, 3.0, 4.0, 1.0, 4.0, 3.0, 7.0, 7.0, 8.0, 9.0, 19.0, 14.0, 31.0, 33.0, 46.0, 70.0, 146.0, 299.0, 690.0, 2181.0, 9294.0, 59891.0, 468716.0, 440144.0, 54854.0, 8740.0, 1973.0, 659.0, 294.0, 145.0, 80.0, 54.0, 38.0, 22.0, 17.0, 8.0, 18.0, 11.0, 8.0, 5.0, 5.0, 3.0, 1.0, 3.0, 2.0, 2.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.1304931640625, -0.12653636932373047, -0.12257957458496094, -0.1186227798461914, -0.11466598510742188, -0.11070919036865234, -0.10675239562988281, -0.10279560089111328, -0.09883880615234375, -0.09488201141357422, -0.09092521667480469, -0.08696842193603516, -0.08301162719726562, -0.0790548324584961, -0.07509803771972656, -0.07114124298095703, -0.0671844482421875, -0.06322765350341797, -0.05927085876464844, -0.055314064025878906, -0.051357269287109375, -0.047400474548339844, -0.04344367980957031, -0.03948688507080078, -0.03553009033203125, -0.03157329559326172, -0.027616500854492188, -0.023659706115722656, -0.019702911376953125, -0.015746116638183594, -0.011789321899414062, -0.007832527160644531, -0.003875732421875, 8.106231689453125e-05, 0.0040378570556640625, 0.007994651794433594, 0.011951446533203125, 0.015908241271972656, 0.019865036010742188, 0.02382183074951172, 0.02777862548828125, 0.03173542022705078, 0.03569221496582031, 0.039649009704589844, 0.043605804443359375, 0.047562599182128906, 0.05151939392089844, 0.05547618865966797, 0.0594329833984375, 0.06338977813720703, 0.06734657287597656, 0.0713033676147461, 0.07526016235351562, 0.07921695709228516, 0.08317375183105469, 0.08713054656982422, 0.09108734130859375, 0.09504413604736328, 0.09900093078613281, 0.10295772552490234, 0.10691452026367188, 0.1108713150024414, 0.11482810974121094, 0.11878490447998047, 0.12274169921875]}, "gradients/encoder.encoder.layers.14.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 3.0, 1.0, 3.0, 3.0, 7.0, 5.0, 6.0, 5.0, 4.0, 8.0, 7.0, 16.0, 17.0, 17.0, 17.0, 27.0, 22.0, 29.0, 24.0, 34.0, 35.0, 29.0, 37.0, 45.0, 42.0, 43.0, 31.0, 42.0, 35.0, 49.0, 39.0, 35.0, 25.0, 24.0, 28.0, 30.0, 18.0, 16.0, 28.0, 14.0, 15.0, 12.0, 18.0, 17.0, 12.0, 9.0, 6.0, 6.0, 6.0, 5.0, 4.0, 0.0, 1.0, 4.0, 0.0, 2.0, 1.0], "bins": [-0.10675048828125, -0.10358428955078125, -0.1004180908203125, -0.09725189208984375, -0.094085693359375, -0.09091949462890625, -0.0877532958984375, -0.08458709716796875, -0.0814208984375, -0.07825469970703125, -0.0750885009765625, -0.07192230224609375, -0.068756103515625, -0.06558990478515625, -0.0624237060546875, -0.05925750732421875, -0.05609130859375, -0.05292510986328125, -0.0497589111328125, -0.04659271240234375, -0.043426513671875, -0.04026031494140625, -0.0370941162109375, -0.03392791748046875, -0.03076171875, -0.02759552001953125, -0.0244293212890625, -0.02126312255859375, -0.018096923828125, -0.01493072509765625, -0.0117645263671875, -0.00859832763671875, -0.00543212890625, -0.00226593017578125, 0.0009002685546875, 0.00406646728515625, 0.007232666015625, 0.01039886474609375, 0.0135650634765625, 0.01673126220703125, 0.0198974609375, 0.02306365966796875, 0.0262298583984375, 0.02939605712890625, 0.032562255859375, 0.03572845458984375, 0.0388946533203125, 0.04206085205078125, 0.04522705078125, 0.04839324951171875, 0.0515594482421875, 0.05472564697265625, 0.057891845703125, 0.06105804443359375, 0.0642242431640625, 0.06739044189453125, 0.070556640625, 0.07372283935546875, 0.0768890380859375, 0.08005523681640625, 0.083221435546875, 0.08638763427734375, 0.0895538330078125, 0.09272003173828125, 0.09588623046875]}, "gradients/encoder.encoder.layers.14.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 2.0, 9.0, 4.0, 7.0, 11.0, 19.0, 27.0, 53.0, 76.0, 95.0, 223.0, 418.0, 935.0, 2735.0, 11028.0, 93259.0, 730031.0, 186029.0, 17532.0, 3698.0, 1237.0, 524.0, 245.0, 138.0, 73.0, 49.0, 34.0, 21.0, 17.0, 11.0, 8.0, 2.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.049835205078125, -0.04803133010864258, -0.046227455139160156, -0.044423580169677734, -0.04261970520019531, -0.04081583023071289, -0.03901195526123047, -0.03720808029174805, -0.035404205322265625, -0.0336003303527832, -0.03179645538330078, -0.02999258041381836, -0.028188705444335938, -0.026384830474853516, -0.024580955505371094, -0.022777080535888672, -0.02097320556640625, -0.019169330596923828, -0.017365455627441406, -0.015561580657958984, -0.013757705688476562, -0.01195383071899414, -0.010149955749511719, -0.008346080780029297, -0.006542205810546875, -0.004738330841064453, -0.0029344558715820312, -0.0011305809020996094, 0.0006732940673828125, 0.0024771690368652344, 0.004281044006347656, 0.006084918975830078, 0.0078887939453125, 0.009692668914794922, 0.011496543884277344, 0.013300418853759766, 0.015104293823242188, 0.01690816879272461, 0.01871204376220703, 0.020515918731689453, 0.022319793701171875, 0.024123668670654297, 0.02592754364013672, 0.02773141860961914, 0.029535293579101562, 0.031339168548583984, 0.033143043518066406, 0.03494691848754883, 0.03675079345703125, 0.03855466842651367, 0.040358543395996094, 0.042162418365478516, 0.04396629333496094, 0.04577016830444336, 0.04757404327392578, 0.0493779182434082, 0.051181793212890625, 0.05298566818237305, 0.05478954315185547, 0.05659341812133789, 0.05839729309082031, 0.060201168060302734, 0.062005043029785156, 0.06380891799926758, 0.06561279296875]}, "gradients/encoder.encoder.layers.14.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 2.0, 2.0, 5.0, 4.0, 7.0, 11.0, 8.0, 6.0, 5.0, 27.0, 18.0, 15.0, 26.0, 25.0, 30.0, 59.0, 40.0, 53.0, 45.0, 52.0, 68.0, 45.0, 49.0, 36.0, 51.0, 48.0, 38.0, 34.0, 34.0, 24.0, 26.0, 26.0, 19.0, 17.0, 10.0, 8.0, 3.0, 6.0, 10.0, 5.0, 5.0, 3.0, 2.0, 1.0, 3.0, 1.0, 2.0], "bins": [-7.212162017822266e-06, -7.022172212600708e-06, -6.83218240737915e-06, -6.642192602157593e-06, -6.452202796936035e-06, -6.2622129917144775e-06, -6.07222318649292e-06, -5.882233381271362e-06, -5.692243576049805e-06, -5.502253770828247e-06, -5.3122639656066895e-06, -5.122274160385132e-06, -4.932284355163574e-06, -4.742294549942017e-06, -4.552304744720459e-06, -4.362314939498901e-06, -4.172325134277344e-06, -3.982335329055786e-06, -3.7923455238342285e-06, -3.602355718612671e-06, -3.4123659133911133e-06, -3.2223761081695557e-06, -3.032386302947998e-06, -2.8423964977264404e-06, -2.652406692504883e-06, -2.462416887283325e-06, -2.2724270820617676e-06, -2.08243727684021e-06, -1.8924474716186523e-06, -1.7024576663970947e-06, -1.5124678611755371e-06, -1.3224780559539795e-06, -1.1324882507324219e-06, -9.424984455108643e-07, -7.525086402893066e-07, -5.62518835067749e-07, -3.725290298461914e-07, -1.825392246246338e-07, 7.450580596923828e-09, 1.9744038581848145e-07, 3.8743019104003906e-07, 5.774199962615967e-07, 7.674098014831543e-07, 9.57399606704712e-07, 1.1473894119262695e-06, 1.3373792171478271e-06, 1.5273690223693848e-06, 1.7173588275909424e-06, 1.9073486328125e-06, 2.0973384380340576e-06, 2.2873282432556152e-06, 2.477318048477173e-06, 2.6673078536987305e-06, 2.857297658920288e-06, 3.0472874641418457e-06, 3.2372772693634033e-06, 3.427267074584961e-06, 3.6172568798065186e-06, 3.807246685028076e-06, 3.997236490249634e-06, 4.187226295471191e-06, 4.377216100692749e-06, 4.567205905914307e-06, 4.757195711135864e-06, 4.947185516357422e-06]}, "gradients/encoder.encoder.layers.14.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 2.0, 2.0, 3.0, 2.0, 3.0, 7.0, 13.0, 14.0, 28.0, 42.0, 55.0, 121.0, 255.0, 437.0, 995.0, 2721.0, 11601.0, 96792.0, 755483.0, 158756.0, 15631.0, 3309.0, 1155.0, 537.0, 243.0, 131.0, 88.0, 43.0, 30.0, 25.0, 1.0, 9.0, 5.0, 5.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.055267333984375, -0.05326414108276367, -0.051260948181152344, -0.049257755279541016, -0.04725456237792969, -0.04525136947631836, -0.04324817657470703, -0.0412449836730957, -0.039241790771484375, -0.03723859786987305, -0.03523540496826172, -0.03323221206665039, -0.031229019165039062, -0.029225826263427734, -0.027222633361816406, -0.025219440460205078, -0.02321624755859375, -0.021213054656982422, -0.019209861755371094, -0.017206668853759766, -0.015203475952148438, -0.01320028305053711, -0.011197090148925781, -0.009193897247314453, -0.007190704345703125, -0.005187511444091797, -0.0031843185424804688, -0.0011811256408691406, 0.0008220672607421875, 0.0028252601623535156, 0.004828453063964844, 0.006831645965576172, 0.0088348388671875, 0.010838031768798828, 0.012841224670410156, 0.014844417572021484, 0.016847610473632812, 0.01885080337524414, 0.02085399627685547, 0.022857189178466797, 0.024860382080078125, 0.026863574981689453, 0.02886676788330078, 0.03086996078491211, 0.03287315368652344, 0.034876346588134766, 0.036879539489746094, 0.03888273239135742, 0.04088592529296875, 0.04288911819458008, 0.044892311096191406, 0.046895503997802734, 0.04889869689941406, 0.05090188980102539, 0.05290508270263672, 0.05490827560424805, 0.056911468505859375, 0.0589146614074707, 0.06091785430908203, 0.06292104721069336, 0.06492424011230469, 0.06692743301391602, 0.06893062591552734, 0.07093381881713867, 0.07293701171875]}, "gradients/encoder.encoder.layers.14.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 7.0, 2.0, 7.0, 6.0, 6.0, 12.0, 15.0, 20.0, 26.0, 41.0, 43.0, 71.0, 82.0, 78.0, 95.0, 108.0, 83.0, 68.0, 48.0, 39.0, 38.0, 33.0, 18.0, 21.0, 8.0, 9.0, 4.0, 7.0, 1.0, 2.0, 4.0, 0.0, 3.0, 0.0, 1.0, 3.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.04656982421875, -0.04525136947631836, -0.04393291473388672, -0.04261445999145508, -0.04129600524902344, -0.0399775505065918, -0.038659095764160156, -0.037340641021728516, -0.036022186279296875, -0.034703731536865234, -0.033385276794433594, -0.03206682205200195, -0.030748367309570312, -0.029429912567138672, -0.02811145782470703, -0.02679300308227539, -0.02547454833984375, -0.02415609359741211, -0.02283763885498047, -0.021519184112548828, -0.020200729370117188, -0.018882274627685547, -0.017563819885253906, -0.016245365142822266, -0.014926910400390625, -0.013608455657958984, -0.012290000915527344, -0.010971546173095703, -0.009653091430664062, -0.008334636688232422, -0.007016181945800781, -0.005697727203369141, -0.0043792724609375, -0.0030608177185058594, -0.0017423629760742188, -0.0004239082336425781, 0.0008945465087890625, 0.002213001251220703, 0.0035314559936523438, 0.004849910736083984, 0.006168365478515625, 0.007486820220947266, 0.008805274963378906, 0.010123729705810547, 0.011442184448242188, 0.012760639190673828, 0.014079093933105469, 0.01539754867553711, 0.01671600341796875, 0.01803445816040039, 0.01935291290283203, 0.020671367645263672, 0.021989822387695312, 0.023308277130126953, 0.024626731872558594, 0.025945186614990234, 0.027263641357421875, 0.028582096099853516, 0.029900550842285156, 0.031219005584716797, 0.03253746032714844, 0.03385591506958008, 0.03517436981201172, 0.03649282455444336, 0.037811279296875]}, "gradients/encoder.encoder.layers.14.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 4.0, 15.0, 39.0, 154.0, 301.0, 336.0, 126.0, 25.0, 8.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-1.8469196557998657, -1.7996481657028198, -1.752376675605774, -1.705105185508728, -1.6578335762023926, -1.6105620861053467, -1.5632905960083008, -1.5160191059112549, -1.468747615814209, -1.421476125717163, -1.3742046356201172, -1.3269331455230713, -1.2796616554260254, -1.23239004611969, -1.185118556022644, -1.1378470659255981, -1.0905755758285522, -1.0433040857315063, -0.9960325956344604, -0.9487610459327698, -0.9014895558357239, -0.854218065738678, -0.8069465160369873, -0.7596750259399414, -0.7124035358428955, -0.6651320457458496, -0.6178605556488037, -0.570589005947113, -0.5233175158500671, -0.47604602575302124, -0.42877450585365295, -0.38150298595428467, -0.3342313766479492, -0.2869598865509033, -0.23968836665153503, -0.19241686165332794, -0.14514535665512085, -0.09787385165691376, -0.050602346658706665, -0.003330826759338379, 0.04394066333770752, 0.09121216833591461, 0.1384836733341217, 0.1857551783323288, 0.2330266833305359, 0.2802981734275818, 0.3275696933269501, 0.37484121322631836, 0.42211270332336426, 0.46938419342041016, 0.516655683517456, 0.5639272332191467, 0.6111987233161926, 0.6584702134132385, 0.7057417631149292, 0.7530132532119751, 0.800284743309021, 0.8475562334060669, 0.8948277235031128, 0.9420992732048035, 0.9893707633018494, 1.03664231300354, 1.083913803100586, 1.1311852931976318, 1.1784567832946777]}, "gradients/encoder.encoder.layers.14.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 0.0, 3.0, 6.0, 6.0, 7.0, 7.0, 7.0, 8.0, 13.0, 21.0, 16.0, 22.0, 32.0, 33.0, 41.0, 40.0, 46.0, 37.0, 49.0, 34.0, 58.0, 47.0, 39.0, 40.0, 45.0, 37.0, 30.0, 33.0, 33.0, 35.0, 33.0, 23.0, 20.0, 18.0, 9.0, 16.0, 9.0, 11.0, 11.0, 7.0, 3.0, 10.0, 3.0, 2.0, 0.0, 3.0, 3.0, 0.0, 1.0, 4.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5267287492752075, -0.5095049738883972, -0.4922812581062317, -0.4750574827194214, -0.45783373713493347, -0.44060999155044556, -0.42338621616363525, -0.40616247057914734, -0.3889387249946594, -0.3717149794101715, -0.3544912338256836, -0.3372674584388733, -0.3200437128543854, -0.30281996726989746, -0.28559619188308716, -0.26837244629859924, -0.25114870071411133, -0.2339249551296234, -0.2167011946439743, -0.1994774341583252, -0.18225368857383728, -0.16502994298934937, -0.14780618250370026, -0.13058242201805115, -0.11335867643356323, -0.09613492339849472, -0.07891117036342621, -0.0616874173283577, -0.044463664293289185, -0.027239911258220673, -0.01001615822315216, 0.007207594811916351, 0.024431288242340088, 0.0416550412774086, 0.05887879431247711, 0.07610254734754562, 0.09332630038261414, 0.11055005341768265, 0.12777380645275116, 0.14499756693840027, 0.16222131252288818, 0.1794450581073761, 0.1966688185930252, 0.21389257907867432, 0.23111632466316223, 0.24834007024765015, 0.26556384563446045, 0.28278759121894836, 0.3000113368034363, 0.3172350823879242, 0.3344588279724121, 0.3516826033592224, 0.3689063489437103, 0.38613009452819824, 0.40335386991500854, 0.42057761549949646, 0.4378013610839844, 0.4550251066684723, 0.4722488522529602, 0.4894726276397705, 0.506696343421936, 0.5239201188087463, 0.5411438941955566, 0.5583676099777222, 0.5755913853645325]}, "gradients/encoder.encoder.layers.13.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 3.0, 6.0, 3.0, 1.0, 7.0, 13.0, 11.0, 13.0, 19.0, 29.0, 36.0, 47.0, 61.0, 96.0, 205.0, 391.0, 1039.0, 3865.0, 33127.0, 4124014.0, 25837.0, 3602.0, 1008.0, 412.0, 188.0, 111.0, 38.0, 37.0, 16.0, 22.0, 10.0, 9.0, 5.0, 2.0, 0.0, 2.0, 0.0, 3.0, 1.0, 0.0, 2.0], "bins": [-0.5849609375, -0.5711822509765625, -0.557403564453125, -0.5436248779296875, -0.52984619140625, -0.5160675048828125, -0.502288818359375, -0.4885101318359375, -0.4747314453125, -0.4609527587890625, -0.447174072265625, -0.4333953857421875, -0.41961669921875, -0.4058380126953125, -0.392059326171875, -0.3782806396484375, -0.364501953125, -0.3507232666015625, -0.336944580078125, -0.3231658935546875, -0.30938720703125, -0.2956085205078125, -0.281829833984375, -0.2680511474609375, -0.2542724609375, -0.2404937744140625, -0.226715087890625, -0.2129364013671875, -0.19915771484375, -0.1853790283203125, -0.171600341796875, -0.1578216552734375, -0.14404296875, -0.1302642822265625, -0.116485595703125, -0.1027069091796875, -0.08892822265625, -0.0751495361328125, -0.061370849609375, -0.0475921630859375, -0.0338134765625, -0.0200347900390625, -0.006256103515625, 0.0075225830078125, 0.02130126953125, 0.0350799560546875, 0.048858642578125, 0.0626373291015625, 0.076416015625, 0.0901947021484375, 0.103973388671875, 0.1177520751953125, 0.13153076171875, 0.1453094482421875, 0.159088134765625, 0.1728668212890625, 0.1866455078125, 0.2004241943359375, 0.214202880859375, 0.2279815673828125, 0.24176025390625, 0.2555389404296875, 0.269317626953125, 0.2830963134765625, 0.296875]}, "gradients/encoder.encoder.layers.13.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 8.0, 12.0, 9.0, 15.0, 26.0, 52.0, 53.0, 74.0, 85.0, 102.0, 97.0, 117.0, 94.0, 72.0, 63.0, 34.0, 39.0, 24.0, 8.0, 7.0, 4.0, 3.0, 4.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.09161376953125, -0.08940792083740234, -0.08720207214355469, -0.08499622344970703, -0.08279037475585938, -0.08058452606201172, -0.07837867736816406, -0.0761728286743164, -0.07396697998046875, -0.0717611312866211, -0.06955528259277344, -0.06734943389892578, -0.06514358520507812, -0.06293773651123047, -0.06073188781738281, -0.058526039123535156, -0.0563201904296875, -0.054114341735839844, -0.05190849304199219, -0.04970264434814453, -0.047496795654296875, -0.04529094696044922, -0.04308509826660156, -0.040879249572753906, -0.03867340087890625, -0.036467552185058594, -0.03426170349121094, -0.03205585479736328, -0.029850006103515625, -0.02764415740966797, -0.025438308715820312, -0.023232460021972656, -0.021026611328125, -0.018820762634277344, -0.016614913940429688, -0.014409065246582031, -0.012203216552734375, -0.009997367858886719, -0.0077915191650390625, -0.005585670471191406, -0.00337982177734375, -0.0011739730834960938, 0.0010318756103515625, 0.0032377243041992188, 0.005443572998046875, 0.007649421691894531, 0.009855270385742188, 0.012061119079589844, 0.0142669677734375, 0.016472816467285156, 0.018678665161132812, 0.02088451385498047, 0.023090362548828125, 0.02529621124267578, 0.027502059936523438, 0.029707908630371094, 0.03191375732421875, 0.034119606018066406, 0.03632545471191406, 0.03853130340576172, 0.040737152099609375, 0.04294300079345703, 0.04514884948730469, 0.047354698181152344, 0.049560546875]}, "gradients/encoder.encoder.layers.13.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 13.0, 8.0, 14.0, 33.0, 36.0, 42.0, 88.0, 131.0, 314.0, 824.0, 3268.0, 21349.0, 4071321.0, 87886.0, 6502.0, 1475.0, 485.0, 203.0, 109.0, 68.0, 41.0, 33.0, 20.0, 13.0, 10.0, 7.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4462890625, -0.4317588806152344, -0.41722869873046875, -0.4026985168457031, -0.3881683349609375, -0.3736381530761719, -0.35910797119140625, -0.3445777893066406, -0.330047607421875, -0.3155174255371094, -0.30098724365234375, -0.2864570617675781, -0.2719268798828125, -0.2573966979980469, -0.24286651611328125, -0.22833633422851562, -0.21380615234375, -0.19927597045898438, -0.18474578857421875, -0.17021560668945312, -0.1556854248046875, -0.14115524291992188, -0.12662506103515625, -0.11209487915039062, -0.097564697265625, -0.08303451538085938, -0.06850433349609375, -0.053974151611328125, -0.0394439697265625, -0.024913787841796875, -0.01038360595703125, 0.004146575927734375, 0.0186767578125, 0.033206939697265625, 0.04773712158203125, 0.062267303466796875, 0.0767974853515625, 0.09132766723632812, 0.10585784912109375, 0.12038803100585938, 0.134918212890625, 0.14944839477539062, 0.16397857666015625, 0.17850875854492188, 0.1930389404296875, 0.20756912231445312, 0.22209930419921875, 0.23662948608398438, 0.25115966796875, 0.2656898498535156, 0.28022003173828125, 0.2947502136230469, 0.3092803955078125, 0.3238105773925781, 0.33834075927734375, 0.3528709411621094, 0.367401123046875, 0.3819313049316406, 0.39646148681640625, 0.4109916687011719, 0.4255218505859375, 0.4400520324707031, 0.45458221435546875, 0.4691123962402344, 0.483642578125]}, "gradients/encoder.encoder.layers.13.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 5.0, 4.0, 5.0, 2.0, 11.0, 8.0, 14.0, 35.0, 70.0, 206.0, 2841.0, 646.0, 94.0, 53.0, 26.0, 19.0, 16.0, 8.0, 7.0, 2.0, 4.0, 5.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09722900390625, -0.09385013580322266, -0.09047126770019531, -0.08709239959716797, -0.08371353149414062, -0.08033466339111328, -0.07695579528808594, -0.0735769271850586, -0.07019805908203125, -0.0668191909790039, -0.06344032287597656, -0.06006145477294922, -0.056682586669921875, -0.05330371856689453, -0.04992485046386719, -0.046545982360839844, -0.0431671142578125, -0.039788246154785156, -0.03640937805175781, -0.03303050994873047, -0.029651641845703125, -0.02627277374267578, -0.022893905639648438, -0.019515037536621094, -0.01613616943359375, -0.012757301330566406, -0.009378433227539062, -0.005999565124511719, -0.002620697021484375, 0.0007581710815429688, 0.0041370391845703125, 0.007515907287597656, 0.010894775390625, 0.014273643493652344, 0.017652511596679688, 0.02103137969970703, 0.024410247802734375, 0.02778911590576172, 0.031167984008789062, 0.034546852111816406, 0.03792572021484375, 0.041304588317871094, 0.04468345642089844, 0.04806232452392578, 0.051441192626953125, 0.05482006072998047, 0.05819892883300781, 0.061577796936035156, 0.0649566650390625, 0.06833553314208984, 0.07171440124511719, 0.07509326934814453, 0.07847213745117188, 0.08185100555419922, 0.08522987365722656, 0.0886087417602539, 0.09198760986328125, 0.0953664779663086, 0.09874534606933594, 0.10212421417236328, 0.10550308227539062, 0.10888195037841797, 0.11226081848144531, 0.11563968658447266, 0.1190185546875]}, "gradients/encoder.encoder.layers.13.final_layer_norm.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 1.0, 3.0, 3.0, 13.0, 35.0, 112.0, 349.0, 324.0, 105.0, 38.0, 12.0, 3.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.31402260065078735, -0.28955355286598206, -0.26508450508117676, -0.24061548709869385, -0.21614643931388855, -0.19167739152908325, -0.16720835864543915, -0.14273932576179504, -0.11827027797698975, -0.09380123764276505, -0.06933219730854034, -0.04486315697431564, -0.020394116640090942, 0.0040749236941337585, 0.02854396402835846, 0.053012996912002563, 0.07748204469680786, 0.10195108503103256, 0.12642012536525726, 0.15088915824890137, 0.17535820603370667, 0.19982725381851196, 0.22429628670215607, 0.24876531958580017, 0.27323436737060547, 0.29770341515541077, 0.32217246294021606, 0.346641480922699, 0.3711105287075043, 0.39557957649230957, 0.4200485944747925, 0.4445176422595978, 0.4689866304397583, 0.4934556782245636, 0.5179247260093689, 0.5423937439918518, 0.5668628215789795, 0.5913318395614624, 0.6158008575439453, 0.6402698755264282, 0.6647389531135559, 0.6892079710960388, 0.7136770486831665, 0.7381460666656494, 0.7626150846481323, 0.78708416223526, 0.8115531802177429, 0.8360222578048706, 0.8604912757873535, 0.8849602937698364, 0.9094293713569641, 0.933898389339447, 0.9583674669265747, 0.9828364849090576, 1.0073055028915405, 1.0317745208740234, 1.056243658065796, 1.0807126760482788, 1.1051816940307617, 1.1296508312225342, 1.154119849205017, 1.1785888671875, 1.203057885169983, 1.2275269031524658, 1.2519959211349487]}, "gradients/encoder.encoder.layers.13.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 12.0, 14.0, 16.0, 20.0, 22.0, 31.0, 50.0, 49.0, 82.0, 68.0, 67.0, 92.0, 91.0, 76.0, 61.0, 49.0, 55.0, 39.0, 30.0, 24.0, 20.0, 9.0, 10.0, 7.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1919897198677063, -0.18325240910053253, -0.17451509833335876, -0.165777787566185, -0.15704047679901123, -0.14830316603183746, -0.1395658552646637, -0.13082855939865112, -0.12209124118089676, -0.11335393041372299, -0.10461661964654922, -0.09587931632995605, -0.08714200556278229, -0.07840469479560852, -0.06966738402843475, -0.060930073261260986, -0.05219276249408722, -0.04345545172691345, -0.034718140959739685, -0.025980833917856216, -0.01724352315068245, -0.008506212383508682, 0.00023109465837478638, 0.008968405425548553, 0.01770571619272232, 0.026443026959896088, 0.035180337727069855, 0.04391764476895332, 0.05265495553612709, 0.06139226630330086, 0.07012957334518433, 0.0788668841123581, 0.08760419487953186, 0.09634150564670563, 0.1050788164138794, 0.11381612718105316, 0.12255343794822693, 0.1312907487154007, 0.14002805948257446, 0.14876535534858704, 0.157502681016922, 0.16623999178409576, 0.17497730255126953, 0.1837146133184433, 0.19245192408561707, 0.20118923485279083, 0.2099265456199646, 0.21866384148597717, 0.22740115225315094, 0.2361384630203247, 0.24487577378749847, 0.25361308455467224, 0.2623503804206848, 0.2710877060890198, 0.27982500195503235, 0.2885623276233673, 0.2972996234893799, 0.30603691935539246, 0.3147742450237274, 0.32351154088974, 0.33224886655807495, 0.3409861624240875, 0.3497234880924225, 0.35846078395843506, 0.36719810962677]}, "gradients/encoder.encoder.layers.13.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 0.0, 2.0, 6.0, 4.0, 4.0, 9.0, 10.0, 12.0, 11.0, 24.0, 27.0, 37.0, 57.0, 95.0, 124.0, 191.0, 323.0, 516.0, 1101.0, 2515.0, 8301.0, 41675.0, 322785.0, 567290.0, 82327.0, 14111.0, 3828.0, 1405.0, 670.0, 363.0, 231.0, 168.0, 98.0, 67.0, 48.0, 35.0, 30.0, 14.0, 9.0, 12.0, 15.0, 1.0, 6.0, 2.0, 4.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.182861328125, -0.17751121520996094, -0.17216110229492188, -0.1668109893798828, -0.16146087646484375, -0.1561107635498047, -0.15076065063476562, -0.14541053771972656, -0.1400604248046875, -0.13471031188964844, -0.12936019897460938, -0.12401008605957031, -0.11865997314453125, -0.11330986022949219, -0.10795974731445312, -0.10260963439941406, -0.097259521484375, -0.09190940856933594, -0.08655929565429688, -0.08120918273925781, -0.07585906982421875, -0.07050895690917969, -0.06515884399414062, -0.05980873107910156, -0.0544586181640625, -0.04910850524902344, -0.043758392333984375, -0.03840827941894531, -0.03305816650390625, -0.027708053588867188, -0.022357940673828125, -0.017007827758789062, -0.01165771484375, -0.0063076019287109375, -0.000957489013671875, 0.0043926239013671875, 0.00974273681640625, 0.015092849731445312, 0.020442962646484375, 0.025793075561523438, 0.0311431884765625, 0.03649330139160156, 0.041843414306640625, 0.04719352722167969, 0.05254364013671875, 0.05789375305175781, 0.06324386596679688, 0.06859397888183594, 0.073944091796875, 0.07929420471191406, 0.08464431762695312, 0.08999443054199219, 0.09534454345703125, 0.10069465637207031, 0.10604476928710938, 0.11139488220214844, 0.1167449951171875, 0.12209510803222656, 0.12744522094726562, 0.1327953338623047, 0.13814544677734375, 0.1434955596923828, 0.14884567260742188, 0.15419578552246094, 0.1595458984375]}, "gradients/encoder.encoder.layers.13.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 7.0, 5.0, 12.0, 25.0, 30.0, 49.0, 56.0, 75.0, 80.0, 102.0, 111.0, 109.0, 96.0, 79.0, 53.0, 47.0, 23.0, 19.0, 9.0, 8.0, 1.0, 6.0, 0.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.0955810546875, -0.09327125549316406, -0.09096145629882812, -0.08865165710449219, -0.08634185791015625, -0.08403205871582031, -0.08172225952148438, -0.07941246032714844, -0.0771026611328125, -0.07479286193847656, -0.07248306274414062, -0.07017326354980469, -0.06786346435546875, -0.06555366516113281, -0.06324386596679688, -0.06093406677246094, -0.058624267578125, -0.05631446838378906, -0.054004669189453125, -0.05169486999511719, -0.04938507080078125, -0.04707527160644531, -0.044765472412109375, -0.04245567321777344, -0.0401458740234375, -0.03783607482910156, -0.035526275634765625, -0.03321647644042969, -0.03090667724609375, -0.028596878051757812, -0.026287078857421875, -0.023977279663085938, -0.02166748046875, -0.019357681274414062, -0.017047882080078125, -0.014738082885742188, -0.01242828369140625, -0.010118484497070312, -0.007808685302734375, -0.0054988861083984375, -0.0031890869140625, -0.0008792877197265625, 0.001430511474609375, 0.0037403106689453125, 0.00605010986328125, 0.008359909057617188, 0.010669708251953125, 0.012979507446289062, 0.015289306640625, 0.017599105834960938, 0.019908905029296875, 0.022218704223632812, 0.02452850341796875, 0.026838302612304688, 0.029148101806640625, 0.03145790100097656, 0.0337677001953125, 0.03607749938964844, 0.038387298583984375, 0.04069709777832031, 0.04300689697265625, 0.04531669616699219, 0.047626495361328125, 0.04993629455566406, 0.05224609375]}, "gradients/encoder.encoder.layers.13.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 2.0, 6.0, 8.0, 9.0, 8.0, 10.0, 21.0, 21.0, 41.0, 57.0, 72.0, 117.0, 220.0, 441.0, 1037.0, 2908.0, 11424.0, 62530.0, 455906.0, 438195.0, 59639.0, 11125.0, 2783.0, 970.0, 414.0, 211.0, 122.0, 76.0, 53.0, 41.0, 21.0, 11.0, 15.0, 13.0, 7.0, 3.0, 5.0, 3.0, 3.0, 4.0, 3.0, 3.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.150634765625, -0.14644908905029297, -0.14226341247558594, -0.1380777359008789, -0.13389205932617188, -0.12970638275146484, -0.1255207061767578, -0.12133502960205078, -0.11714935302734375, -0.11296367645263672, -0.10877799987792969, -0.10459232330322266, -0.10040664672851562, -0.0962209701538086, -0.09203529357910156, -0.08784961700439453, -0.0836639404296875, -0.07947826385498047, -0.07529258728027344, -0.0711069107055664, -0.06692123413085938, -0.06273555755615234, -0.05854988098144531, -0.05436420440673828, -0.05017852783203125, -0.04599285125732422, -0.04180717468261719, -0.037621498107910156, -0.033435821533203125, -0.029250144958496094, -0.025064468383789062, -0.02087879180908203, -0.016693115234375, -0.012507438659667969, -0.008321762084960938, -0.004136085510253906, 4.9591064453125e-05, 0.004235267639160156, 0.008420944213867188, 0.012606620788574219, 0.01679229736328125, 0.02097797393798828, 0.025163650512695312, 0.029349327087402344, 0.033535003662109375, 0.037720680236816406, 0.04190635681152344, 0.04609203338623047, 0.0502777099609375, 0.05446338653564453, 0.05864906311035156, 0.0628347396850586, 0.06702041625976562, 0.07120609283447266, 0.07539176940917969, 0.07957744598388672, 0.08376312255859375, 0.08794879913330078, 0.09213447570800781, 0.09632015228271484, 0.10050582885742188, 0.1046915054321289, 0.10887718200683594, 0.11306285858154297, 0.11724853515625]}, "gradients/encoder.encoder.layers.13.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 5.0, 5.0, 2.0, 0.0, 3.0, 6.0, 9.0, 11.0, 11.0, 7.0, 14.0, 14.0, 11.0, 18.0, 19.0, 20.0, 28.0, 28.0, 34.0, 36.0, 38.0, 38.0, 29.0, 31.0, 42.0, 42.0, 55.0, 38.0, 39.0, 49.0, 26.0, 31.0, 41.0, 23.0, 36.0, 24.0, 22.0, 16.0, 19.0, 16.0, 16.0, 12.0, 14.0, 7.0, 7.0, 3.0, 6.0, 2.0, 5.0, 2.0, 2.0, 3.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.1060791015625, -0.10232353210449219, -0.09856796264648438, -0.09481239318847656, -0.09105682373046875, -0.08730125427246094, -0.08354568481445312, -0.07979011535644531, -0.0760345458984375, -0.07227897644042969, -0.06852340698242188, -0.06476783752441406, -0.06101226806640625, -0.05725669860839844, -0.053501129150390625, -0.04974555969238281, -0.045989990234375, -0.04223442077636719, -0.038478851318359375, -0.03472328186035156, -0.03096771240234375, -0.027212142944335938, -0.023456573486328125, -0.019701004028320312, -0.0159454345703125, -0.012189865112304688, -0.008434295654296875, -0.0046787261962890625, -0.00092315673828125, 0.0028324127197265625, 0.006587982177734375, 0.010343551635742188, 0.01409912109375, 0.017854690551757812, 0.021610260009765625, 0.025365829467773438, 0.02912139892578125, 0.03287696838378906, 0.036632537841796875, 0.04038810729980469, 0.0441436767578125, 0.04789924621582031, 0.051654815673828125, 0.05541038513183594, 0.05916595458984375, 0.06292152404785156, 0.06667709350585938, 0.07043266296386719, 0.074188232421875, 0.07794380187988281, 0.08169937133789062, 0.08545494079589844, 0.08921051025390625, 0.09296607971191406, 0.09672164916992188, 0.10047721862792969, 0.1042327880859375, 0.10798835754394531, 0.11174392700195312, 0.11549949645996094, 0.11925506591796875, 0.12301063537597656, 0.12676620483398438, 0.1305217742919922, 0.13427734375]}, "gradients/encoder.encoder.layers.13.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 5.0, 4.0, 12.0, 16.0, 22.0, 39.0, 80.0, 98.0, 229.0, 563.0, 1557.0, 7882.0, 103805.0, 822123.0, 101557.0, 7943.0, 1567.0, 507.0, 252.0, 134.0, 71.0, 41.0, 20.0, 13.0, 7.0, 5.0, 3.0, 3.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.051361083984375, -0.04885530471801758, -0.046349525451660156, -0.043843746185302734, -0.04133796691894531, -0.03883218765258789, -0.03632640838623047, -0.03382062911987305, -0.031314849853515625, -0.028809070587158203, -0.02630329132080078, -0.02379751205444336, -0.021291732788085938, -0.018785953521728516, -0.016280174255371094, -0.013774394989013672, -0.01126861572265625, -0.008762836456298828, -0.006257057189941406, -0.0037512779235839844, -0.0012454986572265625, 0.0012602806091308594, 0.0037660598754882812, 0.006271839141845703, 0.008777618408203125, 0.011283397674560547, 0.013789176940917969, 0.01629495620727539, 0.018800735473632812, 0.021306514739990234, 0.023812294006347656, 0.026318073272705078, 0.0288238525390625, 0.03132963180541992, 0.033835411071777344, 0.036341190338134766, 0.03884696960449219, 0.04135274887084961, 0.04385852813720703, 0.04636430740356445, 0.048870086669921875, 0.0513758659362793, 0.05388164520263672, 0.05638742446899414, 0.05889320373535156, 0.061398983001708984, 0.0639047622680664, 0.06641054153442383, 0.06891632080078125, 0.07142210006713867, 0.0739278793334961, 0.07643365859985352, 0.07893943786621094, 0.08144521713256836, 0.08395099639892578, 0.0864567756652832, 0.08896255493164062, 0.09146833419799805, 0.09397411346435547, 0.09647989273071289, 0.09898567199707031, 0.10149145126342773, 0.10399723052978516, 0.10650300979614258, 0.1090087890625]}, "gradients/encoder.encoder.layers.13.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 7.0, 5.0, 5.0, 9.0, 12.0, 20.0, 24.0, 56.0, 62.0, 62.0, 65.0, 94.0, 101.0, 114.0, 93.0, 72.0, 58.0, 44.0, 38.0, 20.0, 11.0, 18.0, 4.0, 4.0, 4.0, 3.0, 3.0, 1.0, 0.0, 1.0, 3.0], "bins": [-1.9073486328125e-05, -1.8654391169548035e-05, -1.823529601097107e-05, -1.7816200852394104e-05, -1.739710569381714e-05, -1.6978010535240173e-05, -1.6558915376663208e-05, -1.6139820218086243e-05, -1.5720725059509277e-05, -1.5301629900932312e-05, -1.4882534742355347e-05, -1.4463439583778381e-05, -1.4044344425201416e-05, -1.362524926662445e-05, -1.3206154108047485e-05, -1.278705894947052e-05, -1.2367963790893555e-05, -1.194886863231659e-05, -1.1529773473739624e-05, -1.1110678315162659e-05, -1.0691583156585693e-05, -1.0272487998008728e-05, -9.853392839431763e-06, -9.434297680854797e-06, -9.015202522277832e-06, -8.596107363700867e-06, -8.177012205123901e-06, -7.757917046546936e-06, -7.338821887969971e-06, -6.919726729393005e-06, -6.50063157081604e-06, -6.081536412239075e-06, -5.662441253662109e-06, -5.243346095085144e-06, -4.824250936508179e-06, -4.405155777931213e-06, -3.986060619354248e-06, -3.5669654607772827e-06, -3.1478703022003174e-06, -2.728775143623352e-06, -2.3096799850463867e-06, -1.8905848264694214e-06, -1.471489667892456e-06, -1.0523945093154907e-06, -6.332993507385254e-07, -2.1420419216156006e-07, 2.0489096641540527e-07, 6.239861249923706e-07, 1.043081283569336e-06, 1.4621764421463013e-06, 1.8812716007232666e-06, 2.300366759300232e-06, 2.7194619178771973e-06, 3.1385570764541626e-06, 3.557652235031128e-06, 3.976747393608093e-06, 4.395842552185059e-06, 4.814937710762024e-06, 5.234032869338989e-06, 5.653128027915955e-06, 6.07222318649292e-06, 6.491318345069885e-06, 6.910413503646851e-06, 7.329508662223816e-06, 7.748603820800781e-06]}, "gradients/encoder.encoder.layers.13.attention.q_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 2.0, 5.0, 3.0, 3.0, 5.0, 14.0, 21.0, 18.0, 28.0, 28.0, 44.0, 51.0, 78.0, 118.0, 269.0, 501.0, 1710.0, 8812.0, 99867.0, 788279.0, 134739.0, 10762.0, 2022.0, 562.0, 210.0, 112.0, 61.0, 51.0, 33.0, 30.0, 38.0, 23.0, 17.0, 8.0, 7.0, 7.0, 6.0, 3.0, 5.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 0.0, 2.0], "bins": [-0.09625244140625, -0.09354496002197266, -0.09083747863769531, -0.08812999725341797, -0.08542251586914062, -0.08271503448486328, -0.08000755310058594, -0.0773000717163086, -0.07459259033203125, -0.0718851089477539, -0.06917762756347656, -0.06647014617919922, -0.06376266479492188, -0.06105518341064453, -0.05834770202636719, -0.055640220642089844, -0.0529327392578125, -0.050225257873535156, -0.04751777648925781, -0.04481029510498047, -0.042102813720703125, -0.03939533233642578, -0.03668785095214844, -0.033980369567871094, -0.03127288818359375, -0.028565406799316406, -0.025857925415039062, -0.02315044403076172, -0.020442962646484375, -0.01773548126220703, -0.015027999877929688, -0.012320518493652344, -0.009613037109375, -0.006905555725097656, -0.0041980743408203125, -0.0014905929565429688, 0.001216888427734375, 0.003924369812011719, 0.0066318511962890625, 0.009339332580566406, 0.01204681396484375, 0.014754295349121094, 0.017461776733398438, 0.02016925811767578, 0.022876739501953125, 0.02558422088623047, 0.028291702270507812, 0.030999183654785156, 0.0337066650390625, 0.036414146423339844, 0.03912162780761719, 0.04182910919189453, 0.044536590576171875, 0.04724407196044922, 0.04995155334472656, 0.052659034729003906, 0.05536651611328125, 0.058073997497558594, 0.06078147888183594, 0.06348896026611328, 0.06619644165039062, 0.06890392303466797, 0.07161140441894531, 0.07431888580322266, 0.0770263671875]}, "gradients/encoder.encoder.layers.13.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 6.0, 8.0, 10.0, 15.0, 16.0, 22.0, 24.0, 28.0, 40.0, 55.0, 73.0, 69.0, 98.0, 96.0, 77.0, 70.0, 72.0, 52.0, 33.0, 41.0, 22.0, 18.0, 22.0, 7.0, 8.0, 9.0, 8.0, 4.0, 0.0, 2.0, 1.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.053497314453125, -0.051906585693359375, -0.05031585693359375, -0.048725128173828125, -0.0471343994140625, -0.045543670654296875, -0.04395294189453125, -0.042362213134765625, -0.040771484375, -0.039180755615234375, -0.03759002685546875, -0.035999298095703125, -0.0344085693359375, -0.032817840576171875, -0.03122711181640625, -0.029636383056640625, -0.028045654296875, -0.026454925537109375, -0.02486419677734375, -0.023273468017578125, -0.0216827392578125, -0.020092010498046875, -0.01850128173828125, -0.016910552978515625, -0.01531982421875, -0.013729095458984375, -0.01213836669921875, -0.010547637939453125, -0.0089569091796875, -0.007366180419921875, -0.00577545166015625, -0.004184722900390625, -0.002593994140625, -0.001003265380859375, 0.00058746337890625, 0.002178192138671875, 0.0037689208984375, 0.005359649658203125, 0.00695037841796875, 0.008541107177734375, 0.0101318359375, 0.011722564697265625, 0.01331329345703125, 0.014904022216796875, 0.0164947509765625, 0.018085479736328125, 0.01967620849609375, 0.021266937255859375, 0.022857666015625, 0.024448394775390625, 0.02603912353515625, 0.027629852294921875, 0.0292205810546875, 0.030811309814453125, 0.03240203857421875, 0.033992767333984375, 0.03558349609375, 0.037174224853515625, 0.03876495361328125, 0.040355682373046875, 0.0419464111328125, 0.043537139892578125, 0.04512786865234375, 0.046718597412109375, 0.048309326171875]}, "gradients/encoder.encoder.layers.13.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 6.0, 8.0, 29.0, 26.0, 96.0, 175.0, 257.0, 232.0, 104.0, 51.0, 16.0, 8.0, 5.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8727773427963257, -1.8301141262054443, -1.7874507904052734, -1.744787573814392, -1.7021243572235107, -1.6594610214233398, -1.6167978048324585, -1.5741345882415771, -1.5314712524414062, -1.488808035850525, -1.446144700050354, -1.4034814834594727, -1.3608182668685913, -1.31815505027771, -1.275491714477539, -1.2328284978866577, -1.1901652812957764, -1.147502064704895, -1.1048387289047241, -1.0621755123138428, -1.0195122957229614, -0.9768490195274353, -0.9341857433319092, -0.8915225267410278, -0.8488592505455017, -0.8061959743499756, -0.7635327577590942, -0.7208694815635681, -0.678206205368042, -0.6355429887771606, -0.5928797125816345, -0.5502164363861084, -0.5075533390045166, -0.46489009261131287, -0.42222684621810913, -0.379563570022583, -0.3369003236293793, -0.29423707723617554, -0.2515738010406494, -0.20891055464744568, -0.16624730825424194, -0.12358405441045761, -0.08092080056667328, -0.03825753927230835, 0.004405707120895386, 0.04706895351409912, 0.08973222970962524, 0.13239547610282898, 0.17505872249603271, 0.21772196888923645, 0.2603852152824402, 0.3030484914779663, 0.34571173787117004, 0.3883749842643738, 0.4310382604598999, 0.47370150685310364, 0.5163647532463074, 0.5590280294418335, 0.6016912460327148, 0.644354522228241, 0.6870177984237671, 0.7296810150146484, 0.7723442912101746, 0.8150075674057007, 0.857670783996582]}, "gradients/encoder.encoder.layers.13.layer_norm.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 6.0, 3.0, 6.0, 3.0, 7.0, 11.0, 10.0, 12.0, 20.0, 21.0, 26.0, 33.0, 23.0, 34.0, 39.0, 35.0, 43.0, 44.0, 46.0, 49.0, 52.0, 41.0, 48.0, 40.0, 36.0, 27.0, 34.0, 24.0, 36.0, 27.0, 23.0, 29.0, 20.0, 15.0, 11.0, 18.0, 10.0, 6.0, 7.0, 7.0, 8.0, 5.0, 2.0, 3.0, 3.0, 7.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6110549569129944, -0.5900056958198547, -0.5689563751220703, -0.5479071140289307, -0.526857852935791, -0.5058085918426514, -0.48475927114486694, -0.4637100100517273, -0.44266071915626526, -0.4216114282608032, -0.4005621671676636, -0.37951287627220154, -0.3584635853767395, -0.33741432428359985, -0.3163650333881378, -0.2953157424926758, -0.27426648139953613, -0.2532171905040741, -0.23216792941093445, -0.2111186385154724, -0.19006936252117157, -0.16902008652687073, -0.1479707956314087, -0.12692151963710785, -0.105872243642807, -0.08482296764850616, -0.06377368420362473, -0.042724400758743286, -0.021675124764442444, -0.0006258487701416016, 0.020423442125320435, 0.04147271811962128, 0.06252199411392212, 0.08357127010822296, 0.1046205535531044, 0.12566983699798584, 0.14671911299228668, 0.16776838898658752, 0.18881767988204956, 0.2098669558763504, 0.23091623187065125, 0.2519655227661133, 0.27301478385925293, 0.29406407475471497, 0.315113365650177, 0.33616262674331665, 0.3572119176387787, 0.3782612085342407, 0.39931046962738037, 0.4203597605228424, 0.44140902161598206, 0.4624583125114441, 0.48350757360458374, 0.5045568943023682, 0.5256061553955078, 0.5466554164886475, 0.5677046775817871, 0.5887539386749268, 0.6098032593727112, 0.6308525204658508, 0.6519017815589905, 0.6729511022567749, 0.6940003633499146, 0.7150496244430542, 0.7360989451408386]}, "gradients/encoder.encoder.layers.12.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 5.0, 11.0, 12.0, 20.0, 23.0, 50.0, 79.0, 128.0, 238.0, 563.0, 3044.0, 53623.0, 4123969.0, 10378.0, 1403.0, 400.0, 164.0, 85.0, 40.0, 20.0, 7.0, 6.0, 8.0, 3.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.72607421875, -0.708892822265625, -0.69171142578125, -0.674530029296875, -0.6573486328125, -0.640167236328125, -0.62298583984375, -0.605804443359375, -0.588623046875, -0.571441650390625, -0.55426025390625, -0.537078857421875, -0.5198974609375, -0.502716064453125, -0.48553466796875, -0.468353271484375, -0.451171875, -0.433990478515625, -0.41680908203125, -0.399627685546875, -0.3824462890625, -0.365264892578125, -0.34808349609375, -0.330902099609375, -0.313720703125, -0.296539306640625, -0.27935791015625, -0.262176513671875, -0.2449951171875, -0.227813720703125, -0.21063232421875, -0.193450927734375, -0.17626953125, -0.159088134765625, -0.14190673828125, -0.124725341796875, -0.1075439453125, -0.090362548828125, -0.07318115234375, -0.055999755859375, -0.038818359375, -0.021636962890625, -0.00445556640625, 0.012725830078125, 0.0299072265625, 0.047088623046875, 0.06427001953125, 0.081451416015625, 0.0986328125, 0.115814208984375, 0.13299560546875, 0.150177001953125, 0.1673583984375, 0.184539794921875, 0.20172119140625, 0.218902587890625, 0.236083984375, 0.253265380859375, 0.27044677734375, 0.287628173828125, 0.3048095703125, 0.321990966796875, 0.33917236328125, 0.356353759765625, 0.37353515625]}, "gradients/encoder.encoder.layers.12.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 4.0, 18.0, 23.0, 35.0, 56.0, 75.0, 75.0, 88.0, 95.0, 125.0, 92.0, 90.0, 63.0, 56.0, 35.0, 24.0, 19.0, 11.0, 5.0, 4.0, 6.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0968017578125, -0.09450149536132812, -0.09220123291015625, -0.08990097045898438, -0.0876007080078125, -0.08530044555664062, -0.08300018310546875, -0.08069992065429688, -0.078399658203125, -0.07609939575195312, -0.07379913330078125, -0.07149887084960938, -0.0691986083984375, -0.06689834594726562, -0.06459808349609375, -0.062297821044921875, -0.05999755859375, -0.057697296142578125, -0.05539703369140625, -0.053096771240234375, -0.0507965087890625, -0.048496246337890625, -0.04619598388671875, -0.043895721435546875, -0.041595458984375, -0.039295196533203125, -0.03699493408203125, -0.034694671630859375, -0.0323944091796875, -0.030094146728515625, -0.02779388427734375, -0.025493621826171875, -0.023193359375, -0.020893096923828125, -0.01859283447265625, -0.016292572021484375, -0.0139923095703125, -0.011692047119140625, -0.00939178466796875, -0.007091522216796875, -0.004791259765625, -0.002490997314453125, -0.00019073486328125, 0.002109527587890625, 0.0044097900390625, 0.006710052490234375, 0.00901031494140625, 0.011310577392578125, 0.01361083984375, 0.015911102294921875, 0.01821136474609375, 0.020511627197265625, 0.0228118896484375, 0.025112152099609375, 0.02741241455078125, 0.029712677001953125, 0.032012939453125, 0.034313201904296875, 0.03661346435546875, 0.038913726806640625, 0.0412139892578125, 0.043514251708984375, 0.04581451416015625, 0.048114776611328125, 0.0504150390625]}, "gradients/encoder.encoder.layers.12.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 5.0, 6.0, 10.0, 8.0, 15.0, 25.0, 37.0, 67.0, 117.0, 166.0, 288.0, 471.0, 936.0, 1905.0, 4515.0, 15938.0, 312389.0, 3821712.0, 24947.0, 5844.0, 2327.0, 1107.0, 561.0, 348.0, 216.0, 111.0, 81.0, 54.0, 30.0, 25.0, 12.0, 5.0, 6.0, 5.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.313232421875, -0.3034820556640625, -0.293731689453125, -0.2839813232421875, -0.27423095703125, -0.2644805908203125, -0.254730224609375, -0.2449798583984375, -0.2352294921875, -0.2254791259765625, -0.215728759765625, -0.2059783935546875, -0.19622802734375, -0.1864776611328125, -0.176727294921875, -0.1669769287109375, -0.1572265625, -0.1474761962890625, -0.137725830078125, -0.1279754638671875, -0.11822509765625, -0.1084747314453125, -0.098724365234375, -0.0889739990234375, -0.0792236328125, -0.0694732666015625, -0.059722900390625, -0.0499725341796875, -0.04022216796875, -0.0304718017578125, -0.020721435546875, -0.0109710693359375, -0.001220703125, 0.0085296630859375, 0.018280029296875, 0.0280303955078125, 0.03778076171875, 0.0475311279296875, 0.057281494140625, 0.0670318603515625, 0.0767822265625, 0.0865325927734375, 0.096282958984375, 0.1060333251953125, 0.11578369140625, 0.1255340576171875, 0.135284423828125, 0.1450347900390625, 0.15478515625, 0.1645355224609375, 0.174285888671875, 0.1840362548828125, 0.19378662109375, 0.2035369873046875, 0.213287353515625, 0.2230377197265625, 0.2327880859375, 0.2425384521484375, 0.252288818359375, 0.2620391845703125, 0.27178955078125, 0.2815399169921875, 0.291290283203125, 0.3010406494140625, 0.310791015625]}, "gradients/encoder.encoder.layers.12.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 4.0, 2.0, 2.0, 4.0, 2.0, 7.0, 11.0, 9.0, 23.0, 40.0, 69.0, 227.0, 2899.0, 544.0, 103.0, 52.0, 27.0, 14.0, 7.0, 10.0, 3.0, 3.0, 4.0, 6.0, 3.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.11248779296875, -0.10882568359375, -0.10516357421875, -0.10150146484375, -0.09783935546875, -0.09417724609375, -0.09051513671875, -0.08685302734375, -0.08319091796875, -0.07952880859375, -0.07586669921875, -0.07220458984375, -0.06854248046875, -0.06488037109375, -0.06121826171875, -0.05755615234375, -0.05389404296875, -0.05023193359375, -0.04656982421875, -0.04290771484375, -0.03924560546875, -0.03558349609375, -0.03192138671875, -0.02825927734375, -0.02459716796875, -0.02093505859375, -0.01727294921875, -0.01361083984375, -0.00994873046875, -0.00628662109375, -0.00262451171875, 0.00103759765625, 0.00469970703125, 0.00836181640625, 0.01202392578125, 0.01568603515625, 0.01934814453125, 0.02301025390625, 0.02667236328125, 0.03033447265625, 0.03399658203125, 0.03765869140625, 0.04132080078125, 0.04498291015625, 0.04864501953125, 0.05230712890625, 0.05596923828125, 0.05963134765625, 0.06329345703125, 0.06695556640625, 0.07061767578125, 0.07427978515625, 0.07794189453125, 0.08160400390625, 0.08526611328125, 0.08892822265625, 0.09259033203125, 0.09625244140625, 0.09991455078125, 0.10357666015625, 0.10723876953125, 0.11090087890625, 0.11456298828125, 0.11822509765625, 0.12188720703125]}, "gradients/encoder.encoder.layers.12.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 3.0, 2.0, 11.0, 33.0, 82.0, 264.0, 384.0, 164.0, 36.0, 14.0, 5.0, 4.0, 1.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6126707792282104, -0.5808740258216858, -0.5490772128105164, -0.5172804594039917, -0.48548367619514465, -0.4536868929862976, -0.42189013957977295, -0.3900933563709259, -0.35829657316207886, -0.3264997899532318, -0.29470300674438477, -0.2629062533378601, -0.23110947012901306, -0.19931268692016602, -0.16751591861248016, -0.1357191503047943, -0.10392236709594727, -0.07212559133768082, -0.04032881557941437, -0.008532039821147919, 0.02326473593711853, 0.055061519145965576, 0.08685828745365143, 0.11865505576133728, 0.15045183897018433, 0.18224862217903137, 0.21404539048671722, 0.24584215879440308, 0.2776389420032501, 0.30943572521209717, 0.3412324786186218, 0.37302926182746887, 0.40482592582702637, 0.4366227090358734, 0.46841949224472046, 0.5002162456512451, 0.5320130586624146, 0.5638098120689392, 0.5956065654754639, 0.6274033784866333, 0.659200131893158, 0.6909968852996826, 0.722793698310852, 0.7545904517173767, 0.7863872051239014, 0.8181840181350708, 0.8499807715415955, 0.8817775249481201, 0.9135743379592896, 0.9453710913658142, 0.9771679043769836, 1.0089646577835083, 1.0407614707946777, 1.0725581645965576, 1.104354977607727, 1.1361517906188965, 1.1679484844207764, 1.1997452974319458, 1.2315419912338257, 1.2633388042449951, 1.2951356172561646, 1.326932430267334, 1.3587291240692139, 1.3905259370803833, 1.4223227500915527]}, "gradients/encoder.encoder.layers.12.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 6.0, 1.0, 6.0, 11.0, 23.0, 30.0, 49.0, 62.0, 82.0, 88.0, 101.0, 107.0, 100.0, 74.0, 81.0, 71.0, 49.0, 24.0, 13.0, 9.0, 10.0, 6.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0], "bins": [-0.6076111197471619, -0.5939531326293945, -0.580295205116272, -0.5666372179985046, -0.5529792308807373, -0.5393213033676147, -0.5256633162498474, -0.5120053291320801, -0.49834737181663513, -0.4846894145011902, -0.47103142738342285, -0.4573734700679779, -0.44371551275253296, -0.4300575256347656, -0.4163995683193207, -0.40274161100387573, -0.3890836238861084, -0.37542566657066345, -0.3617676794528961, -0.34810972213745117, -0.33445173501968384, -0.3207937777042389, -0.30713582038879395, -0.2934778332710266, -0.27981987595558167, -0.2661619186401367, -0.2525039315223694, -0.23884597420692444, -0.2251880019903183, -0.21153002977371216, -0.1978720724582672, -0.18421410024166107, -0.17055612802505493, -0.1568981558084488, -0.14324018359184265, -0.1295822262763977, -0.11592425405979156, -0.10226628184318542, -0.08860831707715988, -0.07495035231113434, -0.0612923800945282, -0.04763441160321236, -0.033976443111896515, -0.020318474620580673, -0.0066605061292648315, 0.006997466087341309, 0.020655430853366852, 0.034313395619392395, 0.047971367835998535, 0.06162933632731438, 0.07528730481863022, 0.08894526958465576, 0.1026032418012619, 0.11626121401786804, 0.129919171333313, 0.14357714354991913, 0.15723511576652527, 0.1708930879831314, 0.18455106019973755, 0.1982090175151825, 0.21186698973178864, 0.22552496194839478, 0.23918291926383972, 0.25284087657928467, 0.266498863697052]}, "gradients/encoder.encoder.layers.12.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 4.0, 0.0, 0.0, 5.0, 1.0, 7.0, 4.0, 9.0, 10.0, 12.0, 15.0, 26.0, 37.0, 55.0, 77.0, 100.0, 179.0, 291.0, 542.0, 1193.0, 3166.0, 12695.0, 106800.0, 803124.0, 102413.0, 12149.0, 3096.0, 1179.0, 527.0, 305.0, 173.0, 106.0, 74.0, 56.0, 35.0, 24.0, 22.0, 14.0, 13.0, 5.0, 7.0, 2.0, 5.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.2548828125, -0.24727249145507812, -0.23966217041015625, -0.23205184936523438, -0.2244415283203125, -0.21683120727539062, -0.20922088623046875, -0.20161056518554688, -0.194000244140625, -0.18638992309570312, -0.17877960205078125, -0.17116928100585938, -0.1635589599609375, -0.15594863891601562, -0.14833831787109375, -0.14072799682617188, -0.13311767578125, -0.12550735473632812, -0.11789703369140625, -0.11028671264648438, -0.1026763916015625, -0.09506607055664062, -0.08745574951171875, -0.07984542846679688, -0.072235107421875, -0.06462478637695312, -0.05701446533203125, -0.049404144287109375, -0.0417938232421875, -0.034183502197265625, -0.02657318115234375, -0.018962860107421875, -0.0113525390625, -0.003742218017578125, 0.00386810302734375, 0.011478424072265625, 0.0190887451171875, 0.026699066162109375, 0.03430938720703125, 0.041919708251953125, 0.049530029296875, 0.057140350341796875, 0.06475067138671875, 0.07236099243164062, 0.0799713134765625, 0.08758163452148438, 0.09519195556640625, 0.10280227661132812, 0.11041259765625, 0.11802291870117188, 0.12563323974609375, 0.13324356079101562, 0.1408538818359375, 0.14846420288085938, 0.15607452392578125, 0.16368484497070312, 0.171295166015625, 0.17890548706054688, 0.18651580810546875, 0.19412612915039062, 0.2017364501953125, 0.20934677124023438, 0.21695709228515625, 0.22456741333007812, 0.232177734375]}, "gradients/encoder.encoder.layers.12.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 4.0, 7.0, 11.0, 14.0, 36.0, 58.0, 67.0, 92.0, 95.0, 104.0, 131.0, 94.0, 86.0, 73.0, 46.0, 29.0, 18.0, 20.0, 11.0, 3.0, 6.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.10955810546875, -0.10701227188110352, -0.10446643829345703, -0.10192060470581055, -0.09937477111816406, -0.09682893753051758, -0.0942831039428711, -0.09173727035522461, -0.08919143676757812, -0.08664560317993164, -0.08409976959228516, -0.08155393600463867, -0.07900810241699219, -0.0764622688293457, -0.07391643524169922, -0.07137060165405273, -0.06882476806640625, -0.06627893447875977, -0.06373310089111328, -0.0611872673034668, -0.05864143371582031, -0.05609560012817383, -0.053549766540527344, -0.05100393295288086, -0.048458099365234375, -0.04591226577758789, -0.043366432189941406, -0.04082059860229492, -0.03827476501464844, -0.03572893142700195, -0.03318309783935547, -0.030637264251708984, -0.0280914306640625, -0.025545597076416016, -0.02299976348876953, -0.020453929901123047, -0.017908096313476562, -0.015362262725830078, -0.012816429138183594, -0.01027059555053711, -0.007724761962890625, -0.005178928375244141, -0.0026330947875976562, -8.726119995117188e-05, 0.0024585723876953125, 0.005004405975341797, 0.007550239562988281, 0.010096073150634766, 0.01264190673828125, 0.015187740325927734, 0.01773357391357422, 0.020279407501220703, 0.022825241088867188, 0.025371074676513672, 0.027916908264160156, 0.03046274185180664, 0.033008575439453125, 0.03555440902709961, 0.038100242614746094, 0.04064607620239258, 0.04319190979003906, 0.04573774337768555, 0.04828357696533203, 0.050829410552978516, 0.053375244140625]}, "gradients/encoder.encoder.layers.12.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 7.0, 4.0, 3.0, 13.0, 8.0, 20.0, 28.0, 51.0, 93.0, 137.0, 277.0, 542.0, 1375.0, 4000.0, 18642.0, 224283.0, 730598.0, 56302.0, 8199.0, 2273.0, 807.0, 415.0, 221.0, 99.0, 58.0, 36.0, 27.0, 11.0, 11.0, 6.0, 6.0, 5.0, 4.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.16455078125, -0.15892601013183594, -0.15330123901367188, -0.1476764678955078, -0.14205169677734375, -0.1364269256591797, -0.13080215454101562, -0.12517738342285156, -0.1195526123046875, -0.11392784118652344, -0.10830307006835938, -0.10267829895019531, -0.09705352783203125, -0.09142875671386719, -0.08580398559570312, -0.08017921447753906, -0.074554443359375, -0.06892967224121094, -0.06330490112304688, -0.05768013000488281, -0.05205535888671875, -0.04643058776855469, -0.040805816650390625, -0.03518104553222656, -0.0295562744140625, -0.023931503295898438, -0.018306732177734375, -0.012681961059570312, -0.00705718994140625, -0.0014324188232421875, 0.004192352294921875, 0.009817123413085938, 0.01544189453125, 0.021066665649414062, 0.026691436767578125, 0.03231620788574219, 0.03794097900390625, 0.04356575012207031, 0.049190521240234375, 0.05481529235839844, 0.0604400634765625, 0.06606483459472656, 0.07168960571289062, 0.07731437683105469, 0.08293914794921875, 0.08856391906738281, 0.09418869018554688, 0.09981346130371094, 0.105438232421875, 0.11106300354003906, 0.11668777465820312, 0.12231254577636719, 0.12793731689453125, 0.1335620880126953, 0.13918685913085938, 0.14481163024902344, 0.1504364013671875, 0.15606117248535156, 0.16168594360351562, 0.1673107147216797, 0.17293548583984375, 0.1785602569580078, 0.18418502807617188, 0.18980979919433594, 0.1954345703125]}, "gradients/encoder.encoder.layers.12.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 3.0, 3.0, 1.0, 4.0, 2.0, 5.0, 6.0, 7.0, 6.0, 10.0, 17.0, 21.0, 26.0, 25.0, 16.0, 24.0, 36.0, 29.0, 35.0, 39.0, 35.0, 51.0, 36.0, 51.0, 42.0, 54.0, 49.0, 50.0, 42.0, 35.0, 45.0, 32.0, 24.0, 23.0, 23.0, 24.0, 19.0, 7.0, 7.0, 12.0, 9.0, 12.0, 5.0, 4.0, 3.0, 3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.125732421875, -0.12141799926757812, -0.11710357666015625, -0.11278915405273438, -0.1084747314453125, -0.10416030883789062, -0.09984588623046875, -0.09553146362304688, -0.091217041015625, -0.08690261840820312, -0.08258819580078125, -0.07827377319335938, -0.0739593505859375, -0.06964492797851562, -0.06533050537109375, -0.061016082763671875, -0.05670166015625, -0.052387237548828125, -0.04807281494140625, -0.043758392333984375, -0.0394439697265625, -0.035129547119140625, -0.03081512451171875, -0.026500701904296875, -0.022186279296875, -0.017871856689453125, -0.01355743408203125, -0.009243011474609375, -0.0049285888671875, -0.000614166259765625, 0.00370025634765625, 0.008014678955078125, 0.0123291015625, 0.016643524169921875, 0.02095794677734375, 0.025272369384765625, 0.0295867919921875, 0.033901214599609375, 0.03821563720703125, 0.042530059814453125, 0.046844482421875, 0.051158905029296875, 0.05547332763671875, 0.059787750244140625, 0.0641021728515625, 0.06841659545898438, 0.07273101806640625, 0.07704544067382812, 0.08135986328125, 0.08567428588867188, 0.08998870849609375, 0.09430313110351562, 0.0986175537109375, 0.10293197631835938, 0.10724639892578125, 0.11156082153320312, 0.115875244140625, 0.12018966674804688, 0.12450408935546875, 0.12881851196289062, 0.1331329345703125, 0.13744735717773438, 0.14176177978515625, 0.14607620239257812, 0.150390625]}, "gradients/encoder.encoder.layers.12.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 0.0, 3.0, 2.0, 2.0, 0.0, 2.0, 11.0, 8.0, 17.0, 21.0, 38.0, 55.0, 86.0, 123.0, 220.0, 424.0, 860.0, 2264.0, 6451.0, 24510.0, 119851.0, 572298.0, 258673.0, 45550.0, 10997.0, 3468.0, 1286.0, 590.0, 298.0, 160.0, 104.0, 86.0, 43.0, 14.0, 13.0, 8.0, 10.0, 3.0, 6.0, 1.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.03741455078125, -0.036196231842041016, -0.03497791290283203, -0.03375959396362305, -0.03254127502441406, -0.03132295608520508, -0.030104637145996094, -0.02888631820678711, -0.027667999267578125, -0.02644968032836914, -0.025231361389160156, -0.024013042449951172, -0.022794723510742188, -0.021576404571533203, -0.02035808563232422, -0.019139766693115234, -0.01792144775390625, -0.016703128814697266, -0.015484809875488281, -0.014266490936279297, -0.013048171997070312, -0.011829853057861328, -0.010611534118652344, -0.00939321517944336, -0.008174896240234375, -0.006956577301025391, -0.005738258361816406, -0.004519939422607422, -0.0033016204833984375, -0.002083301544189453, -0.0008649826049804688, 0.0003533363342285156, 0.0015716552734375, 0.0027899742126464844, 0.004008293151855469, 0.005226612091064453, 0.0064449310302734375, 0.007663249969482422, 0.008881568908691406, 0.01009988784790039, 0.011318206787109375, 0.01253652572631836, 0.013754844665527344, 0.014973163604736328, 0.016191482543945312, 0.017409801483154297, 0.01862812042236328, 0.019846439361572266, 0.02106475830078125, 0.022283077239990234, 0.02350139617919922, 0.024719715118408203, 0.025938034057617188, 0.027156352996826172, 0.028374671936035156, 0.02959299087524414, 0.030811309814453125, 0.03202962875366211, 0.033247947692871094, 0.03446626663208008, 0.03568458557128906, 0.03690290451049805, 0.03812122344970703, 0.039339542388916016, 0.040557861328125]}, "gradients/encoder.encoder.layers.12.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 2.0, 7.0, 4.0, 14.0, 24.0, 27.0, 49.0, 66.0, 79.0, 104.0, 99.0, 130.0, 102.0, 72.0, 78.0, 53.0, 27.0, 22.0, 15.0, 16.0, 6.0, 7.0, 4.0, 2.0, 1.0, 0.0, 3.0], "bins": [-2.181529998779297e-05, -2.1360814571380615e-05, -2.0906329154968262e-05, -2.0451843738555908e-05, -1.9997358322143555e-05, -1.95428729057312e-05, -1.9088387489318848e-05, -1.8633902072906494e-05, -1.817941665649414e-05, -1.7724931240081787e-05, -1.7270445823669434e-05, -1.681596040725708e-05, -1.6361474990844727e-05, -1.5906989574432373e-05, -1.545250415802002e-05, -1.4998018741607666e-05, -1.4543533325195312e-05, -1.4089047908782959e-05, -1.3634562492370605e-05, -1.3180077075958252e-05, -1.2725591659545898e-05, -1.2271106243133545e-05, -1.1816620826721191e-05, -1.1362135410308838e-05, -1.0907649993896484e-05, -1.0453164577484131e-05, -9.998679161071777e-06, -9.544193744659424e-06, -9.08970832824707e-06, -8.635222911834717e-06, -8.180737495422363e-06, -7.72625207901001e-06, -7.271766662597656e-06, -6.817281246185303e-06, -6.362795829772949e-06, -5.908310413360596e-06, -5.453824996948242e-06, -4.999339580535889e-06, -4.544854164123535e-06, -4.090368747711182e-06, -3.635883331298828e-06, -3.1813979148864746e-06, -2.726912498474121e-06, -2.2724270820617676e-06, -1.817941665649414e-06, -1.3634562492370605e-06, -9.08970832824707e-07, -4.544854164123535e-07, 0.0, 4.544854164123535e-07, 9.08970832824707e-07, 1.3634562492370605e-06, 1.817941665649414e-06, 2.2724270820617676e-06, 2.726912498474121e-06, 3.1813979148864746e-06, 3.635883331298828e-06, 4.090368747711182e-06, 4.544854164123535e-06, 4.999339580535889e-06, 5.453824996948242e-06, 5.908310413360596e-06, 6.362795829772949e-06, 6.817281246185303e-06, 7.271766662597656e-06]}, "gradients/encoder.encoder.layers.12.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 4.0, 2.0, 8.0, 3.0, 8.0, 13.0, 20.0, 14.0, 32.0, 78.0, 129.0, 254.0, 555.0, 1280.0, 3296.0, 10152.0, 40380.0, 232205.0, 591124.0, 131914.0, 25555.0, 7180.0, 2454.0, 1016.0, 413.0, 222.0, 96.0, 55.0, 28.0, 18.0, 15.0, 9.0, 6.0, 7.0, 4.0, 5.0, 1.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.043731689453125, -0.042290687561035156, -0.04084968566894531, -0.03940868377685547, -0.037967681884765625, -0.03652667999267578, -0.03508567810058594, -0.033644676208496094, -0.03220367431640625, -0.030762672424316406, -0.029321670532226562, -0.02788066864013672, -0.026439666748046875, -0.02499866485595703, -0.023557662963867188, -0.022116661071777344, -0.0206756591796875, -0.019234657287597656, -0.017793655395507812, -0.01635265350341797, -0.014911651611328125, -0.013470649719238281, -0.012029647827148438, -0.010588645935058594, -0.00914764404296875, -0.007706642150878906, -0.0062656402587890625, -0.004824638366699219, -0.003383636474609375, -0.0019426345825195312, -0.0005016326904296875, 0.0009393692016601562, 0.00238037109375, 0.0038213729858398438, 0.0052623748779296875, 0.006703376770019531, 0.008144378662109375, 0.009585380554199219, 0.011026382446289062, 0.012467384338378906, 0.01390838623046875, 0.015349388122558594, 0.016790390014648438, 0.01823139190673828, 0.019672393798828125, 0.02111339569091797, 0.022554397583007812, 0.023995399475097656, 0.0254364013671875, 0.026877403259277344, 0.028318405151367188, 0.02975940704345703, 0.031200408935546875, 0.03264141082763672, 0.03408241271972656, 0.035523414611816406, 0.03696441650390625, 0.038405418395996094, 0.03984642028808594, 0.04128742218017578, 0.042728424072265625, 0.04416942596435547, 0.04561042785644531, 0.047051429748535156, 0.048492431640625]}, "gradients/encoder.encoder.layers.12.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 1.0, 1.0, 4.0, 5.0, 4.0, 6.0, 8.0, 8.0, 11.0, 15.0, 16.0, 26.0, 18.0, 24.0, 25.0, 42.0, 47.0, 55.0, 65.0, 60.0, 73.0, 71.0, 60.0, 61.0, 49.0, 36.0, 35.0, 47.0, 24.0, 29.0, 8.0, 15.0, 17.0, 7.0, 10.0, 7.0, 8.0, 2.0, 2.0, 1.0, 0.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.029449462890625, -0.02852487564086914, -0.02760028839111328, -0.026675701141357422, -0.025751113891601562, -0.024826526641845703, -0.023901939392089844, -0.022977352142333984, -0.022052764892578125, -0.021128177642822266, -0.020203590393066406, -0.019279003143310547, -0.018354415893554688, -0.017429828643798828, -0.01650524139404297, -0.01558065414428711, -0.01465606689453125, -0.01373147964477539, -0.012806892395019531, -0.011882305145263672, -0.010957717895507812, -0.010033130645751953, -0.009108543395996094, -0.008183956146240234, -0.007259368896484375, -0.006334781646728516, -0.005410194396972656, -0.004485607147216797, -0.0035610198974609375, -0.002636432647705078, -0.0017118453979492188, -0.0007872581481933594, 0.0001373291015625, 0.0010619163513183594, 0.0019865036010742188, 0.002911090850830078, 0.0038356781005859375, 0.004760265350341797, 0.005684852600097656, 0.006609439849853516, 0.007534027099609375, 0.008458614349365234, 0.009383201599121094, 0.010307788848876953, 0.011232376098632812, 0.012156963348388672, 0.013081550598144531, 0.01400613784790039, 0.01493072509765625, 0.01585531234741211, 0.01677989959716797, 0.017704486846923828, 0.018629074096679688, 0.019553661346435547, 0.020478248596191406, 0.021402835845947266, 0.022327423095703125, 0.023252010345458984, 0.024176597595214844, 0.025101184844970703, 0.026025772094726562, 0.026950359344482422, 0.02787494659423828, 0.02879953384399414, 0.02972412109375]}, "gradients/encoder.encoder.layers.12.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 6.0, 3.0, 10.0, 29.0, 86.0, 199.0, 314.0, 242.0, 69.0, 33.0, 13.0, 5.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.5178558826446533, -2.4670417308807373, -2.416227340698242, -2.365413188934326, -2.31459903717041, -2.263784646987915, -2.212970495223999, -2.162156105041504, -2.111341953277588, -2.060527801513672, -2.0097134113311768, -1.9588992595672607, -1.9080849885940552, -1.8572707176208496, -1.8064565658569336, -1.755642294883728, -1.7048280239105225, -1.654013752937317, -1.6031994819641113, -1.5523853302001953, -1.5015710592269897, -1.4507567882537842, -1.3999426364898682, -1.3491283655166626, -1.298314094543457, -1.2474998235702515, -1.196685552597046, -1.1458714008331299, -1.0950571298599243, -1.0442428588867188, -0.993428647518158, -0.9426144361495972, -0.8918001651763916, -0.840985894203186, -0.7901716828346252, -0.7393574714660645, -0.6885432004928589, -0.6377289295196533, -0.5869147181510925, -0.5361005067825317, -0.48528623580932617, -0.434471994638443, -0.3836577534675598, -0.33284351229667664, -0.28202927112579346, -0.23121502995491028, -0.1804007887840271, -0.12958654761314392, -0.07877230644226074, -0.027958065271377563, 0.022856175899505615, 0.0736704170703888, 0.12448465824127197, 0.17529889941215515, 0.22611314058303833, 0.2769273817539215, 0.3277416229248047, 0.37855586409568787, 0.42937010526657104, 0.4801843464374542, 0.5309985876083374, 0.581812858581543, 0.6326270699501038, 0.6834412813186646, 0.7342555522918701]}, "gradients/encoder.encoder.layers.12.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 7.0, 12.0, 11.0, 14.0, 14.0, 8.0, 22.0, 22.0, 25.0, 24.0, 24.0, 20.0, 34.0, 22.0, 26.0, 28.0, 37.0, 31.0, 39.0, 33.0, 39.0, 46.0, 47.0, 30.0, 37.0, 34.0, 42.0, 26.0, 22.0, 32.0, 25.0, 19.0, 34.0, 11.0, 15.0, 13.0, 14.0, 12.0, 11.0, 6.0, 5.0, 10.0, 5.0, 4.0, 6.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 2.0, 3.0], "bins": [-0.5584851503372192, -0.5410619378089905, -0.5236387848854065, -0.5062155723571777, -0.48879238963127136, -0.471369206905365, -0.45394599437713623, -0.43652281165122986, -0.4190996289253235, -0.4016764461994171, -0.38425326347351074, -0.366830050945282, -0.3494068682193756, -0.33198368549346924, -0.3145604729652405, -0.2971372902393341, -0.27971410751342773, -0.26229092478752136, -0.2448677271604538, -0.22744452953338623, -0.21002134680747986, -0.1925981640815735, -0.17517496645450592, -0.15775176882743835, -0.14032858610153198, -0.12290539592504501, -0.10548220574855804, -0.08805901557207108, -0.0706358253955841, -0.05321263521909714, -0.03578944504261017, -0.0183662548661232, -0.0009430646896362305, 0.01648012548685074, 0.03390331566333771, 0.051326505839824677, 0.06874969601631165, 0.08617288619279861, 0.10359607636928558, 0.12101926654577255, 0.13844245672225952, 0.1558656394481659, 0.17328883707523346, 0.19071203470230103, 0.2081352174282074, 0.22555840015411377, 0.24298159778118134, 0.2604047954082489, 0.2778279781341553, 0.29525116086006165, 0.312674343585968, 0.3300975561141968, 0.34752073884010315, 0.3649439215660095, 0.3823671340942383, 0.39979031682014465, 0.417213499546051, 0.4346366822719574, 0.45205986499786377, 0.46948307752609253, 0.4869062602519989, 0.5043294429779053, 0.521752655506134, 0.539175808429718, 0.5565990209579468]}, "gradients/encoder.encoder.layers.11.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 1.0, 8.0, 6.0, 10.0, 13.0, 21.0, 41.0, 80.0, 141.0, 409.0, 5845.0, 4186398.0, 897.0, 195.0, 96.0, 59.0, 31.0, 14.0, 5.0, 5.0, 6.0, 1.0, 3.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-2.4375, -2.3797454833984375, -2.321990966796875, -2.2642364501953125, -2.20648193359375, -2.1487274169921875, -2.090972900390625, -2.0332183837890625, -1.9754638671875, -1.9177093505859375, -1.859954833984375, -1.8022003173828125, -1.74444580078125, -1.6866912841796875, -1.628936767578125, -1.5711822509765625, -1.513427734375, -1.4556732177734375, -1.397918701171875, -1.3401641845703125, -1.28240966796875, -1.2246551513671875, -1.166900634765625, -1.1091461181640625, -1.0513916015625, -0.9936370849609375, -0.935882568359375, -0.8781280517578125, -0.82037353515625, -0.7626190185546875, -0.704864501953125, -0.6471099853515625, -0.58935546875, -0.5316009521484375, -0.473846435546875, -0.4160919189453125, -0.35833740234375, -0.3005828857421875, -0.242828369140625, -0.1850738525390625, -0.1273193359375, -0.0695648193359375, -0.011810302734375, 0.0459442138671875, 0.10369873046875, 0.1614532470703125, 0.219207763671875, 0.2769622802734375, 0.334716796875, 0.3924713134765625, 0.450225830078125, 0.5079803466796875, 0.56573486328125, 0.6234893798828125, 0.681243896484375, 0.7389984130859375, 0.7967529296875, 0.8545074462890625, 0.912261962890625, 0.9700164794921875, 1.02777099609375, 1.0855255126953125, 1.143280029296875, 1.2010345458984375, 1.2587890625]}, "gradients/encoder.encoder.layers.11.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 4.0, 6.0, 8.0, 15.0, 37.0, 43.0, 60.0, 94.0, 79.0, 96.0, 121.0, 110.0, 96.0, 75.0, 57.0, 28.0, 32.0, 18.0, 15.0, 6.0, 5.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0], "bins": [-0.112060546875, -0.10947990417480469, -0.10689926147460938, -0.10431861877441406, -0.10173797607421875, -0.09915733337402344, -0.09657669067382812, -0.09399604797363281, -0.0914154052734375, -0.08883476257324219, -0.08625411987304688, -0.08367347717285156, -0.08109283447265625, -0.07851219177246094, -0.07593154907226562, -0.07335090637207031, -0.070770263671875, -0.06818962097167969, -0.06560897827148438, -0.06302833557128906, -0.06044769287109375, -0.05786705017089844, -0.055286407470703125, -0.05270576477050781, -0.0501251220703125, -0.04754447937011719, -0.044963836669921875, -0.04238319396972656, -0.03980255126953125, -0.03722190856933594, -0.034641265869140625, -0.03206062316894531, -0.02947998046875, -0.026899337768554688, -0.024318695068359375, -0.021738052368164062, -0.01915740966796875, -0.016576766967773438, -0.013996124267578125, -0.011415481567382812, -0.0088348388671875, -0.0062541961669921875, -0.003673553466796875, -0.0010929107666015625, 0.00148773193359375, 0.0040683746337890625, 0.006649017333984375, 0.009229660034179688, 0.011810302734375, 0.014390945434570312, 0.016971588134765625, 0.019552230834960938, 0.02213287353515625, 0.024713516235351562, 0.027294158935546875, 0.029874801635742188, 0.0324554443359375, 0.03503608703613281, 0.037616729736328125, 0.04019737243652344, 0.04277801513671875, 0.04535865783691406, 0.047939300537109375, 0.05051994323730469, 0.0531005859375]}, "gradients/encoder.encoder.layers.11.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 1.0, 2.0, 3.0, 4.0, 8.0, 7.0, 12.0, 17.0, 27.0, 25.0, 38.0, 54.0, 75.0, 110.0, 111.0, 183.0, 228.0, 373.0, 540.0, 858.0, 1472.0, 2728.0, 7561.0, 109521.0, 4051027.0, 11129.0, 3365.0, 1765.0, 1001.0, 592.0, 413.0, 306.0, 190.0, 138.0, 111.0, 66.0, 71.0, 56.0, 35.0, 25.0, 12.0, 5.0, 7.0, 5.0, 7.0, 6.0, 4.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.2113037109375, -0.204925537109375, -0.19854736328125, -0.192169189453125, -0.185791015625, -0.179412841796875, -0.17303466796875, -0.166656494140625, -0.1602783203125, -0.153900146484375, -0.14752197265625, -0.141143798828125, -0.134765625, -0.128387451171875, -0.12200927734375, -0.115631103515625, -0.1092529296875, -0.102874755859375, -0.09649658203125, -0.090118408203125, -0.083740234375, -0.077362060546875, -0.07098388671875, -0.064605712890625, -0.0582275390625, -0.051849365234375, -0.04547119140625, -0.039093017578125, -0.03271484375, -0.026336669921875, -0.01995849609375, -0.013580322265625, -0.0072021484375, -0.000823974609375, 0.00555419921875, 0.011932373046875, 0.018310546875, 0.024688720703125, 0.03106689453125, 0.037445068359375, 0.0438232421875, 0.050201416015625, 0.05657958984375, 0.062957763671875, 0.0693359375, 0.075714111328125, 0.08209228515625, 0.088470458984375, 0.0948486328125, 0.101226806640625, 0.10760498046875, 0.113983154296875, 0.120361328125, 0.126739501953125, 0.13311767578125, 0.139495849609375, 0.1458740234375, 0.152252197265625, 0.15863037109375, 0.165008544921875, 0.17138671875, 0.177764892578125, 0.18414306640625, 0.190521240234375, 0.1968994140625]}, "gradients/encoder.encoder.layers.11.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 1.0, 4.0, 2.0, 6.0, 5.0, 16.0, 36.0, 423.0, 3516.0, 40.0, 19.0, 5.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0447998046875, -0.04330873489379883, -0.041817665100097656, -0.040326595306396484, -0.03883552551269531, -0.03734445571899414, -0.03585338592529297, -0.0343623161315918, -0.032871246337890625, -0.03138017654418945, -0.02988910675048828, -0.02839803695678711, -0.026906967163085938, -0.025415897369384766, -0.023924827575683594, -0.022433757781982422, -0.02094268798828125, -0.019451618194580078, -0.017960548400878906, -0.016469478607177734, -0.014978408813476562, -0.01348733901977539, -0.011996269226074219, -0.010505199432373047, -0.009014129638671875, -0.007523059844970703, -0.006031990051269531, -0.004540920257568359, -0.0030498504638671875, -0.0015587806701660156, -6.771087646484375e-05, 0.0014233589172363281, 0.0029144287109375, 0.004405498504638672, 0.005896568298339844, 0.007387638092041016, 0.008878707885742188, 0.01036977767944336, 0.011860847473144531, 0.013351917266845703, 0.014842987060546875, 0.016334056854248047, 0.01782512664794922, 0.01931619644165039, 0.020807266235351562, 0.022298336029052734, 0.023789405822753906, 0.025280475616455078, 0.02677154541015625, 0.028262615203857422, 0.029753684997558594, 0.031244754791259766, 0.03273582458496094, 0.03422689437866211, 0.03571796417236328, 0.03720903396606445, 0.038700103759765625, 0.0401911735534668, 0.04168224334716797, 0.04317331314086914, 0.04466438293457031, 0.046155452728271484, 0.047646522521972656, 0.04913759231567383, 0.050628662109375]}, "gradients/encoder.encoder.layers.11.final_layer_norm.weight": {"_type": "histogram", "values": [4.0, 3.0, 5.0, 10.0, 27.0, 119.0, 414.0, 311.0, 91.0, 27.0, 7.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.08491823822259903, -0.07251914590597153, -0.060120053589344025, -0.04772096499800682, -0.03532187268137932, -0.022922780364751816, -0.010523691773414612, 0.0018754005432128906, 0.014274492859840393, 0.026673585176467896, 0.0390726774930954, 0.0514717660844326, 0.0638708621263504, 0.0762699544429779, 0.08866903930902481, 0.10106813162565231, 0.11346722394227982, 0.12586630880832672, 0.13826540112495422, 0.15066449344158173, 0.16306358575820923, 0.17546267807483673, 0.18786177039146423, 0.20026086270809174, 0.21265995502471924, 0.22505904734134674, 0.23745813965797424, 0.24985723197460175, 0.26225632429122925, 0.27465540170669556, 0.28705450892448425, 0.29945358633995056, 0.31185266375541687, 0.3242517411708832, 0.3366508483886719, 0.3490499258041382, 0.3614490330219269, 0.3738481104373932, 0.3862472176551819, 0.3986462950706482, 0.4110454022884369, 0.4234444797039032, 0.4358435869216919, 0.4482426643371582, 0.4606417715549469, 0.4730408489704132, 0.4854399561882019, 0.4978390336036682, 0.5102381110191345, 0.5226371884346008, 0.5350362658500671, 0.5474354028701782, 0.5598344802856445, 0.5722335577011108, 0.5846326351165771, 0.5970317721366882, 0.6094308495521545, 0.6218299269676208, 0.6342290043830872, 0.6466281414031982, 0.6590272188186646, 0.6714262962341309, 0.6838253736495972, 0.6962245106697083, 0.7086235880851746]}, "gradients/encoder.encoder.layers.11.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 4.0, 1.0, 8.0, 4.0, 10.0, 12.0, 29.0, 14.0, 29.0, 29.0, 34.0, 46.0, 40.0, 50.0, 39.0, 50.0, 63.0, 70.0, 54.0, 66.0, 56.0, 46.0, 52.0, 30.0, 36.0, 26.0, 22.0, 19.0, 20.0, 11.0, 12.0, 4.0, 5.0, 7.0, 4.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06170976161956787, -0.05943150818347931, -0.05715325102210045, -0.05487499386072159, -0.052596740424633026, -0.050318486988544464, -0.048040229827165604, -0.04576197266578674, -0.04348371922969818, -0.04120546579360962, -0.03892720863223076, -0.0366489514708519, -0.034370698034763336, -0.032092444598674774, -0.029814187437295914, -0.027535932138562202, -0.02525767683982849, -0.02297942154109478, -0.02070116624236107, -0.018422910943627357, -0.016144655644893646, -0.013866400346159935, -0.011588145047426224, -0.009309889748692513, -0.007031634449958801, -0.00475337915122509, -0.002475123852491379, -0.00019686855375766754, 0.0020813867449760437, 0.004359642043709755, 0.006637897342443466, 0.008916152641177177, 0.011194407939910889, 0.0134726632386446, 0.01575091853737831, 0.018029173836112022, 0.020307429134845734, 0.022585684433579445, 0.024863939732313156, 0.027142195031046867, 0.02942045032978058, 0.03169870376586914, 0.033976960927248, 0.03625521808862686, 0.038533471524715424, 0.040811724960803986, 0.043089982122182846, 0.04536823928356171, 0.04764649271965027, 0.04992474615573883, 0.05220300331711769, 0.05448126047849655, 0.056759513914585114, 0.059037767350673676, 0.061316024512052536, 0.0635942816734314, 0.06587253510951996, 0.06815078854560852, 0.07042904198169708, 0.07270730286836624, 0.0749855563044548, 0.07726380974054337, 0.07954207062721252, 0.08182032406330109, 0.08409857749938965]}, "gradients/encoder.encoder.layers.11.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 4.0, 8.0, 4.0, 5.0, 12.0, 20.0, 19.0, 37.0, 53.0, 63.0, 127.0, 171.0, 276.0, 509.0, 1165.0, 2945.0, 12195.0, 81837.0, 639029.0, 270624.0, 30210.0, 5761.0, 1750.0, 763.0, 331.0, 208.0, 120.0, 91.0, 55.0, 51.0, 31.0, 17.0, 22.0, 14.0, 10.0, 8.0, 3.0, 3.0, 4.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1600341796875, -0.15465545654296875, -0.1492767333984375, -0.14389801025390625, -0.138519287109375, -0.13314056396484375, -0.1277618408203125, -0.12238311767578125, -0.11700439453125, -0.11162567138671875, -0.1062469482421875, -0.10086822509765625, -0.095489501953125, -0.09011077880859375, -0.0847320556640625, -0.07935333251953125, -0.073974609375, -0.06859588623046875, -0.0632171630859375, -0.05783843994140625, -0.052459716796875, -0.04708099365234375, -0.0417022705078125, -0.03632354736328125, -0.03094482421875, -0.02556610107421875, -0.0201873779296875, -0.01480865478515625, -0.009429931640625, -0.00405120849609375, 0.0013275146484375, 0.00670623779296875, 0.0120849609375, 0.01746368408203125, 0.0228424072265625, 0.02822113037109375, 0.033599853515625, 0.03897857666015625, 0.0443572998046875, 0.04973602294921875, 0.05511474609375, 0.06049346923828125, 0.0658721923828125, 0.07125091552734375, 0.076629638671875, 0.08200836181640625, 0.0873870849609375, 0.09276580810546875, 0.09814453125, 0.10352325439453125, 0.1089019775390625, 0.11428070068359375, 0.119659423828125, 0.12503814697265625, 0.1304168701171875, 0.13579559326171875, 0.14117431640625, 0.14655303955078125, 0.1519317626953125, 0.15731048583984375, 0.162689208984375, 0.16806793212890625, 0.1734466552734375, 0.17882537841796875, 0.1842041015625]}, "gradients/encoder.encoder.layers.11.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 7.0, 12.0, 19.0, 40.0, 45.0, 63.0, 87.0, 79.0, 107.0, 114.0, 95.0, 92.0, 74.0, 53.0, 38.0, 22.0, 24.0, 11.0, 12.0, 6.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.1131591796875, -0.1105494499206543, -0.1079397201538086, -0.10532999038696289, -0.10272026062011719, -0.10011053085327148, -0.09750080108642578, -0.09489107131958008, -0.09228134155273438, -0.08967161178588867, -0.08706188201904297, -0.08445215225219727, -0.08184242248535156, -0.07923269271850586, -0.07662296295166016, -0.07401323318481445, -0.07140350341796875, -0.06879377365112305, -0.06618404388427734, -0.06357431411743164, -0.06096458435058594, -0.058354854583740234, -0.05574512481689453, -0.05313539505004883, -0.050525665283203125, -0.04791593551635742, -0.04530620574951172, -0.042696475982666016, -0.04008674621582031, -0.03747701644897461, -0.034867286682128906, -0.0322575569152832, -0.0296478271484375, -0.027038097381591797, -0.024428367614746094, -0.02181863784790039, -0.019208908081054688, -0.016599178314208984, -0.013989448547363281, -0.011379718780517578, -0.008769989013671875, -0.006160259246826172, -0.0035505294799804688, -0.0009407997131347656, 0.0016689300537109375, 0.004278659820556641, 0.006888389587402344, 0.009498119354248047, 0.01210784912109375, 0.014717578887939453, 0.017327308654785156, 0.01993703842163086, 0.022546768188476562, 0.025156497955322266, 0.02776622772216797, 0.030375957489013672, 0.032985687255859375, 0.03559541702270508, 0.03820514678955078, 0.040814876556396484, 0.04342460632324219, 0.04603433609008789, 0.048644065856933594, 0.0512537956237793, 0.053863525390625]}, "gradients/encoder.encoder.layers.11.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 4.0, 10.0, 10.0, 16.0, 15.0, 25.0, 36.0, 45.0, 80.0, 113.0, 173.0, 347.0, 746.0, 1762.0, 5259.0, 22296.0, 190926.0, 722597.0, 84627.0, 13382.0, 3593.0, 1283.0, 540.0, 252.0, 142.0, 93.0, 65.0, 34.0, 18.0, 15.0, 15.0, 11.0, 8.0, 10.0, 4.0, 1.0, 4.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.149169921875, -0.14469528198242188, -0.14022064208984375, -0.13574600219726562, -0.1312713623046875, -0.12679672241210938, -0.12232208251953125, -0.11784744262695312, -0.113372802734375, -0.10889816284179688, -0.10442352294921875, -0.09994888305664062, -0.0954742431640625, -0.09099960327148438, -0.08652496337890625, -0.08205032348632812, -0.07757568359375, -0.07310104370117188, -0.06862640380859375, -0.06415176391601562, -0.0596771240234375, -0.055202484130859375, -0.05072784423828125, -0.046253204345703125, -0.041778564453125, -0.037303924560546875, -0.03282928466796875, -0.028354644775390625, -0.0238800048828125, -0.019405364990234375, -0.01493072509765625, -0.010456085205078125, -0.0059814453125, -0.001506805419921875, 0.00296783447265625, 0.007442474365234375, 0.0119171142578125, 0.016391754150390625, 0.02086639404296875, 0.025341033935546875, 0.029815673828125, 0.034290313720703125, 0.03876495361328125, 0.043239593505859375, 0.0477142333984375, 0.052188873291015625, 0.05666351318359375, 0.061138153076171875, 0.06561279296875, 0.07008743286132812, 0.07456207275390625, 0.07903671264648438, 0.0835113525390625, 0.08798599243164062, 0.09246063232421875, 0.09693527221679688, 0.101409912109375, 0.10588455200195312, 0.11035919189453125, 0.11483383178710938, 0.1193084716796875, 0.12378311157226562, 0.12825775146484375, 0.13273239135742188, 0.13720703125]}, "gradients/encoder.encoder.layers.11.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 4.0, 6.0, 0.0, 0.0, 6.0, 10.0, 14.0, 10.0, 11.0, 28.0, 22.0, 20.0, 24.0, 38.0, 41.0, 29.0, 46.0, 48.0, 36.0, 53.0, 56.0, 46.0, 47.0, 46.0, 47.0, 47.0, 42.0, 37.0, 28.0, 23.0, 28.0, 22.0, 20.0, 16.0, 19.0, 7.0, 10.0, 5.0, 5.0, 4.0, 4.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1502685546875, -0.1456775665283203, -0.14108657836914062, -0.13649559020996094, -0.13190460205078125, -0.12731361389160156, -0.12272262573242188, -0.11813163757324219, -0.1135406494140625, -0.10894966125488281, -0.10435867309570312, -0.09976768493652344, -0.09517669677734375, -0.09058570861816406, -0.08599472045898438, -0.08140373229980469, -0.076812744140625, -0.07222175598144531, -0.06763076782226562, -0.06303977966308594, -0.05844879150390625, -0.05385780334472656, -0.049266815185546875, -0.04467582702636719, -0.0400848388671875, -0.03549385070800781, -0.030902862548828125, -0.026311874389648438, -0.02172088623046875, -0.017129898071289062, -0.012538909912109375, -0.007947921752929688, -0.00335693359375, 0.0012340545654296875, 0.005825042724609375, 0.010416030883789062, 0.01500701904296875, 0.019598007202148438, 0.024188995361328125, 0.028779983520507812, 0.0333709716796875, 0.03796195983886719, 0.042552947998046875, 0.04714393615722656, 0.05173492431640625, 0.05632591247558594, 0.060916900634765625, 0.06550788879394531, 0.070098876953125, 0.07468986511230469, 0.07928085327148438, 0.08387184143066406, 0.08846282958984375, 0.09305381774902344, 0.09764480590820312, 0.10223579406738281, 0.1068267822265625, 0.11141777038574219, 0.11600875854492188, 0.12059974670410156, 0.12519073486328125, 0.12978172302246094, 0.13437271118164062, 0.1389636993408203, 0.1435546875]}, "gradients/encoder.encoder.layers.11.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 4.0, 2.0, 5.0, 6.0, 7.0, 9.0, 6.0, 4.0, 13.0, 19.0, 23.0, 47.0, 66.0, 93.0, 153.0, 292.0, 580.0, 1235.0, 3421.0, 10750.0, 41114.0, 181069.0, 530903.0, 212253.0, 47675.0, 12152.0, 3865.0, 1398.0, 611.0, 318.0, 175.0, 96.0, 57.0, 38.0, 24.0, 19.0, 17.0, 7.0, 13.0, 10.0, 4.0, 5.0, 2.0, 7.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0195465087890625, -0.01871657371520996, -0.017886638641357422, -0.017056703567504883, -0.016226768493652344, -0.015396833419799805, -0.014566898345947266, -0.013736963272094727, -0.012907028198242188, -0.012077093124389648, -0.01124715805053711, -0.01041722297668457, -0.009587287902832031, -0.008757352828979492, -0.007927417755126953, -0.007097482681274414, -0.006267547607421875, -0.005437612533569336, -0.004607677459716797, -0.003777742385864258, -0.0029478073120117188, -0.0021178722381591797, -0.0012879371643066406, -0.00045800209045410156, 0.0003719329833984375, 0.0012018680572509766, 0.0020318031311035156, 0.0028617382049560547, 0.0036916732788085938, 0.004521608352661133, 0.005351543426513672, 0.006181478500366211, 0.00701141357421875, 0.007841348648071289, 0.008671283721923828, 0.009501218795776367, 0.010331153869628906, 0.011161088943481445, 0.011991024017333984, 0.012820959091186523, 0.013650894165039062, 0.014480829238891602, 0.01531076431274414, 0.01614069938659668, 0.01697063446044922, 0.017800569534301758, 0.018630504608154297, 0.019460439682006836, 0.020290374755859375, 0.021120309829711914, 0.021950244903564453, 0.022780179977416992, 0.02361011505126953, 0.02444005012512207, 0.02526998519897461, 0.02609992027282715, 0.026929855346679688, 0.027759790420532227, 0.028589725494384766, 0.029419660568237305, 0.030249595642089844, 0.031079530715942383, 0.03190946578979492, 0.03273940086364746, 0.0335693359375]}, "gradients/encoder.encoder.layers.11.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 3.0, 6.0, 6.0, 4.0, 16.0, 15.0, 15.0, 21.0, 23.0, 20.0, 42.0, 45.0, 41.0, 40.0, 57.0, 61.0, 64.0, 69.0, 64.0, 65.0, 36.0, 57.0, 54.0, 42.0, 26.0, 25.0, 23.0, 14.0, 12.0, 16.0, 4.0, 4.0, 3.0, 3.0, 2.0, 3.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-7.450580596923828e-06, -7.220543920993805e-06, -6.990507245063782e-06, -6.7604705691337585e-06, -6.530433893203735e-06, -6.300397217273712e-06, -6.070360541343689e-06, -5.840323865413666e-06, -5.610287189483643e-06, -5.380250513553619e-06, -5.150213837623596e-06, -4.920177161693573e-06, -4.69014048576355e-06, -4.460103809833527e-06, -4.230067133903503e-06, -4.00003045797348e-06, -3.769993782043457e-06, -3.539957106113434e-06, -3.3099204301834106e-06, -3.0798837542533875e-06, -2.8498470783233643e-06, -2.619810402393341e-06, -2.389773726463318e-06, -2.1597370505332947e-06, -1.9297003746032715e-06, -1.6996636986732483e-06, -1.469627022743225e-06, -1.239590346813202e-06, -1.0095536708831787e-06, -7.795169949531555e-07, -5.494803190231323e-07, -3.1944364309310913e-07, -8.940696716308594e-08, 1.4062970876693726e-07, 3.7066638469696045e-07, 6.007030606269836e-07, 8.307397365570068e-07, 1.06077641248703e-06, 1.2908130884170532e-06, 1.5208497643470764e-06, 1.7508864402770996e-06, 1.980923116207123e-06, 2.210959792137146e-06, 2.440996468067169e-06, 2.6710331439971924e-06, 2.9010698199272156e-06, 3.1311064958572388e-06, 3.361143171787262e-06, 3.591179847717285e-06, 3.821216523647308e-06, 4.0512531995773315e-06, 4.281289875507355e-06, 4.511326551437378e-06, 4.741363227367401e-06, 4.971399903297424e-06, 5.2014365792274475e-06, 5.431473255157471e-06, 5.661509931087494e-06, 5.891546607017517e-06, 6.12158328294754e-06, 6.3516199588775635e-06, 6.581656634807587e-06, 6.81169331073761e-06, 7.041729986667633e-06, 7.271766662597656e-06]}, "gradients/encoder.encoder.layers.11.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 7.0, 6.0, 4.0, 7.0, 11.0, 14.0, 17.0, 26.0, 42.0, 63.0, 69.0, 121.0, 209.0, 388.0, 724.0, 1430.0, 3258.0, 8585.0, 27070.0, 105453.0, 393850.0, 369518.0, 98123.0, 25580.0, 7872.0, 2996.0, 1448.0, 673.0, 374.0, 225.0, 129.0, 93.0, 49.0, 38.0, 32.0, 9.0, 11.0, 8.0, 5.0, 5.0, 4.0, 3.0, 5.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 3.0], "bins": [-0.02996826171875, -0.029111385345458984, -0.02825450897216797, -0.027397632598876953, -0.026540756225585938, -0.025683879852294922, -0.024827003479003906, -0.02397012710571289, -0.023113250732421875, -0.02225637435913086, -0.021399497985839844, -0.020542621612548828, -0.019685745239257812, -0.018828868865966797, -0.01797199249267578, -0.017115116119384766, -0.01625823974609375, -0.015401363372802734, -0.014544486999511719, -0.013687610626220703, -0.012830734252929688, -0.011973857879638672, -0.011116981506347656, -0.01026010513305664, -0.009403228759765625, -0.00854635238647461, -0.007689476013183594, -0.006832599639892578, -0.0059757232666015625, -0.005118846893310547, -0.004261970520019531, -0.0034050941467285156, -0.0025482177734375, -0.0016913414001464844, -0.0008344650268554688, 2.2411346435546875e-05, 0.0008792877197265625, 0.0017361640930175781, 0.0025930404663085938, 0.0034499168395996094, 0.004306793212890625, 0.005163669586181641, 0.006020545959472656, 0.006877422332763672, 0.0077342987060546875, 0.008591175079345703, 0.009448051452636719, 0.010304927825927734, 0.01116180419921875, 0.012018680572509766, 0.012875556945800781, 0.013732433319091797, 0.014589309692382812, 0.015446186065673828, 0.016303062438964844, 0.01715993881225586, 0.018016815185546875, 0.01887369155883789, 0.019730567932128906, 0.020587444305419922, 0.021444320678710938, 0.022301197052001953, 0.02315807342529297, 0.024014949798583984, 0.024871826171875]}, "gradients/encoder.encoder.layers.11.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 0.0, 1.0, 4.0, 3.0, 4.0, 5.0, 8.0, 11.0, 15.0, 17.0, 23.0, 30.0, 24.0, 38.0, 51.0, 57.0, 70.0, 58.0, 75.0, 70.0, 73.0, 74.0, 58.0, 48.0, 45.0, 35.0, 27.0, 19.0, 29.0, 11.0, 10.0, 5.0, 4.0, 1.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03564453125, -0.034726619720458984, -0.03380870819091797, -0.03289079666137695, -0.03197288513183594, -0.031054973602294922, -0.030137062072753906, -0.02921915054321289, -0.028301239013671875, -0.02738332748413086, -0.026465415954589844, -0.025547504425048828, -0.024629592895507812, -0.023711681365966797, -0.02279376983642578, -0.021875858306884766, -0.02095794677734375, -0.020040035247802734, -0.01912212371826172, -0.018204212188720703, -0.017286300659179688, -0.016368389129638672, -0.015450477600097656, -0.01453256607055664, -0.013614654541015625, -0.01269674301147461, -0.011778831481933594, -0.010860919952392578, -0.009943008422851562, -0.009025096893310547, -0.008107185363769531, -0.007189273834228516, -0.0062713623046875, -0.005353450775146484, -0.004435539245605469, -0.003517627716064453, -0.0025997161865234375, -0.0016818046569824219, -0.0007638931274414062, 0.00015401840209960938, 0.001071929931640625, 0.0019898414611816406, 0.0029077529907226562, 0.003825664520263672, 0.0047435760498046875, 0.005661487579345703, 0.006579399108886719, 0.007497310638427734, 0.00841522216796875, 0.009333133697509766, 0.010251045227050781, 0.011168956756591797, 0.012086868286132812, 0.013004779815673828, 0.013922691345214844, 0.01484060287475586, 0.015758514404296875, 0.01667642593383789, 0.017594337463378906, 0.018512248992919922, 0.019430160522460938, 0.020348072052001953, 0.02126598358154297, 0.022183895111083984, 0.023101806640625]}, "gradients/encoder.encoder.layers.11.layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 6.0, 3.0, 6.0, 6.0, 12.0, 19.0, 25.0, 31.0, 49.0, 69.0, 77.0, 122.0, 127.0, 124.0, 91.0, 71.0, 52.0, 41.0, 21.0, 20.0, 16.0, 7.0, 6.0, 6.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.35948291420936584, -0.34035542607307434, -0.3212279677391052, -0.3021004796028137, -0.2829730212688446, -0.2638455331325531, -0.2447180598974228, -0.22559058666229248, -0.20646311342716217, -0.18733564019203186, -0.16820816695690155, -0.14908069372177124, -0.12995320558547974, -0.11082573980093002, -0.09169825911521912, -0.0725707858800888, -0.053443312644958496, -0.034315839409828186, -0.015188362449407578, 0.003939114511013031, 0.02306658774614334, 0.04219406098127365, 0.06132154166698456, 0.08044901490211487, 0.09957648813724518, 0.11870396137237549, 0.1378314346075058, 0.1569589078426361, 0.1760863959789276, 0.19521385431289673, 0.21434134244918823, 0.23346881568431854, 0.25259625911712646, 0.27172374725341797, 0.2908512055873871, 0.3099786937236786, 0.3291061520576477, 0.3482336401939392, 0.3673611283302307, 0.38648858666419983, 0.40561604499816895, 0.42474353313446045, 0.44387099146842957, 0.46299847960472107, 0.4821259379386902, 0.5012534260749817, 0.5203809142112732, 0.5395083427429199, 0.5586358308792114, 0.5777633190155029, 0.5968908071517944, 0.6160182356834412, 0.6351457238197327, 0.6542732119560242, 0.6734007000923157, 0.6925281286239624, 0.7116556763648987, 0.7307831645011902, 0.7499106526374817, 0.7690380811691284, 0.7881655693054199, 0.8072930574417114, 0.8264205455780029, 0.8455480337142944, 0.8646754622459412]}, "gradients/encoder.encoder.layers.11.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 9.0, 5.0, 10.0, 9.0, 14.0, 17.0, 18.0, 21.0, 24.0, 31.0, 31.0, 26.0, 38.0, 29.0, 41.0, 59.0, 44.0, 47.0, 61.0, 43.0, 48.0, 43.0, 37.0, 32.0, 38.0, 37.0, 19.0, 25.0, 21.0, 18.0, 15.0, 17.0, 9.0, 7.0, 11.0, 8.0, 8.0, 4.0, 11.0, 1.0, 3.0, 3.0, 4.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.7132899761199951, -0.6924432516098022, -0.6715965867042542, -0.6507498621940613, -0.6299031972885132, -0.6090564727783203, -0.5882097482681274, -0.5673630237579346, -0.5465163588523865, -0.5256696343421936, -0.5048229694366455, -0.48397624492645264, -0.46312955021858215, -0.44228285551071167, -0.4214361310005188, -0.4005894362926483, -0.37974274158477783, -0.35889604687690735, -0.33804935216903687, -0.317202627658844, -0.2963559329509735, -0.275509238243103, -0.25466251373291016, -0.23381581902503967, -0.2129691243171692, -0.1921224296092987, -0.17127572000026703, -0.15042901039123535, -0.12958231568336487, -0.10873561352491379, -0.08788891136646271, -0.06704220175743103, -0.04619550704956055, -0.025348804891109467, -0.004502102732658386, 0.016344599425792694, 0.037191301584243774, 0.058038003742694855, 0.07888470590114594, 0.09973141551017761, 0.1205781102180481, 0.14142480492591858, 0.16227151453495026, 0.18311822414398193, 0.20396491885185242, 0.2248116135597229, 0.24565832316875458, 0.26650503277778625, 0.28735172748565674, 0.3081984221935272, 0.3290451169013977, 0.3498918414115906, 0.37073853611946106, 0.39158523082733154, 0.4124319553375244, 0.4332786500453949, 0.4541253447532654, 0.47497203946113586, 0.49581873416900635, 0.5166654586791992, 0.5375121831893921, 0.5583588480949402, 0.5792055726051331, 0.6000522375106812, 0.620898962020874]}, "gradients/encoder.encoder.layers.10.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 1.0, 1.0, 1.0, 4.0, 1.0, 5.0, 4.0, 8.0, 13.0, 27.0, 38.0, 78.0, 136.0, 321.0, 730.0, 2346.0, 4166674.0, 21309.0, 1489.0, 550.0, 266.0, 103.0, 61.0, 38.0, 20.0, 15.0, 15.0, 8.0, 6.0, 3.0, 3.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.72607421875, -0.7091255187988281, -0.6921768188476562, -0.6752281188964844, -0.6582794189453125, -0.6413307189941406, -0.6243820190429688, -0.6074333190917969, -0.590484619140625, -0.5735359191894531, -0.5565872192382812, -0.5396385192871094, -0.5226898193359375, -0.5057411193847656, -0.48879241943359375, -0.4718437194824219, -0.45489501953125, -0.4379463195800781, -0.42099761962890625, -0.4040489196777344, -0.3871002197265625, -0.3701515197753906, -0.35320281982421875, -0.3362541198730469, -0.319305419921875, -0.3023567199707031, -0.28540802001953125, -0.2684593200683594, -0.2515106201171875, -0.23456192016601562, -0.21761322021484375, -0.20066452026367188, -0.1837158203125, -0.16676712036132812, -0.14981842041015625, -0.13286972045898438, -0.1159210205078125, -0.09897232055664062, -0.08202362060546875, -0.06507492065429688, -0.048126220703125, -0.031177520751953125, -0.01422882080078125, 0.002719879150390625, 0.0196685791015625, 0.036617279052734375, 0.05356597900390625, 0.07051467895507812, 0.08746337890625, 0.10441207885742188, 0.12136077880859375, 0.13830947875976562, 0.1552581787109375, 0.17220687866210938, 0.18915557861328125, 0.20610427856445312, 0.223052978515625, 0.24000167846679688, 0.25695037841796875, 0.2738990783691406, 0.2908477783203125, 0.3077964782714844, 0.32474517822265625, 0.3416938781738281, 0.358642578125]}, "gradients/encoder.encoder.layers.10.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 6.0, 9.0, 11.0, 21.0, 33.0, 42.0, 51.0, 90.0, 106.0, 98.0, 107.0, 109.0, 85.0, 77.0, 58.0, 34.0, 22.0, 16.0, 11.0, 8.0, 9.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.11248779296875, -0.10989141464233398, -0.10729503631591797, -0.10469865798950195, -0.10210227966308594, -0.09950590133666992, -0.0969095230102539, -0.09431314468383789, -0.09171676635742188, -0.08912038803100586, -0.08652400970458984, -0.08392763137817383, -0.08133125305175781, -0.0787348747253418, -0.07613849639892578, -0.07354211807250977, -0.07094573974609375, -0.06834936141967773, -0.06575298309326172, -0.0631566047668457, -0.06056022644042969, -0.05796384811401367, -0.055367469787597656, -0.05277109146118164, -0.050174713134765625, -0.04757833480834961, -0.044981956481933594, -0.04238557815551758, -0.03978919982910156, -0.03719282150268555, -0.03459644317626953, -0.032000064849853516, -0.0294036865234375, -0.026807308197021484, -0.02421092987060547, -0.021614551544189453, -0.019018173217773438, -0.016421794891357422, -0.013825416564941406, -0.01122903823852539, -0.008632659912109375, -0.006036281585693359, -0.0034399032592773438, -0.0008435249328613281, 0.0017528533935546875, 0.004349231719970703, 0.006945610046386719, 0.009541988372802734, 0.01213836669921875, 0.014734745025634766, 0.01733112335205078, 0.019927501678466797, 0.022523880004882812, 0.025120258331298828, 0.027716636657714844, 0.03031301498413086, 0.032909393310546875, 0.03550577163696289, 0.038102149963378906, 0.04069852828979492, 0.04329490661621094, 0.04589128494262695, 0.04848766326904297, 0.051084041595458984, 0.053680419921875]}, "gradients/encoder.encoder.layers.10.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 5.0, 9.0, 2.0, 16.0, 19.0, 21.0, 24.0, 37.0, 63.0, 63.0, 111.0, 147.0, 179.0, 277.0, 339.0, 535.0, 732.0, 1195.0, 1899.0, 3454.0, 7707.0, 33345.0, 4052367.0, 70308.0, 10803.0, 4243.0, 2206.0, 1301.0, 811.0, 583.0, 418.0, 277.0, 217.0, 170.0, 91.0, 83.0, 64.0, 49.0, 19.0, 32.0, 16.0, 17.0, 11.0, 10.0, 2.0, 4.0, 5.0, 2.0, 5.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.098876953125, -0.09584808349609375, -0.0928192138671875, -0.08979034423828125, -0.086761474609375, -0.08373260498046875, -0.0807037353515625, -0.07767486572265625, -0.07464599609375, -0.07161712646484375, -0.0685882568359375, -0.06555938720703125, -0.062530517578125, -0.05950164794921875, -0.0564727783203125, -0.05344390869140625, -0.0504150390625, -0.04738616943359375, -0.0443572998046875, -0.04132843017578125, -0.038299560546875, -0.03527069091796875, -0.0322418212890625, -0.02921295166015625, -0.02618408203125, -0.02315521240234375, -0.0201263427734375, -0.01709747314453125, -0.014068603515625, -0.01103973388671875, -0.0080108642578125, -0.00498199462890625, -0.001953125, 0.00107574462890625, 0.0041046142578125, 0.00713348388671875, 0.010162353515625, 0.01319122314453125, 0.0162200927734375, 0.01924896240234375, 0.02227783203125, 0.02530670166015625, 0.0283355712890625, 0.03136444091796875, 0.034393310546875, 0.03742218017578125, 0.0404510498046875, 0.04347991943359375, 0.0465087890625, 0.04953765869140625, 0.0525665283203125, 0.05559539794921875, 0.058624267578125, 0.06165313720703125, 0.0646820068359375, 0.06771087646484375, 0.07073974609375, 0.07376861572265625, 0.0767974853515625, 0.07982635498046875, 0.082855224609375, 0.08588409423828125, 0.0889129638671875, 0.09194183349609375, 0.094970703125]}, "gradients/encoder.encoder.layers.10.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 2.0, 6.0, 6.0, 8.0, 13.0, 29.0, 47.0, 135.0, 3549.0, 182.0, 36.0, 22.0, 8.0, 7.0, 5.0, 2.0, 7.0, 2.0, 1.0, 4.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0277099609375, -0.026972055435180664, -0.026234149932861328, -0.025496244430541992, -0.024758338928222656, -0.02402043342590332, -0.023282527923583984, -0.02254462242126465, -0.021806716918945312, -0.021068811416625977, -0.02033090591430664, -0.019593000411987305, -0.01885509490966797, -0.018117189407348633, -0.017379283905029297, -0.01664137840270996, -0.015903472900390625, -0.015165567398071289, -0.014427661895751953, -0.013689756393432617, -0.012951850891113281, -0.012213945388793945, -0.01147603988647461, -0.010738134384155273, -0.010000228881835938, -0.009262323379516602, -0.008524417877197266, -0.00778651237487793, -0.007048606872558594, -0.006310701370239258, -0.005572795867919922, -0.004834890365600586, -0.00409698486328125, -0.003359079360961914, -0.002621173858642578, -0.0018832683563232422, -0.0011453628540039062, -0.0004074573516845703, 0.0003304481506347656, 0.0010683536529541016, 0.0018062591552734375, 0.0025441646575927734, 0.0032820701599121094, 0.004019975662231445, 0.004757881164550781, 0.005495786666870117, 0.006233692169189453, 0.006971597671508789, 0.007709503173828125, 0.008447408676147461, 0.009185314178466797, 0.009923219680786133, 0.010661125183105469, 0.011399030685424805, 0.01213693618774414, 0.012874841690063477, 0.013612747192382812, 0.014350652694702148, 0.015088558197021484, 0.01582646369934082, 0.016564369201660156, 0.017302274703979492, 0.018040180206298828, 0.018778085708618164, 0.0195159912109375]}, "gradients/encoder.encoder.layers.10.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 3.0, 5.0, 9.0, 6.0, 22.0, 21.0, 36.0, 57.0, 93.0, 132.0, 163.0, 158.0, 115.0, 81.0, 39.0, 30.0, 15.0, 11.0, 2.0, 4.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.13695700466632843, -0.13332800567150116, -0.1296989917755127, -0.12606999278068542, -0.12244099378585815, -0.11881198734045029, -0.11518298089504242, -0.11155398190021515, -0.10792497545480728, -0.10429596900939941, -0.10066697001457214, -0.09703796356916428, -0.09340895712375641, -0.08977995812892914, -0.08615095168352127, -0.0825219452381134, -0.07889294624328613, -0.07526393979787827, -0.071634940803051, -0.06800593435764313, -0.06437693536281586, -0.06074792891740799, -0.05711892247200012, -0.05348991975188255, -0.049860917031764984, -0.046231914311647415, -0.042602911591529846, -0.03897390514612198, -0.03534490242600441, -0.03171589970588684, -0.028086895123124123, -0.024457890540361404, -0.020828895270824432, -0.017199892550706863, -0.013570887967944145, -0.009941884316504002, -0.006312880665063858, -0.002683877944946289, 0.0009451266378164291, 0.004574131220579147, 0.008203133940696716, 0.01183213759213686, 0.015461141243577003, 0.01909014582633972, 0.02271914854645729, 0.02634815126657486, 0.029977155849337578, 0.033606160432100296, 0.037235163152217865, 0.040864165872335434, 0.044493168592453, 0.04812217503786087, 0.05175117775797844, 0.05538018047809601, 0.059009186923503876, 0.06263819336891174, 0.06626719236373901, 0.06989619880914688, 0.07352519780397415, 0.07715420424938202, 0.08078320324420929, 0.08441220968961716, 0.08804121613502502, 0.0916702151298523, 0.09529922157526016]}, "gradients/encoder.encoder.layers.10.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 6.0, 5.0, 6.0, 3.0, 4.0, 16.0, 14.0, 13.0, 14.0, 12.0, 26.0, 21.0, 29.0, 28.0, 37.0, 37.0, 33.0, 36.0, 39.0, 44.0, 50.0, 33.0, 33.0, 38.0, 45.0, 36.0, 45.0, 46.0, 36.0, 27.0, 23.0, 25.0, 20.0, 17.0, 16.0, 17.0, 21.0, 18.0, 7.0, 7.0, 6.0, 4.0, 6.0, 4.0, 2.0, 0.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0], "bins": [-0.041365206241607666, -0.04009249806404114, -0.03881978988647461, -0.03754708543419838, -0.03627437725663185, -0.03500166907906532, -0.03372896462678909, -0.032456256449222565, -0.031183548271656036, -0.029910840094089508, -0.02863813377916813, -0.02736542746424675, -0.02609271928668022, -0.024820011109113693, -0.023547304794192314, -0.022274598479270935, -0.021001890301704407, -0.01972918212413788, -0.0184564758092165, -0.01718376949429512, -0.015911061316728592, -0.014638354070484638, -0.013365646824240685, -0.01209293957799673, -0.010820232331752777, -0.009547525085508823, -0.00827481783926487, -0.007002110593020916, -0.005729403346776962, -0.004456696100533009, -0.003183988854289055, -0.0019112816080451012, -0.0006385743618011475, 0.0006341328844428062, 0.00190684013068676, 0.0031795473769307137, 0.004452254623174667, 0.005724961869418621, 0.006997669115662575, 0.008270376361906528, 0.009543083608150482, 0.010815790854394436, 0.01208849810063839, 0.013361205346882343, 0.014633912593126297, 0.015906620770692825, 0.017179327085614204, 0.018452033400535583, 0.019724741578102112, 0.02099744975566864, 0.02227015607059002, 0.0235428623855114, 0.024815570563077927, 0.026088278740644455, 0.027360985055565834, 0.028633691370487213, 0.02990639954805374, 0.03117910772562027, 0.0324518159031868, 0.03372452035546303, 0.034997228533029556, 0.036269936710596085, 0.037542641162872314, 0.03881534934043884, 0.04008805751800537]}, "gradients/encoder.encoder.layers.10.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 0.0, 3.0, 1.0, 7.0, 7.0, 9.0, 14.0, 14.0, 24.0, 35.0, 36.0, 76.0, 105.0, 209.0, 341.0, 610.0, 1477.0, 4516.0, 23630.0, 353950.0, 615954.0, 37919.0, 6165.0, 1860.0, 701.0, 336.0, 206.0, 107.0, 66.0, 52.0, 34.0, 19.0, 21.0, 15.0, 12.0, 10.0, 7.0, 5.0, 3.0, 1.0, 0.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.251708984375, -0.2445392608642578, -0.23736953735351562, -0.23019981384277344, -0.22303009033203125, -0.21586036682128906, -0.20869064331054688, -0.2015209197998047, -0.1943511962890625, -0.1871814727783203, -0.18001174926757812, -0.17284202575683594, -0.16567230224609375, -0.15850257873535156, -0.15133285522460938, -0.1441631317138672, -0.136993408203125, -0.1298236846923828, -0.12265396118164062, -0.11548423767089844, -0.10831451416015625, -0.10114479064941406, -0.09397506713867188, -0.08680534362792969, -0.0796356201171875, -0.07246589660644531, -0.06529617309570312, -0.05812644958496094, -0.05095672607421875, -0.04378700256347656, -0.036617279052734375, -0.029447555541992188, -0.02227783203125, -0.015108108520507812, -0.007938385009765625, -0.0007686614990234375, 0.00640106201171875, 0.013570785522460938, 0.020740509033203125, 0.027910232543945312, 0.0350799560546875, 0.04224967956542969, 0.049419403076171875, 0.05658912658691406, 0.06375885009765625, 0.07092857360839844, 0.07809829711914062, 0.08526802062988281, 0.092437744140625, 0.09960746765136719, 0.10677719116210938, 0.11394691467285156, 0.12111663818359375, 0.12828636169433594, 0.13545608520507812, 0.1426258087158203, 0.1497955322265625, 0.1569652557373047, 0.16413497924804688, 0.17130470275878906, 0.17847442626953125, 0.18564414978027344, 0.19281387329101562, 0.1999835968017578, 0.2071533203125]}, "gradients/encoder.encoder.layers.10.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 4.0, 9.0, 8.0, 12.0, 23.0, 35.0, 53.0, 61.0, 78.0, 98.0, 105.0, 112.0, 102.0, 88.0, 69.0, 53.0, 28.0, 30.0, 11.0, 13.0, 8.0, 5.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.1134033203125, -0.11077499389648438, -0.10814666748046875, -0.10551834106445312, -0.1028900146484375, -0.10026168823242188, -0.09763336181640625, -0.09500503540039062, -0.092376708984375, -0.08974838256835938, -0.08712005615234375, -0.08449172973632812, -0.0818634033203125, -0.07923507690429688, -0.07660675048828125, -0.07397842407226562, -0.07135009765625, -0.06872177124023438, -0.06609344482421875, -0.06346511840820312, -0.0608367919921875, -0.058208465576171875, -0.05558013916015625, -0.052951812744140625, -0.050323486328125, -0.047695159912109375, -0.04506683349609375, -0.042438507080078125, -0.0398101806640625, -0.037181854248046875, -0.03455352783203125, -0.031925201416015625, -0.029296875, -0.026668548583984375, -0.02404022216796875, -0.021411895751953125, -0.0187835693359375, -0.016155242919921875, -0.01352691650390625, -0.010898590087890625, -0.008270263671875, -0.005641937255859375, -0.00301361083984375, -0.000385284423828125, 0.0022430419921875, 0.004871368408203125, 0.00749969482421875, 0.010128021240234375, 0.01275634765625, 0.015384674072265625, 0.01801300048828125, 0.020641326904296875, 0.0232696533203125, 0.025897979736328125, 0.02852630615234375, 0.031154632568359375, 0.033782958984375, 0.036411285400390625, 0.03903961181640625, 0.041667938232421875, 0.0442962646484375, 0.046924591064453125, 0.04955291748046875, 0.052181243896484375, 0.0548095703125]}, "gradients/encoder.encoder.layers.10.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 3.0, 8.0, 8.0, 12.0, 9.0, 11.0, 31.0, 29.0, 41.0, 55.0, 79.0, 126.0, 242.0, 446.0, 972.0, 2383.0, 8276.0, 46398.0, 621943.0, 329603.0, 28517.0, 5976.0, 1887.0, 730.0, 331.0, 153.0, 83.0, 64.0, 44.0, 24.0, 16.0, 16.0, 16.0, 8.0, 7.0, 8.0, 5.0, 2.0, 3.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09393310546875, -0.0894327163696289, -0.08493232727050781, -0.08043193817138672, -0.07593154907226562, -0.07143115997314453, -0.06693077087402344, -0.062430381774902344, -0.05792999267578125, -0.053429603576660156, -0.04892921447753906, -0.04442882537841797, -0.039928436279296875, -0.03542804718017578, -0.030927658081054688, -0.026427268981933594, -0.0219268798828125, -0.017426490783691406, -0.012926101684570312, -0.008425712585449219, -0.003925323486328125, 0.0005750656127929688, 0.0050754547119140625, 0.009575843811035156, 0.01407623291015625, 0.018576622009277344, 0.023077011108398438, 0.02757740020751953, 0.032077789306640625, 0.03657817840576172, 0.04107856750488281, 0.045578956604003906, 0.050079345703125, 0.054579734802246094, 0.05908012390136719, 0.06358051300048828, 0.06808090209960938, 0.07258129119873047, 0.07708168029785156, 0.08158206939697266, 0.08608245849609375, 0.09058284759521484, 0.09508323669433594, 0.09958362579345703, 0.10408401489257812, 0.10858440399169922, 0.11308479309082031, 0.1175851821899414, 0.1220855712890625, 0.1265859603881836, 0.1310863494873047, 0.13558673858642578, 0.14008712768554688, 0.14458751678466797, 0.14908790588378906, 0.15358829498291016, 0.15808868408203125, 0.16258907318115234, 0.16708946228027344, 0.17158985137939453, 0.17609024047851562, 0.18059062957763672, 0.1850910186767578, 0.1895914077758789, 0.194091796875]}, "gradients/encoder.encoder.layers.10.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 0.0, 5.0, 5.0, 11.0, 3.0, 9.0, 9.0, 16.0, 13.0, 23.0, 32.0, 27.0, 22.0, 25.0, 26.0, 28.0, 42.0, 30.0, 34.0, 46.0, 51.0, 60.0, 48.0, 41.0, 46.0, 48.0, 35.0, 45.0, 32.0, 23.0, 39.0, 22.0, 13.0, 20.0, 10.0, 10.0, 10.0, 5.0, 10.0, 6.0, 8.0, 3.0, 4.0, 3.0, 7.0, 0.0, 1.0, 1.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1292724609375, -0.12497901916503906, -0.12068557739257812, -0.11639213562011719, -0.11209869384765625, -0.10780525207519531, -0.10351181030273438, -0.09921836853027344, -0.0949249267578125, -0.09063148498535156, -0.08633804321289062, -0.08204460144042969, -0.07775115966796875, -0.07345771789550781, -0.06916427612304688, -0.06487083435058594, -0.060577392578125, -0.05628395080566406, -0.051990509033203125, -0.04769706726074219, -0.04340362548828125, -0.03911018371582031, -0.034816741943359375, -0.030523300170898438, -0.0262298583984375, -0.021936416625976562, -0.017642974853515625, -0.013349533081054688, -0.00905609130859375, -0.0047626495361328125, -0.000469207763671875, 0.0038242340087890625, 0.00811767578125, 0.012411117553710938, 0.016704559326171875, 0.020998001098632812, 0.02529144287109375, 0.029584884643554688, 0.033878326416015625, 0.03817176818847656, 0.0424652099609375, 0.04675865173339844, 0.051052093505859375, 0.05534553527832031, 0.05963897705078125, 0.06393241882324219, 0.06822586059570312, 0.07251930236816406, 0.076812744140625, 0.08110618591308594, 0.08539962768554688, 0.08969306945800781, 0.09398651123046875, 0.09827995300292969, 0.10257339477539062, 0.10686683654785156, 0.1111602783203125, 0.11545372009277344, 0.11974716186523438, 0.12404060363769531, 0.12833404541015625, 0.1326274871826172, 0.13692092895507812, 0.14121437072753906, 0.1455078125]}, "gradients/encoder.encoder.layers.10.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 3.0, 3.0, 3.0, 5.0, 8.0, 13.0, 15.0, 15.0, 31.0, 36.0, 45.0, 65.0, 120.0, 213.0, 421.0, 907.0, 2533.0, 9515.0, 57772.0, 561692.0, 366677.0, 37680.0, 7070.0, 2125.0, 765.0, 323.0, 178.0, 94.0, 58.0, 51.0, 33.0, 23.0, 19.0, 10.0, 9.0, 6.0, 8.0, 7.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0374755859375, -0.0363006591796875, -0.035125732421875, -0.0339508056640625, -0.03277587890625, -0.0316009521484375, -0.030426025390625, -0.0292510986328125, -0.028076171875, -0.0269012451171875, -0.025726318359375, -0.0245513916015625, -0.02337646484375, -0.0222015380859375, -0.021026611328125, -0.0198516845703125, -0.0186767578125, -0.0175018310546875, -0.016326904296875, -0.0151519775390625, -0.01397705078125, -0.0128021240234375, -0.011627197265625, -0.0104522705078125, -0.00927734375, -0.0081024169921875, -0.006927490234375, -0.0057525634765625, -0.00457763671875, -0.0034027099609375, -0.002227783203125, -0.0010528564453125, 0.0001220703125, 0.0012969970703125, 0.002471923828125, 0.0036468505859375, 0.00482177734375, 0.0059967041015625, 0.007171630859375, 0.0083465576171875, 0.009521484375, 0.0106964111328125, 0.011871337890625, 0.0130462646484375, 0.01422119140625, 0.0153961181640625, 0.016571044921875, 0.0177459716796875, 0.0189208984375, 0.0200958251953125, 0.021270751953125, 0.0224456787109375, 0.02362060546875, 0.0247955322265625, 0.025970458984375, 0.0271453857421875, 0.0283203125, 0.0294952392578125, 0.030670166015625, 0.0318450927734375, 0.03302001953125, 0.0341949462890625, 0.035369873046875, 0.0365447998046875, 0.0377197265625]}, "gradients/encoder.encoder.layers.10.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 0.0, 8.0, 9.0, 5.0, 10.0, 12.0, 35.0, 15.0, 17.0, 36.0, 24.0, 49.0, 53.0, 39.0, 60.0, 60.0, 67.0, 50.0, 49.0, 61.0, 35.0, 76.0, 29.0, 45.0, 26.0, 20.0, 26.0, 20.0, 16.0, 11.0, 10.0, 8.0, 2.0, 7.0, 4.0, 6.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-6.079673767089844e-06, -5.8766454458236694e-06, -5.673617124557495e-06, -5.470588803291321e-06, -5.2675604820251465e-06, -5.064532160758972e-06, -4.861503839492798e-06, -4.6584755182266235e-06, -4.455447196960449e-06, -4.252418875694275e-06, -4.049390554428101e-06, -3.846362233161926e-06, -3.643333911895752e-06, -3.4403055906295776e-06, -3.2372772693634033e-06, -3.034248948097229e-06, -2.8312206268310547e-06, -2.6281923055648804e-06, -2.425163984298706e-06, -2.2221356630325317e-06, -2.0191073417663574e-06, -1.816079020500183e-06, -1.6130506992340088e-06, -1.4100223779678345e-06, -1.2069940567016602e-06, -1.0039657354354858e-06, -8.009374141693115e-07, -5.979090929031372e-07, -3.948807716369629e-07, -1.9185245037078857e-07, 1.1175870895385742e-08, 2.1420419216156006e-07, 4.172325134277344e-07, 6.202608346939087e-07, 8.23289155960083e-07, 1.0263174772262573e-06, 1.2293457984924316e-06, 1.432374119758606e-06, 1.6354024410247803e-06, 1.8384307622909546e-06, 2.041459083557129e-06, 2.2444874048233032e-06, 2.4475157260894775e-06, 2.650544047355652e-06, 2.853572368621826e-06, 3.0566006898880005e-06, 3.259629011154175e-06, 3.462657332420349e-06, 3.6656856536865234e-06, 3.868713974952698e-06, 4.071742296218872e-06, 4.274770617485046e-06, 4.477798938751221e-06, 4.680827260017395e-06, 4.883855581283569e-06, 5.086883902549744e-06, 5.289912223815918e-06, 5.492940545082092e-06, 5.695968866348267e-06, 5.898997187614441e-06, 6.102025508880615e-06, 6.3050538301467896e-06, 6.508082151412964e-06, 6.711110472679138e-06, 6.9141387939453125e-06]}, "gradients/encoder.encoder.layers.10.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 3.0, 6.0, 6.0, 10.0, 14.0, 16.0, 36.0, 45.0, 85.0, 161.0, 290.0, 705.0, 1770.0, 6356.0, 43938.0, 643636.0, 322133.0, 22691.0, 4346.0, 1274.0, 470.0, 254.0, 123.0, 74.0, 48.0, 22.0, 16.0, 12.0, 10.0, 4.0, 6.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.040496826171875, -0.03886747360229492, -0.037238121032714844, -0.035608768463134766, -0.03397941589355469, -0.03235006332397461, -0.03072071075439453, -0.029091358184814453, -0.027462005615234375, -0.025832653045654297, -0.02420330047607422, -0.02257394790649414, -0.020944595336914062, -0.019315242767333984, -0.017685890197753906, -0.016056537628173828, -0.01442718505859375, -0.012797832489013672, -0.011168479919433594, -0.009539127349853516, -0.007909774780273438, -0.006280422210693359, -0.004651069641113281, -0.003021717071533203, -0.001392364501953125, 0.00023698806762695312, 0.0018663406372070312, 0.0034956932067871094, 0.0051250457763671875, 0.006754398345947266, 0.008383750915527344, 0.010013103485107422, 0.0116424560546875, 0.013271808624267578, 0.014901161193847656, 0.016530513763427734, 0.018159866333007812, 0.01978921890258789, 0.02141857147216797, 0.023047924041748047, 0.024677276611328125, 0.026306629180908203, 0.02793598175048828, 0.02956533432006836, 0.031194686889648438, 0.032824039459228516, 0.034453392028808594, 0.03608274459838867, 0.03771209716796875, 0.03934144973754883, 0.040970802307128906, 0.042600154876708984, 0.04422950744628906, 0.04585886001586914, 0.04748821258544922, 0.0491175651550293, 0.050746917724609375, 0.05237627029418945, 0.05400562286376953, 0.05563497543334961, 0.05726432800292969, 0.058893680572509766, 0.060523033142089844, 0.06215238571166992, 0.06378173828125]}, "gradients/encoder.encoder.layers.10.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 4.0, 3.0, 7.0, 6.0, 9.0, 10.0, 6.0, 14.0, 25.0, 33.0, 44.0, 43.0, 61.0, 60.0, 100.0, 73.0, 91.0, 68.0, 63.0, 49.0, 44.0, 36.0, 33.0, 25.0, 22.0, 17.0, 13.0, 16.0, 7.0, 5.0, 2.0, 5.0, 1.0, 4.0, 1.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0281829833984375, -0.02721858024597168, -0.02625417709350586, -0.02528977394104004, -0.02432537078857422, -0.0233609676361084, -0.022396564483642578, -0.021432161331176758, -0.020467758178710938, -0.019503355026245117, -0.018538951873779297, -0.017574548721313477, -0.016610145568847656, -0.015645742416381836, -0.014681339263916016, -0.013716936111450195, -0.012752532958984375, -0.011788129806518555, -0.010823726654052734, -0.009859323501586914, -0.008894920349121094, -0.007930517196655273, -0.006966114044189453, -0.006001710891723633, -0.0050373077392578125, -0.004072904586791992, -0.003108501434326172, -0.0021440982818603516, -0.0011796951293945312, -0.00021529197692871094, 0.0007491111755371094, 0.0017135143280029297, 0.00267791748046875, 0.0036423206329345703, 0.004606723785400391, 0.005571126937866211, 0.006535530090332031, 0.0074999332427978516, 0.008464336395263672, 0.009428739547729492, 0.010393142700195312, 0.011357545852661133, 0.012321949005126953, 0.013286352157592773, 0.014250755310058594, 0.015215158462524414, 0.016179561614990234, 0.017143964767456055, 0.018108367919921875, 0.019072771072387695, 0.020037174224853516, 0.021001577377319336, 0.021965980529785156, 0.022930383682250977, 0.023894786834716797, 0.024859189987182617, 0.025823593139648438, 0.026787996292114258, 0.027752399444580078, 0.0287168025970459, 0.02968120574951172, 0.03064560890197754, 0.03161001205444336, 0.03257441520690918, 0.033538818359375]}, "gradients/encoder.encoder.layers.10.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 11.0, 22.0, 88.0, 294.0, 396.0, 144.0, 40.0, 13.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.22335684299469, -1.1562827825546265, -1.0892088413238525, -1.022134780883789, -0.9550607204437256, -0.8879867196083069, -0.8209127187728882, -0.7538386583328247, -0.686764657497406, -0.6196906566619873, -0.5526165962219238, -0.4855425953865051, -0.41846856474876404, -0.35139453411102295, -0.28432053327560425, -0.21724650263786316, -0.15017247200012207, -0.08309844881296158, -0.016024425625801086, 0.05104959011077881, 0.1181236207485199, 0.185197651386261, 0.2522716522216797, 0.3193456828594208, 0.38641971349716187, 0.45349374413490295, 0.520567774772644, 0.5876417756080627, 0.6547157764434814, 0.7217898368835449, 0.7888638377189636, 0.8559378385543823, 0.9230120182037354, 0.990086019039154, 1.0571600198745728, 1.1242340803146362, 1.1913081407546997, 1.2583820819854736, 1.325456142425537, 1.3925302028656006, 1.459604263305664, 1.5266783237457275, 1.5937522649765015, 1.660826325416565, 1.7279003858566284, 1.7949743270874023, 1.8620483875274658, 1.9291224479675293, 1.9961963891983032, 2.063270330429077, 2.1303443908691406, 2.197418451309204, 2.2644925117492676, 2.331566572189331, 2.3986406326293945, 2.465714454650879, 2.5327885150909424, 2.599862575531006, 2.6669366359710693, 2.734010696411133, 2.801084518432617, 2.8681585788726807, 2.935232639312744, 3.0023066997528076, 3.069380760192871]}, "gradients/encoder.encoder.layers.10.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 3.0, 1.0, 4.0, 2.0, 3.0, 8.0, 6.0, 5.0, 6.0, 5.0, 12.0, 13.0, 13.0, 11.0, 26.0, 34.0, 27.0, 44.0, 37.0, 42.0, 35.0, 46.0, 56.0, 60.0, 42.0, 63.0, 67.0, 44.0, 50.0, 42.0, 33.0, 33.0, 27.0, 22.0, 19.0, 11.0, 12.0, 14.0, 12.0, 8.0, 3.0, 4.0, 4.0, 4.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0], "bins": [-0.7941553592681885, -0.7700571417808533, -0.7459589838981628, -0.7218607664108276, -0.6977625489234924, -0.673664391040802, -0.6495661735534668, -0.6254680156707764, -0.6013697981834412, -0.577271580696106, -0.5531734228134155, -0.5290752053260803, -0.5049769878387451, -0.4808788299560547, -0.4567806124687195, -0.43268242478370667, -0.40858420729637146, -0.38448601961135864, -0.36038780212402344, -0.3362896144390106, -0.3121914267539978, -0.2880932092666626, -0.2639950215816498, -0.23989683389663696, -0.21579863131046295, -0.19170042872428894, -0.16760224103927612, -0.1435040384531021, -0.1194058433175087, -0.09530764818191528, -0.07120944559574127, -0.047111257910728455, -0.023013055324554443, 0.0010851416736841202, 0.025183338671922684, 0.049281537532806396, 0.07337973266839981, 0.09747792780399323, 0.12157613039016724, 0.14567431807518005, 0.16977252066135406, 0.19387072324752808, 0.2179689109325409, 0.2420671135187149, 0.2661653161048889, 0.29026350378990173, 0.31436169147491455, 0.33845990896224976, 0.3625580966472626, 0.3866562843322754, 0.4107545018196106, 0.4348526895046234, 0.45895087718963623, 0.48304909467697144, 0.5071473121643066, 0.5312454700469971, 0.5553436875343323, 0.5794419050216675, 0.6035400629043579, 0.6276382803916931, 0.6517364978790283, 0.6758346557617188, 0.699932873249054, 0.7240310907363892, 0.7481292486190796]}, "gradients/encoder.encoder.layers.9.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 4.0, 1.0, 16.0, 14.0, 18.0, 44.0, 60.0, 119.0, 280.0, 3370.0, 4189478.0, 535.0, 148.0, 88.0, 49.0, 33.0, 11.0, 6.0, 4.0, 2.0, 5.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0], "bins": [-2.8125, -2.747406005859375, -2.68231201171875, -2.617218017578125, -2.5521240234375, -2.487030029296875, -2.42193603515625, -2.356842041015625, -2.291748046875, -2.226654052734375, -2.16156005859375, -2.096466064453125, -2.0313720703125, -1.966278076171875, -1.90118408203125, -1.836090087890625, -1.77099609375, -1.705902099609375, -1.64080810546875, -1.575714111328125, -1.5106201171875, -1.445526123046875, -1.38043212890625, -1.315338134765625, -1.250244140625, -1.185150146484375, -1.12005615234375, -1.054962158203125, -0.9898681640625, -0.924774169921875, -0.85968017578125, -0.794586181640625, -0.7294921875, -0.664398193359375, -0.59930419921875, -0.534210205078125, -0.4691162109375, -0.404022216796875, -0.33892822265625, -0.273834228515625, -0.208740234375, -0.143646240234375, -0.07855224609375, -0.013458251953125, 0.0516357421875, 0.116729736328125, 0.18182373046875, 0.246917724609375, 0.31201171875, 0.377105712890625, 0.44219970703125, 0.507293701171875, 0.5723876953125, 0.637481689453125, 0.70257568359375, 0.767669677734375, 0.832763671875, 0.897857666015625, 0.96295166015625, 1.028045654296875, 1.0931396484375, 1.158233642578125, 1.22332763671875, 1.288421630859375, 1.353515625]}, "gradients/encoder.encoder.layers.9.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 3.0, 6.0, 6.0, 22.0, 25.0, 33.0, 65.0, 68.0, 82.0, 83.0, 111.0, 119.0, 88.0, 81.0, 65.0, 51.0, 40.0, 24.0, 10.0, 11.0, 9.0, 6.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.1124267578125, -0.10981082916259766, -0.10719490051269531, -0.10457897186279297, -0.10196304321289062, -0.09934711456298828, -0.09673118591308594, -0.0941152572631836, -0.09149932861328125, -0.0888833999633789, -0.08626747131347656, -0.08365154266357422, -0.08103561401367188, -0.07841968536376953, -0.07580375671386719, -0.07318782806396484, -0.0705718994140625, -0.06795597076416016, -0.06534004211425781, -0.06272411346435547, -0.060108184814453125, -0.05749225616455078, -0.05487632751464844, -0.052260398864746094, -0.04964447021484375, -0.047028541564941406, -0.04441261291503906, -0.04179668426513672, -0.039180755615234375, -0.03656482696533203, -0.03394889831542969, -0.031332969665527344, -0.028717041015625, -0.026101112365722656, -0.023485183715820312, -0.02086925506591797, -0.018253326416015625, -0.01563739776611328, -0.013021469116210938, -0.010405540466308594, -0.00778961181640625, -0.005173683166503906, -0.0025577545166015625, 5.817413330078125e-05, 0.002674102783203125, 0.005290031433105469, 0.007905960083007812, 0.010521888732910156, 0.0131378173828125, 0.015753746032714844, 0.018369674682617188, 0.02098560333251953, 0.023601531982421875, 0.02621746063232422, 0.028833389282226562, 0.031449317932128906, 0.03406524658203125, 0.036681175231933594, 0.03929710388183594, 0.04191303253173828, 0.044528961181640625, 0.04714488983154297, 0.04976081848144531, 0.052376747131347656, 0.05499267578125]}, "gradients/encoder.encoder.layers.9.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 0.0, 3.0, 6.0, 4.0, 8.0, 10.0, 16.0, 22.0, 28.0, 31.0, 53.0, 51.0, 96.0, 135.0, 188.0, 265.0, 382.0, 604.0, 1011.0, 2279.0, 7906.0, 4107507.0, 64341.0, 5058.0, 1714.0, 841.0, 504.0, 328.0, 226.0, 176.0, 137.0, 90.0, 81.0, 54.0, 31.0, 39.0, 13.0, 17.0, 10.0, 7.0, 7.0, 5.0, 4.0, 3.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.3017578125, -0.29378700256347656, -0.2858161926269531, -0.2778453826904297, -0.26987457275390625, -0.2619037628173828, -0.2539329528808594, -0.24596214294433594, -0.2379913330078125, -0.23002052307128906, -0.22204971313476562, -0.2140789031982422, -0.20610809326171875, -0.1981372833251953, -0.19016647338867188, -0.18219566345214844, -0.174224853515625, -0.16625404357910156, -0.15828323364257812, -0.1503124237060547, -0.14234161376953125, -0.1343708038330078, -0.12639999389648438, -0.11842918395996094, -0.1104583740234375, -0.10248756408691406, -0.09451675415039062, -0.08654594421386719, -0.07857513427734375, -0.07060432434082031, -0.06263351440429688, -0.05466270446777344, -0.04669189453125, -0.03872108459472656, -0.030750274658203125, -0.022779464721679688, -0.01480865478515625, -0.0068378448486328125, 0.001132965087890625, 0.009103775024414062, 0.0170745849609375, 0.025045394897460938, 0.033016204833984375, 0.04098701477050781, 0.04895782470703125, 0.05692863464355469, 0.06489944458007812, 0.07287025451660156, 0.080841064453125, 0.08881187438964844, 0.09678268432617188, 0.10475349426269531, 0.11272430419921875, 0.12069511413574219, 0.12866592407226562, 0.13663673400878906, 0.1446075439453125, 0.15257835388183594, 0.16054916381835938, 0.1685199737548828, 0.17649078369140625, 0.1844615936279297, 0.19243240356445312, 0.20040321350097656, 0.2083740234375]}, "gradients/encoder.encoder.layers.9.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [3.0, 1.0, 5.0, 1.0, 4.0, 2.0, 6.0, 22.0, 86.0, 3832.0, 70.0, 26.0, 8.0, 5.0, 3.0, 4.0, 3.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0140838623046875, -0.012603044509887695, -0.01112222671508789, -0.009641408920288086, -0.008160591125488281, -0.0066797733306884766, -0.005198955535888672, -0.003718137741088867, -0.0022373199462890625, -0.0007565021514892578, 0.0007243156433105469, 0.0022051334381103516, 0.0036859512329101562, 0.005166769027709961, 0.006647586822509766, 0.00812840461730957, 0.009609222412109375, 0.01109004020690918, 0.012570858001708984, 0.014051675796508789, 0.015532493591308594, 0.0170133113861084, 0.018494129180908203, 0.019974946975708008, 0.021455764770507812, 0.022936582565307617, 0.024417400360107422, 0.025898218154907227, 0.02737903594970703, 0.028859853744506836, 0.03034067153930664, 0.031821489334106445, 0.03330230712890625, 0.034783124923706055, 0.03626394271850586, 0.037744760513305664, 0.03922557830810547, 0.04070639610290527, 0.04218721389770508, 0.04366803169250488, 0.04514884948730469, 0.04662966728210449, 0.0481104850769043, 0.0495913028717041, 0.051072120666503906, 0.05255293846130371, 0.054033756256103516, 0.05551457405090332, 0.056995391845703125, 0.05847620964050293, 0.059957027435302734, 0.06143784523010254, 0.06291866302490234, 0.06439948081970215, 0.06588029861450195, 0.06736111640930176, 0.06884193420410156, 0.07032275199890137, 0.07180356979370117, 0.07328438758850098, 0.07476520538330078, 0.07624602317810059, 0.07772684097290039, 0.0792076587677002, 0.0806884765625]}, "gradients/encoder.encoder.layers.9.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 3.0, 4.0, 17.0, 81.0, 455.0, 301.0, 101.0, 30.0, 14.0, 4.0, 5.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7454192638397217, -0.7239395380020142, -0.7024598121643066, -0.6809800267219543, -0.6595003008842468, -0.6380205750465393, -0.6165408492088318, -0.5950610637664795, -0.573581337928772, -0.5521016120910645, -0.5306218862533569, -0.5091421008110046, -0.4876623749732971, -0.4661826491355896, -0.4447029232978821, -0.4232231676578522, -0.40174344182014465, -0.38026371598243713, -0.3587839603424072, -0.3373042345046997, -0.3158244788646698, -0.2943447530269623, -0.2728649973869324, -0.25138527154922485, -0.22990553081035614, -0.20842579007148743, -0.1869460493326187, -0.16546630859375, -0.14398658275604248, -0.12250683456659317, -0.10102710127830505, -0.07954736053943634, -0.05806761980056763, -0.036587879061698914, -0.015108142048120499, 0.006371594965457916, 0.02785133570432663, 0.04933107644319534, 0.07081080973148346, 0.09229055047035217, 0.11377029120922089, 0.1352500319480896, 0.1567297726869583, 0.17820951342582703, 0.19968923926353455, 0.22116899490356445, 0.24264872074127197, 0.2641284465789795, 0.2856082022190094, 0.3070879280567169, 0.3285676836967468, 0.35004740953445435, 0.37152716517448425, 0.3930068910121918, 0.4144866466522217, 0.4359663724899292, 0.4574460983276367, 0.47892582416534424, 0.5004055500030518, 0.521885335445404, 0.5433650612831116, 0.5648447871208191, 0.5863245129585266, 0.6078042984008789, 0.6292840242385864]}, "gradients/encoder.encoder.layers.9.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 7.0, 8.0, 7.0, 9.0, 10.0, 4.0, 17.0, 21.0, 23.0, 16.0, 21.0, 29.0, 30.0, 25.0, 33.0, 42.0, 32.0, 26.0, 41.0, 32.0, 41.0, 43.0, 42.0, 33.0, 24.0, 37.0, 40.0, 25.0, 38.0, 30.0, 21.0, 28.0, 19.0, 29.0, 14.0, 19.0, 22.0, 8.0, 11.0, 12.0, 7.0, 6.0, 12.0, 3.0, 3.0, 3.0, 4.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.09581488370895386, -0.09286656975746155, -0.08991824835538864, -0.08696992695331573, -0.08402161300182343, -0.08107329905033112, -0.07812497764825821, -0.0751766562461853, -0.072228342294693, -0.06928002834320068, -0.06633170694112778, -0.06338338553905487, -0.06043507158756256, -0.05748675391077995, -0.054538436233997345, -0.05159011855721474, -0.04864180088043213, -0.04569348320364952, -0.04274516552686691, -0.039796847850084305, -0.0368485301733017, -0.03390021249651909, -0.03095189481973648, -0.028003577142953873, -0.025055259466171265, -0.022106941789388657, -0.01915862411260605, -0.01621030643582344, -0.013261988759040833, -0.010313671082258224, -0.0073653534054756165, -0.004417035728693008, -0.0014687180519104004, 0.0014795996248722076, 0.004427917301654816, 0.007376234978437424, 0.010324552655220032, 0.01327287033200264, 0.016221188008785248, 0.019169505685567856, 0.022117823362350464, 0.025066141039133072, 0.02801445871591568, 0.030962776392698288, 0.033911094069480896, 0.036859411746263504, 0.03980772942304611, 0.04275604709982872, 0.04570436477661133, 0.048652682453393936, 0.051601000130176544, 0.05454931780695915, 0.05749763548374176, 0.06044595316052437, 0.06339427083730698, 0.06634259223937988, 0.06929090619087219, 0.0722392201423645, 0.07518754154443741, 0.07813586294651031, 0.08108417689800262, 0.08403249084949493, 0.08698081225156784, 0.08992913365364075, 0.09287744760513306]}, "gradients/encoder.encoder.layers.9.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 1.0, 6.0, 7.0, 12.0, 10.0, 21.0, 25.0, 47.0, 67.0, 115.0, 274.0, 435.0, 1043.0, 2463.0, 7880.0, 80139.0, 893844.0, 51483.0, 6617.0, 2237.0, 909.0, 430.0, 219.0, 109.0, 62.0, 27.0, 10.0, 27.0, 10.0, 4.0, 4.0, 7.0, 4.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.25830078125, -0.25008392333984375, -0.2418670654296875, -0.23365020751953125, -0.225433349609375, -0.21721649169921875, -0.2089996337890625, -0.20078277587890625, -0.19256591796875, -0.18434906005859375, -0.1761322021484375, -0.16791534423828125, -0.159698486328125, -0.15148162841796875, -0.1432647705078125, -0.13504791259765625, -0.1268310546875, -0.11861419677734375, -0.1103973388671875, -0.10218048095703125, -0.093963623046875, -0.08574676513671875, -0.0775299072265625, -0.06931304931640625, -0.06109619140625, -0.05287933349609375, -0.0446624755859375, -0.03644561767578125, -0.028228759765625, -0.02001190185546875, -0.0117950439453125, -0.00357818603515625, 0.004638671875, 0.01285552978515625, 0.0210723876953125, 0.02928924560546875, 0.037506103515625, 0.04572296142578125, 0.0539398193359375, 0.06215667724609375, 0.07037353515625, 0.07859039306640625, 0.0868072509765625, 0.09502410888671875, 0.103240966796875, 0.11145782470703125, 0.1196746826171875, 0.12789154052734375, 0.1361083984375, 0.14432525634765625, 0.1525421142578125, 0.16075897216796875, 0.168975830078125, 0.17719268798828125, 0.1854095458984375, 0.19362640380859375, 0.20184326171875, 0.21006011962890625, 0.2182769775390625, 0.22649383544921875, 0.234710693359375, 0.24292755126953125, 0.2511444091796875, 0.25936126708984375, 0.267578125]}, "gradients/encoder.encoder.layers.9.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 1.0, 13.0, 16.0, 21.0, 37.0, 41.0, 50.0, 53.0, 68.0, 86.0, 91.0, 101.0, 80.0, 85.0, 55.0, 64.0, 45.0, 34.0, 25.0, 11.0, 9.0, 8.0, 6.0, 0.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.1080322265625, -0.10552740097045898, -0.10302257537841797, -0.10051774978637695, -0.09801292419433594, -0.09550809860229492, -0.0930032730102539, -0.09049844741821289, -0.08799362182617188, -0.08548879623413086, -0.08298397064208984, -0.08047914505004883, -0.07797431945800781, -0.0754694938659668, -0.07296466827392578, -0.07045984268188477, -0.06795501708984375, -0.06545019149780273, -0.06294536590576172, -0.0604405403137207, -0.05793571472167969, -0.05543088912963867, -0.052926063537597656, -0.05042123794555664, -0.047916412353515625, -0.04541158676147461, -0.042906761169433594, -0.04040193557739258, -0.03789710998535156, -0.03539228439331055, -0.03288745880126953, -0.030382633209228516, -0.0278778076171875, -0.025372982025146484, -0.02286815643310547, -0.020363330841064453, -0.017858505249023438, -0.015353679656982422, -0.012848854064941406, -0.01034402847290039, -0.007839202880859375, -0.005334377288818359, -0.0028295516967773438, -0.0003247261047363281, 0.0021800994873046875, 0.004684925079345703, 0.007189750671386719, 0.009694576263427734, 0.01219940185546875, 0.014704227447509766, 0.01720905303955078, 0.019713878631591797, 0.022218704223632812, 0.024723529815673828, 0.027228355407714844, 0.02973318099975586, 0.032238006591796875, 0.03474283218383789, 0.037247657775878906, 0.03975248336791992, 0.04225730895996094, 0.04476213455200195, 0.04726696014404297, 0.049771785736083984, 0.052276611328125]}, "gradients/encoder.encoder.layers.9.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 2.0, 3.0, 5.0, 8.0, 17.0, 27.0, 39.0, 49.0, 107.0, 238.0, 645.0, 2244.0, 24142.0, 962729.0, 53846.0, 3192.0, 704.0, 237.0, 141.0, 81.0, 45.0, 33.0, 9.0, 11.0, 4.0, 1.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.355224609375, -0.34668540954589844, -0.3381462097167969, -0.3296070098876953, -0.32106781005859375, -0.3125286102294922, -0.3039894104003906, -0.29545021057128906, -0.2869110107421875, -0.27837181091308594, -0.2698326110839844, -0.2612934112548828, -0.25275421142578125, -0.2442150115966797, -0.23567581176757812, -0.22713661193847656, -0.218597412109375, -0.21005821228027344, -0.20151901245117188, -0.1929798126220703, -0.18444061279296875, -0.1759014129638672, -0.16736221313476562, -0.15882301330566406, -0.1502838134765625, -0.14174461364746094, -0.13320541381835938, -0.12466621398925781, -0.11612701416015625, -0.10758781433105469, -0.09904861450195312, -0.09050941467285156, -0.08197021484375, -0.07343101501464844, -0.06489181518554688, -0.05635261535644531, -0.04781341552734375, -0.03927421569824219, -0.030735015869140625, -0.022195816040039062, -0.0136566162109375, -0.0051174163818359375, 0.003421783447265625, 0.011960983276367188, 0.02050018310546875, 0.029039382934570312, 0.037578582763671875, 0.04611778259277344, 0.054656982421875, 0.06319618225097656, 0.07173538208007812, 0.08027458190917969, 0.08881378173828125, 0.09735298156738281, 0.10589218139648438, 0.11443138122558594, 0.1229705810546875, 0.13150978088378906, 0.14004898071289062, 0.1485881805419922, 0.15712738037109375, 0.1656665802001953, 0.17420578002929688, 0.18274497985839844, 0.1912841796875]}, "gradients/encoder.encoder.layers.9.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 4.0, 1.0, 4.0, 4.0, 4.0, 9.0, 12.0, 7.0, 32.0, 19.0, 30.0, 45.0, 42.0, 52.0, 48.0, 50.0, 57.0, 67.0, 78.0, 65.0, 76.0, 51.0, 48.0, 36.0, 38.0, 34.0, 31.0, 19.0, 14.0, 8.0, 11.0, 7.0, 3.0, 3.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2266845703125, -0.2198486328125, -0.2130126953125, -0.2061767578125, -0.1993408203125, -0.1925048828125, -0.1856689453125, -0.1788330078125, -0.1719970703125, -0.1651611328125, -0.1583251953125, -0.1514892578125, -0.1446533203125, -0.1378173828125, -0.1309814453125, -0.1241455078125, -0.1173095703125, -0.1104736328125, -0.1036376953125, -0.0968017578125, -0.0899658203125, -0.0831298828125, -0.0762939453125, -0.0694580078125, -0.0626220703125, -0.0557861328125, -0.0489501953125, -0.0421142578125, -0.0352783203125, -0.0284423828125, -0.0216064453125, -0.0147705078125, -0.0079345703125, -0.0010986328125, 0.0057373046875, 0.0125732421875, 0.0194091796875, 0.0262451171875, 0.0330810546875, 0.0399169921875, 0.0467529296875, 0.0535888671875, 0.0604248046875, 0.0672607421875, 0.0740966796875, 0.0809326171875, 0.0877685546875, 0.0946044921875, 0.1014404296875, 0.1082763671875, 0.1151123046875, 0.1219482421875, 0.1287841796875, 0.1356201171875, 0.1424560546875, 0.1492919921875, 0.1561279296875, 0.1629638671875, 0.1697998046875, 0.1766357421875, 0.1834716796875, 0.1903076171875, 0.1971435546875, 0.2039794921875, 0.2108154296875]}, "gradients/encoder.encoder.layers.9.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 4.0, 2.0, 5.0, 8.0, 10.0, 18.0, 17.0, 32.0, 41.0, 55.0, 96.0, 199.0, 362.0, 671.0, 1687.0, 4997.0, 22166.0, 232006.0, 716064.0, 56046.0, 9296.0, 2671.0, 977.0, 499.0, 260.0, 150.0, 80.0, 49.0, 33.0, 15.0, 16.0, 5.0, 8.0, 2.0, 4.0, 4.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.049407958984375, -0.047921180725097656, -0.04643440246582031, -0.04494762420654297, -0.043460845947265625, -0.04197406768798828, -0.04048728942871094, -0.039000511169433594, -0.03751373291015625, -0.036026954650878906, -0.03454017639160156, -0.03305339813232422, -0.031566619873046875, -0.03007984161376953, -0.028593063354492188, -0.027106285095214844, -0.0256195068359375, -0.024132728576660156, -0.022645950317382812, -0.02115917205810547, -0.019672393798828125, -0.01818561553955078, -0.016698837280273438, -0.015212059020996094, -0.01372528076171875, -0.012238502502441406, -0.010751724243164062, -0.009264945983886719, -0.007778167724609375, -0.006291389465332031, -0.0048046112060546875, -0.0033178329467773438, -0.0018310546875, -0.00034427642822265625, 0.0011425018310546875, 0.0026292800903320312, 0.004116058349609375, 0.005602836608886719, 0.0070896148681640625, 0.008576393127441406, 0.01006317138671875, 0.011549949645996094, 0.013036727905273438, 0.014523506164550781, 0.016010284423828125, 0.01749706268310547, 0.018983840942382812, 0.020470619201660156, 0.0219573974609375, 0.023444175720214844, 0.024930953979492188, 0.02641773223876953, 0.027904510498046875, 0.02939128875732422, 0.030878067016601562, 0.032364845275878906, 0.03385162353515625, 0.035338401794433594, 0.03682518005371094, 0.03831195831298828, 0.039798736572265625, 0.04128551483154297, 0.04277229309082031, 0.044259071350097656, 0.045745849609375]}, "gradients/encoder.encoder.layers.9.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 3.0, 4.0, 4.0, 0.0, 4.0, 6.0, 9.0, 11.0, 19.0, 25.0, 15.0, 32.0, 38.0, 47.0, 51.0, 64.0, 68.0, 83.0, 57.0, 82.0, 63.0, 70.0, 54.0, 39.0, 39.0, 23.0, 32.0, 18.0, 10.0, 6.0, 8.0, 8.0, 6.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 2.0], "bins": [-1.0848045349121094e-05, -1.0574236512184143e-05, -1.0300427675247192e-05, -1.0026618838310242e-05, -9.752810001373291e-06, -9.47900116443634e-06, -9.20519232749939e-06, -8.931383490562439e-06, -8.657574653625488e-06, -8.383765816688538e-06, -8.109956979751587e-06, -7.836148142814636e-06, -7.5623393058776855e-06, -7.288530468940735e-06, -7.014721632003784e-06, -6.7409127950668335e-06, -6.467103958129883e-06, -6.193295121192932e-06, -5.9194862842559814e-06, -5.645677447319031e-06, -5.37186861038208e-06, -5.098059773445129e-06, -4.824250936508179e-06, -4.550442099571228e-06, -4.276633262634277e-06, -4.002824425697327e-06, -3.729015588760376e-06, -3.4552067518234253e-06, -3.1813979148864746e-06, -2.907589077949524e-06, -2.6337802410125732e-06, -2.3599714040756226e-06, -2.086162567138672e-06, -1.8123537302017212e-06, -1.5385448932647705e-06, -1.2647360563278198e-06, -9.909272193908691e-07, -7.171183824539185e-07, -4.4330954551696777e-07, -1.695007085800171e-07, 1.043081283569336e-07, 3.781169652938843e-07, 6.51925802230835e-07, 9.257346391677856e-07, 1.1995434761047363e-06, 1.473352313041687e-06, 1.7471611499786377e-06, 2.0209699869155884e-06, 2.294778823852539e-06, 2.5685876607894897e-06, 2.8423964977264404e-06, 3.116205334663391e-06, 3.390014171600342e-06, 3.6638230085372925e-06, 3.937631845474243e-06, 4.211440682411194e-06, 4.4852495193481445e-06, 4.759058356285095e-06, 5.032867193222046e-06, 5.306676030158997e-06, 5.580484867095947e-06, 5.854293704032898e-06, 6.128102540969849e-06, 6.401911377906799e-06, 6.67572021484375e-06]}, "gradients/encoder.encoder.layers.9.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 2.0, 2.0, 6.0, 3.0, 5.0, 4.0, 3.0, 9.0, 13.0, 12.0, 34.0, 72.0, 120.0, 356.0, 998.0, 4069.0, 28392.0, 835430.0, 166022.0, 9995.0, 2023.0, 579.0, 215.0, 86.0, 47.0, 20.0, 8.0, 6.0, 5.0, 4.0, 3.0, 1.0, 3.0, 2.0, 2.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.08612060546875, -0.08332157135009766, -0.08052253723144531, -0.07772350311279297, -0.07492446899414062, -0.07212543487548828, -0.06932640075683594, -0.0665273666381836, -0.06372833251953125, -0.060929298400878906, -0.05813026428222656, -0.05533123016357422, -0.052532196044921875, -0.04973316192626953, -0.04693412780761719, -0.044135093688964844, -0.0413360595703125, -0.038537025451660156, -0.03573799133300781, -0.03293895721435547, -0.030139923095703125, -0.02734088897705078, -0.024541854858398438, -0.021742820739746094, -0.01894378662109375, -0.016144752502441406, -0.013345718383789062, -0.010546684265136719, -0.007747650146484375, -0.004948616027832031, -0.0021495819091796875, 0.0006494522094726562, 0.003448486328125, 0.006247520446777344, 0.009046554565429688, 0.011845588684082031, 0.014644622802734375, 0.01744365692138672, 0.020242691040039062, 0.023041725158691406, 0.02584075927734375, 0.028639793395996094, 0.03143882751464844, 0.03423786163330078, 0.037036895751953125, 0.03983592987060547, 0.04263496398925781, 0.045433998107910156, 0.0482330322265625, 0.051032066345214844, 0.05383110046386719, 0.05663013458251953, 0.059429168701171875, 0.06222820281982422, 0.06502723693847656, 0.0678262710571289, 0.07062530517578125, 0.0734243392944336, 0.07622337341308594, 0.07902240753173828, 0.08182144165039062, 0.08462047576904297, 0.08741950988769531, 0.09021854400634766, 0.093017578125]}, "gradients/encoder.encoder.layers.9.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 1.0, 1.0, 1.0, 3.0, 6.0, 4.0, 5.0, 19.0, 28.0, 53.0, 123.0, 210.0, 232.0, 157.0, 71.0, 32.0, 18.0, 9.0, 7.0, 8.0, 4.0, 0.0, 4.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.12109375, -0.11774730682373047, -0.11440086364746094, -0.1110544204711914, -0.10770797729492188, -0.10436153411865234, -0.10101509094238281, -0.09766864776611328, -0.09432220458984375, -0.09097576141357422, -0.08762931823730469, -0.08428287506103516, -0.08093643188476562, -0.0775899887084961, -0.07424354553222656, -0.07089710235595703, -0.0675506591796875, -0.06420421600341797, -0.06085777282714844, -0.057511329650878906, -0.054164886474609375, -0.050818443298339844, -0.04747200012207031, -0.04412555694580078, -0.04077911376953125, -0.03743267059326172, -0.03408622741699219, -0.030739784240722656, -0.027393341064453125, -0.024046897888183594, -0.020700454711914062, -0.01735401153564453, -0.014007568359375, -0.010661125183105469, -0.0073146820068359375, -0.003968238830566406, -0.000621795654296875, 0.0027246475219726562, 0.0060710906982421875, 0.009417533874511719, 0.01276397705078125, 0.01611042022705078, 0.019456863403320312, 0.022803306579589844, 0.026149749755859375, 0.029496192932128906, 0.03284263610839844, 0.03618907928466797, 0.0395355224609375, 0.04288196563720703, 0.04622840881347656, 0.049574851989746094, 0.052921295166015625, 0.056267738342285156, 0.05961418151855469, 0.06296062469482422, 0.06630706787109375, 0.06965351104736328, 0.07299995422363281, 0.07634639739990234, 0.07969284057617188, 0.0830392837524414, 0.08638572692871094, 0.08973217010498047, 0.09307861328125]}, "gradients/encoder.encoder.layers.9.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 3.0, 3.0, 13.0, 35.0, 50.0, 129.0, 245.0, 226.0, 173.0, 72.0, 31.0, 17.0, 7.0, 5.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.337580919265747, -1.2996152639389038, -1.2616496086120605, -1.2236839532852173, -1.185718297958374, -1.1477526426315308, -1.1097869873046875, -1.0718213319778442, -1.033855676651001, -0.9958900213241577, -0.9579243659973145, -0.9199587106704712, -0.8819930553436279, -0.8440274000167847, -0.8060617446899414, -0.7680960893630981, -0.7301303744316101, -0.6921647191047668, -0.6541990637779236, -0.6162334084510803, -0.5782677531242371, -0.5403020977973938, -0.5023363828659058, -0.4643707573413849, -0.4264051020145416, -0.38843944668769836, -0.3504737913608551, -0.31250810623168945, -0.2745424509048462, -0.23657681047916412, -0.19861114025115967, -0.1606454849243164, -0.12267982959747314, -0.08471417427062988, -0.046748511493206024, -0.008782848715782166, 0.029182806611061096, 0.06714846193790436, 0.10511413216590881, 0.14307978749275208, 0.18104544281959534, 0.2190110981464386, 0.25697675347328186, 0.2949424386024475, 0.33290809392929077, 0.37087374925613403, 0.4088394045829773, 0.44680505990982056, 0.4847707152366638, 0.5227363705635071, 0.5607020258903503, 0.5986676812171936, 0.6366333365440369, 0.6745989918708801, 0.7125647068023682, 0.7505303621292114, 0.7884960174560547, 0.826461672782898, 0.8644273281097412, 0.9023929834365845, 0.9403586387634277, 0.978324294090271, 1.0162899494171143, 1.0542556047439575, 1.0922212600708008]}, "gradients/encoder.encoder.layers.9.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 4.0, 4.0, 3.0, 9.0, 5.0, 13.0, 15.0, 22.0, 17.0, 25.0, 28.0, 21.0, 31.0, 36.0, 35.0, 48.0, 39.0, 56.0, 51.0, 51.0, 58.0, 45.0, 60.0, 46.0, 41.0, 25.0, 30.0, 37.0, 30.0, 24.0, 21.0, 14.0, 9.0, 7.0, 13.0, 14.0, 8.0, 4.0, 6.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6151859760284424, -0.5917969942092896, -0.5684080123901367, -0.5450190305709839, -0.521630048751831, -0.4982410669326782, -0.4748521149158478, -0.45146313309669495, -0.4280741512775421, -0.4046851694583893, -0.38129618763923645, -0.357907235622406, -0.3345182538032532, -0.31112927198410034, -0.2877402901649475, -0.2643513083457947, -0.24096232652664185, -0.217573344707489, -0.19418436288833618, -0.17079539597034454, -0.1474064141511917, -0.12401743233203888, -0.10062846541404724, -0.07723948359489441, -0.05385050177574158, -0.030461523681879044, -0.00707254558801651, 0.016316428780555725, 0.03970541059970856, 0.06309439241886139, 0.08648335933685303, 0.10987234115600586, 0.13326138257980347, 0.1566503643989563, 0.18003934621810913, 0.20342831313610077, 0.2268172949552536, 0.2502062916755676, 0.27359524369239807, 0.2969842255115509, 0.32037320733070374, 0.34376218914985657, 0.3671511709690094, 0.39054012298583984, 0.4139291048049927, 0.4373180866241455, 0.46070706844329834, 0.48409605026245117, 0.507485032081604, 0.5308740139007568, 0.5542629957199097, 0.5776519775390625, 0.6010409593582153, 0.6244299411773682, 0.647818922996521, 0.6712079048156738, 0.6945968866348267, 0.7179858684539795, 0.7413748502731323, 0.7647638320922852, 0.788152813911438, 0.8115417957305908, 0.8349307775497437, 0.8583197593688965, 0.8817086815834045]}, "gradients/encoder.encoder.layers.8.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 4.0, 12.0, 14.0, 22.0, 29.0, 55.0, 88.0, 121.0, 490.0, 4189031.0, 3965.0, 183.0, 91.0, 67.0, 44.0, 25.0, 20.0, 7.0, 3.0, 5.0, 5.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0], "bins": [-1.6328125, -1.59368896484375, -1.5545654296875, -1.51544189453125, -1.476318359375, -1.43719482421875, -1.3980712890625, -1.35894775390625, -1.31982421875, -1.28070068359375, -1.2415771484375, -1.20245361328125, -1.163330078125, -1.12420654296875, -1.0850830078125, -1.04595947265625, -1.0068359375, -0.96771240234375, -0.9285888671875, -0.88946533203125, -0.850341796875, -0.81121826171875, -0.7720947265625, -0.73297119140625, -0.69384765625, -0.65472412109375, -0.6156005859375, -0.57647705078125, -0.537353515625, -0.49822998046875, -0.4591064453125, -0.41998291015625, -0.380859375, -0.34173583984375, -0.3026123046875, -0.26348876953125, -0.224365234375, -0.18524169921875, -0.1461181640625, -0.10699462890625, -0.06787109375, -0.02874755859375, 0.0103759765625, 0.04949951171875, 0.088623046875, 0.12774658203125, 0.1668701171875, 0.20599365234375, 0.2451171875, 0.28424072265625, 0.3233642578125, 0.36248779296875, 0.401611328125, 0.44073486328125, 0.4798583984375, 0.51898193359375, 0.55810546875, 0.59722900390625, 0.6363525390625, 0.67547607421875, 0.714599609375, 0.75372314453125, 0.7928466796875, 0.83197021484375, 0.87109375]}, "gradients/encoder.encoder.layers.8.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 4.0, 9.0, 3.0, 13.0, 20.0, 21.0, 31.0, 55.0, 56.0, 76.0, 59.0, 77.0, 117.0, 94.0, 83.0, 59.0, 54.0, 51.0, 39.0, 27.0, 13.0, 20.0, 9.0, 5.0, 5.0, 0.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 3.0], "bins": [-0.10418701171875, -0.10174083709716797, -0.09929466247558594, -0.0968484878540039, -0.09440231323242188, -0.09195613861083984, -0.08950996398925781, -0.08706378936767578, -0.08461761474609375, -0.08217144012451172, -0.07972526550292969, -0.07727909088134766, -0.07483291625976562, -0.0723867416381836, -0.06994056701660156, -0.06749439239501953, -0.0650482177734375, -0.06260204315185547, -0.06015586853027344, -0.057709693908691406, -0.055263519287109375, -0.052817344665527344, -0.05037117004394531, -0.04792499542236328, -0.04547882080078125, -0.04303264617919922, -0.04058647155761719, -0.038140296936035156, -0.035694122314453125, -0.033247947692871094, -0.030801773071289062, -0.02835559844970703, -0.025909423828125, -0.02346324920654297, -0.021017074584960938, -0.018570899963378906, -0.016124725341796875, -0.013678550720214844, -0.011232376098632812, -0.008786201477050781, -0.00634002685546875, -0.0038938522338867188, -0.0014476776123046875, 0.0009984970092773438, 0.003444671630859375, 0.005890846252441406, 0.008337020874023438, 0.010783195495605469, 0.0132293701171875, 0.01567554473876953, 0.018121719360351562, 0.020567893981933594, 0.023014068603515625, 0.025460243225097656, 0.027906417846679688, 0.03035259246826172, 0.03279876708984375, 0.03524494171142578, 0.03769111633300781, 0.040137290954589844, 0.042583465576171875, 0.045029640197753906, 0.04747581481933594, 0.04992198944091797, 0.0523681640625]}, "gradients/encoder.encoder.layers.8.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 4.0, 3.0, 5.0, 3.0, 2.0, 12.0, 13.0, 32.0, 28.0, 44.0, 61.0, 106.0, 132.0, 239.0, 387.0, 700.0, 1389.0, 2869.0, 7671.0, 40895.0, 4060099.0, 63449.0, 9212.0, 3364.0, 1665.0, 789.0, 408.0, 233.0, 147.0, 124.0, 59.0, 49.0, 27.0, 24.0, 17.0, 14.0, 4.0, 6.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1422119140625, -0.1377086639404297, -0.13320541381835938, -0.12870216369628906, -0.12419891357421875, -0.11969566345214844, -0.11519241333007812, -0.11068916320800781, -0.1061859130859375, -0.10168266296386719, -0.09717941284179688, -0.09267616271972656, -0.08817291259765625, -0.08366966247558594, -0.07916641235351562, -0.07466316223144531, -0.070159912109375, -0.06565666198730469, -0.061153411865234375, -0.05665016174316406, -0.05214691162109375, -0.04764366149902344, -0.043140411376953125, -0.03863716125488281, -0.0341339111328125, -0.029630661010742188, -0.025127410888671875, -0.020624160766601562, -0.01612091064453125, -0.011617660522460938, -0.007114410400390625, -0.0026111602783203125, 0.00189208984375, 0.0063953399658203125, 0.010898590087890625, 0.015401840209960938, 0.01990509033203125, 0.024408340454101562, 0.028911590576171875, 0.03341484069824219, 0.0379180908203125, 0.04242134094238281, 0.046924591064453125, 0.05142784118652344, 0.05593109130859375, 0.06043434143066406, 0.06493759155273438, 0.06944084167480469, 0.073944091796875, 0.07844734191894531, 0.08295059204101562, 0.08745384216308594, 0.09195709228515625, 0.09646034240722656, 0.10096359252929688, 0.10546684265136719, 0.1099700927734375, 0.11447334289550781, 0.11897659301757812, 0.12347984313964844, 0.12798309326171875, 0.13248634338378906, 0.13698959350585938, 0.1414928436279297, 0.14599609375]}, "gradients/encoder.encoder.layers.8.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 5.0, 5.0, 3.0, 14.0, 20.0, 24.0, 64.0, 111.0, 3376.0, 280.0, 95.0, 35.0, 12.0, 6.0, 4.0, 4.0, 5.0, 4.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.08282470703125, -0.08097314834594727, -0.07912158966064453, -0.0772700309753418, -0.07541847229003906, -0.07356691360473633, -0.0717153549194336, -0.06986379623413086, -0.06801223754882812, -0.06616067886352539, -0.06430912017822266, -0.06245756149291992, -0.06060600280761719, -0.05875444412231445, -0.05690288543701172, -0.055051326751708984, -0.05319976806640625, -0.051348209381103516, -0.04949665069580078, -0.04764509201049805, -0.04579353332519531, -0.04394197463989258, -0.042090415954589844, -0.04023885726928711, -0.038387298583984375, -0.03653573989868164, -0.034684181213378906, -0.03283262252807617, -0.030981063842773438, -0.029129505157470703, -0.02727794647216797, -0.025426387786865234, -0.0235748291015625, -0.021723270416259766, -0.01987171173095703, -0.018020153045654297, -0.016168594360351562, -0.014317035675048828, -0.012465476989746094, -0.01061391830444336, -0.008762359619140625, -0.006910800933837891, -0.005059242248535156, -0.003207683563232422, -0.0013561248779296875, 0.0004954338073730469, 0.0023469924926757812, 0.004198551177978516, 0.00605010986328125, 0.007901668548583984, 0.009753227233886719, 0.011604785919189453, 0.013456344604492188, 0.015307903289794922, 0.017159461975097656, 0.01901102066040039, 0.020862579345703125, 0.02271413803100586, 0.024565696716308594, 0.026417255401611328, 0.028268814086914062, 0.030120372772216797, 0.03197193145751953, 0.033823490142822266, 0.035675048828125]}, "gradients/encoder.encoder.layers.8.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 3.0, 3.0, 9.0, 8.0, 25.0, 59.0, 169.0, 351.0, 285.0, 70.0, 24.0, 6.0, 4.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.0081409215927124, -0.9898035526275635, -0.9714662432670593, -0.9531288743019104, -0.9347915053367615, -0.9164541959762573, -0.8981168270111084, -0.8797794580459595, -0.8614421486854553, -0.8431047797203064, -0.8247674703598022, -0.8064301013946533, -0.7880927324295044, -0.7697554230690002, -0.7514180541038513, -0.7330807447433472, -0.7147433757781982, -0.6964060068130493, -0.6780686974525452, -0.6597313284873962, -0.6413939595222473, -0.6230566501617432, -0.6047192811965942, -0.5863819122314453, -0.5680445432662964, -0.5497071743011475, -0.5313698649406433, -0.5130324959754944, -0.49469515681266785, -0.4763578176498413, -0.4580204486846924, -0.43968310952186584, -0.4213457405567169, -0.4030084013938904, -0.38467103242874146, -0.3663336932659149, -0.3479963541030884, -0.32965898513793945, -0.3113216459751129, -0.2929843068122864, -0.27464693784713745, -0.2563095986843109, -0.23797224462032318, -0.21963489055633545, -0.2012975513935089, -0.18296019732952118, -0.16462284326553345, -0.1462855041027069, -0.12794816493988037, -0.10961081832647324, -0.0912734717130661, -0.07293611764907837, -0.054598771035671234, -0.0362614244222641, -0.017924070358276367, 0.0004132688045501709, 0.018750622868537903, 0.03708796948194504, 0.05542531982064247, 0.0737626701593399, 0.09210001677274704, 0.11043736338615417, 0.1287747174501419, 0.14711205661296844, 0.16544941067695618]}, "gradients/encoder.encoder.layers.8.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 4.0, 2.0, 8.0, 8.0, 13.0, 7.0, 15.0, 13.0, 16.0, 30.0, 32.0, 34.0, 35.0, 30.0, 43.0, 37.0, 54.0, 42.0, 60.0, 39.0, 51.0, 49.0, 43.0, 31.0, 33.0, 46.0, 37.0, 27.0, 26.0, 15.0, 20.0, 17.0, 23.0, 14.0, 11.0, 8.0, 10.0, 11.0, 1.0, 6.0, 1.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.12366318702697754, -0.11954590678215027, -0.1154286190867424, -0.11131133884191513, -0.10719405114650726, -0.10307677090167999, -0.09895949065685272, -0.09484221041202545, -0.09072492271661758, -0.08660764247179031, -0.08249035477638245, -0.07837307453155518, -0.0742557942867279, -0.07013850659132004, -0.06602122634649277, -0.0619039423763752, -0.05778665840625763, -0.05366937443614006, -0.04955209046602249, -0.04543481022119522, -0.04131752625107765, -0.03720024228096008, -0.03308296203613281, -0.028965678066015244, -0.024848394095897675, -0.020731110125780106, -0.016613828018307686, -0.012496544979512691, -0.008379261940717697, -0.004261977970600128, -0.00014469586312770844, 0.003972586244344711, 0.00808987021446228, 0.012207153253257275, 0.01632443629205227, 0.02044171839952469, 0.024559002369642258, 0.028676286339759827, 0.0327935665845871, 0.036910850554704666, 0.041028134524822235, 0.045145418494939804, 0.04926270246505737, 0.053379982709884644, 0.05749726668000221, 0.06161455065011978, 0.06573183089494705, 0.06984911859035492, 0.07396639883518219, 0.07808367908000946, 0.08220096677541733, 0.0863182470202446, 0.09043553471565247, 0.09455281496047974, 0.098670095205307, 0.10278737545013428, 0.10690466314554214, 0.11102194339036942, 0.11513923108577728, 0.11925651133060455, 0.12337379157543182, 0.1274910867214203, 0.13160836696624756, 0.13572564721107483, 0.1398429274559021]}, "gradients/encoder.encoder.layers.8.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 7.0, 18.0, 21.0, 28.0, 47.0, 75.0, 119.0, 234.0, 427.0, 1150.0, 3737.0, 19696.0, 183345.0, 719048.0, 103149.0, 12950.0, 2782.0, 913.0, 355.0, 201.0, 95.0, 56.0, 28.0, 23.0, 14.0, 13.0, 8.0, 8.0, 2.0, 3.0, 5.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0], "bins": [-0.176513671875, -0.17214298248291016, -0.1677722930908203, -0.16340160369873047, -0.15903091430664062, -0.15466022491455078, -0.15028953552246094, -0.1459188461303711, -0.14154815673828125, -0.1371774673461914, -0.13280677795410156, -0.12843608856201172, -0.12406539916992188, -0.11969470977783203, -0.11532402038574219, -0.11095333099365234, -0.1065826416015625, -0.10221195220947266, -0.09784126281738281, -0.09347057342529297, -0.08909988403320312, -0.08472919464111328, -0.08035850524902344, -0.0759878158569336, -0.07161712646484375, -0.0672464370727539, -0.06287574768066406, -0.05850505828857422, -0.054134368896484375, -0.04976367950439453, -0.04539299011230469, -0.041022300720214844, -0.036651611328125, -0.032280921936035156, -0.027910232543945312, -0.02353954315185547, -0.019168853759765625, -0.014798164367675781, -0.010427474975585938, -0.006056785583496094, -0.00168609619140625, 0.0026845932006835938, 0.0070552825927734375, 0.011425971984863281, 0.015796661376953125, 0.02016735076904297, 0.024538040161132812, 0.028908729553222656, 0.0332794189453125, 0.037650108337402344, 0.04202079772949219, 0.04639148712158203, 0.050762176513671875, 0.05513286590576172, 0.05950355529785156, 0.0638742446899414, 0.06824493408203125, 0.0726156234741211, 0.07698631286621094, 0.08135700225830078, 0.08572769165039062, 0.09009838104248047, 0.09446907043457031, 0.09883975982666016, 0.10321044921875]}, "gradients/encoder.encoder.layers.8.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 2.0, 1.0, 4.0, 8.0, 8.0, 19.0, 28.0, 28.0, 45.0, 55.0, 70.0, 75.0, 82.0, 93.0, 94.0, 78.0, 82.0, 48.0, 48.0, 37.0, 36.0, 21.0, 13.0, 13.0, 7.0, 5.0, 1.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.10662841796875, -0.10413599014282227, -0.10164356231689453, -0.0991511344909668, -0.09665870666503906, -0.09416627883911133, -0.0916738510131836, -0.08918142318725586, -0.08668899536132812, -0.08419656753540039, -0.08170413970947266, -0.07921171188354492, -0.07671928405761719, -0.07422685623168945, -0.07173442840576172, -0.06924200057983398, -0.06674957275390625, -0.06425714492797852, -0.06176471710205078, -0.05927228927612305, -0.05677986145019531, -0.05428743362426758, -0.051795005798339844, -0.04930257797241211, -0.046810150146484375, -0.04431772232055664, -0.041825294494628906, -0.03933286666870117, -0.03684043884277344, -0.0343480110168457, -0.03185558319091797, -0.029363155364990234, -0.0268707275390625, -0.024378299713134766, -0.02188587188720703, -0.019393444061279297, -0.016901016235351562, -0.014408588409423828, -0.011916160583496094, -0.00942373275756836, -0.006931304931640625, -0.004438877105712891, -0.0019464492797851562, 0.0005459785461425781, 0.0030384063720703125, 0.005530834197998047, 0.008023262023925781, 0.010515689849853516, 0.01300811767578125, 0.015500545501708984, 0.01799297332763672, 0.020485401153564453, 0.022977828979492188, 0.025470256805419922, 0.027962684631347656, 0.03045511245727539, 0.032947540283203125, 0.03543996810913086, 0.037932395935058594, 0.04042482376098633, 0.04291725158691406, 0.0454096794128418, 0.04790210723876953, 0.050394535064697266, 0.052886962890625]}, "gradients/encoder.encoder.layers.8.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 2.0, 1.0, 8.0, 3.0, 1.0, 3.0, 7.0, 9.0, 5.0, 11.0, 12.0, 17.0, 23.0, 29.0, 40.0, 52.0, 88.0, 124.0, 236.0, 419.0, 769.0, 2102.0, 8743.0, 82487.0, 826356.0, 112402.0, 10270.0, 2376.0, 889.0, 386.0, 227.0, 136.0, 96.0, 62.0, 32.0, 34.0, 25.0, 23.0, 10.0, 8.0, 9.0, 10.0, 3.0, 3.0, 9.0, 1.0, 3.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.147705078125, -0.14316749572753906, -0.13862991333007812, -0.1340923309326172, -0.12955474853515625, -0.1250171661376953, -0.12047958374023438, -0.11594200134277344, -0.1114044189453125, -0.10686683654785156, -0.10232925415039062, -0.09779167175292969, -0.09325408935546875, -0.08871650695800781, -0.08417892456054688, -0.07964134216308594, -0.075103759765625, -0.07056617736816406, -0.06602859497070312, -0.06149101257324219, -0.05695343017578125, -0.05241584777832031, -0.047878265380859375, -0.04334068298339844, -0.0388031005859375, -0.03426551818847656, -0.029727935791015625, -0.025190353393554688, -0.02065277099609375, -0.016115188598632812, -0.011577606201171875, -0.0070400238037109375, -0.00250244140625, 0.0020351409912109375, 0.006572723388671875, 0.011110305786132812, 0.01564788818359375, 0.020185470581054688, 0.024723052978515625, 0.029260635375976562, 0.0337982177734375, 0.03833580017089844, 0.042873382568359375, 0.04741096496582031, 0.05194854736328125, 0.05648612976074219, 0.061023712158203125, 0.06556129455566406, 0.070098876953125, 0.07463645935058594, 0.07917404174804688, 0.08371162414550781, 0.08824920654296875, 0.09278678894042969, 0.09732437133789062, 0.10186195373535156, 0.1063995361328125, 0.11093711853027344, 0.11547470092773438, 0.12001228332519531, 0.12454986572265625, 0.1290874481201172, 0.13362503051757812, 0.13816261291503906, 0.1427001953125]}, "gradients/encoder.encoder.layers.8.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 3.0, 1.0, 0.0, 2.0, 0.0, 1.0, 3.0, 7.0, 8.0, 6.0, 4.0, 9.0, 11.0, 11.0, 11.0, 14.0, 17.0, 17.0, 17.0, 31.0, 25.0, 39.0, 36.0, 37.0, 40.0, 41.0, 45.0, 44.0, 53.0, 45.0, 40.0, 41.0, 37.0, 44.0, 23.0, 28.0, 33.0, 33.0, 18.0, 26.0, 13.0, 13.0, 21.0, 9.0, 14.0, 7.0, 14.0, 5.0, 7.0, 2.0, 1.0, 6.0, 2.0, 1.0, 0.0, 3.0, 0.0, 1.0], "bins": [-0.1690673828125, -0.1641979217529297, -0.15932846069335938, -0.15445899963378906, -0.14958953857421875, -0.14472007751464844, -0.13985061645507812, -0.1349811553955078, -0.1301116943359375, -0.1252422332763672, -0.12037277221679688, -0.11550331115722656, -0.11063385009765625, -0.10576438903808594, -0.10089492797851562, -0.09602546691894531, -0.091156005859375, -0.08628654479980469, -0.08141708374023438, -0.07654762268066406, -0.07167816162109375, -0.06680870056152344, -0.061939239501953125, -0.05706977844238281, -0.0522003173828125, -0.04733085632324219, -0.042461395263671875, -0.03759193420410156, -0.03272247314453125, -0.027853012084960938, -0.022983551025390625, -0.018114089965820312, -0.01324462890625, -0.008375167846679688, -0.003505706787109375, 0.0013637542724609375, 0.00623321533203125, 0.011102676391601562, 0.015972137451171875, 0.020841598510742188, 0.0257110595703125, 0.030580520629882812, 0.035449981689453125, 0.04031944274902344, 0.04518890380859375, 0.05005836486816406, 0.054927825927734375, 0.05979728698730469, 0.064666748046875, 0.06953620910644531, 0.07440567016601562, 0.07927513122558594, 0.08414459228515625, 0.08901405334472656, 0.09388351440429688, 0.09875297546386719, 0.1036224365234375, 0.10849189758300781, 0.11336135864257812, 0.11823081970214844, 0.12310028076171875, 0.12796974182128906, 0.13283920288085938, 0.1377086639404297, 0.142578125]}, "gradients/encoder.encoder.layers.8.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 5.0, 7.0, 10.0, 13.0, 11.0, 15.0, 22.0, 33.0, 51.0, 76.0, 139.0, 296.0, 640.0, 2006.0, 8955.0, 102162.0, 856275.0, 67996.0, 6962.0, 1674.0, 577.0, 272.0, 140.0, 75.0, 46.0, 23.0, 16.0, 16.0, 12.0, 8.0, 8.0, 3.0, 3.0, 1.0, 2.0, 7.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.048828125, -0.04745006561279297, -0.04607200622558594, -0.044693946838378906, -0.043315887451171875, -0.041937828063964844, -0.04055976867675781, -0.03918170928955078, -0.03780364990234375, -0.03642559051513672, -0.03504753112792969, -0.033669471740722656, -0.032291412353515625, -0.030913352966308594, -0.029535293579101562, -0.02815723419189453, -0.0267791748046875, -0.02540111541748047, -0.024023056030273438, -0.022644996643066406, -0.021266937255859375, -0.019888877868652344, -0.018510818481445312, -0.01713275909423828, -0.01575469970703125, -0.014376640319824219, -0.012998580932617188, -0.011620521545410156, -0.010242462158203125, -0.008864402770996094, -0.0074863433837890625, -0.006108283996582031, -0.004730224609375, -0.0033521652221679688, -0.0019741058349609375, -0.0005960464477539062, 0.000782012939453125, 0.0021600723266601562, 0.0035381317138671875, 0.004916191101074219, 0.00629425048828125, 0.007672309875488281, 0.009050369262695312, 0.010428428649902344, 0.011806488037109375, 0.013184547424316406, 0.014562606811523438, 0.01594066619873047, 0.0173187255859375, 0.01869678497314453, 0.020074844360351562, 0.021452903747558594, 0.022830963134765625, 0.024209022521972656, 0.025587081909179688, 0.02696514129638672, 0.02834320068359375, 0.02972126007080078, 0.031099319458007812, 0.032477378845214844, 0.033855438232421875, 0.035233497619628906, 0.03661155700683594, 0.03798961639404297, 0.03936767578125]}, "gradients/encoder.encoder.layers.8.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 2.0, 6.0, 1.0, 3.0, 8.0, 18.0, 6.0, 16.0, 20.0, 30.0, 47.0, 42.0, 64.0, 64.0, 73.0, 87.0, 76.0, 69.0, 91.0, 63.0, 49.0, 53.0, 22.0, 23.0, 24.0, 11.0, 12.0, 11.0, 3.0, 3.0, 6.0, 1.0, 3.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-9.834766387939453e-06, -9.57585871219635e-06, -9.316951036453247e-06, -9.058043360710144e-06, -8.799135684967041e-06, -8.540228009223938e-06, -8.281320333480835e-06, -8.022412657737732e-06, -7.763504981994629e-06, -7.504597306251526e-06, -7.245689630508423e-06, -6.98678195476532e-06, -6.727874279022217e-06, -6.468966603279114e-06, -6.210058927536011e-06, -5.951151251792908e-06, -5.692243576049805e-06, -5.433335900306702e-06, -5.174428224563599e-06, -4.915520548820496e-06, -4.656612873077393e-06, -4.3977051973342896e-06, -4.1387975215911865e-06, -3.8798898458480835e-06, -3.6209821701049805e-06, -3.3620744943618774e-06, -3.1031668186187744e-06, -2.8442591428756714e-06, -2.5853514671325684e-06, -2.3264437913894653e-06, -2.0675361156463623e-06, -1.8086284399032593e-06, -1.5497207641601562e-06, -1.2908130884170532e-06, -1.0319054126739502e-06, -7.729977369308472e-07, -5.140900611877441e-07, -2.551823854446411e-07, 3.725290298461914e-09, 2.6263296604156494e-07, 5.21540641784668e-07, 7.80448317527771e-07, 1.039355993270874e-06, 1.298263669013977e-06, 1.55717134475708e-06, 1.816079020500183e-06, 2.074986696243286e-06, 2.333894371986389e-06, 2.592802047729492e-06, 2.8517097234725952e-06, 3.1106173992156982e-06, 3.3695250749588013e-06, 3.6284327507019043e-06, 3.887340426445007e-06, 4.14624810218811e-06, 4.405155777931213e-06, 4.664063453674316e-06, 4.9229711294174194e-06, 5.1818788051605225e-06, 5.4407864809036255e-06, 5.6996941566467285e-06, 5.9586018323898315e-06, 6.2175095081329346e-06, 6.476417183876038e-06, 6.735324859619141e-06]}, "gradients/encoder.encoder.layers.8.attention.q_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 2.0, 4.0, 4.0, 2.0, 5.0, 3.0, 5.0, 15.0, 27.0, 36.0, 53.0, 98.0, 175.0, 394.0, 827.0, 2495.0, 11321.0, 137940.0, 831947.0, 53454.0, 6775.0, 1724.0, 622.0, 284.0, 140.0, 79.0, 48.0, 21.0, 14.0, 12.0, 8.0, 3.0, 6.0, 5.0, 3.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0526123046875, -0.050933837890625, -0.04925537109375, -0.047576904296875, -0.0458984375, -0.044219970703125, -0.04254150390625, -0.040863037109375, -0.0391845703125, -0.037506103515625, -0.03582763671875, -0.034149169921875, -0.032470703125, -0.030792236328125, -0.02911376953125, -0.027435302734375, -0.0257568359375, -0.024078369140625, -0.02239990234375, -0.020721435546875, -0.01904296875, -0.017364501953125, -0.01568603515625, -0.014007568359375, -0.0123291015625, -0.010650634765625, -0.00897216796875, -0.007293701171875, -0.005615234375, -0.003936767578125, -0.00225830078125, -0.000579833984375, 0.0010986328125, 0.002777099609375, 0.00445556640625, 0.006134033203125, 0.0078125, 0.009490966796875, 0.01116943359375, 0.012847900390625, 0.0145263671875, 0.016204833984375, 0.01788330078125, 0.019561767578125, 0.021240234375, 0.022918701171875, 0.02459716796875, 0.026275634765625, 0.0279541015625, 0.029632568359375, 0.03131103515625, 0.032989501953125, 0.03466796875, 0.036346435546875, 0.03802490234375, 0.039703369140625, 0.0413818359375, 0.043060302734375, 0.04473876953125, 0.046417236328125, 0.048095703125, 0.049774169921875, 0.05145263671875, 0.053131103515625, 0.0548095703125]}, "gradients/encoder.encoder.layers.8.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 6.0, 3.0, 5.0, 9.0, 9.0, 13.0, 25.0, 35.0, 40.0, 56.0, 77.0, 93.0, 130.0, 91.0, 97.0, 84.0, 50.0, 57.0, 38.0, 22.0, 17.0, 12.0, 15.0, 5.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.044342041015625, -0.04303741455078125, -0.0417327880859375, -0.04042816162109375, -0.03912353515625, -0.03781890869140625, -0.0365142822265625, -0.03520965576171875, -0.033905029296875, -0.03260040283203125, -0.0312957763671875, -0.02999114990234375, -0.0286865234375, -0.02738189697265625, -0.0260772705078125, -0.02477264404296875, -0.023468017578125, -0.02216339111328125, -0.0208587646484375, -0.01955413818359375, -0.01824951171875, -0.01694488525390625, -0.0156402587890625, -0.01433563232421875, -0.013031005859375, -0.01172637939453125, -0.0104217529296875, -0.00911712646484375, -0.0078125, -0.00650787353515625, -0.0052032470703125, -0.00389862060546875, -0.002593994140625, -0.00128936767578125, 1.52587890625e-05, 0.00131988525390625, 0.00262451171875, 0.00392913818359375, 0.0052337646484375, 0.00653839111328125, 0.007843017578125, 0.00914764404296875, 0.0104522705078125, 0.01175689697265625, 0.0130615234375, 0.01436614990234375, 0.0156707763671875, 0.01697540283203125, 0.018280029296875, 0.01958465576171875, 0.0208892822265625, 0.02219390869140625, 0.02349853515625, 0.02480316162109375, 0.0261077880859375, 0.02741241455078125, 0.028717041015625, 0.03002166748046875, 0.0313262939453125, 0.03263092041015625, 0.033935546875, 0.03524017333984375, 0.0365447998046875, 0.03784942626953125, 0.039154052734375]}, "gradients/encoder.encoder.layers.8.layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 9.0, 12.0, 29.0, 94.0, 228.0, 325.0, 214.0, 65.0, 22.0, 10.0, 4.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5948129296302795, -0.5469017624855042, -0.49899062514305115, -0.45107948780059814, -0.40316832065582275, -0.35525715351104736, -0.30734601616859436, -0.25943487882614136, -0.21152371168136597, -0.16361255943775177, -0.11570140719413757, -0.06779025495052338, -0.01987910270690918, 0.028032049536705017, 0.07594320178031921, 0.12385433912277222, 0.1717655062675476, 0.2196766585111618, 0.267587810754776, 0.315498948097229, 0.3634101152420044, 0.4113212823867798, 0.4592324197292328, 0.5071435570716858, 0.5550547242164612, 0.6029658913612366, 0.6508769989013672, 0.6987881660461426, 0.746699333190918, 0.7946105003356934, 0.8425216674804688, 0.8904327750205994, 0.9383440017700195, 0.9862551689147949, 1.0341663360595703, 1.0820775032043457, 1.129988670349121, 1.177899718284607, 1.2258108854293823, 1.2737220525741577, 1.321633219718933, 1.3695443868637085, 1.4174555540084839, 1.4653667211532593, 1.5132777690887451, 1.5611889362335205, 1.609100103378296, 1.6570112705230713, 1.7049224376678467, 1.752833604812622, 1.8007447719573975, 1.8486559391021729, 1.8965671062469482, 1.944478154182434, 1.9923893213272095, 2.0403003692626953, 2.0882115364074707, 2.136122703552246, 2.1840338706970215, 2.231945037841797, 2.2798562049865723, 2.3277673721313477, 2.375678539276123, 2.4235897064208984, 2.471500873565674]}, "gradients/encoder.encoder.layers.8.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 3.0, 12.0, 6.0, 10.0, 11.0, 10.0, 16.0, 10.0, 30.0, 20.0, 35.0, 38.0, 44.0, 46.0, 45.0, 60.0, 60.0, 60.0, 47.0, 67.0, 59.0, 43.0, 43.0, 39.0, 44.0, 22.0, 24.0, 22.0, 27.0, 13.0, 8.0, 8.0, 9.0, 5.0, 3.0, 6.0, 4.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7938513159751892, -0.7662767767906189, -0.7387021780014038, -0.7111276388168335, -0.6835530996322632, -0.6559785008430481, -0.6284039616584778, -0.6008293628692627, -0.5732548236846924, -0.5456802845001221, -0.518105685710907, -0.49053114652633667, -0.46295657753944397, -0.43538200855255127, -0.40780746936798096, -0.38023290038108826, -0.35265833139419556, -0.32508376240730286, -0.29750919342041016, -0.26993465423583984, -0.24236008524894714, -0.21478551626205444, -0.18721096217632294, -0.15963640809059143, -0.13206183910369873, -0.10448727756738663, -0.07691271603107452, -0.04933815449476242, -0.021763592958450317, 0.005810976028442383, 0.03338553011417389, 0.060960084199905396, 0.0885346531867981, 0.1161092147231102, 0.1436837762594223, 0.1712583303451538, 0.1988328993320465, 0.2264074683189392, 0.2539820075035095, 0.2815565764904022, 0.3091311454772949, 0.3367057144641876, 0.3642802834510803, 0.39185482263565063, 0.41942939162254333, 0.44700396060943604, 0.47457849979400635, 0.5021530389785767, 0.5297276377677917, 0.5573021769523621, 0.5848767757415771, 0.6124513149261475, 0.6400258541107178, 0.6676004528999329, 0.6951749920845032, 0.7227495908737183, 0.7503241300582886, 0.7778986692428589, 0.805473268032074, 0.8330478072166443, 0.8606224060058594, 0.8881969451904297, 0.915771484375, 0.9433460235595703, 0.9709206223487854]}, "gradients/encoder.encoder.layers.7.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 2.0, 3.0, 2.0, 4.0, 7.0, 6.0, 7.0, 6.0, 8.0, 9.0, 23.0, 23.0, 52.0, 50.0, 86.0, 157.0, 287.0, 602.0, 1683.0, 7791.0, 99799.0, 4025171.0, 49380.0, 6563.0, 1501.0, 513.0, 261.0, 118.0, 70.0, 33.0, 23.0, 20.0, 7.0, 7.0, 4.0, 3.0, 4.0, 2.0, 3.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.31494140625, -0.30733299255371094, -0.2997245788574219, -0.2921161651611328, -0.28450775146484375, -0.2768993377685547, -0.2692909240722656, -0.26168251037597656, -0.2540740966796875, -0.24646568298339844, -0.23885726928710938, -0.2312488555908203, -0.22364044189453125, -0.2160320281982422, -0.20842361450195312, -0.20081520080566406, -0.193206787109375, -0.18559837341308594, -0.17798995971679688, -0.1703815460205078, -0.16277313232421875, -0.1551647186279297, -0.14755630493164062, -0.13994789123535156, -0.1323394775390625, -0.12473106384277344, -0.11712265014648438, -0.10951423645019531, -0.10190582275390625, -0.09429740905761719, -0.08668899536132812, -0.07908058166503906, -0.07147216796875, -0.06386375427246094, -0.056255340576171875, -0.04864692687988281, -0.04103851318359375, -0.03343009948730469, -0.025821685791015625, -0.018213272094726562, -0.0106048583984375, -0.0029964447021484375, 0.004611968994140625, 0.012220382690429688, 0.01982879638671875, 0.027437210083007812, 0.035045623779296875, 0.04265403747558594, 0.050262451171875, 0.05787086486816406, 0.06547927856445312, 0.07308769226074219, 0.08069610595703125, 0.08830451965332031, 0.09591293334960938, 0.10352134704589844, 0.1111297607421875, 0.11873817443847656, 0.12634658813476562, 0.1339550018310547, 0.14156341552734375, 0.1491718292236328, 0.15678024291992188, 0.16438865661621094, 0.1719970703125]}, "gradients/encoder.encoder.layers.7.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 8.0, 9.0, 15.0, 37.0, 47.0, 52.0, 59.0, 82.0, 92.0, 90.0, 94.0, 96.0, 82.0, 82.0, 43.0, 43.0, 24.0, 18.0, 10.0, 9.0, 2.0, 4.0, 3.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09735107421875, -0.09479045867919922, -0.09222984313964844, -0.08966922760009766, -0.08710861206054688, -0.0845479965209961, -0.08198738098144531, -0.07942676544189453, -0.07686614990234375, -0.07430553436279297, -0.07174491882324219, -0.0691843032836914, -0.06662368774414062, -0.06406307220458984, -0.06150245666503906, -0.05894184112548828, -0.0563812255859375, -0.05382061004638672, -0.05125999450683594, -0.048699378967285156, -0.046138763427734375, -0.043578147888183594, -0.04101753234863281, -0.03845691680908203, -0.03589630126953125, -0.03333568572998047, -0.030775070190429688, -0.028214454650878906, -0.025653839111328125, -0.023093223571777344, -0.020532608032226562, -0.01797199249267578, -0.015411376953125, -0.012850761413574219, -0.010290145874023438, -0.007729530334472656, -0.005168914794921875, -0.0026082992553710938, -4.76837158203125e-05, 0.0025129318237304688, 0.00507354736328125, 0.007634162902832031, 0.010194778442382812, 0.012755393981933594, 0.015316009521484375, 0.017876625061035156, 0.020437240600585938, 0.02299785614013672, 0.0255584716796875, 0.02811908721923828, 0.030679702758789062, 0.033240318298339844, 0.035800933837890625, 0.038361549377441406, 0.04092216491699219, 0.04348278045654297, 0.04604339599609375, 0.04860401153564453, 0.05116462707519531, 0.053725242614746094, 0.056285858154296875, 0.058846473693847656, 0.06140708923339844, 0.06396770477294922, 0.0665283203125]}, "gradients/encoder.encoder.layers.7.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 5.0, 6.0, 1.0, 1.0, 8.0, 14.0, 8.0, 9.0, 25.0, 29.0, 59.0, 94.0, 206.0, 608.0, 2174.0, 11619.0, 187496.0, 3945584.0, 39759.0, 4753.0, 1132.0, 344.0, 133.0, 77.0, 42.0, 26.0, 23.0, 14.0, 5.0, 6.0, 5.0, 8.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.2471923828125, -0.2399768829345703, -0.23276138305664062, -0.22554588317871094, -0.21833038330078125, -0.21111488342285156, -0.20389938354492188, -0.1966838836669922, -0.1894683837890625, -0.1822528839111328, -0.17503738403320312, -0.16782188415527344, -0.16060638427734375, -0.15339088439941406, -0.14617538452148438, -0.1389598846435547, -0.131744384765625, -0.12452888488769531, -0.11731338500976562, -0.11009788513183594, -0.10288238525390625, -0.09566688537597656, -0.08845138549804688, -0.08123588562011719, -0.0740203857421875, -0.06680488586425781, -0.059589385986328125, -0.05237388610839844, -0.04515838623046875, -0.03794288635253906, -0.030727386474609375, -0.023511886596679688, -0.01629638671875, -0.009080886840820312, -0.001865386962890625, 0.0053501129150390625, 0.01256561279296875, 0.019781112670898438, 0.026996612548828125, 0.03421211242675781, 0.0414276123046875, 0.04864311218261719, 0.055858612060546875, 0.06307411193847656, 0.07028961181640625, 0.07750511169433594, 0.08472061157226562, 0.09193611145019531, 0.099151611328125, 0.10636711120605469, 0.11358261108398438, 0.12079811096191406, 0.12801361083984375, 0.13522911071777344, 0.14244461059570312, 0.1496601104736328, 0.1568756103515625, 0.1640911102294922, 0.17130661010742188, 0.17852210998535156, 0.18573760986328125, 0.19295310974121094, 0.20016860961914062, 0.2073841094970703, 0.214599609375]}, "gradients/encoder.encoder.layers.7.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 7.0, 12.0, 15.0, 14.0, 15.0, 15.0, 19.0, 22.0, 47.0, 79.0, 132.0, 502.0, 1622.0, 973.0, 277.0, 97.0, 51.0, 38.0, 34.0, 20.0, 17.0, 17.0, 10.0, 11.0, 8.0, 5.0, 5.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.1353759765625, -0.13147544860839844, -0.12757492065429688, -0.12367439270019531, -0.11977386474609375, -0.11587333679199219, -0.11197280883789062, -0.10807228088378906, -0.1041717529296875, -0.10027122497558594, -0.09637069702148438, -0.09247016906738281, -0.08856964111328125, -0.08466911315917969, -0.08076858520507812, -0.07686805725097656, -0.072967529296875, -0.06906700134277344, -0.06516647338867188, -0.06126594543457031, -0.05736541748046875, -0.05346488952636719, -0.049564361572265625, -0.04566383361816406, -0.0417633056640625, -0.03786277770996094, -0.033962249755859375, -0.030061721801757812, -0.02616119384765625, -0.022260665893554688, -0.018360137939453125, -0.014459609985351562, -0.01055908203125, -0.0066585540771484375, -0.002758026123046875, 0.0011425018310546875, 0.00504302978515625, 0.008943557739257812, 0.012844085693359375, 0.016744613647460938, 0.0206451416015625, 0.024545669555664062, 0.028446197509765625, 0.03234672546386719, 0.03624725341796875, 0.04014778137207031, 0.044048309326171875, 0.04794883728027344, 0.051849365234375, 0.05574989318847656, 0.059650421142578125, 0.06355094909667969, 0.06745147705078125, 0.07135200500488281, 0.07525253295898438, 0.07915306091308594, 0.0830535888671875, 0.08695411682128906, 0.09085464477539062, 0.09475517272949219, 0.09865570068359375, 0.10255622863769531, 0.10645675659179688, 0.11035728454589844, 0.1142578125]}, "gradients/encoder.encoder.layers.7.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 4.0, 1.0, 2.0, 3.0, 7.0, 5.0, 23.0, 26.0, 39.0, 98.0, 166.0, 217.0, 173.0, 108.0, 60.0, 31.0, 13.0, 13.0, 11.0, 4.0, 3.0, 3.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1522479057312012, -1.1167049407958984, -1.0811619758605957, -1.045619010925293, -1.0100760459899902, -0.9745330810546875, -0.9389901161193848, -0.903447151184082, -0.8679041862487793, -0.8323612213134766, -0.7968182563781738, -0.7612752914428711, -0.7257323265075684, -0.6901893615722656, -0.6546463966369629, -0.6191034317016602, -0.5835604071617126, -0.5480174422264099, -0.5124744772911072, -0.47693151235580444, -0.4413885474205017, -0.405845582485199, -0.37030258774757385, -0.3347596228122711, -0.2992166578769684, -0.26367369294166565, -0.22813072800636292, -0.192587748169899, -0.15704478323459625, -0.12150181829929352, -0.08595883846282959, -0.050415873527526855, -0.014872908592224121, 0.020670060068368912, 0.056213028728961945, 0.09175600111484528, 0.127298966050148, 0.16284193098545074, 0.19838491082191467, 0.2339278757572174, 0.26947084069252014, 0.3050138056278229, 0.3405567705631256, 0.37609976530075073, 0.41164273023605347, 0.4471856951713562, 0.48272866010665894, 0.5182716250419617, 0.5538145899772644, 0.5893575549125671, 0.6249005198478699, 0.6604434847831726, 0.6959864497184753, 0.7315294146537781, 0.7670724391937256, 0.8026154041290283, 0.838158369064331, 0.8737013339996338, 0.9092442989349365, 0.9447872638702393, 0.980330228805542, 1.0158731937408447, 1.0514161586761475, 1.0869591236114502, 1.122502088546753]}, "gradients/encoder.encoder.layers.7.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 1.0, 5.0, 8.0, 11.0, 15.0, 13.0, 8.0, 12.0, 16.0, 24.0, 16.0, 25.0, 28.0, 30.0, 33.0, 31.0, 37.0, 39.0, 32.0, 43.0, 32.0, 41.0, 37.0, 45.0, 35.0, 42.0, 37.0, 41.0, 42.0, 41.0, 27.0, 20.0, 26.0, 19.0, 14.0, 11.0, 8.0, 12.0, 9.0, 11.0, 5.0, 6.0, 4.0, 4.0, 4.0, 4.0, 2.0, 1.0, 1.0, 2.0], "bins": [-0.48485374450683594, -0.47137588262557983, -0.45789802074432373, -0.4444201588630676, -0.4309422969818115, -0.4174644351005554, -0.4039865732192993, -0.3905087113380432, -0.3770308494567871, -0.363552987575531, -0.3500751256942749, -0.3365972638130188, -0.3231194019317627, -0.3096415400505066, -0.2961636781692505, -0.2826858162879944, -0.2692079544067383, -0.2557300925254822, -0.24225223064422607, -0.22877436876296997, -0.21529650688171387, -0.20181864500045776, -0.18834078311920166, -0.17486292123794556, -0.16138502955436707, -0.14790716767311096, -0.13442930579185486, -0.12095144391059875, -0.10747358202934265, -0.09399571269750595, -0.08051785081624985, -0.06703998893499374, -0.05356213450431824, -0.040084272623062134, -0.02660640887916088, -0.013128545135259628, 0.0003493167459964752, 0.013827182352542877, 0.02730504423379898, 0.040782906115055084, 0.05426076799631119, 0.06773862987756729, 0.0812164917588234, 0.0946943610906601, 0.1081722229719162, 0.1216500848531723, 0.1351279467344284, 0.1486058086156845, 0.1620836704969406, 0.17556153237819672, 0.18903939425945282, 0.20251725614070892, 0.21599511802196503, 0.22947299480438232, 0.24295085668563843, 0.25642871856689453, 0.26990658044815063, 0.28338444232940674, 0.29686230421066284, 0.31034016609191895, 0.32381802797317505, 0.33729588985443115, 0.35077375173568726, 0.36425161361694336, 0.37772947549819946]}, "gradients/encoder.encoder.layers.7.attention.out_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 5.0, 6.0, 7.0, 9.0, 12.0, 4.0, 16.0, 22.0, 43.0, 47.0, 49.0, 66.0, 110.0, 167.0, 285.0, 433.0, 957.0, 2108.0, 5913.0, 21581.0, 120949.0, 619955.0, 227550.0, 34245.0, 8518.0, 2881.0, 1188.0, 546.0, 290.0, 176.0, 105.0, 82.0, 62.0, 39.0, 32.0, 21.0, 21.0, 16.0, 7.0, 5.0, 7.0, 14.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.1029052734375, -0.09855461120605469, -0.09420394897460938, -0.08985328674316406, -0.08550262451171875, -0.08115196228027344, -0.07680130004882812, -0.07245063781738281, -0.0680999755859375, -0.06374931335449219, -0.059398651123046875, -0.05504798889160156, -0.05069732666015625, -0.04634666442871094, -0.041996002197265625, -0.03764533996582031, -0.033294677734375, -0.028944015502929688, -0.024593353271484375, -0.020242691040039062, -0.01589202880859375, -0.011541366577148438, -0.007190704345703125, -0.0028400421142578125, 0.0015106201171875, 0.0058612823486328125, 0.010211944580078125, 0.014562606811523438, 0.01891326904296875, 0.023263931274414062, 0.027614593505859375, 0.03196525573730469, 0.03631591796875, 0.04066658020019531, 0.045017242431640625, 0.04936790466308594, 0.05371856689453125, 0.05806922912597656, 0.062419891357421875, 0.06677055358886719, 0.0711212158203125, 0.07547187805175781, 0.07982254028320312, 0.08417320251464844, 0.08852386474609375, 0.09287452697753906, 0.09722518920898438, 0.10157585144042969, 0.105926513671875, 0.11027717590332031, 0.11462783813476562, 0.11897850036621094, 0.12332916259765625, 0.12767982482910156, 0.13203048706054688, 0.1363811492919922, 0.1407318115234375, 0.1450824737548828, 0.14943313598632812, 0.15378379821777344, 0.15813446044921875, 0.16248512268066406, 0.16683578491210938, 0.1711864471435547, 0.175537109375]}, "gradients/encoder.encoder.layers.7.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 0.0, 1.0, 2.0, 7.0, 13.0, 14.0, 28.0, 39.0, 45.0, 56.0, 74.0, 67.0, 80.0, 88.0, 91.0, 81.0, 68.0, 61.0, 65.0, 37.0, 32.0, 17.0, 16.0, 13.0, 6.0, 2.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.09893798828125, -0.09657478332519531, -0.09421157836914062, -0.09184837341308594, -0.08948516845703125, -0.08712196350097656, -0.08475875854492188, -0.08239555358886719, -0.0800323486328125, -0.07766914367675781, -0.07530593872070312, -0.07294273376464844, -0.07057952880859375, -0.06821632385253906, -0.06585311889648438, -0.06348991394042969, -0.061126708984375, -0.05876350402832031, -0.056400299072265625, -0.05403709411621094, -0.05167388916015625, -0.04931068420410156, -0.046947479248046875, -0.04458427429199219, -0.0422210693359375, -0.03985786437988281, -0.037494659423828125, -0.03513145446777344, -0.03276824951171875, -0.030405044555664062, -0.028041839599609375, -0.025678634643554688, -0.0233154296875, -0.020952224731445312, -0.018589019775390625, -0.016225814819335938, -0.01386260986328125, -0.011499404907226562, -0.009136199951171875, -0.0067729949951171875, -0.0044097900390625, -0.0020465850830078125, 0.000316619873046875, 0.0026798248291015625, 0.00504302978515625, 0.0074062347412109375, 0.009769439697265625, 0.012132644653320312, 0.014495849609375, 0.016859054565429688, 0.019222259521484375, 0.021585464477539062, 0.02394866943359375, 0.026311874389648438, 0.028675079345703125, 0.031038284301757812, 0.0334014892578125, 0.03576469421386719, 0.038127899169921875, 0.04049110412597656, 0.04285430908203125, 0.04521751403808594, 0.047580718994140625, 0.04994392395019531, 0.05230712890625]}, "gradients/encoder.encoder.layers.7.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 4.0, 2.0, 4.0, 2.0, 5.0, 2.0, 3.0, 8.0, 5.0, 6.0, 13.0, 12.0, 16.0, 14.0, 31.0, 42.0, 49.0, 49.0, 100.0, 135.0, 206.0, 324.0, 546.0, 1191.0, 2749.0, 10181.0, 83121.0, 826145.0, 106569.0, 11173.0, 2976.0, 1166.0, 652.0, 364.0, 234.0, 133.0, 88.0, 57.0, 48.0, 37.0, 22.0, 27.0, 14.0, 10.0, 9.0, 7.0, 9.0, 5.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.151123046875, -0.1458282470703125, -0.140533447265625, -0.1352386474609375, -0.12994384765625, -0.1246490478515625, -0.119354248046875, -0.1140594482421875, -0.1087646484375, -0.1034698486328125, -0.098175048828125, -0.0928802490234375, -0.08758544921875, -0.0822906494140625, -0.076995849609375, -0.0717010498046875, -0.06640625, -0.0611114501953125, -0.055816650390625, -0.0505218505859375, -0.04522705078125, -0.0399322509765625, -0.034637451171875, -0.0293426513671875, -0.0240478515625, -0.0187530517578125, -0.013458251953125, -0.0081634521484375, -0.00286865234375, 0.0024261474609375, 0.007720947265625, 0.0130157470703125, 0.018310546875, 0.0236053466796875, 0.028900146484375, 0.0341949462890625, 0.03948974609375, 0.0447845458984375, 0.050079345703125, 0.0553741455078125, 0.0606689453125, 0.0659637451171875, 0.071258544921875, 0.0765533447265625, 0.08184814453125, 0.0871429443359375, 0.092437744140625, 0.0977325439453125, 0.10302734375, 0.1083221435546875, 0.113616943359375, 0.1189117431640625, 0.12420654296875, 0.1295013427734375, 0.134796142578125, 0.1400909423828125, 0.1453857421875, 0.1506805419921875, 0.155975341796875, 0.1612701416015625, 0.16656494140625, 0.1718597412109375, 0.177154541015625, 0.1824493408203125, 0.187744140625]}, "gradients/encoder.encoder.layers.7.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 4.0, 3.0, 7.0, 9.0, 8.0, 14.0, 22.0, 14.0, 24.0, 27.0, 28.0, 45.0, 47.0, 52.0, 52.0, 52.0, 47.0, 59.0, 58.0, 51.0, 42.0, 50.0, 47.0, 50.0, 43.0, 34.0, 25.0, 21.0, 20.0, 12.0, 15.0, 7.0, 4.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2047119140625, -0.1985454559326172, -0.19237899780273438, -0.18621253967285156, -0.18004608154296875, -0.17387962341308594, -0.16771316528320312, -0.1615467071533203, -0.1553802490234375, -0.1492137908935547, -0.14304733276367188, -0.13688087463378906, -0.13071441650390625, -0.12454795837402344, -0.11838150024414062, -0.11221504211425781, -0.106048583984375, -0.09988212585449219, -0.09371566772460938, -0.08754920959472656, -0.08138275146484375, -0.07521629333496094, -0.06904983520507812, -0.06288337707519531, -0.0567169189453125, -0.05055046081542969, -0.044384002685546875, -0.03821754455566406, -0.03205108642578125, -0.025884628295898438, -0.019718170166015625, -0.013551712036132812, -0.00738525390625, -0.0012187957763671875, 0.004947662353515625, 0.011114120483398438, 0.01728057861328125, 0.023447036743164062, 0.029613494873046875, 0.03577995300292969, 0.0419464111328125, 0.04811286926269531, 0.054279327392578125, 0.06044578552246094, 0.06661224365234375, 0.07277870178222656, 0.07894515991210938, 0.08511161804199219, 0.091278076171875, 0.09744453430175781, 0.10361099243164062, 0.10977745056152344, 0.11594390869140625, 0.12211036682128906, 0.12827682495117188, 0.1344432830810547, 0.1406097412109375, 0.1467761993408203, 0.15294265747070312, 0.15910911560058594, 0.16527557373046875, 0.17144203186035156, 0.17760848999023438, 0.1837749481201172, 0.18994140625]}, "gradients/encoder.encoder.layers.7.attention.k_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 2.0, 2.0, 2.0, 1.0, 6.0, 3.0, 6.0, 8.0, 7.0, 19.0, 22.0, 31.0, 37.0, 66.0, 96.0, 150.0, 227.0, 432.0, 779.0, 1667.0, 4273.0, 16599.0, 324409.0, 668460.0, 22295.0, 4950.0, 1919.0, 915.0, 416.0, 263.0, 167.0, 99.0, 70.0, 41.0, 36.0, 22.0, 20.0, 11.0, 4.0, 9.0, 6.0, 3.0, 1.0, 2.0, 4.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.080078125, -0.07785844802856445, -0.0756387710571289, -0.07341909408569336, -0.07119941711425781, -0.06897974014282227, -0.06676006317138672, -0.06454038619995117, -0.062320709228515625, -0.06010103225708008, -0.05788135528564453, -0.055661678314208984, -0.05344200134277344, -0.05122232437133789, -0.049002647399902344, -0.0467829704284668, -0.04456329345703125, -0.0423436164855957, -0.040123939514160156, -0.03790426254272461, -0.03568458557128906, -0.033464908599853516, -0.03124523162841797, -0.029025554656982422, -0.026805877685546875, -0.024586200714111328, -0.02236652374267578, -0.020146846771240234, -0.017927169799804688, -0.01570749282836914, -0.013487815856933594, -0.011268138885498047, -0.0090484619140625, -0.006828784942626953, -0.004609107971191406, -0.0023894309997558594, -0.0001697540283203125, 0.0020499229431152344, 0.004269599914550781, 0.006489276885986328, 0.008708953857421875, 0.010928630828857422, 0.013148307800292969, 0.015367984771728516, 0.017587661743164062, 0.01980733871459961, 0.022027015686035156, 0.024246692657470703, 0.02646636962890625, 0.028686046600341797, 0.030905723571777344, 0.03312540054321289, 0.03534507751464844, 0.037564754486083984, 0.03978443145751953, 0.04200410842895508, 0.044223785400390625, 0.04644346237182617, 0.04866313934326172, 0.050882816314697266, 0.05310249328613281, 0.05532217025756836, 0.057541847229003906, 0.05976152420043945, 0.061981201171875]}, "gradients/encoder.encoder.layers.7.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 1.0, 3.0, 4.0, 2.0, 7.0, 12.0, 15.0, 12.0, 29.0, 74.0, 108.0, 169.0, 213.0, 150.0, 94.0, 55.0, 26.0, 4.0, 5.0, 10.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.086162567138672e-05, -2.0069070160388947e-05, -1.9276514649391174e-05, -1.8483959138393402e-05, -1.769140362739563e-05, -1.6898848116397858e-05, -1.6106292605400085e-05, -1.5313737094402313e-05, -1.4521181583404541e-05, -1.3728626072406769e-05, -1.2936070561408997e-05, -1.2143515050411224e-05, -1.1350959539413452e-05, -1.055840402841568e-05, -9.765848517417908e-06, -8.973293006420135e-06, -8.180737495422363e-06, -7.388181984424591e-06, -6.595626473426819e-06, -5.803070962429047e-06, -5.010515451431274e-06, -4.217959940433502e-06, -3.42540442943573e-06, -2.6328489184379578e-06, -1.8402934074401855e-06, -1.0477378964424133e-06, -2.551823854446411e-07, 5.373731255531311e-07, 1.3299286365509033e-06, 2.1224841475486755e-06, 2.9150396585464478e-06, 3.70759516954422e-06, 4.500150680541992e-06, 5.292706191539764e-06, 6.085261702537537e-06, 6.877817213535309e-06, 7.670372724533081e-06, 8.462928235530853e-06, 9.255483746528625e-06, 1.0048039257526398e-05, 1.084059476852417e-05, 1.1633150279521942e-05, 1.2425705790519714e-05, 1.3218261301517487e-05, 1.4010816812515259e-05, 1.4803372323513031e-05, 1.5595927834510803e-05, 1.6388483345508575e-05, 1.7181038856506348e-05, 1.797359436750412e-05, 1.8766149878501892e-05, 1.9558705389499664e-05, 2.0351260900497437e-05, 2.114381641149521e-05, 2.193637192249298e-05, 2.2728927433490753e-05, 2.3521482944488525e-05, 2.4314038455486298e-05, 2.510659396648407e-05, 2.5899149477481842e-05, 2.6691704988479614e-05, 2.7484260499477386e-05, 2.827681601047516e-05, 2.906937152147293e-05, 2.9861927032470703e-05]}, "gradients/encoder.encoder.layers.7.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 2.0, 3.0, 2.0, 5.0, 5.0, 9.0, 17.0, 24.0, 40.0, 56.0, 127.0, 269.0, 582.0, 1671.0, 6745.0, 75535.0, 923178.0, 33572.0, 4508.0, 1251.0, 489.0, 210.0, 107.0, 60.0, 40.0, 16.0, 11.0, 5.0, 8.0, 3.0, 6.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.091796875, -0.0887746810913086, -0.08575248718261719, -0.08273029327392578, -0.07970809936523438, -0.07668590545654297, -0.07366371154785156, -0.07064151763916016, -0.06761932373046875, -0.06459712982177734, -0.06157493591308594, -0.05855274200439453, -0.055530548095703125, -0.05250835418701172, -0.04948616027832031, -0.046463966369628906, -0.0434417724609375, -0.040419578552246094, -0.03739738464355469, -0.03437519073486328, -0.031352996826171875, -0.02833080291748047, -0.025308609008789062, -0.022286415100097656, -0.01926422119140625, -0.016242027282714844, -0.013219833374023438, -0.010197639465332031, -0.007175445556640625, -0.004153251647949219, -0.0011310577392578125, 0.0018911361694335938, 0.004913330078125, 0.007935523986816406, 0.010957717895507812, 0.013979911804199219, 0.017002105712890625, 0.02002429962158203, 0.023046493530273438, 0.026068687438964844, 0.02909088134765625, 0.032113075256347656, 0.03513526916503906, 0.03815746307373047, 0.041179656982421875, 0.04420185089111328, 0.04722404479980469, 0.050246238708496094, 0.0532684326171875, 0.056290626525878906, 0.05931282043457031, 0.06233501434326172, 0.06535720825195312, 0.06837940216064453, 0.07140159606933594, 0.07442378997802734, 0.07744598388671875, 0.08046817779541016, 0.08349037170410156, 0.08651256561279297, 0.08953475952148438, 0.09255695343017578, 0.09557914733886719, 0.0986013412475586, 0.10162353515625]}, "gradients/encoder.encoder.layers.7.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 2.0, 1.0, 3.0, 5.0, 1.0, 13.0, 13.0, 10.0, 21.0, 32.0, 42.0, 68.0, 82.0, 123.0, 137.0, 134.0, 101.0, 64.0, 43.0, 33.0, 20.0, 18.0, 12.0, 10.0, 1.0, 3.0, 4.0, 4.0, 4.0, 4.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06280517578125, -0.060944557189941406, -0.05908393859863281, -0.05722332000732422, -0.055362701416015625, -0.05350208282470703, -0.05164146423339844, -0.049780845642089844, -0.04792022705078125, -0.046059608459472656, -0.04419898986816406, -0.04233837127685547, -0.040477752685546875, -0.03861713409423828, -0.03675651550292969, -0.034895896911621094, -0.0330352783203125, -0.031174659729003906, -0.029314041137695312, -0.02745342254638672, -0.025592803955078125, -0.02373218536376953, -0.021871566772460938, -0.020010948181152344, -0.01815032958984375, -0.016289710998535156, -0.014429092407226562, -0.012568473815917969, -0.010707855224609375, -0.008847236633300781, -0.0069866180419921875, -0.005125999450683594, -0.003265380859375, -0.0014047622680664062, 0.0004558563232421875, 0.0023164749145507812, 0.004177093505859375, 0.006037712097167969, 0.007898330688476562, 0.009758949279785156, 0.01161956787109375, 0.013480186462402344, 0.015340805053710938, 0.01720142364501953, 0.019062042236328125, 0.02092266082763672, 0.022783279418945312, 0.024643898010253906, 0.0265045166015625, 0.028365135192871094, 0.030225753784179688, 0.03208637237548828, 0.033946990966796875, 0.03580760955810547, 0.03766822814941406, 0.039528846740722656, 0.04138946533203125, 0.043250083923339844, 0.04511070251464844, 0.04697132110595703, 0.048831939697265625, 0.05069255828857422, 0.05255317687988281, 0.054413795471191406, 0.0562744140625]}, "gradients/encoder.encoder.layers.7.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 5.0, 14.0, 31.0, 57.0, 132.0, 214.0, 247.0, 155.0, 77.0, 41.0, 12.0, 6.0, 8.0, 1.0, 2.0, 5.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0], "bins": [-1.839813470840454, -1.7981863021850586, -1.7565590143203735, -1.714931845664978, -1.673304557800293, -1.6316773891448975, -1.590050220489502, -1.5484230518341064, -1.5067957639694214, -1.4651685953140259, -1.4235413074493408, -1.3819141387939453, -1.3402869701385498, -1.2986596822738647, -1.2570325136184692, -1.2154052257537842, -1.1737780570983887, -1.1321508884429932, -1.090523600578308, -1.0488964319229126, -1.0072691440582275, -0.965641975402832, -0.9240148067474365, -0.8823875784873962, -0.840760350227356, -0.7991331219673157, -0.7575058937072754, -0.7158787250518799, -0.6742514967918396, -0.6326242685317993, -0.5909970998764038, -0.5493698716163635, -0.5077426433563232, -0.46611541509628296, -0.42448821663856506, -0.38286101818084717, -0.3412337899208069, -0.2996065616607666, -0.2579793632030487, -0.2163521647453308, -0.17472493648529053, -0.13309772312641144, -0.09147050976753235, -0.04984329640865326, -0.00821608304977417, 0.03341113030910492, 0.07503834366798401, 0.1166655421257019, 0.1582927703857422, 0.19991998374462128, 0.24154719710350037, 0.28317439556121826, 0.32480162382125854, 0.36642885208129883, 0.4080560505390167, 0.4496832489967346, 0.4913104772567749, 0.5329377055168152, 0.5745649337768555, 0.616192102432251, 0.6578193306922913, 0.6994465589523315, 0.741073727607727, 0.7827009558677673, 0.8243281841278076]}, "gradients/encoder.encoder.layers.7.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 3.0, 7.0, 6.0, 7.0, 11.0, 11.0, 18.0, 14.0, 17.0, 30.0, 35.0, 30.0, 34.0, 29.0, 53.0, 61.0, 60.0, 46.0, 87.0, 54.0, 51.0, 49.0, 39.0, 43.0, 32.0, 27.0, 33.0, 16.0, 16.0, 22.0, 18.0, 13.0, 7.0, 5.0, 9.0, 5.0, 1.0, 1.0, 2.0, 1.0, 1.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7061905860900879, -0.6833276152610779, -0.6604645848274231, -0.6376016139984131, -0.6147386431694031, -0.5918756723403931, -0.5690126419067383, -0.5461496710777283, -0.5232867002487183, -0.5004237294197083, -0.47756072878837585, -0.45469772815704346, -0.43183475732803345, -0.40897175669670105, -0.38610875606536865, -0.36324578523635864, -0.34038278460502625, -0.31751978397369385, -0.29465681314468384, -0.27179381251335144, -0.24893084168434143, -0.22606784105300903, -0.20320485532283783, -0.18034186959266663, -0.15747888386249542, -0.13461589813232422, -0.11175291240215302, -0.08888991922140121, -0.06602693349123001, -0.04316394776105881, -0.020300954580307007, 0.0025620311498641968, 0.0254250168800354, 0.048288002610206604, 0.07115098834037781, 0.09401398152112961, 0.11687696725130081, 0.1397399604320526, 0.16260294616222382, 0.18546593189239502, 0.20832891762256622, 0.23119190335273743, 0.2540549039840698, 0.27691787481307983, 0.29978087544441223, 0.32264384627342224, 0.34550684690475464, 0.36836981773376465, 0.39123281836509705, 0.41409581899642944, 0.43695878982543945, 0.45982179045677185, 0.48268476128578186, 0.5055477619171143, 0.5284107327461243, 0.5512737035751343, 0.5741367340087891, 0.5969997048377991, 0.6198627352714539, 0.6427257061004639, 0.6655886769294739, 0.6884516477584839, 0.7113146781921387, 0.7341776490211487, 0.7570406198501587]}, "gradients/encoder.encoder.layers.6.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 3.0, 4.0, 8.0, 16.0, 56.0, 147.0, 432.0, 4149467.0, 43540.0, 380.0, 147.0, 56.0, 17.0, 8.0, 7.0, 1.0, 1.0, 2.0], "bins": [-4.63671875, -4.550758361816406, -4.4647979736328125, -4.378837585449219, -4.292877197265625, -4.206916809082031, -4.1209564208984375, -4.034996032714844, -3.94903564453125, -3.8630752563476562, -3.7771148681640625, -3.6911544799804688, -3.605194091796875, -3.5192337036132812, -3.4332733154296875, -3.3473129272460938, -3.2613525390625, -3.1753921508789062, -3.0894317626953125, -3.0034713745117188, -2.917510986328125, -2.8315505981445312, -2.7455902099609375, -2.6596298217773438, -2.57366943359375, -2.4877090454101562, -2.4017486572265625, -2.3157882690429688, -2.229827880859375, -2.1438674926757812, -2.0579071044921875, -1.9719467163085938, -1.885986328125, -1.8000259399414062, -1.7140655517578125, -1.6281051635742188, -1.542144775390625, -1.4561843872070312, -1.3702239990234375, -1.2842636108398438, -1.19830322265625, -1.1123428344726562, -1.0263824462890625, -0.9404220581054688, -0.854461669921875, -0.7685012817382812, -0.6825408935546875, -0.5965805053710938, -0.5106201171875, -0.42465972900390625, -0.3386993408203125, -0.25273895263671875, -0.166778564453125, -0.08081817626953125, 0.0051422119140625, 0.09110260009765625, 0.17706298828125, 0.26302337646484375, 0.3489837646484375, 0.43494415283203125, 0.520904541015625, 0.6068649291992188, 0.6928253173828125, 0.7787857055664062, 0.86474609375]}, "gradients/encoder.encoder.layers.6.feed_forward.output_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 4.0, 9.0, 11.0, 21.0, 38.0, 37.0, 49.0, 62.0, 73.0, 92.0, 98.0, 81.0, 84.0, 94.0, 57.0, 49.0, 37.0, 26.0, 26.0, 18.0, 11.0, 9.0, 6.0, 2.0, 4.0, 3.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0], "bins": [-0.09259033203125, -0.09034395217895508, -0.08809757232666016, -0.08585119247436523, -0.08360481262207031, -0.08135843276977539, -0.07911205291748047, -0.07686567306518555, -0.07461929321289062, -0.0723729133605957, -0.07012653350830078, -0.06788015365600586, -0.06563377380371094, -0.06338739395141602, -0.061141014099121094, -0.05889463424682617, -0.05664825439453125, -0.05440187454223633, -0.052155494689941406, -0.049909114837646484, -0.04766273498535156, -0.04541635513305664, -0.04316997528076172, -0.0409235954284668, -0.038677215576171875, -0.03643083572387695, -0.03418445587158203, -0.03193807601928711, -0.029691696166992188, -0.027445316314697266, -0.025198936462402344, -0.022952556610107422, -0.0207061767578125, -0.018459796905517578, -0.016213417053222656, -0.013967037200927734, -0.011720657348632812, -0.00947427749633789, -0.007227897644042969, -0.004981517791748047, -0.002735137939453125, -0.0004887580871582031, 0.0017576217651367188, 0.004004001617431641, 0.0062503814697265625, 0.008496761322021484, 0.010743141174316406, 0.012989521026611328, 0.01523590087890625, 0.017482280731201172, 0.019728660583496094, 0.021975040435791016, 0.024221420288085938, 0.02646780014038086, 0.02871417999267578, 0.030960559844970703, 0.033206939697265625, 0.03545331954956055, 0.03769969940185547, 0.03994607925415039, 0.04219245910644531, 0.044438838958740234, 0.046685218811035156, 0.04893159866333008, 0.051177978515625]}, "gradients/encoder.encoder.layers.6.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 8.0, 7.0, 8.0, 9.0, 28.0, 42.0, 60.0, 83.0, 183.0, 458.0, 1308.0, 6338.0, 88045.0, 4042050.0, 49152.0, 4508.0, 1091.0, 421.0, 199.0, 93.0, 64.0, 42.0, 35.0, 11.0, 17.0, 10.0, 6.0, 7.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.23583984375, -0.22723770141601562, -0.21863555908203125, -0.21003341674804688, -0.2014312744140625, -0.19282913208007812, -0.18422698974609375, -0.17562484741210938, -0.167022705078125, -0.15842056274414062, -0.14981842041015625, -0.14121627807617188, -0.1326141357421875, -0.12401199340820312, -0.11540985107421875, -0.10680770874023438, -0.09820556640625, -0.08960342407226562, -0.08100128173828125, -0.07239913940429688, -0.0637969970703125, -0.055194854736328125, -0.04659271240234375, -0.037990570068359375, -0.029388427734375, -0.020786285400390625, -0.01218414306640625, -0.003582000732421875, 0.0050201416015625, 0.013622283935546875, 0.02222442626953125, 0.030826568603515625, 0.0394287109375, 0.048030853271484375, 0.05663299560546875, 0.06523513793945312, 0.0738372802734375, 0.08243942260742188, 0.09104156494140625, 0.09964370727539062, 0.108245849609375, 0.11684799194335938, 0.12545013427734375, 0.13405227661132812, 0.1426544189453125, 0.15125656127929688, 0.15985870361328125, 0.16846084594726562, 0.17706298828125, 0.18566513061523438, 0.19426727294921875, 0.20286941528320312, 0.2114715576171875, 0.22007369995117188, 0.22867584228515625, 0.23727798461914062, 0.245880126953125, 0.2544822692871094, 0.26308441162109375, 0.2716865539550781, 0.2802886962890625, 0.2888908386230469, 0.29749298095703125, 0.3060951232910156, 0.314697265625]}, "gradients/encoder.encoder.layers.6.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 6.0, 9.0, 15.0, 29.0, 29.0, 67.0, 160.0, 843.0, 2250.0, 430.0, 116.0, 43.0, 35.0, 19.0, 9.0, 8.0, 2.0, 2.0, 3.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.14794921875, -0.14132308959960938, -0.13469696044921875, -0.12807083129882812, -0.1214447021484375, -0.11481857299804688, -0.10819244384765625, -0.10156631469726562, -0.094940185546875, -0.08831405639648438, -0.08168792724609375, -0.07506179809570312, -0.0684356689453125, -0.061809539794921875, -0.05518341064453125, -0.048557281494140625, -0.04193115234375, -0.035305023193359375, -0.02867889404296875, -0.022052764892578125, -0.0154266357421875, -0.008800506591796875, -0.00217437744140625, 0.004451751708984375, 0.011077880859375, 0.017704010009765625, 0.02433013916015625, 0.030956268310546875, 0.0375823974609375, 0.044208526611328125, 0.05083465576171875, 0.057460784912109375, 0.0640869140625, 0.07071304321289062, 0.07733917236328125, 0.08396530151367188, 0.0905914306640625, 0.09721755981445312, 0.10384368896484375, 0.11046981811523438, 0.117095947265625, 0.12372207641601562, 0.13034820556640625, 0.13697433471679688, 0.1436004638671875, 0.15022659301757812, 0.15685272216796875, 0.16347885131835938, 0.17010498046875, 0.17673110961914062, 0.18335723876953125, 0.18998336791992188, 0.1966094970703125, 0.20323562622070312, 0.20986175537109375, 0.21648788452148438, 0.223114013671875, 0.22974014282226562, 0.23636627197265625, 0.24299240112304688, 0.2496185302734375, 0.2562446594238281, 0.26287078857421875, 0.2694969177246094, 0.276123046875]}, "gradients/encoder.encoder.layers.6.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 2.0, 4.0, 11.0, 12.0, 23.0, 52.0, 98.0, 181.0, 216.0, 196.0, 95.0, 53.0, 27.0, 6.0, 7.0, 5.0, 6.0, 2.0, 4.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-1.1641261577606201, -1.1294547319412231, -1.0947834253311157, -1.0601119995117188, -1.0254405736923218, -0.9907692670822144, -0.9560978412628174, -0.9214264750480652, -0.886755108833313, -0.8520837426185608, -0.8174123167991638, -0.7827409505844116, -0.7480695843696594, -0.7133982181549072, -0.6787267923355103, -0.6440554261207581, -0.6093840003013611, -0.5747126340866089, -0.5400412082672119, -0.5053698420524597, -0.4706984758377075, -0.43602707982063293, -0.40135568380355835, -0.36668431758880615, -0.33201292157173157, -0.297341525554657, -0.2626701593399048, -0.2279987633228302, -0.1933273822069168, -0.15865600109100342, -0.12398460507392883, -0.08931322395801544, -0.05464184284210205, -0.01997045800089836, 0.014700926840305328, 0.049372315406799316, 0.08404369652271271, 0.1187150776386261, 0.15338647365570068, 0.18805785477161407, 0.22272923588752747, 0.25740063190460205, 0.29207199811935425, 0.32674339413642883, 0.3614147901535034, 0.3960861563682556, 0.4307575523853302, 0.4654289484024048, 0.500100314617157, 0.5347716808319092, 0.5694431066513062, 0.6041144728660583, 0.6387858390808105, 0.6734572649002075, 0.7081286311149597, 0.7427999973297119, 0.7774714231491089, 0.8121427893638611, 0.8468142151832581, 0.8814855813980103, 0.9161569476127625, 0.9508283138275146, 0.9854997396469116, 1.0201711654663086, 1.054842472076416]}, "gradients/encoder.encoder.layers.6.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 12.0, 14.0, 13.0, 19.0, 28.0, 27.0, 40.0, 41.0, 42.0, 57.0, 60.0, 61.0, 71.0, 74.0, 69.0, 56.0, 53.0, 44.0, 47.0, 37.0, 41.0, 24.0, 20.0, 16.0, 12.0, 4.0, 5.0, 4.0, 4.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6333287954330444, -0.6137583255767822, -0.59418785572052, -0.574617326259613, -0.5550468564033508, -0.5354763865470886, -0.5159059166908264, -0.4963354468345642, -0.4767649471759796, -0.4571944773197174, -0.4376239776611328, -0.4180535078048706, -0.3984830379486084, -0.3789125382900238, -0.3593420684337616, -0.339771568775177, -0.3202010989189148, -0.3006306290626526, -0.281060129404068, -0.2614896595478058, -0.24191917479038239, -0.22234869003295898, -0.20277822017669678, -0.18320773541927338, -0.16363725066184998, -0.14406676590442657, -0.12449628859758377, -0.10492581129074097, -0.08535532653331757, -0.06578484177589417, -0.04621436446905136, -0.026643887162208557, -0.007073342800140381, 0.012497138231992722, 0.032067619264125824, 0.051638100296258926, 0.07120858132839203, 0.09077906608581543, 0.11034954339265823, 0.12992002069950104, 0.14949050545692444, 0.16906099021434784, 0.18863147497177124, 0.20820194482803345, 0.22777242958545685, 0.24734291434288025, 0.26691338419914246, 0.28648388385772705, 0.30605435371398926, 0.32562482357025146, 0.34519532322883606, 0.36476579308509827, 0.38433629274368286, 0.40390676259994507, 0.4234772324562073, 0.4430477023124695, 0.4626182019710541, 0.4821886718273163, 0.5017591714859009, 0.5213296413421631, 0.5409001111984253, 0.5604705810546875, 0.5800411105155945, 0.5996115803718567, 0.6191820502281189]}, "gradients/encoder.encoder.layers.6.attention.out_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 3.0, 4.0, 3.0, 5.0, 5.0, 8.0, 12.0, 31.0, 42.0, 92.0, 182.0, 571.0, 2231.0, 19139.0, 715022.0, 299276.0, 9696.0, 1491.0, 431.0, 154.0, 61.0, 45.0, 20.0, 14.0, 8.0, 4.0, 2.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-0.385986328125, -0.37579345703125, -0.3656005859375, -0.35540771484375, -0.34521484375, -0.33502197265625, -0.3248291015625, -0.31463623046875, -0.304443359375, -0.29425048828125, -0.2840576171875, -0.27386474609375, -0.263671875, -0.25347900390625, -0.2432861328125, -0.23309326171875, -0.222900390625, -0.21270751953125, -0.2025146484375, -0.19232177734375, -0.18212890625, -0.17193603515625, -0.1617431640625, -0.15155029296875, -0.141357421875, -0.13116455078125, -0.1209716796875, -0.11077880859375, -0.1005859375, -0.09039306640625, -0.0802001953125, -0.07000732421875, -0.059814453125, -0.04962158203125, -0.0394287109375, -0.02923583984375, -0.01904296875, -0.00885009765625, 0.0013427734375, 0.01153564453125, 0.021728515625, 0.03192138671875, 0.0421142578125, 0.05230712890625, 0.0625, 0.07269287109375, 0.0828857421875, 0.09307861328125, 0.103271484375, 0.11346435546875, 0.1236572265625, 0.13385009765625, 0.14404296875, 0.15423583984375, 0.1644287109375, 0.17462158203125, 0.184814453125, 0.19500732421875, 0.2052001953125, 0.21539306640625, 0.2255859375, 0.23577880859375, 0.2459716796875, 0.25616455078125, 0.266357421875]}, "gradients/encoder.encoder.layers.6.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 1.0, 4.0, 4.0, 10.0, 14.0, 25.0, 24.0, 35.0, 48.0, 62.0, 80.0, 72.0, 95.0, 91.0, 97.0, 58.0, 75.0, 51.0, 47.0, 33.0, 25.0, 16.0, 16.0, 12.0, 1.0, 4.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1031494140625, -0.10059309005737305, -0.0980367660522461, -0.09548044204711914, -0.09292411804199219, -0.09036779403686523, -0.08781147003173828, -0.08525514602661133, -0.08269882202148438, -0.08014249801635742, -0.07758617401123047, -0.07502985000610352, -0.07247352600097656, -0.06991720199584961, -0.06736087799072266, -0.0648045539855957, -0.06224822998046875, -0.0596919059753418, -0.057135581970214844, -0.05457925796508789, -0.05202293395996094, -0.049466609954833984, -0.04691028594970703, -0.04435396194458008, -0.041797637939453125, -0.03924131393432617, -0.03668498992919922, -0.034128665924072266, -0.03157234191894531, -0.02901601791381836, -0.026459693908691406, -0.023903369903564453, -0.0213470458984375, -0.018790721893310547, -0.016234397888183594, -0.01367807388305664, -0.011121749877929688, -0.008565425872802734, -0.006009101867675781, -0.003452777862548828, -0.000896453857421875, 0.0016598701477050781, 0.004216194152832031, 0.006772518157958984, 0.009328842163085938, 0.01188516616821289, 0.014441490173339844, 0.016997814178466797, 0.01955413818359375, 0.022110462188720703, 0.024666786193847656, 0.02722311019897461, 0.029779434204101562, 0.032335758209228516, 0.03489208221435547, 0.03744840621948242, 0.040004730224609375, 0.04256105422973633, 0.04511737823486328, 0.047673702239990234, 0.05023002624511719, 0.05278635025024414, 0.055342674255371094, 0.05789899826049805, 0.060455322265625]}, "gradients/encoder.encoder.layers.6.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 1.0, 1.0, 2.0, 3.0, 1.0, 5.0, 3.0, 8.0, 17.0, 15.0, 18.0, 26.0, 34.0, 49.0, 89.0, 129.0, 204.0, 343.0, 738.0, 1640.0, 5040.0, 23183.0, 204426.0, 708546.0, 85916.0, 12519.0, 3104.0, 1186.0, 483.0, 296.0, 159.0, 108.0, 71.0, 56.0, 39.0, 23.0, 23.0, 11.0, 19.0, 7.0, 6.0, 6.0, 4.0, 3.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.160400390625, -0.15558624267578125, -0.1507720947265625, -0.14595794677734375, -0.141143798828125, -0.13632965087890625, -0.1315155029296875, -0.12670135498046875, -0.12188720703125, -0.11707305908203125, -0.1122589111328125, -0.10744476318359375, -0.102630615234375, -0.09781646728515625, -0.0930023193359375, -0.08818817138671875, -0.0833740234375, -0.07855987548828125, -0.0737457275390625, -0.06893157958984375, -0.064117431640625, -0.05930328369140625, -0.0544891357421875, -0.04967498779296875, -0.04486083984375, -0.04004669189453125, -0.0352325439453125, -0.03041839599609375, -0.025604248046875, -0.02079010009765625, -0.0159759521484375, -0.01116180419921875, -0.00634765625, -0.00153350830078125, 0.0032806396484375, 0.00809478759765625, 0.012908935546875, 0.01772308349609375, 0.0225372314453125, 0.02735137939453125, 0.03216552734375, 0.03697967529296875, 0.0417938232421875, 0.04660797119140625, 0.051422119140625, 0.05623626708984375, 0.0610504150390625, 0.06586456298828125, 0.0706787109375, 0.07549285888671875, 0.0803070068359375, 0.08512115478515625, 0.089935302734375, 0.09474945068359375, 0.0995635986328125, 0.10437774658203125, 0.10919189453125, 0.11400604248046875, 0.1188201904296875, 0.12363433837890625, 0.128448486328125, 0.13326263427734375, 0.1380767822265625, 0.14289093017578125, 0.147705078125]}, "gradients/encoder.encoder.layers.6.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 8.0, 6.0, 8.0, 9.0, 3.0, 12.0, 7.0, 13.0, 21.0, 19.0, 14.0, 22.0, 31.0, 20.0, 33.0, 40.0, 36.0, 36.0, 36.0, 45.0, 38.0, 44.0, 49.0, 53.0, 44.0, 25.0, 39.0, 26.0, 43.0, 39.0, 20.0, 23.0, 21.0, 20.0, 8.0, 22.0, 15.0, 11.0, 8.0, 8.0, 7.0, 5.0, 4.0, 7.0, 3.0, 5.0, 2.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.168701171875, -0.16376495361328125, -0.1588287353515625, -0.15389251708984375, -0.148956298828125, -0.14402008056640625, -0.1390838623046875, -0.13414764404296875, -0.12921142578125, -0.12427520751953125, -0.1193389892578125, -0.11440277099609375, -0.109466552734375, -0.10453033447265625, -0.0995941162109375, -0.09465789794921875, -0.0897216796875, -0.08478546142578125, -0.0798492431640625, -0.07491302490234375, -0.069976806640625, -0.06504058837890625, -0.0601043701171875, -0.05516815185546875, -0.05023193359375, -0.04529571533203125, -0.0403594970703125, -0.03542327880859375, -0.030487060546875, -0.02555084228515625, -0.0206146240234375, -0.01567840576171875, -0.0107421875, -0.00580596923828125, -0.0008697509765625, 0.00406646728515625, 0.009002685546875, 0.01393890380859375, 0.0188751220703125, 0.02381134033203125, 0.02874755859375, 0.03368377685546875, 0.0386199951171875, 0.04355621337890625, 0.048492431640625, 0.05342864990234375, 0.0583648681640625, 0.06330108642578125, 0.0682373046875, 0.07317352294921875, 0.0781097412109375, 0.08304595947265625, 0.087982177734375, 0.09291839599609375, 0.0978546142578125, 0.10279083251953125, 0.10772705078125, 0.11266326904296875, 0.1175994873046875, 0.12253570556640625, 0.127471923828125, 0.13240814208984375, 0.1373443603515625, 0.14228057861328125, 0.147216796875]}, "gradients/encoder.encoder.layers.6.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 4.0, 4.0, 1.0, 3.0, 4.0, 16.0, 11.0, 22.0, 34.0, 72.0, 137.0, 271.0, 830.0, 2563.0, 12040.0, 190344.0, 803566.0, 31832.0, 4688.0, 1267.0, 449.0, 183.0, 92.0, 52.0, 28.0, 18.0, 8.0, 5.0, 3.0, 5.0, 3.0, 2.0, 2.0, 2.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.08135986328125, -0.07813358306884766, -0.07490730285644531, -0.07168102264404297, -0.06845474243164062, -0.06522846221923828, -0.06200218200683594, -0.058775901794433594, -0.05554962158203125, -0.052323341369628906, -0.04909706115722656, -0.04587078094482422, -0.042644500732421875, -0.03941822052001953, -0.03619194030761719, -0.032965660095214844, -0.0297393798828125, -0.026513099670410156, -0.023286819458007812, -0.02006053924560547, -0.016834259033203125, -0.013607978820800781, -0.010381698608398438, -0.007155418395996094, -0.00392913818359375, -0.0007028579711914062, 0.0025234222412109375, 0.005749702453613281, 0.008975982666015625, 0.012202262878417969, 0.015428543090820312, 0.018654823303222656, 0.021881103515625, 0.025107383728027344, 0.028333663940429688, 0.03155994415283203, 0.034786224365234375, 0.03801250457763672, 0.04123878479003906, 0.044465065002441406, 0.04769134521484375, 0.050917625427246094, 0.05414390563964844, 0.05737018585205078, 0.060596466064453125, 0.06382274627685547, 0.06704902648925781, 0.07027530670166016, 0.0735015869140625, 0.07672786712646484, 0.07995414733886719, 0.08318042755126953, 0.08640670776367188, 0.08963298797607422, 0.09285926818847656, 0.0960855484008789, 0.09931182861328125, 0.1025381088256836, 0.10576438903808594, 0.10899066925048828, 0.11221694946289062, 0.11544322967529297, 0.11866950988769531, 0.12189579010009766, 0.1251220703125]}, "gradients/encoder.encoder.layers.6.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 3.0, 3.0, 3.0, 8.0, 8.0, 11.0, 19.0, 36.0, 60.0, 91.0, 163.0, 168.0, 139.0, 111.0, 57.0, 35.0, 31.0, 19.0, 8.0, 8.0, 8.0, 4.0, 5.0, 1.0, 5.0, 2.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.187490463256836e-05, -2.10469588637352e-05, -2.021901309490204e-05, -1.9391067326068878e-05, -1.8563121557235718e-05, -1.7735175788402557e-05, -1.6907230019569397e-05, -1.6079284250736237e-05, -1.5251338481903076e-05, -1.4423392713069916e-05, -1.3595446944236755e-05, -1.2767501175403595e-05, -1.1939555406570435e-05, -1.1111609637737274e-05, -1.0283663868904114e-05, -9.455718100070953e-06, -8.627772331237793e-06, -7.799826562404633e-06, -6.971880793571472e-06, -6.143935024738312e-06, -5.315989255905151e-06, -4.488043487071991e-06, -3.6600977182388306e-06, -2.83215194940567e-06, -2.0042061805725098e-06, -1.1762604117393494e-06, -3.4831464290618896e-07, 4.796311259269714e-07, 1.3075768947601318e-06, 2.1355226635932922e-06, 2.9634684324264526e-06, 3.791414201259613e-06, 4.6193599700927734e-06, 5.447305738925934e-06, 6.275251507759094e-06, 7.103197276592255e-06, 7.931143045425415e-06, 8.759088814258575e-06, 9.587034583091736e-06, 1.0414980351924896e-05, 1.1242926120758057e-05, 1.2070871889591217e-05, 1.2898817658424377e-05, 1.3726763427257538e-05, 1.4554709196090698e-05, 1.538265496492386e-05, 1.621060073375702e-05, 1.703854650259018e-05, 1.786649227142334e-05, 1.86944380402565e-05, 1.952238380908966e-05, 2.035032957792282e-05, 2.117827534675598e-05, 2.2006221115589142e-05, 2.2834166884422302e-05, 2.3662112653255463e-05, 2.4490058422088623e-05, 2.5318004190921783e-05, 2.6145949959754944e-05, 2.6973895728588104e-05, 2.7801841497421265e-05, 2.8629787266254425e-05, 2.9457733035087585e-05, 3.0285678803920746e-05, 3.1113624572753906e-05]}, "gradients/encoder.encoder.layers.6.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 4.0, 4.0, 1.0, 7.0, 10.0, 11.0, 14.0, 15.0, 26.0, 39.0, 61.0, 91.0, 154.0, 325.0, 580.0, 1582.0, 4425.0, 17203.0, 110119.0, 734429.0, 149722.0, 21270.0, 5268.0, 1733.0, 651.0, 318.0, 204.0, 104.0, 57.0, 37.0, 31.0, 19.0, 13.0, 4.0, 9.0, 6.0, 9.0, 5.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06170654296875, -0.059546470642089844, -0.05738639831542969, -0.05522632598876953, -0.053066253662109375, -0.05090618133544922, -0.04874610900878906, -0.046586036682128906, -0.04442596435546875, -0.042265892028808594, -0.04010581970214844, -0.03794574737548828, -0.035785675048828125, -0.03362560272216797, -0.03146553039550781, -0.029305458068847656, -0.0271453857421875, -0.024985313415527344, -0.022825241088867188, -0.02066516876220703, -0.018505096435546875, -0.01634502410888672, -0.014184951782226562, -0.012024879455566406, -0.00986480712890625, -0.007704734802246094, -0.0055446624755859375, -0.0033845901489257812, -0.001224517822265625, 0.0009355545043945312, 0.0030956268310546875, 0.005255699157714844, 0.007415771484375, 0.009575843811035156, 0.011735916137695312, 0.013895988464355469, 0.016056060791015625, 0.01821613311767578, 0.020376205444335938, 0.022536277770996094, 0.02469635009765625, 0.026856422424316406, 0.029016494750976562, 0.03117656707763672, 0.033336639404296875, 0.03549671173095703, 0.03765678405761719, 0.039816856384277344, 0.0419769287109375, 0.044137001037597656, 0.04629707336425781, 0.04845714569091797, 0.050617218017578125, 0.05277729034423828, 0.05493736267089844, 0.057097434997558594, 0.05925750732421875, 0.061417579650878906, 0.06357765197753906, 0.06573772430419922, 0.06789779663085938, 0.07005786895751953, 0.07221794128417969, 0.07437801361083984, 0.0765380859375]}, "gradients/encoder.encoder.layers.6.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 4.0, 4.0, 3.0, 5.0, 2.0, 8.0, 4.0, 15.0, 7.0, 10.0, 13.0, 17.0, 30.0, 32.0, 44.0, 30.0, 56.0, 66.0, 66.0, 81.0, 85.0, 78.0, 59.0, 55.0, 46.0, 40.0, 28.0, 25.0, 10.0, 14.0, 12.0, 12.0, 9.0, 9.0, 3.0, 9.0, 6.0, 2.0, 3.0, 2.0, 7.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.047637939453125, -0.04619932174682617, -0.044760704040527344, -0.043322086334228516, -0.04188346862792969, -0.04044485092163086, -0.03900623321533203, -0.0375676155090332, -0.036128997802734375, -0.03469038009643555, -0.03325176239013672, -0.03181314468383789, -0.030374526977539062, -0.028935909271240234, -0.027497291564941406, -0.026058673858642578, -0.02462005615234375, -0.023181438446044922, -0.021742820739746094, -0.020304203033447266, -0.018865585327148438, -0.01742696762084961, -0.01598834991455078, -0.014549732208251953, -0.013111114501953125, -0.011672496795654297, -0.010233879089355469, -0.00879526138305664, -0.0073566436767578125, -0.005918025970458984, -0.004479408264160156, -0.003040790557861328, -0.0016021728515625, -0.00016355514526367188, 0.0012750625610351562, 0.0027136802673339844, 0.0041522979736328125, 0.005590915679931641, 0.007029533386230469, 0.008468151092529297, 0.009906768798828125, 0.011345386505126953, 0.012784004211425781, 0.01422262191772461, 0.015661239624023438, 0.017099857330322266, 0.018538475036621094, 0.019977092742919922, 0.02141571044921875, 0.022854328155517578, 0.024292945861816406, 0.025731563568115234, 0.027170181274414062, 0.02860879898071289, 0.03004741668701172, 0.03148603439331055, 0.032924652099609375, 0.0343632698059082, 0.03580188751220703, 0.03724050521850586, 0.03867912292480469, 0.040117740631103516, 0.041556358337402344, 0.04299497604370117, 0.04443359375]}, "gradients/encoder.encoder.layers.6.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 12.0, 31.0, 58.0, 193.0, 276.0, 227.0, 101.0, 55.0, 22.0, 11.0, 4.0, 4.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.9872668981552124, -0.9363853931427002, -0.8855039477348328, -0.8346225023269653, -0.7837409973144531, -0.7328594923019409, -0.6819780468940735, -0.631096601486206, -0.5802150964736938, -0.5293335914611816, -0.4784521460533142, -0.4275706708431244, -0.37668919563293457, -0.32580772042274475, -0.27492624521255493, -0.2240447700023651, -0.1731632947921753, -0.12228181958198547, -0.07140034437179565, -0.020518869161605835, 0.030362606048583984, 0.0812440812587738, 0.13212555646896362, 0.18300703167915344, 0.23388850688934326, 0.2847699820995331, 0.3356514573097229, 0.3865329325199127, 0.43741440773010254, 0.48829588294029236, 0.5391773581504822, 0.5900588035583496, 0.6409404277801514, 0.6918219327926636, 0.742703378200531, 0.7935848236083984, 0.8444663286209106, 0.8953478336334229, 0.9462292790412903, 0.9971107244491577, 1.04799222946167, 1.0988737344741821, 1.1497552394866943, 1.200636625289917, 1.2515181303024292, 1.3023996353149414, 1.353281021118164, 1.4041625261306763, 1.4550440311431885, 1.5059255361557007, 1.556807041168213, 1.6076884269714355, 1.6585699319839478, 1.70945143699646, 1.7603328227996826, 1.8112143278121948, 1.862095832824707, 1.9129773378372192, 1.9638588428497314, 2.014740228652954, 2.065621852874756, 2.1165032386779785, 2.167384624481201, 2.218266248703003, 2.2691476345062256]}, "gradients/encoder.encoder.layers.6.layer_norm.bias": {"_type": "histogram", "values": [3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 3.0, 8.0, 5.0, 9.0, 9.0, 20.0, 19.0, 22.0, 18.0, 28.0, 32.0, 38.0, 29.0, 40.0, 56.0, 50.0, 55.0, 68.0, 67.0, 71.0, 58.0, 42.0, 35.0, 42.0, 35.0, 22.0, 29.0, 19.0, 14.0, 11.0, 17.0, 14.0, 6.0, 4.0, 2.0, 6.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.738601565361023, -0.712329089641571, -0.6860566139221191, -0.6597841382026672, -0.6335116624832153, -0.6072391867637634, -0.5809667110443115, -0.5546941757202148, -0.5284217596054077, -0.5021492838859558, -0.4758768081665039, -0.449604332447052, -0.4233318567276001, -0.3970593810081482, -0.3707868754863739, -0.344514399766922, -0.3182418942451477, -0.2919694185256958, -0.2656969428062439, -0.2394244521856308, -0.2131519764661789, -0.186879500746727, -0.1606070101261139, -0.134334534406662, -0.10806205868721008, -0.08178958296775818, -0.05551709979772568, -0.029244616627693176, -0.002972140908241272, 0.023300334811210632, 0.04957282543182373, 0.07584530115127563, 0.10211777687072754, 0.12839025259017944, 0.15466272830963135, 0.18093521893024445, 0.20720769464969635, 0.23348017036914825, 0.25975266098976135, 0.28602513670921326, 0.31229761242866516, 0.33857008814811707, 0.36484256386756897, 0.39111506938934326, 0.41738754510879517, 0.44366002082824707, 0.469932496547699, 0.4962049722671509, 0.5224774479866028, 0.5487499237060547, 0.5750223994255066, 0.6012948751449585, 0.6275673508644104, 0.6538398265838623, 0.680112361907959, 0.7063847780227661, 0.7326573133468628, 0.7589297890663147, 0.7852022647857666, 0.8114747405052185, 0.8377472162246704, 0.8640196919441223, 0.8902921676635742, 0.9165647029876709, 0.942837119102478]}, "gradients/encoder.encoder.layers.5.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 2.0, 1.0, 0.0, 3.0, 2.0, 1.0, 3.0, 6.0, 12.0, 31.0, 62.0, 170.0, 441.0, 2584.0, 898254.0, 3288101.0, 3985.0, 468.0, 103.0, 33.0, 21.0, 9.0, 1.0, 2.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.826171875, -0.8099880218505859, -0.7938041687011719, -0.7776203155517578, -0.7614364624023438, -0.7452526092529297, -0.7290687561035156, -0.7128849029541016, -0.6967010498046875, -0.6805171966552734, -0.6643333435058594, -0.6481494903564453, -0.6319656372070312, -0.6157817840576172, -0.5995979309082031, -0.5834140777587891, -0.567230224609375, -0.5510463714599609, -0.5348625183105469, -0.5186786651611328, -0.5024948120117188, -0.4863109588623047, -0.4701271057128906, -0.45394325256347656, -0.4377593994140625, -0.42157554626464844, -0.4053916931152344, -0.3892078399658203, -0.37302398681640625, -0.3568401336669922, -0.3406562805175781, -0.32447242736816406, -0.30828857421875, -0.29210472106933594, -0.2759208679199219, -0.2597370147705078, -0.24355316162109375, -0.2273693084716797, -0.21118545532226562, -0.19500160217285156, -0.1788177490234375, -0.16263389587402344, -0.14645004272460938, -0.1302661895751953, -0.11408233642578125, -0.09789848327636719, -0.08171463012695312, -0.06553077697753906, -0.049346923828125, -0.03316307067871094, -0.016979217529296875, -0.0007953643798828125, 0.01538848876953125, 0.03157234191894531, 0.047756195068359375, 0.06394004821777344, 0.0801239013671875, 0.09630775451660156, 0.11249160766601562, 0.1286754608154297, 0.14485931396484375, 0.1610431671142578, 0.17722702026367188, 0.19341087341308594, 0.2095947265625]}, "gradients/encoder.encoder.layers.5.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 8.0, 16.0, 10.0, 23.0, 40.0, 47.0, 57.0, 86.0, 104.0, 94.0, 106.0, 98.0, 93.0, 86.0, 45.0, 38.0, 24.0, 13.0, 5.0, 4.0, 7.0, 2.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1142578125, -0.11147594451904297, -0.10869407653808594, -0.1059122085571289, -0.10313034057617188, -0.10034847259521484, -0.09756660461425781, -0.09478473663330078, -0.09200286865234375, -0.08922100067138672, -0.08643913269042969, -0.08365726470947266, -0.08087539672851562, -0.0780935287475586, -0.07531166076660156, -0.07252979278564453, -0.0697479248046875, -0.06696605682373047, -0.06418418884277344, -0.061402320861816406, -0.058620452880859375, -0.055838584899902344, -0.05305671691894531, -0.05027484893798828, -0.04749298095703125, -0.04471111297607422, -0.04192924499511719, -0.039147377014160156, -0.036365509033203125, -0.033583641052246094, -0.030801773071289062, -0.02801990509033203, -0.025238037109375, -0.02245616912841797, -0.019674301147460938, -0.016892433166503906, -0.014110565185546875, -0.011328697204589844, -0.008546829223632812, -0.005764961242675781, -0.00298309326171875, -0.00020122528076171875, 0.0025806427001953125, 0.005362510681152344, 0.008144378662109375, 0.010926246643066406, 0.013708114624023438, 0.01648998260498047, 0.0192718505859375, 0.02205371856689453, 0.024835586547851562, 0.027617454528808594, 0.030399322509765625, 0.033181190490722656, 0.03596305847167969, 0.03874492645263672, 0.04152679443359375, 0.04430866241455078, 0.04709053039550781, 0.049872398376464844, 0.052654266357421875, 0.055436134338378906, 0.05821800231933594, 0.06099987030029297, 0.06378173828125]}, "gradients/encoder.encoder.layers.5.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 2.0, 4.0, 5.0, 9.0, 14.0, 9.0, 25.0, 27.0, 33.0, 79.0, 142.0, 270.0, 679.0, 1563.0, 5091.0, 24143.0, 439530.0, 3642973.0, 65300.0, 9858.0, 2661.0, 963.0, 444.0, 208.0, 95.0, 52.0, 46.0, 21.0, 9.0, 15.0, 7.0, 5.0, 2.0, 4.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.166259765625, -0.16014862060546875, -0.1540374755859375, -0.14792633056640625, -0.141815185546875, -0.13570404052734375, -0.1295928955078125, -0.12348175048828125, -0.11737060546875, -0.11125946044921875, -0.1051483154296875, -0.09903717041015625, -0.092926025390625, -0.08681488037109375, -0.0807037353515625, -0.07459259033203125, -0.0684814453125, -0.06237030029296875, -0.0562591552734375, -0.05014801025390625, -0.044036865234375, -0.03792572021484375, -0.0318145751953125, -0.02570343017578125, -0.01959228515625, -0.01348114013671875, -0.0073699951171875, -0.00125885009765625, 0.004852294921875, 0.01096343994140625, 0.0170745849609375, 0.02318572998046875, 0.029296875, 0.03540802001953125, 0.0415191650390625, 0.04763031005859375, 0.053741455078125, 0.05985260009765625, 0.0659637451171875, 0.07207489013671875, 0.07818603515625, 0.08429718017578125, 0.0904083251953125, 0.09651947021484375, 0.102630615234375, 0.10874176025390625, 0.1148529052734375, 0.12096405029296875, 0.1270751953125, 0.13318634033203125, 0.1392974853515625, 0.14540863037109375, 0.151519775390625, 0.15763092041015625, 0.1637420654296875, 0.16985321044921875, 0.17596435546875, 0.18207550048828125, 0.1881866455078125, 0.19429779052734375, 0.200408935546875, 0.20652008056640625, 0.2126312255859375, 0.21874237060546875, 0.224853515625]}, "gradients/encoder.encoder.layers.5.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 4.0, 0.0, 6.0, 2.0, 2.0, 2.0, 5.0, 13.0, 7.0, 13.0, 17.0, 35.0, 49.0, 78.0, 154.0, 420.0, 1200.0, 1142.0, 419.0, 194.0, 106.0, 65.0, 42.0, 31.0, 24.0, 19.0, 8.0, 6.0, 7.0, 5.0, 1.0, 4.0, 1.0, 3.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.157470703125, -0.15254974365234375, -0.1476287841796875, -0.14270782470703125, -0.137786865234375, -0.13286590576171875, -0.1279449462890625, -0.12302398681640625, -0.11810302734375, -0.11318206787109375, -0.1082611083984375, -0.10334014892578125, -0.098419189453125, -0.09349822998046875, -0.0885772705078125, -0.08365631103515625, -0.0787353515625, -0.07381439208984375, -0.0688934326171875, -0.06397247314453125, -0.059051513671875, -0.05413055419921875, -0.0492095947265625, -0.04428863525390625, -0.03936767578125, -0.03444671630859375, -0.0295257568359375, -0.02460479736328125, -0.019683837890625, -0.01476287841796875, -0.0098419189453125, -0.00492095947265625, 0.0, 0.00492095947265625, 0.0098419189453125, 0.01476287841796875, 0.019683837890625, 0.02460479736328125, 0.0295257568359375, 0.03444671630859375, 0.03936767578125, 0.04428863525390625, 0.0492095947265625, 0.05413055419921875, 0.059051513671875, 0.06397247314453125, 0.0688934326171875, 0.07381439208984375, 0.0787353515625, 0.08365631103515625, 0.0885772705078125, 0.09349822998046875, 0.098419189453125, 0.10334014892578125, 0.1082611083984375, 0.11318206787109375, 0.11810302734375, 0.12302398681640625, 0.1279449462890625, 0.13286590576171875, 0.137786865234375, 0.14270782470703125, 0.1476287841796875, 0.15254974365234375, 0.157470703125]}, "gradients/encoder.encoder.layers.5.final_layer_norm.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 4.0, 4.0, 5.0, 18.0, 44.0, 79.0, 124.0, 221.0, 220.0, 140.0, 68.0, 34.0, 23.0, 5.0, 4.0, 5.0, 4.0, 4.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.9099547266960144, -0.8617578744888306, -0.8135610818862915, -0.7653642296791077, -0.7171673774719238, -0.6689705848693848, -0.6207737326622009, -0.5725768804550171, -0.524380087852478, -0.4761832654476166, -0.42798641324043274, -0.3797895908355713, -0.33159273862838745, -0.283395916223526, -0.23519909381866455, -0.1870022416114807, -0.13880538940429688, -0.09060855209827423, -0.042411722242832184, 0.005785107612609863, 0.05398194491863251, 0.10217878222465515, 0.1503756046295166, 0.19857245683670044, 0.2467692792415619, 0.29496610164642334, 0.3431629538536072, 0.39135977625846863, 0.4395565986633301, 0.4877534508705139, 0.5359503030776978, 0.5841470956802368, 0.6323438882827759, 0.6805407404899597, 0.7287375330924988, 0.7769343852996826, 0.8251312375068665, 0.8733280897140503, 0.9215248823165894, 0.9697217345237732, 1.017918586730957, 1.066115379333496, 1.1143122911453247, 1.1625090837478638, 1.2107058763504028, 1.2589027881622314, 1.3070995807647705, 1.3552963733673096, 1.4034931659698486, 1.4516899585723877, 1.4998868703842163, 1.5480836629867554, 1.5962804555892944, 1.644477367401123, 1.692674160003662, 1.7408709526062012, 1.7890678644180298, 1.8372646570205688, 1.8854615688323975, 1.9336583614349365, 1.9818551540374756, 2.0300519466400146, 2.078248977661133, 2.126445770263672, 2.174642562866211]}, "gradients/encoder.encoder.layers.5.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 3.0, 2.0, 3.0, 6.0, 6.0, 4.0, 12.0, 11.0, 16.0, 19.0, 22.0, 42.0, 37.0, 39.0, 44.0, 46.0, 58.0, 64.0, 67.0, 46.0, 74.0, 63.0, 60.0, 51.0, 35.0, 38.0, 30.0, 28.0, 19.0, 25.0, 10.0, 5.0, 8.0, 12.0, 5.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.9162216186523438, -0.8932319283485413, -0.8702422380447388, -0.8472525477409363, -0.8242628574371338, -0.8012731671333313, -0.7782834768295288, -0.7552937865257263, -0.7323040962219238, -0.7093144059181213, -0.6863247156143188, -0.6633350253105164, -0.6403453350067139, -0.6173556447029114, -0.5943659543991089, -0.5713762640953064, -0.5483865737915039, -0.5253968834877014, -0.5024071931838989, -0.47941750288009644, -0.45642781257629395, -0.43343812227249146, -0.41044843196868896, -0.3874587416648865, -0.364469051361084, -0.3414793610572815, -0.318489670753479, -0.2954999804496765, -0.272510290145874, -0.24952059984207153, -0.22653090953826904, -0.20354121923446655, -0.18055158853530884, -0.15756189823150635, -0.13457220792770386, -0.11158251762390137, -0.08859282732009888, -0.06560313701629639, -0.042613446712493896, -0.019623756408691406, 0.003365933895111084, 0.026355624198913574, 0.049345314502716064, 0.07233500480651855, 0.09532469511032104, 0.11831438541412354, 0.14130407571792603, 0.16429376602172852, 0.187283456325531, 0.2102731466293335, 0.233262836933136, 0.2562525272369385, 0.27924221754074097, 0.30223190784454346, 0.32522159814834595, 0.34821128845214844, 0.3712009787559509, 0.3941906690597534, 0.4171803593635559, 0.4401700496673584, 0.4631597399711609, 0.4861494302749634, 0.5091391205787659, 0.5321288108825684, 0.5551185011863708]}, "gradients/encoder.encoder.layers.5.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 5.0, 2.0, 2.0, 6.0, 11.0, 11.0, 8.0, 19.0, 22.0, 31.0, 45.0, 74.0, 137.0, 192.0, 402.0, 746.0, 1866.0, 5783.0, 26878.0, 178893.0, 616400.0, 180569.0, 26965.0, 5844.0, 1895.0, 839.0, 356.0, 222.0, 113.0, 75.0, 53.0, 32.0, 26.0, 11.0, 7.0, 7.0, 6.0, 4.0, 4.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.12322998046875, -0.11839771270751953, -0.11356544494628906, -0.1087331771850586, -0.10390090942382812, -0.09906864166259766, -0.09423637390136719, -0.08940410614013672, -0.08457183837890625, -0.07973957061767578, -0.07490730285644531, -0.07007503509521484, -0.06524276733398438, -0.060410499572753906, -0.05557823181152344, -0.05074596405029297, -0.0459136962890625, -0.04108142852783203, -0.03624916076660156, -0.031416893005371094, -0.026584625244140625, -0.021752357482910156, -0.016920089721679688, -0.012087821960449219, -0.00725555419921875, -0.0024232864379882812, 0.0024089813232421875, 0.007241249084472656, 0.012073516845703125, 0.016905784606933594, 0.021738052368164062, 0.02657032012939453, 0.031402587890625, 0.03623485565185547, 0.04106712341308594, 0.045899391174316406, 0.050731658935546875, 0.055563926696777344, 0.06039619445800781, 0.06522846221923828, 0.07006072998046875, 0.07489299774169922, 0.07972526550292969, 0.08455753326416016, 0.08938980102539062, 0.0942220687866211, 0.09905433654785156, 0.10388660430908203, 0.1087188720703125, 0.11355113983154297, 0.11838340759277344, 0.1232156753540039, 0.12804794311523438, 0.13288021087646484, 0.1377124786376953, 0.14254474639892578, 0.14737701416015625, 0.15220928192138672, 0.1570415496826172, 0.16187381744384766, 0.16670608520507812, 0.1715383529663086, 0.17637062072753906, 0.18120288848876953, 0.18603515625]}, "gradients/encoder.encoder.layers.5.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 2.0, 18.0, 16.0, 30.0, 31.0, 36.0, 48.0, 70.0, 105.0, 95.0, 83.0, 101.0, 102.0, 67.0, 69.0, 44.0, 29.0, 26.0, 10.0, 12.0, 5.0, 1.0, 5.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.10968017578125, -0.10690593719482422, -0.10413169860839844, -0.10135746002197266, -0.09858322143554688, -0.0958089828491211, -0.09303474426269531, -0.09026050567626953, -0.08748626708984375, -0.08471202850341797, -0.08193778991699219, -0.0791635513305664, -0.07638931274414062, -0.07361507415771484, -0.07084083557128906, -0.06806659698486328, -0.0652923583984375, -0.06251811981201172, -0.05974388122558594, -0.056969642639160156, -0.054195404052734375, -0.051421165466308594, -0.04864692687988281, -0.04587268829345703, -0.04309844970703125, -0.04032421112060547, -0.03754997253417969, -0.034775733947753906, -0.032001495361328125, -0.029227256774902344, -0.026453018188476562, -0.02367877960205078, -0.020904541015625, -0.01813030242919922, -0.015356063842773438, -0.012581825256347656, -0.009807586669921875, -0.007033348083496094, -0.0042591094970703125, -0.0014848709106445312, 0.00128936767578125, 0.004063606262207031, 0.0068378448486328125, 0.009612083435058594, 0.012386322021484375, 0.015160560607910156, 0.017934799194335938, 0.02070903778076172, 0.0234832763671875, 0.02625751495361328, 0.029031753540039062, 0.031805992126464844, 0.034580230712890625, 0.037354469299316406, 0.04012870788574219, 0.04290294647216797, 0.04567718505859375, 0.04845142364501953, 0.05122566223144531, 0.053999900817871094, 0.056774139404296875, 0.059548377990722656, 0.06232261657714844, 0.06509685516357422, 0.06787109375]}, "gradients/encoder.encoder.layers.5.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 4.0, 3.0, 3.0, 4.0, 10.0, 8.0, 12.0, 11.0, 15.0, 25.0, 32.0, 43.0, 70.0, 99.0, 131.0, 194.0, 325.0, 508.0, 950.0, 2139.0, 6743.0, 41373.0, 660455.0, 302561.0, 24312.0, 4772.0, 1681.0, 852.0, 416.0, 275.0, 144.0, 111.0, 66.0, 58.0, 34.0, 33.0, 28.0, 21.0, 11.0, 14.0, 5.0, 8.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.2105712890625, -0.20416641235351562, -0.19776153564453125, -0.19135665893554688, -0.1849517822265625, -0.17854690551757812, -0.17214202880859375, -0.16573715209960938, -0.159332275390625, -0.15292739868164062, -0.14652252197265625, -0.14011764526367188, -0.1337127685546875, -0.12730789184570312, -0.12090301513671875, -0.11449813842773438, -0.10809326171875, -0.10168838500976562, -0.09528350830078125, -0.08887863159179688, -0.0824737548828125, -0.07606887817382812, -0.06966400146484375, -0.06325912475585938, -0.056854248046875, -0.050449371337890625, -0.04404449462890625, -0.037639617919921875, -0.0312347412109375, -0.024829864501953125, -0.01842498779296875, -0.012020111083984375, -0.005615234375, 0.000789642333984375, 0.00719451904296875, 0.013599395751953125, 0.0200042724609375, 0.026409149169921875, 0.03281402587890625, 0.039218902587890625, 0.045623779296875, 0.052028656005859375, 0.05843353271484375, 0.06483840942382812, 0.0712432861328125, 0.07764816284179688, 0.08405303955078125, 0.09045791625976562, 0.09686279296875, 0.10326766967773438, 0.10967254638671875, 0.11607742309570312, 0.1224822998046875, 0.12888717651367188, 0.13529205322265625, 0.14169692993164062, 0.148101806640625, 0.15450668334960938, 0.16091156005859375, 0.16731643676757812, 0.1737213134765625, 0.18012619018554688, 0.18653106689453125, 0.19293594360351562, 0.1993408203125]}, "gradients/encoder.encoder.layers.5.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 2.0, 4.0, 5.0, 3.0, 1.0, 6.0, 7.0, 10.0, 14.0, 15.0, 21.0, 21.0, 28.0, 27.0, 29.0, 31.0, 27.0, 35.0, 50.0, 34.0, 38.0, 40.0, 49.0, 61.0, 55.0, 40.0, 40.0, 47.0, 36.0, 42.0, 38.0, 31.0, 25.0, 26.0, 13.0, 14.0, 13.0, 14.0, 8.0, 4.0, 3.0, 3.0, 2.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.189453125, -0.18335914611816406, -0.17726516723632812, -0.1711711883544922, -0.16507720947265625, -0.1589832305908203, -0.15288925170898438, -0.14679527282714844, -0.1407012939453125, -0.13460731506347656, -0.12851333618164062, -0.12241935729980469, -0.11632537841796875, -0.11023139953613281, -0.10413742065429688, -0.09804344177246094, -0.091949462890625, -0.08585548400878906, -0.07976150512695312, -0.07366752624511719, -0.06757354736328125, -0.06147956848144531, -0.055385589599609375, -0.04929161071777344, -0.0431976318359375, -0.03710365295410156, -0.031009674072265625, -0.024915695190429688, -0.01882171630859375, -0.012727737426757812, -0.006633758544921875, -0.0005397796630859375, 0.00555419921875, 0.011648178100585938, 0.017742156982421875, 0.023836135864257812, 0.02993011474609375, 0.03602409362792969, 0.042118072509765625, 0.04821205139160156, 0.0543060302734375, 0.06040000915527344, 0.06649398803710938, 0.07258796691894531, 0.07868194580078125, 0.08477592468261719, 0.09086990356445312, 0.09696388244628906, 0.103057861328125, 0.10915184020996094, 0.11524581909179688, 0.12133979797363281, 0.12743377685546875, 0.1335277557373047, 0.13962173461914062, 0.14571571350097656, 0.1518096923828125, 0.15790367126464844, 0.16399765014648438, 0.1700916290283203, 0.17618560791015625, 0.1822795867919922, 0.18837356567382812, 0.19446754455566406, 0.2005615234375]}, "gradients/encoder.encoder.layers.5.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 4.0, 4.0, 6.0, 4.0, 6.0, 5.0, 11.0, 12.0, 23.0, 18.0, 35.0, 22.0, 60.0, 82.0, 126.0, 178.0, 248.0, 401.0, 669.0, 1121.0, 2089.0, 3986.0, 9055.0, 24137.0, 138473.0, 764166.0, 71512.0, 17413.0, 7104.0, 3261.0, 1733.0, 956.0, 550.0, 342.0, 224.0, 145.0, 108.0, 85.0, 42.0, 40.0, 22.0, 18.0, 14.0, 17.0, 8.0, 9.0, 3.0, 5.0, 5.0, 4.0, 3.0, 0.0, 1.0, 1.0, 0.0, 3.0, 2.0], "bins": [-0.07525634765625, -0.07293033599853516, -0.07060432434082031, -0.06827831268310547, -0.06595230102539062, -0.06362628936767578, -0.06130027770996094, -0.058974266052246094, -0.05664825439453125, -0.054322242736816406, -0.05199623107910156, -0.04967021942138672, -0.047344207763671875, -0.04501819610595703, -0.04269218444824219, -0.040366172790527344, -0.0380401611328125, -0.035714149475097656, -0.03338813781738281, -0.03106212615966797, -0.028736114501953125, -0.02641010284423828, -0.024084091186523438, -0.021758079528808594, -0.01943206787109375, -0.017106056213378906, -0.014780044555664062, -0.012454032897949219, -0.010128021240234375, -0.007802009582519531, -0.0054759979248046875, -0.0031499862670898438, -0.000823974609375, 0.0015020370483398438, 0.0038280487060546875, 0.006154060363769531, 0.008480072021484375, 0.010806083679199219, 0.013132095336914062, 0.015458106994628906, 0.01778411865234375, 0.020110130310058594, 0.022436141967773438, 0.02476215362548828, 0.027088165283203125, 0.02941417694091797, 0.03174018859863281, 0.034066200256347656, 0.0363922119140625, 0.038718223571777344, 0.04104423522949219, 0.04337024688720703, 0.045696258544921875, 0.04802227020263672, 0.05034828186035156, 0.052674293518066406, 0.05500030517578125, 0.057326316833496094, 0.05965232849121094, 0.06197834014892578, 0.06430435180664062, 0.06663036346435547, 0.06895637512207031, 0.07128238677978516, 0.0736083984375]}, "gradients/encoder.encoder.layers.5.attention.k_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 6.0, 0.0, 2.0, 6.0, 6.0, 11.0, 18.0, 32.0, 93.0, 229.0, 316.0, 153.0, 62.0, 24.0, 7.0, 9.0, 6.0, 4.0, 10.0, 5.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.7326087951660156e-05, -4.546064883470535e-05, -4.359520971775055e-05, -4.1729770600795746e-05, -3.986433148384094e-05, -3.799889236688614e-05, -3.6133453249931335e-05, -3.426801413297653e-05, -3.240257501602173e-05, -3.0537135899066925e-05, -2.867169678211212e-05, -2.6806257665157318e-05, -2.4940818548202515e-05, -2.307537943124771e-05, -2.1209940314292908e-05, -1.9344501197338104e-05, -1.74790620803833e-05, -1.5613622963428497e-05, -1.3748183846473694e-05, -1.188274472951889e-05, -1.0017305612564087e-05, -8.151866495609283e-06, -6.28642737865448e-06, -4.4209882616996765e-06, -2.555549144744873e-06, -6.901100277900696e-07, 1.1753290891647339e-06, 3.0407682061195374e-06, 4.906207323074341e-06, 6.771646440029144e-06, 8.637085556983948e-06, 1.0502524673938751e-05, 1.2367963790893555e-05, 1.4233402907848358e-05, 1.609884202480316e-05, 1.7964281141757965e-05, 1.982972025871277e-05, 2.1695159375667572e-05, 2.3560598492622375e-05, 2.542603760957718e-05, 2.7291476726531982e-05, 2.9156915843486786e-05, 3.102235496044159e-05, 3.288779407739639e-05, 3.4753233194351196e-05, 3.6618672311306e-05, 3.84841114282608e-05, 4.034955054521561e-05, 4.221498966217041e-05, 4.4080428779125214e-05, 4.594586789608002e-05, 4.781130701303482e-05, 4.9676746129989624e-05, 5.154218524694443e-05, 5.340762436389923e-05, 5.5273063480854034e-05, 5.713850259780884e-05, 5.900394171476364e-05, 6.0869380831718445e-05, 6.273481994867325e-05, 6.460025906562805e-05, 6.646569818258286e-05, 6.833113729953766e-05, 7.019657641649246e-05, 7.206201553344727e-05]}, "gradients/encoder.encoder.layers.5.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 2.0, 3.0, 5.0, 13.0, 15.0, 27.0, 39.0, 105.0, 208.0, 515.0, 1830.0, 10297.0, 229421.0, 786421.0, 16019.0, 2535.0, 643.0, 239.0, 112.0, 50.0, 30.0, 14.0, 5.0, 10.0, 2.0, 4.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1727294921875, -0.16734886169433594, -0.16196823120117188, -0.1565876007080078, -0.15120697021484375, -0.1458263397216797, -0.14044570922851562, -0.13506507873535156, -0.1296844482421875, -0.12430381774902344, -0.11892318725585938, -0.11354255676269531, -0.10816192626953125, -0.10278129577636719, -0.09740066528320312, -0.09202003479003906, -0.086639404296875, -0.08125877380371094, -0.07587814331054688, -0.07049751281738281, -0.06511688232421875, -0.05973625183105469, -0.054355621337890625, -0.04897499084472656, -0.0435943603515625, -0.03821372985839844, -0.032833099365234375, -0.027452468872070312, -0.02207183837890625, -0.016691207885742188, -0.011310577392578125, -0.0059299468994140625, -0.00054931640625, 0.0048313140869140625, 0.010211944580078125, 0.015592575073242188, 0.02097320556640625, 0.026353836059570312, 0.031734466552734375, 0.03711509704589844, 0.0424957275390625, 0.04787635803222656, 0.053256988525390625, 0.05863761901855469, 0.06401824951171875, 0.06939888000488281, 0.07477951049804688, 0.08016014099121094, 0.085540771484375, 0.09092140197753906, 0.09630203247070312, 0.10168266296386719, 0.10706329345703125, 0.11244392395019531, 0.11782455444335938, 0.12320518493652344, 0.1285858154296875, 0.13396644592285156, 0.13934707641601562, 0.1447277069091797, 0.15010833740234375, 0.1554889678955078, 0.16086959838867188, 0.16625022888183594, 0.171630859375]}, "gradients/encoder.encoder.layers.5.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 4.0, 4.0, 2.0, 3.0, 10.0, 13.0, 7.0, 8.0, 15.0, 21.0, 44.0, 72.0, 93.0, 138.0, 169.0, 148.0, 92.0, 57.0, 24.0, 28.0, 7.0, 12.0, 8.0, 7.0, 8.0, 1.0, 8.0, 4.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10028076171875, -0.09728240966796875, -0.0942840576171875, -0.09128570556640625, -0.088287353515625, -0.08528900146484375, -0.0822906494140625, -0.07929229736328125, -0.0762939453125, -0.07329559326171875, -0.0702972412109375, -0.06729888916015625, -0.064300537109375, -0.06130218505859375, -0.0583038330078125, -0.05530548095703125, -0.05230712890625, -0.04930877685546875, -0.0463104248046875, -0.04331207275390625, -0.040313720703125, -0.03731536865234375, -0.0343170166015625, -0.03131866455078125, -0.0283203125, -0.02532196044921875, -0.0223236083984375, -0.01932525634765625, -0.016326904296875, -0.01332855224609375, -0.0103302001953125, -0.00733184814453125, -0.00433349609375, -0.00133514404296875, 0.0016632080078125, 0.00466156005859375, 0.007659912109375, 0.01065826416015625, 0.0136566162109375, 0.01665496826171875, 0.0196533203125, 0.02265167236328125, 0.0256500244140625, 0.02864837646484375, 0.031646728515625, 0.03464508056640625, 0.0376434326171875, 0.04064178466796875, 0.04364013671875, 0.04663848876953125, 0.0496368408203125, 0.05263519287109375, 0.055633544921875, 0.05863189697265625, 0.0616302490234375, 0.06462860107421875, 0.067626953125, 0.07062530517578125, 0.0736236572265625, 0.07662200927734375, 0.079620361328125, 0.08261871337890625, 0.0856170654296875, 0.08861541748046875, 0.09161376953125]}, "gradients/encoder.encoder.layers.5.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 4.0, 5.0, 1.0, 11.0, 16.0, 40.0, 78.0, 154.0, 318.0, 173.0, 95.0, 62.0, 24.0, 13.0, 6.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1227328777313232, -1.0651572942733765, -1.0075817108154297, -0.9500061273574829, -0.8924304842948914, -0.8348549008369446, -0.7772793173789978, -0.7197036743164062, -0.6621280908584595, -0.6045525074005127, -0.5469769239425659, -0.48940131068229675, -0.4318256974220276, -0.3742501139640808, -0.31667453050613403, -0.25909891724586487, -0.20152336359024048, -0.1439477652311325, -0.08637217432260513, -0.02879658341407776, 0.028779014945030212, 0.08635461330413818, 0.14393019676208496, 0.20150581002235413, 0.2590813934803009, 0.3166569769382477, 0.37423259019851685, 0.4318081736564636, 0.4893837571144104, 0.546959400177002, 0.6045349836349487, 0.6621105670928955, 0.7196861505508423, 0.7772617340087891, 0.8348373174667358, 0.8924129009246826, 0.9499885439872742, 1.0075640678405762, 1.0651397705078125, 1.1227153539657593, 1.180290937423706, 1.2378665208816528, 1.2954421043395996, 1.3530176877975464, 1.4105932712554932, 1.4681689739227295, 1.5257444381713867, 1.583320140838623, 1.6408956050872803, 1.698471188545227, 1.7560467720031738, 1.8136223554611206, 1.8711979389190674, 1.9287736415863037, 1.986349105834961, 2.0439248085021973, 2.1015005111694336, 2.15907621383667, 2.216651678085327, 2.2742273807525635, 2.3318028450012207, 2.389378547668457, 2.4469540119171143, 2.5045297145843506, 2.562105178833008]}, "gradients/encoder.encoder.layers.5.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 3.0, 0.0, 0.0, 4.0, 4.0, 5.0, 3.0, 7.0, 5.0, 9.0, 9.0, 8.0, 9.0, 11.0, 11.0, 19.0, 17.0, 28.0, 18.0, 24.0, 26.0, 44.0, 32.0, 23.0, 44.0, 49.0, 55.0, 64.0, 57.0, 47.0, 38.0, 27.0, 36.0, 35.0, 36.0, 21.0, 18.0, 17.0, 14.0, 18.0, 15.0, 20.0, 7.0, 15.0, 10.0, 11.0, 7.0, 6.0, 6.0, 8.0, 4.0, 3.0, 0.0, 7.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6883071660995483, -0.6660804748535156, -0.6438537836074829, -0.6216270327568054, -0.5994003415107727, -0.57717365026474, -0.5549468994140625, -0.5327202081680298, -0.5104935169219971, -0.48826682567596436, -0.46604010462760925, -0.44381338357925415, -0.42158669233322144, -0.3993600010871887, -0.3771332800388336, -0.3549065589904785, -0.3326798677444458, -0.3104531764984131, -0.288226455450058, -0.2659997344017029, -0.24377304315567017, -0.22154633700847626, -0.19931963086128235, -0.17709292471408844, -0.15486621856689453, -0.13263951241970062, -0.11041280627250671, -0.0881861001253128, -0.0659593939781189, -0.04373268783092499, -0.02150598168373108, 0.0007207244634628296, 0.02294743061065674, 0.04517413675785065, 0.06740084290504456, 0.08962754905223846, 0.11185425519943237, 0.13408096134662628, 0.1563076674938202, 0.1785343736410141, 0.200761079788208, 0.22298778593540192, 0.24521449208259583, 0.2674412131309509, 0.28966790437698364, 0.31189459562301636, 0.33412131667137146, 0.35634803771972656, 0.3785747289657593, 0.400801420211792, 0.4230281412601471, 0.4452548623085022, 0.4674815535545349, 0.4897082448005676, 0.5119349956512451, 0.5341616868972778, 0.5563883781433105, 0.5786150693893433, 0.600841760635376, 0.6230685114860535, 0.6452952027320862, 0.6675218939781189, 0.6897486448287964, 0.7119753360748291, 0.7342020273208618]}, "gradients/encoder.encoder.layers.4.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 4.0, 1.0, 2.0, 2.0, 3.0, 4.0, 4.0, 4.0, 4.0, 10.0, 16.0, 22.0, 40.0, 49.0, 81.0, 117.0, 166.0, 248.0, 462.0, 958.0, 2557.0, 10366.0, 175059.0, 3919319.0, 72065.0, 8804.0, 2173.0, 792.0, 436.0, 213.0, 111.0, 70.0, 34.0, 28.0, 23.0, 13.0, 4.0, 10.0, 4.0, 3.0, 3.0, 1.0, 3.0, 3.0], "bins": [-0.318115234375, -0.31078338623046875, -0.3034515380859375, -0.29611968994140625, -0.288787841796875, -0.28145599365234375, -0.2741241455078125, -0.26679229736328125, -0.25946044921875, -0.25212860107421875, -0.2447967529296875, -0.23746490478515625, -0.230133056640625, -0.22280120849609375, -0.2154693603515625, -0.20813751220703125, -0.2008056640625, -0.19347381591796875, -0.1861419677734375, -0.17881011962890625, -0.171478271484375, -0.16414642333984375, -0.1568145751953125, -0.14948272705078125, -0.14215087890625, -0.13481903076171875, -0.1274871826171875, -0.12015533447265625, -0.112823486328125, -0.10549163818359375, -0.0981597900390625, -0.09082794189453125, -0.08349609375, -0.07616424560546875, -0.0688323974609375, -0.06150054931640625, -0.054168701171875, -0.04683685302734375, -0.0395050048828125, -0.03217315673828125, -0.02484130859375, -0.01750946044921875, -0.0101776123046875, -0.00284576416015625, 0.004486083984375, 0.01181793212890625, 0.0191497802734375, 0.02648162841796875, 0.0338134765625, 0.04114532470703125, 0.0484771728515625, 0.05580902099609375, 0.063140869140625, 0.07047271728515625, 0.0778045654296875, 0.08513641357421875, 0.09246826171875, 0.09980010986328125, 0.1071319580078125, 0.11446380615234375, 0.121795654296875, 0.12912750244140625, 0.1364593505859375, 0.14379119873046875, 0.151123046875]}, "gradients/encoder.encoder.layers.4.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 2.0, 7.0, 8.0, 12.0, 20.0, 26.0, 43.0, 49.0, 72.0, 77.0, 108.0, 111.0, 85.0, 79.0, 85.0, 73.0, 51.0, 35.0, 27.0, 13.0, 9.0, 3.0, 11.0, 4.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.115966796875, -0.11325883865356445, -0.1105508804321289, -0.10784292221069336, -0.10513496398925781, -0.10242700576782227, -0.09971904754638672, -0.09701108932495117, -0.09430313110351562, -0.09159517288208008, -0.08888721466064453, -0.08617925643920898, -0.08347129821777344, -0.08076333999633789, -0.07805538177490234, -0.0753474235534668, -0.07263946533203125, -0.0699315071105957, -0.06722354888916016, -0.06451559066772461, -0.06180763244628906, -0.059099674224853516, -0.05639171600341797, -0.05368375778198242, -0.050975799560546875, -0.04826784133911133, -0.04555988311767578, -0.042851924896240234, -0.04014396667480469, -0.03743600845336914, -0.034728050231933594, -0.03202009201049805, -0.0293121337890625, -0.026604175567626953, -0.023896217346191406, -0.02118825912475586, -0.018480300903320312, -0.015772342681884766, -0.013064384460449219, -0.010356426239013672, -0.007648468017578125, -0.004940509796142578, -0.0022325515747070312, 0.0004754066467285156, 0.0031833648681640625, 0.005891323089599609, 0.008599281311035156, 0.011307239532470703, 0.01401519775390625, 0.016723155975341797, 0.019431114196777344, 0.02213907241821289, 0.024847030639648438, 0.027554988861083984, 0.03026294708251953, 0.03297090530395508, 0.035678863525390625, 0.03838682174682617, 0.04109477996826172, 0.043802738189697266, 0.04651069641113281, 0.04921865463256836, 0.051926612854003906, 0.05463457107543945, 0.057342529296875]}, "gradients/encoder.encoder.layers.4.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 3.0, 1.0, 3.0, 4.0, 7.0, 10.0, 9.0, 18.0, 23.0, 26.0, 24.0, 23.0, 44.0, 66.0, 70.0, 102.0, 177.0, 246.0, 493.0, 1176.0, 4214.0, 27043.0, 1814477.0, 2306661.0, 31557.0, 4757.0, 1464.0, 670.0, 297.0, 199.0, 131.0, 85.0, 64.0, 38.0, 25.0, 18.0, 25.0, 19.0, 9.0, 6.0, 3.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.2430419921875, -0.23545074462890625, -0.2278594970703125, -0.22026824951171875, -0.212677001953125, -0.20508575439453125, -0.1974945068359375, -0.18990325927734375, -0.18231201171875, -0.17472076416015625, -0.1671295166015625, -0.15953826904296875, -0.151947021484375, -0.14435577392578125, -0.1367645263671875, -0.12917327880859375, -0.12158203125, -0.11399078369140625, -0.1063995361328125, -0.09880828857421875, -0.091217041015625, -0.08362579345703125, -0.0760345458984375, -0.06844329833984375, -0.06085205078125, -0.05326080322265625, -0.0456695556640625, -0.03807830810546875, -0.030487060546875, -0.02289581298828125, -0.0153045654296875, -0.00771331787109375, -0.0001220703125, 0.00746917724609375, 0.0150604248046875, 0.02265167236328125, 0.030242919921875, 0.03783416748046875, 0.0454254150390625, 0.05301666259765625, 0.06060791015625, 0.06819915771484375, 0.0757904052734375, 0.08338165283203125, 0.090972900390625, 0.09856414794921875, 0.1061553955078125, 0.11374664306640625, 0.121337890625, 0.12892913818359375, 0.1365203857421875, 0.14411163330078125, 0.151702880859375, 0.15929412841796875, 0.1668853759765625, 0.17447662353515625, 0.18206787109375, 0.18965911865234375, 0.1972503662109375, 0.20484161376953125, 0.212432861328125, 0.22002410888671875, 0.2276153564453125, 0.23520660400390625, 0.2427978515625]}, "gradients/encoder.encoder.layers.4.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 3.0, 3.0, 5.0, 7.0, 12.0, 17.0, 29.0, 53.0, 99.0, 227.0, 690.0, 1512.0, 842.0, 289.0, 111.0, 60.0, 42.0, 29.0, 17.0, 7.0, 11.0, 3.0, 4.0, 5.0, 1.0, 1.0, 1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.213623046875, -0.20685958862304688, -0.20009613037109375, -0.19333267211914062, -0.1865692138671875, -0.17980575561523438, -0.17304229736328125, -0.16627883911132812, -0.159515380859375, -0.15275192260742188, -0.14598846435546875, -0.13922500610351562, -0.1324615478515625, -0.12569808959960938, -0.11893463134765625, -0.11217117309570312, -0.10540771484375, -0.09864425659179688, -0.09188079833984375, -0.08511734008789062, -0.0783538818359375, -0.07159042358398438, -0.06482696533203125, -0.058063507080078125, -0.051300048828125, -0.044536590576171875, -0.03777313232421875, -0.031009674072265625, -0.0242462158203125, -0.017482757568359375, -0.01071929931640625, -0.003955841064453125, 0.0028076171875, 0.009571075439453125, 0.01633453369140625, 0.023097991943359375, 0.0298614501953125, 0.036624908447265625, 0.04338836669921875, 0.050151824951171875, 0.056915283203125, 0.06367874145507812, 0.07044219970703125, 0.07720565795898438, 0.0839691162109375, 0.09073257446289062, 0.09749603271484375, 0.10425949096679688, 0.11102294921875, 0.11778640747070312, 0.12454986572265625, 0.13131332397460938, 0.1380767822265625, 0.14484024047851562, 0.15160369873046875, 0.15836715698242188, 0.165130615234375, 0.17189407348632812, 0.17865753173828125, 0.18542098999023438, 0.1921844482421875, 0.19894790649414062, 0.20571136474609375, 0.21247482299804688, 0.21923828125]}, "gradients/encoder.encoder.layers.4.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 5.0, 5.0, 2.0, 1.0, 6.0, 6.0, 7.0, 24.0, 25.0, 49.0, 99.0, 184.0, 184.0, 164.0, 96.0, 56.0, 34.0, 22.0, 4.0, 11.0, 10.0, 4.0, 1.0, 1.0, 3.0, 3.0, 3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0], "bins": [-2.1288230419158936, -2.0794289112091064, -2.0300347805023193, -1.9806406497955322, -1.9312465190887451, -1.881852388381958, -1.832458257675171, -1.7830641269683838, -1.7336699962615967, -1.6842758655548096, -1.6348817348480225, -1.5854876041412354, -1.5360934734344482, -1.4866993427276611, -1.437305212020874, -1.387911081314087, -1.3385170698165894, -1.2891229391098022, -1.2397288084030151, -1.190334677696228, -1.140940546989441, -1.0915464162826538, -1.0421524047851562, -0.9927582144737244, -0.9433640837669373, -0.8939699530601501, -0.844575822353363, -0.7951817512512207, -0.7457876205444336, -0.6963934898376465, -0.6469993591308594, -0.5976052284240723, -0.5482112169265747, -0.4988170862197876, -0.4494229555130005, -0.40002885460853577, -0.35063472390174866, -0.30124059319496155, -0.2518464922904968, -0.20245236158370972, -0.1530582308769226, -0.1036641076207161, -0.05426998436450958, -0.004875868558883667, 0.04451826214790344, 0.09391239285469055, 0.14330649375915527, 0.19270062446594238, 0.2420947551727295, 0.2914888858795166, 0.3408830165863037, 0.39027711749076843, 0.43967124819755554, 0.48906537890434265, 0.5384594798088074, 0.5878536105155945, 0.6372477412223816, 0.6866418719291687, 0.7360360026359558, 0.7854300737380981, 0.8348242044448853, 0.8842183351516724, 0.9336124658584595, 0.9830065965652466, 1.0324007272720337]}, "gradients/encoder.encoder.layers.4.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 3.0, 3.0, 3.0, 9.0, 8.0, 13.0, 11.0, 10.0, 17.0, 16.0, 27.0, 24.0, 23.0, 33.0, 29.0, 32.0, 29.0, 47.0, 47.0, 48.0, 48.0, 59.0, 47.0, 52.0, 50.0, 50.0, 36.0, 35.0, 21.0, 31.0, 21.0, 26.0, 15.0, 21.0, 11.0, 7.0, 7.0, 7.0, 10.0, 6.0, 6.0, 2.0, 2.0, 3.0, 4.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.5960007309913635, -0.5767236351966858, -0.5574465394020081, -0.5381694436073303, -0.5188924074172974, -0.49961528182029724, -0.4803382158279419, -0.46106112003326416, -0.4417840242385864, -0.4225069284439087, -0.40322983264923096, -0.3839527666568756, -0.3646756708621979, -0.34539857506752014, -0.3261215090751648, -0.30684441328048706, -0.2875673174858093, -0.2682902216911316, -0.24901314079761505, -0.2297360599040985, -0.21045896410942078, -0.19118186831474304, -0.1719047874212265, -0.15262770652770996, -0.13335061073303223, -0.11407352238893509, -0.09479643404483795, -0.07551934570074081, -0.05624225735664368, -0.03696516901254654, -0.017688080668449402, 0.0015890002250671387, 0.020866096019744873, 0.04014318436384201, 0.05942027270793915, 0.07869736105203629, 0.09797444939613342, 0.11725153774023056, 0.1365286260843277, 0.15580570697784424, 0.17508280277252197, 0.1943598985671997, 0.21363697946071625, 0.2329140603542328, 0.2521911561489105, 0.27146825194358826, 0.2907453179359436, 0.31002241373062134, 0.3292995095252991, 0.3485766053199768, 0.36785370111465454, 0.3871307671070099, 0.4064078629016876, 0.42568495869636536, 0.4449620246887207, 0.46423912048339844, 0.48351621627807617, 0.5027933120727539, 0.5220704078674316, 0.5413475036621094, 0.5606245994567871, 0.5799016356468201, 0.5991787314414978, 0.6184558272361755, 0.6377329230308533]}, "gradients/encoder.encoder.layers.4.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 4.0, 1.0, 1.0, 2.0, 1.0, 4.0, 4.0, 8.0, 12.0, 8.0, 18.0, 21.0, 34.0, 46.0, 64.0, 130.0, 274.0, 503.0, 1056.0, 2597.0, 7297.0, 24141.0, 116223.0, 571132.0, 260319.0, 45852.0, 11850.0, 3925.0, 1584.0, 624.0, 356.0, 176.0, 103.0, 74.0, 36.0, 31.0, 19.0, 12.0, 3.0, 6.0, 7.0, 4.0, 3.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.229248046875, -0.2236175537109375, -0.217987060546875, -0.2123565673828125, -0.20672607421875, -0.2010955810546875, -0.195465087890625, -0.1898345947265625, -0.1842041015625, -0.1785736083984375, -0.172943115234375, -0.1673126220703125, -0.16168212890625, -0.1560516357421875, -0.150421142578125, -0.1447906494140625, -0.13916015625, -0.1335296630859375, -0.127899169921875, -0.1222686767578125, -0.11663818359375, -0.1110076904296875, -0.105377197265625, -0.0997467041015625, -0.0941162109375, -0.0884857177734375, -0.082855224609375, -0.0772247314453125, -0.07159423828125, -0.0659637451171875, -0.060333251953125, -0.0547027587890625, -0.049072265625, -0.0434417724609375, -0.037811279296875, -0.0321807861328125, -0.02655029296875, -0.0209197998046875, -0.015289306640625, -0.0096588134765625, -0.0040283203125, 0.0016021728515625, 0.007232666015625, 0.0128631591796875, 0.01849365234375, 0.0241241455078125, 0.029754638671875, 0.0353851318359375, 0.041015625, 0.0466461181640625, 0.052276611328125, 0.0579071044921875, 0.06353759765625, 0.0691680908203125, 0.074798583984375, 0.0804290771484375, 0.0860595703125, 0.0916900634765625, 0.097320556640625, 0.1029510498046875, 0.10858154296875, 0.1142120361328125, 0.119842529296875, 0.1254730224609375, 0.131103515625]}, "gradients/encoder.encoder.layers.4.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 3.0, 0.0, 3.0, 1.0, 0.0, 5.0, 11.0, 8.0, 25.0, 25.0, 41.0, 51.0, 59.0, 78.0, 69.0, 91.0, 88.0, 79.0, 95.0, 69.0, 54.0, 52.0, 36.0, 26.0, 15.0, 11.0, 9.0, 6.0, 3.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0], "bins": [-0.1280517578125, -0.12516450881958008, -0.12227725982666016, -0.11939001083374023, -0.11650276184082031, -0.11361551284790039, -0.11072826385498047, -0.10784101486206055, -0.10495376586914062, -0.1020665168762207, -0.09917926788330078, -0.09629201889038086, -0.09340476989746094, -0.09051752090454102, -0.0876302719116211, -0.08474302291870117, -0.08185577392578125, -0.07896852493286133, -0.0760812759399414, -0.07319402694702148, -0.07030677795410156, -0.06741952896118164, -0.06453227996826172, -0.0616450309753418, -0.058757781982421875, -0.05587053298950195, -0.05298328399658203, -0.05009603500366211, -0.04720878601074219, -0.044321537017822266, -0.041434288024902344, -0.03854703903198242, -0.0356597900390625, -0.03277254104614258, -0.029885292053222656, -0.026998043060302734, -0.024110794067382812, -0.02122354507446289, -0.01833629608154297, -0.015449047088623047, -0.012561798095703125, -0.009674549102783203, -0.006787300109863281, -0.0039000511169433594, -0.0010128021240234375, 0.0018744468688964844, 0.004761695861816406, 0.007648944854736328, 0.01053619384765625, 0.013423442840576172, 0.016310691833496094, 0.019197940826416016, 0.022085189819335938, 0.02497243881225586, 0.02785968780517578, 0.030746936798095703, 0.033634185791015625, 0.03652143478393555, 0.03940868377685547, 0.04229593276977539, 0.04518318176269531, 0.048070430755615234, 0.050957679748535156, 0.05384492874145508, 0.056732177734375]}, "gradients/encoder.encoder.layers.4.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 4.0, 1.0, 2.0, 3.0, 1.0, 3.0, 4.0, 9.0, 9.0, 7.0, 10.0, 10.0, 25.0, 28.0, 55.0, 65.0, 86.0, 123.0, 163.0, 250.0, 353.0, 597.0, 1035.0, 2048.0, 4585.0, 14187.0, 70623.0, 679082.0, 229406.0, 30685.0, 8179.0, 3080.0, 1506.0, 834.0, 520.0, 287.0, 206.0, 136.0, 112.0, 59.0, 45.0, 30.0, 22.0, 34.0, 15.0, 14.0, 7.0, 3.0, 10.0, 2.0, 4.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.200927734375, -0.19439697265625, -0.1878662109375, -0.18133544921875, -0.1748046875, -0.16827392578125, -0.1617431640625, -0.15521240234375, -0.148681640625, -0.14215087890625, -0.1356201171875, -0.12908935546875, -0.12255859375, -0.11602783203125, -0.1094970703125, -0.10296630859375, -0.096435546875, -0.08990478515625, -0.0833740234375, -0.07684326171875, -0.0703125, -0.06378173828125, -0.0572509765625, -0.05072021484375, -0.044189453125, -0.03765869140625, -0.0311279296875, -0.02459716796875, -0.01806640625, -0.01153564453125, -0.0050048828125, 0.00152587890625, 0.008056640625, 0.01458740234375, 0.0211181640625, 0.02764892578125, 0.0341796875, 0.04071044921875, 0.0472412109375, 0.05377197265625, 0.060302734375, 0.06683349609375, 0.0733642578125, 0.07989501953125, 0.08642578125, 0.09295654296875, 0.0994873046875, 0.10601806640625, 0.112548828125, 0.11907958984375, 0.1256103515625, 0.13214111328125, 0.138671875, 0.14520263671875, 0.1517333984375, 0.15826416015625, 0.164794921875, 0.17132568359375, 0.1778564453125, 0.18438720703125, 0.19091796875, 0.19744873046875, 0.2039794921875, 0.21051025390625, 0.217041015625]}, "gradients/encoder.encoder.layers.4.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 4.0, 3.0, 6.0, 2.0, 1.0, 7.0, 4.0, 5.0, 11.0, 15.0, 15.0, 15.0, 25.0, 28.0, 29.0, 30.0, 47.0, 52.0, 42.0, 43.0, 47.0, 49.0, 49.0, 39.0, 42.0, 44.0, 42.0, 37.0, 36.0, 19.0, 32.0, 24.0, 30.0, 24.0, 18.0, 21.0, 15.0, 13.0, 6.0, 4.0, 7.0, 9.0, 2.0, 2.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 3.0, 2.0, 2.0], "bins": [-0.1929931640625, -0.1872406005859375, -0.181488037109375, -0.1757354736328125, -0.16998291015625, -0.1642303466796875, -0.158477783203125, -0.1527252197265625, -0.14697265625, -0.1412200927734375, -0.135467529296875, -0.1297149658203125, -0.12396240234375, -0.1182098388671875, -0.112457275390625, -0.1067047119140625, -0.1009521484375, -0.0951995849609375, -0.089447021484375, -0.0836944580078125, -0.07794189453125, -0.0721893310546875, -0.066436767578125, -0.0606842041015625, -0.054931640625, -0.0491790771484375, -0.043426513671875, -0.0376739501953125, -0.03192138671875, -0.0261688232421875, -0.020416259765625, -0.0146636962890625, -0.0089111328125, -0.0031585693359375, 0.002593994140625, 0.0083465576171875, 0.01409912109375, 0.0198516845703125, 0.025604248046875, 0.0313568115234375, 0.037109375, 0.0428619384765625, 0.048614501953125, 0.0543670654296875, 0.06011962890625, 0.0658721923828125, 0.071624755859375, 0.0773773193359375, 0.0831298828125, 0.0888824462890625, 0.094635009765625, 0.1003875732421875, 0.10614013671875, 0.1118927001953125, 0.117645263671875, 0.1233978271484375, 0.129150390625, 0.1349029541015625, 0.140655517578125, 0.1464080810546875, 0.15216064453125, 0.1579132080078125, 0.163665771484375, 0.1694183349609375, 0.1751708984375]}, "gradients/encoder.encoder.layers.4.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 3.0, 3.0, 7.0, 8.0, 10.0, 19.0, 20.0, 48.0, 58.0, 81.0, 149.0, 201.0, 351.0, 578.0, 919.0, 1556.0, 2642.0, 4858.0, 9422.0, 22054.0, 68986.0, 546072.0, 296608.0, 55662.0, 19228.0, 8498.0, 4343.0, 2501.0, 1392.0, 914.0, 493.0, 347.0, 189.0, 124.0, 63.0, 59.0, 40.0, 24.0, 11.0, 7.0, 5.0, 4.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0611572265625, -0.058966636657714844, -0.05677604675292969, -0.05458545684814453, -0.052394866943359375, -0.05020427703857422, -0.04801368713378906, -0.045823097229003906, -0.04363250732421875, -0.041441917419433594, -0.03925132751464844, -0.03706073760986328, -0.034870147705078125, -0.03267955780029297, -0.030488967895507812, -0.028298377990722656, -0.0261077880859375, -0.023917198181152344, -0.021726608276367188, -0.01953601837158203, -0.017345428466796875, -0.015154838562011719, -0.012964248657226562, -0.010773658752441406, -0.00858306884765625, -0.006392478942871094, -0.0042018890380859375, -0.0020112991333007812, 0.000179290771484375, 0.0023698806762695312, 0.0045604705810546875, 0.006751060485839844, 0.008941650390625, 0.011132240295410156, 0.013322830200195312, 0.015513420104980469, 0.017704010009765625, 0.01989459991455078, 0.022085189819335938, 0.024275779724121094, 0.02646636962890625, 0.028656959533691406, 0.030847549438476562, 0.03303813934326172, 0.035228729248046875, 0.03741931915283203, 0.03960990905761719, 0.041800498962402344, 0.0439910888671875, 0.046181678771972656, 0.04837226867675781, 0.05056285858154297, 0.052753448486328125, 0.05494403839111328, 0.05713462829589844, 0.059325218200683594, 0.06151580810546875, 0.0637063980102539, 0.06589698791503906, 0.06808757781982422, 0.07027816772460938, 0.07246875762939453, 0.07465934753417969, 0.07684993743896484, 0.07904052734375]}, "gradients/encoder.encoder.layers.4.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 1.0, 2.0, 3.0, 4.0, 3.0, 3.0, 4.0, 5.0, 3.0, 7.0, 7.0, 20.0, 10.0, 15.0, 18.0, 39.0, 56.0, 86.0, 157.0, 174.0, 123.0, 81.0, 46.0, 30.0, 27.0, 19.0, 8.0, 2.0, 11.0, 8.0, 5.0, 5.0, 5.0, 1.0, 2.0, 4.0, 1.0, 2.0, 4.0, 3.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0], "bins": [-3.6835670471191406e-05, -3.5732053220272064e-05, -3.462843596935272e-05, -3.352481871843338e-05, -3.242120146751404e-05, -3.1317584216594696e-05, -3.0213966965675354e-05, -2.9110349714756012e-05, -2.800673246383667e-05, -2.6903115212917328e-05, -2.5799497961997986e-05, -2.4695880711078644e-05, -2.3592263460159302e-05, -2.248864620923996e-05, -2.1385028958320618e-05, -2.0281411707401276e-05, -1.9177794456481934e-05, -1.807417720556259e-05, -1.697055995464325e-05, -1.5866942703723907e-05, -1.4763325452804565e-05, -1.3659708201885223e-05, -1.2556090950965881e-05, -1.145247370004654e-05, -1.0348856449127197e-05, -9.245239198207855e-06, -8.141621947288513e-06, -7.038004696369171e-06, -5.934387445449829e-06, -4.830770194530487e-06, -3.727152943611145e-06, -2.623535692691803e-06, -1.519918441772461e-06, -4.163011908531189e-07, 6.873160600662231e-07, 1.7909333109855652e-06, 2.8945505619049072e-06, 3.998167812824249e-06, 5.101785063743591e-06, 6.205402314662933e-06, 7.309019565582275e-06, 8.412636816501617e-06, 9.51625406742096e-06, 1.0619871318340302e-05, 1.1723488569259644e-05, 1.2827105820178986e-05, 1.3930723071098328e-05, 1.503434032201767e-05, 1.6137957572937012e-05, 1.7241574823856354e-05, 1.8345192074775696e-05, 1.9448809325695038e-05, 2.055242657661438e-05, 2.1656043827533722e-05, 2.2759661078453064e-05, 2.3863278329372406e-05, 2.4966895580291748e-05, 2.607051283121109e-05, 2.7174130082130432e-05, 2.8277747333049774e-05, 2.9381364583969116e-05, 3.0484981834888458e-05, 3.15885990858078e-05, 3.269221633672714e-05, 3.3795833587646484e-05]}, "gradients/encoder.encoder.layers.4.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 0.0, 1.0, 3.0, 0.0, 1.0, 6.0, 3.0, 4.0, 12.0, 9.0, 12.0, 21.0, 29.0, 25.0, 55.0, 70.0, 107.0, 196.0, 344.0, 597.0, 1209.0, 2640.0, 6625.0, 20235.0, 94021.0, 728943.0, 151577.0, 27459.0, 8353.0, 3052.0, 1355.0, 635.0, 355.0, 214.0, 123.0, 77.0, 52.0, 40.0, 32.0, 18.0, 14.0, 10.0, 7.0, 1.0, 6.0, 4.0, 7.0, 6.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09173583984375, -0.08873939514160156, -0.08574295043945312, -0.08274650573730469, -0.07975006103515625, -0.07675361633300781, -0.07375717163085938, -0.07076072692871094, -0.0677642822265625, -0.06476783752441406, -0.061771392822265625, -0.05877494812011719, -0.05577850341796875, -0.05278205871582031, -0.049785614013671875, -0.04678916931152344, -0.043792724609375, -0.04079627990722656, -0.037799835205078125, -0.03480339050292969, -0.03180694580078125, -0.028810501098632812, -0.025814056396484375, -0.022817611694335938, -0.0198211669921875, -0.016824722290039062, -0.013828277587890625, -0.010831832885742188, -0.00783538818359375, -0.0048389434814453125, -0.001842498779296875, 0.0011539459228515625, 0.004150390625, 0.0071468353271484375, 0.010143280029296875, 0.013139724731445312, 0.01613616943359375, 0.019132614135742188, 0.022129058837890625, 0.025125503540039062, 0.0281219482421875, 0.031118392944335938, 0.034114837646484375, 0.03711128234863281, 0.04010772705078125, 0.04310417175292969, 0.046100616455078125, 0.04909706115722656, 0.052093505859375, 0.05508995056152344, 0.058086395263671875, 0.06108283996582031, 0.06407928466796875, 0.06707572937011719, 0.07007217407226562, 0.07306861877441406, 0.0760650634765625, 0.07906150817871094, 0.08205795288085938, 0.08505439758300781, 0.08805084228515625, 0.09104728698730469, 0.09404373168945312, 0.09704017639160156, 0.10003662109375]}, "gradients/encoder.encoder.layers.4.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 4.0, 4.0, 2.0, 3.0, 5.0, 6.0, 3.0, 9.0, 7.0, 3.0, 8.0, 17.0, 18.0, 27.0, 39.0, 39.0, 56.0, 72.0, 91.0, 78.0, 95.0, 84.0, 78.0, 60.0, 44.0, 35.0, 25.0, 17.0, 18.0, 9.0, 13.0, 7.0, 9.0, 3.0, 7.0, 1.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 1.0, 4.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 2.0], "bins": [-0.07421875, -0.0720052719116211, -0.06979179382324219, -0.06757831573486328, -0.06536483764648438, -0.06315135955810547, -0.06093788146972656, -0.058724403381347656, -0.05651092529296875, -0.054297447204589844, -0.05208396911621094, -0.04987049102783203, -0.047657012939453125, -0.04544353485107422, -0.04323005676269531, -0.041016578674316406, -0.0388031005859375, -0.036589622497558594, -0.03437614440917969, -0.03216266632080078, -0.029949188232421875, -0.02773571014404297, -0.025522232055664062, -0.023308753967285156, -0.02109527587890625, -0.018881797790527344, -0.016668319702148438, -0.014454841613769531, -0.012241363525390625, -0.010027885437011719, -0.007814407348632812, -0.005600929260253906, -0.003387451171875, -0.0011739730834960938, 0.0010395050048828125, 0.0032529830932617188, 0.005466461181640625, 0.007679939270019531, 0.009893417358398438, 0.012106895446777344, 0.01432037353515625, 0.016533851623535156, 0.018747329711914062, 0.02096080780029297, 0.023174285888671875, 0.02538776397705078, 0.027601242065429688, 0.029814720153808594, 0.0320281982421875, 0.034241676330566406, 0.03645515441894531, 0.03866863250732422, 0.040882110595703125, 0.04309558868408203, 0.04530906677246094, 0.047522544860839844, 0.04973602294921875, 0.051949501037597656, 0.05416297912597656, 0.05637645721435547, 0.058589935302734375, 0.06080341339111328, 0.06301689147949219, 0.0652303695678711, 0.06744384765625]}, "gradients/encoder.encoder.layers.4.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 2.0, 5.0, 5.0, 21.0, 28.0, 69.0, 115.0, 225.0, 252.0, 122.0, 71.0, 38.0, 22.0, 14.0, 7.0, 2.0, 3.0, 3.0, 0.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.1968374252319336, -2.130263328552246, -2.0636892318725586, -1.997115135192871, -1.9305410385131836, -1.863966941833496, -1.7973928451538086, -1.730818748474121, -1.6642446517944336, -1.597670555114746, -1.5310964584350586, -1.464522361755371, -1.3979482650756836, -1.331374168395996, -1.2648000717163086, -1.198225975036621, -1.1316518783569336, -1.065077781677246, -0.9985036849975586, -0.9319295883178711, -0.8653554916381836, -0.7987813949584961, -0.7322072982788086, -0.6656332015991211, -0.5990591049194336, -0.5324850082397461, -0.4659109115600586, -0.3993368148803711, -0.3327627182006836, -0.2661886215209961, -0.1996145248413086, -0.1330404281616211, -0.06646609306335449, 0.00010800361633300781, 0.06668210029602051, 0.133256196975708, 0.1998302936553955, 0.266404390335083, 0.3329784870147705, 0.399552583694458, 0.4661266803741455, 0.532700777053833, 0.5992748737335205, 0.665848970413208, 0.7324230670928955, 0.798997163772583, 0.8655712604522705, 0.932145357131958, 0.9987194538116455, 1.065293550491333, 1.1318676471710205, 1.198441743850708, 1.2650158405303955, 1.331589937210083, 1.3981640338897705, 1.464738130569458, 1.5313122272491455, 1.597886323928833, 1.6644604206085205, 1.731034517288208, 1.7976086139678955, 1.864182710647583, 1.9307568073272705, 1.997330904006958, 2.0639050006866455]}, "gradients/encoder.encoder.layers.4.layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 1.0, 9.0, 2.0, 9.0, 8.0, 12.0, 10.0, 7.0, 11.0, 12.0, 22.0, 16.0, 22.0, 13.0, 24.0, 34.0, 33.0, 29.0, 37.0, 36.0, 46.0, 64.0, 71.0, 69.0, 67.0, 39.0, 33.0, 42.0, 34.0, 15.0, 26.0, 19.0, 13.0, 20.0, 24.0, 16.0, 11.0, 11.0, 10.0, 6.0, 9.0, 5.0, 2.0, 2.0, 3.0, 3.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7328797578811646, -0.7089084982872009, -0.6849372386932373, -0.6609660387039185, -0.6369947791099548, -0.6130235195159912, -0.5890522599220276, -0.565081000328064, -0.5411098003387451, -0.5171385407447815, -0.49316731095314026, -0.46919605135917664, -0.4452248215675354, -0.4212535619735718, -0.39728230237960815, -0.37331104278564453, -0.3493397831916809, -0.3253685235977173, -0.30139729380607605, -0.2774260342121124, -0.2534548044204712, -0.22948354482650757, -0.20551228523254395, -0.18154104053974152, -0.1575697958469391, -0.13359855115413666, -0.10962729901075363, -0.0856560468673706, -0.061684802174568176, -0.03771355748176575, -0.013742297887802124, 0.010228946805000305, 0.03420025110244751, 0.05817149952054024, 0.08214274793863297, 0.10611400008201599, 0.13008524477481842, 0.15405648946762085, 0.17802774906158447, 0.2019989937543869, 0.22597023844718933, 0.24994148313999176, 0.2739127278327942, 0.2978839874267578, 0.32185524702072144, 0.34582647681236267, 0.3697977364063263, 0.39376896619796753, 0.41774022579193115, 0.4417114853858948, 0.465682715177536, 0.48965397477149963, 0.5136252045631409, 0.5375964641571045, 0.5615677237510681, 0.5855389833450317, 0.6095101833343506, 0.6334814429283142, 0.6574527025222778, 0.6814239025115967, 0.7053951621055603, 0.7293664216995239, 0.7533376812934875, 0.7773089408874512, 0.8012802004814148]}, "gradients/encoder.encoder.layers.3.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 4.0, 2.0, 1.0, 4.0, 4.0, 11.0, 7.0, 7.0, 19.0, 31.0, 47.0, 62.0, 122.0, 199.0, 394.0, 1078.0, 3934.0, 30024.0, 3282362.0, 852971.0, 18537.0, 2996.0, 788.0, 313.0, 146.0, 97.0, 49.0, 29.0, 14.0, 12.0, 9.0, 4.0, 3.0, 6.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0], "bins": [-0.3447265625, -0.3366851806640625, -0.328643798828125, -0.3206024169921875, -0.31256103515625, -0.3045196533203125, -0.296478271484375, -0.2884368896484375, -0.2803955078125, -0.2723541259765625, -0.264312744140625, -0.2562713623046875, -0.24822998046875, -0.2401885986328125, -0.232147216796875, -0.2241058349609375, -0.216064453125, -0.2080230712890625, -0.199981689453125, -0.1919403076171875, -0.18389892578125, -0.1758575439453125, -0.167816162109375, -0.1597747802734375, -0.1517333984375, -0.1436920166015625, -0.135650634765625, -0.1276092529296875, -0.11956787109375, -0.1115264892578125, -0.103485107421875, -0.0954437255859375, -0.08740234375, -0.0793609619140625, -0.071319580078125, -0.0632781982421875, -0.05523681640625, -0.0471954345703125, -0.039154052734375, -0.0311126708984375, -0.0230712890625, -0.0150299072265625, -0.006988525390625, 0.0010528564453125, 0.00909423828125, 0.0171356201171875, 0.025177001953125, 0.0332183837890625, 0.041259765625, 0.0493011474609375, 0.057342529296875, 0.0653839111328125, 0.07342529296875, 0.0814666748046875, 0.089508056640625, 0.0975494384765625, 0.1055908203125, 0.1136322021484375, 0.121673583984375, 0.1297149658203125, 0.13775634765625, 0.1457977294921875, 0.153839111328125, 0.1618804931640625, 0.169921875]}, "gradients/encoder.encoder.layers.3.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 2.0, 2.0, 3.0, 5.0, 13.0, 22.0, 37.0, 44.0, 44.0, 63.0, 72.0, 80.0, 99.0, 85.0, 107.0, 78.0, 72.0, 51.0, 42.0, 33.0, 19.0, 14.0, 7.0, 9.0, 5.0, 4.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-0.1253662109375, -0.12258625030517578, -0.11980628967285156, -0.11702632904052734, -0.11424636840820312, -0.1114664077758789, -0.10868644714355469, -0.10590648651123047, -0.10312652587890625, -0.10034656524658203, -0.09756660461425781, -0.0947866439819336, -0.09200668334960938, -0.08922672271728516, -0.08644676208496094, -0.08366680145263672, -0.0808868408203125, -0.07810688018798828, -0.07532691955566406, -0.07254695892333984, -0.06976699829101562, -0.0669870376586914, -0.06420707702636719, -0.06142711639404297, -0.05864715576171875, -0.05586719512939453, -0.05308723449707031, -0.050307273864746094, -0.047527313232421875, -0.044747352600097656, -0.04196739196777344, -0.03918743133544922, -0.036407470703125, -0.03362751007080078, -0.030847549438476562, -0.028067588806152344, -0.025287628173828125, -0.022507667541503906, -0.019727706909179688, -0.01694774627685547, -0.01416778564453125, -0.011387825012207031, -0.008607864379882812, -0.005827903747558594, -0.003047943115234375, -0.00026798248291015625, 0.0025119781494140625, 0.005291938781738281, 0.0080718994140625, 0.010851860046386719, 0.013631820678710938, 0.016411781311035156, 0.019191741943359375, 0.021971702575683594, 0.024751663208007812, 0.02753162384033203, 0.03031158447265625, 0.03309154510498047, 0.03587150573730469, 0.038651466369628906, 0.041431427001953125, 0.044211387634277344, 0.04699134826660156, 0.04977130889892578, 0.05255126953125]}, "gradients/encoder.encoder.layers.3.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 4.0, 7.0, 9.0, 18.0, 18.0, 32.0, 56.0, 59.0, 99.0, 164.0, 309.0, 733.0, 2857.0, 41751.0, 4110041.0, 34522.0, 2227.0, 579.0, 261.0, 147.0, 117.0, 82.0, 58.0, 33.0, 24.0, 23.0, 12.0, 11.0, 8.0, 6.0, 7.0, 4.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.48583984375, -0.4693603515625, -0.452880859375, -0.4364013671875, -0.419921875, -0.4034423828125, -0.386962890625, -0.3704833984375, -0.35400390625, -0.3375244140625, -0.321044921875, -0.3045654296875, -0.2880859375, -0.2716064453125, -0.255126953125, -0.2386474609375, -0.22216796875, -0.2056884765625, -0.189208984375, -0.1727294921875, -0.15625, -0.1397705078125, -0.123291015625, -0.1068115234375, -0.09033203125, -0.0738525390625, -0.057373046875, -0.0408935546875, -0.0244140625, -0.0079345703125, 0.008544921875, 0.0250244140625, 0.04150390625, 0.0579833984375, 0.074462890625, 0.0909423828125, 0.107421875, 0.1239013671875, 0.140380859375, 0.1568603515625, 0.17333984375, 0.1898193359375, 0.206298828125, 0.2227783203125, 0.2392578125, 0.2557373046875, 0.272216796875, 0.2886962890625, 0.30517578125, 0.3216552734375, 0.338134765625, 0.3546142578125, 0.37109375, 0.3875732421875, 0.404052734375, 0.4205322265625, 0.43701171875, 0.4534912109375, 0.469970703125, 0.4864501953125, 0.5029296875, 0.5194091796875, 0.535888671875, 0.5523681640625, 0.56884765625]}, "gradients/encoder.encoder.layers.3.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 3.0, 1.0, 1.0, 4.0, 4.0, 4.0, 15.0, 25.0, 30.0, 67.0, 159.0, 754.0, 2038.0, 708.0, 157.0, 58.0, 26.0, 18.0, 7.0, 6.0, 2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1748046875, -0.1634979248046875, -0.152191162109375, -0.1408843994140625, -0.12957763671875, -0.1182708740234375, -0.106964111328125, -0.0956573486328125, -0.0843505859375, -0.0730438232421875, -0.061737060546875, -0.0504302978515625, -0.03912353515625, -0.0278167724609375, -0.016510009765625, -0.0052032470703125, 0.006103515625, 0.0174102783203125, 0.028717041015625, 0.0400238037109375, 0.05133056640625, 0.0626373291015625, 0.073944091796875, 0.0852508544921875, 0.0965576171875, 0.1078643798828125, 0.119171142578125, 0.1304779052734375, 0.14178466796875, 0.1530914306640625, 0.164398193359375, 0.1757049560546875, 0.18701171875, 0.1983184814453125, 0.209625244140625, 0.2209320068359375, 0.23223876953125, 0.2435455322265625, 0.254852294921875, 0.2661590576171875, 0.2774658203125, 0.2887725830078125, 0.300079345703125, 0.3113861083984375, 0.32269287109375, 0.3339996337890625, 0.345306396484375, 0.3566131591796875, 0.367919921875, 0.3792266845703125, 0.390533447265625, 0.4018402099609375, 0.41314697265625, 0.4244537353515625, 0.435760498046875, 0.4470672607421875, 0.4583740234375, 0.4696807861328125, 0.480987548828125, 0.4922943115234375, 0.50360107421875, 0.5149078369140625, 0.526214599609375, 0.5375213623046875, 0.548828125]}, "gradients/encoder.encoder.layers.3.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 11.0, 22.0, 36.0, 48.0, 88.0, 153.0, 176.0, 178.0, 127.0, 66.0, 35.0, 15.0, 10.0, 12.0, 6.0, 4.0, 1.0, 7.0, 5.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.3489787578582764, -2.2850213050842285, -2.2210640907287598, -2.157106637954712, -2.093149423599243, -2.0291919708251953, -1.965234637260437, -1.9012773036956787, -1.8373199701309204, -1.773362636566162, -1.7094053030014038, -1.6454479694366455, -1.5814905166625977, -1.517533302307129, -1.453575849533081, -1.3896185159683228, -1.3256611824035645, -1.2617038488388062, -1.1977465152740479, -1.1337891817092896, -1.0698318481445312, -1.0058743953704834, -0.9419170618057251, -0.8779597282409668, -0.8140023946762085, -0.7500450611114502, -0.6860877275466919, -0.6221303343772888, -0.5581730008125305, -0.4942156672477722, -0.43025830388069153, -0.36630094051361084, -0.302343487739563, -0.2383861392736435, -0.174428790807724, -0.1104714423418045, -0.04651409387588501, 0.01744323968887329, 0.08140060305595398, 0.14535796642303467, 0.20931529998779297, 0.27327263355255127, 0.33722999691963196, 0.40118736028671265, 0.46514469385147095, 0.5291020274162292, 0.5930594205856323, 0.6570167541503906, 0.7209740877151489, 0.7849314212799072, 0.8488887548446655, 0.9128461480140686, 0.9768034815788269, 1.0407607555389404, 1.1047182083129883, 1.1686755418777466, 1.2326328754425049, 1.2965902090072632, 1.3605475425720215, 1.4245048761367798, 1.488462209701538, 1.552419662475586, 1.6163769960403442, 1.6803343296051025, 1.7442916631698608]}, "gradients/encoder.encoder.layers.3.final_layer_norm.bias": {"_type": "histogram", "values": [3.0, 2.0, 1.0, 1.0, 2.0, 2.0, 3.0, 9.0, 5.0, 10.0, 13.0, 15.0, 16.0, 23.0, 31.0, 40.0, 36.0, 38.0, 53.0, 60.0, 52.0, 65.0, 58.0, 54.0, 53.0, 41.0, 39.0, 52.0, 42.0, 39.0, 32.0, 35.0, 29.0, 14.0, 15.0, 9.0, 11.0, 7.0, 6.0, 1.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5868810415267944, -0.5615463852882385, -0.5362116694450378, -0.5108770132064819, -0.48554229736328125, -0.46020764112472534, -0.43487292528152466, -0.40953826904296875, -0.38420355319976807, -0.35886886715888977, -0.3335341811180115, -0.3081994950771332, -0.2828648090362549, -0.257530152797699, -0.23219545185565948, -0.2068607658147812, -0.1815260946750641, -0.1561914086341858, -0.1308567225933075, -0.1055220440030098, -0.0801873579621315, -0.0548526793718338, -0.029517993330955505, -0.0041833072900772095, 0.021151378750801086, 0.04648606479167938, 0.07182075083255768, 0.09715542942285538, 0.12249011546373367, 0.14782479405403137, 0.17315948009490967, 0.19849416613578796, 0.22382885217666626, 0.24916353821754456, 0.27449822425842285, 0.29983291029930115, 0.32516759634017944, 0.35050225257873535, 0.37583696842193604, 0.40117162466049194, 0.4265063405036926, 0.4518410265445709, 0.4771757125854492, 0.5025103688240051, 0.5278450846672058, 0.5531797409057617, 0.5785144567489624, 0.6038491129875183, 0.6291837692260742, 0.6545184254646301, 0.6798531413078308, 0.7051877975463867, 0.7305225133895874, 0.7558571696281433, 0.781191885471344, 0.8065265417098999, 0.8318612575531006, 0.8571959137916565, 0.8825306296348572, 0.9078652858734131, 0.9332000017166138, 0.9585346579551697, 0.9838693737983704, 1.0092040300369263, 1.034538745880127]}, "gradients/encoder.encoder.layers.3.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 4.0, 9.0, 9.0, 7.0, 30.0, 26.0, 52.0, 75.0, 128.0, 230.0, 412.0, 809.0, 1532.0, 3455.0, 8049.0, 20745.0, 60022.0, 178617.0, 368388.0, 261071.0, 93115.0, 30932.0, 11662.0, 4765.0, 2111.0, 1100.0, 498.0, 290.0, 144.0, 102.0, 55.0, 39.0, 24.0, 12.0, 9.0, 14.0, 7.0, 4.0, 2.0, 0.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.11566162109375, -0.1122293472290039, -0.10879707336425781, -0.10536479949951172, -0.10193252563476562, -0.09850025177001953, -0.09506797790527344, -0.09163570404052734, -0.08820343017578125, -0.08477115631103516, -0.08133888244628906, -0.07790660858154297, -0.07447433471679688, -0.07104206085205078, -0.06760978698730469, -0.0641775131225586, -0.0607452392578125, -0.057312965393066406, -0.05388069152832031, -0.05044841766357422, -0.047016143798828125, -0.04358386993408203, -0.04015159606933594, -0.036719322204589844, -0.03328704833984375, -0.029854774475097656, -0.026422500610351562, -0.02299022674560547, -0.019557952880859375, -0.01612567901611328, -0.012693405151367188, -0.009261131286621094, -0.005828857421875, -0.0023965835571289062, 0.0010356903076171875, 0.004467964172363281, 0.007900238037109375, 0.011332511901855469, 0.014764785766601562, 0.018197059631347656, 0.02162933349609375, 0.025061607360839844, 0.028493881225585938, 0.03192615509033203, 0.035358428955078125, 0.03879070281982422, 0.04222297668457031, 0.045655250549316406, 0.0490875244140625, 0.052519798278808594, 0.05595207214355469, 0.05938434600830078, 0.06281661987304688, 0.06624889373779297, 0.06968116760253906, 0.07311344146728516, 0.07654571533203125, 0.07997798919677734, 0.08341026306152344, 0.08684253692626953, 0.09027481079101562, 0.09370708465576172, 0.09713935852050781, 0.1005716323852539, 0.10400390625]}, "gradients/encoder.encoder.layers.3.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 4.0, 3.0, 6.0, 6.0, 18.0, 10.0, 19.0, 25.0, 48.0, 57.0, 70.0, 76.0, 60.0, 67.0, 87.0, 65.0, 79.0, 68.0, 62.0, 41.0, 43.0, 34.0, 25.0, 12.0, 6.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.1256103515625, -0.12277889251708984, -0.11994743347167969, -0.11711597442626953, -0.11428451538085938, -0.11145305633544922, -0.10862159729003906, -0.1057901382446289, -0.10295867919921875, -0.1001272201538086, -0.09729576110839844, -0.09446430206298828, -0.09163284301757812, -0.08880138397216797, -0.08596992492675781, -0.08313846588134766, -0.0803070068359375, -0.07747554779052734, -0.07464408874511719, -0.07181262969970703, -0.06898117065429688, -0.06614971160888672, -0.06331825256347656, -0.060486793518066406, -0.05765533447265625, -0.054823875427246094, -0.05199241638183594, -0.04916095733642578, -0.046329498291015625, -0.04349803924560547, -0.04066658020019531, -0.037835121154785156, -0.035003662109375, -0.032172203063964844, -0.029340744018554688, -0.02650928497314453, -0.023677825927734375, -0.02084636688232422, -0.018014907836914062, -0.015183448791503906, -0.01235198974609375, -0.009520530700683594, -0.0066890716552734375, -0.0038576126098632812, -0.001026153564453125, 0.0018053054809570312, 0.0046367645263671875, 0.007468223571777344, 0.0102996826171875, 0.013131141662597656, 0.015962600708007812, 0.01879405975341797, 0.021625518798828125, 0.02445697784423828, 0.027288436889648438, 0.030119895935058594, 0.03295135498046875, 0.035782814025878906, 0.03861427307128906, 0.04144573211669922, 0.044277191162109375, 0.04710865020751953, 0.04994010925292969, 0.052771568298339844, 0.05560302734375]}, "gradients/encoder.encoder.layers.3.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 2.0, 3.0, 0.0, 6.0, 5.0, 7.0, 9.0, 9.0, 13.0, 23.0, 39.0, 30.0, 58.0, 97.0, 131.0, 192.0, 330.0, 546.0, 819.0, 1613.0, 3393.0, 8352.0, 29923.0, 144633.0, 620417.0, 183337.0, 36562.0, 9968.0, 3670.0, 1769.0, 987.0, 592.0, 371.0, 206.0, 130.0, 115.0, 53.0, 50.0, 37.0, 21.0, 12.0, 7.0, 8.0, 7.0, 5.0, 1.0, 5.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.174560546875, -0.16903114318847656, -0.16350173950195312, -0.1579723358154297, -0.15244293212890625, -0.1469135284423828, -0.14138412475585938, -0.13585472106933594, -0.1303253173828125, -0.12479591369628906, -0.11926651000976562, -0.11373710632324219, -0.10820770263671875, -0.10267829895019531, -0.09714889526367188, -0.09161949157714844, -0.086090087890625, -0.08056068420410156, -0.07503128051757812, -0.06950187683105469, -0.06397247314453125, -0.05844306945800781, -0.052913665771484375, -0.04738426208496094, -0.0418548583984375, -0.03632545471191406, -0.030796051025390625, -0.025266647338867188, -0.01973724365234375, -0.014207839965820312, -0.008678436279296875, -0.0031490325927734375, 0.00238037109375, 0.007909774780273438, 0.013439178466796875, 0.018968582153320312, 0.02449798583984375, 0.030027389526367188, 0.035556793212890625, 0.04108619689941406, 0.0466156005859375, 0.05214500427246094, 0.057674407958984375, 0.06320381164550781, 0.06873321533203125, 0.07426261901855469, 0.07979202270507812, 0.08532142639160156, 0.090850830078125, 0.09638023376464844, 0.10190963745117188, 0.10743904113769531, 0.11296844482421875, 0.11849784851074219, 0.12402725219726562, 0.12955665588378906, 0.1350860595703125, 0.14061546325683594, 0.14614486694335938, 0.1516742706298828, 0.15720367431640625, 0.1627330780029297, 0.16826248168945312, 0.17379188537597656, 0.1793212890625]}, "gradients/encoder.encoder.layers.3.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 2.0, 0.0, 2.0, 2.0, 7.0, 6.0, 5.0, 6.0, 6.0, 13.0, 21.0, 16.0, 23.0, 16.0, 22.0, 27.0, 43.0, 30.0, 31.0, 36.0, 44.0, 33.0, 54.0, 48.0, 46.0, 68.0, 57.0, 53.0, 35.0, 35.0, 26.0, 37.0, 27.0, 24.0, 13.0, 13.0, 20.0, 14.0, 5.0, 9.0, 8.0, 11.0, 6.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0], "bins": [-0.21923828125, -0.2129840850830078, -0.20672988891601562, -0.20047569274902344, -0.19422149658203125, -0.18796730041503906, -0.18171310424804688, -0.1754589080810547, -0.1692047119140625, -0.1629505157470703, -0.15669631958007812, -0.15044212341308594, -0.14418792724609375, -0.13793373107910156, -0.13167953491210938, -0.1254253387451172, -0.119171142578125, -0.11291694641113281, -0.10666275024414062, -0.10040855407714844, -0.09415435791015625, -0.08790016174316406, -0.08164596557617188, -0.07539176940917969, -0.0691375732421875, -0.06288337707519531, -0.056629180908203125, -0.05037498474121094, -0.04412078857421875, -0.03786659240722656, -0.031612396240234375, -0.025358200073242188, -0.01910400390625, -0.012849807739257812, -0.006595611572265625, -0.0003414154052734375, 0.00591278076171875, 0.012166976928710938, 0.018421173095703125, 0.024675369262695312, 0.0309295654296875, 0.03718376159667969, 0.043437957763671875, 0.04969215393066406, 0.05594635009765625, 0.06220054626464844, 0.06845474243164062, 0.07470893859863281, 0.080963134765625, 0.08721733093261719, 0.09347152709960938, 0.09972572326660156, 0.10597991943359375, 0.11223411560058594, 0.11848831176757812, 0.12474250793457031, 0.1309967041015625, 0.1372509002685547, 0.14350509643554688, 0.14975929260253906, 0.15601348876953125, 0.16226768493652344, 0.16852188110351562, 0.1747760772705078, 0.1810302734375]}, "gradients/encoder.encoder.layers.3.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 4.0, 6.0, 14.0, 10.0, 16.0, 20.0, 24.0, 44.0, 70.0, 79.0, 137.0, 177.0, 298.0, 546.0, 841.0, 1563.0, 3537.0, 10530.0, 60562.0, 799486.0, 143950.0, 17095.0, 4808.0, 1959.0, 1043.0, 595.0, 416.0, 243.0, 160.0, 80.0, 72.0, 46.0, 38.0, 23.0, 11.0, 19.0, 8.0, 8.0, 6.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.1802978515625, -0.1746044158935547, -0.16891098022460938, -0.16321754455566406, -0.15752410888671875, -0.15183067321777344, -0.14613723754882812, -0.1404438018798828, -0.1347503662109375, -0.1290569305419922, -0.12336349487304688, -0.11767005920410156, -0.11197662353515625, -0.10628318786621094, -0.10058975219726562, -0.09489631652832031, -0.089202880859375, -0.08350944519042969, -0.07781600952148438, -0.07212257385253906, -0.06642913818359375, -0.06073570251464844, -0.055042266845703125, -0.04934883117675781, -0.0436553955078125, -0.03796195983886719, -0.032268524169921875, -0.026575088500976562, -0.02088165283203125, -0.015188217163085938, -0.009494781494140625, -0.0038013458251953125, 0.00189208984375, 0.0075855255126953125, 0.013278961181640625, 0.018972396850585938, 0.02466583251953125, 0.030359268188476562, 0.036052703857421875, 0.04174613952636719, 0.0474395751953125, 0.05313301086425781, 0.058826446533203125, 0.06451988220214844, 0.07021331787109375, 0.07590675354003906, 0.08160018920898438, 0.08729362487792969, 0.092987060546875, 0.09868049621582031, 0.10437393188476562, 0.11006736755371094, 0.11576080322265625, 0.12145423889160156, 0.12714767456054688, 0.1328411102294922, 0.1385345458984375, 0.1442279815673828, 0.14992141723632812, 0.15561485290527344, 0.16130828857421875, 0.16700172424316406, 0.17269515991210938, 0.1783885955810547, 0.18408203125]}, "gradients/encoder.encoder.layers.3.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 4.0, 3.0, 3.0, 3.0, 2.0, 6.0, 11.0, 19.0, 19.0, 33.0, 85.0, 138.0, 225.0, 209.0, 92.0, 63.0, 27.0, 23.0, 13.0, 7.0, 9.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-9.274482727050781e-05, -9.016040712594986e-05, -8.75759869813919e-05, -8.499156683683395e-05, -8.2407146692276e-05, -7.982272654771805e-05, -7.72383064031601e-05, -7.465388625860214e-05, -7.206946611404419e-05, -6.948504596948624e-05, -6.690062582492828e-05, -6.431620568037033e-05, -6.173178553581238e-05, -5.9147365391254425e-05, -5.656294524669647e-05, -5.397852510213852e-05, -5.1394104957580566e-05, -4.8809684813022614e-05, -4.622526466846466e-05, -4.364084452390671e-05, -4.1056424379348755e-05, -3.84720042347908e-05, -3.588758409023285e-05, -3.3303163945674896e-05, -3.071874380111694e-05, -2.813432365655899e-05, -2.5549903512001038e-05, -2.2965483367443085e-05, -2.0381063222885132e-05, -1.779664307832718e-05, -1.5212222933769226e-05, -1.2627802789211273e-05, -1.004338264465332e-05, -7.4589625000953674e-06, -4.8745423555374146e-06, -2.2901222109794617e-06, 2.942979335784912e-07, 2.878718078136444e-06, 5.463138222694397e-06, 8.04755836725235e-06, 1.0631978511810303e-05, 1.3216398656368256e-05, 1.580081880092621e-05, 1.838523894548416e-05, 2.0969659090042114e-05, 2.3554079234600067e-05, 2.613849937915802e-05, 2.8722919523715973e-05, 3.1307339668273926e-05, 3.389175981283188e-05, 3.647617995738983e-05, 3.9060600101947784e-05, 4.164502024650574e-05, 4.422944039106369e-05, 4.681386053562164e-05, 4.9398280680179596e-05, 5.198270082473755e-05, 5.45671209692955e-05, 5.7151541113853455e-05, 5.973596125841141e-05, 6.232038140296936e-05, 6.490480154752731e-05, 6.748922169208527e-05, 7.007364183664322e-05, 7.265806198120117e-05]}, "gradients/encoder.encoder.layers.3.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 3.0, 2.0, 8.0, 13.0, 18.0, 30.0, 55.0, 124.0, 208.0, 412.0, 864.0, 2173.0, 6984.0, 39076.0, 784199.0, 191206.0, 16459.0, 4036.0, 1441.0, 609.0, 277.0, 157.0, 76.0, 51.0, 30.0, 12.0, 11.0, 10.0, 4.0, 3.0, 5.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2017822265625, -0.19500732421875, -0.188232421875, -0.18145751953125, -0.1746826171875, -0.16790771484375, -0.1611328125, -0.15435791015625, -0.1475830078125, -0.14080810546875, -0.134033203125, -0.12725830078125, -0.1204833984375, -0.11370849609375, -0.10693359375, -0.10015869140625, -0.0933837890625, -0.08660888671875, -0.079833984375, -0.07305908203125, -0.0662841796875, -0.05950927734375, -0.052734375, -0.04595947265625, -0.0391845703125, -0.03240966796875, -0.025634765625, -0.01885986328125, -0.0120849609375, -0.00531005859375, 0.00146484375, 0.00823974609375, 0.0150146484375, 0.02178955078125, 0.028564453125, 0.03533935546875, 0.0421142578125, 0.04888916015625, 0.0556640625, 0.06243896484375, 0.0692138671875, 0.07598876953125, 0.082763671875, 0.08953857421875, 0.0963134765625, 0.10308837890625, 0.10986328125, 0.11663818359375, 0.1234130859375, 0.13018798828125, 0.136962890625, 0.14373779296875, 0.1505126953125, 0.15728759765625, 0.1640625, 0.17083740234375, 0.1776123046875, 0.18438720703125, 0.191162109375, 0.19793701171875, 0.2047119140625, 0.21148681640625, 0.21826171875, 0.22503662109375, 0.2318115234375]}, "gradients/encoder.encoder.layers.3.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 7.0, 3.0, 17.0, 15.0, 24.0, 30.0, 54.0, 79.0, 127.0, 141.0, 172.0, 120.0, 60.0, 56.0, 37.0, 22.0, 14.0, 8.0, 5.0, 5.0, 2.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1253662109375, -0.1205596923828125, -0.115753173828125, -0.1109466552734375, -0.10614013671875, -0.1013336181640625, -0.096527099609375, -0.0917205810546875, -0.0869140625, -0.0821075439453125, -0.077301025390625, -0.0724945068359375, -0.06768798828125, -0.0628814697265625, -0.058074951171875, -0.0532684326171875, -0.0484619140625, -0.0436553955078125, -0.038848876953125, -0.0340423583984375, -0.02923583984375, -0.0244293212890625, -0.019622802734375, -0.0148162841796875, -0.010009765625, -0.0052032470703125, -0.000396728515625, 0.0044097900390625, 0.00921630859375, 0.0140228271484375, 0.018829345703125, 0.0236358642578125, 0.0284423828125, 0.0332489013671875, 0.038055419921875, 0.0428619384765625, 0.04766845703125, 0.0524749755859375, 0.057281494140625, 0.0620880126953125, 0.06689453125, 0.0717010498046875, 0.076507568359375, 0.0813140869140625, 0.08612060546875, 0.0909271240234375, 0.095733642578125, 0.1005401611328125, 0.1053466796875, 0.1101531982421875, 0.114959716796875, 0.1197662353515625, 0.12457275390625, 0.1293792724609375, 0.134185791015625, 0.1389923095703125, 0.143798828125, 0.1486053466796875, 0.153411865234375, 0.1582183837890625, 0.16302490234375, 0.1678314208984375, 0.172637939453125, 0.1774444580078125, 0.1822509765625]}, "gradients/encoder.encoder.layers.3.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 2.0, 6.0, 6.0, 5.0, 19.0, 28.0, 89.0, 225.0, 404.0, 143.0, 45.0, 27.0, 7.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.8303780555725098, -3.711782455444336, -3.593186855316162, -3.4745912551879883, -3.3559954166412354, -3.2373998165130615, -3.1188042163848877, -3.000208616256714, -2.88161301612854, -2.763017416000366, -2.6444218158721924, -2.5258259773254395, -2.4072303771972656, -2.288634777069092, -2.170039176940918, -2.051443576812744, -1.9328478574752808, -1.814252257347107, -1.6956565380096436, -1.5770609378814697, -1.458465337753296, -1.339869737625122, -1.2212740182876587, -1.1026784181594849, -0.9840827584266663, -0.8654870986938477, -0.7468914985656738, -0.6282958388328552, -0.5097001791000366, -0.3911045789718628, -0.2725089192390442, -0.15391331911087036, -0.03531765937805176, 0.08327797800302505, 0.20187361538410187, 0.3204692602157593, 0.4390648901462555, 0.5576605200767517, 0.6762561798095703, 0.7948517799377441, 0.9134474396705627, 1.0320430994033813, 1.1506386995315552, 1.2692344188690186, 1.3878300189971924, 1.5064256191253662, 1.62502121925354, 1.7436168193817139, 1.8622125387191772, 1.980808138847351, 2.0994038581848145, 2.2179994583129883, 2.336595058441162, 2.455190658569336, 2.5737862586975098, 2.6923818588256836, 2.8109776973724365, 2.9295732975006104, 3.048168897628784, 3.166764736175537, 3.285360336303711, 3.4039559364318848, 3.5225515365600586, 3.6411471366882324, 3.7597427368164062]}, "gradients/encoder.encoder.layers.3.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 4.0, 2.0, 5.0, 3.0, 8.0, 4.0, 6.0, 6.0, 9.0, 11.0, 13.0, 15.0, 12.0, 21.0, 17.0, 18.0, 19.0, 22.0, 35.0, 28.0, 29.0, 39.0, 40.0, 52.0, 79.0, 70.0, 63.0, 41.0, 37.0, 31.0, 40.0, 29.0, 26.0, 27.0, 11.0, 24.0, 19.0, 12.0, 14.0, 10.0, 13.0, 11.0, 9.0, 7.0, 5.0, 5.0, 2.0, 5.0, 6.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7192637920379639, -0.6943606734275818, -0.6694576144218445, -0.6445544958114624, -0.6196514368057251, -0.594748318195343, -0.5698451995849609, -0.5449421405792236, -0.5200390219688416, -0.49513593316078186, -0.47023284435272217, -0.4453297257423401, -0.4204266369342804, -0.3955235481262207, -0.3706204295158386, -0.34571734070777893, -0.32081425189971924, -0.29591116309165955, -0.27100807428359985, -0.24610495567321777, -0.22120186686515808, -0.1962987780570984, -0.1713956743478775, -0.14649257063865662, -0.12158948183059692, -0.09668638557195663, -0.07178328931331635, -0.046880193054676056, -0.021977096796035767, 0.0029259994626045227, 0.027829095721244812, 0.0527321994304657, 0.07763528823852539, 0.10253838449716568, 0.12744148075580597, 0.15234458446502686, 0.17724767327308655, 0.20215076208114624, 0.22705386579036713, 0.251956969499588, 0.2768600583076477, 0.3017631471157074, 0.3266662359237671, 0.35156935453414917, 0.37647244334220886, 0.40137553215026855, 0.42627865076065063, 0.4511817395687103, 0.47608482837677, 0.5009879469871521, 0.5258910059928894, 0.5507941246032715, 0.5756971836090088, 0.6006003022193909, 0.625503420829773, 0.6504064798355103, 0.6753095984458923, 0.7002127170562744, 0.7251157760620117, 0.7500188946723938, 0.7749220132827759, 0.7998250722885132, 0.8247281908988953, 0.8496313095092773, 0.8745343685150146]}, "gradients/encoder.encoder.layers.2.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 4.0, 0.0, 5.0, 6.0, 5.0, 7.0, 9.0, 12.0, 21.0, 21.0, 24.0, 43.0, 54.0, 79.0, 128.0, 221.0, 362.0, 639.0, 1235.0, 2638.0, 6355.0, 19562.0, 116290.0, 2639830.0, 1316007.0, 65679.0, 15027.0, 5439.0, 2164.0, 1045.0, 557.0, 312.0, 191.0, 109.0, 80.0, 50.0, 27.0, 14.0, 13.0, 7.0, 6.0, 4.0, 2.0, 3.0, 0.0, 3.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1939697265625, -0.18884849548339844, -0.18372726440429688, -0.1786060333251953, -0.17348480224609375, -0.1683635711669922, -0.16324234008789062, -0.15812110900878906, -0.1529998779296875, -0.14787864685058594, -0.14275741577148438, -0.1376361846923828, -0.13251495361328125, -0.1273937225341797, -0.12227249145507812, -0.11715126037597656, -0.112030029296875, -0.10690879821777344, -0.10178756713867188, -0.09666633605957031, -0.09154510498046875, -0.08642387390136719, -0.08130264282226562, -0.07618141174316406, -0.0710601806640625, -0.06593894958496094, -0.060817718505859375, -0.05569648742675781, -0.05057525634765625, -0.04545402526855469, -0.040332794189453125, -0.03521156311035156, -0.03009033203125, -0.024969100952148438, -0.019847869873046875, -0.014726638793945312, -0.00960540771484375, -0.0044841766357421875, 0.000637054443359375, 0.0057582855224609375, 0.0108795166015625, 0.016000747680664062, 0.021121978759765625, 0.026243209838867188, 0.03136444091796875, 0.03648567199707031, 0.041606903076171875, 0.04672813415527344, 0.051849365234375, 0.05697059631347656, 0.062091827392578125, 0.06721305847167969, 0.07233428955078125, 0.07745552062988281, 0.08257675170898438, 0.08769798278808594, 0.0928192138671875, 0.09794044494628906, 0.10306167602539062, 0.10818290710449219, 0.11330413818359375, 0.11842536926269531, 0.12354660034179688, 0.12866783142089844, 0.1337890625]}, "gradients/encoder.encoder.layers.2.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 3.0, 8.0, 11.0, 16.0, 18.0, 36.0, 37.0, 58.0, 59.0, 71.0, 80.0, 86.0, 92.0, 81.0, 80.0, 74.0, 52.0, 36.0, 38.0, 19.0, 15.0, 17.0, 11.0, 6.0, 3.0, 4.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.12152099609375, -0.1184988021850586, -0.11547660827636719, -0.11245441436767578, -0.10943222045898438, -0.10641002655029297, -0.10338783264160156, -0.10036563873291016, -0.09734344482421875, -0.09432125091552734, -0.09129905700683594, -0.08827686309814453, -0.08525466918945312, -0.08223247528076172, -0.07921028137207031, -0.0761880874633789, -0.0731658935546875, -0.0701436996459961, -0.06712150573730469, -0.06409931182861328, -0.061077117919921875, -0.05805492401123047, -0.05503273010253906, -0.052010536193847656, -0.04898834228515625, -0.045966148376464844, -0.04294395446777344, -0.03992176055908203, -0.036899566650390625, -0.03387737274169922, -0.030855178833007812, -0.027832984924316406, -0.024810791015625, -0.021788597106933594, -0.018766403198242188, -0.01574420928955078, -0.012722015380859375, -0.009699821472167969, -0.0066776275634765625, -0.0036554336547851562, -0.00063323974609375, 0.0023889541625976562, 0.0054111480712890625, 0.008433341979980469, 0.011455535888671875, 0.014477729797363281, 0.017499923706054688, 0.020522117614746094, 0.0235443115234375, 0.026566505432128906, 0.029588699340820312, 0.03261089324951172, 0.035633087158203125, 0.03865528106689453, 0.04167747497558594, 0.044699668884277344, 0.04772186279296875, 0.050744056701660156, 0.05376625061035156, 0.05678844451904297, 0.059810638427734375, 0.06283283233642578, 0.06585502624511719, 0.0688772201538086, 0.0718994140625]}, "gradients/encoder.encoder.layers.2.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 3.0, 0.0, 1.0, 3.0, 2.0, 10.0, 7.0, 5.0, 14.0, 23.0, 23.0, 54.0, 73.0, 164.0, 293.0, 740.0, 2249.0, 11637.0, 208856.0, 3915946.0, 46391.0, 5405.0, 1330.0, 539.0, 230.0, 115.0, 56.0, 43.0, 23.0, 19.0, 9.0, 5.0, 10.0, 2.0, 3.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.470947265625, -0.4568023681640625, -0.442657470703125, -0.4285125732421875, -0.41436767578125, -0.4002227783203125, -0.386077880859375, -0.3719329833984375, -0.3577880859375, -0.3436431884765625, -0.329498291015625, -0.3153533935546875, -0.30120849609375, -0.2870635986328125, -0.272918701171875, -0.2587738037109375, -0.24462890625, -0.2304840087890625, -0.216339111328125, -0.2021942138671875, -0.18804931640625, -0.1739044189453125, -0.159759521484375, -0.1456146240234375, -0.1314697265625, -0.1173248291015625, -0.103179931640625, -0.0890350341796875, -0.07489013671875, -0.0607452392578125, -0.046600341796875, -0.0324554443359375, -0.018310546875, -0.0041656494140625, 0.009979248046875, 0.0241241455078125, 0.03826904296875, 0.0524139404296875, 0.066558837890625, 0.0807037353515625, 0.0948486328125, 0.1089935302734375, 0.123138427734375, 0.1372833251953125, 0.15142822265625, 0.1655731201171875, 0.179718017578125, 0.1938629150390625, 0.2080078125, 0.2221527099609375, 0.236297607421875, 0.2504425048828125, 0.26458740234375, 0.2787322998046875, 0.292877197265625, 0.3070220947265625, 0.3211669921875, 0.3353118896484375, 0.349456787109375, 0.3636016845703125, 0.37774658203125, 0.3918914794921875, 0.406036376953125, 0.4201812744140625, 0.434326171875]}, "gradients/encoder.encoder.layers.2.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 1.0, 2.0, 4.0, 1.0, 9.0, 7.0, 15.0, 9.0, 15.0, 21.0, 29.0, 40.0, 74.0, 124.0, 197.0, 460.0, 933.0, 1055.0, 511.0, 222.0, 122.0, 79.0, 37.0, 20.0, 18.0, 15.0, 13.0, 9.0, 10.0, 6.0, 7.0, 3.0, 1.0, 2.0, 2.0, 2.0, 0.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2364501953125, -0.22951698303222656, -0.22258377075195312, -0.2156505584716797, -0.20871734619140625, -0.2017841339111328, -0.19485092163085938, -0.18791770935058594, -0.1809844970703125, -0.17405128479003906, -0.16711807250976562, -0.1601848602294922, -0.15325164794921875, -0.1463184356689453, -0.13938522338867188, -0.13245201110839844, -0.125518798828125, -0.11858558654785156, -0.11165237426757812, -0.10471916198730469, -0.09778594970703125, -0.09085273742675781, -0.08391952514648438, -0.07698631286621094, -0.0700531005859375, -0.06311988830566406, -0.056186676025390625, -0.04925346374511719, -0.04232025146484375, -0.03538703918457031, -0.028453826904296875, -0.021520614624023438, -0.01458740234375, -0.0076541900634765625, -0.000720977783203125, 0.0062122344970703125, 0.01314544677734375, 0.020078659057617188, 0.027011871337890625, 0.03394508361816406, 0.0408782958984375, 0.04781150817871094, 0.054744720458984375, 0.06167793273925781, 0.06861114501953125, 0.07554435729980469, 0.08247756958007812, 0.08941078186035156, 0.096343994140625, 0.10327720642089844, 0.11021041870117188, 0.11714363098144531, 0.12407684326171875, 0.1310100555419922, 0.13794326782226562, 0.14487648010253906, 0.1518096923828125, 0.15874290466308594, 0.16567611694335938, 0.1726093292236328, 0.17954254150390625, 0.1864757537841797, 0.19340896606445312, 0.20034217834472656, 0.207275390625]}, "gradients/encoder.encoder.layers.2.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 2.0, 5.0, 7.0, 6.0, 7.0, 13.0, 29.0, 39.0, 50.0, 83.0, 130.0, 154.0, 146.0, 120.0, 83.0, 58.0, 34.0, 17.0, 10.0, 7.0, 6.0, 3.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-2.565342903137207, -2.510693311691284, -2.4560437202453613, -2.4013938903808594, -2.3467442989349365, -2.2920947074890137, -2.237445116043091, -2.182795286178589, -2.128145694732666, -2.073496103286743, -2.0188465118408203, -1.964196801185608, -1.9095470905303955, -1.8548974990844727, -1.8002477884292603, -1.7455981969833374, -1.690948486328125, -1.6362988948822021, -1.5816491842269897, -1.526999592781067, -1.4723498821258545, -1.4177002906799316, -1.3630505800247192, -1.3084009885787964, -1.2537513971328735, -1.1991018056869507, -1.1444520950317383, -1.0898025035858154, -1.035152792930603, -0.9805032014846802, -0.9258534908294678, -0.8712038993835449, -0.8165541887283325, -0.7619045376777649, -0.7072548866271973, -0.6526052355766296, -0.597955584526062, -0.5433059930801392, -0.48865631222724915, -0.4340066611766815, -0.3793570101261139, -0.32470735907554626, -0.27005770802497864, -0.2154080718755722, -0.16075842082500458, -0.10610878467559814, -0.05145913362503052, 0.0031905174255371094, 0.057840168476104736, 0.11248981952667236, 0.16713947057724, 0.22178910672664642, 0.27643877267837524, 0.3310883939266205, 0.3857380449771881, 0.44038769602775574, 0.49503734707832336, 0.5496869683265686, 0.6043366193771362, 0.6589862704277039, 0.7136359214782715, 0.7682855725288391, 0.8229352235794067, 0.8775848746299744, 0.932234525680542]}, "gradients/encoder.encoder.layers.2.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 4.0, 7.0, 1.0, 4.0, 6.0, 5.0, 15.0, 10.0, 20.0, 22.0, 21.0, 16.0, 21.0, 25.0, 21.0, 33.0, 41.0, 28.0, 43.0, 31.0, 47.0, 43.0, 44.0, 53.0, 45.0, 46.0, 52.0, 40.0, 38.0, 35.0, 30.0, 28.0, 14.0, 14.0, 25.0, 13.0, 12.0, 13.0, 9.0, 9.0, 8.0, 7.0, 5.0, 1.0, 0.0, 3.0, 3.0, 2.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.7177403569221497, -0.6949856877326965, -0.6722310185432434, -0.6494764089584351, -0.6267217397689819, -0.6039670705795288, -0.5812124013900757, -0.5584577322006226, -0.5357030630111694, -0.5129483938217163, -0.49019375443458557, -0.46743908524513245, -0.4446844458580017, -0.4219297766685486, -0.39917510747909546, -0.37642043828964233, -0.353665828704834, -0.33091115951538086, -0.3081565201282501, -0.285401850938797, -0.26264721155166626, -0.23989254236221313, -0.21713787317276, -0.19438321888446808, -0.17162856459617615, -0.14887391030788422, -0.12611925601959229, -0.10336458683013916, -0.08060993254184723, -0.0578552782535553, -0.03510060906410217, -0.012345954775810242, 0.01040869951248169, 0.03316335752606392, 0.05591801553964615, 0.07867267727851868, 0.10142733156681061, 0.12418198585510254, 0.14693665504455566, 0.1696913093328476, 0.19244596362113953, 0.21520061790943146, 0.2379552721977234, 0.2607099413871765, 0.28346461057662964, 0.3062192499637604, 0.3289739191532135, 0.35172855854034424, 0.37448322772979736, 0.3972378969192505, 0.4199925363063812, 0.44274720549583435, 0.4655018448829651, 0.4882565140724182, 0.5110111832618713, 0.5337658524513245, 0.5565204620361328, 0.5792751312255859, 0.6020298004150391, 0.6247844696044922, 0.6475390791893005, 0.6702937483787537, 0.6930484175682068, 0.7158030867576599, 0.738557755947113]}, "gradients/encoder.encoder.layers.2.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 3.0, 3.0, 3.0, 8.0, 7.0, 12.0, 7.0, 27.0, 37.0, 77.0, 103.0, 238.0, 463.0, 1003.0, 2075.0, 4787.0, 11943.0, 31492.0, 90088.0, 258398.0, 385227.0, 169128.0, 58237.0, 20789.0, 8010.0, 3435.0, 1553.0, 646.0, 339.0, 188.0, 106.0, 50.0, 36.0, 18.0, 14.0, 5.0, 1.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.17333984375, -0.1690378189086914, -0.1647357940673828, -0.16043376922607422, -0.15613174438476562, -0.15182971954345703, -0.14752769470214844, -0.14322566986083984, -0.13892364501953125, -0.13462162017822266, -0.13031959533691406, -0.12601757049560547, -0.12171554565429688, -0.11741352081298828, -0.11311149597167969, -0.1088094711303711, -0.1045074462890625, -0.1002054214477539, -0.09590339660644531, -0.09160137176513672, -0.08729934692382812, -0.08299732208251953, -0.07869529724121094, -0.07439327239990234, -0.07009124755859375, -0.06578922271728516, -0.06148719787597656, -0.05718517303466797, -0.052883148193359375, -0.04858112335205078, -0.04427909851074219, -0.039977073669433594, -0.035675048828125, -0.031373023986816406, -0.027070999145507812, -0.02276897430419922, -0.018466949462890625, -0.014164924621582031, -0.009862899780273438, -0.005560874938964844, -0.00125885009765625, 0.0030431747436523438, 0.0073451995849609375, 0.011647224426269531, 0.015949249267578125, 0.02025127410888672, 0.024553298950195312, 0.028855323791503906, 0.0331573486328125, 0.037459373474121094, 0.04176139831542969, 0.04606342315673828, 0.050365447998046875, 0.05466747283935547, 0.05896949768066406, 0.06327152252197266, 0.06757354736328125, 0.07187557220458984, 0.07617759704589844, 0.08047962188720703, 0.08478164672851562, 0.08908367156982422, 0.09338569641113281, 0.0976877212524414, 0.10198974609375]}, "gradients/encoder.encoder.layers.2.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 5.0, 1.0, 4.0, 7.0, 11.0, 9.0, 10.0, 19.0, 24.0, 29.0, 38.0, 42.0, 50.0, 47.0, 65.0, 66.0, 68.0, 84.0, 77.0, 65.0, 41.0, 48.0, 49.0, 34.0, 30.0, 29.0, 19.0, 7.0, 16.0, 8.0, 7.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1231689453125, -0.120086669921875, -0.11700439453125, -0.113922119140625, -0.11083984375, -0.107757568359375, -0.10467529296875, -0.101593017578125, -0.0985107421875, -0.095428466796875, -0.09234619140625, -0.089263916015625, -0.086181640625, -0.083099365234375, -0.08001708984375, -0.076934814453125, -0.0738525390625, -0.070770263671875, -0.06768798828125, -0.064605712890625, -0.0615234375, -0.058441162109375, -0.05535888671875, -0.052276611328125, -0.0491943359375, -0.046112060546875, -0.04302978515625, -0.039947509765625, -0.036865234375, -0.033782958984375, -0.03070068359375, -0.027618408203125, -0.0245361328125, -0.021453857421875, -0.01837158203125, -0.015289306640625, -0.01220703125, -0.009124755859375, -0.00604248046875, -0.002960205078125, 0.0001220703125, 0.003204345703125, 0.00628662109375, 0.009368896484375, 0.012451171875, 0.015533447265625, 0.01861572265625, 0.021697998046875, 0.0247802734375, 0.027862548828125, 0.03094482421875, 0.034027099609375, 0.037109375, 0.040191650390625, 0.04327392578125, 0.046356201171875, 0.0494384765625, 0.052520751953125, 0.05560302734375, 0.058685302734375, 0.061767578125, 0.064849853515625, 0.06793212890625, 0.071014404296875, 0.0740966796875]}, "gradients/encoder.encoder.layers.2.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 6.0, 6.0, 7.0, 5.0, 9.0, 16.0, 24.0, 26.0, 22.0, 63.0, 89.0, 125.0, 170.0, 278.0, 468.0, 911.0, 2481.0, 8905.0, 49185.0, 602038.0, 332079.0, 39727.0, 7695.0, 2168.0, 901.0, 443.0, 236.0, 123.0, 103.0, 69.0, 51.0, 36.0, 24.0, 15.0, 14.0, 9.0, 10.0, 3.0, 4.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.283203125, -0.2740516662597656, -0.26490020751953125, -0.2557487487792969, -0.2465972900390625, -0.23744583129882812, -0.22829437255859375, -0.21914291381835938, -0.209991455078125, -0.20083999633789062, -0.19168853759765625, -0.18253707885742188, -0.1733856201171875, -0.16423416137695312, -0.15508270263671875, -0.14593124389648438, -0.13677978515625, -0.12762832641601562, -0.11847686767578125, -0.10932540893554688, -0.1001739501953125, -0.09102249145507812, -0.08187103271484375, -0.07271957397460938, -0.063568115234375, -0.054416656494140625, -0.04526519775390625, -0.036113739013671875, -0.0269622802734375, -0.017810821533203125, -0.00865936279296875, 0.000492095947265625, 0.0096435546875, 0.018795013427734375, 0.02794647216796875, 0.037097930908203125, 0.0462493896484375, 0.055400848388671875, 0.06455230712890625, 0.07370376586914062, 0.082855224609375, 0.09200668334960938, 0.10115814208984375, 0.11030960083007812, 0.1194610595703125, 0.12861251831054688, 0.13776397705078125, 0.14691543579101562, 0.15606689453125, 0.16521835327148438, 0.17436981201171875, 0.18352127075195312, 0.1926727294921875, 0.20182418823242188, 0.21097564697265625, 0.22012710571289062, 0.229278564453125, 0.23843002319335938, 0.24758148193359375, 0.2567329406738281, 0.2658843994140625, 0.2750358581542969, 0.28418731689453125, 0.2933387756347656, 0.302490234375]}, "gradients/encoder.encoder.layers.2.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 4.0, 3.0, 2.0, 5.0, 4.0, 9.0, 8.0, 17.0, 13.0, 10.0, 16.0, 17.0, 20.0, 17.0, 23.0, 37.0, 35.0, 46.0, 60.0, 52.0, 58.0, 49.0, 63.0, 48.0, 56.0, 58.0, 50.0, 29.0, 32.0, 25.0, 28.0, 24.0, 19.0, 17.0, 16.0, 9.0, 5.0, 5.0, 3.0, 9.0, 1.0, 1.0, 2.0, 2.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.271240234375, -0.2615699768066406, -0.25189971923828125, -0.24222946166992188, -0.2325592041015625, -0.22288894653320312, -0.21321868896484375, -0.20354843139648438, -0.193878173828125, -0.18420791625976562, -0.17453765869140625, -0.16486740112304688, -0.1551971435546875, -0.14552688598632812, -0.13585662841796875, -0.12618637084960938, -0.11651611328125, -0.10684585571289062, -0.09717559814453125, -0.08750534057617188, -0.0778350830078125, -0.06816482543945312, -0.05849456787109375, -0.048824310302734375, -0.039154052734375, -0.029483795166015625, -0.01981353759765625, -0.010143280029296875, -0.0004730224609375, 0.009197235107421875, 0.01886749267578125, 0.028537750244140625, 0.0382080078125, 0.047878265380859375, 0.05754852294921875, 0.06721878051757812, 0.0768890380859375, 0.08655929565429688, 0.09622955322265625, 0.10589981079101562, 0.115570068359375, 0.12524032592773438, 0.13491058349609375, 0.14458084106445312, 0.1542510986328125, 0.16392135620117188, 0.17359161376953125, 0.18326187133789062, 0.19293212890625, 0.20260238647460938, 0.21227264404296875, 0.22194290161132812, 0.2316131591796875, 0.24128341674804688, 0.25095367431640625, 0.2606239318847656, 0.270294189453125, 0.2799644470214844, 0.28963470458984375, 0.2993049621582031, 0.3089752197265625, 0.3186454772949219, 0.32831573486328125, 0.3379859924316406, 0.34765625]}, "gradients/encoder.encoder.layers.2.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 3.0, 8.0, 7.0, 9.0, 9.0, 14.0, 19.0, 25.0, 45.0, 55.0, 91.0, 154.0, 240.0, 455.0, 891.0, 1897.0, 4557.0, 13212.0, 53896.0, 669625.0, 249327.0, 36891.0, 10185.0, 3659.0, 1526.0, 761.0, 408.0, 218.0, 127.0, 67.0, 52.0, 32.0, 35.0, 11.0, 14.0, 9.0, 5.0, 6.0, 5.0, 5.0, 1.0, 2.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.1451416015625, -0.14058876037597656, -0.13603591918945312, -0.1314830780029297, -0.12693023681640625, -0.12237739562988281, -0.11782455444335938, -0.11327171325683594, -0.1087188720703125, -0.10416603088378906, -0.09961318969726562, -0.09506034851074219, -0.09050750732421875, -0.08595466613769531, -0.08140182495117188, -0.07684898376464844, -0.072296142578125, -0.06774330139160156, -0.06319046020507812, -0.05863761901855469, -0.05408477783203125, -0.04953193664550781, -0.044979095458984375, -0.04042625427246094, -0.0358734130859375, -0.03132057189941406, -0.026767730712890625, -0.022214889526367188, -0.01766204833984375, -0.013109207153320312, -0.008556365966796875, -0.0040035247802734375, 0.00054931640625, 0.0051021575927734375, 0.009654998779296875, 0.014207839965820312, 0.01876068115234375, 0.023313522338867188, 0.027866363525390625, 0.03241920471191406, 0.0369720458984375, 0.04152488708496094, 0.046077728271484375, 0.05063056945800781, 0.05518341064453125, 0.05973625183105469, 0.06428909301757812, 0.06884193420410156, 0.073394775390625, 0.07794761657714844, 0.08250045776367188, 0.08705329895019531, 0.09160614013671875, 0.09615898132324219, 0.10071182250976562, 0.10526466369628906, 0.1098175048828125, 0.11437034606933594, 0.11892318725585938, 0.12347602844238281, 0.12802886962890625, 0.1325817108154297, 0.13713455200195312, 0.14168739318847656, 0.146240234375]}, "gradients/encoder.encoder.layers.2.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 4.0, 1.0, 7.0, 2.0, 4.0, 13.0, 4.0, 6.0, 12.0, 13.0, 20.0, 13.0, 21.0, 24.0, 38.0, 66.0, 79.0, 83.0, 122.0, 126.0, 77.0, 65.0, 49.0, 28.0, 19.0, 19.0, 13.0, 23.0, 6.0, 5.0, 10.0, 7.0, 4.0, 3.0, 6.0, 2.0, 1.0, 2.0, 8.0, 1.0, 0.0, 0.0, 1.0, 4.0, 0.0, 1.0], "bins": [-4.172325134277344e-05, -4.057958722114563e-05, -3.943592309951782e-05, -3.8292258977890015e-05, -3.714859485626221e-05, -3.60049307346344e-05, -3.486126661300659e-05, -3.3717602491378784e-05, -3.2573938369750977e-05, -3.143027424812317e-05, -3.028661012649536e-05, -2.9142946004867554e-05, -2.7999281883239746e-05, -2.685561776161194e-05, -2.571195363998413e-05, -2.4568289518356323e-05, -2.3424625396728516e-05, -2.2280961275100708e-05, -2.11372971534729e-05, -1.9993633031845093e-05, -1.8849968910217285e-05, -1.7706304788589478e-05, -1.656264066696167e-05, -1.5418976545333862e-05, -1.4275312423706055e-05, -1.3131648302078247e-05, -1.198798418045044e-05, -1.0844320058822632e-05, -9.700655937194824e-06, -8.556991815567017e-06, -7.413327693939209e-06, -6.269663572311401e-06, -5.125999450683594e-06, -3.982335329055786e-06, -2.8386712074279785e-06, -1.695007085800171e-06, -5.513429641723633e-07, 5.923211574554443e-07, 1.735985279083252e-06, 2.8796494007110596e-06, 4.023313522338867e-06, 5.166977643966675e-06, 6.310641765594482e-06, 7.45430588722229e-06, 8.597970008850098e-06, 9.741634130477905e-06, 1.0885298252105713e-05, 1.202896237373352e-05, 1.3172626495361328e-05, 1.4316290616989136e-05, 1.5459954738616943e-05, 1.660361886024475e-05, 1.774728298187256e-05, 1.8890947103500366e-05, 2.0034611225128174e-05, 2.117827534675598e-05, 2.232193946838379e-05, 2.3465603590011597e-05, 2.4609267711639404e-05, 2.5752931833267212e-05, 2.689659595489502e-05, 2.8040260076522827e-05, 2.9183924198150635e-05, 3.0327588319778442e-05, 3.147125244140625e-05]}, "gradients/encoder.encoder.layers.2.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 4.0, 6.0, 3.0, 14.0, 13.0, 18.0, 23.0, 46.0, 65.0, 103.0, 186.0, 332.0, 677.0, 1588.0, 4419.0, 17706.0, 109097.0, 803240.0, 88828.0, 15264.0, 4037.0, 1425.0, 673.0, 315.0, 208.0, 84.0, 70.0, 34.0, 24.0, 15.0, 8.0, 6.0, 6.0, 8.0, 1.0, 1.0, 5.0, 1.0, 1.0, 3.0, 2.0, 1.0, 1.0], "bins": [-0.2030029296875, -0.197723388671875, -0.19244384765625, -0.187164306640625, -0.181884765625, -0.176605224609375, -0.17132568359375, -0.166046142578125, -0.1607666015625, -0.155487060546875, -0.15020751953125, -0.144927978515625, -0.1396484375, -0.134368896484375, -0.12908935546875, -0.123809814453125, -0.1185302734375, -0.113250732421875, -0.10797119140625, -0.102691650390625, -0.097412109375, -0.092132568359375, -0.08685302734375, -0.081573486328125, -0.0762939453125, -0.071014404296875, -0.06573486328125, -0.060455322265625, -0.05517578125, -0.049896240234375, -0.04461669921875, -0.039337158203125, -0.0340576171875, -0.028778076171875, -0.02349853515625, -0.018218994140625, -0.012939453125, -0.007659912109375, -0.00238037109375, 0.002899169921875, 0.0081787109375, 0.013458251953125, 0.01873779296875, 0.024017333984375, 0.029296875, 0.034576416015625, 0.03985595703125, 0.045135498046875, 0.0504150390625, 0.055694580078125, 0.06097412109375, 0.066253662109375, 0.071533203125, 0.076812744140625, 0.08209228515625, 0.087371826171875, 0.0926513671875, 0.097930908203125, 0.10321044921875, 0.108489990234375, 0.11376953125, 0.119049072265625, 0.12432861328125, 0.129608154296875, 0.1348876953125]}, "gradients/encoder.encoder.layers.2.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 5.0, 6.0, 7.0, 7.0, 14.0, 20.0, 19.0, 27.0, 43.0, 47.0, 82.0, 83.0, 107.0, 116.0, 102.0, 77.0, 43.0, 50.0, 31.0, 26.0, 23.0, 20.0, 9.0, 6.0, 6.0, 7.0, 6.0, 2.0, 7.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.112548828125, -0.1095266342163086, -0.10650444030761719, -0.10348224639892578, -0.10046005249023438, -0.09743785858154297, -0.09441566467285156, -0.09139347076416016, -0.08837127685546875, -0.08534908294677734, -0.08232688903808594, -0.07930469512939453, -0.07628250122070312, -0.07326030731201172, -0.07023811340332031, -0.0672159194946289, -0.0641937255859375, -0.061171531677246094, -0.05814933776855469, -0.05512714385986328, -0.052104949951171875, -0.04908275604248047, -0.04606056213378906, -0.043038368225097656, -0.04001617431640625, -0.036993980407714844, -0.03397178649902344, -0.03094959259033203, -0.027927398681640625, -0.02490520477294922, -0.021883010864257812, -0.018860816955566406, -0.015838623046875, -0.012816429138183594, -0.009794235229492188, -0.006772041320800781, -0.003749847412109375, -0.0007276535034179688, 0.0022945404052734375, 0.005316734313964844, 0.00833892822265625, 0.011361122131347656, 0.014383316040039062, 0.01740550994873047, 0.020427703857421875, 0.02344989776611328, 0.026472091674804688, 0.029494285583496094, 0.0325164794921875, 0.035538673400878906, 0.03856086730957031, 0.04158306121826172, 0.044605255126953125, 0.04762744903564453, 0.05064964294433594, 0.053671836853027344, 0.05669403076171875, 0.059716224670410156, 0.06273841857910156, 0.06576061248779297, 0.06878280639648438, 0.07180500030517578, 0.07482719421386719, 0.0778493881225586, 0.08087158203125]}, "gradients/encoder.encoder.layers.2.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 6.0, 6.0, 13.0, 20.0, 37.0, 79.0, 120.0, 160.0, 314.0, 115.0, 67.0, 29.0, 20.0, 10.0, 4.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.135388135910034, -3.053584337234497, -2.971780300140381, -2.8899765014648438, -2.8081727027893066, -2.7263689041137695, -2.6445648670196533, -2.562761068344116, -2.48095703125, -2.399153232574463, -2.3173491954803467, -2.2355453968048096, -2.1537415981292725, -2.0719375610351562, -1.9901337623596191, -1.908329963684082, -1.826526165008545, -1.7447222471237183, -1.6629184484481812, -1.5811145305633545, -1.4993107318878174, -1.4175068140029907, -1.335702896118164, -1.253899097442627, -1.1720951795578003, -1.0902912616729736, -1.0084874629974365, -0.9266835451126099, -0.844879686832428, -0.7630758285522461, -0.6812719106674194, -0.5994680523872375, -0.5176639556884766, -0.4358600974082947, -0.3540562093257904, -0.27225232124328613, -0.19044846296310425, -0.10864460468292236, -0.02684071660041809, 0.05496317148208618, 0.13676702976226807, 0.21857090294361115, 0.3003747761249542, 0.3821786642074585, 0.4639825224876404, 0.5457863807678223, 0.6275902986526489, 0.7093941569328308, 0.7911980152130127, 0.8730018734931946, 0.9548057317733765, 1.0366096496582031, 1.1184134483337402, 1.200217366218567, 1.2820212841033936, 1.3638250827789307, 1.4456290006637573, 1.527432918548584, 1.609236717224121, 1.6910406351089478, 1.7728445529937744, 1.8546483516693115, 1.9364522695541382, 2.018256187438965, 2.100059986114502]}, "gradients/encoder.encoder.layers.2.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 3.0, 9.0, 6.0, 5.0, 9.0, 8.0, 9.0, 18.0, 14.0, 18.0, 16.0, 27.0, 26.0, 17.0, 19.0, 25.0, 21.0, 44.0, 20.0, 60.0, 57.0, 93.0, 93.0, 62.0, 40.0, 29.0, 17.0, 28.0, 32.0, 22.0, 22.0, 18.0, 16.0, 22.0, 11.0, 17.0, 8.0, 12.0, 12.0, 6.0, 3.0, 2.0, 3.0, 1.0, 5.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 1.0], "bins": [-1.1448421478271484, -1.1100022792816162, -1.075162410736084, -1.0403225421905518, -1.0054826736450195, -0.9706428050994873, -0.9358029365539551, -0.9009630680084229, -0.8661231994628906, -0.8312833309173584, -0.7964434623718262, -0.761603593826294, -0.7267637252807617, -0.6919238567352295, -0.6570839881896973, -0.622244119644165, -0.5874043107032776, -0.5525644421577454, -0.5177245736122131, -0.4828847050666809, -0.4480448365211487, -0.41320496797561646, -0.3783651292324066, -0.3435252606868744, -0.30868539214134216, -0.27384552359580994, -0.2390056550502777, -0.20416580140590668, -0.16932593286037445, -0.13448606431484222, -0.09964621067047119, -0.06480634212493896, -0.02996647357940674, 0.00487339124083519, 0.03971325606107712, 0.07455311715602875, 0.10939298570156097, 0.1442328542470932, 0.17907270789146423, 0.21391257643699646, 0.2487524449825287, 0.2835923135280609, 0.31843218207359314, 0.353272020816803, 0.3881118893623352, 0.42295175790786743, 0.45779162645339966, 0.4926314949989319, 0.5274713635444641, 0.5623112320899963, 0.5971511006355286, 0.6319909691810608, 0.666830837726593, 0.7016707062721252, 0.7365105152130127, 0.7713503837585449, 0.8061902523040771, 0.8410301208496094, 0.8758699893951416, 0.9107098579406738, 0.945549726486206, 0.9803895950317383, 1.0152294635772705, 1.0500693321228027, 1.084909200668335]}, "gradients/encoder.encoder.layers.1.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 4.0, 4.0, 6.0, 2.0, 9.0, 21.0, 15.0, 29.0, 26.0, 38.0, 61.0, 94.0, 149.0, 236.0, 437.0, 735.0, 1513.0, 3550.0, 10771.0, 59271.0, 2318776.0, 1729793.0, 52376.0, 10053.0, 3214.0, 1392.0, 683.0, 388.0, 238.0, 135.0, 88.0, 60.0, 42.0, 22.0, 22.0, 10.0, 11.0, 5.0, 3.0, 4.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.258056640625, -0.250457763671875, -0.24285888671875, -0.235260009765625, -0.2276611328125, -0.220062255859375, -0.21246337890625, -0.204864501953125, -0.197265625, -0.189666748046875, -0.18206787109375, -0.174468994140625, -0.1668701171875, -0.159271240234375, -0.15167236328125, -0.144073486328125, -0.136474609375, -0.128875732421875, -0.12127685546875, -0.113677978515625, -0.1060791015625, -0.098480224609375, -0.09088134765625, -0.083282470703125, -0.07568359375, -0.068084716796875, -0.06048583984375, -0.052886962890625, -0.0452880859375, -0.037689208984375, -0.03009033203125, -0.022491455078125, -0.014892578125, -0.007293701171875, 0.00030517578125, 0.007904052734375, 0.0155029296875, 0.023101806640625, 0.03070068359375, 0.038299560546875, 0.0458984375, 0.053497314453125, 0.06109619140625, 0.068695068359375, 0.0762939453125, 0.083892822265625, 0.09149169921875, 0.099090576171875, 0.106689453125, 0.114288330078125, 0.12188720703125, 0.129486083984375, 0.1370849609375, 0.144683837890625, 0.15228271484375, 0.159881591796875, 0.16748046875, 0.175079345703125, 0.18267822265625, 0.190277099609375, 0.1978759765625, 0.205474853515625, 0.21307373046875, 0.220672607421875, 0.228271484375]}, "gradients/encoder.encoder.layers.1.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 6.0, 6.0, 5.0, 6.0, 9.0, 6.0, 20.0, 16.0, 18.0, 29.0, 31.0, 40.0, 40.0, 40.0, 44.0, 49.0, 59.0, 59.0, 57.0, 64.0, 57.0, 56.0, 45.0, 35.0, 37.0, 32.0, 28.0, 24.0, 25.0, 19.0, 14.0, 15.0, 6.0, 1.0, 3.0, 8.0, 2.0, 3.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.09503173828125, -0.09267139434814453, -0.09031105041503906, -0.0879507064819336, -0.08559036254882812, -0.08323001861572266, -0.08086967468261719, -0.07850933074951172, -0.07614898681640625, -0.07378864288330078, -0.07142829895019531, -0.06906795501708984, -0.06670761108398438, -0.0643472671508789, -0.06198692321777344, -0.05962657928466797, -0.0572662353515625, -0.05490589141845703, -0.05254554748535156, -0.050185203552246094, -0.047824859619140625, -0.045464515686035156, -0.04310417175292969, -0.04074382781982422, -0.03838348388671875, -0.03602313995361328, -0.03366279602050781, -0.031302452087402344, -0.028942108154296875, -0.026581764221191406, -0.024221420288085938, -0.02186107635498047, -0.019500732421875, -0.01714038848876953, -0.014780044555664062, -0.012419700622558594, -0.010059356689453125, -0.007699012756347656, -0.0053386688232421875, -0.0029783248901367188, -0.00061798095703125, 0.0017423629760742188, 0.0041027069091796875, 0.006463050842285156, 0.008823394775390625, 0.011183738708496094, 0.013544082641601562, 0.01590442657470703, 0.0182647705078125, 0.02062511444091797, 0.022985458374023438, 0.025345802307128906, 0.027706146240234375, 0.030066490173339844, 0.03242683410644531, 0.03478717803955078, 0.03714752197265625, 0.03950786590576172, 0.04186820983886719, 0.044228553771972656, 0.046588897705078125, 0.048949241638183594, 0.05130958557128906, 0.05366992950439453, 0.0560302734375]}, "gradients/encoder.encoder.layers.1.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 4.0, 7.0, 9.0, 13.0, 38.0, 77.0, 187.0, 465.0, 1770.0, 51271.0, 4125518.0, 13250.0, 1131.0, 345.0, 115.0, 41.0, 24.0, 11.0, 9.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.037109375, -1.0040283203125, -0.970947265625, -0.9378662109375, -0.90478515625, -0.8717041015625, -0.838623046875, -0.8055419921875, -0.7724609375, -0.7393798828125, -0.706298828125, -0.6732177734375, -0.64013671875, -0.6070556640625, -0.573974609375, -0.5408935546875, -0.5078125, -0.4747314453125, -0.441650390625, -0.4085693359375, -0.37548828125, -0.3424072265625, -0.309326171875, -0.2762451171875, -0.2431640625, -0.2100830078125, -0.177001953125, -0.1439208984375, -0.11083984375, -0.0777587890625, -0.044677734375, -0.0115966796875, 0.021484375, 0.0545654296875, 0.087646484375, 0.1207275390625, 0.15380859375, 0.1868896484375, 0.219970703125, 0.2530517578125, 0.2861328125, 0.3192138671875, 0.352294921875, 0.3853759765625, 0.41845703125, 0.4515380859375, 0.484619140625, 0.5177001953125, 0.55078125, 0.5838623046875, 0.616943359375, 0.6500244140625, 0.68310546875, 0.7161865234375, 0.749267578125, 0.7823486328125, 0.8154296875, 0.8485107421875, 0.881591796875, 0.9146728515625, 0.94775390625, 0.9808349609375, 1.013916015625, 1.0469970703125, 1.080078125]}, "gradients/encoder.encoder.layers.1.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 5.0, 7.0, 7.0, 11.0, 25.0, 29.0, 86.0, 157.0, 556.0, 1798.0, 941.0, 239.0, 99.0, 55.0, 24.0, 15.0, 10.0, 7.0, 4.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.68359375, -0.6675682067871094, -0.6515426635742188, -0.6355171203613281, -0.6194915771484375, -0.6034660339355469, -0.5874404907226562, -0.5714149475097656, -0.555389404296875, -0.5393638610839844, -0.5233383178710938, -0.5073127746582031, -0.4912872314453125, -0.4752616882324219, -0.45923614501953125, -0.4432106018066406, -0.42718505859375, -0.4111595153808594, -0.39513397216796875, -0.3791084289550781, -0.3630828857421875, -0.3470573425292969, -0.33103179931640625, -0.3150062561035156, -0.298980712890625, -0.2829551696777344, -0.26692962646484375, -0.2509040832519531, -0.2348785400390625, -0.21885299682617188, -0.20282745361328125, -0.18680191040039062, -0.1707763671875, -0.15475082397460938, -0.13872528076171875, -0.12269973754882812, -0.1066741943359375, -0.09064865112304688, -0.07462310791015625, -0.058597564697265625, -0.042572021484375, -0.026546478271484375, -0.01052093505859375, 0.005504608154296875, 0.0215301513671875, 0.037555694580078125, 0.05358123779296875, 0.06960678100585938, 0.08563232421875, 0.10165786743164062, 0.11768341064453125, 0.13370895385742188, 0.1497344970703125, 0.16576004028320312, 0.18178558349609375, 0.19781112670898438, 0.213836669921875, 0.22986221313476562, 0.24588775634765625, 0.2619132995605469, 0.2779388427734375, 0.2939643859863281, 0.30998992919921875, 0.3260154724121094, 0.342041015625]}, "gradients/encoder.encoder.layers.1.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 6.0, 8.0, 12.0, 22.0, 28.0, 43.0, 97.0, 145.0, 165.0, 173.0, 115.0, 83.0, 58.0, 21.0, 15.0, 3.0, 7.0, 4.0, 1.0, 0.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.0948410034179688, -3.0188400745391846, -2.9428391456604004, -2.8668384552001953, -2.790837526321411, -2.714836597442627, -2.6388356685638428, -2.5628347396850586, -2.4868338108062744, -2.4108328819274902, -2.334831953048706, -2.258831024169922, -2.182830333709717, -2.1068294048309326, -2.0308284759521484, -1.9548275470733643, -1.8788267374038696, -1.8028258085250854, -1.7268249988555908, -1.6508240699768066, -1.5748231410980225, -1.4988222122192383, -1.4228214025497437, -1.3468204736709595, -1.2708196640014648, -1.1948187351226807, -1.118817925453186, -1.0428169965744019, -0.9668160676956177, -0.8908151984214783, -0.8148143291473389, -0.7388134002685547, -0.6628124713897705, -0.5868116021156311, -0.5108106732368469, -0.4348098039627075, -0.3588089048862457, -0.28280800580978394, -0.20680713653564453, -0.13080620765686035, -0.05480533838272095, 0.021195553243160248, 0.09719644486904144, 0.17319732904434204, 0.24919822812080383, 0.3251991271972656, 0.40119999647140503, 0.4772009253501892, 0.5532017946243286, 0.629202663898468, 0.7052035927772522, 0.7812044620513916, 0.8572053909301758, 0.9332062602043152, 1.0092071294784546, 1.0852080583572388, 1.1612088680267334, 1.2372097969055176, 1.3132106065750122, 1.3892115354537964, 1.4652124643325806, 1.5412132740020752, 1.6172142028808594, 1.6932151317596436, 1.7692160606384277]}, "gradients/encoder.encoder.layers.1.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 3.0, 2.0, 7.0, 4.0, 4.0, 4.0, 5.0, 16.0, 8.0, 11.0, 9.0, 15.0, 13.0, 21.0, 18.0, 26.0, 26.0, 28.0, 28.0, 41.0, 36.0, 53.0, 52.0, 66.0, 61.0, 49.0, 42.0, 49.0, 37.0, 34.0, 35.0, 35.0, 29.0, 18.0, 21.0, 14.0, 14.0, 18.0, 8.0, 9.0, 9.0, 5.0, 6.0, 4.0, 5.0, 4.0, 1.0, 5.0, 2.0, 1.0, 1.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-1.1733834743499756, -1.1367570161819458, -1.1001306772232056, -1.0635042190551758, -1.026877760887146, -0.990251362323761, -0.953624963760376, -0.9169985055923462, -0.8803721070289612, -0.8437457084655762, -0.8071192502975464, -0.7704928517341614, -0.7338664531707764, -0.6972399950027466, -0.6606135964393616, -0.6239871978759766, -0.5873607397079468, -0.5507343411445618, -0.514107882976532, -0.477481484413147, -0.4408550560474396, -0.4042286276817322, -0.36760222911834717, -0.33097580075263977, -0.2943493723869324, -0.257722944021225, -0.22109653055667877, -0.18447011709213257, -0.14784368872642517, -0.11121726036071777, -0.07459084689617157, -0.037964433431625366, -0.0013381242752075195, 0.03528829663991928, 0.07191471755504608, 0.10854113847017288, 0.14516755938529968, 0.18179398775100708, 0.21842040121555328, 0.2550468146800995, 0.2916732430458069, 0.3282996714115143, 0.3649260997772217, 0.4015524983406067, 0.4381789267063141, 0.4748053550720215, 0.5114317536354065, 0.5480581521987915, 0.5846846103668213, 0.6213110089302063, 0.6579374670982361, 0.6945638656616211, 0.7311903238296509, 0.7678167223930359, 0.8044431209564209, 0.8410695791244507, 0.8776959776878357, 0.9143223762512207, 0.9509488344192505, 0.9875752329826355, 1.0242016315460205, 1.0608280897140503, 1.09745454788208, 1.1340808868408203, 1.17070734500885]}, "gradients/encoder.encoder.layers.1.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 5.0, 2.0, 3.0, 1.0, 5.0, 11.0, 12.0, 8.0, 21.0, 25.0, 34.0, 50.0, 71.0, 103.0, 163.0, 239.0, 330.0, 486.0, 723.0, 1115.0, 1638.0, 2706.0, 4423.0, 7191.0, 12871.0, 22976.0, 44493.0, 92860.0, 199845.0, 296333.0, 182513.0, 84834.0, 41029.0, 21182.0, 11895.0, 7030.0, 4057.0, 2494.0, 1659.0, 1039.0, 670.0, 441.0, 317.0, 213.0, 156.0, 94.0, 45.0, 49.0, 37.0, 19.0, 17.0, 12.0, 6.0, 12.0, 6.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1177978515625, -0.11392784118652344, -0.11005783081054688, -0.10618782043457031, -0.10231781005859375, -0.09844779968261719, -0.09457778930664062, -0.09070777893066406, -0.0868377685546875, -0.08296775817871094, -0.07909774780273438, -0.07522773742675781, -0.07135772705078125, -0.06748771667480469, -0.06361770629882812, -0.05974769592285156, -0.055877685546875, -0.05200767517089844, -0.048137664794921875, -0.04426765441894531, -0.04039764404296875, -0.03652763366699219, -0.032657623291015625, -0.028787612915039062, -0.0249176025390625, -0.021047592163085938, -0.017177581787109375, -0.013307571411132812, -0.00943756103515625, -0.0055675506591796875, -0.001697540283203125, 0.0021724700927734375, 0.00604248046875, 0.009912490844726562, 0.013782501220703125, 0.017652511596679688, 0.02152252197265625, 0.025392532348632812, 0.029262542724609375, 0.03313255310058594, 0.0370025634765625, 0.04087257385253906, 0.044742584228515625, 0.04861259460449219, 0.05248260498046875, 0.05635261535644531, 0.060222625732421875, 0.06409263610839844, 0.067962646484375, 0.07183265686035156, 0.07570266723632812, 0.07957267761230469, 0.08344268798828125, 0.08731269836425781, 0.09118270874023438, 0.09505271911621094, 0.0989227294921875, 0.10279273986816406, 0.10666275024414062, 0.11053276062011719, 0.11440277099609375, 0.11827278137207031, 0.12214279174804688, 0.12601280212402344, 0.1298828125]}, "gradients/encoder.encoder.layers.1.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 4.0, 4.0, 6.0, 9.0, 6.0, 8.0, 13.0, 12.0, 19.0, 23.0, 20.0, 19.0, 29.0, 32.0, 42.0, 42.0, 56.0, 47.0, 41.0, 64.0, 55.0, 58.0, 48.0, 51.0, 43.0, 45.0, 38.0, 29.0, 28.0, 14.0, 36.0, 13.0, 15.0, 8.0, 8.0, 7.0, 3.0, 3.0, 2.0, 3.0, 3.0, 4.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.093994140625, -0.09105587005615234, -0.08811759948730469, -0.08517932891845703, -0.08224105834960938, -0.07930278778076172, -0.07636451721191406, -0.0734262466430664, -0.07048797607421875, -0.0675497055053711, -0.06461143493652344, -0.06167316436767578, -0.058734893798828125, -0.05579662322998047, -0.05285835266113281, -0.049920082092285156, -0.0469818115234375, -0.044043540954589844, -0.04110527038574219, -0.03816699981689453, -0.035228729248046875, -0.03229045867919922, -0.029352188110351562, -0.026413917541503906, -0.02347564697265625, -0.020537376403808594, -0.017599105834960938, -0.014660835266113281, -0.011722564697265625, -0.008784294128417969, -0.0058460235595703125, -0.0029077529907226562, 3.0517578125e-05, 0.0029687881469726562, 0.0059070587158203125, 0.008845329284667969, 0.011783599853515625, 0.014721870422363281, 0.017660140991210938, 0.020598411560058594, 0.02353668212890625, 0.026474952697753906, 0.029413223266601562, 0.03235149383544922, 0.035289764404296875, 0.03822803497314453, 0.04116630554199219, 0.044104576110839844, 0.0470428466796875, 0.049981117248535156, 0.05291938781738281, 0.05585765838623047, 0.058795928955078125, 0.06173419952392578, 0.06467247009277344, 0.0676107406616211, 0.07054901123046875, 0.0734872817993164, 0.07642555236816406, 0.07936382293701172, 0.08230209350585938, 0.08524036407470703, 0.08817863464355469, 0.09111690521240234, 0.09405517578125]}, "gradients/encoder.encoder.layers.1.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 4.0, 2.0, 3.0, 1.0, 4.0, 9.0, 10.0, 22.0, 20.0, 22.0, 26.0, 44.0, 53.0, 85.0, 133.0, 217.0, 320.0, 557.0, 1094.0, 2423.0, 7714.0, 41131.0, 743827.0, 220251.0, 21574.0, 4983.0, 1873.0, 855.0, 442.0, 254.0, 192.0, 121.0, 82.0, 49.0, 36.0, 27.0, 26.0, 23.0, 9.0, 10.0, 12.0, 6.0, 8.0, 5.0, 1.0, 1.0, 0.0, 1.0, 2.0, 3.0], "bins": [-0.48486328125, -0.4716987609863281, -0.45853424072265625, -0.4453697204589844, -0.4322052001953125, -0.4190406799316406, -0.40587615966796875, -0.3927116394042969, -0.379547119140625, -0.3663825988769531, -0.35321807861328125, -0.3400535583496094, -0.3268890380859375, -0.3137245178222656, -0.30055999755859375, -0.2873954772949219, -0.27423095703125, -0.2610664367675781, -0.24790191650390625, -0.23473739624023438, -0.2215728759765625, -0.20840835571289062, -0.19524383544921875, -0.18207931518554688, -0.168914794921875, -0.15575027465820312, -0.14258575439453125, -0.12942123413085938, -0.1162567138671875, -0.10309219360351562, -0.08992767333984375, -0.07676315307617188, -0.0635986328125, -0.050434112548828125, -0.03726959228515625, -0.024105072021484375, -0.0109405517578125, 0.002223968505859375, 0.01538848876953125, 0.028553009033203125, 0.041717529296875, 0.054882049560546875, 0.06804656982421875, 0.08121109008789062, 0.0943756103515625, 0.10754013061523438, 0.12070465087890625, 0.13386917114257812, 0.14703369140625, 0.16019821166992188, 0.17336273193359375, 0.18652725219726562, 0.1996917724609375, 0.21285629272460938, 0.22602081298828125, 0.23918533325195312, 0.252349853515625, 0.2655143737792969, 0.27867889404296875, 0.2918434143066406, 0.3050079345703125, 0.3181724548339844, 0.33133697509765625, 0.3445014953613281, 0.357666015625]}, "gradients/encoder.encoder.layers.1.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 5.0, 3.0, 5.0, 7.0, 4.0, 4.0, 9.0, 13.0, 12.0, 15.0, 13.0, 23.0, 21.0, 31.0, 34.0, 43.0, 66.0, 63.0, 51.0, 57.0, 70.0, 68.0, 56.0, 41.0, 51.0, 46.0, 35.0, 26.0, 26.0, 17.0, 24.0, 16.0, 9.0, 12.0, 9.0, 4.0, 4.0, 4.0, 5.0, 3.0, 4.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.447998046875, -0.4349861145019531, -0.42197418212890625, -0.4089622497558594, -0.3959503173828125, -0.3829383850097656, -0.36992645263671875, -0.3569145202636719, -0.343902587890625, -0.3308906555175781, -0.31787872314453125, -0.3048667907714844, -0.2918548583984375, -0.2788429260253906, -0.26583099365234375, -0.2528190612792969, -0.23980712890625, -0.22679519653320312, -0.21378326416015625, -0.20077133178710938, -0.1877593994140625, -0.17474746704101562, -0.16173553466796875, -0.14872360229492188, -0.135711669921875, -0.12269973754882812, -0.10968780517578125, -0.09667587280273438, -0.0836639404296875, -0.07065200805664062, -0.05764007568359375, -0.044628143310546875, -0.0316162109375, -0.018604278564453125, -0.00559234619140625, 0.007419586181640625, 0.0204315185546875, 0.033443450927734375, 0.04645538330078125, 0.059467315673828125, 0.072479248046875, 0.08549118041992188, 0.09850311279296875, 0.11151504516601562, 0.1245269775390625, 0.13753890991210938, 0.15055084228515625, 0.16356277465820312, 0.17657470703125, 0.18958663940429688, 0.20259857177734375, 0.21561050415039062, 0.2286224365234375, 0.24163436889648438, 0.25464630126953125, 0.2676582336425781, 0.280670166015625, 0.2936820983886719, 0.30669403076171875, 0.3197059631347656, 0.3327178955078125, 0.3457298278808594, 0.35874176025390625, 0.3717536926269531, 0.384765625]}, "gradients/encoder.encoder.layers.1.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 3.0, 2.0, 2.0, 5.0, 4.0, 3.0, 9.0, 3.0, 11.0, 15.0, 14.0, 27.0, 43.0, 66.0, 95.0, 157.0, 272.0, 421.0, 785.0, 1435.0, 2886.0, 6892.0, 23719.0, 150425.0, 764356.0, 72381.0, 14653.0, 4829.0, 2324.0, 1098.0, 588.0, 353.0, 212.0, 164.0, 94.0, 54.0, 44.0, 40.0, 20.0, 15.0, 9.0, 8.0, 7.0, 4.0, 5.0, 5.0, 1.0, 0.0, 0.0, 0.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.10601806640625, -0.10263347625732422, -0.09924888610839844, -0.09586429595947266, -0.09247970581054688, -0.0890951156616211, -0.08571052551269531, -0.08232593536376953, -0.07894134521484375, -0.07555675506591797, -0.07217216491699219, -0.0687875747680664, -0.06540298461914062, -0.062018394470214844, -0.05863380432128906, -0.05524921417236328, -0.0518646240234375, -0.04848003387451172, -0.04509544372558594, -0.041710853576660156, -0.038326263427734375, -0.034941673278808594, -0.03155708312988281, -0.02817249298095703, -0.02478790283203125, -0.02140331268310547, -0.018018722534179688, -0.014634132385253906, -0.011249542236328125, -0.007864952087402344, -0.0044803619384765625, -0.0010957717895507812, 0.002288818359375, 0.005673408508300781, 0.009057998657226562, 0.012442588806152344, 0.015827178955078125, 0.019211769104003906, 0.022596359252929688, 0.02598094940185547, 0.02936553955078125, 0.03275012969970703, 0.03613471984863281, 0.039519309997558594, 0.042903900146484375, 0.046288490295410156, 0.04967308044433594, 0.05305767059326172, 0.0564422607421875, 0.05982685089111328, 0.06321144104003906, 0.06659603118896484, 0.06998062133789062, 0.0733652114868164, 0.07674980163574219, 0.08013439178466797, 0.08351898193359375, 0.08690357208251953, 0.09028816223144531, 0.0936727523803711, 0.09705734252929688, 0.10044193267822266, 0.10382652282714844, 0.10721111297607422, 0.110595703125]}, "gradients/encoder.encoder.layers.1.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 2.0, 6.0, 5.0, 4.0, 4.0, 3.0, 8.0, 12.0, 13.0, 17.0, 27.0, 53.0, 56.0, 82.0, 98.0, 125.0, 134.0, 88.0, 68.0, 45.0, 26.0, 36.0, 18.0, 16.0, 10.0, 8.0, 6.0, 9.0, 6.0, 6.0, 3.0, 3.0, 4.0, 3.0, 4.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.5703182220458984e-05, -3.426987677812576e-05, -3.283657133579254e-05, -3.140326589345932e-05, -2.99699604511261e-05, -2.8536655008792877e-05, -2.7103349566459656e-05, -2.5670044124126434e-05, -2.4236738681793213e-05, -2.280343323945999e-05, -2.137012779712677e-05, -1.993682235479355e-05, -1.8503516912460327e-05, -1.7070211470127106e-05, -1.5636906027793884e-05, -1.4203600585460663e-05, -1.2770295143127441e-05, -1.133698970079422e-05, -9.903684258460999e-06, -8.470378816127777e-06, -7.037073373794556e-06, -5.603767931461334e-06, -4.170462489128113e-06, -2.7371570467948914e-06, -1.30385160446167e-06, 1.2945383787155151e-07, 1.562759280204773e-06, 2.9960647225379944e-06, 4.429370164871216e-06, 5.862675607204437e-06, 7.295981049537659e-06, 8.72928649187088e-06, 1.0162591934204102e-05, 1.1595897376537323e-05, 1.3029202818870544e-05, 1.4462508261203766e-05, 1.5895813703536987e-05, 1.732911914587021e-05, 1.876242458820343e-05, 2.019573003053665e-05, 2.1629035472869873e-05, 2.3062340915203094e-05, 2.4495646357536316e-05, 2.5928951799869537e-05, 2.736225724220276e-05, 2.879556268453598e-05, 3.02288681268692e-05, 3.166217356920242e-05, 3.3095479011535645e-05, 3.4528784453868866e-05, 3.596208989620209e-05, 3.739539533853531e-05, 3.882870078086853e-05, 4.026200622320175e-05, 4.169531166553497e-05, 4.3128617107868195e-05, 4.4561922550201416e-05, 4.599522799253464e-05, 4.742853343486786e-05, 4.886183887720108e-05, 5.02951443195343e-05, 5.172844976186752e-05, 5.3161755204200745e-05, 5.4595060646533966e-05, 5.602836608886719e-05]}, "gradients/encoder.encoder.layers.1.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 9.0, 8.0, 12.0, 19.0, 33.0, 51.0, 94.0, 167.0, 355.0, 977.0, 3650.0, 27238.0, 884677.0, 120621.0, 7872.0, 1681.0, 557.0, 234.0, 114.0, 73.0, 44.0, 29.0, 8.0, 10.0, 7.0, 5.0, 3.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.187255859375, -0.18116188049316406, -0.17506790161132812, -0.1689739227294922, -0.16287994384765625, -0.1567859649658203, -0.15069198608398438, -0.14459800720214844, -0.1385040283203125, -0.13241004943847656, -0.12631607055664062, -0.12022209167480469, -0.11412811279296875, -0.10803413391113281, -0.10194015502929688, -0.09584617614746094, -0.089752197265625, -0.08365821838378906, -0.07756423950195312, -0.07147026062011719, -0.06537628173828125, -0.05928230285644531, -0.053188323974609375, -0.04709434509277344, -0.0410003662109375, -0.03490638732910156, -0.028812408447265625, -0.022718429565429688, -0.01662445068359375, -0.010530471801757812, -0.004436492919921875, 0.0016574859619140625, 0.00775146484375, 0.013845443725585938, 0.019939422607421875, 0.026033401489257812, 0.03212738037109375, 0.03822135925292969, 0.044315338134765625, 0.05040931701660156, 0.0565032958984375, 0.06259727478027344, 0.06869125366210938, 0.07478523254394531, 0.08087921142578125, 0.08697319030761719, 0.09306716918945312, 0.09916114807128906, 0.105255126953125, 0.11134910583496094, 0.11744308471679688, 0.12353706359863281, 0.12963104248046875, 0.1357250213623047, 0.14181900024414062, 0.14791297912597656, 0.1540069580078125, 0.16010093688964844, 0.16619491577148438, 0.1722888946533203, 0.17838287353515625, 0.1844768524169922, 0.19057083129882812, 0.19666481018066406, 0.2027587890625]}, "gradients/encoder.encoder.layers.1.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 3.0, 1.0, 2.0, 10.0, 11.0, 17.0, 26.0, 59.0, 74.0, 143.0, 185.0, 189.0, 114.0, 66.0, 43.0, 27.0, 18.0, 9.0, 6.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1571044921875, -0.15201377868652344, -0.14692306518554688, -0.1418323516845703, -0.13674163818359375, -0.1316509246826172, -0.12656021118164062, -0.12146949768066406, -0.1163787841796875, -0.11128807067871094, -0.10619735717773438, -0.10110664367675781, -0.09601593017578125, -0.09092521667480469, -0.08583450317382812, -0.08074378967285156, -0.075653076171875, -0.07056236267089844, -0.06547164916992188, -0.06038093566894531, -0.05529022216796875, -0.05019950866699219, -0.045108795166015625, -0.04001808166503906, -0.0349273681640625, -0.029836654663085938, -0.024745941162109375, -0.019655227661132812, -0.01456451416015625, -0.009473800659179688, -0.004383087158203125, 0.0007076263427734375, 0.00579833984375, 0.010889053344726562, 0.015979766845703125, 0.021070480346679688, 0.02616119384765625, 0.03125190734863281, 0.036342620849609375, 0.04143333435058594, 0.0465240478515625, 0.05161476135253906, 0.056705474853515625, 0.06179618835449219, 0.06688690185546875, 0.07197761535644531, 0.07706832885742188, 0.08215904235839844, 0.087249755859375, 0.09234046936035156, 0.09743118286132812, 0.10252189636230469, 0.10761260986328125, 0.11270332336425781, 0.11779403686523438, 0.12288475036621094, 0.1279754638671875, 0.13306617736816406, 0.13815689086914062, 0.1432476043701172, 0.14833831787109375, 0.1534290313720703, 0.15851974487304688, 0.16361045837402344, 0.168701171875]}, "gradients/encoder.encoder.layers.1.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 11.0, 3.0, 6.0, 16.0, 29.0, 45.0, 72.0, 135.0, 386.0, 145.0, 69.0, 39.0, 21.0, 12.0, 5.0, 10.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-3.1913084983825684, -3.0927889347076416, -2.994269371032715, -2.895749568939209, -2.7972300052642822, -2.6987104415893555, -2.6001908779144287, -2.501671314239502, -2.403151750564575, -2.3046321868896484, -2.2061126232147217, -2.107593059539795, -2.009073257446289, -1.9105536937713623, -1.8120341300964355, -1.7135145664215088, -1.6149948835372925, -1.5164753198623657, -1.4179556369781494, -1.3194360733032227, -1.220916509628296, -1.1223969459533691, -1.0238772630691528, -0.9253576993942261, -0.8268380761146545, -0.728318452835083, -0.6297988891601562, -0.5312792658805847, -0.43275967240333557, -0.3342400789260864, -0.2357204556465149, -0.13720089197158813, -0.0386812686920166, 0.05983833223581314, 0.15835793316364288, 0.2568775415420532, 0.35539713501930237, 0.4539167284965515, 0.552436351776123, 0.6509559154510498, 0.7494755387306213, 0.8479951620101929, 0.9465147256851196, 1.045034408569336, 1.1435539722442627, 1.2420735359191895, 1.3405930995941162, 1.439112663269043, 1.5376323461532593, 1.636151909828186, 1.7346715927124023, 1.833191156387329, 1.9317107200622559, 2.0302302837371826, 2.1287498474121094, 2.2272696495056152, 2.325789213180542, 2.4243087768554688, 2.5228283405303955, 2.6213479042053223, 2.719867706298828, 2.818387269973755, 2.9169068336486816, 3.0154263973236084, 3.113945960998535]}, "gradients/encoder.encoder.layers.1.layer_norm.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 8.0, 12.0, 10.0, 10.0, 8.0, 18.0, 14.0, 14.0, 21.0, 23.0, 33.0, 21.0, 29.0, 31.0, 45.0, 49.0, 127.0, 173.0, 53.0, 40.0, 40.0, 26.0, 33.0, 22.0, 15.0, 12.0, 16.0, 18.0, 10.0, 17.0, 9.0, 9.0, 3.0, 5.0, 8.0, 3.0, 1.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-1.8218750953674316, -1.7668397426605225, -1.7118045091629028, -1.6567691564559937, -1.6017338037490845, -1.5466985702514648, -1.4916632175445557, -1.4366278648376465, -1.3815926313400269, -1.3265572786331177, -1.271522045135498, -1.2164866924285889, -1.1614513397216797, -1.10641610622406, -1.0513807535171509, -0.9963454604148865, -0.9413101077079773, -0.8862748146057129, -0.8312394618988037, -0.7762041687965393, -0.7211688756942749, -0.6661335229873657, -0.6110982298851013, -0.5560629367828369, -0.5010275840759277, -0.44599226117134094, -0.39095696806907654, -0.33592164516448975, -0.28088635206222534, -0.22585102915763855, -0.17081570625305176, -0.11578041315078735, -0.06074512004852295, -0.005709808319807053, 0.049325503408908844, 0.10436081886291504, 0.15939612686634064, 0.21443143486976624, 0.269466757774353, 0.32450205087661743, 0.3795373737812042, 0.434572696685791, 0.4896079897880554, 0.5446432828903198, 0.599678635597229, 0.6547139286994934, 0.7097492218017578, 0.764784574508667, 0.8198198676109314, 0.8748551607131958, 0.929890513420105, 0.9849258065223694, 1.0399610996246338, 1.094996452331543, 1.1500318050384521, 1.2050670385360718, 1.260102391242981, 1.3151377439498901, 1.3701729774475098, 1.425208330154419, 1.4802436828613281, 1.5352789163589478, 1.590314269065857, 1.6453495025634766, 1.7003848552703857]}, "gradients/encoder.encoder.layers.0.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 1.0, 2.0, 5.0, 5.0, 2.0, 2.0, 6.0, 13.0, 18.0, 22.0, 42.0, 37.0, 72.0, 165.0, 270.0, 521.0, 1053.0, 2617.0, 8114.0, 37274.0, 400296.0, 3297437.0, 396007.0, 37522.0, 8183.0, 2533.0, 1026.0, 443.0, 266.0, 139.0, 85.0, 46.0, 25.0, 20.0, 8.0, 7.0, 5.0, 3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.348876953125, -0.33931732177734375, -0.3297576904296875, -0.32019805908203125, -0.310638427734375, -0.30107879638671875, -0.2915191650390625, -0.28195953369140625, -0.27239990234375, -0.26284027099609375, -0.2532806396484375, -0.24372100830078125, -0.234161376953125, -0.22460174560546875, -0.2150421142578125, -0.20548248291015625, -0.1959228515625, -0.18636322021484375, -0.1768035888671875, -0.16724395751953125, -0.157684326171875, -0.14812469482421875, -0.1385650634765625, -0.12900543212890625, -0.11944580078125, -0.10988616943359375, -0.1003265380859375, -0.09076690673828125, -0.081207275390625, -0.07164764404296875, -0.0620880126953125, -0.05252838134765625, -0.04296875, -0.03340911865234375, -0.0238494873046875, -0.01428985595703125, -0.004730224609375, 0.00482940673828125, 0.0143890380859375, 0.02394866943359375, 0.03350830078125, 0.04306793212890625, 0.0526275634765625, 0.06218719482421875, 0.071746826171875, 0.08130645751953125, 0.0908660888671875, 0.10042572021484375, 0.1099853515625, 0.11954498291015625, 0.1291046142578125, 0.13866424560546875, 0.148223876953125, 0.15778350830078125, 0.1673431396484375, 0.17690277099609375, 0.18646240234375, 0.19602203369140625, 0.2055816650390625, 0.21514129638671875, 0.224700927734375, 0.23426055908203125, 0.2438201904296875, 0.25337982177734375, 0.262939453125]}, "gradients/encoder.encoder.layers.0.feed_forward.output_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 3.0, 3.0, 5.0, 5.0, 4.0, 3.0, 10.0, 16.0, 12.0, 13.0, 13.0, 22.0, 29.0, 25.0, 22.0, 25.0, 33.0, 39.0, 41.0, 47.0, 44.0, 53.0, 52.0, 42.0, 51.0, 63.0, 36.0, 37.0, 26.0, 34.0, 33.0, 24.0, 20.0, 25.0, 23.0, 20.0, 10.0, 11.0, 12.0, 7.0, 3.0, 4.0, 3.0, 4.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.07794189453125, -0.07540512084960938, -0.07286834716796875, -0.07033157348632812, -0.0677947998046875, -0.06525802612304688, -0.06272125244140625, -0.060184478759765625, -0.057647705078125, -0.055110931396484375, -0.05257415771484375, -0.050037384033203125, -0.0475006103515625, -0.044963836669921875, -0.04242706298828125, -0.039890289306640625, -0.037353515625, -0.034816741943359375, -0.03227996826171875, -0.029743194580078125, -0.0272064208984375, -0.024669647216796875, -0.02213287353515625, -0.019596099853515625, -0.017059326171875, -0.014522552490234375, -0.01198577880859375, -0.009449005126953125, -0.0069122314453125, -0.004375457763671875, -0.00183868408203125, 0.000698089599609375, 0.00323486328125, 0.005771636962890625, 0.00830841064453125, 0.010845184326171875, 0.0133819580078125, 0.015918731689453125, 0.01845550537109375, 0.020992279052734375, 0.023529052734375, 0.026065826416015625, 0.02860260009765625, 0.031139373779296875, 0.0336761474609375, 0.036212921142578125, 0.03874969482421875, 0.041286468505859375, 0.0438232421875, 0.046360015869140625, 0.04889678955078125, 0.051433563232421875, 0.0539703369140625, 0.056507110595703125, 0.05904388427734375, 0.061580657958984375, 0.064117431640625, 0.06665420532226562, 0.06919097900390625, 0.07172775268554688, 0.0742645263671875, 0.07680130004882812, 0.07933807373046875, 0.08187484741210938, 0.08441162109375]}, "gradients/encoder.encoder.layers.0.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 5.0, 5.0, 5.0, 3.0, 5.0, 16.0, 20.0, 30.0, 50.0, 71.0, 116.0, 204.0, 384.0, 695.0, 1711.0, 8171.0, 364765.0, 3794788.0, 18560.0, 2643.0, 882.0, 518.0, 231.0, 145.0, 96.0, 63.0, 39.0, 27.0, 15.0, 9.0, 4.0, 6.0, 4.0, 2.0, 2.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.0478515625, -1.0188751220703125, -0.989898681640625, -0.9609222412109375, -0.93194580078125, -0.9029693603515625, -0.873992919921875, -0.8450164794921875, -0.8160400390625, -0.7870635986328125, -0.758087158203125, -0.7291107177734375, -0.70013427734375, -0.6711578369140625, -0.642181396484375, -0.6132049560546875, -0.584228515625, -0.5552520751953125, -0.526275634765625, -0.4972991943359375, -0.46832275390625, -0.4393463134765625, -0.410369873046875, -0.3813934326171875, -0.3524169921875, -0.3234405517578125, -0.294464111328125, -0.2654876708984375, -0.23651123046875, -0.2075347900390625, -0.178558349609375, -0.1495819091796875, -0.12060546875, -0.0916290283203125, -0.062652587890625, -0.0336761474609375, -0.00469970703125, 0.0242767333984375, 0.053253173828125, 0.0822296142578125, 0.1112060546875, 0.1401824951171875, 0.169158935546875, 0.1981353759765625, 0.22711181640625, 0.2560882568359375, 0.285064697265625, 0.3140411376953125, 0.343017578125, 0.3719940185546875, 0.400970458984375, 0.4299468994140625, 0.45892333984375, 0.4878997802734375, 0.516876220703125, 0.5458526611328125, 0.5748291015625, 0.6038055419921875, 0.632781982421875, 0.6617584228515625, 0.69073486328125, 0.7197113037109375, 0.748687744140625, 0.7776641845703125, 0.806640625]}, "gradients/encoder.encoder.layers.0.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 6.0, 5.0, 2.0, 3.0, 6.0, 7.0, 12.0, 25.0, 35.0, 31.0, 43.0, 76.0, 69.0, 137.0, 202.0, 304.0, 626.0, 729.0, 610.0, 396.0, 240.0, 155.0, 118.0, 71.0, 45.0, 31.0, 21.0, 26.0, 15.0, 19.0, 8.0, 6.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.305419921875, -0.29390716552734375, -0.2823944091796875, -0.27088165283203125, -0.259368896484375, -0.24785614013671875, -0.2363433837890625, -0.22483062744140625, -0.21331787109375, -0.20180511474609375, -0.1902923583984375, -0.17877960205078125, -0.167266845703125, -0.15575408935546875, -0.1442413330078125, -0.13272857666015625, -0.1212158203125, -0.10970306396484375, -0.0981903076171875, -0.08667755126953125, -0.075164794921875, -0.06365203857421875, -0.0521392822265625, -0.04062652587890625, -0.02911376953125, -0.01760101318359375, -0.0060882568359375, 0.00542449951171875, 0.016937255859375, 0.02845001220703125, 0.0399627685546875, 0.05147552490234375, 0.06298828125, 0.07450103759765625, 0.0860137939453125, 0.09752655029296875, 0.109039306640625, 0.12055206298828125, 0.1320648193359375, 0.14357757568359375, 0.15509033203125, 0.16660308837890625, 0.1781158447265625, 0.18962860107421875, 0.201141357421875, 0.21265411376953125, 0.2241668701171875, 0.23567962646484375, 0.2471923828125, 0.25870513916015625, 0.2702178955078125, 0.28173065185546875, 0.293243408203125, 0.30475616455078125, 0.3162689208984375, 0.32778167724609375, 0.33929443359375, 0.35080718994140625, 0.3623199462890625, 0.37383270263671875, 0.385345458984375, 0.39685821533203125, 0.4083709716796875, 0.41988372802734375, 0.431396484375]}, "gradients/encoder.encoder.layers.0.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 3.0, 2.0, 3.0, 5.0, 6.0, 18.0, 23.0, 46.0, 104.0, 214.0, 263.0, 191.0, 76.0, 28.0, 11.0, 6.0, 4.0, 3.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.403827667236328, -6.197922706604004, -5.992018222808838, -5.786113262176514, -5.5802083015441895, -5.374303817749023, -5.168398857116699, -4.962493896484375, -4.756588935852051, -4.550683975219727, -4.3447794914245605, -4.138874530792236, -3.932969570159912, -3.727064847946167, -3.521160125732422, -3.3152551651000977, -3.1093506813049316, -2.9034459590911865, -2.6975409984588623, -2.491636276245117, -2.285731315612793, -2.079826593399048, -1.8739218711853027, -1.668017029762268, -1.4621121883392334, -1.2562073469161987, -1.050302505493164, -0.844397783279419, -0.6384929418563843, -0.4325881004333496, -0.2266833782196045, -0.020778536796569824, 0.18512582778930664, 0.3910306394100189, 0.5969354510307312, 0.8028402328491211, 1.0087450742721558, 1.2146499156951904, 1.4205546379089355, 1.6264594793319702, 1.8323643207550049, 2.03826904296875, 2.244174003601074, 2.4500787258148193, 2.6559834480285645, 2.8618884086608887, 3.067793130874634, 3.273697853088379, 3.479602813720703, 3.6855075359344482, 3.8914124965667725, 4.097317218780518, 4.303222179412842, 4.509126663208008, 4.715031623840332, 4.920936584472656, 5.1268415451049805, 5.332746505737305, 5.538650989532471, 5.744555950164795, 5.950460910797119, 6.156365394592285, 6.362270355224609, 6.568175315856934, 6.7740797996521]}, "gradients/encoder.encoder.layers.0.final_layer_norm.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 0.0, 4.0, 5.0, 2.0, 4.0, 3.0, 4.0, 12.0, 10.0, 13.0, 9.0, 11.0, 20.0, 21.0, 28.0, 19.0, 17.0, 19.0, 20.0, 29.0, 36.0, 38.0, 42.0, 49.0, 42.0, 44.0, 50.0, 49.0, 51.0, 41.0, 40.0, 32.0, 35.0, 25.0, 28.0, 21.0, 21.0, 21.0, 14.0, 22.0, 6.0, 17.0, 6.0, 9.0, 5.0, 5.0, 5.0, 3.0, 2.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-1.5701229572296143, -1.5177509784698486, -1.465378999710083, -1.4130070209503174, -1.3606350421905518, -1.3082630634307861, -1.25589120388031, -1.2035192251205444, -1.1511472463607788, -1.0987752676010132, -1.0464032888412476, -0.9940313696861267, -0.9416593909263611, -0.8892874121665955, -0.8369154930114746, -0.784543514251709, -0.7321715354919434, -0.6797995567321777, -0.6274275779724121, -0.5750556588172913, -0.5226836800575256, -0.47031170129776, -0.4179397523403168, -0.36556780338287354, -0.3131958246231079, -0.2608238458633423, -0.20845189690589905, -0.15607993304729462, -0.10370796918869019, -0.05133599042892456, 0.0010359585285186768, 0.053407907485961914, 0.10578000545501709, 0.15815196931362152, 0.21052393317222595, 0.2628958821296692, 0.3152678608894348, 0.36763983964920044, 0.4200117886066437, 0.4723837375640869, 0.5247557163238525, 0.5771276950836182, 0.6294996738433838, 0.6818715929985046, 0.7342435717582703, 0.7866155505180359, 0.8389874696731567, 0.8913594484329224, 0.943731427192688, 0.9961034059524536, 1.0484753847122192, 1.1008473634719849, 1.153219223022461, 1.2055912017822266, 1.2579631805419922, 1.3103351593017578, 1.3627071380615234, 1.415079116821289, 1.4674510955810547, 1.5198230743408203, 1.572195053100586, 1.6245670318603516, 1.6769388914108276, 1.7293108701705933, 1.7816828489303589]}, "gradients/encoder.encoder.layers.0.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 3.0, 4.0, 5.0, 16.0, 16.0, 37.0, 49.0, 96.0, 171.0, 339.0, 641.0, 1732.0, 4597.0, 18917.0, 144942.0, 772434.0, 85129.0, 13278.0, 3615.0, 1327.0, 593.0, 279.0, 144.0, 72.0, 40.0, 30.0, 22.0, 9.0, 13.0, 6.0, 1.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.427734375, -0.41452789306640625, -0.4013214111328125, -0.38811492919921875, -0.374908447265625, -0.36170196533203125, -0.3484954833984375, -0.33528900146484375, -0.32208251953125, -0.30887603759765625, -0.2956695556640625, -0.28246307373046875, -0.269256591796875, -0.25605010986328125, -0.2428436279296875, -0.22963714599609375, -0.2164306640625, -0.20322418212890625, -0.1900177001953125, -0.17681121826171875, -0.163604736328125, -0.15039825439453125, -0.1371917724609375, -0.12398529052734375, -0.11077880859375, -0.09757232666015625, -0.0843658447265625, -0.07115936279296875, -0.057952880859375, -0.04474639892578125, -0.0315399169921875, -0.01833343505859375, -0.005126953125, 0.00807952880859375, 0.0212860107421875, 0.03449249267578125, 0.047698974609375, 0.06090545654296875, 0.0741119384765625, 0.08731842041015625, 0.10052490234375, 0.11373138427734375, 0.1269378662109375, 0.14014434814453125, 0.153350830078125, 0.16655731201171875, 0.1797637939453125, 0.19297027587890625, 0.2061767578125, 0.21938323974609375, 0.2325897216796875, 0.24579620361328125, 0.259002685546875, 0.27220916748046875, 0.2854156494140625, 0.29862213134765625, 0.31182861328125, 0.32503509521484375, 0.3382415771484375, 0.35144805908203125, 0.364654541015625, 0.37786102294921875, 0.3910675048828125, 0.40427398681640625, 0.41748046875]}, "gradients/encoder.encoder.layers.0.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 3.0, 5.0, 5.0, 8.0, 14.0, 22.0, 24.0, 40.0, 59.0, 63.0, 73.0, 96.0, 108.0, 93.0, 90.0, 74.0, 56.0, 59.0, 42.0, 28.0, 18.0, 16.0, 3.0, 4.0, 2.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2298583984375, -0.2221546173095703, -0.21445083618164062, -0.20674705505371094, -0.19904327392578125, -0.19133949279785156, -0.18363571166992188, -0.1759319305419922, -0.1682281494140625, -0.1605243682861328, -0.15282058715820312, -0.14511680603027344, -0.13741302490234375, -0.12970924377441406, -0.12200546264648438, -0.11430168151855469, -0.106597900390625, -0.09889411926269531, -0.09119033813476562, -0.08348655700683594, -0.07578277587890625, -0.06807899475097656, -0.060375213623046875, -0.05267143249511719, -0.0449676513671875, -0.03726387023925781, -0.029560089111328125, -0.021856307983398438, -0.01415252685546875, -0.0064487457275390625, 0.001255035400390625, 0.008958816528320312, 0.01666259765625, 0.024366378784179688, 0.032070159912109375, 0.03977394104003906, 0.04747772216796875, 0.05518150329589844, 0.06288528442382812, 0.07058906555175781, 0.0782928466796875, 0.08599662780761719, 0.09370040893554688, 0.10140419006347656, 0.10910797119140625, 0.11681175231933594, 0.12451553344726562, 0.1322193145751953, 0.139923095703125, 0.1476268768310547, 0.15533065795898438, 0.16303443908691406, 0.17073822021484375, 0.17844200134277344, 0.18614578247070312, 0.1938495635986328, 0.2015533447265625, 0.2092571258544922, 0.21696090698242188, 0.22466468811035156, 0.23236846923828125, 0.24007225036621094, 0.24777603149414062, 0.2554798126220703, 0.26318359375]}, "gradients/encoder.encoder.layers.0.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 4.0, 6.0, 5.0, 9.0, 11.0, 17.0, 14.0, 22.0, 22.0, 43.0, 68.0, 78.0, 150.0, 261.0, 472.0, 1011.0, 2798.0, 11693.0, 103988.0, 880191.0, 37973.0, 6212.0, 1811.0, 731.0, 348.0, 200.0, 129.0, 86.0, 66.0, 49.0, 32.0, 18.0, 14.0, 8.0, 5.0, 6.0, 8.0, 4.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.262939453125, -0.2511482238769531, -0.23935699462890625, -0.22756576538085938, -0.2157745361328125, -0.20398330688476562, -0.19219207763671875, -0.18040084838867188, -0.168609619140625, -0.15681838989257812, -0.14502716064453125, -0.13323593139648438, -0.1214447021484375, -0.10965347290039062, -0.09786224365234375, -0.08607101440429688, -0.07427978515625, -0.062488555908203125, -0.05069732666015625, -0.038906097412109375, -0.0271148681640625, -0.015323638916015625, -0.00353240966796875, 0.008258819580078125, 0.020050048828125, 0.031841278076171875, 0.04363250732421875, 0.055423736572265625, 0.0672149658203125, 0.07900619506835938, 0.09079742431640625, 0.10258865356445312, 0.1143798828125, 0.12617111206054688, 0.13796234130859375, 0.14975357055664062, 0.1615447998046875, 0.17333602905273438, 0.18512725830078125, 0.19691848754882812, 0.208709716796875, 0.22050094604492188, 0.23229217529296875, 0.24408340454101562, 0.2558746337890625, 0.2676658630371094, 0.27945709228515625, 0.2912483215332031, 0.30303955078125, 0.3148307800292969, 0.32662200927734375, 0.3384132385253906, 0.3502044677734375, 0.3619956970214844, 0.37378692626953125, 0.3855781555175781, 0.397369384765625, 0.4091606140136719, 0.42095184326171875, 0.4327430725097656, 0.4445343017578125, 0.4563255310058594, 0.46811676025390625, 0.4799079895019531, 0.49169921875]}, "gradients/encoder.encoder.layers.0.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 2.0, 1.0, 4.0, 8.0, 6.0, 6.0, 10.0, 8.0, 11.0, 14.0, 21.0, 22.0, 26.0, 22.0, 33.0, 29.0, 57.0, 56.0, 70.0, 82.0, 85.0, 77.0, 61.0, 44.0, 44.0, 30.0, 26.0, 21.0, 17.0, 14.0, 15.0, 17.0, 13.0, 12.0, 2.0, 9.0, 9.0, 7.0, 3.0, 3.0, 7.0, 1.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4423828125, -0.426666259765625, -0.41094970703125, -0.395233154296875, -0.3795166015625, -0.363800048828125, -0.34808349609375, -0.332366943359375, -0.316650390625, -0.300933837890625, -0.28521728515625, -0.269500732421875, -0.2537841796875, -0.238067626953125, -0.22235107421875, -0.206634521484375, -0.19091796875, -0.175201416015625, -0.15948486328125, -0.143768310546875, -0.1280517578125, -0.112335205078125, -0.09661865234375, -0.080902099609375, -0.065185546875, -0.049468994140625, -0.03375244140625, -0.018035888671875, -0.0023193359375, 0.013397216796875, 0.02911376953125, 0.044830322265625, 0.060546875, 0.076263427734375, 0.09197998046875, 0.107696533203125, 0.1234130859375, 0.139129638671875, 0.15484619140625, 0.170562744140625, 0.186279296875, 0.201995849609375, 0.21771240234375, 0.233428955078125, 0.2491455078125, 0.264862060546875, 0.28057861328125, 0.296295166015625, 0.31201171875, 0.327728271484375, 0.34344482421875, 0.359161376953125, 0.3748779296875, 0.390594482421875, 0.40631103515625, 0.422027587890625, 0.437744140625, 0.453460693359375, 0.46917724609375, 0.484893798828125, 0.5006103515625, 0.516326904296875, 0.53204345703125, 0.547760009765625, 0.5634765625]}, "gradients/encoder.encoder.layers.0.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 5.0, 7.0, 6.0, 12.0, 16.0, 18.0, 20.0, 34.0, 41.0, 60.0, 87.0, 162.0, 341.0, 793.0, 2629.0, 13572.0, 812797.0, 203778.0, 10515.0, 2196.0, 703.0, 297.0, 153.0, 84.0, 49.0, 40.0, 44.0, 26.0, 16.0, 8.0, 13.0, 5.0, 5.0, 4.0, 3.0, 2.0, 4.0, 1.0, 0.0, 4.0, 1.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.111572265625, -0.10784530639648438, -0.10411834716796875, -0.10039138793945312, -0.0966644287109375, -0.09293746948242188, -0.08921051025390625, -0.08548355102539062, -0.081756591796875, -0.07802963256835938, -0.07430267333984375, -0.07057571411132812, -0.0668487548828125, -0.06312179565429688, -0.05939483642578125, -0.055667877197265625, -0.05194091796875, -0.048213958740234375, -0.04448699951171875, -0.040760040283203125, -0.0370330810546875, -0.033306121826171875, -0.02957916259765625, -0.025852203369140625, -0.022125244140625, -0.018398284912109375, -0.01467132568359375, -0.010944366455078125, -0.0072174072265625, -0.003490447998046875, 0.00023651123046875, 0.003963470458984375, 0.0076904296875, 0.011417388916015625, 0.01514434814453125, 0.018871307373046875, 0.0225982666015625, 0.026325225830078125, 0.03005218505859375, 0.033779144287109375, 0.037506103515625, 0.041233062744140625, 0.04496002197265625, 0.048686981201171875, 0.0524139404296875, 0.056140899658203125, 0.05986785888671875, 0.06359481811523438, 0.06732177734375, 0.07104873657226562, 0.07477569580078125, 0.07850265502929688, 0.0822296142578125, 0.08595657348632812, 0.08968353271484375, 0.09341049194335938, 0.097137451171875, 0.10086441040039062, 0.10459136962890625, 0.10831832885742188, 0.1120452880859375, 0.11577224731445312, 0.11949920654296875, 0.12322616577148438, 0.126953125]}, "gradients/encoder.encoder.layers.0.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 5.0, 3.0, 6.0, 6.0, 13.0, 8.0, 14.0, 11.0, 19.0, 19.0, 27.0, 44.0, 53.0, 90.0, 124.0, 158.0, 128.0, 65.0, 48.0, 31.0, 30.0, 27.0, 17.0, 11.0, 5.0, 6.0, 6.0, 4.0, 3.0, 4.0, 3.0, 7.0, 0.0, 2.0, 1.0, 1.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.5822391510009766e-05, -3.4714117646217346e-05, -3.360584378242493e-05, -3.249756991863251e-05, -3.138929605484009e-05, -3.028102219104767e-05, -2.917274832725525e-05, -2.806447446346283e-05, -2.695620059967041e-05, -2.584792673587799e-05, -2.473965287208557e-05, -2.3631379008293152e-05, -2.2523105144500732e-05, -2.1414831280708313e-05, -2.0306557416915894e-05, -1.9198283553123474e-05, -1.8090009689331055e-05, -1.6981735825538635e-05, -1.5873461961746216e-05, -1.4765188097953796e-05, -1.3656914234161377e-05, -1.2548640370368958e-05, -1.1440366506576538e-05, -1.0332092642784119e-05, -9.2238187789917e-06, -8.11554491519928e-06, -7.00727105140686e-06, -5.898997187614441e-06, -4.7907233238220215e-06, -3.682449460029602e-06, -2.5741755962371826e-06, -1.4659017324447632e-06, -3.5762786865234375e-07, 7.506459951400757e-07, 1.8589198589324951e-06, 2.9671937227249146e-06, 4.075467586517334e-06, 5.183741450309753e-06, 6.292015314102173e-06, 7.400289177894592e-06, 8.508563041687012e-06, 9.616836905479431e-06, 1.072511076927185e-05, 1.183338463306427e-05, 1.294165849685669e-05, 1.4049932360649109e-05, 1.5158206224441528e-05, 1.6266480088233948e-05, 1.7374753952026367e-05, 1.8483027815818787e-05, 1.9591301679611206e-05, 2.0699575543403625e-05, 2.1807849407196045e-05, 2.2916123270988464e-05, 2.4024397134780884e-05, 2.5132670998573303e-05, 2.6240944862365723e-05, 2.7349218726158142e-05, 2.845749258995056e-05, 2.956576645374298e-05, 3.06740403175354e-05, 3.178231418132782e-05, 3.289058804512024e-05, 3.399886190891266e-05, 3.510713577270508e-05]}, "gradients/encoder.encoder.layers.0.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 4.0, 4.0, 2.0, 2.0, 3.0, 4.0, 9.0, 9.0, 5.0, 16.0, 35.0, 53.0, 100.0, 262.0, 735.0, 3119.0, 22772.0, 942965.0, 70626.0, 5967.0, 1185.0, 386.0, 150.0, 51.0, 34.0, 24.0, 9.0, 9.0, 5.0, 5.0, 4.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1309814453125, -0.1268482208251953, -0.12271499633789062, -0.11858177185058594, -0.11444854736328125, -0.11031532287597656, -0.10618209838867188, -0.10204887390136719, -0.0979156494140625, -0.09378242492675781, -0.08964920043945312, -0.08551597595214844, -0.08138275146484375, -0.07724952697753906, -0.07311630249023438, -0.06898307800292969, -0.064849853515625, -0.06071662902832031, -0.056583404541015625, -0.05245018005371094, -0.04831695556640625, -0.04418373107910156, -0.040050506591796875, -0.03591728210449219, -0.0317840576171875, -0.027650833129882812, -0.023517608642578125, -0.019384384155273438, -0.01525115966796875, -0.011117935180664062, -0.006984710693359375, -0.0028514862060546875, 0.00128173828125, 0.0054149627685546875, 0.009548187255859375, 0.013681411743164062, 0.01781463623046875, 0.021947860717773438, 0.026081085205078125, 0.030214309692382812, 0.0343475341796875, 0.03848075866699219, 0.042613983154296875, 0.04674720764160156, 0.05088043212890625, 0.05501365661621094, 0.059146881103515625, 0.06328010559082031, 0.067413330078125, 0.07154655456542969, 0.07567977905273438, 0.07981300354003906, 0.08394622802734375, 0.08807945251464844, 0.09221267700195312, 0.09634590148925781, 0.1004791259765625, 0.10461235046386719, 0.10874557495117188, 0.11287879943847656, 0.11701202392578125, 0.12114524841308594, 0.12527847290039062, 0.1294116973876953, 0.133544921875]}, "gradients/encoder.encoder.layers.0.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 3.0, 3.0, 4.0, 2.0, 5.0, 9.0, 5.0, 7.0, 4.0, 13.0, 23.0, 21.0, 32.0, 53.0, 75.0, 127.0, 141.0, 149.0, 108.0, 58.0, 45.0, 36.0, 23.0, 14.0, 17.0, 8.0, 4.0, 6.0, 5.0, 6.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0802001953125, -0.07769107818603516, -0.07518196105957031, -0.07267284393310547, -0.07016372680664062, -0.06765460968017578, -0.06514549255371094, -0.0626363754272461, -0.06012725830078125, -0.057618141174316406, -0.05510902404785156, -0.05259990692138672, -0.050090789794921875, -0.04758167266845703, -0.04507255554199219, -0.042563438415527344, -0.0400543212890625, -0.037545204162597656, -0.03503608703613281, -0.03252696990966797, -0.030017852783203125, -0.02750873565673828, -0.024999618530273438, -0.022490501403808594, -0.01998138427734375, -0.017472267150878906, -0.014963150024414062, -0.012454032897949219, -0.009944915771484375, -0.007435798645019531, -0.0049266815185546875, -0.0024175643920898438, 9.1552734375e-05, 0.0026006698608398438, 0.0051097869873046875, 0.007618904113769531, 0.010128021240234375, 0.012637138366699219, 0.015146255493164062, 0.017655372619628906, 0.02016448974609375, 0.022673606872558594, 0.025182723999023438, 0.02769184112548828, 0.030200958251953125, 0.03271007537841797, 0.03521919250488281, 0.037728309631347656, 0.0402374267578125, 0.042746543884277344, 0.04525566101074219, 0.04776477813720703, 0.050273895263671875, 0.05278301239013672, 0.05529212951660156, 0.057801246643066406, 0.06031036376953125, 0.0628194808959961, 0.06532859802246094, 0.06783771514892578, 0.07034683227539062, 0.07285594940185547, 0.07536506652832031, 0.07787418365478516, 0.08038330078125]}, "gradients/encoder.encoder.layers.0.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 3.0, 1.0, 5.0, 3.0, 4.0, 8.0, 10.0, 11.0, 20.0, 25.0, 46.0, 78.0, 244.0, 361.0, 71.0, 51.0, 22.0, 22.0, 5.0, 6.0, 4.0, 4.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.7144558429718018, -2.651451587677002, -2.588447093963623, -2.5254428386688232, -2.4624385833740234, -2.3994343280792236, -2.336430072784424, -2.273425579071045, -2.210421323776245, -2.1474170684814453, -2.0844125747680664, -2.0214083194732666, -1.9584040641784668, -1.895399808883667, -1.8323954343795776, -1.7693910598754883, -1.7063868045806885, -1.6433825492858887, -1.5803781747817993, -1.51737380027771, -1.4543695449829102, -1.3913652896881104, -1.328360915184021, -1.2653565406799316, -1.2023522853851318, -1.139348030090332, -1.0763436555862427, -1.0133392810821533, -0.9503350257873535, -0.8873307108879089, -0.8243263959884644, -0.7613220810890198, -0.6983180046081543, -0.6353136897087097, -0.5723093748092651, -0.5093050599098206, -0.446300745010376, -0.3832964301109314, -0.3202921152114868, -0.25728780031204224, -0.19428348541259766, -0.13127917051315308, -0.0682748556137085, -0.005270540714263916, 0.057733774185180664, 0.12073808908462524, 0.18374240398406982, 0.2467467188835144, 0.309751033782959, 0.37275534868240356, 0.43575966358184814, 0.4987639784812927, 0.5617682933807373, 0.6247726082801819, 0.6877769231796265, 0.750781238079071, 0.8137855529785156, 0.8767898678779602, 0.9397941827774048, 1.0027985572814941, 1.065802812576294, 1.1288070678710938, 1.191811442375183, 1.2548158168792725, 1.3178200721740723]}, "gradients/encoder.encoder.layers.0.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 4.0, 3.0, 2.0, 3.0, 4.0, 1.0, 8.0, 5.0, 5.0, 4.0, 3.0, 10.0, 5.0, 15.0, 16.0, 7.0, 18.0, 17.0, 26.0, 21.0, 23.0, 28.0, 36.0, 159.0, 279.0, 67.0, 23.0, 24.0, 27.0, 21.0, 22.0, 13.0, 21.0, 9.0, 13.0, 10.0, 6.0, 8.0, 6.0, 10.0, 4.0, 7.0, 7.0, 9.0, 0.0, 3.0, 3.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0], "bins": [-1.4992005825042725, -1.456620693206787, -1.4140408039093018, -1.3714609146118164, -1.3288811445236206, -1.2863012552261353, -1.24372136592865, -1.2011414766311646, -1.1585615873336792, -1.1159816980361938, -1.0734018087387085, -1.0308220386505127, -0.9882420897483826, -0.945662260055542, -0.9030823707580566, -0.8605024814605713, -0.8179226517677307, -0.7753427624702454, -0.7327629327774048, -0.6901830434799194, -0.6476031541824341, -0.6050232648849487, -0.5624434351921082, -0.5198635458946228, -0.47728368639945984, -0.4347038269042969, -0.3921239376068115, -0.34954407811164856, -0.3069642186164856, -0.26438432931900024, -0.22180446982383728, -0.17922458052635193, -0.13664472103118896, -0.09406484663486481, -0.051484979689121246, -0.008905112743377686, 0.03367476165294647, 0.07625463604927063, 0.1188344955444336, 0.16141438484191895, 0.2039942443370819, 0.24657411873340607, 0.2891539931297302, 0.3317338526248932, 0.37431371212005615, 0.4168936014175415, 0.45947346091270447, 0.5020533800125122, 0.5446332097053528, 0.5872130990028381, 0.6297929286956787, 0.6723728179931641, 0.7149527072906494, 0.7575325965881348, 0.8001124262809753, 0.8426923155784607, 0.8852721452713013, 0.9278520345687866, 0.9704318642616272, 1.0130116939544678, 1.0555915832519531, 1.0981714725494385, 1.1407513618469238, 1.1833312511444092, 1.2259111404418945]}, "gradients/encoder.encoder.pos_conv_embed.conv.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 4.0, 5.0, 9.0, 15.0, 15.0, 29.0, 41.0, 79.0, 88.0, 369.0, 137.0, 69.0, 57.0, 35.0, 28.0, 11.0, 9.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.39501953125, -0.38425445556640625, -0.3734893798828125, -0.36272430419921875, -0.351959228515625, -0.34119415283203125, -0.3304290771484375, -0.31966400146484375, -0.30889892578125, -0.29813385009765625, -0.2873687744140625, -0.27660369873046875, -0.265838623046875, -0.25507354736328125, -0.2443084716796875, -0.23354339599609375, -0.2227783203125, -0.21201324462890625, -0.2012481689453125, -0.19048309326171875, -0.179718017578125, -0.16895294189453125, -0.1581878662109375, -0.14742279052734375, -0.13665771484375, -0.12589263916015625, -0.1151275634765625, -0.10436248779296875, -0.093597412109375, -0.08283233642578125, -0.0720672607421875, -0.06130218505859375, -0.050537109375, -0.03977203369140625, -0.0290069580078125, -0.01824188232421875, -0.007476806640625, 0.00328826904296875, 0.0140533447265625, 0.02481842041015625, 0.03558349609375, 0.04634857177734375, 0.0571136474609375, 0.06787872314453125, 0.078643798828125, 0.08940887451171875, 0.1001739501953125, 0.11093902587890625, 0.1217041015625, 0.13246917724609375, 0.1432342529296875, 0.15399932861328125, 0.164764404296875, 0.17552947998046875, 0.1862945556640625, 0.19705963134765625, 0.20782470703125, 0.21858978271484375, 0.2293548583984375, 0.24011993408203125, 0.250885009765625, 0.26165008544921875, 0.2724151611328125, 0.28318023681640625, 0.2939453125]}, "gradients/encoder.encoder.pos_conv_embed.conv.weight_v": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 8.0, 5.0, 4.0, 14.0, 12.0, 10.0, 17.0, 10.0, 27.0, 33.0, 51.0, 108.0, 147.0, 291.0, 712.0, 2533.0, 17028.0, 8349241.0, 14556.0, 2401.0, 695.0, 290.0, 162.0, 66.0, 56.0, 36.0, 21.0, 12.0, 10.0, 8.0, 6.0, 4.0, 2.0, 3.0, 4.0, 1.0, 2.0, 4.0, 5.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.5725394487380981, -1.525553822517395, -1.478568196296692, -1.4315824508666992, -1.384596824645996, -1.337611198425293, -1.2906255722045898, -1.2436399459838867, -1.1966543197631836, -1.1496686935424805, -1.1026830673217773, -1.0556974411010742, -1.0087116956710815, -0.9617260694503784, -0.9147404432296753, -0.8677548170089722, -0.8207690715789795, -0.7737834453582764, -0.7267977595329285, -0.6798121333122253, -0.6328264474868774, -0.5858408212661743, -0.5388551950454712, -0.4918695390224457, -0.44488388299942017, -0.39789822697639465, -0.35091257095336914, -0.303926944732666, -0.2569412887096405, -0.209955632686615, -0.16297000646591187, -0.11598435044288635, -0.06899881362915039, -0.022013165056705475, 0.02497248351573944, 0.07195812463760376, 0.11894378066062927, 0.16592943668365479, 0.2129150629043579, 0.2599007189273834, 0.30688637495040894, 0.35387203097343445, 0.40085768699645996, 0.4478433132171631, 0.4948289692401886, 0.5418146252632141, 0.5888002514839172, 0.6357859373092651, 0.6827715635299683, 0.7297571897506714, 0.7767428755760193, 0.8237285017967224, 0.8707141876220703, 0.9176998138427734, 0.9646854400634766, 1.0116710662841797, 1.0586566925048828, 1.105642318725586, 1.152627944946289, 1.1996135711669922, 1.2465993165969849, 1.293584942817688, 1.3405705690383911, 1.3875561952590942, 1.434541940689087]}, "gradients/encoder.encoder.pos_conv_embed.conv.weight_g": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 5.0, 1.0, 1.0, 1.0, 0.0, 3.0, 2.0, 2.0, 5.0, 3.0, 5.0, 5.0, 3.0, 5.0, 3.0, 3.0, 10.0, 5.0, 3.0, 5.0, 3.0, 2.0, 3.0, 4.0, 4.0, 2.0, 3.0, 3.0, 3.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.8564257025718689, -0.8252968788146973, -0.7941679954528809, -0.7630391716957092, -0.7319103479385376, -0.7007814645767212, -0.6696526408195496, -0.6385238170623779, -0.6073949337005615, -0.5762661099433899, -0.5451372265815735, -0.5140084028244019, -0.48287954926490784, -0.4517506957054138, -0.4206218719482422, -0.38949301838874817, -0.35836416482925415, -0.32723531126976013, -0.2961064577102661, -0.2649776339530945, -0.23384878039360046, -0.20271992683410645, -0.17159108817577362, -0.1404622495174408, -0.10933339595794678, -0.07820454984903336, -0.047075703740119934, -0.015946857631206512, 0.01518198847770691, 0.04631084203720093, 0.07743968069553375, 0.10856851935386658, 0.13969731330871582, 0.17082616686820984, 0.20195500552654266, 0.2330838441848755, 0.2642126977443695, 0.2953415513038635, 0.32647037506103516, 0.3575992286205292, 0.3887280821800232, 0.4198569357395172, 0.45098578929901123, 0.48211461305618286, 0.5132434368133545, 0.5443723201751709, 0.5755011439323425, 0.6066299676895142, 0.6377588510513306, 0.6688876748085022, 0.7000165581703186, 0.7311453819274902, 0.7622742652893066, 0.7934030890464783, 0.8245319128036499, 0.8556607961654663, 0.8867896199226379, 0.9179184436798096, 0.949047327041626, 0.9801761507987976, 1.0113049745559692, 1.0424338579177856, 1.073562741279602, 1.104691505432129, 1.1358203887939453]}, "gradients/encoder.feature_projection.projection.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 4.0, 8.0, 10.0, 18.0, 39.0, 63.0, 136.0, 306.0, 909.0, 3582.0, 24361.0, 325045.0, 154536.0, 12048.0, 2067.0, 635.0, 240.0, 116.0, 48.0, 44.0, 24.0, 15.0, 8.0, 7.0, 6.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.03515625, -2.950469970703125, -2.86578369140625, -2.781097412109375, -2.6964111328125, -2.611724853515625, -2.52703857421875, -2.442352294921875, -2.357666015625, -2.272979736328125, -2.18829345703125, -2.103607177734375, -2.0189208984375, -1.934234619140625, -1.84954833984375, -1.764862060546875, -1.68017578125, -1.595489501953125, -1.51080322265625, -1.426116943359375, -1.3414306640625, -1.256744384765625, -1.17205810546875, -1.087371826171875, -1.002685546875, -0.917999267578125, -0.83331298828125, -0.748626708984375, -0.6639404296875, -0.579254150390625, -0.49456787109375, -0.409881591796875, -0.3251953125, -0.240509033203125, -0.15582275390625, -0.071136474609375, 0.0135498046875, 0.098236083984375, 0.18292236328125, 0.267608642578125, 0.352294921875, 0.436981201171875, 0.52166748046875, 0.606353759765625, 0.6910400390625, 0.775726318359375, 0.86041259765625, 0.945098876953125, 1.02978515625, 1.114471435546875, 1.19915771484375, 1.283843994140625, 1.3685302734375, 1.453216552734375, 1.53790283203125, 1.622589111328125, 1.707275390625, 1.791961669921875, 1.87664794921875, 1.961334228515625, 2.0460205078125, 2.130706787109375, 2.21539306640625, 2.300079345703125, 2.384765625]}, "gradients/encoder.feature_projection.projection.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 4.0, 1.0, 1.0, 4.0, 6.0, 6.0, 12.0, 5.0, 8.0, 8.0, 15.0, 24.0, 27.0, 27.0, 38.0, 42.0, 55.0, 59.0, 80.0, 64.0, 60.0, 66.0, 63.0, 63.0, 47.0, 53.0, 40.0, 32.0, 30.0, 18.0, 11.0, 11.0, 8.0, 8.0, 5.0, 2.0, 2.0, 2.0, 1.0, 2.0, 1.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.11474609375, -0.11037445068359375, -0.1060028076171875, -0.10163116455078125, -0.097259521484375, -0.09288787841796875, -0.0885162353515625, -0.08414459228515625, -0.07977294921875, -0.07540130615234375, -0.0710296630859375, -0.06665802001953125, -0.062286376953125, -0.05791473388671875, -0.0535430908203125, -0.04917144775390625, -0.0447998046875, -0.04042816162109375, -0.0360565185546875, -0.03168487548828125, -0.027313232421875, -0.02294158935546875, -0.0185699462890625, -0.01419830322265625, -0.00982666015625, -0.00545501708984375, -0.0010833740234375, 0.00328826904296875, 0.007659912109375, 0.01203155517578125, 0.0164031982421875, 0.02077484130859375, 0.025146484375, 0.02951812744140625, 0.0338897705078125, 0.03826141357421875, 0.042633056640625, 0.04700469970703125, 0.0513763427734375, 0.05574798583984375, 0.06011962890625, 0.06449127197265625, 0.0688629150390625, 0.07323455810546875, 0.077606201171875, 0.08197784423828125, 0.0863494873046875, 0.09072113037109375, 0.0950927734375, 0.09946441650390625, 0.1038360595703125, 0.10820770263671875, 0.112579345703125, 0.11695098876953125, 0.1213226318359375, 0.12569427490234375, 0.13006591796875, 0.13443756103515625, 0.1388092041015625, 0.14318084716796875, 0.147552490234375, 0.15192413330078125, 0.1562957763671875, 0.16066741943359375, 0.1650390625]}, "gradients/encoder.feature_projection.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 3.0, 2.0, 4.0, 1.0, 2.0, 6.0, 3.0, 4.0, 6.0, 15.0, 25.0, 67.0, 109.0, 104.0, 62.0, 25.0, 12.0, 23.0, 12.0, 5.0, 2.0, 3.0, 6.0, 1.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0841953754425049, -1.0307884216308594, -0.9773814678192139, -0.9239745140075684, -0.8705675601959229, -0.8171606063842773, -0.7637537121772766, -0.7103467583656311, -0.6569398045539856, -0.6035328507423401, -0.5501258969306946, -0.49671897292137146, -0.44331201910972595, -0.38990506529808044, -0.3364981412887573, -0.2830911874771118, -0.2296842336654663, -0.1762772798538208, -0.12287034094333649, -0.06946340203285217, -0.016056448221206665, 0.03735050559043884, 0.09075742959976196, 0.14416438341140747, 0.19757133722305298, 0.2509782910346985, 0.304385244846344, 0.3577921688556671, 0.4111991226673126, 0.46460607647895813, 0.5180130004882812, 0.5714199542999268, 0.6248269081115723, 0.6782338619232178, 0.7316408157348633, 0.7850477695465088, 0.8384547233581543, 0.8918616771697998, 0.9452685713768005, 0.998675525188446, 1.0520825386047363, 1.1054894924163818, 1.1588964462280273, 1.2123034000396729, 1.2657103538513184, 1.3191173076629639, 1.3725242614746094, 1.4259312152862549, 1.4793380498886108, 1.5327450037002563, 1.5861519575119019, 1.6395589113235474, 1.6929658651351929, 1.7463728189468384, 1.7997796535491943, 1.8531866073608398, 1.9065935611724854, 1.9600005149841309, 2.0134074687957764, 2.066814422607422, 2.1202213764190674, 2.173628330230713, 2.2270352840423584, 2.280442237854004, 2.3338491916656494]}, "gradients/encoder.feature_projection.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 3.0, 2.0, 4.0, 4.0, 4.0, 1.0, 6.0, 3.0, 7.0, 2.0, 2.0, 1.0, 3.0, 4.0, 19.0, 44.0, 85.0, 122.0, 68.0, 44.0, 15.0, 7.0, 8.0, 8.0, 4.0, 4.0, 1.0, 4.0, 2.0, 2.0, 1.0, 4.0, 6.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6400466561317444, -0.6164583563804626, -0.5928699970245361, -0.5692816972732544, -0.5456933379173279, -0.5221050381660461, -0.49851667881011963, -0.4749283790588379, -0.4513400197029114, -0.42775169014930725, -0.4041633605957031, -0.380575031042099, -0.3569867014884949, -0.33339837193489075, -0.3098100423812866, -0.2862217426300049, -0.26263341307640076, -0.23904508352279663, -0.2154567539691925, -0.19186842441558838, -0.16828009486198425, -0.14469176530838013, -0.1211034506559372, -0.09751512110233307, -0.07392679154872894, -0.05033846199512482, -0.02675013616681099, -0.003161810338497162, 0.020426519215106964, 0.04401484876871109, 0.06760317087173462, 0.09119150042533875, 0.11477982997894287, 0.138368159532547, 0.16195648908615112, 0.18554481863975525, 0.20913314819335938, 0.2327214777469635, 0.2563098073005676, 0.27989810705184937, 0.3034864664077759, 0.32707479596138, 0.35066312551498413, 0.37425145506858826, 0.3978397846221924, 0.4214281141757965, 0.44501644372940063, 0.4686047434806824, 0.4921930730342865, 0.5157814025878906, 0.5393697023391724, 0.5629580616950989, 0.5865463614463806, 0.6101347208023071, 0.6337230205535889, 0.6573113799095154, 0.6808996796607971, 0.7044879794120789, 0.7280763387680054, 0.7516646385192871, 0.7752529978752136, 0.7988412976264954, 0.8224296569824219, 0.8460179567337036, 0.8696063160896301]}}']}, 'wandb-history.jsonl': {'offset': 349, 'content': ['{"gradients/decoder.transformer.ln_f.weight": {"_type": "histogram", "values": [10.0, 39.0, 264.0, 483.0, 192.0, 27.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-15.277154922485352, -10.82435417175293, -6.371552467346191, -1.9187507629394531, 2.5340499877929688, 6.986850738525391, 11.439653396606445, 15.892454147338867, 20.34525489807129, 24.79805564880371, 29.250858306884766, 33.70365905761719, 38.15645980834961, 42.60926055908203, 47.06206512451172, 51.514862060546875, 55.96766662597656, 60.420467376708984, 64.8732681274414, 69.3260726928711, 73.77886962890625, 78.23167419433594, 82.68447875976562, 87.13727569580078, 91.59007263183594, 96.04287719726562, 100.49567413330078, 104.94847869873047, 109.40127563476562, 113.85408020019531, 118.306884765625, 122.75968170166016, 127.21247863769531, 131.665283203125, 136.1180877685547, 140.5708770751953, 145.023681640625, 149.4764862060547, 153.92929077148438, 158.382080078125, 162.8348846435547, 167.28768920898438, 171.74049377441406, 176.1932830810547, 180.64608764648438, 185.09889221191406, 189.55169677734375, 194.00448608398438, 198.45730590820312, 202.9101104736328, 207.3629150390625, 211.81570434570312, 216.2685089111328, 220.7213134765625, 225.1741180419922, 229.62692260742188, 234.0797119140625, 238.5325164794922, 242.98532104492188, 247.4381103515625, 251.8909149169922, 256.3437194824219, 260.7965087890625, 265.24932861328125, 269.7021179199219]}, "gradients/decoder.transformer.ln_f.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 5.0, 3.0, 2.0, 1.0, 4.0, 7.0, 7.0, 7.0, 7.0, 13.0, 10.0, 14.0, 20.0, 12.0, 21.0, 25.0, 30.0, 21.0, 37.0, 38.0, 37.0, 29.0, 35.0, 39.0, 57.0, 40.0, 45.0, 35.0, 48.0, 36.0, 32.0, 27.0, 24.0, 27.0, 25.0, 23.0, 30.0, 18.0, 19.0, 17.0, 13.0, 19.0, 9.0, 9.0, 6.0, 9.0, 7.0, 3.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-44.952396392822266, -43.53903579711914, -42.125675201416016, -40.71231460571289, -39.298954010009766, -37.885597229003906, -36.47223663330078, -35.058876037597656, -33.64551544189453, -32.232154846191406, -30.81879425048828, -29.405433654785156, -27.992074966430664, -26.57871437072754, -25.165353775024414, -23.751995086669922, -22.338632583618164, -20.92527198791504, -19.511911392211914, -18.098552703857422, -16.685192108154297, -15.271831512451172, -13.858470916748047, -12.445111274719238, -11.031750679016113, -9.618390083312988, -8.20503044128418, -6.791669845581055, -5.378309726715088, -3.964949607849121, -2.551589012145996, -1.1382293701171875, 0.2751312255859375, 1.6884914636611938, 3.10185170173645, 4.515212059020996, 5.928572177886963, 7.34193229675293, 8.755292892456055, 10.168652534484863, 11.582013130187988, 12.995373725891113, 14.408733367919922, 15.822093963623047, 17.235454559326172, 18.648815155029297, 20.062175750732422, 21.475534439086914, 22.88889503479004, 24.302255630493164, 25.71561622619629, 27.12897491455078, 28.542335510253906, 29.95569610595703, 31.369056701660156, 32.78241729736328, 34.195777893066406, 35.60913848876953, 37.022499084472656, 38.43585968017578, 39.849220275878906, 41.26258087158203, 42.675941467285156, 44.089298248291016, 45.50265884399414]}, "gradients/decoder.transformer.h.23.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 2.0, 5.0, 3.0, 3.0, 2.0, 8.0, 8.0, 6.0, 10.0, 13.0, 4.0, 12.0, 18.0, 22.0, 19.0, 19.0, 23.0, 31.0, 27.0, 28.0, 19.0, 42.0, 33.0, 42.0, 38.0, 47.0, 59.0, 50.0, 41.0, 34.0, 34.0, 32.0, 32.0, 27.0, 30.0, 23.0, 23.0, 18.0, 20.0, 16.0, 13.0, 11.0, 12.0, 13.0, 6.0, 9.0, 6.0, 6.0, 5.0, 4.0, 3.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.9609375, -3.8385009765625, -3.716064453125, -3.5936279296875, -3.47119140625, -3.3487548828125, -3.226318359375, -3.1038818359375, -2.9814453125, -2.8590087890625, -2.736572265625, -2.6141357421875, -2.49169921875, -2.3692626953125, -2.246826171875, -2.1243896484375, -2.001953125, -1.8795166015625, -1.757080078125, -1.6346435546875, -1.51220703125, -1.3897705078125, -1.267333984375, -1.1448974609375, -1.0224609375, -0.9000244140625, -0.777587890625, -0.6551513671875, -0.53271484375, -0.4102783203125, -0.287841796875, -0.1654052734375, -0.04296875, 0.0794677734375, 0.201904296875, 0.3243408203125, 0.44677734375, 0.5692138671875, 0.691650390625, 0.8140869140625, 0.9365234375, 1.0589599609375, 1.181396484375, 1.3038330078125, 1.42626953125, 1.5487060546875, 1.671142578125, 1.7935791015625, 1.916015625, 2.0384521484375, 2.160888671875, 2.2833251953125, 2.40576171875, 2.5281982421875, 2.650634765625, 2.7730712890625, 2.8955078125, 3.0179443359375, 3.140380859375, 3.2628173828125, 3.38525390625, 3.5076904296875, 3.630126953125, 3.7525634765625, 3.875]}, "gradients/decoder.transformer.h.23.mlp.c_proj.weight": {"_type": "histogram", "values": [4.0, 3.0, 4.0, 3.0, 6.0, 7.0, 21.0, 13.0, 11.0, 13.0, 21.0, 20.0, 36.0, 42.0, 55.0, 61.0, 91.0, 96.0, 136.0, 183.0, 262.0, 325.0, 384.0, 550.0, 830.0, 1313.0, 2078.0, 3574.0, 7154.0, 18124.0, 77818.0, 682302.0, 2368741.0, 881023.0, 108809.0, 21833.0, 8051.0, 3742.0, 2140.0, 1336.0, 819.0, 557.0, 439.0, 284.0, 250.0, 159.0, 111.0, 112.0, 59.0, 58.0, 50.0, 42.0, 32.0, 34.0, 31.0, 10.0, 12.0, 9.0, 7.0, 2.0, 4.0, 1.0, 4.0, 3.0], "bins": [-10.2890625, -9.97314453125, -9.6572265625, -9.34130859375, -9.025390625, -8.70947265625, -8.3935546875, -8.07763671875, -7.76171875, -7.44580078125, -7.1298828125, -6.81396484375, -6.498046875, -6.18212890625, -5.8662109375, -5.55029296875, -5.234375, -4.91845703125, -4.6025390625, -4.28662109375, -3.970703125, -3.65478515625, -3.3388671875, -3.02294921875, -2.70703125, -2.39111328125, -2.0751953125, -1.75927734375, -1.443359375, -1.12744140625, -0.8115234375, -0.49560546875, -0.1796875, 0.13623046875, 0.4521484375, 0.76806640625, 1.083984375, 1.39990234375, 1.7158203125, 2.03173828125, 2.34765625, 2.66357421875, 2.9794921875, 3.29541015625, 3.611328125, 3.92724609375, 4.2431640625, 4.55908203125, 4.875, 5.19091796875, 5.5068359375, 5.82275390625, 6.138671875, 6.45458984375, 6.7705078125, 7.08642578125, 7.40234375, 7.71826171875, 8.0341796875, 8.35009765625, 8.666015625, 8.98193359375, 9.2978515625, 9.61376953125, 9.9296875]}, "gradients/decoder.transformer.h.23.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 2.0, 5.0, 0.0, 6.0, 3.0, 11.0, 10.0, 10.0, 11.0, 10.0, 18.0, 31.0, 21.0, 52.0, 76.0, 107.0, 174.0, 253.0, 376.0, 517.0, 614.0, 504.0, 370.0, 249.0, 178.0, 128.0, 96.0, 66.0, 47.0, 29.0, 23.0, 20.0, 18.0, 5.0, 7.0, 11.0, 9.0, 5.0, 3.0, 1.0, 1.0, 2.0, 2.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-16.296875, -15.810791015625, -15.32470703125, -14.838623046875, -14.3525390625, -13.866455078125, -13.38037109375, -12.894287109375, -12.408203125, -11.922119140625, -11.43603515625, -10.949951171875, -10.4638671875, -9.977783203125, -9.49169921875, -9.005615234375, -8.51953125, -8.033447265625, -7.54736328125, -7.061279296875, -6.5751953125, -6.089111328125, -5.60302734375, -5.116943359375, -4.630859375, -4.144775390625, -3.65869140625, -3.172607421875, -2.6865234375, -2.200439453125, -1.71435546875, -1.228271484375, -0.7421875, -0.256103515625, 0.22998046875, 0.716064453125, 1.2021484375, 1.688232421875, 2.17431640625, 2.660400390625, 3.146484375, 3.632568359375, 4.11865234375, 4.604736328125, 5.0908203125, 5.576904296875, 6.06298828125, 6.549072265625, 7.03515625, 7.521240234375, 8.00732421875, 8.493408203125, 8.9794921875, 9.465576171875, 9.95166015625, 10.437744140625, 10.923828125, 11.409912109375, 11.89599609375, 12.382080078125, 12.8681640625, 13.354248046875, 13.84033203125, 14.326416015625, 14.8125]}, "gradients/decoder.transformer.h.23.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 3.0, 2.0, 1.0, 10.0, 5.0, 11.0, 9.0, 14.0, 15.0, 27.0, 25.0, 44.0, 59.0, 76.0, 123.0, 164.0, 266.0, 480.0, 860.0, 2306.0, 25238.0, 4034410.0, 124196.0, 3194.0, 1177.0, 622.0, 322.0, 188.0, 118.0, 82.0, 65.0, 60.0, 19.0, 21.0, 16.0, 16.0, 13.0, 10.0, 8.0, 4.0, 7.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.46875, -56.62109375, -54.7734375, -52.92578125, -51.078125, -49.23046875, -47.3828125, -45.53515625, -43.6875, -41.83984375, -39.9921875, -38.14453125, -36.296875, -34.44921875, -32.6015625, -30.75390625, -28.90625, -27.05859375, -25.2109375, -23.36328125, -21.515625, -19.66796875, -17.8203125, -15.97265625, -14.125, -12.27734375, -10.4296875, -8.58203125, -6.734375, -4.88671875, -3.0390625, -1.19140625, 0.65625, 2.50390625, 4.3515625, 6.19921875, 8.046875, 9.89453125, 11.7421875, 13.58984375, 15.4375, 17.28515625, 19.1328125, 20.98046875, 22.828125, 24.67578125, 26.5234375, 28.37109375, 30.21875, 32.06640625, 33.9140625, 35.76171875, 37.609375, 39.45703125, 41.3046875, 43.15234375, 45.0, 46.84765625, 48.6953125, 50.54296875, 52.390625, 54.23828125, 56.0859375, 57.93359375, 59.78125]}, "gradients/decoder.transformer.h.23.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 109.0, 653.0, 234.0, 13.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-303.5772399902344, -297.3272399902344, -291.0772705078125, -284.8272705078125, -278.5772705078125, -272.3272705078125, -266.0773010253906, -259.8273010253906, -253.57730102539062, -247.3273162841797, -241.0773162841797, -234.82733154296875, -228.57733154296875, -222.3273468017578, -216.07736206054688, -209.82736206054688, -203.57737731933594, -197.327392578125, -191.077392578125, -184.82740783691406, -178.57740783691406, -172.32742309570312, -166.07742309570312, -159.8274383544922, -153.57745361328125, -147.3274688720703, -141.0774688720703, -134.82748413085938, -128.57748413085938, -122.32749938964844, -116.07750701904297, -109.8275146484375, -103.57750701904297, -97.3275146484375, -91.07752227783203, -84.82752990722656, -78.57754516601562, -72.32754516601562, -66.07756042480469, -59.82756805419922, -53.57757568359375, -47.32758331298828, -41.07759094238281, -34.82760238647461, -28.57761001586914, -22.327617645263672, -16.07762908935547, -9.82763671875, -3.5776443481445312, 2.672347068786621, 8.922338485717773, 15.17232894897461, 21.422321319580078, 27.672313690185547, 33.92230224609375, 40.17229461669922, 46.42228698730469, 52.672279357910156, 58.922271728515625, 65.17225646972656, 71.42225646972656, 77.6722412109375, 83.92223358154297, 90.17222595214844, 96.4222183227539]}, "gradients/decoder.transformer.h.23.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 1.0, 1.0, 5.0, 10.0, 6.0, 6.0, 7.0, 7.0, 11.0, 15.0, 11.0, 17.0, 23.0, 30.0, 19.0, 23.0, 22.0, 30.0, 34.0, 31.0, 41.0, 32.0, 31.0, 39.0, 32.0, 33.0, 55.0, 41.0, 44.0, 38.0, 38.0, 45.0, 33.0, 27.0, 22.0, 24.0, 16.0, 22.0, 17.0, 8.0, 11.0, 13.0, 9.0, 7.0, 4.0, 6.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 2.0, 1.0], "bins": [-53.098175048828125, -51.56222915649414, -50.02627944946289, -48.490333557128906, -46.954383850097656, -45.41843795776367, -43.88248825073242, -42.34654235839844, -40.81059265136719, -39.2746467590332, -37.73869705200195, -36.20275115966797, -34.66680145263672, -33.130855560302734, -31.594905853271484, -30.0589599609375, -28.523012161254883, -26.987064361572266, -25.45111656188965, -23.91516876220703, -22.379220962524414, -20.843273162841797, -19.307327270507812, -17.771377563476562, -16.235431671142578, -14.699483871459961, -13.163536071777344, -11.627588272094727, -10.09164047241211, -8.555692672729492, -7.019745826721191, -5.483798027038574, -3.9478492736816406, -2.4119014739990234, -0.8759539127349854, 0.6599936485290527, 2.19594144821167, 3.731889247894287, 5.267836570739746, 6.803784370422363, 8.33973217010498, 9.875679969787598, 11.411627769470215, 12.947574615478516, 14.483522415161133, 16.01947021484375, 17.555418014526367, 19.091365814208984, 20.6273136138916, 22.16326141357422, 23.699209213256836, 25.235157012939453, 26.77110481262207, 28.307052612304688, 29.842998504638672, 31.378948211669922, 32.914894104003906, 34.45083999633789, 35.98678970336914, 37.522735595703125, 39.058685302734375, 40.59463119506836, 42.13058090209961, 43.666526794433594, 45.202476501464844]}, "gradients/decoder.transformer.h.23.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 6.0, 3.0, 11.0, 3.0, 2.0, 8.0, 10.0, 12.0, 7.0, 23.0, 21.0, 18.0, 20.0, 33.0, 25.0, 32.0, 37.0, 34.0, 44.0, 38.0, 43.0, 43.0, 43.0, 48.0, 43.0, 50.0, 34.0, 35.0, 34.0, 25.0, 23.0, 24.0, 26.0, 26.0, 16.0, 18.0, 19.0, 12.0, 15.0, 5.0, 7.0, 7.0, 10.0, 4.0, 6.0, 2.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.42578125, -4.2813720703125, -4.136962890625, -3.9925537109375, -3.84814453125, -3.7037353515625, -3.559326171875, -3.4149169921875, -3.2705078125, -3.1260986328125, -2.981689453125, -2.8372802734375, -2.69287109375, -2.5484619140625, -2.404052734375, -2.2596435546875, -2.115234375, -1.9708251953125, -1.826416015625, -1.6820068359375, -1.53759765625, -1.3931884765625, -1.248779296875, -1.1043701171875, -0.9599609375, -0.8155517578125, -0.671142578125, -0.5267333984375, -0.38232421875, -0.2379150390625, -0.093505859375, 0.0509033203125, 0.1953125, 0.3397216796875, 0.484130859375, 0.6285400390625, 0.77294921875, 0.9173583984375, 1.061767578125, 1.2061767578125, 1.3505859375, 1.4949951171875, 1.639404296875, 1.7838134765625, 1.92822265625, 2.0726318359375, 2.217041015625, 2.3614501953125, 2.505859375, 2.6502685546875, 2.794677734375, 2.9390869140625, 3.08349609375, 3.2279052734375, 3.372314453125, 3.5167236328125, 3.6611328125, 3.8055419921875, 3.949951171875, 4.0943603515625, 4.23876953125, 4.3831787109375, 4.527587890625, 4.6719970703125, 4.81640625]}, "gradients/decoder.transformer.h.23.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 5.0, 12.0, 14.0, 19.0, 31.0, 42.0, 60.0, 96.0, 153.0, 221.0, 290.0, 466.0, 626.0, 953.0, 1405.0, 2074.0, 3165.0, 4830.0, 7410.0, 11393.0, 17880.0, 28417.0, 47238.0, 84857.0, 181850.0, 344518.0, 134386.0, 68072.0, 39717.0, 24475.0, 15291.0, 9693.0, 6383.0, 4112.0, 2779.0, 1826.0, 1147.0, 874.0, 594.0, 340.0, 269.0, 185.0, 137.0, 83.0, 56.0, 39.0, 18.0, 22.0, 17.0, 13.0, 3.0, 2.0, 2.0, 1.0, 2.0, 2.0], "bins": [-1.2509765625, -1.21343994140625, -1.1759033203125, -1.13836669921875, -1.100830078125, -1.06329345703125, -1.0257568359375, -0.98822021484375, -0.95068359375, -0.91314697265625, -0.8756103515625, -0.83807373046875, -0.800537109375, -0.76300048828125, -0.7254638671875, -0.68792724609375, -0.650390625, -0.61285400390625, -0.5753173828125, -0.53778076171875, -0.500244140625, -0.46270751953125, -0.4251708984375, -0.38763427734375, -0.35009765625, -0.31256103515625, -0.2750244140625, -0.23748779296875, -0.199951171875, -0.16241455078125, -0.1248779296875, -0.08734130859375, -0.0498046875, -0.01226806640625, 0.0252685546875, 0.06280517578125, 0.100341796875, 0.13787841796875, 0.1754150390625, 0.21295166015625, 0.25048828125, 0.28802490234375, 0.3255615234375, 0.36309814453125, 0.400634765625, 0.43817138671875, 0.4757080078125, 0.51324462890625, 0.55078125, 0.58831787109375, 0.6258544921875, 0.66339111328125, 0.700927734375, 0.73846435546875, 0.7760009765625, 0.81353759765625, 0.85107421875, 0.88861083984375, 0.9261474609375, 0.96368408203125, 1.001220703125, 1.03875732421875, 1.0762939453125, 1.11383056640625, 1.1513671875]}, "gradients/decoder.transformer.h.23.crossattention.c_attn.bias": {"_type": "histogram", "values": [3.0, 2.0, 3.0, 1.0, 0.0, 3.0, 2.0, 3.0, 5.0, 7.0, 12.0, 10.0, 10.0, 13.0, 16.0, 12.0, 14.0, 23.0, 23.0, 25.0, 36.0, 40.0, 27.0, 35.0, 32.0, 42.0, 27.0, 41.0, 1070.0, 40.0, 33.0, 39.0, 32.0, 50.0, 37.0, 31.0, 35.0, 28.0, 28.0, 25.0, 19.0, 18.0, 18.0, 10.0, 12.0, 10.0, 12.0, 8.0, 5.0, 6.0, 2.0, 4.0, 3.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.599609375, -2.50946044921875, -2.4193115234375, -2.32916259765625, -2.239013671875, -2.14886474609375, -2.0587158203125, -1.96856689453125, -1.87841796875, -1.78826904296875, -1.6981201171875, -1.60797119140625, -1.517822265625, -1.42767333984375, -1.3375244140625, -1.24737548828125, -1.1572265625, -1.06707763671875, -0.9769287109375, -0.88677978515625, -0.796630859375, -0.70648193359375, -0.6163330078125, -0.52618408203125, -0.43603515625, -0.34588623046875, -0.2557373046875, -0.16558837890625, -0.075439453125, 0.01470947265625, 0.1048583984375, 0.19500732421875, 0.28515625, 0.37530517578125, 0.4654541015625, 0.55560302734375, 0.645751953125, 0.73590087890625, 0.8260498046875, 0.91619873046875, 1.00634765625, 1.09649658203125, 1.1866455078125, 1.27679443359375, 1.366943359375, 1.45709228515625, 1.5472412109375, 1.63739013671875, 1.7275390625, 1.81768798828125, 1.9078369140625, 1.99798583984375, 2.088134765625, 2.17828369140625, 2.2684326171875, 2.35858154296875, 2.44873046875, 2.53887939453125, 2.6290283203125, 2.71917724609375, 2.809326171875, 2.89947509765625, 2.9896240234375, 3.07977294921875, 3.169921875]}, "gradients/decoder.transformer.h.23.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 5.0, 4.0, 7.0, 9.0, 14.0, 18.0, 24.0, 35.0, 54.0, 105.0, 165.0, 220.0, 401.0, 680.0, 1249.0, 2063.0, 3749.0, 6558.0, 12055.0, 22558.0, 45082.0, 92285.0, 221497.0, 1435320.0, 127789.0, 60298.0, 29879.0, 15506.0, 8465.0, 4709.0, 2647.0, 1512.0, 857.0, 522.0, 298.0, 186.0, 104.0, 64.0, 39.0, 25.0, 26.0, 23.0, 10.0, 7.0, 4.0, 3.0, 3.0, 1.0, 1.0, 2.0, 3.0, 3.0], "bins": [-1.7314453125, -1.6822662353515625, -1.633087158203125, -1.5839080810546875, -1.53472900390625, -1.4855499267578125, -1.436370849609375, -1.3871917724609375, -1.3380126953125, -1.2888336181640625, -1.239654541015625, -1.1904754638671875, -1.14129638671875, -1.0921173095703125, -1.042938232421875, -0.9937591552734375, -0.944580078125, -0.8954010009765625, -0.846221923828125, -0.7970428466796875, -0.74786376953125, -0.6986846923828125, -0.649505615234375, -0.6003265380859375, -0.5511474609375, -0.5019683837890625, -0.452789306640625, -0.4036102294921875, -0.35443115234375, -0.3052520751953125, -0.256072998046875, -0.2068939208984375, -0.15771484375, -0.1085357666015625, -0.059356689453125, -0.0101776123046875, 0.03900146484375, 0.0881805419921875, 0.137359619140625, 0.1865386962890625, 0.2357177734375, 0.2848968505859375, 0.334075927734375, 0.3832550048828125, 0.43243408203125, 0.4816131591796875, 0.530792236328125, 0.5799713134765625, 0.629150390625, 0.6783294677734375, 0.727508544921875, 0.7766876220703125, 0.82586669921875, 0.8750457763671875, 0.924224853515625, 0.9734039306640625, 1.0225830078125, 1.0717620849609375, 1.120941162109375, 1.1701202392578125, 1.21929931640625, 1.2684783935546875, 1.317657470703125, 1.3668365478515625, 1.416015625]}, "gradients/decoder.transformer.h.23.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 3.0, 3.0, 5.0, 3.0, 11.0, 11.0, 16.0, 24.0, 27.0, 34.0, 45.0, 79.0, 94.0, 91.0, 104.0, 103.0, 89.0, 65.0, 43.0, 48.0, 24.0, 28.0, 12.0, 10.0, 7.0, 1.0, 9.0, 4.0, 4.0, 1.0, 3.0, 0.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0010557174682617188, -0.0010241195559501648, -0.0009925216436386108, -0.0009609237313270569, -0.0009293258190155029, -0.000897727906703949, -0.000866129994392395, -0.0008345320820808411, -0.0008029341697692871, -0.0007713362574577332, -0.0007397383451461792, -0.0007081404328346252, -0.0006765425205230713, -0.0006449446082115173, -0.0006133466958999634, -0.0005817487835884094, -0.0005501508712768555, -0.0005185529589653015, -0.00048695504665374756, -0.0004553571343421936, -0.00042375922203063965, -0.0003921613097190857, -0.00036056339740753174, -0.0003289654850959778, -0.00029736757278442383, -0.0002657696604728699, -0.00023417174816131592, -0.00020257383584976196, -0.000170975923538208, -0.00013937801122665405, -0.0001077800989151001, -7.618218660354614e-05, -4.458427429199219e-05, -1.2986361980438232e-05, 1.8611550331115723e-05, 5.020946264266968e-05, 8.180737495422363e-05, 0.00011340528726577759, 0.00014500319957733154, 0.0001766011118888855, 0.00020819902420043945, 0.0002397969365119934, 0.00027139484882354736, 0.0003029927611351013, 0.0003345906734466553, 0.00036618858575820923, 0.0003977864980697632, 0.00042938441038131714, 0.0004609823226928711, 0.000492580235004425, 0.000524178147315979, 0.000555776059627533, 0.0005873739719390869, 0.0006189718842506409, 0.0006505697965621948, 0.0006821677088737488, 0.0007137656211853027, 0.0007453635334968567, 0.0007769614458084106, 0.0008085593581199646, 0.0008401572704315186, 0.0008717551827430725, 0.0009033530950546265, 0.0009349510073661804, 0.0009665489196777344]}, "gradients/decoder.transformer.h.23.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 5.0, 1.0, 3.0, 5.0, 9.0, 12.0, 10.0, 9.0, 22.0, 25.0, 36.0, 61.0, 98.0, 159.0, 215.0, 353.0, 701.0, 5142.0, 1039413.0, 996.0, 465.0, 295.0, 185.0, 109.0, 63.0, 53.0, 30.0, 22.0, 18.0, 14.0, 9.0, 10.0, 2.0, 5.0, 1.0, 1.0, 1.0, 3.0, 0.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0203399658203125, -0.01968860626220703, -0.019037246704101562, -0.018385887145996094, -0.017734527587890625, -0.017083168029785156, -0.016431808471679688, -0.01578044891357422, -0.01512908935546875, -0.014477729797363281, -0.013826370239257812, -0.013175010681152344, -0.012523651123046875, -0.011872291564941406, -0.011220932006835938, -0.010569572448730469, -0.009918212890625, -0.009266853332519531, -0.008615493774414062, -0.007964134216308594, -0.007312774658203125, -0.006661415100097656, -0.0060100555419921875, -0.005358695983886719, -0.00470733642578125, -0.004055976867675781, -0.0034046173095703125, -0.0027532577514648438, -0.002101898193359375, -0.0014505386352539062, -0.0007991790771484375, -0.00014781951904296875, 0.0005035400390625, 0.0011548995971679688, 0.0018062591552734375, 0.0024576187133789062, 0.003108978271484375, 0.0037603378295898438, 0.0044116973876953125, 0.005063056945800781, 0.00571441650390625, 0.006365776062011719, 0.0070171356201171875, 0.007668495178222656, 0.008319854736328125, 0.008971214294433594, 0.009622573852539062, 0.010273933410644531, 0.01092529296875, 0.011576652526855469, 0.012228012084960938, 0.012879371643066406, 0.013530731201171875, 0.014182090759277344, 0.014833450317382812, 0.015484809875488281, 0.01613616943359375, 0.01678752899169922, 0.017438888549804688, 0.018090248107910156, 0.018741607666015625, 0.019392967224121094, 0.020044326782226562, 0.02069568634033203, 0.0213470458984375]}, "gradients/decoder.transformer.h.23.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 8.0, 859.0, 150.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.003812056966125965, -0.0037229156587272882, -0.0036337743513286114, -0.003544632811099291, -0.003455491503700614, -0.003366350196301937, -0.0032772086560726166, -0.0031880673486739397, -0.003098926041275263, -0.003009784733876586, -0.002920643426477909, -0.0028315018862485886, -0.0027423605788499117, -0.002653219271451235, -0.0025640777312219143, -0.0024749364238232374, -0.0023857951164245605, -0.0022966538090258837, -0.002207512501627207, -0.0021183709613978863, -0.0020292296539992094, -0.0019400883466005325, -0.0018509469227865338, -0.0017618054989725351, -0.0016726641915738583, -0.0015835228841751814, -0.0014943814603611827, -0.001405240036547184, -0.0013160987291485071, -0.0012269574217498302, -0.0011378159979358315, -0.0010486745741218328, -0.0009595331503078341, -0.0008703917847014964, -0.0007812504190951586, -0.0006921090534888208, -0.000602967687882483, -0.0005138263222761452, -0.00042468495666980743, -0.00033554359106346965, -0.00024640222545713186, -0.00015726085985079408, -6.811949424445629e-05, 2.1021871361881495e-05, 0.00011016323696821928, 0.00019930460257455707, 0.00028844596818089485, 0.00037758733378723264, 0.0004667286993935704, 0.0005558700649999082, 0.000645011430606246, 0.0007341527962125838, 0.0008232941618189216, 0.0009124355274252594, 0.0010015768930315971, 0.001090718200430274, 0.0011798596242442727, 0.0012690010480582714, 0.0013581423554569483, 0.0014472836628556252, 0.0015364250866696239, 0.0016255665104836226, 0.0017147078178822994, 0.0018038491252809763, 0.001892990549094975]}, "gradients/decoder.transformer.h.23.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 2.0, 3.0, 4.0, 6.0, 5.0, 9.0, 11.0, 14.0, 12.0, 22.0, 23.0, 29.0, 22.0, 30.0, 29.0, 45.0, 41.0, 38.0, 40.0, 40.0, 47.0, 44.0, 35.0, 48.0, 34.0, 34.0, 41.0, 49.0, 42.0, 36.0, 28.0, 37.0, 19.0, 19.0, 14.0, 8.0, 10.0, 7.0, 11.0, 10.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.0004277825355529785, -0.00041530653834342957, -0.0004028305411338806, -0.00039035454392433167, -0.0003778785467147827, -0.00036540254950523376, -0.0003529265522956848, -0.00034045055508613586, -0.0003279745578765869, -0.00031549856066703796, -0.000303022563457489, -0.00029054656624794006, -0.0002780705690383911, -0.00026559457182884216, -0.0002531185746192932, -0.00024064257740974426, -0.0002281665802001953, -0.00021569058299064636, -0.0002032145857810974, -0.00019073858857154846, -0.0001782625913619995, -0.00016578659415245056, -0.0001533105969429016, -0.00014083459973335266, -0.0001283586025238037, -0.00011588260531425476, -0.00010340660810470581, -9.093061089515686e-05, -7.845461368560791e-05, -6.597861647605896e-05, -5.350261926651001e-05, -4.102662205696106e-05, -2.855062484741211e-05, -1.607462763786316e-05, -3.598630428314209e-06, 8.877366781234741e-06, 2.135336399078369e-05, 3.382936120033264e-05, 4.630535840988159e-05, 5.878135561943054e-05, 7.125735282897949e-05, 8.373335003852844e-05, 9.620934724807739e-05, 0.00010868534445762634, 0.00012116134166717529, 0.00013363733887672424, 0.0001461133360862732, 0.00015858933329582214, 0.0001710653305053711, 0.00018354132771492004, 0.000196017324924469, 0.00020849332213401794, 0.0002209693193435669, 0.00023344531655311584, 0.0002459213137626648, 0.00025839731097221375, 0.0002708733081817627, 0.00028334930539131165, 0.0002958253026008606, 0.00030830129981040955, 0.0003207772970199585, 0.00033325329422950745, 0.0003457292914390564, 0.00035820528864860535, 0.0003706812858581543]}, "gradients/decoder.transformer.h.23.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 6.0, 3.0, 11.0, 3.0, 2.0, 8.0, 10.0, 12.0, 7.0, 23.0, 21.0, 18.0, 20.0, 33.0, 25.0, 32.0, 37.0, 34.0, 44.0, 38.0, 43.0, 43.0, 43.0, 48.0, 43.0, 50.0, 34.0, 35.0, 34.0, 25.0, 23.0, 24.0, 26.0, 26.0, 16.0, 18.0, 19.0, 12.0, 15.0, 5.0, 7.0, 7.0, 10.0, 4.0, 6.0, 2.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.42578125, -4.2813720703125, -4.136962890625, -3.9925537109375, -3.84814453125, -3.7037353515625, -3.559326171875, -3.4149169921875, -3.2705078125, -3.1260986328125, -2.981689453125, -2.8372802734375, -2.69287109375, -2.5484619140625, -2.404052734375, -2.2596435546875, -2.115234375, -1.9708251953125, -1.826416015625, -1.6820068359375, -1.53759765625, -1.3931884765625, -1.248779296875, -1.1043701171875, -0.9599609375, -0.8155517578125, -0.671142578125, -0.5267333984375, -0.38232421875, -0.2379150390625, -0.093505859375, 0.0509033203125, 0.1953125, 0.3397216796875, 0.484130859375, 0.6285400390625, 0.77294921875, 0.9173583984375, 1.061767578125, 1.2061767578125, 1.3505859375, 1.4949951171875, 1.639404296875, 1.7838134765625, 1.92822265625, 2.0726318359375, 2.217041015625, 2.3614501953125, 2.505859375, 2.6502685546875, 2.794677734375, 2.9390869140625, 3.08349609375, 3.2279052734375, 3.372314453125, 3.5167236328125, 3.6611328125, 3.8055419921875, 3.949951171875, 4.0943603515625, 4.23876953125, 4.3831787109375, 4.527587890625, 4.6719970703125, 4.81640625]}, "gradients/decoder.transformer.h.23.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 3.0, 1.0, 5.0, 2.0, 4.0, 7.0, 13.0, 8.0, 18.0, 20.0, 24.0, 35.0, 48.0, 66.0, 90.0, 130.0, 175.0, 256.0, 350.0, 467.0, 711.0, 1017.0, 1430.0, 2131.0, 3305.0, 5436.0, 10812.0, 35930.0, 883127.0, 71501.0, 13456.0, 6389.0, 3770.0, 2460.0, 1645.0, 1061.0, 761.0, 525.0, 385.0, 253.0, 224.0, 130.0, 113.0, 69.0, 57.0, 51.0, 21.0, 19.0, 17.0, 12.0, 9.0, 5.0, 5.0, 6.0, 0.0, 2.0, 4.0, 1.0, 0.0, 1.0, 1.0], "bins": [-36.71875, -35.55810546875, -34.3974609375, -33.23681640625, -32.076171875, -30.91552734375, -29.7548828125, -28.59423828125, -27.43359375, -26.27294921875, -25.1123046875, -23.95166015625, -22.791015625, -21.63037109375, -20.4697265625, -19.30908203125, -18.1484375, -16.98779296875, -15.8271484375, -14.66650390625, -13.505859375, -12.34521484375, -11.1845703125, -10.02392578125, -8.86328125, -7.70263671875, -6.5419921875, -5.38134765625, -4.220703125, -3.06005859375, -1.8994140625, -0.73876953125, 0.421875, 1.58251953125, 2.7431640625, 3.90380859375, 5.064453125, 6.22509765625, 7.3857421875, 8.54638671875, 9.70703125, 10.86767578125, 12.0283203125, 13.18896484375, 14.349609375, 15.51025390625, 16.6708984375, 17.83154296875, 18.9921875, 20.15283203125, 21.3134765625, 22.47412109375, 23.634765625, 24.79541015625, 25.9560546875, 27.11669921875, 28.27734375, 29.43798828125, 30.5986328125, 31.75927734375, 32.919921875, 34.08056640625, 35.2412109375, 36.40185546875, 37.5625]}, "gradients/decoder.transformer.h.23.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 3.0, 5.0, 4.0, 7.0, 6.0, 12.0, 12.0, 9.0, 14.0, 18.0, 18.0, 24.0, 22.0, 21.0, 27.0, 24.0, 42.0, 31.0, 31.0, 36.0, 58.0, 90.0, 262.0, 1650.0, 190.0, 58.0, 46.0, 36.0, 38.0, 35.0, 33.0, 22.0, 27.0, 21.0, 15.0, 18.0, 23.0, 7.0, 10.0, 11.0, 7.0, 9.0, 6.0, 4.0, 7.0, 5.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0], "bins": [-14.9296875, -14.4954833984375, -14.061279296875, -13.6270751953125, -13.19287109375, -12.7586669921875, -12.324462890625, -11.8902587890625, -11.4560546875, -11.0218505859375, -10.587646484375, -10.1534423828125, -9.71923828125, -9.2850341796875, -8.850830078125, -8.4166259765625, -7.982421875, -7.5482177734375, -7.114013671875, -6.6798095703125, -6.24560546875, -5.8114013671875, -5.377197265625, -4.9429931640625, -4.5087890625, -4.0745849609375, -3.640380859375, -3.2061767578125, -2.77197265625, -2.3377685546875, -1.903564453125, -1.4693603515625, -1.03515625, -0.6009521484375, -0.166748046875, 0.2674560546875, 0.70166015625, 1.1358642578125, 1.570068359375, 2.0042724609375, 2.4384765625, 2.8726806640625, 3.306884765625, 3.7410888671875, 4.17529296875, 4.6094970703125, 5.043701171875, 5.4779052734375, 5.912109375, 6.3463134765625, 6.780517578125, 7.2147216796875, 7.64892578125, 8.0831298828125, 8.517333984375, 8.9515380859375, 9.3857421875, 9.8199462890625, 10.254150390625, 10.6883544921875, 11.12255859375, 11.5567626953125, 11.990966796875, 12.4251708984375, 12.859375]}, "gradients/decoder.transformer.h.23.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 3.0, 3.0, 10.0, 4.0, 5.0, 11.0, 12.0, 10.0, 13.0, 16.0, 12.0, 29.0, 19.0, 16.0, 28.0, 30.0, 46.0, 54.0, 87.0, 189.0, 630.0, 5452.0, 3130980.0, 6665.0, 708.0, 210.0, 99.0, 66.0, 40.0, 43.0, 30.0, 24.0, 24.0, 17.0, 16.0, 24.0, 16.0, 17.0, 9.0, 6.0, 9.0, 7.0, 6.0, 5.0, 5.0, 3.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-58.71875, -56.72998046875, -54.7412109375, -52.75244140625, -50.763671875, -48.77490234375, -46.7861328125, -44.79736328125, -42.80859375, -40.81982421875, -38.8310546875, -36.84228515625, -34.853515625, -32.86474609375, -30.8759765625, -28.88720703125, -26.8984375, -24.90966796875, -22.9208984375, -20.93212890625, -18.943359375, -16.95458984375, -14.9658203125, -12.97705078125, -10.98828125, -8.99951171875, -7.0107421875, -5.02197265625, -3.033203125, -1.04443359375, 0.9443359375, 2.93310546875, 4.921875, 6.91064453125, 8.8994140625, 10.88818359375, 12.876953125, 14.86572265625, 16.8544921875, 18.84326171875, 20.83203125, 22.82080078125, 24.8095703125, 26.79833984375, 28.787109375, 30.77587890625, 32.7646484375, 34.75341796875, 36.7421875, 38.73095703125, 40.7197265625, 42.70849609375, 44.697265625, 46.68603515625, 48.6748046875, 50.66357421875, 52.65234375, 54.64111328125, 56.6298828125, 58.61865234375, 60.607421875, 62.59619140625, 64.5849609375, 66.57373046875, 68.5625]}, "gradients/decoder.transformer.h.23.ln_1.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 4.0, 407.0, 608.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-59.58740997314453, -49.696773529052734, -39.80613708496094, -29.915504455566406, -20.02486801147461, -10.134231567382812, -0.24359893798828125, 9.647041320800781, 19.537673950195312, 29.42831039428711, 39.318946838378906, 49.20957946777344, 59.100215911865234, 68.99085235595703, 78.88148498535156, 88.77212524414062, 98.66275787353516, 108.55339050292969, 118.44403076171875, 128.33465576171875, 138.2252960205078, 148.11593627929688, 158.00656127929688, 167.897216796875, 177.787841796875, 187.67848205566406, 197.56910705566406, 207.45974731445312, 217.3503875732422, 227.24102783203125, 237.13165283203125, 247.0222930908203, 256.9129333496094, 266.8035583496094, 276.6942138671875, 286.5848388671875, 296.4754638671875, 306.3661193847656, 316.2567443847656, 326.14739990234375, 336.03802490234375, 345.92864990234375, 355.8193054199219, 365.7099304199219, 375.6005554199219, 385.4912109375, 395.3818359375, 405.2724609375, 415.1630859375, 425.0537109375, 434.9443664550781, 444.8349914550781, 454.7256164550781, 464.61627197265625, 474.50689697265625, 484.39752197265625, 494.2881774902344, 504.1788024902344, 514.0694580078125, 523.9600830078125, 533.8507080078125, 543.7413330078125, 553.6319580078125, 563.5226440429688, 573.4132690429688]}, "gradients/decoder.transformer.h.23.ln_1.bias": {"_type": "histogram", "values": [5.0, 4.0, 4.0, 4.0, 3.0, 3.0, 3.0, 10.0, 8.0, 5.0, 11.0, 9.0, 19.0, 14.0, 16.0, 22.0, 23.0, 21.0, 21.0, 27.0, 27.0, 31.0, 29.0, 27.0, 30.0, 39.0, 37.0, 41.0, 26.0, 41.0, 36.0, 47.0, 45.0, 30.0, 34.0, 33.0, 32.0, 25.0, 28.0, 22.0, 15.0, 21.0, 20.0, 7.0, 12.0, 8.0, 7.0, 3.0, 7.0, 3.0, 6.0, 4.0, 3.0, 3.0, 2.0, 3.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-31.174589157104492, -29.939983367919922, -28.70537757873535, -27.47077178955078, -26.236164093017578, -25.00156021118164, -23.766952514648438, -22.532346725463867, -21.297740936279297, -20.063135147094727, -18.828529357910156, -17.593923568725586, -16.359317779541016, -15.124711036682129, -13.890104293823242, -12.655498504638672, -11.420892715454102, -10.186286926269531, -8.951681137084961, -7.717074394226074, -6.482468605041504, -5.247862815856934, -4.013256549835205, -2.7786502838134766, -1.5440444946289062, -0.30943846702575684, 0.9251675605773926, 2.159773588180542, 3.3943796157836914, 4.628985404968262, 5.86359167098999, 7.098197937011719, 8.332801818847656, 9.567407608032227, 10.802013397216797, 12.036620140075684, 13.271225929260254, 14.505831718444824, 15.740438461303711, 16.97504425048828, 18.20965003967285, 19.444255828857422, 20.678861618041992, 21.913467407226562, 23.148075103759766, 24.382678985595703, 25.617286682128906, 26.851892471313477, 28.086498260498047, 29.321104049682617, 30.555709838867188, 31.790315628051758, 33.02492141723633, 34.25952911376953, 35.49413299560547, 36.72874069213867, 37.963348388671875, 39.19795608520508, 40.432559967041016, 41.66716766357422, 42.901771545410156, 44.13637924194336, 45.3709831237793, 46.6055908203125, 47.84019470214844]}, "gradients/decoder.transformer.h.22.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 3.0, 3.0, 10.0, 4.0, 3.0, 8.0, 4.0, 8.0, 8.0, 12.0, 10.0, 15.0, 31.0, 25.0, 24.0, 20.0, 34.0, 35.0, 34.0, 30.0, 50.0, 41.0, 44.0, 38.0, 44.0, 49.0, 36.0, 43.0, 39.0, 34.0, 28.0, 31.0, 27.0, 21.0, 21.0, 16.0, 24.0, 22.0, 13.0, 15.0, 12.0, 6.0, 8.0, 6.0, 7.0, 8.0, 3.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-4.70703125, -4.55645751953125, -4.4058837890625, -4.25531005859375, -4.104736328125, -3.95416259765625, -3.8035888671875, -3.65301513671875, -3.50244140625, -3.35186767578125, -3.2012939453125, -3.05072021484375, -2.900146484375, -2.74957275390625, -2.5989990234375, -2.44842529296875, -2.2978515625, -2.14727783203125, -1.9967041015625, -1.84613037109375, -1.695556640625, -1.54498291015625, -1.3944091796875, -1.24383544921875, -1.09326171875, -0.94268798828125, -0.7921142578125, -0.64154052734375, -0.490966796875, -0.34039306640625, -0.1898193359375, -0.03924560546875, 0.111328125, 0.26190185546875, 0.4124755859375, 0.56304931640625, 0.713623046875, 0.86419677734375, 1.0147705078125, 1.16534423828125, 1.31591796875, 1.46649169921875, 1.6170654296875, 1.76763916015625, 1.918212890625, 2.06878662109375, 2.2193603515625, 2.36993408203125, 2.5205078125, 2.67108154296875, 2.8216552734375, 2.97222900390625, 3.122802734375, 3.27337646484375, 3.4239501953125, 3.57452392578125, 3.72509765625, 3.87567138671875, 4.0262451171875, 4.17681884765625, 4.327392578125, 4.47796630859375, 4.6285400390625, 4.77911376953125, 4.9296875]}, "gradients/decoder.transformer.h.22.mlp.c_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 3.0, 5.0, 4.0, 4.0, 4.0, 6.0, 4.0, 7.0, 18.0, 20.0, 28.0, 29.0, 49.0, 60.0, 98.0, 152.0, 266.0, 433.0, 700.0, 1394.0, 2683.0, 5659.0, 13781.0, 42436.0, 681364.0, 3318202.0, 88889.0, 22199.0, 8477.0, 3631.0, 1675.0, 808.0, 446.0, 263.0, 168.0, 93.0, 56.0, 50.0, 29.0, 26.0, 23.0, 14.0, 9.0, 7.0, 4.0, 7.0, 2.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-33.875, -32.7900390625, -31.705078125, -30.6201171875, -29.53515625, -28.4501953125, -27.365234375, -26.2802734375, -25.1953125, -24.1103515625, -23.025390625, -21.9404296875, -20.85546875, -19.7705078125, -18.685546875, -17.6005859375, -16.515625, -15.4306640625, -14.345703125, -13.2607421875, -12.17578125, -11.0908203125, -10.005859375, -8.9208984375, -7.8359375, -6.7509765625, -5.666015625, -4.5810546875, -3.49609375, -2.4111328125, -1.326171875, -0.2412109375, 0.84375, 1.9287109375, 3.013671875, 4.0986328125, 5.18359375, 6.2685546875, 7.353515625, 8.4384765625, 9.5234375, 10.6083984375, 11.693359375, 12.7783203125, 13.86328125, 14.9482421875, 16.033203125, 17.1181640625, 18.203125, 19.2880859375, 20.373046875, 21.4580078125, 22.54296875, 23.6279296875, 24.712890625, 25.7978515625, 26.8828125, 27.9677734375, 29.052734375, 30.1376953125, 31.22265625, 32.3076171875, 33.392578125, 34.4775390625, 35.5625]}, "gradients/decoder.transformer.h.22.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 6.0, 1.0, 2.0, 1.0, 6.0, 4.0, 10.0, 13.0, 20.0, 19.0, 26.0, 38.0, 58.0, 78.0, 95.0, 160.0, 299.0, 499.0, 747.0, 756.0, 452.0, 251.0, 178.0, 106.0, 68.0, 61.0, 39.0, 23.0, 18.0, 14.0, 13.0, 7.0, 6.0, 4.0, 0.0, 2.0, 5.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-20.578125, -19.642822265625, -18.70751953125, -17.772216796875, -16.8369140625, -15.901611328125, -14.96630859375, -14.031005859375, -13.095703125, -12.160400390625, -11.22509765625, -10.289794921875, -9.3544921875, -8.419189453125, -7.48388671875, -6.548583984375, -5.61328125, -4.677978515625, -3.74267578125, -2.807373046875, -1.8720703125, -0.936767578125, -0.00146484375, 0.933837890625, 1.869140625, 2.804443359375, 3.73974609375, 4.675048828125, 5.6103515625, 6.545654296875, 7.48095703125, 8.416259765625, 9.3515625, 10.286865234375, 11.22216796875, 12.157470703125, 13.0927734375, 14.028076171875, 14.96337890625, 15.898681640625, 16.833984375, 17.769287109375, 18.70458984375, 19.639892578125, 20.5751953125, 21.510498046875, 22.44580078125, 23.381103515625, 24.31640625, 25.251708984375, 26.18701171875, 27.122314453125, 28.0576171875, 28.992919921875, 29.92822265625, 30.863525390625, 31.798828125, 32.734130859375, 33.66943359375, 34.604736328125, 35.5400390625, 36.475341796875, 37.41064453125, 38.345947265625, 39.28125]}, "gradients/decoder.transformer.h.22.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 2.0, 2.0, 4.0, 3.0, 4.0, 8.0, 14.0, 14.0, 20.0, 33.0, 47.0, 69.0, 98.0, 188.0, 279.0, 492.0, 3615.0, 4184496.0, 3565.0, 527.0, 283.0, 173.0, 101.0, 79.0, 58.0, 28.0, 25.0, 24.0, 12.0, 8.0, 9.0, 2.0, 1.0, 3.0, 3.0, 4.0, 0.0, 1.0], "bins": [-216.375, -211.3994140625, -206.423828125, -201.4482421875, -196.47265625, -191.4970703125, -186.521484375, -181.5458984375, -176.5703125, -171.5947265625, -166.619140625, -161.6435546875, -156.66796875, -151.6923828125, -146.716796875, -141.7412109375, -136.765625, -131.7900390625, -126.814453125, -121.8388671875, -116.86328125, -111.8876953125, -106.912109375, -101.9365234375, -96.9609375, -91.9853515625, -87.009765625, -82.0341796875, -77.05859375, -72.0830078125, -67.107421875, -62.1318359375, -57.15625, -52.1806640625, -47.205078125, -42.2294921875, -37.25390625, -32.2783203125, -27.302734375, -22.3271484375, -17.3515625, -12.3759765625, -7.400390625, -2.4248046875, 2.55078125, 7.5263671875, 12.501953125, 17.4775390625, 22.453125, 27.4287109375, 32.404296875, 37.3798828125, 42.35546875, 47.3310546875, 52.306640625, 57.2822265625, 62.2578125, 67.2333984375, 72.208984375, 77.1845703125, 82.16015625, 87.1357421875, 92.111328125, 97.0869140625, 102.0625]}, "gradients/decoder.transformer.h.22.ln_2.weight": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 142.0, 705.0, 163.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-35.7835693359375, -27.871990203857422, -19.960412979125977, -12.048835754394531, -4.137256622314453, 3.774322509765625, 11.685897827148438, 19.597476959228516, 27.509056091308594, 35.42063522338867, 43.33221435546875, 51.24378967285156, 59.15536880493164, 67.06694793701172, 74.97852325439453, 82.89010620117188, 90.80168151855469, 98.7132568359375, 106.62483978271484, 114.53641510009766, 122.447998046875, 130.3595733642578, 138.27114868164062, 146.18272399902344, 154.09429931640625, 162.00587463378906, 169.91744995117188, 177.82904052734375, 185.74061584472656, 193.65219116210938, 201.5637664794922, 209.475341796875, 217.38693237304688, 225.2985076904297, 233.2100830078125, 241.12167358398438, 249.0332489013672, 256.94482421875, 264.85638427734375, 272.7679748535156, 280.6795654296875, 288.5911560058594, 296.5027160644531, 304.414306640625, 312.32586669921875, 320.2374572753906, 328.1490478515625, 336.06060791015625, 343.97216796875, 351.8837585449219, 359.7953186035156, 367.7069091796875, 375.61846923828125, 383.5300598144531, 391.441650390625, 399.35321044921875, 407.2648010253906, 415.1763916015625, 423.08795166015625, 430.9995422363281, 438.9111022949219, 446.82269287109375, 454.7342529296875, 462.6458435058594, 470.55743408203125]}, "gradients/decoder.transformer.h.22.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 4.0, 3.0, 4.0, 5.0, 10.0, 21.0, 15.0, 22.0, 24.0, 24.0, 27.0, 26.0, 35.0, 38.0, 43.0, 41.0, 54.0, 40.0, 50.0, 54.0, 55.0, 50.0, 41.0, 43.0, 32.0, 36.0, 42.0, 32.0, 21.0, 16.0, 21.0, 18.0, 12.0, 11.0, 10.0, 9.0, 7.0, 5.0, 6.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-82.56156158447266, -79.8252944946289, -77.08902740478516, -74.3527603149414, -71.61648559570312, -68.88021850585938, -66.14395141601562, -63.407684326171875, -60.671417236328125, -57.935150146484375, -55.198883056640625, -52.46261215209961, -49.72634506225586, -46.99007797241211, -44.253807067871094, -41.517539978027344, -38.781272888183594, -36.045005798339844, -33.308738708496094, -30.572467803955078, -27.836200714111328, -25.099933624267578, -22.363664627075195, -19.627395629882812, -16.891128540039062, -14.154860496520996, -11.41859245300293, -8.682324409484863, -5.946056365966797, -3.2097883224487305, -0.47352027893066406, 2.2627487182617188, 4.999015808105469, 7.735283851623535, 10.471551895141602, 13.207819938659668, 15.944087982177734, 18.680355072021484, 21.416624069213867, 24.15289306640625, 26.88916015625, 29.62542724609375, 32.3616943359375, 35.097965240478516, 37.834232330322266, 40.570499420166016, 43.30677032470703, 46.04303741455078, 48.77930450439453, 51.51557159423828, 54.25183868408203, 56.98810958862305, 59.7243766784668, 62.46064376831055, 65.19691467285156, 67.93318176269531, 70.66944885253906, 73.40571594238281, 76.14198303222656, 78.87825012207031, 81.61451721191406, 84.35079193115234, 87.0870590209961, 89.82332611083984, 92.5595932006836]}, "gradients/decoder.transformer.h.22.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 6.0, 6.0, 6.0, 5.0, 2.0, 8.0, 4.0, 14.0, 7.0, 12.0, 10.0, 26.0, 24.0, 30.0, 31.0, 21.0, 33.0, 43.0, 42.0, 37.0, 39.0, 51.0, 41.0, 42.0, 43.0, 41.0, 48.0, 38.0, 33.0, 28.0, 28.0, 35.0, 19.0, 24.0, 22.0, 19.0, 15.0, 15.0, 15.0, 5.0, 8.0, 9.0, 7.0, 7.0, 3.0, 5.0, 2.0, 0.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.76171875, -4.60430908203125, -4.4468994140625, -4.28948974609375, -4.132080078125, -3.97467041015625, -3.8172607421875, -3.65985107421875, -3.50244140625, -3.34503173828125, -3.1876220703125, -3.03021240234375, -2.872802734375, -2.71539306640625, -2.5579833984375, -2.40057373046875, -2.2431640625, -2.08575439453125, -1.9283447265625, -1.77093505859375, -1.613525390625, -1.45611572265625, -1.2987060546875, -1.14129638671875, -0.98388671875, -0.82647705078125, -0.6690673828125, -0.51165771484375, -0.354248046875, -0.19683837890625, -0.0394287109375, 0.11798095703125, 0.275390625, 0.43280029296875, 0.5902099609375, 0.74761962890625, 0.905029296875, 1.06243896484375, 1.2198486328125, 1.37725830078125, 1.53466796875, 1.69207763671875, 1.8494873046875, 2.00689697265625, 2.164306640625, 2.32171630859375, 2.4791259765625, 2.63653564453125, 2.7939453125, 2.95135498046875, 3.1087646484375, 3.26617431640625, 3.423583984375, 3.58099365234375, 3.7384033203125, 3.89581298828125, 4.05322265625, 4.21063232421875, 4.3680419921875, 4.52545166015625, 4.682861328125, 4.84027099609375, 4.9976806640625, 5.15509033203125, 5.3125]}, "gradients/decoder.transformer.h.22.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 1.0, 6.0, 3.0, 13.0, 12.0, 16.0, 19.0, 33.0, 56.0, 98.0, 136.0, 215.0, 384.0, 562.0, 976.0, 1697.0, 2810.0, 5012.0, 8632.0, 15533.0, 29427.0, 57835.0, 129428.0, 426571.0, 203716.0, 80029.0, 38744.0, 20473.0, 11041.0, 6213.0, 3572.0, 2134.0, 1263.0, 735.0, 455.0, 260.0, 156.0, 88.0, 67.0, 50.0, 18.0, 27.0, 11.0, 9.0, 8.0, 8.0, 4.0, 2.0, 3.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.873046875, -1.8174896240234375, -1.761932373046875, -1.7063751220703125, -1.65081787109375, -1.5952606201171875, -1.539703369140625, -1.4841461181640625, -1.4285888671875, -1.3730316162109375, -1.317474365234375, -1.2619171142578125, -1.20635986328125, -1.1508026123046875, -1.095245361328125, -1.0396881103515625, -0.984130859375, -0.9285736083984375, -0.873016357421875, -0.8174591064453125, -0.76190185546875, -0.7063446044921875, -0.650787353515625, -0.5952301025390625, -0.5396728515625, -0.4841156005859375, -0.428558349609375, -0.3730010986328125, -0.31744384765625, -0.2618865966796875, -0.206329345703125, -0.1507720947265625, -0.09521484375, -0.0396575927734375, 0.015899658203125, 0.0714569091796875, 0.12701416015625, 0.1825714111328125, 0.238128662109375, 0.2936859130859375, 0.3492431640625, 0.4048004150390625, 0.460357666015625, 0.5159149169921875, 0.57147216796875, 0.6270294189453125, 0.682586669921875, 0.7381439208984375, 0.793701171875, 0.8492584228515625, 0.904815673828125, 0.9603729248046875, 1.01593017578125, 1.0714874267578125, 1.127044677734375, 1.1826019287109375, 1.2381591796875, 1.2937164306640625, 1.349273681640625, 1.4048309326171875, 1.46038818359375, 1.5159454345703125, 1.571502685546875, 1.6270599365234375, 1.6826171875]}, "gradients/decoder.transformer.h.22.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 4.0, 1.0, 3.0, 2.0, 3.0, 5.0, 6.0, 6.0, 6.0, 11.0, 11.0, 6.0, 14.0, 26.0, 22.0, 22.0, 23.0, 34.0, 30.0, 30.0, 37.0, 45.0, 42.0, 42.0, 40.0, 37.0, 1063.0, 36.0, 33.0, 30.0, 30.0, 24.0, 38.0, 39.0, 26.0, 34.0, 34.0, 28.0, 19.0, 15.0, 15.0, 13.0, 14.0, 12.0, 10.0, 7.0, 6.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-3.056640625, -2.95751953125, -2.8583984375, -2.75927734375, -2.66015625, -2.56103515625, -2.4619140625, -2.36279296875, -2.263671875, -2.16455078125, -2.0654296875, -1.96630859375, -1.8671875, -1.76806640625, -1.6689453125, -1.56982421875, -1.470703125, -1.37158203125, -1.2724609375, -1.17333984375, -1.07421875, -0.97509765625, -0.8759765625, -0.77685546875, -0.677734375, -0.57861328125, -0.4794921875, -0.38037109375, -0.28125, -0.18212890625, -0.0830078125, 0.01611328125, 0.115234375, 0.21435546875, 0.3134765625, 0.41259765625, 0.51171875, 0.61083984375, 0.7099609375, 0.80908203125, 0.908203125, 1.00732421875, 1.1064453125, 1.20556640625, 1.3046875, 1.40380859375, 1.5029296875, 1.60205078125, 1.701171875, 1.80029296875, 1.8994140625, 1.99853515625, 2.09765625, 2.19677734375, 2.2958984375, 2.39501953125, 2.494140625, 2.59326171875, 2.6923828125, 2.79150390625, 2.890625, 2.98974609375, 3.0888671875, 3.18798828125, 3.287109375]}, "gradients/decoder.transformer.h.22.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 6.0, 5.0, 9.0, 10.0, 14.0, 28.0, 37.0, 60.0, 77.0, 148.0, 240.0, 370.0, 637.0, 1119.0, 1936.0, 3747.0, 6593.0, 12496.0, 24109.0, 48334.0, 100126.0, 243320.0, 1408620.0, 123933.0, 59360.0, 29177.0, 14967.0, 7902.0, 4203.0, 2371.0, 1313.0, 752.0, 425.0, 264.0, 147.0, 88.0, 62.0, 39.0, 32.0, 19.0, 8.0, 11.0, 8.0, 6.0, 5.0, 3.0, 2.0, 1.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.7978515625, -1.7436065673828125, -1.689361572265625, -1.6351165771484375, -1.58087158203125, -1.5266265869140625, -1.472381591796875, -1.4181365966796875, -1.3638916015625, -1.3096466064453125, -1.255401611328125, -1.2011566162109375, -1.14691162109375, -1.0926666259765625, -1.038421630859375, -0.9841766357421875, -0.929931640625, -0.8756866455078125, -0.821441650390625, -0.7671966552734375, -0.71295166015625, -0.6587066650390625, -0.604461669921875, -0.5502166748046875, -0.4959716796875, -0.4417266845703125, -0.387481689453125, -0.3332366943359375, -0.27899169921875, -0.2247467041015625, -0.170501708984375, -0.1162567138671875, -0.06201171875, -0.0077667236328125, 0.046478271484375, 0.1007232666015625, 0.15496826171875, 0.2092132568359375, 0.263458251953125, 0.3177032470703125, 0.3719482421875, 0.4261932373046875, 0.480438232421875, 0.5346832275390625, 0.58892822265625, 0.6431732177734375, 0.697418212890625, 0.7516632080078125, 0.805908203125, 0.8601531982421875, 0.914398193359375, 0.9686431884765625, 1.02288818359375, 1.0771331787109375, 1.131378173828125, 1.1856231689453125, 1.2398681640625, 1.2941131591796875, 1.348358154296875, 1.4026031494140625, 1.45684814453125, 1.5110931396484375, 1.565338134765625, 1.6195831298828125, 1.673828125]}, "gradients/decoder.transformer.h.22.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 0.0, 7.0, 2.0, 7.0, 5.0, 4.0, 8.0, 10.0, 17.0, 21.0, 20.0, 20.0, 39.0, 39.0, 40.0, 42.0, 78.0, 60.0, 65.0, 78.0, 93.0, 59.0, 55.0, 52.0, 35.0, 32.0, 16.0, 19.0, 19.0, 8.0, 16.0, 9.0, 6.0, 8.0, 2.0, 2.0, 2.0, 3.0, 3.0, 0.0, 2.0, 4.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0009136199951171875, -0.0008880943059921265, -0.0008625686168670654, -0.0008370429277420044, -0.0008115172386169434, -0.0007859915494918823, -0.0007604658603668213, -0.0007349401712417603, -0.0007094144821166992, -0.0006838887929916382, -0.0006583631038665771, -0.0006328374147415161, -0.0006073117256164551, -0.000581786036491394, -0.000556260347366333, -0.000530734658241272, -0.0005052089691162109, -0.0004796832799911499, -0.00045415759086608887, -0.00042863190174102783, -0.0004031062126159668, -0.00037758052349090576, -0.0003520548343658447, -0.0003265291452407837, -0.00030100345611572266, -0.0002754777669906616, -0.0002499520778656006, -0.00022442638874053955, -0.00019890069961547852, -0.00017337501049041748, -0.00014784932136535645, -0.0001223236322402954, -9.679794311523438e-05, -7.127225399017334e-05, -4.5746564865112305e-05, -2.022087574005127e-05, 5.304813385009766e-06, 3.08305025100708e-05, 5.6356191635131836e-05, 8.188188076019287e-05, 0.0001074075698852539, 0.00013293325901031494, 0.00015845894813537598, 0.000183984637260437, 0.00020951032638549805, 0.00023503601551055908, 0.0002605617046356201, 0.00028608739376068115, 0.0003116130828857422, 0.0003371387720108032, 0.00036266446113586426, 0.0003881901502609253, 0.00041371583938598633, 0.00043924152851104736, 0.0004647672176361084, 0.0004902929067611694, 0.0005158185958862305, 0.0005413442850112915, 0.0005668699741363525, 0.0005923956632614136, 0.0006179213523864746, 0.0006434470415115356, 0.0006689727306365967, 0.0006944984197616577, 0.0007200241088867188]}, "gradients/decoder.transformer.h.22.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 2.0, 3.0, 0.0, 1.0, 3.0, 3.0, 5.0, 5.0, 3.0, 5.0, 10.0, 11.0, 15.0, 18.0, 16.0, 23.0, 33.0, 64.0, 96.0, 129.0, 199.0, 610.0, 895977.0, 150232.0, 520.0, 196.0, 118.0, 58.0, 51.0, 32.0, 24.0, 26.0, 18.0, 11.0, 10.0, 6.0, 11.0, 5.0, 3.0, 3.0, 4.0, 3.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0207061767578125, -0.019937992095947266, -0.01916980743408203, -0.018401622772216797, -0.017633438110351562, -0.016865253448486328, -0.016097068786621094, -0.01532888412475586, -0.014560699462890625, -0.01379251480102539, -0.013024330139160156, -0.012256145477294922, -0.011487960815429688, -0.010719776153564453, -0.009951591491699219, -0.009183406829833984, -0.00841522216796875, -0.007647037506103516, -0.006878852844238281, -0.006110668182373047, -0.0053424835205078125, -0.004574298858642578, -0.0038061141967773438, -0.0030379295349121094, -0.002269744873046875, -0.0015015602111816406, -0.0007333755493164062, 3.4809112548828125e-05, 0.0008029937744140625, 0.0015711784362792969, 0.0023393630981445312, 0.0031075477600097656, 0.003875732421875, 0.004643917083740234, 0.005412101745605469, 0.006180286407470703, 0.0069484710693359375, 0.007716655731201172, 0.008484840393066406, 0.00925302505493164, 0.010021209716796875, 0.01078939437866211, 0.011557579040527344, 0.012325763702392578, 0.013093948364257812, 0.013862133026123047, 0.014630317687988281, 0.015398502349853516, 0.01616668701171875, 0.016934871673583984, 0.01770305633544922, 0.018471240997314453, 0.019239425659179688, 0.020007610321044922, 0.020775794982910156, 0.02154397964477539, 0.022312164306640625, 0.02308034896850586, 0.023848533630371094, 0.024616718292236328, 0.025384902954101562, 0.026153087615966797, 0.02692127227783203, 0.027689456939697266, 0.0284576416015625]}, "gradients/decoder.transformer.h.22.ln_cross_attn.weight": {"_type": "histogram", "values": [7.0, 25.0, 62.0, 175.0, 290.0, 270.0, 127.0, 42.0, 17.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.382082498632371e-05, -7.435216684825718e-05, -5.488350143423304e-05, -3.54148396581877e-05, -1.5946177882142365e-05, 3.5224802559241652e-06, 2.299114566994831e-05, 4.2459811083972454e-05, 6.192846922203898e-05, 8.139712736010551e-05, 0.00010086579277412966, 0.0001203344581881538, 0.00013980311632622033, 0.00015927177446428686, 0.00017874044715426862, 0.00019820910529233515, 0.00021767776343040168, 0.0002371464215684682, 0.00025661507970653474, 0.0002760837378446013, 0.00029555242508649826, 0.0003150210832245648, 0.0003344897413626313, 0.00035395839950069785, 0.0003734270576387644, 0.0003928957157768309, 0.00041236437391489744, 0.00043183303205296397, 0.0004513016901910305, 0.00047077034832909703, 0.0004902390064671636, 0.000509707722812891, 0.0005291763227432966, 0.0005486449808813632, 0.0005681136390194297, 0.0005875822971574962, 0.0006070509552955627, 0.0006265196134336293, 0.0006459882715716958, 0.0006654569879174232, 0.0006849255878478289, 0.0007043942459858954, 0.0007238629041239619, 0.0007433315622620285, 0.000762800220400095, 0.0007822688785381615, 0.000801737536676228, 0.0008212062530219555, 0.000840674911160022, 0.0008601435692980886, 0.0008796122274361551, 0.0008990808855742216, 0.0009185495437122881, 0.0009380182018503547, 0.0009574868599884212, 0.0009769555181264877, 0.0009964242344722152, 0.0010158929508179426, 0.0010353615507483482, 0.0010548302670940757, 0.0010742988670244813, 0.0010937675833702087, 0.0011132361833006144, 0.0011327048996463418, 0.0011521734995767474]}, "gradients/decoder.transformer.h.22.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 4.0, 2.0, 4.0, 7.0, 9.0, 6.0, 18.0, 13.0, 21.0, 12.0, 19.0, 25.0, 23.0, 22.0, 24.0, 27.0, 21.0, 34.0, 32.0, 31.0, 37.0, 52.0, 41.0, 38.0, 37.0, 45.0, 39.0, 28.0, 37.0, 38.0, 30.0, 38.0, 32.0, 23.0, 22.0, 16.0, 18.0, 10.0, 14.0, 7.0, 10.0, 11.0, 13.0, 2.0, 8.0, 2.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 1.0], "bins": [-0.00039637088775634766, -0.0003847982734441757, -0.0003732256591320038, -0.00036165304481983185, -0.0003500804305076599, -0.000338507816195488, -0.00032693520188331604, -0.0003153625875711441, -0.00030378997325897217, -0.00029221735894680023, -0.0002806447446346283, -0.00026907213032245636, -0.0002574995160102844, -0.0002459269016981125, -0.00023435428738594055, -0.00022278167307376862, -0.00021120905876159668, -0.00019963644444942474, -0.0001880638301372528, -0.00017649121582508087, -0.00016491860151290894, -0.000153345987200737, -0.00014177337288856506, -0.00013020075857639313, -0.00011862814426422119, -0.00010705552995204926, -9.548291563987732e-05, -8.391030132770538e-05, -7.233768701553345e-05, -6.076507270336151e-05, -4.9192458391189575e-05, -3.761984407901764e-05, -2.6047229766845703e-05, -1.4474615454673767e-05, -2.902001142501831e-06, 8.670613169670105e-06, 2.024322748184204e-05, 3.181584179401398e-05, 4.338845610618591e-05, 5.496107041835785e-05, 6.653368473052979e-05, 7.810629904270172e-05, 8.967891335487366e-05, 0.0001012515276670456, 0.00011282414197921753, 0.00012439675629138947, 0.0001359693706035614, 0.00014754198491573334, 0.00015911459922790527, 0.0001706872135400772, 0.00018225982785224915, 0.00019383244216442108, 0.00020540505647659302, 0.00021697767078876495, 0.0002285502851009369, 0.00024012289941310883, 0.00025169551372528076, 0.0002632681280374527, 0.00027484074234962463, 0.00028641335666179657, 0.0002979859709739685, 0.00030955858528614044, 0.0003211311995983124, 0.0003327038139104843, 0.00034427642822265625]}, "gradients/decoder.transformer.h.22.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 6.0, 6.0, 6.0, 5.0, 2.0, 8.0, 4.0, 14.0, 7.0, 12.0, 10.0, 26.0, 24.0, 30.0, 31.0, 21.0, 33.0, 43.0, 42.0, 37.0, 39.0, 51.0, 41.0, 42.0, 43.0, 41.0, 48.0, 38.0, 33.0, 28.0, 28.0, 35.0, 19.0, 24.0, 22.0, 19.0, 15.0, 15.0, 15.0, 5.0, 8.0, 9.0, 7.0, 7.0, 3.0, 5.0, 2.0, 0.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.76171875, -4.60430908203125, -4.4468994140625, -4.28948974609375, -4.132080078125, -3.97467041015625, -3.8172607421875, -3.65985107421875, -3.50244140625, -3.34503173828125, -3.1876220703125, -3.03021240234375, -2.872802734375, -2.71539306640625, -2.5579833984375, -2.40057373046875, -2.2431640625, -2.08575439453125, -1.9283447265625, -1.77093505859375, -1.613525390625, -1.45611572265625, -1.2987060546875, -1.14129638671875, -0.98388671875, -0.82647705078125, -0.6690673828125, -0.51165771484375, -0.354248046875, -0.19683837890625, -0.0394287109375, 0.11798095703125, 0.275390625, 0.43280029296875, 0.5902099609375, 0.74761962890625, 0.905029296875, 1.06243896484375, 1.2198486328125, 1.37725830078125, 1.53466796875, 1.69207763671875, 1.8494873046875, 2.00689697265625, 2.164306640625, 2.32171630859375, 2.4791259765625, 2.63653564453125, 2.7939453125, 2.95135498046875, 3.1087646484375, 3.26617431640625, 3.423583984375, 3.58099365234375, 3.7384033203125, 3.89581298828125, 4.05322265625, 4.21063232421875, 4.3680419921875, 4.52545166015625, 4.682861328125, 4.84027099609375, 4.9976806640625, 5.15509033203125, 5.3125]}, "gradients/decoder.transformer.h.22.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 9.0, 4.0, 5.0, 13.0, 15.0, 24.0, 24.0, 28.0, 36.0, 54.0, 74.0, 72.0, 134.0, 192.0, 221.0, 379.0, 586.0, 1002.0, 1862.0, 3826.0, 9041.0, 24359.0, 78787.0, 305932.0, 444189.0, 119997.0, 34729.0, 12320.0, 5028.0, 2287.0, 1248.0, 668.0, 379.0, 298.0, 186.0, 142.0, 91.0, 79.0, 67.0, 44.0, 35.0, 12.0, 22.0, 19.0, 15.0, 11.0, 6.0, 4.0, 1.0, 5.0, 0.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.57421875, -4.42742919921875, -4.2806396484375, -4.13385009765625, -3.987060546875, -3.84027099609375, -3.6934814453125, -3.54669189453125, -3.39990234375, -3.25311279296875, -3.1063232421875, -2.95953369140625, -2.812744140625, -2.66595458984375, -2.5191650390625, -2.37237548828125, -2.2255859375, -2.07879638671875, -1.9320068359375, -1.78521728515625, -1.638427734375, -1.49163818359375, -1.3448486328125, -1.19805908203125, -1.05126953125, -0.90447998046875, -0.7576904296875, -0.61090087890625, -0.464111328125, -0.31732177734375, -0.1705322265625, -0.02374267578125, 0.123046875, 0.26983642578125, 0.4166259765625, 0.56341552734375, 0.710205078125, 0.85699462890625, 1.0037841796875, 1.15057373046875, 1.29736328125, 1.44415283203125, 1.5909423828125, 1.73773193359375, 1.884521484375, 2.03131103515625, 2.1781005859375, 2.32489013671875, 2.4716796875, 2.61846923828125, 2.7652587890625, 2.91204833984375, 3.058837890625, 3.20562744140625, 3.3524169921875, 3.49920654296875, 3.64599609375, 3.79278564453125, 3.9395751953125, 4.08636474609375, 4.233154296875, 4.37994384765625, 4.5267333984375, 4.67352294921875, 4.8203125]}, "gradients/decoder.transformer.h.22.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 0.0, 3.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 8.0, 7.0, 11.0, 6.0, 17.0, 18.0, 19.0, 16.0, 24.0, 26.0, 19.0, 42.0, 22.0, 35.0, 37.0, 50.0, 46.0, 76.0, 186.0, 1724.0, 184.0, 54.0, 47.0, 41.0, 25.0, 36.0, 26.0, 28.0, 36.0, 22.0, 26.0, 21.0, 16.0, 20.0, 19.0, 8.0, 7.0, 9.0, 11.0, 4.0, 1.0, 5.0, 2.0, 1.0, 4.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 3.0], "bins": [-16.65625, -16.128173828125, -15.60009765625, -15.072021484375, -14.5439453125, -14.015869140625, -13.48779296875, -12.959716796875, -12.431640625, -11.903564453125, -11.37548828125, -10.847412109375, -10.3193359375, -9.791259765625, -9.26318359375, -8.735107421875, -8.20703125, -7.678955078125, -7.15087890625, -6.622802734375, -6.0947265625, -5.566650390625, -5.03857421875, -4.510498046875, -3.982421875, -3.454345703125, -2.92626953125, -2.398193359375, -1.8701171875, -1.342041015625, -0.81396484375, -0.285888671875, 0.2421875, 0.770263671875, 1.29833984375, 1.826416015625, 2.3544921875, 2.882568359375, 3.41064453125, 3.938720703125, 4.466796875, 4.994873046875, 5.52294921875, 6.051025390625, 6.5791015625, 7.107177734375, 7.63525390625, 8.163330078125, 8.69140625, 9.219482421875, 9.74755859375, 10.275634765625, 10.8037109375, 11.331787109375, 11.85986328125, 12.387939453125, 12.916015625, 13.444091796875, 13.97216796875, 14.500244140625, 15.0283203125, 15.556396484375, 16.08447265625, 16.612548828125, 17.140625]}, "gradients/decoder.transformer.h.22.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 3.0, 5.0, 1.0, 4.0, 7.0, 10.0, 8.0, 9.0, 20.0, 19.0, 18.0, 19.0, 29.0, 37.0, 32.0, 33.0, 49.0, 77.0, 99.0, 162.0, 351.0, 946.0, 12686.0, 3121754.0, 7599.0, 817.0, 312.0, 162.0, 85.0, 75.0, 39.0, 43.0, 30.0, 29.0, 24.0, 17.0, 21.0, 13.0, 13.0, 10.0, 9.0, 12.0, 7.0, 2.0, 3.0, 4.0, 2.0, 0.0, 0.0, 4.0, 0.0, 1.0, 0.0, 2.0, 1.0], "bins": [-39.78125, -38.55517578125, -37.3291015625, -36.10302734375, -34.876953125, -33.65087890625, -32.4248046875, -31.19873046875, -29.97265625, -28.74658203125, -27.5205078125, -26.29443359375, -25.068359375, -23.84228515625, -22.6162109375, -21.39013671875, -20.1640625, -18.93798828125, -17.7119140625, -16.48583984375, -15.259765625, -14.03369140625, -12.8076171875, -11.58154296875, -10.35546875, -9.12939453125, -7.9033203125, -6.67724609375, -5.451171875, -4.22509765625, -2.9990234375, -1.77294921875, -0.546875, 0.67919921875, 1.9052734375, 3.13134765625, 4.357421875, 5.58349609375, 6.8095703125, 8.03564453125, 9.26171875, 10.48779296875, 11.7138671875, 12.93994140625, 14.166015625, 15.39208984375, 16.6181640625, 17.84423828125, 19.0703125, 20.29638671875, 21.5224609375, 22.74853515625, 23.974609375, 25.20068359375, 26.4267578125, 27.65283203125, 28.87890625, 30.10498046875, 31.3310546875, 32.55712890625, 33.783203125, 35.00927734375, 36.2353515625, 37.46142578125, 38.6875]}, "gradients/decoder.transformer.h.22.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 28.0, 428.0, 507.0, 46.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-76.03648376464844, -74.2300796508789, -72.4236831665039, -70.61727905273438, -68.81087493896484, -67.00447082519531, -65.19807434082031, -63.39167022705078, -61.58526611328125, -59.778865814208984, -57.97246170043945, -56.16606140136719, -54.359657287597656, -52.55325698852539, -50.746856689453125, -48.940452575683594, -47.13405227661133, -45.32765197753906, -43.52124786376953, -41.714847564697266, -39.908443450927734, -38.10204315185547, -36.29563903808594, -34.48923873901367, -32.682838439941406, -30.876436233520508, -29.07003402709961, -27.263633728027344, -25.457229614257812, -23.650829315185547, -21.84442710876465, -20.03802490234375, -18.231624603271484, -16.425222396850586, -14.618820190429688, -12.812418937683105, -11.006016731262207, -9.199614524841309, -7.393213272094727, -5.586811065673828, -3.7804088592529297, -1.9740068912506104, -0.16760492324829102, 1.6387968063354492, 3.4451990127563477, 5.251601219177246, 7.058002471923828, 8.864404678344727, 10.670806884765625, 12.477209091186523, 14.283611297607422, 16.090011596679688, 17.89641571044922, 19.702816009521484, 21.509218215942383, 23.31562042236328, 25.12202262878418, 26.928424835205078, 28.734827041625977, 30.541229248046875, 32.34762954711914, 34.15403366088867, 35.96043395996094, 37.76683807373047, 39.573238372802734]}, "gradients/decoder.transformer.h.22.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 4.0, 2.0, 3.0, 4.0, 3.0, 4.0, 3.0, 3.0, 12.0, 10.0, 10.0, 6.0, 11.0, 18.0, 20.0, 16.0, 23.0, 21.0, 39.0, 22.0, 37.0, 34.0, 36.0, 32.0, 48.0, 25.0, 44.0, 41.0, 39.0, 41.0, 36.0, 44.0, 32.0, 36.0, 26.0, 33.0, 34.0, 25.0, 20.0, 17.0, 18.0, 12.0, 15.0, 15.0, 9.0, 7.0, 9.0, 3.0, 3.0, 2.0, 0.0, 3.0, 3.0, 1.0, 1.0, 3.0], "bins": [-56.42034912109375, -54.805362701416016, -53.19037628173828, -51.57538986206055, -49.96040344238281, -48.34541320800781, -46.73042678833008, -45.115440368652344, -43.50045394897461, -41.885467529296875, -40.27048110961914, -38.655494689941406, -37.040504455566406, -35.42552185058594, -33.81053161621094, -32.1955451965332, -30.58055877685547, -28.965572357177734, -27.3505859375, -25.735597610473633, -24.1206111907959, -22.505624771118164, -20.890636444091797, -19.275650024414062, -17.660663604736328, -16.045677185058594, -14.430689811706543, -12.815702438354492, -11.200716018676758, -9.585729598999023, -7.970742225646973, -6.355754852294922, -4.7407684326171875, -3.125781536102295, -1.5107946395874023, 0.10419225692749023, 1.7191791534423828, 3.334165573120117, 4.949152946472168, 6.564140319824219, 8.179126739501953, 9.794113159179688, 11.409100532531738, 13.024087905883789, 14.639074325561523, 16.254060745239258, 17.869049072265625, 19.48403549194336, 21.099021911621094, 22.714008331298828, 24.328994750976562, 25.94398307800293, 27.558969497680664, 29.1739559173584, 30.788944244384766, 32.4039306640625, 34.018917083740234, 35.63390350341797, 37.2488899230957, 38.86387634277344, 40.47886657714844, 42.093849182128906, 43.708839416503906, 45.32382583618164, 46.938812255859375]}, "gradients/decoder.transformer.h.21.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 0.0, 2.0, 3.0, 4.0, 4.0, 7.0, 3.0, 4.0, 5.0, 6.0, 6.0, 8.0, 12.0, 9.0, 17.0, 16.0, 26.0, 21.0, 35.0, 29.0, 28.0, 39.0, 32.0, 35.0, 42.0, 47.0, 36.0, 47.0, 34.0, 51.0, 40.0, 43.0, 34.0, 32.0, 28.0, 30.0, 25.0, 27.0, 17.0, 23.0, 18.0, 7.0, 17.0, 12.0, 12.0, 7.0, 11.0, 5.0, 3.0, 6.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.74609375, -4.59173583984375, -4.4373779296875, -4.28302001953125, -4.128662109375, -3.97430419921875, -3.8199462890625, -3.66558837890625, -3.51123046875, -3.35687255859375, -3.2025146484375, -3.04815673828125, -2.893798828125, -2.73944091796875, -2.5850830078125, -2.43072509765625, -2.2763671875, -2.12200927734375, -1.9676513671875, -1.81329345703125, -1.658935546875, -1.50457763671875, -1.3502197265625, -1.19586181640625, -1.04150390625, -0.88714599609375, -0.7327880859375, -0.57843017578125, -0.424072265625, -0.26971435546875, -0.1153564453125, 0.03900146484375, 0.193359375, 0.34771728515625, 0.5020751953125, 0.65643310546875, 0.810791015625, 0.96514892578125, 1.1195068359375, 1.27386474609375, 1.42822265625, 1.58258056640625, 1.7369384765625, 1.89129638671875, 2.045654296875, 2.20001220703125, 2.3543701171875, 2.50872802734375, 2.6630859375, 2.81744384765625, 2.9718017578125, 3.12615966796875, 3.280517578125, 3.43487548828125, 3.5892333984375, 3.74359130859375, 3.89794921875, 4.05230712890625, 4.2066650390625, 4.36102294921875, 4.515380859375, 4.66973876953125, 4.8240966796875, 4.97845458984375, 5.1328125]}, "gradients/decoder.transformer.h.21.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 2.0, 5.0, 3.0, 1.0, 11.0, 8.0, 8.0, 11.0, 15.0, 13.0, 19.0, 26.0, 39.0, 48.0, 81.0, 86.0, 138.0, 282.0, 632.0, 2139.0, 12728.0, 245761.0, 2767235.0, 1105776.0, 52793.0, 4485.0, 984.0, 339.0, 188.0, 105.0, 80.0, 48.0, 44.0, 36.0, 20.0, 18.0, 23.0, 17.0, 10.0, 10.0, 6.0, 5.0, 3.0, 3.0, 2.0, 3.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 1.0], "bins": [-16.5625, -16.07177734375, -15.5810546875, -15.09033203125, -14.599609375, -14.10888671875, -13.6181640625, -13.12744140625, -12.63671875, -12.14599609375, -11.6552734375, -11.16455078125, -10.673828125, -10.18310546875, -9.6923828125, -9.20166015625, -8.7109375, -8.22021484375, -7.7294921875, -7.23876953125, -6.748046875, -6.25732421875, -5.7666015625, -5.27587890625, -4.78515625, -4.29443359375, -3.8037109375, -3.31298828125, -2.822265625, -2.33154296875, -1.8408203125, -1.35009765625, -0.859375, -0.36865234375, 0.1220703125, 0.61279296875, 1.103515625, 1.59423828125, 2.0849609375, 2.57568359375, 3.06640625, 3.55712890625, 4.0478515625, 4.53857421875, 5.029296875, 5.52001953125, 6.0107421875, 6.50146484375, 6.9921875, 7.48291015625, 7.9736328125, 8.46435546875, 8.955078125, 9.44580078125, 9.9365234375, 10.42724609375, 10.91796875, 11.40869140625, 11.8994140625, 12.39013671875, 12.880859375, 13.37158203125, 13.8623046875, 14.35302734375, 14.84375]}, "gradients/decoder.transformer.h.21.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 4.0, 3.0, 9.0, 8.0, 18.0, 18.0, 28.0, 26.0, 52.0, 67.0, 94.0, 114.0, 154.0, 225.0, 373.0, 469.0, 624.0, 510.0, 404.0, 256.0, 202.0, 121.0, 80.0, 53.0, 48.0, 36.0, 30.0, 18.0, 11.0, 6.0, 6.0, 6.0, 3.0, 4.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-22.859375, -22.199951171875, -21.54052734375, -20.881103515625, -20.2216796875, -19.562255859375, -18.90283203125, -18.243408203125, -17.583984375, -16.924560546875, -16.26513671875, -15.605712890625, -14.9462890625, -14.286865234375, -13.62744140625, -12.968017578125, -12.30859375, -11.649169921875, -10.98974609375, -10.330322265625, -9.6708984375, -9.011474609375, -8.35205078125, -7.692626953125, -7.033203125, -6.373779296875, -5.71435546875, -5.054931640625, -4.3955078125, -3.736083984375, -3.07666015625, -2.417236328125, -1.7578125, -1.098388671875, -0.43896484375, 0.220458984375, 0.8798828125, 1.539306640625, 2.19873046875, 2.858154296875, 3.517578125, 4.177001953125, 4.83642578125, 5.495849609375, 6.1552734375, 6.814697265625, 7.47412109375, 8.133544921875, 8.79296875, 9.452392578125, 10.11181640625, 10.771240234375, 11.4306640625, 12.090087890625, 12.74951171875, 13.408935546875, 14.068359375, 14.727783203125, 15.38720703125, 16.046630859375, 16.7060546875, 17.365478515625, 18.02490234375, 18.684326171875, 19.34375]}, "gradients/decoder.transformer.h.21.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 7.0, 5.0, 21.0, 19.0, 25.0, 33.0, 40.0, 67.0, 95.0, 121.0, 189.0, 295.0, 595.0, 2059.0, 439873.0, 3744932.0, 4056.0, 695.0, 360.0, 214.0, 159.0, 122.0, 81.0, 71.0, 40.0, 24.0, 18.0, 34.0, 10.0, 5.0, 5.0, 4.0, 4.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0], "bins": [-83.625, -81.2451171875, -78.865234375, -76.4853515625, -74.10546875, -71.7255859375, -69.345703125, -66.9658203125, -64.5859375, -62.2060546875, -59.826171875, -57.4462890625, -55.06640625, -52.6865234375, -50.306640625, -47.9267578125, -45.546875, -43.1669921875, -40.787109375, -38.4072265625, -36.02734375, -33.6474609375, -31.267578125, -28.8876953125, -26.5078125, -24.1279296875, -21.748046875, -19.3681640625, -16.98828125, -14.6083984375, -12.228515625, -9.8486328125, -7.46875, -5.0888671875, -2.708984375, -0.3291015625, 2.05078125, 4.4306640625, 6.810546875, 9.1904296875, 11.5703125, 13.9501953125, 16.330078125, 18.7099609375, 21.08984375, 23.4697265625, 25.849609375, 28.2294921875, 30.609375, 32.9892578125, 35.369140625, 37.7490234375, 40.12890625, 42.5087890625, 44.888671875, 47.2685546875, 49.6484375, 52.0283203125, 54.408203125, 56.7880859375, 59.16796875, 61.5478515625, 63.927734375, 66.3076171875, 68.6875]}, "gradients/decoder.transformer.h.21.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 5.0, 66.0, 396.0, 439.0, 107.0, 6.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-47.977500915527344, -42.07735061645508, -36.17719650268555, -30.27704620361328, -24.376893997192383, -18.476741790771484, -12.576591491699219, -6.6764373779296875, -0.7762870788574219, 5.123864650726318, 11.024016380310059, 16.92416763305664, 22.82431983947754, 28.724472045898438, 34.6246223449707, 40.524776458740234, 46.4249267578125, 52.325077056884766, 58.2252311706543, 64.12538146972656, 70.0255355834961, 75.92568969726562, 81.82583618164062, 87.72599029541016, 93.62614440917969, 99.52629852294922, 105.42644500732422, 111.32659912109375, 117.22675323486328, 123.12690734863281, 129.0270538330078, 134.92721557617188, 140.8273468017578, 146.7274932861328, 152.62765502929688, 158.52780151367188, 164.42794799804688, 170.32810974121094, 176.22825622558594, 182.12841796875, 188.028564453125, 193.9287109375, 199.82887268066406, 205.72901916503906, 211.62916564941406, 217.52932739257812, 223.42947387695312, 229.32962036132812, 235.22976684570312, 241.12991333007812, 247.0300750732422, 252.9302215576172, 258.83038330078125, 264.73052978515625, 270.63067626953125, 276.53082275390625, 282.4309997558594, 288.3311462402344, 294.2312927246094, 300.1314697265625, 306.0316162109375, 311.9317626953125, 317.8319091796875, 323.7320556640625, 329.6322021484375]}, "gradients/decoder.transformer.h.21.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 5.0, 2.0, 3.0, 3.0, 3.0, 7.0, 11.0, 10.0, 12.0, 21.0, 16.0, 14.0, 16.0, 19.0, 31.0, 23.0, 23.0, 34.0, 36.0, 35.0, 42.0, 42.0, 53.0, 40.0, 42.0, 32.0, 41.0, 39.0, 49.0, 40.0, 45.0, 24.0, 30.0, 26.0, 20.0, 24.0, 17.0, 19.0, 20.0, 12.0, 7.0, 11.0, 4.0, 2.0, 5.0, 2.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 3.0], "bins": [-75.74655151367188, -73.60922241210938, -71.47189331054688, -69.3345718383789, -67.1972427368164, -65.0599136352539, -62.92258834838867, -60.78526306152344, -58.64793395996094, -56.51060485839844, -54.3732795715332, -52.23595428466797, -50.09862518310547, -47.96129608154297, -45.823970794677734, -43.6866455078125, -41.54931640625, -39.4119873046875, -37.274662017822266, -35.13733673095703, -33.00000762939453, -30.862680435180664, -28.725353240966797, -26.58802604675293, -24.450698852539062, -22.313371658325195, -20.176044464111328, -18.03871726989746, -15.901390075683594, -13.764062881469727, -11.62673568725586, -9.489408493041992, -7.352088928222656, -5.214761734008789, -3.077434539794922, -0.9401073455810547, 1.1972198486328125, 3.3345470428466797, 5.471874237060547, 7.609201431274414, 9.746528625488281, 11.883855819702148, 14.021183013916016, 16.158510208129883, 18.29583740234375, 20.433164596557617, 22.570491790771484, 24.70781898498535, 26.84514617919922, 28.982473373413086, 31.119800567626953, 33.25712585449219, 35.39445495605469, 37.53178405761719, 39.66910934448242, 41.806434631347656, 43.943763732910156, 46.081092834472656, 48.21841812133789, 50.355743408203125, 52.493072509765625, 54.630401611328125, 56.76772689819336, 58.905052185058594, 61.042381286621094]}, "gradients/decoder.transformer.h.21.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 7.0, 4.0, 3.0, 4.0, 4.0, 6.0, 10.0, 18.0, 14.0, 16.0, 20.0, 15.0, 22.0, 32.0, 31.0, 34.0, 32.0, 45.0, 38.0, 34.0, 44.0, 53.0, 40.0, 37.0, 50.0, 43.0, 36.0, 31.0, 33.0, 22.0, 26.0, 22.0, 30.0, 14.0, 22.0, 17.0, 13.0, 17.0, 14.0, 12.0, 8.0, 8.0, 5.0, 3.0, 1.0, 5.0, 0.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-4.9609375, -4.803466796875, -4.64599609375, -4.488525390625, -4.3310546875, -4.173583984375, -4.01611328125, -3.858642578125, -3.701171875, -3.543701171875, -3.38623046875, -3.228759765625, -3.0712890625, -2.913818359375, -2.75634765625, -2.598876953125, -2.44140625, -2.283935546875, -2.12646484375, -1.968994140625, -1.8115234375, -1.654052734375, -1.49658203125, -1.339111328125, -1.181640625, -1.024169921875, -0.86669921875, -0.709228515625, -0.5517578125, -0.394287109375, -0.23681640625, -0.079345703125, 0.078125, 0.235595703125, 0.39306640625, 0.550537109375, 0.7080078125, 0.865478515625, 1.02294921875, 1.180419921875, 1.337890625, 1.495361328125, 1.65283203125, 1.810302734375, 1.9677734375, 2.125244140625, 2.28271484375, 2.440185546875, 2.59765625, 2.755126953125, 2.91259765625, 3.070068359375, 3.2275390625, 3.385009765625, 3.54248046875, 3.699951171875, 3.857421875, 4.014892578125, 4.17236328125, 4.329833984375, 4.4873046875, 4.644775390625, 4.80224609375, 4.959716796875, 5.1171875]}, "gradients/decoder.transformer.h.21.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 1.0, 6.0, 10.0, 11.0, 18.0, 25.0, 39.0, 38.0, 69.0, 95.0, 135.0, 193.0, 251.0, 360.0, 519.0, 699.0, 999.0, 1439.0, 1995.0, 2886.0, 4180.0, 6054.0, 8761.0, 13163.0, 19494.0, 29880.0, 46534.0, 77361.0, 145831.0, 324173.0, 146854.0, 78142.0, 46996.0, 30010.0, 19650.0, 12901.0, 8857.0, 5997.0, 4148.0, 2878.0, 1986.0, 1406.0, 1019.0, 733.0, 516.0, 361.0, 274.0, 169.0, 131.0, 93.0, 74.0, 40.0, 39.0, 21.0, 23.0, 13.0, 4.0, 6.0, 5.0, 3.0, 1.0, 1.0], "bins": [-1.2099609375, -1.1715545654296875, -1.133148193359375, -1.0947418212890625, -1.05633544921875, -1.0179290771484375, -0.979522705078125, -0.9411163330078125, -0.9027099609375, -0.8643035888671875, -0.825897216796875, -0.7874908447265625, -0.74908447265625, -0.7106781005859375, -0.672271728515625, -0.6338653564453125, -0.595458984375, -0.5570526123046875, -0.518646240234375, -0.4802398681640625, -0.44183349609375, -0.4034271240234375, -0.365020751953125, -0.3266143798828125, -0.2882080078125, -0.2498016357421875, -0.211395263671875, -0.1729888916015625, -0.13458251953125, -0.0961761474609375, -0.057769775390625, -0.0193634033203125, 0.01904296875, 0.0574493408203125, 0.095855712890625, 0.1342620849609375, 0.17266845703125, 0.2110748291015625, 0.249481201171875, 0.2878875732421875, 0.3262939453125, 0.3647003173828125, 0.403106689453125, 0.4415130615234375, 0.47991943359375, 0.5183258056640625, 0.556732177734375, 0.5951385498046875, 0.633544921875, 0.6719512939453125, 0.710357666015625, 0.7487640380859375, 0.78717041015625, 0.8255767822265625, 0.863983154296875, 0.9023895263671875, 0.9407958984375, 0.9792022705078125, 1.017608642578125, 1.0560150146484375, 1.09442138671875, 1.1328277587890625, 1.171234130859375, 1.2096405029296875, 1.248046875]}, "gradients/decoder.transformer.h.21.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 4.0, 2.0, 6.0, 8.0, 4.0, 5.0, 10.0, 14.0, 16.0, 15.0, 15.0, 28.0, 24.0, 31.0, 24.0, 37.0, 30.0, 31.0, 38.0, 32.0, 42.0, 42.0, 1062.0, 38.0, 57.0, 42.0, 39.0, 38.0, 30.0, 31.0, 44.0, 35.0, 18.0, 18.0, 24.0, 20.0, 20.0, 9.0, 12.0, 9.0, 4.0, 12.0, 6.0, 9.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-3.453125, -3.3465576171875, -3.239990234375, -3.1334228515625, -3.02685546875, -2.9202880859375, -2.813720703125, -2.7071533203125, -2.6005859375, -2.4940185546875, -2.387451171875, -2.2808837890625, -2.17431640625, -2.0677490234375, -1.961181640625, -1.8546142578125, -1.748046875, -1.6414794921875, -1.534912109375, -1.4283447265625, -1.32177734375, -1.2152099609375, -1.108642578125, -1.0020751953125, -0.8955078125, -0.7889404296875, -0.682373046875, -0.5758056640625, -0.46923828125, -0.3626708984375, -0.256103515625, -0.1495361328125, -0.04296875, 0.0635986328125, 0.170166015625, 0.2767333984375, 0.38330078125, 0.4898681640625, 0.596435546875, 0.7030029296875, 0.8095703125, 0.9161376953125, 1.022705078125, 1.1292724609375, 1.23583984375, 1.3424072265625, 1.448974609375, 1.5555419921875, 1.662109375, 1.7686767578125, 1.875244140625, 1.9818115234375, 2.08837890625, 2.1949462890625, 2.301513671875, 2.4080810546875, 2.5146484375, 2.6212158203125, 2.727783203125, 2.8343505859375, 2.94091796875, 3.0474853515625, 3.154052734375, 3.2606201171875, 3.3671875]}, "gradients/decoder.transformer.h.21.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 10.0, 9.0, 14.0, 6.0, 14.0, 25.0, 36.0, 49.0, 91.0, 148.0, 247.0, 416.0, 754.0, 1345.0, 2625.0, 4650.0, 8965.0, 17006.0, 34297.0, 70523.0, 157951.0, 1479280.0, 169716.0, 74463.0, 36155.0, 18022.0, 9390.0, 4859.0, 2751.0, 1367.0, 855.0, 465.0, 251.0, 123.0, 90.0, 62.0, 43.0, 22.0, 12.0, 6.0, 9.0, 9.0, 2.0, 6.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.845703125, -1.78717041015625, -1.7286376953125, -1.67010498046875, -1.611572265625, -1.55303955078125, -1.4945068359375, -1.43597412109375, -1.37744140625, -1.31890869140625, -1.2603759765625, -1.20184326171875, -1.143310546875, -1.08477783203125, -1.0262451171875, -0.96771240234375, -0.9091796875, -0.85064697265625, -0.7921142578125, -0.73358154296875, -0.675048828125, -0.61651611328125, -0.5579833984375, -0.49945068359375, -0.44091796875, -0.38238525390625, -0.3238525390625, -0.26531982421875, -0.206787109375, -0.14825439453125, -0.0897216796875, -0.03118896484375, 0.02734375, 0.08587646484375, 0.1444091796875, 0.20294189453125, 0.261474609375, 0.32000732421875, 0.3785400390625, 0.43707275390625, 0.49560546875, 0.55413818359375, 0.6126708984375, 0.67120361328125, 0.729736328125, 0.78826904296875, 0.8468017578125, 0.90533447265625, 0.9638671875, 1.02239990234375, 1.0809326171875, 1.13946533203125, 1.197998046875, 1.25653076171875, 1.3150634765625, 1.37359619140625, 1.43212890625, 1.49066162109375, 1.5491943359375, 1.60772705078125, 1.666259765625, 1.72479248046875, 1.7833251953125, 1.84185791015625, 1.900390625]}, "gradients/decoder.transformer.h.21.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 0.0, 1.0, 3.0, 2.0, 3.0, 3.0, 2.0, 6.0, 10.0, 8.0, 14.0, 18.0, 20.0, 26.0, 19.0, 29.0, 35.0, 37.0, 49.0, 71.0, 62.0, 50.0, 52.0, 64.0, 55.0, 52.0, 45.0, 44.0, 55.0, 25.0, 31.0, 17.0, 16.0, 14.0, 17.0, 19.0, 13.0, 7.0, 2.0, 3.0, 3.0, 5.0, 3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00106048583984375, -0.00102977454662323, -0.00099906325340271, -0.0009683519601821899, -0.0009376406669616699, -0.0009069293737411499, -0.0008762180805206299, -0.0008455067873001099, -0.0008147954940795898, -0.0007840842008590698, -0.0007533729076385498, -0.0007226616144180298, -0.0006919503211975098, -0.0006612390279769897, -0.0006305277347564697, -0.0005998164415359497, -0.0005691051483154297, -0.0005383938550949097, -0.0005076825618743896, -0.00047697126865386963, -0.0004462599754333496, -0.0004155486822128296, -0.00038483738899230957, -0.00035412609577178955, -0.00032341480255126953, -0.0002927035093307495, -0.0002619922161102295, -0.00023128092288970947, -0.00020056962966918945, -0.00016985833644866943, -0.00013914704322814941, -0.0001084357500076294, -7.772445678710938e-05, -4.7013163566589355e-05, -1.6301870346069336e-05, 1.4409422874450684e-05, 4.51207160949707e-05, 7.583200931549072e-05, 0.00010654330253601074, 0.00013725459575653076, 0.00016796588897705078, 0.0001986771821975708, 0.00022938847541809082, 0.00026009976863861084, 0.00029081106185913086, 0.0003215223550796509, 0.0003522336483001709, 0.0003829449415206909, 0.00041365623474121094, 0.00044436752796173096, 0.000475078821182251, 0.000505790114402771, 0.000536501407623291, 0.000567212700843811, 0.0005979239940643311, 0.0006286352872848511, 0.0006593465805053711, 0.0006900578737258911, 0.0007207691669464111, 0.0007514804601669312, 0.0007821917533874512, 0.0008129030466079712, 0.0008436143398284912, 0.0008743256330490112, 0.0009050369262695312]}, "gradients/decoder.transformer.h.21.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 4.0, 5.0, 4.0, 4.0, 4.0, 7.0, 9.0, 17.0, 16.0, 21.0, 15.0, 16.0, 31.0, 47.0, 39.0, 50.0, 69.0, 102.0, 144.0, 289.0, 646.0, 276027.0, 769418.0, 659.0, 271.0, 175.0, 119.0, 89.0, 52.0, 43.0, 23.0, 26.0, 22.0, 18.0, 18.0, 16.0, 16.0, 7.0, 4.0, 6.0, 4.0, 2.0, 2.0, 4.0, 3.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.0225677490234375, -0.021863222122192383, -0.021158695220947266, -0.02045416831970215, -0.01974964141845703, -0.019045114517211914, -0.018340587615966797, -0.01763606071472168, -0.016931533813476562, -0.016227006912231445, -0.015522480010986328, -0.014817953109741211, -0.014113426208496094, -0.013408899307250977, -0.01270437240600586, -0.011999845504760742, -0.011295318603515625, -0.010590791702270508, -0.00988626480102539, -0.009181737899780273, -0.008477210998535156, -0.007772684097290039, -0.007068157196044922, -0.006363630294799805, -0.0056591033935546875, -0.00495457649230957, -0.004250049591064453, -0.003545522689819336, -0.0028409957885742188, -0.0021364688873291016, -0.0014319419860839844, -0.0007274150848388672, -2.288818359375e-05, 0.0006816387176513672, 0.0013861656188964844, 0.0020906925201416016, 0.0027952194213867188, 0.003499746322631836, 0.004204273223876953, 0.00490880012512207, 0.0056133270263671875, 0.006317853927612305, 0.007022380828857422, 0.007726907730102539, 0.008431434631347656, 0.009135961532592773, 0.00984048843383789, 0.010545015335083008, 0.011249542236328125, 0.011954069137573242, 0.01265859603881836, 0.013363122940063477, 0.014067649841308594, 0.014772176742553711, 0.015476703643798828, 0.016181230545043945, 0.016885757446289062, 0.01759028434753418, 0.018294811248779297, 0.018999338150024414, 0.01970386505126953, 0.02040839195251465, 0.021112918853759766, 0.021817445755004883, 0.02252197265625]}, "gradients/decoder.transformer.h.21.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 9.0, 689.0, 319.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.003000686876475811, -0.0028889442328363657, -0.0027772013563662767, -0.0026654587127268314, -0.0025537158362567425, -0.002441973192617297, -0.002330230548977852, -0.002218487672507763, -0.002106744796037674, -0.0019950021523982286, -0.0018832592759281397, -0.0017715166322886944, -0.0016597737558186054, -0.0015480311121791601, -0.001436288352124393, -0.0013245455920696259, -0.0012128029484301805, -0.0011010601883754134, -0.0009893174283206463, -0.0008775747264735401, -0.0007658319664187729, -0.0006540892063640058, -0.0005423465045168996, -0.00043060374446213245, -0.0003188609844073653, -0.00020711823890451342, -9.537549340166152e-05, 1.636723754927516e-05, 0.0001281099976040423, 0.00023985275765880942, 0.00035159545950591564, 0.0004633382195606828, 0.0005750809796154499, 0.000686823739670217, 0.0007985664997249842, 0.0009103092015720904, 0.0010220520198345184, 0.0011337946634739637, 0.0012455374235287309, 0.001357280183583498, 0.0014690229436382651, 0.0015807657036930323, 0.0016925084637477994, 0.0018042512238025665, 0.0019159938674420118, 0.002027736743912101, 0.002139479387551546, 0.002251222264021635, 0.0023629649076610804, 0.0024747075513005257, 0.0025864504277706146, 0.00269819307141006, 0.002809935947880149, 0.002921678591519594, 0.003033421467989683, 0.0031451641116291285, 0.0032569067552685738, 0.003368649398908019, 0.003480392275378108, 0.0035921349190175533, 0.0037038777954876423, 0.0038156204391270876, 0.003927363082766533, 0.004039105959236622, 0.004150848835706711]}, "gradients/decoder.transformer.h.21.ln_cross_attn.bias": {"_type": "histogram", "values": [4.0, 0.0, 1.0, 2.0, 1.0, 4.0, 4.0, 5.0, 6.0, 7.0, 6.0, 6.0, 8.0, 16.0, 18.0, 17.0, 20.0, 23.0, 30.0, 27.0, 22.0, 27.0, 32.0, 32.0, 37.0, 39.0, 38.0, 41.0, 40.0, 43.0, 37.0, 26.0, 33.0, 48.0, 36.0, 33.0, 20.0, 26.0, 20.0, 27.0, 21.0, 15.0, 18.0, 17.0, 11.0, 13.0, 5.0, 12.0, 6.0, 6.0, 6.0, 6.0, 6.0, 4.0, 2.0, 2.0, 2.0, 3.0, 2.0, 0.0, 2.0, 0.0, 2.0, 1.0], "bins": [-0.0004120469093322754, -0.0003979327157139778, -0.00038381852209568024, -0.00036970432847738266, -0.0003555901348590851, -0.0003414759412407875, -0.00032736174762248993, -0.00031324755400419235, -0.0002991333603858948, -0.0002850191667675972, -0.0002709049731492996, -0.00025679077953100204, -0.00024267658591270447, -0.0002285623922944069, -0.00021444819867610931, -0.00020033400505781174, -0.00018621981143951416, -0.00017210561782121658, -0.000157991424202919, -0.00014387723058462143, -0.00012976303696632385, -0.00011564884334802628, -0.0001015346497297287, -8.742045611143112e-05, -7.330626249313354e-05, -5.919206887483597e-05, -4.507787525653839e-05, -3.0963681638240814e-05, -1.6849488019943237e-05, -2.7352944016456604e-06, 1.1378899216651917e-05, 2.5493092834949493e-05, 3.960728645324707e-05, 5.372148007154465e-05, 6.783567368984222e-05, 8.19498673081398e-05, 9.606406092643738e-05, 0.00011017825454473495, 0.00012429244816303253, 0.0001384066417813301, 0.00015252083539962769, 0.00016663502901792526, 0.00018074922263622284, 0.00019486341625452042, 0.000208977609872818, 0.00022309180349111557, 0.00023720599710941315, 0.0002513201907277107, 0.0002654343843460083, 0.0002795485779643059, 0.00029366277158260345, 0.00030777696520090103, 0.0003218911588191986, 0.0003360053524374962, 0.00035011954605579376, 0.00036423373967409134, 0.0003783479332923889, 0.0003924621269106865, 0.00040657632052898407, 0.00042069051414728165, 0.0004348047077655792, 0.0004489189013838768, 0.0004630330950021744, 0.00047714728862047195, 0.0004912614822387695]}, "gradients/decoder.transformer.h.21.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 7.0, 4.0, 3.0, 4.0, 4.0, 6.0, 10.0, 18.0, 14.0, 16.0, 20.0, 15.0, 22.0, 32.0, 31.0, 34.0, 32.0, 45.0, 38.0, 34.0, 44.0, 53.0, 40.0, 37.0, 50.0, 43.0, 36.0, 31.0, 33.0, 22.0, 26.0, 22.0, 30.0, 14.0, 22.0, 17.0, 13.0, 17.0, 14.0, 12.0, 8.0, 8.0, 5.0, 3.0, 1.0, 5.0, 0.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-4.9609375, -4.803466796875, -4.64599609375, -4.488525390625, -4.3310546875, -4.173583984375, -4.01611328125, -3.858642578125, -3.701171875, -3.543701171875, -3.38623046875, -3.228759765625, -3.0712890625, -2.913818359375, -2.75634765625, -2.598876953125, -2.44140625, -2.283935546875, -2.12646484375, -1.968994140625, -1.8115234375, -1.654052734375, -1.49658203125, -1.339111328125, -1.181640625, -1.024169921875, -0.86669921875, -0.709228515625, -0.5517578125, -0.394287109375, -0.23681640625, -0.079345703125, 0.078125, 0.235595703125, 0.39306640625, 0.550537109375, 0.7080078125, 0.865478515625, 1.02294921875, 1.180419921875, 1.337890625, 1.495361328125, 1.65283203125, 1.810302734375, 1.9677734375, 2.125244140625, 2.28271484375, 2.440185546875, 2.59765625, 2.755126953125, 2.91259765625, 3.070068359375, 3.2275390625, 3.385009765625, 3.54248046875, 3.699951171875, 3.857421875, 4.014892578125, 4.17236328125, 4.329833984375, 4.4873046875, 4.644775390625, 4.80224609375, 4.959716796875, 5.1171875]}, "gradients/decoder.transformer.h.21.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 6.0, 2.0, 3.0, 9.0, 8.0, 5.0, 12.0, 20.0, 24.0, 42.0, 61.0, 95.0, 104.0, 194.0, 283.0, 464.0, 700.0, 1213.0, 2268.0, 4130.0, 8592.0, 19686.0, 49087.0, 140041.0, 454050.0, 238867.0, 75086.0, 28889.0, 12255.0, 5511.0, 2793.0, 1522.0, 914.0, 540.0, 351.0, 224.0, 147.0, 110.0, 73.0, 52.0, 34.0, 33.0, 22.0, 16.0, 4.0, 5.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-3.35546875, -3.249664306640625, -3.14385986328125, -3.038055419921875, -2.9322509765625, -2.826446533203125, -2.72064208984375, -2.614837646484375, -2.509033203125, -2.403228759765625, -2.29742431640625, -2.191619873046875, -2.0858154296875, -1.980010986328125, -1.87420654296875, -1.768402099609375, -1.66259765625, -1.556793212890625, -1.45098876953125, -1.345184326171875, -1.2393798828125, -1.133575439453125, -1.02777099609375, -0.921966552734375, -0.816162109375, -0.710357666015625, -0.60455322265625, -0.498748779296875, -0.3929443359375, -0.287139892578125, -0.18133544921875, -0.075531005859375, 0.0302734375, 0.136077880859375, 0.24188232421875, 0.347686767578125, 0.4534912109375, 0.559295654296875, 0.66510009765625, 0.770904541015625, 0.876708984375, 0.982513427734375, 1.08831787109375, 1.194122314453125, 1.2999267578125, 1.405731201171875, 1.51153564453125, 1.617340087890625, 1.72314453125, 1.828948974609375, 1.93475341796875, 2.040557861328125, 2.1463623046875, 2.252166748046875, 2.35797119140625, 2.463775634765625, 2.569580078125, 2.675384521484375, 2.78118896484375, 2.886993408203125, 2.9927978515625, 3.098602294921875, 3.20440673828125, 3.310211181640625, 3.416015625]}, "gradients/decoder.transformer.h.21.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 6.0, 5.0, 3.0, 11.0, 4.0, 12.0, 12.0, 12.0, 15.0, 25.0, 21.0, 24.0, 29.0, 36.0, 42.0, 40.0, 49.0, 58.0, 59.0, 175.0, 1865.0, 106.0, 40.0, 53.0, 40.0, 37.0, 36.0, 31.0, 39.0, 28.0, 22.0, 21.0, 19.0, 22.0, 8.0, 13.0, 7.0, 8.0, 10.0, 6.0, 0.0, 2.0, 1.0, 1.0, 5.0, 1.0, 2.0], "bins": [-23.109375, -22.4912109375, -21.873046875, -21.2548828125, -20.63671875, -20.0185546875, -19.400390625, -18.7822265625, -18.1640625, -17.5458984375, -16.927734375, -16.3095703125, -15.69140625, -15.0732421875, -14.455078125, -13.8369140625, -13.21875, -12.6005859375, -11.982421875, -11.3642578125, -10.74609375, -10.1279296875, -9.509765625, -8.8916015625, -8.2734375, -7.6552734375, -7.037109375, -6.4189453125, -5.80078125, -5.1826171875, -4.564453125, -3.9462890625, -3.328125, -2.7099609375, -2.091796875, -1.4736328125, -0.85546875, -0.2373046875, 0.380859375, 0.9990234375, 1.6171875, 2.2353515625, 2.853515625, 3.4716796875, 4.08984375, 4.7080078125, 5.326171875, 5.9443359375, 6.5625, 7.1806640625, 7.798828125, 8.4169921875, 9.03515625, 9.6533203125, 10.271484375, 10.8896484375, 11.5078125, 12.1259765625, 12.744140625, 13.3623046875, 13.98046875, 14.5986328125, 15.216796875, 15.8349609375, 16.453125]}, "gradients/decoder.transformer.h.21.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 1.0, 3.0, 2.0, 3.0, 1.0, 2.0, 6.0, 7.0, 7.0, 8.0, 11.0, 8.0, 22.0, 15.0, 25.0, 24.0, 26.0, 38.0, 57.0, 58.0, 92.0, 154.0, 323.0, 611.0, 2809.0, 2673919.0, 464093.0, 2045.0, 580.0, 261.0, 132.0, 96.0, 59.0, 41.0, 29.0, 30.0, 26.0, 16.0, 17.0, 9.0, 13.0, 10.0, 9.0, 5.0, 1.0, 7.0, 5.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-27.96875, -26.93017578125, -25.8916015625, -24.85302734375, -23.814453125, -22.77587890625, -21.7373046875, -20.69873046875, -19.66015625, -18.62158203125, -17.5830078125, -16.54443359375, -15.505859375, -14.46728515625, -13.4287109375, -12.39013671875, -11.3515625, -10.31298828125, -9.2744140625, -8.23583984375, -7.197265625, -6.15869140625, -5.1201171875, -4.08154296875, -3.04296875, -2.00439453125, -0.9658203125, 0.07275390625, 1.111328125, 2.14990234375, 3.1884765625, 4.22705078125, 5.265625, 6.30419921875, 7.3427734375, 8.38134765625, 9.419921875, 10.45849609375, 11.4970703125, 12.53564453125, 13.57421875, 14.61279296875, 15.6513671875, 16.68994140625, 17.728515625, 18.76708984375, 19.8056640625, 20.84423828125, 21.8828125, 22.92138671875, 23.9599609375, 24.99853515625, 26.037109375, 27.07568359375, 28.1142578125, 29.15283203125, 30.19140625, 31.22998046875, 32.2685546875, 33.30712890625, 34.345703125, 35.38427734375, 36.4228515625, 37.46142578125, 38.5]}, "gradients/decoder.transformer.h.21.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 1.0, 2.0, 4.0, 5.0, 1.0, 12.0, 8.0, 20.0, 22.0, 27.0, 56.0, 61.0, 67.0, 91.0, 84.0, 119.0, 87.0, 79.0, 75.0, 58.0, 46.0, 27.0, 20.0, 20.0, 12.0, 2.0, 3.0, 2.0, 1.0, 3.0, 1.0], "bins": [-14.43320369720459, -14.134761810302734, -13.836319923400879, -13.537878036499023, -13.239436149597168, -12.940994262695312, -12.642552375793457, -12.344110488891602, -12.04566764831543, -11.747225761413574, -11.448783874511719, -11.150341987609863, -10.851900100708008, -10.553458213806152, -10.255016326904297, -9.956573486328125, -9.658132553100586, -9.35969066619873, -9.061248779296875, -8.76280689239502, -8.464365005493164, -8.165923118591309, -7.867480754852295, -7.5690388679504395, -7.270596981048584, -6.9721550941467285, -6.673713207244873, -6.375271320343018, -6.076828956604004, -5.778387069702148, -5.479945182800293, -5.1815032958984375, -4.883060455322266, -4.58461856842041, -4.286176681518555, -3.98773455619812, -3.6892926692962646, -3.390850782394409, -3.0924086570739746, -2.793966770172119, -2.4955248832702637, -2.197082996368408, -1.8986409902572632, -1.6001989841461182, -1.3017570972442627, -1.0033152103424072, -0.7048732042312622, -0.4064311981201172, -0.10798931121826172, 0.19045263528823853, 0.48889458179473877, 0.787336528301239, 1.0857784748077393, 1.3842203617095947, 1.6826623678207397, 1.9811043739318848, 2.2795462608337402, 2.5779881477355957, 2.876430034637451, 3.1748721599578857, 3.473314046859741, 3.7717559337615967, 4.070198059082031, 4.368639945983887, 4.667081832885742]}, "gradients/decoder.transformer.h.21.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 6.0, 3.0, 7.0, 5.0, 9.0, 5.0, 11.0, 13.0, 14.0, 13.0, 25.0, 27.0, 19.0, 36.0, 24.0, 34.0, 37.0, 38.0, 44.0, 48.0, 50.0, 48.0, 48.0, 42.0, 48.0, 49.0, 31.0, 32.0, 21.0, 36.0, 27.0, 24.0, 29.0, 17.0, 14.0, 10.0, 15.0, 12.0, 8.0, 7.0, 4.0, 5.0, 6.0, 3.0, 1.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-62.50450134277344, -60.536373138427734, -58.5682487487793, -56.600120544433594, -54.631996154785156, -52.66386795043945, -50.695743560791016, -48.72761535644531, -46.759490966796875, -44.79136276245117, -42.823238372802734, -40.85511016845703, -38.886985778808594, -36.91885757446289, -34.95073318481445, -32.98260498046875, -31.01447868347168, -29.04635238647461, -27.07822608947754, -25.11009979248047, -23.1419734954834, -21.173847198486328, -19.205718994140625, -17.237594604492188, -15.2694673538208, -13.30134105682373, -11.33321475982666, -9.365087509155273, -7.396961688995361, -5.428834915161133, -3.4607086181640625, -1.4925823211669922, 0.4755439758300781, 2.4436702728271484, 4.411796569824219, 6.379923343658447, 8.34804916381836, 10.316176414489746, 12.284302711486816, 14.252429008483887, 16.22055435180664, 18.18868064880371, 20.15680694580078, 22.12493324279785, 24.093059539794922, 26.061187744140625, 28.029312133789062, 29.997440338134766, 31.965566635131836, 33.933692932128906, 35.90182113647461, 37.86994552612305, 39.83807373046875, 41.80619812011719, 43.77432632446289, 45.74245071411133, 47.71057891845703, 49.678707122802734, 51.64683151245117, 53.614959716796875, 55.58308410644531, 57.551212310791016, 59.51933670043945, 61.487464904785156, 63.455589294433594]}, "gradients/decoder.transformer.h.20.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 2.0, 4.0, 4.0, 4.0, 7.0, 1.0, 5.0, 5.0, 8.0, 7.0, 17.0, 10.0, 23.0, 15.0, 20.0, 22.0, 26.0, 35.0, 30.0, 33.0, 41.0, 40.0, 29.0, 55.0, 32.0, 45.0, 33.0, 59.0, 37.0, 45.0, 27.0, 31.0, 24.0, 36.0, 21.0, 20.0, 22.0, 14.0, 18.0, 16.0, 17.0, 14.0, 10.0, 6.0, 11.0, 8.0, 4.0, 3.0, 2.0, 1.0, 3.0, 5.0, 1.0, 2.0, 0.0, 1.0, 3.0], "bins": [-5.17578125, -5.0174560546875, -4.859130859375, -4.7008056640625, -4.54248046875, -4.3841552734375, -4.225830078125, -4.0675048828125, -3.9091796875, -3.7508544921875, -3.592529296875, -3.4342041015625, -3.27587890625, -3.1175537109375, -2.959228515625, -2.8009033203125, -2.642578125, -2.4842529296875, -2.325927734375, -2.1676025390625, -2.00927734375, -1.8509521484375, -1.692626953125, -1.5343017578125, -1.3759765625, -1.2176513671875, -1.059326171875, -0.9010009765625, -0.74267578125, -0.5843505859375, -0.426025390625, -0.2677001953125, -0.109375, 0.0489501953125, 0.207275390625, 0.3656005859375, 0.52392578125, 0.6822509765625, 0.840576171875, 0.9989013671875, 1.1572265625, 1.3155517578125, 1.473876953125, 1.6322021484375, 1.79052734375, 1.9488525390625, 2.107177734375, 2.2655029296875, 2.423828125, 2.5821533203125, 2.740478515625, 2.8988037109375, 3.05712890625, 3.2154541015625, 3.373779296875, 3.5321044921875, 3.6904296875, 3.8487548828125, 4.007080078125, 4.1654052734375, 4.32373046875, 4.4820556640625, 4.640380859375, 4.7987060546875, 4.95703125]}, "gradients/decoder.transformer.h.20.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 3.0, 6.0, 1.0, 4.0, 3.0, 5.0, 5.0, 4.0, 3.0, 5.0, 8.0, 10.0, 11.0, 17.0, 14.0, 21.0, 18.0, 24.0, 34.0, 37.0, 25.0, 32.0, 47.0, 61.0, 193.0, 1555.0, 377972.0, 3794261.0, 18916.0, 528.0, 119.0, 52.0, 29.0, 23.0, 34.0, 27.0, 23.0, 26.0, 18.0, 14.0, 18.0, 13.0, 20.0, 10.0, 15.0, 7.0, 6.0, 4.0, 5.0, 2.0, 6.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-35.4375, -34.33984375, -33.2421875, -32.14453125, -31.046875, -29.94921875, -28.8515625, -27.75390625, -26.65625, -25.55859375, -24.4609375, -23.36328125, -22.265625, -21.16796875, -20.0703125, -18.97265625, -17.875, -16.77734375, -15.6796875, -14.58203125, -13.484375, -12.38671875, -11.2890625, -10.19140625, -9.09375, -7.99609375, -6.8984375, -5.80078125, -4.703125, -3.60546875, -2.5078125, -1.41015625, -0.3125, 0.78515625, 1.8828125, 2.98046875, 4.078125, 5.17578125, 6.2734375, 7.37109375, 8.46875, 9.56640625, 10.6640625, 11.76171875, 12.859375, 13.95703125, 15.0546875, 16.15234375, 17.25, 18.34765625, 19.4453125, 20.54296875, 21.640625, 22.73828125, 23.8359375, 24.93359375, 26.03125, 27.12890625, 28.2265625, 29.32421875, 30.421875, 31.51953125, 32.6171875, 33.71484375, 34.8125]}, "gradients/decoder.transformer.h.20.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 4.0, 4.0, 3.0, 6.0, 10.0, 15.0, 15.0, 30.0, 26.0, 41.0, 45.0, 53.0, 100.0, 121.0, 188.0, 274.0, 338.0, 499.0, 581.0, 503.0, 357.0, 259.0, 171.0, 114.0, 84.0, 61.0, 42.0, 29.0, 40.0, 15.0, 11.0, 11.0, 10.0, 7.0, 5.0, 8.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-16.0, -15.458251953125, -14.91650390625, -14.374755859375, -13.8330078125, -13.291259765625, -12.74951171875, -12.207763671875, -11.666015625, -11.124267578125, -10.58251953125, -10.040771484375, -9.4990234375, -8.957275390625, -8.41552734375, -7.873779296875, -7.33203125, -6.790283203125, -6.24853515625, -5.706787109375, -5.1650390625, -4.623291015625, -4.08154296875, -3.539794921875, -2.998046875, -2.456298828125, -1.91455078125, -1.372802734375, -0.8310546875, -0.289306640625, 0.25244140625, 0.794189453125, 1.3359375, 1.877685546875, 2.41943359375, 2.961181640625, 3.5029296875, 4.044677734375, 4.58642578125, 5.128173828125, 5.669921875, 6.211669921875, 6.75341796875, 7.295166015625, 7.8369140625, 8.378662109375, 8.92041015625, 9.462158203125, 10.00390625, 10.545654296875, 11.08740234375, 11.629150390625, 12.1708984375, 12.712646484375, 13.25439453125, 13.796142578125, 14.337890625, 14.879638671875, 15.42138671875, 15.963134765625, 16.5048828125, 17.046630859375, 17.58837890625, 18.130126953125, 18.671875]}, "gradients/decoder.transformer.h.20.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 2.0, 4.0, 8.0, 9.0, 11.0, 23.0, 28.0, 58.0, 60.0, 88.0, 90.0, 117.0, 173.0, 332.0, 654.0, 3093.0, 3879648.0, 306870.0, 1487.0, 550.0, 295.0, 174.0, 128.0, 100.0, 58.0, 57.0, 57.0, 32.0, 19.0, 13.0, 15.0, 10.0, 9.0, 8.0, 5.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 2.0], "bins": [-87.375, -84.79296875, -82.2109375, -79.62890625, -77.046875, -74.46484375, -71.8828125, -69.30078125, -66.71875, -64.13671875, -61.5546875, -58.97265625, -56.390625, -53.80859375, -51.2265625, -48.64453125, -46.0625, -43.48046875, -40.8984375, -38.31640625, -35.734375, -33.15234375, -30.5703125, -27.98828125, -25.40625, -22.82421875, -20.2421875, -17.66015625, -15.078125, -12.49609375, -9.9140625, -7.33203125, -4.75, -2.16796875, 0.4140625, 2.99609375, 5.578125, 8.16015625, 10.7421875, 13.32421875, 15.90625, 18.48828125, 21.0703125, 23.65234375, 26.234375, 28.81640625, 31.3984375, 33.98046875, 36.5625, 39.14453125, 41.7265625, 44.30859375, 46.890625, 49.47265625, 52.0546875, 54.63671875, 57.21875, 59.80078125, 62.3828125, 64.96484375, 67.546875, 70.12890625, 72.7109375, 75.29296875, 77.875]}, "gradients/decoder.transformer.h.20.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 9.0, 181.0, 618.0, 193.0, 15.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-72.6153564453125, -64.99711608886719, -57.378875732421875, -49.76063919067383, -42.142398834228516, -34.5241584777832, -26.905921936035156, -19.287681579589844, -11.669441223144531, -4.051201820373535, 3.567037582397461, 11.18527603149414, 18.803516387939453, 26.421756744384766, 34.03999328613281, 41.658233642578125, 49.27647399902344, 56.89471435546875, 64.51295471191406, 72.13119506835938, 79.74943542480469, 87.36767578125, 94.98590850830078, 102.6041488647461, 110.2223892211914, 117.84062957763672, 125.45886993408203, 133.0771026611328, 140.69534301757812, 148.31358337402344, 155.93182373046875, 163.55006408691406, 171.16830444335938, 178.7865447998047, 186.40478515625, 194.0230255126953, 201.64126586914062, 209.25950622558594, 216.87774658203125, 224.4959716796875, 232.11422729492188, 239.7324676513672, 247.3507080078125, 254.9689483642578, 262.5871887207031, 270.2054138183594, 277.82366943359375, 285.44189453125, 293.06011962890625, 300.6783447265625, 308.2966003417969, 315.9148254394531, 323.5330810546875, 331.15130615234375, 338.7695617675781, 346.3877868652344, 354.00604248046875, 361.624267578125, 369.2425231933594, 376.8607482910156, 384.47900390625, 392.09722900390625, 399.7154846191406, 407.3337097167969, 414.95196533203125]}, "gradients/decoder.transformer.h.20.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 5.0, 4.0, 4.0, 3.0, 5.0, 10.0, 5.0, 4.0, 5.0, 8.0, 16.0, 12.0, 14.0, 32.0, 22.0, 25.0, 33.0, 27.0, 37.0, 37.0, 39.0, 41.0, 39.0, 43.0, 40.0, 41.0, 42.0, 39.0, 33.0, 41.0, 48.0, 33.0, 29.0, 24.0, 16.0, 23.0, 26.0, 13.0, 31.0, 8.0, 15.0, 7.0, 8.0, 6.0, 5.0, 6.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-55.48941421508789, -53.65110778808594, -51.812801361083984, -49.97449493408203, -48.13618850708008, -46.297882080078125, -44.45957946777344, -42.621273040771484, -40.78296661376953, -38.94466018676758, -37.106353759765625, -35.26804733276367, -33.42974090576172, -31.5914363861084, -29.753129959106445, -27.914825439453125, -26.07651710510254, -24.238210678100586, -22.399904251098633, -20.561599731445312, -18.72329330444336, -16.884986877441406, -15.046680450439453, -13.208374977111816, -11.370068550109863, -9.53176212310791, -7.693456649780273, -5.85515022277832, -4.016844272613525, -2.1785383224487305, -0.34023189544677734, 1.4980735778808594, 3.3363800048828125, 5.174685955047607, 7.012991905212402, 8.851298332214355, 10.689603805541992, 12.527910232543945, 14.366216659545898, 16.20452117919922, 18.042827606201172, 19.881134033203125, 21.719440460205078, 23.55774688720703, 25.39605140686035, 27.234357833862305, 29.072664260864258, 30.910968780517578, 32.74927520751953, 34.587581634521484, 36.42588806152344, 38.26419448852539, 40.102500915527344, 41.94080352783203, 43.77911376953125, 45.61741638183594, 47.455726623535156, 49.29403305053711, 51.13233947753906, 52.970645904541016, 54.80895233154297, 56.647254943847656, 58.485565185546875, 60.32386779785156, 62.162174224853516]}, "gradients/decoder.transformer.h.20.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 7.0, 4.0, 3.0, 6.0, 7.0, 6.0, 15.0, 7.0, 12.0, 23.0, 19.0, 14.0, 21.0, 29.0, 39.0, 37.0, 30.0, 39.0, 38.0, 39.0, 41.0, 45.0, 54.0, 38.0, 43.0, 45.0, 45.0, 34.0, 31.0, 28.0, 29.0, 26.0, 17.0, 23.0, 16.0, 12.0, 18.0, 19.0, 13.0, 6.0, 7.0, 3.0, 4.0, 4.0, 5.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 3.0, 1.0], "bins": [-5.734375, -5.56292724609375, -5.3914794921875, -5.22003173828125, -5.048583984375, -4.87713623046875, -4.7056884765625, -4.53424072265625, -4.36279296875, -4.19134521484375, -4.0198974609375, -3.84844970703125, -3.677001953125, -3.50555419921875, -3.3341064453125, -3.16265869140625, -2.9912109375, -2.81976318359375, -2.6483154296875, -2.47686767578125, -2.305419921875, -2.13397216796875, -1.9625244140625, -1.79107666015625, -1.61962890625, -1.44818115234375, -1.2767333984375, -1.10528564453125, -0.933837890625, -0.76239013671875, -0.5909423828125, -0.41949462890625, -0.248046875, -0.07659912109375, 0.0948486328125, 0.26629638671875, 0.437744140625, 0.60919189453125, 0.7806396484375, 0.95208740234375, 1.12353515625, 1.29498291015625, 1.4664306640625, 1.63787841796875, 1.809326171875, 1.98077392578125, 2.1522216796875, 2.32366943359375, 2.4951171875, 2.66656494140625, 2.8380126953125, 3.00946044921875, 3.180908203125, 3.35235595703125, 3.5238037109375, 3.69525146484375, 3.86669921875, 4.03814697265625, 4.2095947265625, 4.38104248046875, 4.552490234375, 4.72393798828125, 4.8953857421875, 5.06683349609375, 5.23828125]}, "gradients/decoder.transformer.h.20.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 7.0, 12.0, 21.0, 20.0, 36.0, 57.0, 64.0, 108.0, 176.0, 237.0, 358.0, 491.0, 808.0, 1190.0, 1794.0, 2739.0, 4199.0, 6408.0, 10184.0, 15831.0, 25140.0, 41872.0, 72065.0, 139274.0, 358576.0, 164324.0, 80479.0, 45865.0, 27583.0, 17158.0, 11038.0, 7052.0, 4538.0, 2992.0, 1932.0, 1290.0, 912.0, 543.0, 389.0, 233.0, 186.0, 115.0, 81.0, 58.0, 44.0, 19.0, 22.0, 15.0, 8.0, 6.0, 3.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.4140625, -1.3678436279296875, -1.321624755859375, -1.2754058837890625, -1.22918701171875, -1.1829681396484375, -1.136749267578125, -1.0905303955078125, -1.0443115234375, -0.9980926513671875, -0.951873779296875, -0.9056549072265625, -0.85943603515625, -0.8132171630859375, -0.766998291015625, -0.7207794189453125, -0.674560546875, -0.6283416748046875, -0.582122802734375, -0.5359039306640625, -0.48968505859375, -0.4434661865234375, -0.397247314453125, -0.3510284423828125, -0.3048095703125, -0.2585906982421875, -0.212371826171875, -0.1661529541015625, -0.11993408203125, -0.0737152099609375, -0.027496337890625, 0.0187225341796875, 0.06494140625, 0.1111602783203125, 0.157379150390625, 0.2035980224609375, 0.24981689453125, 0.2960357666015625, 0.342254638671875, 0.3884735107421875, 0.4346923828125, 0.4809112548828125, 0.527130126953125, 0.5733489990234375, 0.61956787109375, 0.6657867431640625, 0.712005615234375, 0.7582244873046875, 0.804443359375, 0.8506622314453125, 0.896881103515625, 0.9430999755859375, 0.98931884765625, 1.0355377197265625, 1.081756591796875, 1.1279754638671875, 1.1741943359375, 1.2204132080078125, 1.266632080078125, 1.3128509521484375, 1.35906982421875, 1.4052886962890625, 1.451507568359375, 1.4977264404296875, 1.5439453125]}, "gradients/decoder.transformer.h.20.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 3.0, 5.0, 4.0, 4.0, 7.0, 5.0, 10.0, 10.0, 10.0, 19.0, 21.0, 21.0, 15.0, 37.0, 33.0, 33.0, 36.0, 30.0, 28.0, 34.0, 53.0, 53.0, 1064.0, 41.0, 56.0, 32.0, 50.0, 27.0, 28.0, 24.0, 21.0, 29.0, 33.0, 17.0, 22.0, 17.0, 18.0, 17.0, 7.0, 9.0, 7.0, 13.0, 8.0, 8.0, 3.0, 6.0, 2.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 1.0], "bins": [-3.607421875, -3.49755859375, -3.3876953125, -3.27783203125, -3.16796875, -3.05810546875, -2.9482421875, -2.83837890625, -2.728515625, -2.61865234375, -2.5087890625, -2.39892578125, -2.2890625, -2.17919921875, -2.0693359375, -1.95947265625, -1.849609375, -1.73974609375, -1.6298828125, -1.52001953125, -1.41015625, -1.30029296875, -1.1904296875, -1.08056640625, -0.970703125, -0.86083984375, -0.7509765625, -0.64111328125, -0.53125, -0.42138671875, -0.3115234375, -0.20166015625, -0.091796875, 0.01806640625, 0.1279296875, 0.23779296875, 0.34765625, 0.45751953125, 0.5673828125, 0.67724609375, 0.787109375, 0.89697265625, 1.0068359375, 1.11669921875, 1.2265625, 1.33642578125, 1.4462890625, 1.55615234375, 1.666015625, 1.77587890625, 1.8857421875, 1.99560546875, 2.10546875, 2.21533203125, 2.3251953125, 2.43505859375, 2.544921875, 2.65478515625, 2.7646484375, 2.87451171875, 2.984375, 3.09423828125, 3.2041015625, 3.31396484375, 3.423828125]}, "gradients/decoder.transformer.h.20.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 3.0, 2.0, 2.0, 2.0, 4.0, 6.0, 9.0, 12.0, 14.0, 17.0, 24.0, 25.0, 61.0, 84.0, 155.0, 217.0, 385.0, 694.0, 1161.0, 2082.0, 3627.0, 6380.0, 11703.0, 22363.0, 43636.0, 89154.0, 229860.0, 1448847.0, 117531.0, 56604.0, 28834.0, 14949.0, 7978.0, 4560.0, 2541.0, 1456.0, 907.0, 496.0, 278.0, 187.0, 110.0, 67.0, 44.0, 18.0, 20.0, 10.0, 8.0, 4.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.88671875, -1.8262176513671875, -1.765716552734375, -1.7052154541015625, -1.64471435546875, -1.5842132568359375, -1.523712158203125, -1.4632110595703125, -1.4027099609375, -1.3422088623046875, -1.281707763671875, -1.2212066650390625, -1.16070556640625, -1.1002044677734375, -1.039703369140625, -0.9792022705078125, -0.918701171875, -0.8582000732421875, -0.797698974609375, -0.7371978759765625, -0.67669677734375, -0.6161956787109375, -0.555694580078125, -0.4951934814453125, -0.4346923828125, -0.3741912841796875, -0.313690185546875, -0.2531890869140625, -0.19268798828125, -0.1321868896484375, -0.071685791015625, -0.0111846923828125, 0.04931640625, 0.1098175048828125, 0.170318603515625, 0.2308197021484375, 0.29132080078125, 0.3518218994140625, 0.412322998046875, 0.4728240966796875, 0.5333251953125, 0.5938262939453125, 0.654327392578125, 0.7148284912109375, 0.77532958984375, 0.8358306884765625, 0.896331787109375, 0.9568328857421875, 1.017333984375, 1.0778350830078125, 1.138336181640625, 1.1988372802734375, 1.25933837890625, 1.3198394775390625, 1.380340576171875, 1.4408416748046875, 1.5013427734375, 1.5618438720703125, 1.622344970703125, 1.6828460693359375, 1.74334716796875, 1.8038482666015625, 1.864349365234375, 1.9248504638671875, 1.9853515625]}, "gradients/decoder.transformer.h.20.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 4.0, 3.0, 2.0, 6.0, 3.0, 7.0, 6.0, 9.0, 5.0, 4.0, 9.0, 17.0, 16.0, 19.0, 24.0, 26.0, 28.0, 39.0, 44.0, 65.0, 69.0, 107.0, 59.0, 85.0, 60.0, 52.0, 43.0, 36.0, 30.0, 21.0, 24.0, 16.0, 12.0, 9.0, 7.0, 10.0, 4.0, 6.0, 6.0, 6.0, 4.0, 5.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0008206367492675781, -0.0007958412170410156, -0.0007710456848144531, -0.0007462501525878906, -0.0007214546203613281, -0.0006966590881347656, -0.0006718635559082031, -0.0006470680236816406, -0.0006222724914550781, -0.0005974769592285156, -0.0005726814270019531, -0.0005478858947753906, -0.0005230903625488281, -0.0004982948303222656, -0.0004734992980957031, -0.0004487037658691406, -0.0004239082336425781, -0.0003991127014160156, -0.0003743171691894531, -0.0003495216369628906, -0.0003247261047363281, -0.0002999305725097656, -0.0002751350402832031, -0.0002503395080566406, -0.00022554397583007812, -0.00020074844360351562, -0.00017595291137695312, -0.00015115737915039062, -0.00012636184692382812, -0.00010156631469726562, -7.677078247070312e-05, -5.1975250244140625e-05, -2.7179718017578125e-05, -2.384185791015625e-06, 2.2411346435546875e-05, 4.7206878662109375e-05, 7.200241088867188e-05, 9.679794311523438e-05, 0.00012159347534179688, 0.00014638900756835938, 0.00017118453979492188, 0.00019598007202148438, 0.00022077560424804688, 0.0002455711364746094, 0.0002703666687011719, 0.0002951622009277344, 0.0003199577331542969, 0.0003447532653808594, 0.0003695487976074219, 0.0003943443298339844, 0.0004191398620605469, 0.0004439353942871094, 0.0004687309265136719, 0.0004935264587402344, 0.0005183219909667969, 0.0005431175231933594, 0.0005679130554199219, 0.0005927085876464844, 0.0006175041198730469, 0.0006422996520996094, 0.0006670951843261719, 0.0006918907165527344, 0.0007166862487792969, 0.0007414817810058594, 0.0007662773132324219]}, "gradients/decoder.transformer.h.20.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 3.0, 5.0, 1.0, 3.0, 1.0, 5.0, 4.0, 4.0, 4.0, 2.0, 5.0, 7.0, 7.0, 11.0, 9.0, 23.0, 34.0, 46.0, 49.0, 60.0, 107.0, 122.0, 238.0, 433.0, 2567.0, 1040210.0, 3445.0, 442.0, 230.0, 121.0, 86.0, 73.0, 51.0, 22.0, 30.0, 22.0, 14.0, 11.0, 13.0, 7.0, 6.0, 6.0, 7.0, 2.0, 1.0, 5.0, 4.0, 2.0, 2.0, 1.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0172119140625, -0.01662898063659668, -0.01604604721069336, -0.015463113784790039, -0.014880180358886719, -0.014297246932983398, -0.013714313507080078, -0.013131380081176758, -0.012548446655273438, -0.011965513229370117, -0.011382579803466797, -0.010799646377563477, -0.010216712951660156, -0.009633779525756836, -0.009050846099853516, -0.008467912673950195, -0.007884979248046875, -0.007302045822143555, -0.006719112396240234, -0.006136178970336914, -0.005553245544433594, -0.0049703121185302734, -0.004387378692626953, -0.003804445266723633, -0.0032215118408203125, -0.002638578414916992, -0.002055644989013672, -0.0014727115631103516, -0.0008897781372070312, -0.00030684471130371094, 0.0002760887145996094, 0.0008590221405029297, 0.00144195556640625, 0.0020248889923095703, 0.0026078224182128906, 0.003190755844116211, 0.0037736892700195312, 0.0043566226959228516, 0.004939556121826172, 0.005522489547729492, 0.0061054229736328125, 0.006688356399536133, 0.007271289825439453, 0.007854223251342773, 0.008437156677246094, 0.009020090103149414, 0.009603023529052734, 0.010185956954956055, 0.010768890380859375, 0.011351823806762695, 0.011934757232666016, 0.012517690658569336, 0.013100624084472656, 0.013683557510375977, 0.014266490936279297, 0.014849424362182617, 0.015432357788085938, 0.016015291213989258, 0.016598224639892578, 0.0171811580657959, 0.01776409149169922, 0.01834702491760254, 0.01892995834350586, 0.01951289176940918, 0.0200958251953125]}, "gradients/decoder.transformer.h.20.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 39.0, 277.0, 515.0, 168.0, 14.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013334067771211267, -0.0012828508624807, -0.0012322949478402734, -0.0011817390331998467, -0.0011311830021440983, -0.0010806270875036716, -0.001030071172863245, -0.0009795152582228184, -0.0009289593435823917, -0.0008784034289419651, -0.0008278475143015385, -0.0007772915414534509, -0.0007267356268130243, -0.0006761797121725976, -0.0006256237393245101, -0.0005750678246840835, -0.0005245119100436568, -0.0004739559954032302, -0.0004234000516589731, -0.000372844107914716, -0.00032228819327428937, -0.00027173227863386273, -0.00022117633488960564, -0.00017062039114534855, -0.00012006447650492191, -6.950854731258005e-05, -1.8952618120238185e-05, 3.160331107210368e-05, 8.215924026444554e-05, 0.00013271515490487218, 0.00018327109864912927, 0.00023382704239338636, 0.000284382957033813, 0.00033493887167423964, 0.00038549481541849673, 0.0004360507591627538, 0.00048660667380318046, 0.0005371625884436071, 0.0005877185612916946, 0.0006382744759321213, 0.0006888303905725479, 0.0007393863052129745, 0.0007899422198534012, 0.0008404981927014887, 0.0008910541073419154, 0.000941610021982342, 0.0009921659948304296, 0.0010427219094708562, 0.0010932778241112828, 0.0011438337387517095, 0.001194389653392136, 0.0012449455680325627, 0.0012955015990883112, 0.0013460575137287378, 0.0013966134283691645, 0.001447169343009591, 0.0014977252576500177, 0.0015482811722904444, 0.001598837086930871, 0.0016493930015712976, 0.0016999489162117243, 0.001750504830852151, 0.0018010608619078994, 0.001851616776548326, 0.0019021726911887527]}, "gradients/decoder.transformer.h.20.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 5.0, 1.0, 3.0, 4.0, 4.0, 7.0, 4.0, 3.0, 11.0, 15.0, 9.0, 16.0, 21.0, 23.0, 22.0, 22.0, 33.0, 34.0, 34.0, 29.0, 35.0, 50.0, 46.0, 43.0, 47.0, 40.0, 46.0, 36.0, 40.0, 42.0, 40.0, 42.0, 31.0, 35.0, 28.0, 29.0, 18.0, 14.0, 14.0, 9.0, 9.0, 4.0, 4.0, 3.0, 8.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.00042945146560668945, -0.00041625555604696274, -0.000403059646487236, -0.0003898637369275093, -0.0003766678273677826, -0.0003634719178080559, -0.00035027600824832916, -0.00033708009868860245, -0.00032388418912887573, -0.000310688279569149, -0.0002974923700094223, -0.0002842964604496956, -0.00027110055088996887, -0.00025790464133024216, -0.00024470873177051544, -0.00023151282221078873, -0.000218316912651062, -0.0002051210030913353, -0.00019192509353160858, -0.00017872918397188187, -0.00016553327441215515, -0.00015233736485242844, -0.00013914145529270172, -0.000125945545732975, -0.00011274963617324829, -9.955372661352158e-05, -8.635781705379486e-05, -7.316190749406815e-05, -5.996599793434143e-05, -4.6770088374614716e-05, -3.3574178814888e-05, -2.0378269255161285e-05, -7.18235969543457e-06, 6.013549864292145e-06, 1.920945942401886e-05, 3.2405368983745575e-05, 4.560127854347229e-05, 5.8797188103199005e-05, 7.199309766292572e-05, 8.518900722265244e-05, 9.838491678237915e-05, 0.00011158082634210587, 0.00012477673590183258, 0.0001379726454615593, 0.000151168555021286, 0.00016436446458101273, 0.00017756037414073944, 0.00019075628370046616, 0.00020395219326019287, 0.00021714810281991959, 0.0002303440123796463, 0.00024353992193937302, 0.00025673583149909973, 0.00026993174105882645, 0.00028312765061855316, 0.0002963235601782799, 0.0003095194697380066, 0.0003227153792977333, 0.00033591128885746, 0.00034910719841718674, 0.00036230310797691345, 0.00037549901753664017, 0.0003886949270963669, 0.0004018908366560936, 0.0004150867462158203]}, "gradients/decoder.transformer.h.20.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 7.0, 4.0, 3.0, 6.0, 8.0, 5.0, 15.0, 7.0, 12.0, 23.0, 19.0, 14.0, 21.0, 29.0, 39.0, 37.0, 30.0, 39.0, 38.0, 39.0, 41.0, 45.0, 54.0, 38.0, 43.0, 45.0, 45.0, 34.0, 31.0, 28.0, 29.0, 26.0, 17.0, 23.0, 16.0, 12.0, 18.0, 19.0, 13.0, 6.0, 7.0, 3.0, 4.0, 4.0, 5.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 3.0, 1.0], "bins": [-5.734375, -5.56292724609375, -5.3914794921875, -5.22003173828125, -5.048583984375, -4.87713623046875, -4.7056884765625, -4.53424072265625, -4.36279296875, -4.19134521484375, -4.0198974609375, -3.84844970703125, -3.677001953125, -3.50555419921875, -3.3341064453125, -3.16265869140625, -2.9912109375, -2.81976318359375, -2.6483154296875, -2.47686767578125, -2.305419921875, -2.13397216796875, -1.9625244140625, -1.79107666015625, -1.61962890625, -1.44818115234375, -1.2767333984375, -1.10528564453125, -0.933837890625, -0.76239013671875, -0.5909423828125, -0.41949462890625, -0.248046875, -0.07659912109375, 0.0948486328125, 0.26629638671875, 0.437744140625, 0.60919189453125, 0.7806396484375, 0.95208740234375, 1.12353515625, 1.29498291015625, 1.4664306640625, 1.63787841796875, 1.809326171875, 1.98077392578125, 2.1522216796875, 2.32366943359375, 2.4951171875, 2.66656494140625, 2.8380126953125, 3.00946044921875, 3.180908203125, 3.35235595703125, 3.5238037109375, 3.69525146484375, 3.86669921875, 4.03814697265625, 4.2095947265625, 4.38104248046875, 4.552490234375, 4.72393798828125, 4.8953857421875, 5.06683349609375, 5.23828125]}, "gradients/decoder.transformer.h.20.attn.c_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 1.0, 0.0, 3.0, 5.0, 3.0, 2.0, 6.0, 12.0, 12.0, 19.0, 40.0, 35.0, 60.0, 97.0, 142.0, 204.0, 294.0, 432.0, 756.0, 1312.0, 2209.0, 3949.0, 7978.0, 17344.0, 43932.0, 149032.0, 520676.0, 203381.0, 55753.0, 20968.0, 9239.0, 4582.0, 2392.0, 1449.0, 751.0, 496.0, 313.0, 211.0, 146.0, 95.0, 79.0, 50.0, 30.0, 19.0, 19.0, 11.0, 9.0, 3.0, 8.0, 5.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.2421875, -3.12884521484375, -3.0155029296875, -2.90216064453125, -2.788818359375, -2.67547607421875, -2.5621337890625, -2.44879150390625, -2.33544921875, -2.22210693359375, -2.1087646484375, -1.99542236328125, -1.882080078125, -1.76873779296875, -1.6553955078125, -1.54205322265625, -1.4287109375, -1.31536865234375, -1.2020263671875, -1.08868408203125, -0.975341796875, -0.86199951171875, -0.7486572265625, -0.63531494140625, -0.52197265625, -0.40863037109375, -0.2952880859375, -0.18194580078125, -0.068603515625, 0.04473876953125, 0.1580810546875, 0.27142333984375, 0.384765625, 0.49810791015625, 0.6114501953125, 0.72479248046875, 0.838134765625, 0.95147705078125, 1.0648193359375, 1.17816162109375, 1.29150390625, 1.40484619140625, 1.5181884765625, 1.63153076171875, 1.744873046875, 1.85821533203125, 1.9715576171875, 2.08489990234375, 2.1982421875, 2.31158447265625, 2.4249267578125, 2.53826904296875, 2.651611328125, 2.76495361328125, 2.8782958984375, 2.99163818359375, 3.10498046875, 3.21832275390625, 3.3316650390625, 3.44500732421875, 3.558349609375, 3.67169189453125, 3.7850341796875, 3.89837646484375, 4.01171875]}, "gradients/decoder.transformer.h.20.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 0.0, 2.0, 0.0, 5.0, 4.0, 5.0, 0.0, 2.0, 7.0, 5.0, 14.0, 4.0, 14.0, 14.0, 24.0, 23.0, 31.0, 18.0, 20.0, 21.0, 32.0, 36.0, 30.0, 38.0, 40.0, 59.0, 100.0, 1916.0, 119.0, 64.0, 45.0, 36.0, 46.0, 35.0, 36.0, 29.0, 23.0, 26.0, 32.0, 15.0, 12.0, 11.0, 19.0, 11.0, 7.0, 7.0, 5.0, 9.0, 3.0, 0.0, 3.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-17.84375, -17.25927734375, -16.6748046875, -16.09033203125, -15.505859375, -14.92138671875, -14.3369140625, -13.75244140625, -13.16796875, -12.58349609375, -11.9990234375, -11.41455078125, -10.830078125, -10.24560546875, -9.6611328125, -9.07666015625, -8.4921875, -7.90771484375, -7.3232421875, -6.73876953125, -6.154296875, -5.56982421875, -4.9853515625, -4.40087890625, -3.81640625, -3.23193359375, -2.6474609375, -2.06298828125, -1.478515625, -0.89404296875, -0.3095703125, 0.27490234375, 0.859375, 1.44384765625, 2.0283203125, 2.61279296875, 3.197265625, 3.78173828125, 4.3662109375, 4.95068359375, 5.53515625, 6.11962890625, 6.7041015625, 7.28857421875, 7.873046875, 8.45751953125, 9.0419921875, 9.62646484375, 10.2109375, 10.79541015625, 11.3798828125, 11.96435546875, 12.548828125, 13.13330078125, 13.7177734375, 14.30224609375, 14.88671875, 15.47119140625, 16.0556640625, 16.64013671875, 17.224609375, 17.80908203125, 18.3935546875, 18.97802734375, 19.5625]}, "gradients/decoder.transformer.h.20.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 0.0, 2.0, 3.0, 1.0, 4.0, 6.0, 6.0, 11.0, 7.0, 9.0, 17.0, 12.0, 15.0, 14.0, 36.0, 33.0, 36.0, 44.0, 61.0, 90.0, 131.0, 195.0, 362.0, 865.0, 5166.0, 2900640.0, 233644.0, 2688.0, 674.0, 271.0, 174.0, 104.0, 88.0, 64.0, 38.0, 36.0, 24.0, 27.0, 22.0, 20.0, 19.0, 9.0, 4.0, 11.0, 5.0, 9.0, 1.0, 0.0, 5.0, 5.0, 4.0, 1.0, 2.0, 2.0, 2.0], "bins": [-32.65625, -31.7197265625, -30.783203125, -29.8466796875, -28.91015625, -27.9736328125, -27.037109375, -26.1005859375, -25.1640625, -24.2275390625, -23.291015625, -22.3544921875, -21.41796875, -20.4814453125, -19.544921875, -18.6083984375, -17.671875, -16.7353515625, -15.798828125, -14.8623046875, -13.92578125, -12.9892578125, -12.052734375, -11.1162109375, -10.1796875, -9.2431640625, -8.306640625, -7.3701171875, -6.43359375, -5.4970703125, -4.560546875, -3.6240234375, -2.6875, -1.7509765625, -0.814453125, 0.1220703125, 1.05859375, 1.9951171875, 2.931640625, 3.8681640625, 4.8046875, 5.7412109375, 6.677734375, 7.6142578125, 8.55078125, 9.4873046875, 10.423828125, 11.3603515625, 12.296875, 13.2333984375, 14.169921875, 15.1064453125, 16.04296875, 16.9794921875, 17.916015625, 18.8525390625, 19.7890625, 20.7255859375, 21.662109375, 22.5986328125, 23.53515625, 24.4716796875, 25.408203125, 26.3447265625, 27.28125]}, "gradients/decoder.transformer.h.20.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 14.0, 348.0, 592.0, 60.0, 1.0, 1.0, 2.0], "bins": [-144.5220184326172, -142.07830810546875, -139.63461303710938, -137.19090270996094, -134.74720764160156, -132.30349731445312, -129.85980224609375, -127.41609191894531, -124.9723892211914, -122.5286865234375, -120.0849838256836, -117.64128112792969, -115.19757080078125, -112.75386810302734, -110.31016540527344, -107.86646270751953, -105.42276000976562, -102.97905731201172, -100.53535461425781, -98.09164428710938, -95.64794158935547, -93.20423889160156, -90.76053619384766, -88.31683349609375, -85.87312316894531, -83.4294204711914, -80.9857177734375, -78.54200744628906, -76.09830474853516, -73.65460205078125, -71.21089935302734, -68.76719665527344, -66.32349395751953, -63.879791259765625, -61.43608474731445, -58.99238204956055, -56.54867935180664, -54.10497283935547, -51.66127014160156, -49.217567443847656, -46.77386474609375, -44.330162048339844, -41.88645553588867, -39.442752838134766, -36.99905014038086, -34.55534362792969, -32.11164093017578, -29.667938232421875, -27.22423553466797, -24.78053092956543, -22.336828231811523, -19.893123626708984, -17.449420928955078, -15.005716323852539, -12.56201171875, -10.118309020996094, -7.674603462219238, -5.230899810791016, -2.7871956825256348, -0.3434915542602539, 2.1002120971679688, 4.543915748596191, 6.9876203536987305, 9.431323051452637, 11.875027656555176]}, "gradients/decoder.transformer.h.20.ln_1.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 4.0, 7.0, 5.0, 5.0, 6.0, 5.0, 10.0, 7.0, 12.0, 16.0, 15.0, 14.0, 21.0, 20.0, 25.0, 16.0, 22.0, 21.0, 22.0, 28.0, 51.0, 38.0, 34.0, 42.0, 52.0, 43.0, 34.0, 36.0, 41.0, 41.0, 35.0, 29.0, 30.0, 25.0, 25.0, 16.0, 24.0, 20.0, 21.0, 20.0, 15.0, 9.0, 10.0, 14.0, 7.0, 4.0, 5.0, 1.0, 3.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-48.98492431640625, -47.245452880859375, -45.5059814453125, -43.766510009765625, -42.02703857421875, -40.287567138671875, -38.548095703125, -36.808624267578125, -35.06915283203125, -33.329681396484375, -31.5902099609375, -29.850738525390625, -28.11126708984375, -26.371795654296875, -24.632326126098633, -22.892854690551758, -21.153385162353516, -19.41391372680664, -17.674442291259766, -15.934971809387207, -14.195500373840332, -12.456028938293457, -10.716558456420898, -8.977087020874023, -7.237615585327148, -5.498144149780273, -3.7586731910705566, -2.01920223236084, -0.27973079681396484, 1.4597406387329102, 3.1992111206054688, 4.938682556152344, 6.678153991699219, 8.417625427246094, 10.157096862792969, 11.896567344665527, 13.636038780212402, 15.375510215759277, 17.114980697631836, 18.85445213317871, 20.593923568725586, 22.33339500427246, 24.072866439819336, 25.812335968017578, 27.551807403564453, 29.291278839111328, 31.030750274658203, 32.77022171020508, 34.50969314575195, 36.24916458129883, 37.9886360168457, 39.72810745239258, 41.46757888793945, 43.20705032348633, 44.94651794433594, 46.68598937988281, 48.42546081542969, 50.16493225097656, 51.90440368652344, 53.64387512207031, 55.38334655761719, 57.12281799316406, 58.86228942871094, 60.60176086425781, 62.34123229980469]}, "gradients/decoder.transformer.h.19.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 5.0, 6.0, 4.0, 4.0, 6.0, 6.0, 12.0, 8.0, 12.0, 15.0, 24.0, 19.0, 14.0, 29.0, 36.0, 25.0, 36.0, 41.0, 38.0, 41.0, 43.0, 38.0, 56.0, 45.0, 44.0, 47.0, 32.0, 43.0, 31.0, 33.0, 23.0, 33.0, 16.0, 23.0, 17.0, 14.0, 17.0, 17.0, 15.0, 10.0, 6.0, 6.0, 3.0, 2.0, 4.0, 4.0, 2.0, 3.0, 0.0, 2.0, 0.0, 4.0, 1.0], "bins": [-5.9140625, -5.73931884765625, -5.5645751953125, -5.38983154296875, -5.215087890625, -5.04034423828125, -4.8656005859375, -4.69085693359375, -4.51611328125, -4.34136962890625, -4.1666259765625, -3.99188232421875, -3.817138671875, -3.64239501953125, -3.4676513671875, -3.29290771484375, -3.1181640625, -2.94342041015625, -2.7686767578125, -2.59393310546875, -2.419189453125, -2.24444580078125, -2.0697021484375, -1.89495849609375, -1.72021484375, -1.54547119140625, -1.3707275390625, -1.19598388671875, -1.021240234375, -0.84649658203125, -0.6717529296875, -0.49700927734375, -0.322265625, -0.14752197265625, 0.0272216796875, 0.20196533203125, 0.376708984375, 0.55145263671875, 0.7261962890625, 0.90093994140625, 1.07568359375, 1.25042724609375, 1.4251708984375, 1.59991455078125, 1.774658203125, 1.94940185546875, 2.1241455078125, 2.29888916015625, 2.4736328125, 2.64837646484375, 2.8231201171875, 2.99786376953125, 3.172607421875, 3.34735107421875, 3.5220947265625, 3.69683837890625, 3.87158203125, 4.04632568359375, 4.2210693359375, 4.39581298828125, 4.570556640625, 4.74530029296875, 4.9200439453125, 5.09478759765625, 5.26953125]}, "gradients/decoder.transformer.h.19.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 5.0, 2.0, 4.0, 2.0, 6.0, 8.0, 7.0, 5.0, 9.0, 10.0, 20.0, 20.0, 30.0, 48.0, 64.0, 90.0, 157.0, 364.0, 1073.0, 4831.0, 35371.0, 370993.0, 2154583.0, 1434340.0, 170370.0, 17788.0, 2706.0, 675.0, 273.0, 137.0, 86.0, 64.0, 33.0, 27.0, 18.0, 23.0, 5.0, 10.0, 7.0, 6.0, 10.0, 3.0, 4.0, 4.0, 2.0, 4.0, 2.0, 1.0, 2.0], "bins": [-13.75, -13.3865966796875, -13.023193359375, -12.6597900390625, -12.29638671875, -11.9329833984375, -11.569580078125, -11.2061767578125, -10.8427734375, -10.4793701171875, -10.115966796875, -9.7525634765625, -9.38916015625, -9.0257568359375, -8.662353515625, -8.2989501953125, -7.935546875, -7.5721435546875, -7.208740234375, -6.8453369140625, -6.48193359375, -6.1185302734375, -5.755126953125, -5.3917236328125, -5.0283203125, -4.6649169921875, -4.301513671875, -3.9381103515625, -3.57470703125, -3.2113037109375, -2.847900390625, -2.4844970703125, -2.12109375, -1.7576904296875, -1.394287109375, -1.0308837890625, -0.66748046875, -0.3040771484375, 0.059326171875, 0.4227294921875, 0.7861328125, 1.1495361328125, 1.512939453125, 1.8763427734375, 2.23974609375, 2.6031494140625, 2.966552734375, 3.3299560546875, 3.693359375, 4.0567626953125, 4.420166015625, 4.7835693359375, 5.14697265625, 5.5103759765625, 5.873779296875, 6.2371826171875, 6.6005859375, 6.9639892578125, 7.327392578125, 7.6907958984375, 8.05419921875, 8.4176025390625, 8.781005859375, 9.1444091796875, 9.5078125]}, "gradients/decoder.transformer.h.19.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 7.0, 3.0, 8.0, 2.0, 5.0, 16.0, 11.0, 21.0, 35.0, 42.0, 36.0, 65.0, 95.0, 114.0, 199.0, 277.0, 388.0, 554.0, 596.0, 492.0, 348.0, 240.0, 156.0, 108.0, 70.0, 56.0, 38.0, 28.0, 20.0, 21.0, 7.0, 6.0, 7.0, 3.0, 1.0, 3.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-19.84375, -19.246337890625, -18.64892578125, -18.051513671875, -17.4541015625, -16.856689453125, -16.25927734375, -15.661865234375, -15.064453125, -14.467041015625, -13.86962890625, -13.272216796875, -12.6748046875, -12.077392578125, -11.47998046875, -10.882568359375, -10.28515625, -9.687744140625, -9.09033203125, -8.492919921875, -7.8955078125, -7.298095703125, -6.70068359375, -6.103271484375, -5.505859375, -4.908447265625, -4.31103515625, -3.713623046875, -3.1162109375, -2.518798828125, -1.92138671875, -1.323974609375, -0.7265625, -0.129150390625, 0.46826171875, 1.065673828125, 1.6630859375, 2.260498046875, 2.85791015625, 3.455322265625, 4.052734375, 4.650146484375, 5.24755859375, 5.844970703125, 6.4423828125, 7.039794921875, 7.63720703125, 8.234619140625, 8.83203125, 9.429443359375, 10.02685546875, 10.624267578125, 11.2216796875, 11.819091796875, 12.41650390625, 13.013916015625, 13.611328125, 14.208740234375, 14.80615234375, 15.403564453125, 16.0009765625, 16.598388671875, 17.19580078125, 17.793212890625, 18.390625]}, "gradients/decoder.transformer.h.19.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 3.0, 5.0, 7.0, 3.0, 8.0, 21.0, 13.0, 24.0, 31.0, 39.0, 55.0, 94.0, 133.0, 200.0, 376.0, 1096.0, 154511.0, 4033207.0, 3151.0, 479.0, 258.0, 178.0, 122.0, 67.0, 60.0, 32.0, 41.0, 25.0, 12.0, 11.0, 7.0, 5.0, 4.0, 2.0, 4.0, 3.0, 1.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-88.875, -85.9326171875, -82.990234375, -80.0478515625, -77.10546875, -74.1630859375, -71.220703125, -68.2783203125, -65.3359375, -62.3935546875, -59.451171875, -56.5087890625, -53.56640625, -50.6240234375, -47.681640625, -44.7392578125, -41.796875, -38.8544921875, -35.912109375, -32.9697265625, -30.02734375, -27.0849609375, -24.142578125, -21.2001953125, -18.2578125, -15.3154296875, -12.373046875, -9.4306640625, -6.48828125, -3.5458984375, -0.603515625, 2.3388671875, 5.28125, 8.2236328125, 11.166015625, 14.1083984375, 17.05078125, 19.9931640625, 22.935546875, 25.8779296875, 28.8203125, 31.7626953125, 34.705078125, 37.6474609375, 40.58984375, 43.5322265625, 46.474609375, 49.4169921875, 52.359375, 55.3017578125, 58.244140625, 61.1865234375, 64.12890625, 67.0712890625, 70.013671875, 72.9560546875, 75.8984375, 78.8408203125, 81.783203125, 84.7255859375, 87.66796875, 90.6103515625, 93.552734375, 96.4951171875, 99.4375]}, "gradients/decoder.transformer.h.19.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 5.0, 15.0, 25.0, 60.0, 128.0, 182.0, 204.0, 170.0, 118.0, 70.0, 25.0, 9.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-60.368309020996094, -57.78787612915039, -55.20744323730469, -52.627010345458984, -50.04657745361328, -47.46614074707031, -44.88570785522461, -42.305274963378906, -39.7248420715332, -37.1444091796875, -34.5639762878418, -31.98354148864746, -29.403108596801758, -26.822675704956055, -24.24224090576172, -21.661808013916016, -19.081375122070312, -16.50094223022461, -13.92050838470459, -11.34007453918457, -8.759641647338867, -6.179208755493164, -3.5987749099731445, -1.018341064453125, 1.5620918273925781, 4.1425251960754395, 6.722958564758301, 9.30339241027832, 11.883825302124023, 14.464258193969727, 17.044692993164062, 19.625125885009766, 22.205551147460938, 24.78598403930664, 27.366416931152344, 29.94685173034668, 32.52728271484375, 35.10771942138672, 37.68815231323242, 40.268585205078125, 42.84901809692383, 45.42945098876953, 48.009883880615234, 50.59031677246094, 53.170753479003906, 55.751182556152344, 58.33161926269531, 60.912052154541016, 63.49248504638672, 66.07292175292969, 68.65335083007812, 71.2337875366211, 73.81421661376953, 76.3946533203125, 78.97508239746094, 81.5555191040039, 84.13595581054688, 86.71639251708984, 89.29682159423828, 91.87725830078125, 94.45768737792969, 97.03812408447266, 99.6185531616211, 102.19898986816406, 104.7794189453125]}, "gradients/decoder.transformer.h.19.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 3.0, 2.0, 1.0, 5.0, 7.0, 5.0, 5.0, 8.0, 14.0, 13.0, 18.0, 18.0, 20.0, 30.0, 21.0, 26.0, 31.0, 37.0, 46.0, 39.0, 44.0, 40.0, 45.0, 49.0, 42.0, 31.0, 43.0, 41.0, 41.0, 37.0, 33.0, 35.0, 23.0, 29.0, 13.0, 25.0, 14.0, 16.0, 14.0, 8.0, 9.0, 8.0, 5.0, 7.0, 3.0, 3.0, 3.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-54.46820068359375, -52.644866943359375, -50.821533203125, -48.998199462890625, -47.174861907958984, -45.35152816772461, -43.528194427490234, -41.70486068725586, -39.88152313232422, -38.058189392089844, -36.23485565185547, -34.411521911621094, -32.58818435668945, -30.764850616455078, -28.941516876220703, -27.118183135986328, -25.294849395751953, -23.471515655517578, -21.64818000793457, -19.824846267700195, -18.001510620117188, -16.178176879882812, -14.354843139648438, -12.531508445739746, -10.708173751831055, -8.884839057922363, -7.06150484085083, -5.238170623779297, -3.4148359298706055, -1.591501235961914, 0.23183250427246094, 2.0551671981811523, 3.878498077392578, 5.7018327713012695, 7.525166988372803, 9.348501205444336, 11.171835899353027, 12.995170593261719, 14.818504333496094, 16.64183807373047, 18.465173721313477, 20.28850746154785, 22.11184310913086, 23.935176849365234, 25.75851058959961, 27.581846237182617, 29.405179977416992, 31.228515625, 33.051849365234375, 34.87518310546875, 36.698516845703125, 38.5218505859375, 40.34518814086914, 42.168521881103516, 43.99185562133789, 45.815189361572266, 47.638526916503906, 49.46186065673828, 51.285194396972656, 53.10852813720703, 54.93186569213867, 56.75519943237305, 58.57853317260742, 60.4018669128418, 62.22520065307617]}, "gradients/decoder.transformer.h.19.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 1.0, 1.0, 3.0, 2.0, 4.0, 3.0, 4.0, 6.0, 5.0, 4.0, 12.0, 9.0, 7.0, 9.0, 15.0, 24.0, 23.0, 22.0, 21.0, 24.0, 32.0, 27.0, 38.0, 42.0, 43.0, 33.0, 37.0, 42.0, 41.0, 42.0, 45.0, 36.0, 43.0, 28.0, 34.0, 28.0, 40.0, 19.0, 18.0, 24.0, 19.0, 15.0, 11.0, 18.0, 13.0, 11.0, 7.0, 3.0, 6.0, 3.0, 4.0, 1.0, 2.0, 4.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 1.0], "bins": [-5.1796875, -5.01409912109375, -4.8485107421875, -4.68292236328125, -4.517333984375, -4.35174560546875, -4.1861572265625, -4.02056884765625, -3.85498046875, -3.68939208984375, -3.5238037109375, -3.35821533203125, -3.192626953125, -3.02703857421875, -2.8614501953125, -2.69586181640625, -2.5302734375, -2.36468505859375, -2.1990966796875, -2.03350830078125, -1.867919921875, -1.70233154296875, -1.5367431640625, -1.37115478515625, -1.20556640625, -1.03997802734375, -0.8743896484375, -0.70880126953125, -0.543212890625, -0.37762451171875, -0.2120361328125, -0.04644775390625, 0.119140625, 0.28472900390625, 0.4503173828125, 0.61590576171875, 0.781494140625, 0.94708251953125, 1.1126708984375, 1.27825927734375, 1.44384765625, 1.60943603515625, 1.7750244140625, 1.94061279296875, 2.106201171875, 2.27178955078125, 2.4373779296875, 2.60296630859375, 2.7685546875, 2.93414306640625, 3.0997314453125, 3.26531982421875, 3.430908203125, 3.59649658203125, 3.7620849609375, 3.92767333984375, 4.09326171875, 4.25885009765625, 4.4244384765625, 4.59002685546875, 4.755615234375, 4.92120361328125, 5.0867919921875, 5.25238037109375, 5.41796875]}, "gradients/decoder.transformer.h.19.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 5.0, 3.0, 5.0, 7.0, 12.0, 18.0, 28.0, 40.0, 66.0, 99.0, 141.0, 176.0, 241.0, 354.0, 484.0, 703.0, 990.0, 1491.0, 2083.0, 2923.0, 4213.0, 6035.0, 9074.0, 13711.0, 20609.0, 31666.0, 52834.0, 92467.0, 200504.0, 313973.0, 117750.0, 63924.0, 38378.0, 24226.0, 15636.0, 10490.0, 7101.0, 5026.0, 3284.0, 2282.0, 1621.0, 1126.0, 815.0, 544.0, 409.0, 332.0, 210.0, 156.0, 113.0, 70.0, 46.0, 33.0, 17.0, 8.0, 8.0, 2.0, 2.0, 3.0, 1.0, 2.0, 2.0, 1.0], "bins": [-1.2705078125, -1.229766845703125, -1.18902587890625, -1.148284912109375, -1.1075439453125, -1.066802978515625, -1.02606201171875, -0.985321044921875, -0.944580078125, -0.903839111328125, -0.86309814453125, -0.822357177734375, -0.7816162109375, -0.740875244140625, -0.70013427734375, -0.659393310546875, -0.61865234375, -0.577911376953125, -0.53717041015625, -0.496429443359375, -0.4556884765625, -0.414947509765625, -0.37420654296875, -0.333465576171875, -0.292724609375, -0.251983642578125, -0.21124267578125, -0.170501708984375, -0.1297607421875, -0.089019775390625, -0.04827880859375, -0.007537841796875, 0.033203125, 0.073944091796875, 0.11468505859375, 0.155426025390625, 0.1961669921875, 0.236907958984375, 0.27764892578125, 0.318389892578125, 0.359130859375, 0.399871826171875, 0.44061279296875, 0.481353759765625, 0.5220947265625, 0.562835693359375, 0.60357666015625, 0.644317626953125, 0.68505859375, 0.725799560546875, 0.76654052734375, 0.807281494140625, 0.8480224609375, 0.888763427734375, 0.92950439453125, 0.970245361328125, 1.010986328125, 1.051727294921875, 1.09246826171875, 1.133209228515625, 1.1739501953125, 1.214691162109375, 1.25543212890625, 1.296173095703125, 1.3369140625]}, "gradients/decoder.transformer.h.19.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 3.0, 4.0, 5.0, 5.0, 5.0, 5.0, 13.0, 14.0, 8.0, 13.0, 19.0, 22.0, 32.0, 27.0, 32.0, 28.0, 40.0, 32.0, 44.0, 40.0, 40.0, 34.0, 1072.0, 46.0, 43.0, 41.0, 40.0, 42.0, 40.0, 32.0, 33.0, 31.0, 16.0, 16.0, 20.0, 17.0, 19.0, 13.0, 11.0, 12.0, 4.0, 3.0, 9.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-3.712890625, -3.59796142578125, -3.4830322265625, -3.36810302734375, -3.253173828125, -3.13824462890625, -3.0233154296875, -2.90838623046875, -2.79345703125, -2.67852783203125, -2.5635986328125, -2.44866943359375, -2.333740234375, -2.21881103515625, -2.1038818359375, -1.98895263671875, -1.8740234375, -1.75909423828125, -1.6441650390625, -1.52923583984375, -1.414306640625, -1.29937744140625, -1.1844482421875, -1.06951904296875, -0.95458984375, -0.83966064453125, -0.7247314453125, -0.60980224609375, -0.494873046875, -0.37994384765625, -0.2650146484375, -0.15008544921875, -0.03515625, 0.07977294921875, 0.1947021484375, 0.30963134765625, 0.424560546875, 0.53948974609375, 0.6544189453125, 0.76934814453125, 0.88427734375, 0.99920654296875, 1.1141357421875, 1.22906494140625, 1.343994140625, 1.45892333984375, 1.5738525390625, 1.68878173828125, 1.8037109375, 1.91864013671875, 2.0335693359375, 2.14849853515625, 2.263427734375, 2.37835693359375, 2.4932861328125, 2.60821533203125, 2.72314453125, 2.83807373046875, 2.9530029296875, 3.06793212890625, 3.182861328125, 3.29779052734375, 3.4127197265625, 3.52764892578125, 3.642578125]}, "gradients/decoder.transformer.h.19.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 1.0, 2.0, 8.0, 7.0, 5.0, 14.0, 22.0, 28.0, 42.0, 55.0, 95.0, 167.0, 289.0, 448.0, 813.0, 1478.0, 2582.0, 4778.0, 8739.0, 16628.0, 33522.0, 70097.0, 162389.0, 1483254.0, 167962.0, 72197.0, 34186.0, 17247.0, 8926.0, 4853.0, 2689.0, 1532.0, 831.0, 517.0, 288.0, 160.0, 105.0, 55.0, 51.0, 21.0, 22.0, 5.0, 8.0, 6.0, 6.0, 5.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.966796875, -1.904388427734375, -1.84197998046875, -1.779571533203125, -1.7171630859375, -1.654754638671875, -1.59234619140625, -1.529937744140625, -1.467529296875, -1.405120849609375, -1.34271240234375, -1.280303955078125, -1.2178955078125, -1.155487060546875, -1.09307861328125, -1.030670166015625, -0.96826171875, -0.905853271484375, -0.84344482421875, -0.781036376953125, -0.7186279296875, -0.656219482421875, -0.59381103515625, -0.531402587890625, -0.468994140625, -0.406585693359375, -0.34417724609375, -0.281768798828125, -0.2193603515625, -0.156951904296875, -0.09454345703125, -0.032135009765625, 0.0302734375, 0.092681884765625, 0.15509033203125, 0.217498779296875, 0.2799072265625, 0.342315673828125, 0.40472412109375, 0.467132568359375, 0.529541015625, 0.591949462890625, 0.65435791015625, 0.716766357421875, 0.7791748046875, 0.841583251953125, 0.90399169921875, 0.966400146484375, 1.02880859375, 1.091217041015625, 1.15362548828125, 1.216033935546875, 1.2784423828125, 1.340850830078125, 1.40325927734375, 1.465667724609375, 1.528076171875, 1.590484619140625, 1.65289306640625, 1.715301513671875, 1.7777099609375, 1.840118408203125, 1.90252685546875, 1.964935302734375, 2.02734375]}, "gradients/decoder.transformer.h.19.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 0.0, 3.0, 1.0, 2.0, 5.0, 6.0, 8.0, 7.0, 7.0, 13.0, 17.0, 30.0, 25.0, 44.0, 67.0, 123.0, 122.0, 117.0, 105.0, 74.0, 67.0, 50.0, 28.0, 26.0, 14.0, 15.0, 11.0, 6.0, 1.0, 3.0, 3.0, 3.0, 0.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0019245147705078125, -0.0018700659275054932, -0.0018156170845031738, -0.0017611682415008545, -0.0017067193984985352, -0.0016522705554962158, -0.0015978217124938965, -0.0015433728694915771, -0.0014889240264892578, -0.0014344751834869385, -0.0013800263404846191, -0.0013255774974822998, -0.0012711286544799805, -0.0012166798114776611, -0.0011622309684753418, -0.0011077821254730225, -0.0010533332824707031, -0.0009988844394683838, -0.0009444355964660645, -0.0008899867534637451, -0.0008355379104614258, -0.0007810890674591064, -0.0007266402244567871, -0.0006721913814544678, -0.0006177425384521484, -0.0005632936954498291, -0.0005088448524475098, -0.00045439600944519043, -0.0003999471664428711, -0.00034549832344055176, -0.0002910494804382324, -0.00023660063743591309, -0.00018215179443359375, -0.00012770295143127441, -7.325410842895508e-05, -1.8805265426635742e-05, 3.5643577575683594e-05, 9.009242057800293e-05, 0.00014454126358032227, 0.0001989901065826416, 0.00025343894958496094, 0.0003078877925872803, 0.0003623366355895996, 0.00041678547859191895, 0.0004712343215942383, 0.0005256831645965576, 0.000580132007598877, 0.0006345808506011963, 0.0006890296936035156, 0.000743478536605835, 0.0007979273796081543, 0.0008523762226104736, 0.000906825065612793, 0.0009612739086151123, 0.0010157227516174316, 0.001070171594619751, 0.0011246204376220703, 0.0011790692806243896, 0.001233518123626709, 0.0012879669666290283, 0.0013424158096313477, 0.001396864652633667, 0.0014513134956359863, 0.0015057623386383057, 0.001560211181640625]}, "gradients/decoder.transformer.h.19.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 3.0, 2.0, 1.0, 2.0, 4.0, 1.0, 10.0, 12.0, 16.0, 15.0, 27.0, 41.0, 55.0, 107.0, 168.0, 421.0, 393969.0, 652778.0, 445.0, 184.0, 107.0, 56.0, 37.0, 34.0, 17.0, 9.0, 8.0, 10.0, 5.0, 5.0, 0.0, 0.0, 3.0, 5.0, 2.0, 1.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.039520263671875, -0.03810930252075195, -0.036698341369628906, -0.03528738021850586, -0.03387641906738281, -0.032465457916259766, -0.03105449676513672, -0.029643535614013672, -0.028232574462890625, -0.026821613311767578, -0.02541065216064453, -0.023999691009521484, -0.022588729858398438, -0.02117776870727539, -0.019766807556152344, -0.018355846405029297, -0.01694488525390625, -0.015533924102783203, -0.014122962951660156, -0.01271200180053711, -0.011301040649414062, -0.009890079498291016, -0.008479118347167969, -0.007068157196044922, -0.005657196044921875, -0.004246234893798828, -0.0028352737426757812, -0.0014243125915527344, -1.33514404296875e-05, 0.0013976097106933594, 0.0028085708618164062, 0.004219532012939453, 0.0056304931640625, 0.007041454315185547, 0.008452415466308594, 0.00986337661743164, 0.011274337768554688, 0.012685298919677734, 0.014096260070800781, 0.015507221221923828, 0.016918182373046875, 0.018329143524169922, 0.01974010467529297, 0.021151065826416016, 0.022562026977539062, 0.02397298812866211, 0.025383949279785156, 0.026794910430908203, 0.02820587158203125, 0.029616832733154297, 0.031027793884277344, 0.03243875503540039, 0.03384971618652344, 0.035260677337646484, 0.03667163848876953, 0.03808259963989258, 0.039493560791015625, 0.04090452194213867, 0.04231548309326172, 0.043726444244384766, 0.04513740539550781, 0.04654836654663086, 0.047959327697753906, 0.04937028884887695, 0.05078125]}, "gradients/decoder.transformer.h.19.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 14.0, 76.0, 227.0, 326.0, 241.0, 94.0, 26.0, 9.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0019064029911532998, -0.0018708306597545743, -0.001835258211940527, -0.0017996858805418015, -0.001764113549143076, -0.0017285411013290286, -0.001692968769930303, -0.0016573963221162558, -0.0016218239907175303, -0.0015862516593188047, -0.0015506792115047574, -0.001515106880106032, -0.0014795344322919846, -0.001443962100893259, -0.0014083897694945335, -0.001372817438095808, -0.0013372449902817607, -0.0013016726588830352, -0.0012661002110689878, -0.0012305278796702623, -0.0011949555482715368, -0.0011593831004574895, -0.001123810769058764, -0.0010882383212447166, -0.0010526659898459911, -0.0010170936584472656, -0.0009815212106332183, -0.0009459488792344928, -0.0009103764896281064, -0.0008748041000217199, -0.0008392317686229944, -0.000803659379016608, -0.0007680871058255434, -0.000732514716219157, -0.0006969423266127706, -0.000661369995214045, -0.0006257976056076586, -0.0005902252160012722, -0.0005546528846025467, -0.0005190804949961603, -0.0004835080762859434, -0.0004479357157833874, -0.000412363326177001, -0.0003767909365706146, -0.0003412185760680586, -0.00030564621556550264, -0.0002700738259591162, -0.00023450146545656025, -0.00019892907585017383, -0.00016335670079570264, -0.00012778432574123144, -9.221195068676025e-05, -5.663957563228905e-05, -2.1067200577817857e-05, 1.4505174476653337e-05, 5.0077534979209304e-05, 8.564992458559573e-05, 0.00012122229964006692, 0.00015679467469453812, 0.0001923670497490093, 0.0002279394248034805, 0.00026351178530603647, 0.0002990841749124229, 0.00033465653541497886, 0.0003702289250213653]}, "gradients/decoder.transformer.h.19.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 3.0, 2.0, 5.0, 2.0, 7.0, 3.0, 8.0, 11.0, 10.0, 12.0, 18.0, 17.0, 27.0, 23.0, 17.0, 21.0, 43.0, 36.0, 28.0, 46.0, 45.0, 46.0, 48.0, 36.0, 48.0, 59.0, 43.0, 37.0, 43.0, 36.0, 33.0, 29.0, 24.0, 17.0, 15.0, 15.0, 15.0, 21.0, 15.0, 11.0, 10.0, 16.0, 7.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005411505699157715, -0.0005207424983382225, -0.0005003344267606735, -0.00047992635518312454, -0.00045951828360557556, -0.0004391102120280266, -0.0004187021404504776, -0.0003982940688729286, -0.00037788599729537964, -0.00035747792571783066, -0.0003370698541402817, -0.0003166617825627327, -0.0002962537109851837, -0.00027584563940763474, -0.00025543756783008575, -0.00023502949625253677, -0.0002146214246749878, -0.0001942133530974388, -0.00017380528151988983, -0.00015339720994234085, -0.00013298913836479187, -0.00011258106678724289, -9.217299520969391e-05, -7.176492363214493e-05, -5.135685205459595e-05, -3.0948780477046967e-05, -1.0540708899497986e-05, 9.867362678050995e-06, 3.0275434255599976e-05, 5.0683505833148956e-05, 7.109157741069794e-05, 9.149964898824692e-05, 0.0001119077205657959, 0.00013231579214334488, 0.00015272386372089386, 0.00017313193529844284, 0.00019354000687599182, 0.0002139480784535408, 0.00023435615003108978, 0.00025476422160863876, 0.00027517229318618774, 0.0002955803647637367, 0.0003159884363412857, 0.0003363965079188347, 0.00035680457949638367, 0.00037721265107393265, 0.00039762072265148163, 0.0004180287942290306, 0.0004384368658065796, 0.00045884493738412857, 0.00047925300896167755, 0.0004996610805392265, 0.0005200691521167755, 0.0005404772236943245, 0.0005608852952718735, 0.0005812933668494225, 0.0006017014384269714, 0.0006221095100045204, 0.0006425175815820694, 0.0006629256531596184, 0.0006833337247371674, 0.0007037417963147163, 0.0007241498678922653, 0.0007445579394698143, 0.0007649660110473633]}, "gradients/decoder.transformer.h.19.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 1.0, 1.0, 3.0, 2.0, 4.0, 3.0, 4.0, 6.0, 5.0, 4.0, 12.0, 9.0, 7.0, 9.0, 15.0, 24.0, 23.0, 22.0, 21.0, 24.0, 32.0, 27.0, 38.0, 42.0, 43.0, 33.0, 37.0, 42.0, 41.0, 42.0, 45.0, 36.0, 43.0, 28.0, 34.0, 28.0, 40.0, 19.0, 18.0, 24.0, 19.0, 15.0, 11.0, 18.0, 13.0, 11.0, 7.0, 3.0, 6.0, 3.0, 4.0, 1.0, 2.0, 4.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 1.0], "bins": [-5.1796875, -5.01409912109375, -4.8485107421875, -4.68292236328125, -4.517333984375, -4.35174560546875, -4.1861572265625, -4.02056884765625, -3.85498046875, -3.68939208984375, -3.5238037109375, -3.35821533203125, -3.192626953125, -3.02703857421875, -2.8614501953125, -2.69586181640625, -2.5302734375, -2.36468505859375, -2.1990966796875, -2.03350830078125, -1.867919921875, -1.70233154296875, -1.5367431640625, -1.37115478515625, -1.20556640625, -1.03997802734375, -0.8743896484375, -0.70880126953125, -0.543212890625, -0.37762451171875, -0.2120361328125, -0.04644775390625, 0.119140625, 0.28472900390625, 0.4503173828125, 0.61590576171875, 0.781494140625, 0.94708251953125, 1.1126708984375, 1.27825927734375, 1.44384765625, 1.60943603515625, 1.7750244140625, 1.94061279296875, 2.106201171875, 2.27178955078125, 2.4373779296875, 2.60296630859375, 2.7685546875, 2.93414306640625, 3.0997314453125, 3.26531982421875, 3.430908203125, 3.59649658203125, 3.7620849609375, 3.92767333984375, 4.09326171875, 4.25885009765625, 4.4244384765625, 4.59002685546875, 4.755615234375, 4.92120361328125, 5.0867919921875, 5.25238037109375, 5.41796875]}, "gradients/decoder.transformer.h.19.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 4.0, 1.0, 0.0, 4.0, 1.0, 4.0, 3.0, 4.0, 4.0, 8.0, 6.0, 17.0, 10.0, 9.0, 17.0, 30.0, 52.0, 65.0, 99.0, 181.0, 286.0, 562.0, 1120.0, 2262.0, 4720.0, 11403.0, 30211.0, 92976.0, 299120.0, 396095.0, 138113.0, 43419.0, 15514.0, 6422.0, 2832.0, 1378.0, 737.0, 339.0, 176.0, 115.0, 69.0, 51.0, 28.0, 14.0, 26.0, 15.0, 10.0, 8.0, 8.0, 3.0, 4.0, 2.0, 1.0, 1.0, 6.0, 0.0, 2.0, 1.0, 2.0, 2.0, 1.0, 1.0], "bins": [-3.177734375, -3.075775146484375, -2.97381591796875, -2.871856689453125, -2.7698974609375, -2.667938232421875, -2.56597900390625, -2.464019775390625, -2.362060546875, -2.260101318359375, -2.15814208984375, -2.056182861328125, -1.9542236328125, -1.852264404296875, -1.75030517578125, -1.648345947265625, -1.54638671875, -1.444427490234375, -1.34246826171875, -1.240509033203125, -1.1385498046875, -1.036590576171875, -0.93463134765625, -0.832672119140625, -0.730712890625, -0.628753662109375, -0.52679443359375, -0.424835205078125, -0.3228759765625, -0.220916748046875, -0.11895751953125, -0.016998291015625, 0.0849609375, 0.186920166015625, 0.28887939453125, 0.390838623046875, 0.4927978515625, 0.594757080078125, 0.69671630859375, 0.798675537109375, 0.900634765625, 1.002593994140625, 1.10455322265625, 1.206512451171875, 1.3084716796875, 1.410430908203125, 1.51239013671875, 1.614349365234375, 1.71630859375, 1.818267822265625, 1.92022705078125, 2.022186279296875, 2.1241455078125, 2.226104736328125, 2.32806396484375, 2.430023193359375, 2.531982421875, 2.633941650390625, 2.73590087890625, 2.837860107421875, 2.9398193359375, 3.041778564453125, 3.14373779296875, 3.245697021484375, 3.34765625]}, "gradients/decoder.transformer.h.19.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 5.0, 7.0, 6.0, 12.0, 14.0, 20.0, 16.0, 24.0, 24.0, 28.0, 29.0, 35.0, 42.0, 48.0, 52.0, 60.0, 89.0, 1782.0, 270.0, 70.0, 54.0, 47.0, 54.0, 47.0, 34.0, 30.0, 22.0, 30.0, 17.0, 20.0, 12.0, 9.0, 6.0, 7.0, 12.0, 1.0, 4.0, 8.0, 3.0, 2.0, 1.0, 1.0, 2.0, 2.0, 1.0], "bins": [-26.109375, -25.400390625, -24.69140625, -23.982421875, -23.2734375, -22.564453125, -21.85546875, -21.146484375, -20.4375, -19.728515625, -19.01953125, -18.310546875, -17.6015625, -16.892578125, -16.18359375, -15.474609375, -14.765625, -14.056640625, -13.34765625, -12.638671875, -11.9296875, -11.220703125, -10.51171875, -9.802734375, -9.09375, -8.384765625, -7.67578125, -6.966796875, -6.2578125, -5.548828125, -4.83984375, -4.130859375, -3.421875, -2.712890625, -2.00390625, -1.294921875, -0.5859375, 0.123046875, 0.83203125, 1.541015625, 2.25, 2.958984375, 3.66796875, 4.376953125, 5.0859375, 5.794921875, 6.50390625, 7.212890625, 7.921875, 8.630859375, 9.33984375, 10.048828125, 10.7578125, 11.466796875, 12.17578125, 12.884765625, 13.59375, 14.302734375, 15.01171875, 15.720703125, 16.4296875, 17.138671875, 17.84765625, 18.556640625, 19.265625]}, "gradients/decoder.transformer.h.19.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 4.0, 1.0, 0.0, 1.0, 6.0, 6.0, 4.0, 4.0, 4.0, 10.0, 5.0, 11.0, 7.0, 11.0, 23.0, 22.0, 35.0, 34.0, 52.0, 73.0, 111.0, 152.0, 282.0, 591.0, 2802.0, 2657891.0, 480139.0, 2119.0, 522.0, 254.0, 172.0, 92.0, 52.0, 49.0, 37.0, 29.0, 19.0, 20.0, 19.0, 15.0, 14.0, 7.0, 8.0, 4.0, 2.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-31.296875, -30.135009765625, -28.97314453125, -27.811279296875, -26.6494140625, -25.487548828125, -24.32568359375, -23.163818359375, -22.001953125, -20.840087890625, -19.67822265625, -18.516357421875, -17.3544921875, -16.192626953125, -15.03076171875, -13.868896484375, -12.70703125, -11.545166015625, -10.38330078125, -9.221435546875, -8.0595703125, -6.897705078125, -5.73583984375, -4.573974609375, -3.412109375, -2.250244140625, -1.08837890625, 0.073486328125, 1.2353515625, 2.397216796875, 3.55908203125, 4.720947265625, 5.8828125, 7.044677734375, 8.20654296875, 9.368408203125, 10.5302734375, 11.692138671875, 12.85400390625, 14.015869140625, 15.177734375, 16.339599609375, 17.50146484375, 18.663330078125, 19.8251953125, 20.987060546875, 22.14892578125, 23.310791015625, 24.47265625, 25.634521484375, 26.79638671875, 27.958251953125, 29.1201171875, 30.281982421875, 31.44384765625, 32.605712890625, 33.767578125, 34.929443359375, 36.09130859375, 37.253173828125, 38.4150390625, 39.576904296875, 40.73876953125, 41.900634765625, 43.0625]}, "gradients/decoder.transformer.h.19.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 8.0, 10.0, 36.0, 101.0, 182.0, 212.0, 188.0, 157.0, 74.0, 25.0, 10.0, 8.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.64617919921875, -28.768712997436523, -27.891246795654297, -27.01378059387207, -26.136314392089844, -25.258848190307617, -24.38138198852539, -23.503917694091797, -22.626449584960938, -21.74898338317871, -20.871517181396484, -19.994050979614258, -19.11658477783203, -18.239118576049805, -17.361652374267578, -16.484188079833984, -15.606721878051758, -14.729255676269531, -13.851789474487305, -12.974323272705078, -12.096857070922852, -11.219390869140625, -10.341925621032715, -9.464459419250488, -8.586993217468262, -7.709527015686035, -6.832060813903809, -5.95459508895874, -5.077128887176514, -4.199662685394287, -3.3221969604492188, -2.444730758666992, -1.5672626495361328, -0.6897965669631958, 0.1876695156097412, 1.0651354789733887, 1.9426016807556152, 2.820067882537842, 3.69753360748291, 4.574999809265137, 5.452466011047363, 6.32993221282959, 7.207398414611816, 8.084863662719727, 8.962329864501953, 9.83979606628418, 10.717262268066406, 11.594728469848633, 12.47219467163086, 13.349660873413086, 14.227127075195312, 15.104593276977539, 15.982059478759766, 16.859525680541992, 17.73699188232422, 18.614456176757812, 19.491924285888672, 20.3693904876709, 21.246856689453125, 22.12432289123535, 23.001789093017578, 23.879255294799805, 24.75672149658203, 25.634185791015625, 26.51165199279785]}, "gradients/decoder.transformer.h.19.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 3.0, 4.0, 10.0, 6.0, 8.0, 3.0, 15.0, 14.0, 18.0, 19.0, 21.0, 21.0, 25.0, 35.0, 31.0, 43.0, 32.0, 38.0, 48.0, 46.0, 41.0, 53.0, 50.0, 44.0, 37.0, 34.0, 30.0, 30.0, 24.0, 31.0, 31.0, 16.0, 25.0, 23.0, 25.0, 16.0, 12.0, 6.0, 8.0, 7.0, 6.0, 2.0, 4.0, 4.0, 1.0, 3.0, 3.0, 2.0, 0.0, 2.0], "bins": [-69.41703033447266, -67.44730377197266, -65.47757720947266, -63.50784683227539, -61.53812026977539, -59.568389892578125, -57.598663330078125, -55.628936767578125, -53.659210205078125, -51.689483642578125, -49.71975326538086, -47.75002670288086, -45.78030014038086, -43.810569763183594, -41.840843200683594, -39.871116638183594, -37.90138626098633, -35.93165969848633, -33.96192932128906, -31.992202758789062, -30.022476196289062, -28.05274772644043, -26.083019256591797, -24.113292694091797, -22.143564224243164, -20.17383575439453, -18.20410919189453, -16.2343807220459, -14.264653205871582, -12.294925689697266, -10.325197219848633, -8.355469703674316, -6.385746002197266, -4.416018486022949, -2.4462904930114746, -0.4765625, 1.4931650161743164, 3.462892532348633, 5.432621002197266, 7.402348518371582, 9.372076034545898, 11.341803550720215, 13.311531066894531, 15.281259536743164, 17.250988006591797, 19.220714569091797, 21.19044303894043, 23.160171508789062, 25.129898071289062, 27.099626541137695, 29.069353103637695, 31.039081573486328, 33.00880813598633, 34.978538513183594, 36.948265075683594, 38.917991638183594, 40.887718200683594, 42.857444763183594, 44.82717514038086, 46.79690170288086, 48.76662826538086, 50.736358642578125, 52.706085205078125, 54.675811767578125, 56.64554214477539]}, "gradients/decoder.transformer.h.18.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 2.0, 3.0, 1.0, 2.0, 4.0, 3.0, 7.0, 4.0, 8.0, 7.0, 7.0, 9.0, 13.0, 10.0, 17.0, 23.0, 20.0, 29.0, 24.0, 29.0, 30.0, 39.0, 34.0, 30.0, 50.0, 40.0, 41.0, 30.0, 54.0, 38.0, 43.0, 42.0, 36.0, 34.0, 27.0, 31.0, 23.0, 24.0, 22.0, 12.0, 22.0, 15.0, 13.0, 15.0, 11.0, 6.0, 6.0, 7.0, 0.0, 5.0, 1.0, 3.0, 0.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0], "bins": [-5.2265625, -5.05560302734375, -4.8846435546875, -4.71368408203125, -4.542724609375, -4.37176513671875, -4.2008056640625, -4.02984619140625, -3.85888671875, -3.68792724609375, -3.5169677734375, -3.34600830078125, -3.175048828125, -3.00408935546875, -2.8331298828125, -2.66217041015625, -2.4912109375, -2.32025146484375, -2.1492919921875, -1.97833251953125, -1.807373046875, -1.63641357421875, -1.4654541015625, -1.29449462890625, -1.12353515625, -0.95257568359375, -0.7816162109375, -0.61065673828125, -0.439697265625, -0.26873779296875, -0.0977783203125, 0.07318115234375, 0.244140625, 0.41510009765625, 0.5860595703125, 0.75701904296875, 0.927978515625, 1.09893798828125, 1.2698974609375, 1.44085693359375, 1.61181640625, 1.78277587890625, 1.9537353515625, 2.12469482421875, 2.295654296875, 2.46661376953125, 2.6375732421875, 2.80853271484375, 2.9794921875, 3.15045166015625, 3.3214111328125, 3.49237060546875, 3.663330078125, 3.83428955078125, 4.0052490234375, 4.17620849609375, 4.34716796875, 4.51812744140625, 4.6890869140625, 4.86004638671875, 5.031005859375, 5.20196533203125, 5.3729248046875, 5.54388427734375, 5.71484375]}, "gradients/decoder.transformer.h.18.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 4.0, 3.0, 3.0, 7.0, 6.0, 6.0, 10.0, 5.0, 10.0, 11.0, 9.0, 17.0, 18.0, 28.0, 31.0, 40.0, 47.0, 64.0, 92.0, 168.0, 530.0, 3087.0, 37998.0, 1082544.0, 2882811.0, 176219.0, 8707.0, 1077.0, 277.0, 111.0, 76.0, 44.0, 46.0, 27.0, 30.0, 24.0, 20.0, 13.0, 16.0, 14.0, 9.0, 11.0, 6.0, 3.0, 5.0, 2.0, 1.0, 3.0, 0.0, 2.0, 2.0, 2.0, 2.0, 2.0], "bins": [-19.296875, -18.733154296875, -18.16943359375, -17.605712890625, -17.0419921875, -16.478271484375, -15.91455078125, -15.350830078125, -14.787109375, -14.223388671875, -13.65966796875, -13.095947265625, -12.5322265625, -11.968505859375, -11.40478515625, -10.841064453125, -10.27734375, -9.713623046875, -9.14990234375, -8.586181640625, -8.0224609375, -7.458740234375, -6.89501953125, -6.331298828125, -5.767578125, -5.203857421875, -4.64013671875, -4.076416015625, -3.5126953125, -2.948974609375, -2.38525390625, -1.821533203125, -1.2578125, -0.694091796875, -0.13037109375, 0.433349609375, 0.9970703125, 1.560791015625, 2.12451171875, 2.688232421875, 3.251953125, 3.815673828125, 4.37939453125, 4.943115234375, 5.5068359375, 6.070556640625, 6.63427734375, 7.197998046875, 7.76171875, 8.325439453125, 8.88916015625, 9.452880859375, 10.0166015625, 10.580322265625, 11.14404296875, 11.707763671875, 12.271484375, 12.835205078125, 13.39892578125, 13.962646484375, 14.5263671875, 15.090087890625, 15.65380859375, 16.217529296875, 16.78125]}, "gradients/decoder.transformer.h.18.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 3.0, 7.0, 7.0, 9.0, 10.0, 18.0, 34.0, 43.0, 48.0, 80.0, 116.0, 145.0, 229.0, 302.0, 483.0, 544.0, 574.0, 403.0, 329.0, 207.0, 160.0, 98.0, 65.0, 51.0, 42.0, 26.0, 8.0, 12.0, 9.0, 6.0, 5.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0], "bins": [-24.765625, -24.17578125, -23.5859375, -22.99609375, -22.40625, -21.81640625, -21.2265625, -20.63671875, -20.046875, -19.45703125, -18.8671875, -18.27734375, -17.6875, -17.09765625, -16.5078125, -15.91796875, -15.328125, -14.73828125, -14.1484375, -13.55859375, -12.96875, -12.37890625, -11.7890625, -11.19921875, -10.609375, -10.01953125, -9.4296875, -8.83984375, -8.25, -7.66015625, -7.0703125, -6.48046875, -5.890625, -5.30078125, -4.7109375, -4.12109375, -3.53125, -2.94140625, -2.3515625, -1.76171875, -1.171875, -0.58203125, 0.0078125, 0.59765625, 1.1875, 1.77734375, 2.3671875, 2.95703125, 3.546875, 4.13671875, 4.7265625, 5.31640625, 5.90625, 6.49609375, 7.0859375, 7.67578125, 8.265625, 8.85546875, 9.4453125, 10.03515625, 10.625, 11.21484375, 11.8046875, 12.39453125, 12.984375]}, "gradients/decoder.transformer.h.18.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 4.0, 2.0, 2.0, 3.0, 6.0, 7.0, 6.0, 8.0, 22.0, 32.0, 36.0, 62.0, 54.0, 101.0, 120.0, 166.0, 354.0, 729.0, 4181.0, 3832153.0, 352965.0, 1777.0, 556.0, 312.0, 200.0, 117.0, 89.0, 70.0, 37.0, 39.0, 21.0, 16.0, 13.0, 7.0, 7.0, 3.0, 6.0, 3.0, 2.0, 5.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-60.4375, -57.90234375, -55.3671875, -52.83203125, -50.296875, -47.76171875, -45.2265625, -42.69140625, -40.15625, -37.62109375, -35.0859375, -32.55078125, -30.015625, -27.48046875, -24.9453125, -22.41015625, -19.875, -17.33984375, -14.8046875, -12.26953125, -9.734375, -7.19921875, -4.6640625, -2.12890625, 0.40625, 2.94140625, 5.4765625, 8.01171875, 10.546875, 13.08203125, 15.6171875, 18.15234375, 20.6875, 23.22265625, 25.7578125, 28.29296875, 30.828125, 33.36328125, 35.8984375, 38.43359375, 40.96875, 43.50390625, 46.0390625, 48.57421875, 51.109375, 53.64453125, 56.1796875, 58.71484375, 61.25, 63.78515625, 66.3203125, 68.85546875, 71.390625, 73.92578125, 76.4609375, 78.99609375, 81.53125, 84.06640625, 86.6015625, 89.13671875, 91.671875, 94.20703125, 96.7421875, 99.27734375, 101.8125]}, "gradients/decoder.transformer.h.18.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 7.0, 91.0, 533.0, 340.0, 42.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-92.80313873291016, -83.39895629882812, -73.99476623535156, -64.59058380126953, -55.1864013671875, -45.78221893310547, -36.37803268432617, -26.973846435546875, -17.569664001464844, -8.16547966003418, 1.2387046813964844, 10.642889022827148, 20.047073364257812, 29.451255798339844, 38.85544204711914, 48.25962829589844, 57.66381072998047, 67.0679931640625, 76.47218322753906, 85.8763656616211, 95.28054809570312, 104.68473052978516, 114.08891296386719, 123.49310302734375, 132.89727783203125, 142.3014678955078, 151.7056427001953, 161.10983276367188, 170.51400756835938, 179.91819763183594, 189.3223876953125, 198.7265625, 208.13076782226562, 217.5349578857422, 226.9391326904297, 236.34332275390625, 245.74749755859375, 255.1516876220703, 264.5558776855469, 273.9600524902344, 283.3642578125, 292.7684326171875, 302.1726379394531, 311.5768127441406, 320.9809875488281, 330.38519287109375, 339.78936767578125, 349.19354248046875, 358.59771728515625, 368.00189208984375, 377.4060974121094, 386.8102722167969, 396.2144470214844, 405.61865234375, 415.0228271484375, 424.427001953125, 433.8311767578125, 443.2353515625, 452.6395568847656, 462.0437316894531, 471.4479064941406, 480.85211181640625, 490.25628662109375, 499.66046142578125, 509.0646667480469]}, "gradients/decoder.transformer.h.18.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 4.0, 1.0, 2.0, 7.0, 3.0, 2.0, 12.0, 12.0, 15.0, 11.0, 19.0, 24.0, 29.0, 24.0, 27.0, 36.0, 30.0, 24.0, 38.0, 29.0, 30.0, 38.0, 44.0, 46.0, 50.0, 35.0, 43.0, 39.0, 38.0, 39.0, 30.0, 29.0, 24.0, 25.0, 29.0, 15.0, 18.0, 13.0, 14.0, 14.0, 10.0, 12.0, 8.0, 4.0, 6.0, 4.0, 4.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-54.68250274658203, -52.85968017578125, -51.036861419677734, -49.21403884887695, -47.39122009277344, -45.568397521972656, -43.745574951171875, -41.92275619506836, -40.09993362426758, -38.2771110534668, -36.45429229736328, -34.6314697265625, -32.808650970458984, -30.985828399658203, -29.163007736206055, -27.340187072753906, -25.517366409301758, -23.69454574584961, -21.87172508239746, -20.048904418945312, -18.22608184814453, -16.403261184692383, -14.580440521240234, -12.75761890411377, -10.934798240661621, -9.111977577209473, -7.289155960083008, -5.466335296630859, -3.6435141563415527, -1.820693016052246, 0.0021276473999023438, 1.8249492645263672, 3.6477699279785156, 5.470591068267822, 7.293412208557129, 9.116232872009277, 10.939054489135742, 12.76187515258789, 14.584695816040039, 16.407516479492188, 18.23033905029297, 20.053159713745117, 21.875980377197266, 23.698802947998047, 25.521623611450195, 27.344444274902344, 29.167264938354492, 30.99008560180664, 32.812904357910156, 34.63572692871094, 36.45854568481445, 38.281368255615234, 40.10418701171875, 41.92700958251953, 43.74983215332031, 45.57265090942383, 47.39547348022461, 49.21829605102539, 51.041114807128906, 52.86393737792969, 54.6867561340332, 56.509578704833984, 58.3323974609375, 60.15522003173828, 61.97804260253906]}, "gradients/decoder.transformer.h.18.crossattention.c_proj.bias": {"_type": "histogram", "values": [3.0, 3.0, 2.0, 1.0, 2.0, 3.0, 5.0, 2.0, 4.0, 3.0, 8.0, 9.0, 12.0, 9.0, 10.0, 13.0, 15.0, 17.0, 23.0, 22.0, 26.0, 24.0, 24.0, 39.0, 44.0, 37.0, 46.0, 32.0, 44.0, 36.0, 59.0, 32.0, 43.0, 39.0, 37.0, 36.0, 26.0, 36.0, 23.0, 24.0, 18.0, 21.0, 19.0, 21.0, 12.0, 8.0, 10.0, 7.0, 5.0, 8.0, 2.0, 4.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.2734375, -5.09552001953125, -4.9176025390625, -4.73968505859375, -4.561767578125, -4.38385009765625, -4.2059326171875, -4.02801513671875, -3.85009765625, -3.67218017578125, -3.4942626953125, -3.31634521484375, -3.138427734375, -2.96051025390625, -2.7825927734375, -2.60467529296875, -2.4267578125, -2.24884033203125, -2.0709228515625, -1.89300537109375, -1.715087890625, -1.53717041015625, -1.3592529296875, -1.18133544921875, -1.00341796875, -0.82550048828125, -0.6475830078125, -0.46966552734375, -0.291748046875, -0.11383056640625, 0.0640869140625, 0.24200439453125, 0.419921875, 0.59783935546875, 0.7757568359375, 0.95367431640625, 1.131591796875, 1.30950927734375, 1.4874267578125, 1.66534423828125, 1.84326171875, 2.02117919921875, 2.1990966796875, 2.37701416015625, 2.554931640625, 2.73284912109375, 2.9107666015625, 3.08868408203125, 3.2666015625, 3.44451904296875, 3.6224365234375, 3.80035400390625, 3.978271484375, 4.15618896484375, 4.3341064453125, 4.51202392578125, 4.68994140625, 4.86785888671875, 5.0457763671875, 5.22369384765625, 5.401611328125, 5.57952880859375, 5.7574462890625, 5.93536376953125, 6.11328125]}, "gradients/decoder.transformer.h.18.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 3.0, 4.0, 3.0, 7.0, 10.0, 11.0, 18.0, 33.0, 49.0, 72.0, 119.0, 165.0, 263.0, 381.0, 542.0, 862.0, 1236.0, 1907.0, 2757.0, 4087.0, 6463.0, 9930.0, 15191.0, 24199.0, 39392.0, 68817.0, 131765.0, 354953.0, 174561.0, 84423.0, 47043.0, 28227.0, 17781.0, 11325.0, 7425.0, 4886.0, 3227.0, 2133.0, 1390.0, 971.0, 643.0, 445.0, 290.0, 199.0, 97.0, 101.0, 49.0, 38.0, 22.0, 21.0, 13.0, 7.0, 3.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.421875, -1.3739471435546875, -1.326019287109375, -1.2780914306640625, -1.23016357421875, -1.1822357177734375, -1.134307861328125, -1.0863800048828125, -1.0384521484375, -0.9905242919921875, -0.942596435546875, -0.8946685791015625, -0.84674072265625, -0.7988128662109375, -0.750885009765625, -0.7029571533203125, -0.655029296875, -0.6071014404296875, -0.559173583984375, -0.5112457275390625, -0.46331787109375, -0.4153900146484375, -0.367462158203125, -0.3195343017578125, -0.2716064453125, -0.2236785888671875, -0.175750732421875, -0.1278228759765625, -0.07989501953125, -0.0319671630859375, 0.015960693359375, 0.0638885498046875, 0.11181640625, 0.1597442626953125, 0.207672119140625, 0.2555999755859375, 0.30352783203125, 0.3514556884765625, 0.399383544921875, 0.4473114013671875, 0.4952392578125, 0.5431671142578125, 0.591094970703125, 0.6390228271484375, 0.68695068359375, 0.7348785400390625, 0.782806396484375, 0.8307342529296875, 0.878662109375, 0.9265899658203125, 0.974517822265625, 1.0224456787109375, 1.07037353515625, 1.1183013916015625, 1.166229248046875, 1.2141571044921875, 1.2620849609375, 1.3100128173828125, 1.357940673828125, 1.4058685302734375, 1.45379638671875, 1.5017242431640625, 1.549652099609375, 1.5975799560546875, 1.6455078125]}, "gradients/decoder.transformer.h.18.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 4.0, 3.0, 5.0, 6.0, 11.0, 3.0, 6.0, 4.0, 11.0, 22.0, 17.0, 24.0, 30.0, 33.0, 39.0, 39.0, 48.0, 50.0, 52.0, 42.0, 1065.0, 41.0, 43.0, 33.0, 37.0, 33.0, 45.0, 34.0, 41.0, 38.0, 33.0, 22.0, 23.0, 22.0, 11.0, 23.0, 6.0, 17.0, 2.0, 6.0, 3.0, 1.0, 1.0, 3.0, 3.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.890625, -3.75640869140625, -3.6221923828125, -3.48797607421875, -3.353759765625, -3.21954345703125, -3.0853271484375, -2.95111083984375, -2.81689453125, -2.68267822265625, -2.5484619140625, -2.41424560546875, -2.280029296875, -2.14581298828125, -2.0115966796875, -1.87738037109375, -1.7431640625, -1.60894775390625, -1.4747314453125, -1.34051513671875, -1.206298828125, -1.07208251953125, -0.9378662109375, -0.80364990234375, -0.66943359375, -0.53521728515625, -0.4010009765625, -0.26678466796875, -0.132568359375, 0.00164794921875, 0.1358642578125, 0.27008056640625, 0.404296875, 0.53851318359375, 0.6727294921875, 0.80694580078125, 0.941162109375, 1.07537841796875, 1.2095947265625, 1.34381103515625, 1.47802734375, 1.61224365234375, 1.7464599609375, 1.88067626953125, 2.014892578125, 2.14910888671875, 2.2833251953125, 2.41754150390625, 2.5517578125, 2.68597412109375, 2.8201904296875, 2.95440673828125, 3.088623046875, 3.22283935546875, 3.3570556640625, 3.49127197265625, 3.62548828125, 3.75970458984375, 3.8939208984375, 4.02813720703125, 4.162353515625, 4.29656982421875, 4.4307861328125, 4.56500244140625, 4.69921875]}, "gradients/decoder.transformer.h.18.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 2.0, 3.0, 1.0, 5.0, 6.0, 10.0, 21.0, 21.0, 39.0, 62.0, 88.0, 138.0, 267.0, 412.0, 750.0, 1331.0, 2483.0, 4823.0, 9215.0, 18998.0, 41224.0, 94556.0, 255864.0, 1443172.0, 122523.0, 52857.0, 24009.0, 11634.0, 5733.0, 3093.0, 1687.0, 939.0, 456.0, 267.0, 186.0, 98.0, 55.0, 34.0, 15.0, 24.0, 12.0, 11.0, 5.0, 4.0, 4.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.59765625, -2.52374267578125, -2.4498291015625, -2.37591552734375, -2.302001953125, -2.22808837890625, -2.1541748046875, -2.08026123046875, -2.00634765625, -1.93243408203125, -1.8585205078125, -1.78460693359375, -1.710693359375, -1.63677978515625, -1.5628662109375, -1.48895263671875, -1.4150390625, -1.34112548828125, -1.2672119140625, -1.19329833984375, -1.119384765625, -1.04547119140625, -0.9715576171875, -0.89764404296875, -0.82373046875, -0.74981689453125, -0.6759033203125, -0.60198974609375, -0.528076171875, -0.45416259765625, -0.3802490234375, -0.30633544921875, -0.232421875, -0.15850830078125, -0.0845947265625, -0.01068115234375, 0.063232421875, 0.13714599609375, 0.2110595703125, 0.28497314453125, 0.35888671875, 0.43280029296875, 0.5067138671875, 0.58062744140625, 0.654541015625, 0.72845458984375, 0.8023681640625, 0.87628173828125, 0.9501953125, 1.02410888671875, 1.0980224609375, 1.17193603515625, 1.245849609375, 1.31976318359375, 1.3936767578125, 1.46759033203125, 1.54150390625, 1.61541748046875, 1.6893310546875, 1.76324462890625, 1.837158203125, 1.91107177734375, 1.9849853515625, 2.05889892578125, 2.1328125]}, "gradients/decoder.transformer.h.18.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 2.0, 1.0, 3.0, 6.0, 4.0, 10.0, 14.0, 18.0, 14.0, 24.0, 20.0, 32.0, 35.0, 37.0, 34.0, 60.0, 55.0, 98.0, 91.0, 76.0, 61.0, 43.0, 34.0, 39.0, 38.0, 32.0, 24.0, 23.0, 17.0, 10.0, 12.0, 8.0, 7.0, 6.0, 5.0, 4.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0009975433349609375, -0.0009656399488449097, -0.0009337365627288818, -0.000901833176612854, -0.0008699297904968262, -0.0008380264043807983, -0.0008061230182647705, -0.0007742196321487427, -0.0007423162460327148, -0.000710412859916687, -0.0006785094738006592, -0.0006466060876846313, -0.0006147027015686035, -0.0005827993154525757, -0.0005508959293365479, -0.00051899254322052, -0.0004870891571044922, -0.00045518577098846436, -0.0004232823848724365, -0.0003913789987564087, -0.00035947561264038086, -0.00032757222652435303, -0.0002956688404083252, -0.00026376545429229736, -0.00023186206817626953, -0.0001999586820602417, -0.00016805529594421387, -0.00013615190982818604, -0.0001042485237121582, -7.234513759613037e-05, -4.044175148010254e-05, -8.538365364074707e-06, 2.3365020751953125e-05, 5.526840686798096e-05, 8.717179298400879e-05, 0.00011907517910003662, 0.00015097856521606445, 0.00018288195133209229, 0.00021478533744812012, 0.00024668872356414795, 0.0002785921096801758, 0.0003104954957962036, 0.00034239888191223145, 0.0003743022680282593, 0.0004062056541442871, 0.00043810904026031494, 0.0004700124263763428, 0.0005019158124923706, 0.0005338191986083984, 0.0005657225847244263, 0.0005976259708404541, 0.0006295293569564819, 0.0006614327430725098, 0.0006933361291885376, 0.0007252395153045654, 0.0007571429014205933, 0.0007890462875366211, 0.0008209496736526489, 0.0008528530597686768, 0.0008847564458847046, 0.0009166598320007324, 0.0009485632181167603, 0.000980466604232788, 0.001012369990348816, 0.0010442733764648438]}, "gradients/decoder.transformer.h.18.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 2.0, 3.0, 4.0, 5.0, 3.0, 1.0, 6.0, 9.0, 9.0, 11.0, 13.0, 17.0, 24.0, 36.0, 38.0, 66.0, 105.0, 178.0, 358.0, 867.0, 884533.0, 160687.0, 789.0, 335.0, 164.0, 79.0, 51.0, 39.0, 42.0, 26.0, 14.0, 8.0, 7.0, 8.0, 7.0, 9.0, 2.0, 4.0, 2.0, 3.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0288543701171875, -0.027978181838989258, -0.027101993560791016, -0.026225805282592773, -0.02534961700439453, -0.02447342872619629, -0.023597240447998047, -0.022721052169799805, -0.021844863891601562, -0.02096867561340332, -0.020092487335205078, -0.019216299057006836, -0.018340110778808594, -0.01746392250061035, -0.01658773422241211, -0.015711545944213867, -0.014835357666015625, -0.013959169387817383, -0.01308298110961914, -0.012206792831420898, -0.011330604553222656, -0.010454416275024414, -0.009578227996826172, -0.00870203971862793, -0.007825851440429688, -0.006949663162231445, -0.006073474884033203, -0.005197286605834961, -0.004321098327636719, -0.0034449100494384766, -0.0025687217712402344, -0.0016925334930419922, -0.00081634521484375, 5.984306335449219e-05, 0.0009360313415527344, 0.0018122196197509766, 0.0026884078979492188, 0.003564596176147461, 0.004440784454345703, 0.005316972732543945, 0.0061931610107421875, 0.00706934928894043, 0.007945537567138672, 0.008821725845336914, 0.009697914123535156, 0.010574102401733398, 0.01145029067993164, 0.012326478958129883, 0.013202667236328125, 0.014078855514526367, 0.01495504379272461, 0.01583123207092285, 0.016707420349121094, 0.017583608627319336, 0.018459796905517578, 0.01933598518371582, 0.020212173461914062, 0.021088361740112305, 0.021964550018310547, 0.02284073829650879, 0.02371692657470703, 0.024593114852905273, 0.025469303131103516, 0.026345491409301758, 0.0272216796875]}, "gradients/decoder.transformer.h.18.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 6.0, 41.0, 347.0, 516.0, 97.0, 8.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0035275027621537447, -0.0034557655453681946, -0.0033840283285826445, -0.0033122911117970943, -0.0032405538950115442, -0.003168816678225994, -0.003097079461440444, -0.003025342244654894, -0.0029536052606999874, -0.0028818680439144373, -0.002810130827128887, -0.002738393610343337, -0.002666656393557787, -0.002594919176772237, -0.0025231819599866867, -0.0024514449760317802, -0.0023797075264155865, -0.0023079703096300364, -0.0022362330928444862, -0.002164495876058936, -0.002092758659273386, -0.002021021442487836, -0.0019492843421176076, -0.0018775471253320575, -0.0018058099085465074, -0.0017340726917609572, -0.0016623354749754071, -0.001590598258189857, -0.0015188611578196287, -0.0014471239410340786, -0.0013753867242485285, -0.0013036495074629784, -0.001231912523508072, -0.0011601753067225218, -0.0010884380899369717, -0.0010167008731514215, -0.0009449637145735323, -0.0008732264977879822, -0.000801489339210093, -0.0007297521224245429, -0.0006580149056389928, -0.0005862776888534427, -0.0005145404720678926, -0.00044280331349000335, -0.00037106609670445323, -0.0002993288799189031, -0.00022759169223718345, -0.0001558545045554638, -8.411728776991367e-05, -1.2380085536278784e-05, 5.9357116697356105e-05, 0.000131094318930991, 0.00020283152116462588, 0.000274568737950176, 0.00034630592563189566, 0.0004180431133136153, 0.0004897803300991654, 0.0005615175468847156, 0.0006332547636702657, 0.0007049919222481549, 0.000776729139033705, 0.0008484663558192551, 0.0009202035143971443, 0.0009919407311826944, 0.0010636779479682446]}, "gradients/decoder.transformer.h.18.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 6.0, 1.0, 4.0, 4.0, 5.0, 1.0, 11.0, 8.0, 5.0, 12.0, 4.0, 19.0, 14.0, 11.0, 15.0, 23.0, 22.0, 29.0, 23.0, 18.0, 36.0, 39.0, 37.0, 39.0, 40.0, 45.0, 34.0, 36.0, 33.0, 41.0, 41.0, 26.0, 37.0, 36.0, 26.0, 23.0, 36.0, 23.0, 23.0, 15.0, 20.0, 19.0, 12.0, 16.0, 10.0, 6.0, 6.0, 9.0, 4.0, 7.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 1.0, 0.0, 1.0], "bins": [-0.0004329085350036621, -0.00041897501796483994, -0.00040504150092601776, -0.0003911079838871956, -0.0003771744668483734, -0.00036324094980955124, -0.00034930743277072906, -0.0003353739157319069, -0.0003214403986930847, -0.00030750688165426254, -0.00029357336461544037, -0.0002796398475766182, -0.000265706330537796, -0.00025177281349897385, -0.00023783929646015167, -0.0002239057794213295, -0.00020997226238250732, -0.00019603874534368515, -0.00018210522830486298, -0.0001681717112660408, -0.00015423819422721863, -0.00014030467718839645, -0.00012637116014957428, -0.0001124376431107521, -9.850412607192993e-05, -8.457060903310776e-05, -7.063709199428558e-05, -5.670357495546341e-05, -4.2770057916641235e-05, -2.883654087781906e-05, -1.4903023838996887e-05, -9.695068001747131e-07, 1.2964010238647461e-05, 2.6897527277469635e-05, 4.083104431629181e-05, 5.476456135511398e-05, 6.869807839393616e-05, 8.263159543275833e-05, 9.65651124715805e-05, 0.00011049862951040268, 0.00012443214654922485, 0.00013836566358804703, 0.0001522991806268692, 0.00016623269766569138, 0.00018016621470451355, 0.00019409973174333572, 0.0002080332487821579, 0.00022196676582098007, 0.00023590028285980225, 0.0002498337998986244, 0.0002637673169374466, 0.00027770083397626877, 0.00029163435101509094, 0.0003055678680539131, 0.0003195013850927353, 0.00033343490213155746, 0.00034736841917037964, 0.0003613019362092018, 0.000375235453248024, 0.00038916897028684616, 0.00040310248732566833, 0.0004170360043644905, 0.0004309695214033127, 0.00044490303844213486, 0.00045883655548095703]}, "gradients/decoder.transformer.h.18.attn.c_proj.bias": {"_type": "histogram", "values": [3.0, 3.0, 2.0, 1.0, 2.0, 3.0, 5.0, 2.0, 4.0, 3.0, 8.0, 9.0, 12.0, 9.0, 10.0, 13.0, 15.0, 17.0, 23.0, 22.0, 26.0, 24.0, 24.0, 40.0, 43.0, 37.0, 46.0, 32.0, 44.0, 36.0, 59.0, 32.0, 43.0, 39.0, 37.0, 36.0, 26.0, 36.0, 23.0, 24.0, 18.0, 21.0, 19.0, 21.0, 12.0, 8.0, 10.0, 7.0, 5.0, 8.0, 2.0, 4.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.2734375, -5.09552001953125, -4.9176025390625, -4.73968505859375, -4.561767578125, -4.38385009765625, -4.2059326171875, -4.02801513671875, -3.85009765625, -3.67218017578125, -3.4942626953125, -3.31634521484375, -3.138427734375, -2.96051025390625, -2.7825927734375, -2.60467529296875, -2.4267578125, -2.24884033203125, -2.0709228515625, -1.89300537109375, -1.715087890625, -1.53717041015625, -1.3592529296875, -1.18133544921875, -1.00341796875, -0.82550048828125, -0.6475830078125, -0.46966552734375, -0.291748046875, -0.11383056640625, 0.0640869140625, 0.24200439453125, 0.419921875, 0.59783935546875, 0.7757568359375, 0.95367431640625, 1.131591796875, 1.30950927734375, 1.4874267578125, 1.66534423828125, 1.84326171875, 2.02117919921875, 2.1990966796875, 2.37701416015625, 2.554931640625, 2.73284912109375, 2.9107666015625, 3.08868408203125, 3.2666015625, 3.44451904296875, 3.6224365234375, 3.80035400390625, 3.978271484375, 4.15618896484375, 4.3341064453125, 4.51202392578125, 4.68994140625, 4.86785888671875, 5.0457763671875, 5.22369384765625, 5.401611328125, 5.57952880859375, 5.7574462890625, 5.93536376953125, 6.11328125]}, "gradients/decoder.transformer.h.18.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 6.0, 7.0, 6.0, 15.0, 26.0, 43.0, 44.0, 62.0, 113.0, 157.0, 206.0, 317.0, 503.0, 680.0, 997.0, 1470.0, 2216.0, 3216.0, 4847.0, 7508.0, 12538.0, 21805.0, 43421.0, 100642.0, 320403.0, 324302.0, 102170.0, 43847.0, 22076.0, 12294.0, 7714.0, 4808.0, 3244.0, 2193.0, 1455.0, 1009.0, 689.0, 472.0, 318.0, 209.0, 154.0, 95.0, 90.0, 37.0, 44.0, 37.0, 16.0, 11.0, 14.0, 8.0, 1.0, 2.0, 4.0, 0.0, 2.0, 1.0], "bins": [-3.3984375, -3.29547119140625, -3.1925048828125, -3.08953857421875, -2.986572265625, -2.88360595703125, -2.7806396484375, -2.67767333984375, -2.57470703125, -2.47174072265625, -2.3687744140625, -2.26580810546875, -2.162841796875, -2.05987548828125, -1.9569091796875, -1.85394287109375, -1.7509765625, -1.64801025390625, -1.5450439453125, -1.44207763671875, -1.339111328125, -1.23614501953125, -1.1331787109375, -1.03021240234375, -0.92724609375, -0.82427978515625, -0.7213134765625, -0.61834716796875, -0.515380859375, -0.41241455078125, -0.3094482421875, -0.20648193359375, -0.103515625, -0.00054931640625, 0.1024169921875, 0.20538330078125, 0.308349609375, 0.41131591796875, 0.5142822265625, 0.61724853515625, 0.72021484375, 0.82318115234375, 0.9261474609375, 1.02911376953125, 1.132080078125, 1.23504638671875, 1.3380126953125, 1.44097900390625, 1.5439453125, 1.64691162109375, 1.7498779296875, 1.85284423828125, 1.955810546875, 2.05877685546875, 2.1617431640625, 2.26470947265625, 2.36767578125, 2.47064208984375, 2.5736083984375, 2.67657470703125, 2.779541015625, 2.88250732421875, 2.9854736328125, 3.08843994140625, 3.19140625]}, "gradients/decoder.transformer.h.18.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 3.0, 0.0, 0.0, 2.0, 3.0, 1.0, 2.0, 5.0, 1.0, 5.0, 5.0, 17.0, 17.0, 14.0, 11.0, 16.0, 19.0, 22.0, 32.0, 32.0, 18.0, 31.0, 41.0, 30.0, 27.0, 45.0, 58.0, 147.0, 1761.0, 200.0, 75.0, 48.0, 35.0, 38.0, 45.0, 28.0, 20.0, 28.0, 29.0, 19.0, 23.0, 21.0, 14.0, 9.0, 21.0, 9.0, 9.0, 7.0, 7.0, 6.0, 2.0, 0.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-17.890625, -17.326171875, -16.76171875, -16.197265625, -15.6328125, -15.068359375, -14.50390625, -13.939453125, -13.375, -12.810546875, -12.24609375, -11.681640625, -11.1171875, -10.552734375, -9.98828125, -9.423828125, -8.859375, -8.294921875, -7.73046875, -7.166015625, -6.6015625, -6.037109375, -5.47265625, -4.908203125, -4.34375, -3.779296875, -3.21484375, -2.650390625, -2.0859375, -1.521484375, -0.95703125, -0.392578125, 0.171875, 0.736328125, 1.30078125, 1.865234375, 2.4296875, 2.994140625, 3.55859375, 4.123046875, 4.6875, 5.251953125, 5.81640625, 6.380859375, 6.9453125, 7.509765625, 8.07421875, 8.638671875, 9.203125, 9.767578125, 10.33203125, 10.896484375, 11.4609375, 12.025390625, 12.58984375, 13.154296875, 13.71875, 14.283203125, 14.84765625, 15.412109375, 15.9765625, 16.541015625, 17.10546875, 17.669921875, 18.234375]}, "gradients/decoder.transformer.h.18.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 3.0, 3.0, 2.0, 3.0, 2.0, 10.0, 10.0, 9.0, 11.0, 15.0, 14.0, 20.0, 31.0, 31.0, 31.0, 58.0, 80.0, 103.0, 152.0, 198.0, 349.0, 709.0, 4810.0, 1267925.0, 1864134.0, 5111.0, 759.0, 318.0, 216.0, 159.0, 108.0, 65.0, 48.0, 44.0, 35.0, 29.0, 18.0, 18.0, 19.0, 17.0, 10.0, 8.0, 3.0, 6.0, 3.0, 2.0, 1.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 2.0], "bins": [-37.5625, -36.4580078125, -35.353515625, -34.2490234375, -33.14453125, -32.0400390625, -30.935546875, -29.8310546875, -28.7265625, -27.6220703125, -26.517578125, -25.4130859375, -24.30859375, -23.2041015625, -22.099609375, -20.9951171875, -19.890625, -18.7861328125, -17.681640625, -16.5771484375, -15.47265625, -14.3681640625, -13.263671875, -12.1591796875, -11.0546875, -9.9501953125, -8.845703125, -7.7412109375, -6.63671875, -5.5322265625, -4.427734375, -3.3232421875, -2.21875, -1.1142578125, -0.009765625, 1.0947265625, 2.19921875, 3.3037109375, 4.408203125, 5.5126953125, 6.6171875, 7.7216796875, 8.826171875, 9.9306640625, 11.03515625, 12.1396484375, 13.244140625, 14.3486328125, 15.453125, 16.5576171875, 17.662109375, 18.7666015625, 19.87109375, 20.9755859375, 22.080078125, 23.1845703125, 24.2890625, 25.3935546875, 26.498046875, 27.6025390625, 28.70703125, 29.8115234375, 30.916015625, 32.0205078125, 33.125]}, "gradients/decoder.transformer.h.18.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 36.0, 278.0, 548.0, 141.0, 11.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-92.32762908935547, -89.78218078613281, -87.23673248291016, -84.6912841796875, -82.14583587646484, -79.60038757324219, -77.05493927001953, -74.50949096679688, -71.96403503417969, -69.41858673095703, -66.87313842773438, -64.32769012451172, -61.78224182128906, -59.236793518066406, -56.691341400146484, -54.14589309692383, -51.60044860839844, -49.05500030517578, -46.509552001953125, -43.96410369873047, -41.41865539550781, -38.873207092285156, -36.327754974365234, -33.78230667114258, -31.236858367919922, -28.691410064697266, -26.14596176147461, -23.60051155090332, -21.055063247680664, -18.509614944458008, -15.964165687561035, -13.418716430664062, -10.873268127441406, -8.32781982421875, -5.782370567321777, -3.236921787261963, -0.6914730072021484, 1.8539752960205078, 4.3994245529174805, 6.944873809814453, 9.49032211303711, 12.035770416259766, 14.581219673156738, 17.12666893005371, 19.672117233276367, 22.217565536499023, 24.763015747070312, 27.30846405029297, 29.853912353515625, 32.39936065673828, 34.94480895996094, 37.490257263183594, 40.03570556640625, 42.581153869628906, 45.12660598754883, 47.672054290771484, 50.21750259399414, 52.7629508972168, 55.30839920043945, 57.85384750366211, 60.39929962158203, 62.94474792480469, 65.49019622802734, 68.03564453125, 70.58109283447266]}, "gradients/decoder.transformer.h.18.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 3.0, 3.0, 6.0, 3.0, 6.0, 7.0, 6.0, 9.0, 8.0, 14.0, 16.0, 21.0, 16.0, 18.0, 19.0, 22.0, 23.0, 34.0, 29.0, 37.0, 46.0, 41.0, 38.0, 42.0, 45.0, 41.0, 48.0, 46.0, 44.0, 25.0, 35.0, 43.0, 36.0, 27.0, 29.0, 10.0, 22.0, 15.0, 14.0, 12.0, 10.0, 8.0, 9.0, 10.0, 0.0, 5.0, 7.0, 4.0, 2.0, 3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-52.57984924316406, -50.75518035888672, -48.930511474609375, -47.10584259033203, -45.28117752075195, -43.45650863647461, -41.631839752197266, -39.80717086791992, -37.98250198364258, -36.157833099365234, -34.33316421508789, -32.50849914550781, -30.683828353881836, -28.859161376953125, -27.03449249267578, -25.209823608398438, -23.385156631469727, -21.560487747192383, -19.735820770263672, -17.911151885986328, -16.086483001708984, -14.261815071105957, -12.43714714050293, -10.612478256225586, -8.787810325622559, -6.963141918182373, -5.1384735107421875, -3.31380558013916, -1.4891371726989746, 0.33553123474121094, 2.1601991653442383, 3.984868049621582, 5.809535980224609, 7.634204387664795, 9.45887279510498, 11.283540725708008, 13.108209609985352, 14.932877540588379, 16.757545471191406, 18.58221435546875, 20.406883239746094, 22.231552124023438, 24.05621910095215, 25.880887985229492, 27.705556869506836, 29.530223846435547, 31.35489273071289, 33.179561614990234, 35.00422668457031, 36.828895568847656, 38.653564453125, 40.478233337402344, 42.30289840698242, 44.127567291259766, 45.95223617553711, 47.77690505981445, 49.6015739440918, 51.42624282836914, 53.250911712646484, 55.07557678222656, 56.900245666503906, 58.72491455078125, 60.549583435058594, 62.37425231933594, 64.19892120361328]}, "gradients/decoder.transformer.h.17.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 3.0, 2.0, 1.0, 1.0, 4.0, 4.0, 3.0, 2.0, 5.0, 9.0, 6.0, 14.0, 6.0, 18.0, 9.0, 15.0, 20.0, 23.0, 20.0, 32.0, 34.0, 32.0, 36.0, 36.0, 38.0, 44.0, 38.0, 48.0, 48.0, 50.0, 39.0, 40.0, 38.0, 31.0, 49.0, 20.0, 27.0, 31.0, 17.0, 21.0, 20.0, 16.0, 8.0, 11.0, 11.0, 8.0, 6.0, 8.0, 2.0, 2.0, 4.0, 2.0, 0.0, 4.0, 1.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.46484375, -5.27679443359375, -5.0887451171875, -4.90069580078125, -4.712646484375, -4.52459716796875, -4.3365478515625, -4.14849853515625, -3.96044921875, -3.77239990234375, -3.5843505859375, -3.39630126953125, -3.208251953125, -3.02020263671875, -2.8321533203125, -2.64410400390625, -2.4560546875, -2.26800537109375, -2.0799560546875, -1.89190673828125, -1.703857421875, -1.51580810546875, -1.3277587890625, -1.13970947265625, -0.95166015625, -0.76361083984375, -0.5755615234375, -0.38751220703125, -0.199462890625, -0.01141357421875, 0.1766357421875, 0.36468505859375, 0.552734375, 0.74078369140625, 0.9288330078125, 1.11688232421875, 1.304931640625, 1.49298095703125, 1.6810302734375, 1.86907958984375, 2.05712890625, 2.24517822265625, 2.4332275390625, 2.62127685546875, 2.809326171875, 2.99737548828125, 3.1854248046875, 3.37347412109375, 3.5615234375, 3.74957275390625, 3.9376220703125, 4.12567138671875, 4.313720703125, 4.50177001953125, 4.6898193359375, 4.87786865234375, 5.06591796875, 5.25396728515625, 5.4420166015625, 5.63006591796875, 5.818115234375, 6.00616455078125, 6.1942138671875, 6.38226318359375, 6.5703125]}, "gradients/decoder.transformer.h.17.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 3.0, 0.0, 4.0, 0.0, 4.0, 5.0, 1.0, 6.0, 13.0, 12.0, 16.0, 25.0, 29.0, 32.0, 38.0, 66.0, 66.0, 96.0, 196.0, 253.0, 426.0, 691.0, 1242.0, 2242.0, 4572.0, 10241.0, 25813.0, 69927.0, 200910.0, 536122.0, 1101646.0, 1203087.0, 647089.0, 245882.0, 86858.0, 32579.0, 12562.0, 5415.0, 2619.0, 1402.0, 739.0, 412.0, 304.0, 172.0, 146.0, 85.0, 56.0, 49.0, 27.0, 29.0, 23.0, 23.0, 14.0, 5.0, 4.0, 9.0, 4.0, 4.0, 3.0, 0.0, 1.0, 1.0, 3.0], "bins": [-5.359375, -5.19244384765625, -5.0255126953125, -4.85858154296875, -4.691650390625, -4.52471923828125, -4.3577880859375, -4.19085693359375, -4.02392578125, -3.85699462890625, -3.6900634765625, -3.52313232421875, -3.356201171875, -3.18927001953125, -3.0223388671875, -2.85540771484375, -2.6884765625, -2.52154541015625, -2.3546142578125, -2.18768310546875, -2.020751953125, -1.85382080078125, -1.6868896484375, -1.51995849609375, -1.35302734375, -1.18609619140625, -1.0191650390625, -0.85223388671875, -0.685302734375, -0.51837158203125, -0.3514404296875, -0.18450927734375, -0.017578125, 0.14935302734375, 0.3162841796875, 0.48321533203125, 0.650146484375, 0.81707763671875, 0.9840087890625, 1.15093994140625, 1.31787109375, 1.48480224609375, 1.6517333984375, 1.81866455078125, 1.985595703125, 2.15252685546875, 2.3194580078125, 2.48638916015625, 2.6533203125, 2.82025146484375, 2.9871826171875, 3.15411376953125, 3.321044921875, 3.48797607421875, 3.6549072265625, 3.82183837890625, 3.98876953125, 4.15570068359375, 4.3226318359375, 4.48956298828125, 4.656494140625, 4.82342529296875, 4.9903564453125, 5.15728759765625, 5.32421875]}, "gradients/decoder.transformer.h.17.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 2.0, 3.0, 5.0, 5.0, 3.0, 7.0, 7.0, 7.0, 8.0, 14.0, 12.0, 22.0, 32.0, 42.0, 44.0, 60.0, 68.0, 83.0, 115.0, 166.0, 225.0, 273.0, 335.0, 353.0, 397.0, 355.0, 355.0, 229.0, 210.0, 143.0, 102.0, 93.0, 75.0, 59.0, 40.0, 36.0, 28.0, 18.0, 15.0, 14.0, 7.0, 6.0, 6.0, 2.0, 3.0, 1.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-14.0859375, -13.6759033203125, -13.265869140625, -12.8558349609375, -12.44580078125, -12.0357666015625, -11.625732421875, -11.2156982421875, -10.8056640625, -10.3956298828125, -9.985595703125, -9.5755615234375, -9.16552734375, -8.7554931640625, -8.345458984375, -7.9354248046875, -7.525390625, -7.1153564453125, -6.705322265625, -6.2952880859375, -5.88525390625, -5.4752197265625, -5.065185546875, -4.6551513671875, -4.2451171875, -3.8350830078125, -3.425048828125, -3.0150146484375, -2.60498046875, -2.1949462890625, -1.784912109375, -1.3748779296875, -0.96484375, -0.5548095703125, -0.144775390625, 0.2652587890625, 0.67529296875, 1.0853271484375, 1.495361328125, 1.9053955078125, 2.3154296875, 2.7254638671875, 3.135498046875, 3.5455322265625, 3.95556640625, 4.3656005859375, 4.775634765625, 5.1856689453125, 5.595703125, 6.0057373046875, 6.415771484375, 6.8258056640625, 7.23583984375, 7.6458740234375, 8.055908203125, 8.4659423828125, 8.8759765625, 9.2860107421875, 9.696044921875, 10.1060791015625, 10.51611328125, 10.9261474609375, 11.336181640625, 11.7462158203125, 12.15625]}, "gradients/decoder.transformer.h.17.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 5.0, 3.0, 2.0, 3.0, 7.0, 12.0, 11.0, 13.0, 27.0, 26.0, 43.0, 45.0, 57.0, 84.0, 116.0, 175.0, 288.0, 478.0, 2464.0, 1855896.0, 2330452.0, 2622.0, 528.0, 281.0, 179.0, 124.0, 100.0, 60.0, 41.0, 29.0, 34.0, 22.0, 13.0, 12.0, 10.0, 6.0, 5.0, 6.0, 4.0, 2.0, 2.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-73.1875, -70.9013671875, -68.615234375, -66.3291015625, -64.04296875, -61.7568359375, -59.470703125, -57.1845703125, -54.8984375, -52.6123046875, -50.326171875, -48.0400390625, -45.75390625, -43.4677734375, -41.181640625, -38.8955078125, -36.609375, -34.3232421875, -32.037109375, -29.7509765625, -27.46484375, -25.1787109375, -22.892578125, -20.6064453125, -18.3203125, -16.0341796875, -13.748046875, -11.4619140625, -9.17578125, -6.8896484375, -4.603515625, -2.3173828125, -0.03125, 2.2548828125, 4.541015625, 6.8271484375, 9.11328125, 11.3994140625, 13.685546875, 15.9716796875, 18.2578125, 20.5439453125, 22.830078125, 25.1162109375, 27.40234375, 29.6884765625, 31.974609375, 34.2607421875, 36.546875, 38.8330078125, 41.119140625, 43.4052734375, 45.69140625, 47.9775390625, 50.263671875, 52.5498046875, 54.8359375, 57.1220703125, 59.408203125, 61.6943359375, 63.98046875, 66.2666015625, 68.552734375, 70.8388671875, 73.125]}, "gradients/decoder.transformer.h.17.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 39.0, 200.0, 464.0, 265.0, 45.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-166.6898193359375, -159.6375274658203, -152.58523559570312, -145.53292846679688, -138.4806365966797, -131.4283447265625, -124.37605285644531, -117.32376098632812, -110.27146911621094, -103.21917724609375, -96.16687774658203, -89.11458587646484, -82.06229400634766, -75.00999450683594, -67.95770263671875, -60.90541076660156, -53.853111267089844, -46.80081558227539, -39.7485237121582, -32.69622802734375, -25.64393424987793, -18.59164047241211, -11.539344787597656, -4.487052917480469, 2.5652427673339844, 9.617536544799805, 16.669830322265625, 23.722126007080078, 30.7744197845459, 37.82671356201172, 44.87900924682617, 51.93130111694336, 58.98359680175781, 66.035888671875, 73.08818817138672, 80.1404800415039, 87.1927719116211, 94.24507141113281, 101.29736328125, 108.34965515136719, 115.40194702148438, 122.45423889160156, 129.50653076171875, 136.558837890625, 143.6111297607422, 150.66342163085938, 157.71571350097656, 164.76800537109375, 171.8203125, 178.8726043701172, 185.92489624023438, 192.97720336914062, 200.0294952392578, 207.081787109375, 214.1340789794922, 221.18637084960938, 228.23866271972656, 235.29095458984375, 242.34324645996094, 249.39553833007812, 256.4478454589844, 263.5001220703125, 270.55242919921875, 277.604736328125, 284.6570129394531]}, "gradients/decoder.transformer.h.17.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 4.0, 5.0, 4.0, 6.0, 10.0, 9.0, 8.0, 10.0, 11.0, 16.0, 22.0, 13.0, 16.0, 30.0, 42.0, 35.0, 26.0, 39.0, 26.0, 28.0, 39.0, 45.0, 41.0, 43.0, 28.0, 36.0, 37.0, 28.0, 37.0, 45.0, 25.0, 29.0, 26.0, 29.0, 27.0, 19.0, 20.0, 18.0, 14.0, 14.0, 2.0, 11.0, 7.0, 9.0, 4.0, 9.0, 2.0, 3.0, 5.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-46.62152099609375, -45.07754898071289, -43.53357696533203, -41.98960876464844, -40.44563674926758, -38.90166473388672, -37.357696533203125, -35.813724517822266, -34.269752502441406, -32.72578048706055, -31.18181037902832, -29.637840270996094, -28.093868255615234, -26.549896240234375, -25.00592613220215, -23.461956024169922, -21.917984008789062, -20.374011993408203, -18.830041885375977, -17.28607177734375, -15.74209976196289, -14.198128700256348, -12.654157638549805, -11.110186576843262, -9.566215515136719, -8.022244453430176, -6.478273391723633, -4.93430233001709, -3.390331268310547, -1.846360206604004, -0.30238914489746094, 1.241581916809082, 2.785552978515625, 4.329524040222168, 5.873495101928711, 7.417466163635254, 8.961437225341797, 10.50540828704834, 12.049379348754883, 13.593350410461426, 15.137321472167969, 16.681293487548828, 18.225263595581055, 19.76923370361328, 21.31320571899414, 22.857177734375, 24.401147842407227, 25.945117950439453, 27.489089965820312, 29.033061981201172, 30.5770320892334, 32.121002197265625, 33.664974212646484, 35.208946228027344, 36.75291442871094, 38.2968864440918, 39.840858459472656, 41.384830474853516, 42.928802490234375, 44.47277069091797, 46.01674270629883, 47.56071472167969, 49.10468292236328, 50.64865493774414, 52.192626953125]}, "gradients/decoder.transformer.h.17.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 4.0, 3.0, 5.0, 8.0, 8.0, 20.0, 11.0, 14.0, 15.0, 18.0, 28.0, 25.0, 33.0, 33.0, 38.0, 34.0, 50.0, 34.0, 37.0, 44.0, 47.0, 47.0, 52.0, 39.0, 53.0, 42.0, 28.0, 31.0, 30.0, 25.0, 25.0, 12.0, 21.0, 8.0, 12.0, 16.0, 14.0, 11.0, 6.0, 5.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.16796875, -5.971435546875, -5.77490234375, -5.578369140625, -5.3818359375, -5.185302734375, -4.98876953125, -4.792236328125, -4.595703125, -4.399169921875, -4.20263671875, -4.006103515625, -3.8095703125, -3.613037109375, -3.41650390625, -3.219970703125, -3.0234375, -2.826904296875, -2.63037109375, -2.433837890625, -2.2373046875, -2.040771484375, -1.84423828125, -1.647705078125, -1.451171875, -1.254638671875, -1.05810546875, -0.861572265625, -0.6650390625, -0.468505859375, -0.27197265625, -0.075439453125, 0.12109375, 0.317626953125, 0.51416015625, 0.710693359375, 0.9072265625, 1.103759765625, 1.30029296875, 1.496826171875, 1.693359375, 1.889892578125, 2.08642578125, 2.282958984375, 2.4794921875, 2.676025390625, 2.87255859375, 3.069091796875, 3.265625, 3.462158203125, 3.65869140625, 3.855224609375, 4.0517578125, 4.248291015625, 4.44482421875, 4.641357421875, 4.837890625, 5.034423828125, 5.23095703125, 5.427490234375, 5.6240234375, 5.820556640625, 6.01708984375, 6.213623046875, 6.41015625]}, "gradients/decoder.transformer.h.17.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 5.0, 9.0, 11.0, 14.0, 17.0, 33.0, 45.0, 60.0, 65.0, 111.0, 168.0, 219.0, 281.0, 416.0, 570.0, 786.0, 1209.0, 1640.0, 2104.0, 3149.0, 4229.0, 5846.0, 8546.0, 12470.0, 17831.0, 26353.0, 40503.0, 63535.0, 110549.0, 266065.0, 207910.0, 98101.0, 58349.0, 37241.0, 24264.0, 16621.0, 11512.0, 7909.0, 5674.0, 4037.0, 2790.0, 2028.0, 1511.0, 1130.0, 771.0, 515.0, 386.0, 292.0, 194.0, 146.0, 100.0, 79.0, 49.0, 33.0, 29.0, 27.0, 12.0, 13.0, 5.0, 2.0, 2.0, 2.0], "bins": [-1.283203125, -1.24298095703125, -1.2027587890625, -1.16253662109375, -1.122314453125, -1.08209228515625, -1.0418701171875, -1.00164794921875, -0.96142578125, -0.92120361328125, -0.8809814453125, -0.84075927734375, -0.800537109375, -0.76031494140625, -0.7200927734375, -0.67987060546875, -0.6396484375, -0.59942626953125, -0.5592041015625, -0.51898193359375, -0.478759765625, -0.43853759765625, -0.3983154296875, -0.35809326171875, -0.31787109375, -0.27764892578125, -0.2374267578125, -0.19720458984375, -0.156982421875, -0.11676025390625, -0.0765380859375, -0.03631591796875, 0.00390625, 0.04412841796875, 0.0843505859375, 0.12457275390625, 0.164794921875, 0.20501708984375, 0.2452392578125, 0.28546142578125, 0.32568359375, 0.36590576171875, 0.4061279296875, 0.44635009765625, 0.486572265625, 0.52679443359375, 0.5670166015625, 0.60723876953125, 0.6474609375, 0.68768310546875, 0.7279052734375, 0.76812744140625, 0.808349609375, 0.84857177734375, 0.8887939453125, 0.92901611328125, 0.96923828125, 1.00946044921875, 1.0496826171875, 1.08990478515625, 1.130126953125, 1.17034912109375, 1.2105712890625, 1.25079345703125, 1.291015625]}, "gradients/decoder.transformer.h.17.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 1.0, 5.0, 4.0, 3.0, 10.0, 10.0, 15.0, 8.0, 16.0, 24.0, 19.0, 23.0, 29.0, 30.0, 24.0, 31.0, 34.0, 43.0, 27.0, 46.0, 40.0, 54.0, 1073.0, 40.0, 36.0, 33.0, 36.0, 37.0, 48.0, 29.0, 33.0, 16.0, 21.0, 22.0, 21.0, 20.0, 15.0, 18.0, 10.0, 6.0, 5.0, 5.0, 2.0, 3.0, 4.0, 2.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.35546875, -4.226409912109375, -4.09735107421875, -3.968292236328125, -3.8392333984375, -3.710174560546875, -3.58111572265625, -3.452056884765625, -3.322998046875, -3.193939208984375, -3.06488037109375, -2.935821533203125, -2.8067626953125, -2.677703857421875, -2.54864501953125, -2.419586181640625, -2.29052734375, -2.161468505859375, -2.03240966796875, -1.903350830078125, -1.7742919921875, -1.645233154296875, -1.51617431640625, -1.387115478515625, -1.258056640625, -1.128997802734375, -0.99993896484375, -0.870880126953125, -0.7418212890625, -0.612762451171875, -0.48370361328125, -0.354644775390625, -0.2255859375, -0.096527099609375, 0.03253173828125, 0.161590576171875, 0.2906494140625, 0.419708251953125, 0.54876708984375, 0.677825927734375, 0.806884765625, 0.935943603515625, 1.06500244140625, 1.194061279296875, 1.3231201171875, 1.452178955078125, 1.58123779296875, 1.710296630859375, 1.83935546875, 1.968414306640625, 2.09747314453125, 2.226531982421875, 2.3555908203125, 2.484649658203125, 2.61370849609375, 2.742767333984375, 2.871826171875, 3.000885009765625, 3.12994384765625, 3.259002685546875, 3.3880615234375, 3.517120361328125, 3.64617919921875, 3.775238037109375, 3.904296875]}, "gradients/decoder.transformer.h.17.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 3.0, 4.0, 4.0, 5.0, 3.0, 6.0, 11.0, 12.0, 28.0, 43.0, 46.0, 99.0, 177.0, 266.0, 466.0, 893.0, 1572.0, 2760.0, 5251.0, 10042.0, 19170.0, 38006.0, 79791.0, 183331.0, 1480555.0, 143425.0, 65365.0, 31411.0, 15986.0, 8430.0, 4518.0, 2381.0, 1328.0, 720.0, 376.0, 270.0, 131.0, 90.0, 52.0, 35.0, 32.0, 17.0, 9.0, 7.0, 4.0, 7.0, 2.0, 2.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.1484375, -2.07769775390625, -2.0069580078125, -1.93621826171875, -1.865478515625, -1.79473876953125, -1.7239990234375, -1.65325927734375, -1.58251953125, -1.51177978515625, -1.4410400390625, -1.37030029296875, -1.299560546875, -1.22882080078125, -1.1580810546875, -1.08734130859375, -1.0166015625, -0.94586181640625, -0.8751220703125, -0.80438232421875, -0.733642578125, -0.66290283203125, -0.5921630859375, -0.52142333984375, -0.45068359375, -0.37994384765625, -0.3092041015625, -0.23846435546875, -0.167724609375, -0.09698486328125, -0.0262451171875, 0.04449462890625, 0.115234375, 0.18597412109375, 0.2567138671875, 0.32745361328125, 0.398193359375, 0.46893310546875, 0.5396728515625, 0.61041259765625, 0.68115234375, 0.75189208984375, 0.8226318359375, 0.89337158203125, 0.964111328125, 1.03485107421875, 1.1055908203125, 1.17633056640625, 1.2470703125, 1.31781005859375, 1.3885498046875, 1.45928955078125, 1.530029296875, 1.60076904296875, 1.6715087890625, 1.74224853515625, 1.81298828125, 1.88372802734375, 1.9544677734375, 2.02520751953125, 2.095947265625, 2.16668701171875, 2.2374267578125, 2.30816650390625, 2.37890625]}, "gradients/decoder.transformer.h.17.crossattention.q_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 2.0, 2.0, 1.0, 5.0, 3.0, 5.0, 5.0, 11.0, 4.0, 7.0, 12.0, 14.0, 10.0, 16.0, 16.0, 32.0, 29.0, 24.0, 36.0, 46.0, 58.0, 65.0, 65.0, 79.0, 62.0, 49.0, 67.0, 41.0, 32.0, 38.0, 28.0, 23.0, 22.0, 18.0, 18.0, 10.0, 13.0, 7.0, 10.0, 5.0, 3.0, 2.0, 3.0, 6.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0009646415710449219, -0.0009372234344482422, -0.0009098052978515625, -0.0008823871612548828, -0.0008549690246582031, -0.0008275508880615234, -0.0008001327514648438, -0.0007727146148681641, -0.0007452964782714844, -0.0007178783416748047, -0.000690460205078125, -0.0006630420684814453, -0.0006356239318847656, -0.0006082057952880859, -0.0005807876586914062, -0.0005533695220947266, -0.0005259513854980469, -0.0004985332489013672, -0.0004711151123046875, -0.0004436969757080078, -0.0004162788391113281, -0.00038886070251464844, -0.00036144256591796875, -0.00033402442932128906, -0.0003066062927246094, -0.0002791881561279297, -0.00025177001953125, -0.0002243518829345703, -0.00019693374633789062, -0.00016951560974121094, -0.00014209747314453125, -0.00011467933654785156, -8.726119995117188e-05, -5.984306335449219e-05, -3.24249267578125e-05, -5.0067901611328125e-06, 2.2411346435546875e-05, 4.982948303222656e-05, 7.724761962890625e-05, 0.00010466575622558594, 0.00013208389282226562, 0.0001595020294189453, 0.000186920166015625, 0.0002143383026123047, 0.00024175643920898438, 0.00026917457580566406, 0.00029659271240234375, 0.00032401084899902344, 0.0003514289855957031, 0.0003788471221923828, 0.0004062652587890625, 0.0004336833953857422, 0.0004611015319824219, 0.0004885196685791016, 0.0005159378051757812, 0.0005433559417724609, 0.0005707740783691406, 0.0005981922149658203, 0.0006256103515625, 0.0006530284881591797, 0.0006804466247558594, 0.0007078647613525391, 0.0007352828979492188, 0.0007627010345458984, 0.0007901191711425781]}, "gradients/decoder.transformer.h.17.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 4.0, 5.0, 3.0, 7.0, 2.0, 6.0, 8.0, 14.0, 22.0, 6.0, 22.0, 16.0, 38.0, 48.0, 54.0, 84.0, 141.0, 287.0, 568.0, 4240.0, 1034887.0, 6645.0, 657.0, 297.0, 155.0, 104.0, 55.0, 43.0, 24.0, 18.0, 22.0, 14.0, 16.0, 7.0, 9.0, 8.0, 1.0, 7.0, 2.0, 4.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0], "bins": [-0.019256591796875, -0.018581867218017578, -0.017907142639160156, -0.017232418060302734, -0.016557693481445312, -0.01588296890258789, -0.015208244323730469, -0.014533519744873047, -0.013858795166015625, -0.013184070587158203, -0.012509346008300781, -0.01183462142944336, -0.011159896850585938, -0.010485172271728516, -0.009810447692871094, -0.009135723114013672, -0.00846099853515625, -0.007786273956298828, -0.007111549377441406, -0.006436824798583984, -0.0057621002197265625, -0.005087375640869141, -0.004412651062011719, -0.003737926483154297, -0.003063201904296875, -0.002388477325439453, -0.0017137527465820312, -0.0010390281677246094, -0.0003643035888671875, 0.0003104209899902344, 0.0009851455688476562, 0.0016598701477050781, 0.0023345947265625, 0.003009319305419922, 0.0036840438842773438, 0.004358768463134766, 0.0050334930419921875, 0.005708217620849609, 0.006382942199707031, 0.007057666778564453, 0.007732391357421875, 0.008407115936279297, 0.009081840515136719, 0.00975656509399414, 0.010431289672851562, 0.011106014251708984, 0.011780738830566406, 0.012455463409423828, 0.01313018798828125, 0.013804912567138672, 0.014479637145996094, 0.015154361724853516, 0.015829086303710938, 0.01650381088256836, 0.01717853546142578, 0.017853260040283203, 0.018527984619140625, 0.019202709197998047, 0.01987743377685547, 0.02055215835571289, 0.021226882934570312, 0.021901607513427734, 0.022576332092285156, 0.023251056671142578, 0.02392578125]}, "gradients/decoder.transformer.h.17.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 12.0, 27.0, 42.0, 99.0, 165.0, 195.0, 168.0, 135.0, 90.0, 42.0, 17.0, 12.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0005094473017379642, -0.0004850604454986751, -0.00046067358925938606, -0.000436286733020097, -0.00041189987678080797, -0.0003875130205415189, -0.0003631261351983994, -0.0003387392789591104, -0.00031435242271982133, -0.0002899655664805323, -0.00026557871024124324, -0.00024119183945003897, -0.00021680498321074992, -0.00019241812697146088, -0.0001680312561802566, -0.00014364439994096756, -0.00011925754370167851, -9.487068746238947e-05, -7.048382394714281e-05, -4.609696043189615e-05, -2.1710104192607105e-05, 2.6767520466819406e-06, 2.7063622837886214e-05, 5.145047907717526e-05, 7.58373353164643e-05, 0.00010022419155575335, 0.0001246110477950424, 0.00014899791858624667, 0.00017338477482553571, 0.00019777163106482476, 0.00022215850185602903, 0.0002465453580953181, 0.0002709322143346071, 0.00029531907057389617, 0.0003197059268131852, 0.00034409278305247426, 0.0003684796392917633, 0.00039286649553105235, 0.00041725338087417185, 0.0004416402371134609, 0.00046602709335274994, 0.0004904139786958694, 0.0005148008349351585, 0.0005391876911744475, 0.0005635745474137366, 0.0005879614036530256, 0.0006123482598923147, 0.0006367351161316037, 0.0006611219723708928, 0.0006855088286101818, 0.0007098956848494709, 0.0007342825410887599, 0.0007586693973280489, 0.000783056253567338, 0.000807443168014288, 0.0008318299660459161, 0.000856216880492866, 0.0008806037367321551, 0.0009049905929714441, 0.0009293774492107332, 0.0009537643054500222, 0.0009781512198969722, 0.0010025380179286003, 0.0010269249323755503, 0.0010513117304071784]}, "gradients/decoder.transformer.h.17.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 3.0, 3.0, 5.0, 2.0, 4.0, 9.0, 10.0, 14.0, 11.0, 11.0, 12.0, 13.0, 20.0, 25.0, 23.0, 29.0, 29.0, 35.0, 37.0, 35.0, 39.0, 51.0, 42.0, 42.0, 47.0, 37.0, 50.0, 51.0, 36.0, 23.0, 36.0, 36.0, 29.0, 31.0, 25.0, 15.0, 17.0, 10.0, 12.0, 10.0, 11.0, 13.0, 8.0, 3.0, 3.0, 1.0, 1.0, 4.0, 1.0, 1.0], "bins": [-0.0005670785903930664, -0.0005522938445210457, -0.000537509098649025, -0.0005227243527770042, -0.0005079396069049835, -0.0004931548610329628, -0.0004783701151609421, -0.00046358536928892136, -0.00044880062341690063, -0.0004340158775448799, -0.0004192311316728592, -0.00040444638580083847, -0.00038966163992881775, -0.00037487689405679703, -0.0003600921481847763, -0.0003453074023127556, -0.00033052265644073486, -0.00031573791056871414, -0.0003009531646966934, -0.0002861684188246727, -0.000271383672952652, -0.00025659892708063126, -0.00024181418120861053, -0.0002270294353365898, -0.0002122446894645691, -0.00019745994359254837, -0.00018267519772052765, -0.00016789045184850693, -0.0001531057059764862, -0.00013832096010446548, -0.00012353621423244476, -0.00010875146836042404, -9.396672248840332e-05, -7.91819766163826e-05, -6.439723074436188e-05, -4.9612484872341156e-05, -3.4827739000320435e-05, -2.0042993128299713e-05, -5.258247256278992e-06, 9.52649861574173e-06, 2.431124448776245e-05, 3.909599035978317e-05, 5.3880736231803894e-05, 6.866548210382462e-05, 8.345022797584534e-05, 9.823497384786606e-05, 0.00011301971971988678, 0.0001278044655919075, 0.00014258921146392822, 0.00015737395733594894, 0.00017215870320796967, 0.0001869434490799904, 0.0002017281949520111, 0.00021651294082403183, 0.00023129768669605255, 0.00024608243256807327, 0.000260867178440094, 0.0002756519243121147, 0.00029043667018413544, 0.00030522141605615616, 0.0003200061619281769, 0.0003347909078001976, 0.0003495756536722183, 0.00036436039954423904, 0.00037914514541625977]}, "gradients/decoder.transformer.h.17.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 4.0, 3.0, 5.0, 8.0, 8.0, 20.0, 11.0, 14.0, 15.0, 18.0, 28.0, 25.0, 33.0, 33.0, 38.0, 34.0, 50.0, 34.0, 37.0, 44.0, 47.0, 47.0, 52.0, 39.0, 53.0, 42.0, 28.0, 31.0, 30.0, 25.0, 25.0, 12.0, 21.0, 8.0, 12.0, 16.0, 14.0, 11.0, 6.0, 5.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.16796875, -5.971435546875, -5.77490234375, -5.578369140625, -5.3818359375, -5.185302734375, -4.98876953125, -4.792236328125, -4.595703125, -4.399169921875, -4.20263671875, -4.006103515625, -3.8095703125, -3.613037109375, -3.41650390625, -3.219970703125, -3.0234375, -2.826904296875, -2.63037109375, -2.433837890625, -2.2373046875, -2.040771484375, -1.84423828125, -1.647705078125, -1.451171875, -1.254638671875, -1.05810546875, -0.861572265625, -0.6650390625, -0.468505859375, -0.27197265625, -0.075439453125, 0.12109375, 0.317626953125, 0.51416015625, 0.710693359375, 0.9072265625, 1.103759765625, 1.30029296875, 1.496826171875, 1.693359375, 1.889892578125, 2.08642578125, 2.282958984375, 2.4794921875, 2.676025390625, 2.87255859375, 3.069091796875, 3.265625, 3.462158203125, 3.65869140625, 3.855224609375, 4.0517578125, 4.248291015625, 4.44482421875, 4.641357421875, 4.837890625, 5.034423828125, 5.23095703125, 5.427490234375, 5.6240234375, 5.820556640625, 6.01708984375, 6.213623046875, 6.41015625]}, "gradients/decoder.transformer.h.17.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 6.0, 3.0, 5.0, 3.0, 15.0, 12.0, 14.0, 27.0, 33.0, 57.0, 62.0, 117.0, 129.0, 296.0, 498.0, 987.0, 1962.0, 4580.0, 10094.0, 24673.0, 65858.0, 216465.0, 456626.0, 173252.0, 55471.0, 20821.0, 8849.0, 3880.0, 1738.0, 890.0, 456.0, 240.0, 134.0, 107.0, 63.0, 34.0, 29.0, 20.0, 18.0, 13.0, 6.0, 8.0, 1.0, 3.0, 3.0, 4.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.59375, -4.4520263671875, -4.310302734375, -4.1685791015625, -4.02685546875, -3.8851318359375, -3.743408203125, -3.6016845703125, -3.4599609375, -3.3182373046875, -3.176513671875, -3.0347900390625, -2.89306640625, -2.7513427734375, -2.609619140625, -2.4678955078125, -2.326171875, -2.1844482421875, -2.042724609375, -1.9010009765625, -1.75927734375, -1.6175537109375, -1.475830078125, -1.3341064453125, -1.1923828125, -1.0506591796875, -0.908935546875, -0.7672119140625, -0.62548828125, -0.4837646484375, -0.342041015625, -0.2003173828125, -0.05859375, 0.0831298828125, 0.224853515625, 0.3665771484375, 0.50830078125, 0.6500244140625, 0.791748046875, 0.9334716796875, 1.0751953125, 1.2169189453125, 1.358642578125, 1.5003662109375, 1.64208984375, 1.7838134765625, 1.925537109375, 2.0672607421875, 2.208984375, 2.3507080078125, 2.492431640625, 2.6341552734375, 2.77587890625, 2.9176025390625, 3.059326171875, 3.2010498046875, 3.3427734375, 3.4844970703125, 3.626220703125, 3.7679443359375, 3.90966796875, 4.0513916015625, 4.193115234375, 4.3348388671875, 4.4765625]}, "gradients/decoder.transformer.h.17.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 3.0, 4.0, 0.0, 9.0, 5.0, 5.0, 9.0, 11.0, 12.0, 16.0, 30.0, 28.0, 25.0, 30.0, 30.0, 40.0, 37.0, 43.0, 46.0, 77.0, 133.0, 1783.0, 211.0, 72.0, 37.0, 59.0, 46.0, 46.0, 35.0, 18.0, 30.0, 27.0, 27.0, 13.0, 6.0, 12.0, 8.0, 11.0, 4.0, 8.0, 2.0, 1.0, 5.0, 4.0, 3.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-20.65625, -19.981201171875, -19.30615234375, -18.631103515625, -17.9560546875, -17.281005859375, -16.60595703125, -15.930908203125, -15.255859375, -14.580810546875, -13.90576171875, -13.230712890625, -12.5556640625, -11.880615234375, -11.20556640625, -10.530517578125, -9.85546875, -9.180419921875, -8.50537109375, -7.830322265625, -7.1552734375, -6.480224609375, -5.80517578125, -5.130126953125, -4.455078125, -3.780029296875, -3.10498046875, -2.429931640625, -1.7548828125, -1.079833984375, -0.40478515625, 0.270263671875, 0.9453125, 1.620361328125, 2.29541015625, 2.970458984375, 3.6455078125, 4.320556640625, 4.99560546875, 5.670654296875, 6.345703125, 7.020751953125, 7.69580078125, 8.370849609375, 9.0458984375, 9.720947265625, 10.39599609375, 11.071044921875, 11.74609375, 12.421142578125, 13.09619140625, 13.771240234375, 14.4462890625, 15.121337890625, 15.79638671875, 16.471435546875, 17.146484375, 17.821533203125, 18.49658203125, 19.171630859375, 19.8466796875, 20.521728515625, 21.19677734375, 21.871826171875, 22.546875]}, "gradients/decoder.transformer.h.17.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 5.0, 5.0, 2.0, 4.0, 6.0, 6.0, 14.0, 10.0, 13.0, 15.0, 19.0, 34.0, 43.0, 47.0, 57.0, 94.0, 118.0, 186.0, 242.0, 424.0, 1593.0, 71597.0, 3061876.0, 7414.0, 766.0, 350.0, 196.0, 144.0, 109.0, 79.0, 47.0, 43.0, 39.0, 29.0, 26.0, 13.0, 14.0, 9.0, 5.0, 6.0, 7.0, 4.0, 3.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-45.40625, -44.0419921875, -42.677734375, -41.3134765625, -39.94921875, -38.5849609375, -37.220703125, -35.8564453125, -34.4921875, -33.1279296875, -31.763671875, -30.3994140625, -29.03515625, -27.6708984375, -26.306640625, -24.9423828125, -23.578125, -22.2138671875, -20.849609375, -19.4853515625, -18.12109375, -16.7568359375, -15.392578125, -14.0283203125, -12.6640625, -11.2998046875, -9.935546875, -8.5712890625, -7.20703125, -5.8427734375, -4.478515625, -3.1142578125, -1.75, -0.3857421875, 0.978515625, 2.3427734375, 3.70703125, 5.0712890625, 6.435546875, 7.7998046875, 9.1640625, 10.5283203125, 11.892578125, 13.2568359375, 14.62109375, 15.9853515625, 17.349609375, 18.7138671875, 20.078125, 21.4423828125, 22.806640625, 24.1708984375, 25.53515625, 26.8994140625, 28.263671875, 29.6279296875, 30.9921875, 32.3564453125, 33.720703125, 35.0849609375, 36.44921875, 37.8134765625, 39.177734375, 40.5419921875, 41.90625]}, "gradients/decoder.transformer.h.17.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 13.0, 82.0, 339.0, 450.0, 115.0, 17.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-96.70613861083984, -94.51325988769531, -92.32038116455078, -90.12750244140625, -87.93463134765625, -85.74175262451172, -83.54887390136719, -81.35599517822266, -79.16311645507812, -76.9702377319336, -74.77735900878906, -72.58448791503906, -70.39160919189453, -68.19873046875, -66.00585174560547, -63.81297302246094, -61.62009811401367, -59.42721939086914, -57.234344482421875, -55.041465759277344, -52.84858703613281, -50.65570831298828, -48.462833404541016, -46.269954681396484, -44.07707977294922, -41.88420104980469, -39.69132614135742, -37.49844741821289, -35.30556869506836, -33.112693786621094, -30.919815063476562, -28.72693634033203, -26.534053802490234, -24.341176986694336, -22.148298263549805, -19.955421447753906, -17.762542724609375, -15.569665908813477, -13.376789093017578, -11.183911323547363, -8.991033554077148, -6.798155784606934, -4.605278491973877, -2.4124011993408203, -0.21952342987060547, 1.9733543395996094, 4.166231155395508, 6.359108924865723, 8.551986694335938, 10.744864463806152, 12.937742233276367, 15.130619049072266, 17.323497772216797, 19.516374588012695, 21.709251403808594, 23.902130126953125, 26.095006942749023, 28.287883758544922, 30.480762481689453, 32.67363739013672, 34.86651611328125, 37.05939483642578, 39.25227355957031, 41.44514846801758, 43.63802719116211]}, "gradients/decoder.transformer.h.17.ln_1.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 5.0, 4.0, 2.0, 2.0, 7.0, 9.0, 2.0, 6.0, 11.0, 7.0, 22.0, 6.0, 17.0, 19.0, 17.0, 20.0, 17.0, 22.0, 29.0, 28.0, 22.0, 32.0, 31.0, 29.0, 31.0, 37.0, 29.0, 50.0, 38.0, 29.0, 28.0, 40.0, 33.0, 45.0, 31.0, 33.0, 23.0, 25.0, 28.0, 19.0, 17.0, 18.0, 12.0, 11.0, 13.0, 11.0, 11.0, 6.0, 6.0, 4.0, 5.0, 9.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-40.42657470703125, -39.086795806884766, -37.74701690673828, -36.40723419189453, -35.06745529174805, -33.72767639160156, -32.38789367675781, -31.048114776611328, -29.708335876464844, -28.36855697631836, -27.028776168823242, -25.688995361328125, -24.34921646118164, -23.009437561035156, -21.66965675354004, -20.329875946044922, -18.990097045898438, -17.650318145751953, -16.310537338256836, -14.970757484436035, -13.630977630615234, -12.291197776794434, -10.951417922973633, -9.611638069152832, -8.271858215332031, -6.9320783615112305, -5.59229850769043, -4.252518653869629, -2.912738800048828, -1.5729589462280273, -0.23317909240722656, 1.1066007614135742, 2.446380615234375, 3.786160469055176, 5.125940322875977, 6.465720176696777, 7.805500030517578, 9.145279884338379, 10.48505973815918, 11.82483959197998, 13.164619445800781, 14.504399299621582, 15.844179153442383, 17.1839599609375, 18.523738861083984, 19.86351776123047, 21.203298568725586, 22.543079376220703, 23.882858276367188, 25.222637176513672, 26.56241798400879, 27.902198791503906, 29.24197769165039, 30.581756591796875, 31.921537399291992, 33.26131820678711, 34.601097106933594, 35.94087600708008, 37.28065490722656, 38.62043762207031, 39.9602165222168, 41.29999542236328, 42.63977813720703, 43.979557037353516, 45.3193359375]}, "gradients/decoder.transformer.h.16.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 5.0, 3.0, 3.0, 2.0, 7.0, 7.0, 10.0, 15.0, 15.0, 13.0, 17.0, 22.0, 25.0, 25.0, 29.0, 34.0, 30.0, 39.0, 32.0, 46.0, 45.0, 38.0, 46.0, 53.0, 47.0, 48.0, 49.0, 35.0, 35.0, 31.0, 30.0, 24.0, 31.0, 20.0, 14.0, 12.0, 10.0, 10.0, 21.0, 9.0, 4.0, 5.0, 5.0, 5.0, 1.0, 0.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.15625, -5.955078125, -5.75390625, -5.552734375, -5.3515625, -5.150390625, -4.94921875, -4.748046875, -4.546875, -4.345703125, -4.14453125, -3.943359375, -3.7421875, -3.541015625, -3.33984375, -3.138671875, -2.9375, -2.736328125, -2.53515625, -2.333984375, -2.1328125, -1.931640625, -1.73046875, -1.529296875, -1.328125, -1.126953125, -0.92578125, -0.724609375, -0.5234375, -0.322265625, -0.12109375, 0.080078125, 0.28125, 0.482421875, 0.68359375, 0.884765625, 1.0859375, 1.287109375, 1.48828125, 1.689453125, 1.890625, 2.091796875, 2.29296875, 2.494140625, 2.6953125, 2.896484375, 3.09765625, 3.298828125, 3.5, 3.701171875, 3.90234375, 4.103515625, 4.3046875, 4.505859375, 4.70703125, 4.908203125, 5.109375, 5.310546875, 5.51171875, 5.712890625, 5.9140625, 6.115234375, 6.31640625, 6.517578125, 6.71875]}, "gradients/decoder.transformer.h.16.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 4.0, 1.0, 0.0, 2.0, 2.0, 0.0, 6.0, 3.0, 4.0, 5.0, 10.0, 8.0, 9.0, 11.0, 13.0, 19.0, 23.0, 14.0, 27.0, 36.0, 28.0, 49.0, 53.0, 86.0, 159.0, 314.0, 2603.0, 447856.0, 3716851.0, 24789.0, 610.0, 210.0, 125.0, 78.0, 50.0, 32.0, 34.0, 30.0, 20.0, 20.0, 11.0, 11.0, 18.0, 10.0, 16.0, 10.0, 5.0, 8.0, 6.0, 3.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-33.0, -31.87255859375, -30.7451171875, -29.61767578125, -28.490234375, -27.36279296875, -26.2353515625, -25.10791015625, -23.98046875, -22.85302734375, -21.7255859375, -20.59814453125, -19.470703125, -18.34326171875, -17.2158203125, -16.08837890625, -14.9609375, -13.83349609375, -12.7060546875, -11.57861328125, -10.451171875, -9.32373046875, -8.1962890625, -7.06884765625, -5.94140625, -4.81396484375, -3.6865234375, -2.55908203125, -1.431640625, -0.30419921875, 0.8232421875, 1.95068359375, 3.078125, 4.20556640625, 5.3330078125, 6.46044921875, 7.587890625, 8.71533203125, 9.8427734375, 10.97021484375, 12.09765625, 13.22509765625, 14.3525390625, 15.47998046875, 16.607421875, 17.73486328125, 18.8623046875, 19.98974609375, 21.1171875, 22.24462890625, 23.3720703125, 24.49951171875, 25.626953125, 26.75439453125, 27.8818359375, 29.00927734375, 30.13671875, 31.26416015625, 32.3916015625, 33.51904296875, 34.646484375, 35.77392578125, 36.9013671875, 38.02880859375, 39.15625]}, "gradients/decoder.transformer.h.16.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 4.0, 5.0, 14.0, 14.0, 23.0, 39.0, 55.0, 95.0, 144.0, 207.0, 292.0, 469.0, 595.0, 589.0, 499.0, 350.0, 234.0, 145.0, 103.0, 74.0, 49.0, 23.0, 24.0, 12.0, 9.0, 5.0, 3.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-16.5625, -15.951904296875, -15.34130859375, -14.730712890625, -14.1201171875, -13.509521484375, -12.89892578125, -12.288330078125, -11.677734375, -11.067138671875, -10.45654296875, -9.845947265625, -9.2353515625, -8.624755859375, -8.01416015625, -7.403564453125, -6.79296875, -6.182373046875, -5.57177734375, -4.961181640625, -4.3505859375, -3.739990234375, -3.12939453125, -2.518798828125, -1.908203125, -1.297607421875, -0.68701171875, -0.076416015625, 0.5341796875, 1.144775390625, 1.75537109375, 2.365966796875, 2.9765625, 3.587158203125, 4.19775390625, 4.808349609375, 5.4189453125, 6.029541015625, 6.64013671875, 7.250732421875, 7.861328125, 8.471923828125, 9.08251953125, 9.693115234375, 10.3037109375, 10.914306640625, 11.52490234375, 12.135498046875, 12.74609375, 13.356689453125, 13.96728515625, 14.577880859375, 15.1884765625, 15.799072265625, 16.40966796875, 17.020263671875, 17.630859375, 18.241455078125, 18.85205078125, 19.462646484375, 20.0732421875, 20.683837890625, 21.29443359375, 21.905029296875, 22.515625]}, "gradients/decoder.transformer.h.16.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 2.0, 3.0, 3.0, 11.0, 13.0, 15.0, 14.0, 26.0, 28.0, 45.0, 50.0, 58.0, 91.0, 91.0, 145.0, 212.0, 352.0, 747.0, 2256.0, 253842.0, 3925162.0, 8556.0, 1070.0, 473.0, 295.0, 204.0, 113.0, 88.0, 69.0, 60.0, 47.0, 34.0, 25.0, 18.0, 11.0, 10.0, 13.0, 7.0, 8.0, 4.0, 5.0, 0.0, 3.0, 4.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0], "bins": [-68.6875, -66.556640625, -64.42578125, -62.294921875, -60.1640625, -58.033203125, -55.90234375, -53.771484375, -51.640625, -49.509765625, -47.37890625, -45.248046875, -43.1171875, -40.986328125, -38.85546875, -36.724609375, -34.59375, -32.462890625, -30.33203125, -28.201171875, -26.0703125, -23.939453125, -21.80859375, -19.677734375, -17.546875, -15.416015625, -13.28515625, -11.154296875, -9.0234375, -6.892578125, -4.76171875, -2.630859375, -0.5, 1.630859375, 3.76171875, 5.892578125, 8.0234375, 10.154296875, 12.28515625, 14.416015625, 16.546875, 18.677734375, 20.80859375, 22.939453125, 25.0703125, 27.201171875, 29.33203125, 31.462890625, 33.59375, 35.724609375, 37.85546875, 39.986328125, 42.1171875, 44.248046875, 46.37890625, 48.509765625, 50.640625, 52.771484375, 54.90234375, 57.033203125, 59.1640625, 61.294921875, 63.42578125, 65.556640625, 67.6875]}, "gradients/decoder.transformer.h.16.ln_2.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 5.0, 8.0, 75.0, 388.0, 407.0, 122.0, 14.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-64.16033172607422, -56.15629196166992, -48.152252197265625, -40.14820861816406, -32.144168853759766, -24.14012908935547, -16.136085510253906, -8.13204574584961, -0.1280059814453125, 7.876034736633301, 15.880075454711914, 23.884117126464844, 31.88815689086914, 39.89219665527344, 47.896240234375, 55.9002799987793, 63.904319763183594, 71.90836334228516, 79.91239929199219, 87.91644287109375, 95.92048645019531, 103.92452239990234, 111.9285659790039, 119.93260192871094, 127.9366455078125, 135.94068908691406, 143.94473266601562, 151.94876098632812, 159.9528045654297, 167.95684814453125, 175.9608917236328, 183.96493530273438, 191.96896362304688, 199.97300720214844, 207.97705078125, 215.9810791015625, 223.98512268066406, 231.98916625976562, 239.9932098388672, 247.99725341796875, 256.00128173828125, 264.00531005859375, 272.0093688964844, 280.0133972167969, 288.0174560546875, 296.021484375, 304.0255126953125, 312.0295715332031, 320.03363037109375, 328.03765869140625, 336.0417175292969, 344.0457458496094, 352.0498046875, 360.0538330078125, 368.057861328125, 376.0619201660156, 384.0659484863281, 392.0699768066406, 400.07403564453125, 408.07806396484375, 416.0821228027344, 424.0861511230469, 432.0902099609375, 440.09423828125, 448.0982666015625]}, "gradients/decoder.transformer.h.16.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 2.0, 4.0, 2.0, 9.0, 8.0, 1.0, 5.0, 11.0, 9.0, 10.0, 14.0, 27.0, 21.0, 23.0, 14.0, 29.0, 36.0, 32.0, 51.0, 40.0, 57.0, 38.0, 38.0, 34.0, 41.0, 34.0, 40.0, 47.0, 36.0, 51.0, 37.0, 29.0, 22.0, 29.0, 19.0, 18.0, 12.0, 20.0, 13.0, 10.0, 10.0, 8.0, 2.0, 3.0, 8.0, 2.0, 2.0, 1.0, 4.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-50.43025207519531, -48.888065338134766, -47.34587860107422, -45.80369186401367, -44.261505126953125, -42.719322204589844, -41.1771354675293, -39.63494873046875, -38.0927619934082, -36.550575256347656, -35.00838851928711, -33.46620178222656, -31.92401695251465, -30.3818302154541, -28.839645385742188, -27.29745864868164, -25.755271911621094, -24.213085174560547, -22.6708984375, -21.128713607788086, -19.58652687072754, -18.044340133666992, -16.502155303955078, -14.959968566894531, -13.417781829833984, -11.875595092773438, -10.333409309387207, -8.791223526000977, -7.24903678894043, -5.706850528717041, -4.164664268493652, -2.622478485107422, -1.080291748046875, 0.46189451217651367, 2.0040807723999023, 3.546267032623291, 5.08845329284668, 6.630639553070068, 8.172825813293457, 9.715011596679688, 11.257198333740234, 12.799385070800781, 14.341570854187012, 15.883756637573242, 17.42594337463379, 18.968130111694336, 20.51031494140625, 22.052501678466797, 23.594688415527344, 25.13687515258789, 26.679061889648438, 28.22124671936035, 29.7634334564209, 31.305620193481445, 32.84780502319336, 34.389991760253906, 35.93217849731445, 37.474365234375, 39.01655197143555, 40.558738708496094, 42.100921630859375, 43.64310836791992, 45.18529510498047, 46.727481842041016, 48.26966857910156]}, "gradients/decoder.transformer.h.16.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 2.0, 4.0, 3.0, 3.0, 5.0, 8.0, 8.0, 10.0, 9.0, 16.0, 11.0, 14.0, 25.0, 27.0, 31.0, 32.0, 38.0, 29.0, 35.0, 33.0, 37.0, 39.0, 43.0, 46.0, 38.0, 52.0, 37.0, 49.0, 47.0, 29.0, 34.0, 33.0, 28.0, 20.0, 28.0, 19.0, 12.0, 10.0, 13.0, 14.0, 12.0, 4.0, 6.0, 3.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.28125, -6.08349609375, -5.8857421875, -5.68798828125, -5.490234375, -5.29248046875, -5.0947265625, -4.89697265625, -4.69921875, -4.50146484375, -4.3037109375, -4.10595703125, -3.908203125, -3.71044921875, -3.5126953125, -3.31494140625, -3.1171875, -2.91943359375, -2.7216796875, -2.52392578125, -2.326171875, -2.12841796875, -1.9306640625, -1.73291015625, -1.53515625, -1.33740234375, -1.1396484375, -0.94189453125, -0.744140625, -0.54638671875, -0.3486328125, -0.15087890625, 0.046875, 0.24462890625, 0.4423828125, 0.64013671875, 0.837890625, 1.03564453125, 1.2333984375, 1.43115234375, 1.62890625, 1.82666015625, 2.0244140625, 2.22216796875, 2.419921875, 2.61767578125, 2.8154296875, 3.01318359375, 3.2109375, 3.40869140625, 3.6064453125, 3.80419921875, 4.001953125, 4.19970703125, 4.3974609375, 4.59521484375, 4.79296875, 4.99072265625, 5.1884765625, 5.38623046875, 5.583984375, 5.78173828125, 5.9794921875, 6.17724609375, 6.375]}, "gradients/decoder.transformer.h.16.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 5.0, 6.0, 5.0, 7.0, 2.0, 19.0, 23.0, 27.0, 42.0, 70.0, 98.0, 158.0, 230.0, 384.0, 618.0, 1011.0, 1665.0, 2776.0, 4754.0, 8055.0, 14368.0, 26138.0, 48659.0, 96855.0, 245774.0, 353390.0, 116390.0, 57344.0, 30414.0, 16524.0, 9358.0, 5346.0, 3111.0, 1909.0, 1140.0, 669.0, 439.0, 280.0, 155.0, 125.0, 71.0, 43.0, 34.0, 25.0, 18.0, 9.0, 10.0, 5.0, 2.0, 4.0, 4.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-2.064453125, -2.00018310546875, -1.9359130859375, -1.87164306640625, -1.807373046875, -1.74310302734375, -1.6788330078125, -1.61456298828125, -1.55029296875, -1.48602294921875, -1.4217529296875, -1.35748291015625, -1.293212890625, -1.22894287109375, -1.1646728515625, -1.10040283203125, -1.0361328125, -0.97186279296875, -0.9075927734375, -0.84332275390625, -0.779052734375, -0.71478271484375, -0.6505126953125, -0.58624267578125, -0.52197265625, -0.45770263671875, -0.3934326171875, -0.32916259765625, -0.264892578125, -0.20062255859375, -0.1363525390625, -0.07208251953125, -0.0078125, 0.05645751953125, 0.1207275390625, 0.18499755859375, 0.249267578125, 0.31353759765625, 0.3778076171875, 0.44207763671875, 0.50634765625, 0.57061767578125, 0.6348876953125, 0.69915771484375, 0.763427734375, 0.82769775390625, 0.8919677734375, 0.95623779296875, 1.0205078125, 1.08477783203125, 1.1490478515625, 1.21331787109375, 1.277587890625, 1.34185791015625, 1.4061279296875, 1.47039794921875, 1.53466796875, 1.59893798828125, 1.6632080078125, 1.72747802734375, 1.791748046875, 1.85601806640625, 1.9202880859375, 1.98455810546875, 2.048828125]}, "gradients/decoder.transformer.h.16.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 5.0, 6.0, 3.0, 3.0, 5.0, 10.0, 7.0, 7.0, 13.0, 13.0, 21.0, 27.0, 22.0, 31.0, 27.0, 39.0, 50.0, 42.0, 33.0, 52.0, 42.0, 1078.0, 49.0, 60.0, 44.0, 48.0, 40.0, 28.0, 41.0, 36.0, 16.0, 20.0, 21.0, 16.0, 18.0, 14.0, 8.0, 5.0, 8.0, 7.0, 4.0, 7.0, 6.0, 3.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.04296875, -3.90228271484375, -3.7615966796875, -3.62091064453125, -3.480224609375, -3.33953857421875, -3.1988525390625, -3.05816650390625, -2.91748046875, -2.77679443359375, -2.6361083984375, -2.49542236328125, -2.354736328125, -2.21405029296875, -2.0733642578125, -1.93267822265625, -1.7919921875, -1.65130615234375, -1.5106201171875, -1.36993408203125, -1.229248046875, -1.08856201171875, -0.9478759765625, -0.80718994140625, -0.66650390625, -0.52581787109375, -0.3851318359375, -0.24444580078125, -0.103759765625, 0.03692626953125, 0.1776123046875, 0.31829833984375, 0.458984375, 0.59967041015625, 0.7403564453125, 0.88104248046875, 1.021728515625, 1.16241455078125, 1.3031005859375, 1.44378662109375, 1.58447265625, 1.72515869140625, 1.8658447265625, 2.00653076171875, 2.147216796875, 2.28790283203125, 2.4285888671875, 2.56927490234375, 2.7099609375, 2.85064697265625, 2.9913330078125, 3.13201904296875, 3.272705078125, 3.41339111328125, 3.5540771484375, 3.69476318359375, 3.83544921875, 3.97613525390625, 4.1168212890625, 4.25750732421875, 4.398193359375, 4.53887939453125, 4.6795654296875, 4.82025146484375, 4.9609375]}, "gradients/decoder.transformer.h.16.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 3.0, 6.0, 7.0, 14.0, 10.0, 17.0, 21.0, 33.0, 94.0, 111.0, 159.0, 313.0, 537.0, 867.0, 1687.0, 2885.0, 5450.0, 10330.0, 20879.0, 44616.0, 104598.0, 1420678.0, 305494.0, 95880.0, 41387.0, 19595.0, 9819.0, 5089.0, 2829.0, 1575.0, 852.0, 547.0, 297.0, 153.0, 103.0, 91.0, 29.0, 25.0, 17.0, 12.0, 7.0, 8.0, 5.0, 7.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-2.705078125, -2.627685546875, -2.55029296875, -2.472900390625, -2.3955078125, -2.318115234375, -2.24072265625, -2.163330078125, -2.0859375, -2.008544921875, -1.93115234375, -1.853759765625, -1.7763671875, -1.698974609375, -1.62158203125, -1.544189453125, -1.466796875, -1.389404296875, -1.31201171875, -1.234619140625, -1.1572265625, -1.079833984375, -1.00244140625, -0.925048828125, -0.84765625, -0.770263671875, -0.69287109375, -0.615478515625, -0.5380859375, -0.460693359375, -0.38330078125, -0.305908203125, -0.228515625, -0.151123046875, -0.07373046875, 0.003662109375, 0.0810546875, 0.158447265625, 0.23583984375, 0.313232421875, 0.390625, 0.468017578125, 0.54541015625, 0.622802734375, 0.7001953125, 0.777587890625, 0.85498046875, 0.932373046875, 1.009765625, 1.087158203125, 1.16455078125, 1.241943359375, 1.3193359375, 1.396728515625, 1.47412109375, 1.551513671875, 1.62890625, 1.706298828125, 1.78369140625, 1.861083984375, 1.9384765625, 2.015869140625, 2.09326171875, 2.170654296875, 2.248046875]}, "gradients/decoder.transformer.h.16.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 0.0, 3.0, 3.0, 3.0, 0.0, 6.0, 3.0, 2.0, 8.0, 6.0, 8.0, 8.0, 10.0, 17.0, 11.0, 25.0, 45.0, 48.0, 86.0, 90.0, 118.0, 121.0, 104.0, 67.0, 45.0, 35.0, 33.0, 14.0, 22.0, 17.0, 15.0, 6.0, 8.0, 7.0, 5.0, 4.0, 5.0, 0.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0016431808471679688, -0.0015916526317596436, -0.0015401244163513184, -0.0014885962009429932, -0.001437067985534668, -0.0013855397701263428, -0.0013340115547180176, -0.0012824833393096924, -0.0012309551239013672, -0.001179426908493042, -0.0011278986930847168, -0.0010763704776763916, -0.0010248422622680664, -0.0009733140468597412, -0.000921785831451416, -0.0008702576160430908, -0.0008187294006347656, -0.0007672011852264404, -0.0007156729698181152, -0.00066414475440979, -0.0006126165390014648, -0.0005610883235931396, -0.0005095601081848145, -0.00045803189277648926, -0.00040650367736816406, -0.00035497546195983887, -0.00030344724655151367, -0.0002519190311431885, -0.00020039081573486328, -0.00014886260032653809, -9.733438491821289e-05, -4.5806169509887695e-05, 5.7220458984375e-06, 5.7250261306762695e-05, 0.00010877847671508789, 0.00016030669212341309, 0.00021183490753173828, 0.0002633631229400635, 0.00031489133834838867, 0.00036641955375671387, 0.00041794776916503906, 0.00046947598457336426, 0.0005210041999816895, 0.0005725324153900146, 0.0006240606307983398, 0.000675588846206665, 0.0007271170616149902, 0.0007786452770233154, 0.0008301734924316406, 0.0008817017078399658, 0.000933229923248291, 0.0009847581386566162, 0.0010362863540649414, 0.0010878145694732666, 0.0011393427848815918, 0.001190871000289917, 0.0012423992156982422, 0.0012939274311065674, 0.0013454556465148926, 0.0013969838619232178, 0.001448512077331543, 0.0015000402927398682, 0.0015515685081481934, 0.0016030967235565186, 0.0016546249389648438]}, "gradients/decoder.transformer.h.16.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 0.0, 5.0, 2.0, 4.0, 10.0, 6.0, 9.0, 6.0, 22.0, 16.0, 25.0, 42.0, 65.0, 90.0, 140.0, 281.0, 776.0, 165961.0, 879346.0, 949.0, 347.0, 163.0, 88.0, 56.0, 43.0, 29.0, 18.0, 13.0, 11.0, 6.0, 8.0, 2.0, 2.0, 4.0, 2.0, 1.0, 1.0, 4.0, 2.0, 3.0, 3.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.035308837890625, -0.034207820892333984, -0.03310680389404297, -0.03200578689575195, -0.030904769897460938, -0.029803752899169922, -0.028702735900878906, -0.02760171890258789, -0.026500701904296875, -0.02539968490600586, -0.024298667907714844, -0.023197650909423828, -0.022096633911132812, -0.020995616912841797, -0.01989459991455078, -0.018793582916259766, -0.01769256591796875, -0.016591548919677734, -0.015490531921386719, -0.014389514923095703, -0.013288497924804688, -0.012187480926513672, -0.011086463928222656, -0.00998544692993164, -0.008884429931640625, -0.007783412933349609, -0.006682395935058594, -0.005581378936767578, -0.0044803619384765625, -0.003379344940185547, -0.0022783279418945312, -0.0011773109436035156, -7.62939453125e-05, 0.0010247230529785156, 0.0021257400512695312, 0.003226757049560547, 0.0043277740478515625, 0.005428791046142578, 0.006529808044433594, 0.007630825042724609, 0.008731842041015625, 0.00983285903930664, 0.010933876037597656, 0.012034893035888672, 0.013135910034179688, 0.014236927032470703, 0.015337944030761719, 0.016438961029052734, 0.01753997802734375, 0.018640995025634766, 0.01974201202392578, 0.020843029022216797, 0.021944046020507812, 0.023045063018798828, 0.024146080017089844, 0.02524709701538086, 0.026348114013671875, 0.02744913101196289, 0.028550148010253906, 0.029651165008544922, 0.030752182006835938, 0.03185319900512695, 0.03295421600341797, 0.034055233001708984, 0.03515625]}, "gradients/decoder.transformer.h.16.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 12.0, 46.0, 150.0, 284.0, 293.0, 147.0, 53.0, 19.0, 5.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005440074019134045, -0.0004947661655023694, -0.0004455248999875039, -0.0003962836635764688, -0.0003470423980616033, -0.00029780116165056825, -0.0002485599252395332, -0.00019931865972466767, -0.0001500774233136326, -0.00010083617235068232, -5.159492866368964e-05, -2.353684976696968e-06, 4.688756598625332e-05, 9.612881694920361e-05, 0.00014537005336023867, 0.0001946113188751042, 0.00024385255528613925, 0.0002930937916971743, 0.00034233505721203983, 0.0003915762936230749, 0.0004408175591379404, 0.0004900587955489755, 0.0005393000319600105, 0.0005885412683710456, 0.0006377825047820807, 0.0006870237411931157, 0.0007362649776041508, 0.0007855062140151858, 0.0008347475086338818, 0.0008839887450449169, 0.0009332299814559519, 0.000982471276074648, 0.001031712512485683, 0.001080953748896718, 0.001130194985307753, 0.0011794362217187881, 0.0012286774581298232, 0.0012779186945408583, 0.0013271600473672152, 0.0013764012837782502, 0.0014256425201892853, 0.0014748837566003203, 0.0015241249930113554, 0.0015733662294223905, 0.0016226074658334255, 0.0016718488186597824, 0.0017210899386554956, 0.0017703312914818525, 0.0018195724114775658, 0.0018688136478886008, 0.0019180548842996359, 0.0019672962371259928, 0.002016537357121706, 0.002065778709948063, 0.002115019829943776, 0.002164261182770133, 0.00221350253559649, 0.002262743888422847, 0.00231198500841856, 0.002361226361244917, 0.00241046748124063, 0.002459708834066987, 0.0025089499540627003, 0.002558191306889057, 0.0026074324268847704]}, "gradients/decoder.transformer.h.16.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 2.0, 3.0, 9.0, 4.0, 8.0, 11.0, 7.0, 18.0, 13.0, 18.0, 27.0, 25.0, 28.0, 17.0, 24.0, 36.0, 31.0, 32.0, 38.0, 43.0, 36.0, 35.0, 44.0, 33.0, 42.0, 28.0, 36.0, 52.0, 36.0, 34.0, 26.0, 28.0, 28.0, 17.0, 24.0, 16.0, 32.0, 11.0, 14.0, 9.0, 5.0, 5.0, 3.0, 8.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0], "bins": [-0.0006296038627624512, -0.0006101317703723907, -0.0005906596779823303, -0.0005711875855922699, -0.0005517154932022095, -0.000532243400812149, -0.0005127713084220886, -0.0004932992160320282, -0.0004738271236419678, -0.00045435503125190735, -0.0004348829388618469, -0.0004154108464717865, -0.0003959387540817261, -0.00037646666169166565, -0.0003569945693016052, -0.0003375224769115448, -0.0003180503845214844, -0.00029857829213142395, -0.0002791061997413635, -0.0002596341073513031, -0.00024016201496124268, -0.00022068992257118225, -0.00020121783018112183, -0.0001817457377910614, -0.00016227364540100098, -0.00014280155301094055, -0.00012332946062088013, -0.0001038573682308197, -8.438527584075928e-05, -6.491318345069885e-05, -4.544109106063843e-05, -2.5968998670578003e-05, -6.496906280517578e-06, 1.2975186109542847e-05, 3.244727849960327e-05, 5.1919370889663696e-05, 7.139146327972412e-05, 9.086355566978455e-05, 0.00011033564805984497, 0.0001298077404499054, 0.00014927983283996582, 0.00016875192523002625, 0.00018822401762008667, 0.0002076961100101471, 0.00022716820240020752, 0.00024664029479026794, 0.00026611238718032837, 0.0002855844795703888, 0.0003050565719604492, 0.00032452866435050964, 0.00034400075674057007, 0.0003634728491306305, 0.0003829449415206909, 0.00040241703391075134, 0.00042188912630081177, 0.0004413612186908722, 0.0004608333110809326, 0.00048030540347099304, 0.0004997774958610535, 0.0005192495882511139, 0.0005387216806411743, 0.0005581937730312347, 0.0005776658654212952, 0.0005971379578113556, 0.000616610050201416]}, "gradients/decoder.transformer.h.16.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 2.0, 4.0, 3.0, 3.0, 5.0, 8.0, 8.0, 10.0, 9.0, 16.0, 11.0, 14.0, 25.0, 27.0, 31.0, 32.0, 38.0, 29.0, 35.0, 33.0, 37.0, 39.0, 43.0, 46.0, 38.0, 52.0, 37.0, 49.0, 47.0, 29.0, 34.0, 33.0, 28.0, 20.0, 28.0, 19.0, 12.0, 10.0, 13.0, 14.0, 12.0, 4.0, 6.0, 3.0, 3.0, 4.0, 3.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.28125, -6.08349609375, -5.8857421875, -5.68798828125, -5.490234375, -5.29248046875, -5.0947265625, -4.89697265625, -4.69921875, -4.50146484375, -4.3037109375, -4.10595703125, -3.908203125, -3.71044921875, -3.5126953125, -3.31494140625, -3.1171875, -2.91943359375, -2.7216796875, -2.52392578125, -2.326171875, -2.12841796875, -1.9306640625, -1.73291015625, -1.53515625, -1.33740234375, -1.1396484375, -0.94189453125, -0.744140625, -0.54638671875, -0.3486328125, -0.15087890625, 0.046875, 0.24462890625, 0.4423828125, 0.64013671875, 0.837890625, 1.03564453125, 1.2333984375, 1.43115234375, 1.62890625, 1.82666015625, 2.0244140625, 2.22216796875, 2.419921875, 2.61767578125, 2.8154296875, 3.01318359375, 3.2109375, 3.40869140625, 3.6064453125, 3.80419921875, 4.001953125, 4.19970703125, 4.3974609375, 4.59521484375, 4.79296875, 4.99072265625, 5.1884765625, 5.38623046875, 5.583984375, 5.78173828125, 5.9794921875, 6.17724609375, 6.375]}, "gradients/decoder.transformer.h.16.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 4.0, 2.0, 4.0, 0.0, 3.0, 4.0, 7.0, 9.0, 10.0, 18.0, 30.0, 38.0, 84.0, 115.0, 180.0, 323.0, 554.0, 912.0, 1641.0, 2835.0, 5306.0, 9713.0, 18316.0, 37322.0, 81869.0, 216900.0, 394368.0, 152754.0, 62762.0, 29760.0, 14856.0, 7938.0, 4298.0, 2366.0, 1351.0, 772.0, 464.0, 268.0, 160.0, 99.0, 47.0, 37.0, 23.0, 11.0, 9.0, 7.0, 3.0, 5.0, 1.0, 2.0, 2.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 2.0], "bins": [-3.646484375, -3.533721923828125, -3.42095947265625, -3.308197021484375, -3.1954345703125, -3.082672119140625, -2.96990966796875, -2.857147216796875, -2.744384765625, -2.631622314453125, -2.51885986328125, -2.406097412109375, -2.2933349609375, -2.180572509765625, -2.06781005859375, -1.955047607421875, -1.84228515625, -1.729522705078125, -1.61676025390625, -1.503997802734375, -1.3912353515625, -1.278472900390625, -1.16571044921875, -1.052947998046875, -0.940185546875, -0.827423095703125, -0.71466064453125, -0.601898193359375, -0.4891357421875, -0.376373291015625, -0.26361083984375, -0.150848388671875, -0.0380859375, 0.074676513671875, 0.18743896484375, 0.300201416015625, 0.4129638671875, 0.525726318359375, 0.63848876953125, 0.751251220703125, 0.864013671875, 0.976776123046875, 1.08953857421875, 1.202301025390625, 1.3150634765625, 1.427825927734375, 1.54058837890625, 1.653350830078125, 1.76611328125, 1.878875732421875, 1.99163818359375, 2.104400634765625, 2.2171630859375, 2.329925537109375, 2.44268798828125, 2.555450439453125, 2.668212890625, 2.780975341796875, 2.89373779296875, 3.006500244140625, 3.1192626953125, 3.232025146484375, 3.34478759765625, 3.457550048828125, 3.5703125]}, "gradients/decoder.transformer.h.16.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 2.0, 6.0, 4.0, 8.0, 6.0, 8.0, 13.0, 12.0, 16.0, 16.0, 21.0, 35.0, 34.0, 22.0, 34.0, 23.0, 45.0, 50.0, 60.0, 115.0, 366.0, 1602.0, 108.0, 68.0, 60.0, 43.0, 29.0, 34.0, 30.0, 36.0, 25.0, 32.0, 15.0, 14.0, 17.0, 14.0, 9.0, 5.0, 9.0, 2.0, 8.0, 2.0, 2.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.0625, -23.356689453125, -22.65087890625, -21.945068359375, -21.2392578125, -20.533447265625, -19.82763671875, -19.121826171875, -18.416015625, -17.710205078125, -17.00439453125, -16.298583984375, -15.5927734375, -14.886962890625, -14.18115234375, -13.475341796875, -12.76953125, -12.063720703125, -11.35791015625, -10.652099609375, -9.9462890625, -9.240478515625, -8.53466796875, -7.828857421875, -7.123046875, -6.417236328125, -5.71142578125, -5.005615234375, -4.2998046875, -3.593994140625, -2.88818359375, -2.182373046875, -1.4765625, -0.770751953125, -0.06494140625, 0.640869140625, 1.3466796875, 2.052490234375, 2.75830078125, 3.464111328125, 4.169921875, 4.875732421875, 5.58154296875, 6.287353515625, 6.9931640625, 7.698974609375, 8.40478515625, 9.110595703125, 9.81640625, 10.522216796875, 11.22802734375, 11.933837890625, 12.6396484375, 13.345458984375, 14.05126953125, 14.757080078125, 15.462890625, 16.168701171875, 16.87451171875, 17.580322265625, 18.2861328125, 18.991943359375, 19.69775390625, 20.403564453125, 21.109375]}, "gradients/decoder.transformer.h.16.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 5.0, 5.0, 10.0, 7.0, 12.0, 14.0, 25.0, 28.0, 35.0, 47.0, 47.0, 90.0, 101.0, 163.0, 259.0, 423.0, 1149.0, 15939.0, 3095994.0, 28726.0, 1288.0, 450.0, 262.0, 168.0, 119.0, 91.0, 72.0, 32.0, 34.0, 23.0, 25.0, 13.0, 16.0, 11.0, 7.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 0.0, 1.0], "bins": [-53.8125, -52.25537109375, -50.6982421875, -49.14111328125, -47.583984375, -46.02685546875, -44.4697265625, -42.91259765625, -41.35546875, -39.79833984375, -38.2412109375, -36.68408203125, -35.126953125, -33.56982421875, -32.0126953125, -30.45556640625, -28.8984375, -27.34130859375, -25.7841796875, -24.22705078125, -22.669921875, -21.11279296875, -19.5556640625, -17.99853515625, -16.44140625, -14.88427734375, -13.3271484375, -11.77001953125, -10.212890625, -8.65576171875, -7.0986328125, -5.54150390625, -3.984375, -2.42724609375, -0.8701171875, 0.68701171875, 2.244140625, 3.80126953125, 5.3583984375, 6.91552734375, 8.47265625, 10.02978515625, 11.5869140625, 13.14404296875, 14.701171875, 16.25830078125, 17.8154296875, 19.37255859375, 20.9296875, 22.48681640625, 24.0439453125, 25.60107421875, 27.158203125, 28.71533203125, 30.2724609375, 31.82958984375, 33.38671875, 34.94384765625, 36.5009765625, 38.05810546875, 39.615234375, 41.17236328125, 42.7294921875, 44.28662109375, 45.84375]}, "gradients/decoder.transformer.h.16.ln_1.weight": {"_type": "histogram", "values": [2.0, 801.0, 217.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-23.72690200805664, -11.05643081665039, 1.6140403747558594, 14.28451156616211, 26.95498275756836, 39.62545394897461, 52.29592514038086, 64.96640014648438, 77.63687133789062, 90.30734252929688, 102.97781372070312, 115.64828491210938, 128.31875610351562, 140.98922729492188, 153.65969848632812, 166.33016967773438, 179.00064086914062, 191.67111206054688, 204.34158325195312, 217.01205444335938, 229.68252563476562, 242.35299682617188, 255.02346801757812, 267.6939392089844, 280.3644104003906, 293.0348815917969, 305.7053527832031, 318.3758239746094, 331.0462951660156, 343.7167663574219, 356.3872375488281, 369.0577087402344, 381.7281494140625, 394.39862060546875, 407.069091796875, 419.73956298828125, 432.4100341796875, 445.08050537109375, 457.7509765625, 470.42144775390625, 483.0919189453125, 495.76239013671875, 508.432861328125, 521.1033325195312, 533.7738037109375, 546.4442749023438, 559.11474609375, 571.7852172851562, 584.4556884765625, 597.1261596679688, 609.796630859375, 622.4671020507812, 635.1375732421875, 647.8080444335938, 660.478515625, 673.1489868164062, 685.8194580078125, 698.4899291992188, 711.160400390625, 723.8308715820312, 736.5013427734375, 749.1718139648438, 761.84228515625, 774.5127563476562, 787.1832275390625]}, "gradients/decoder.transformer.h.16.ln_1.bias": {"_type": "histogram", "values": [4.0, 2.0, 0.0, 1.0, 2.0, 3.0, 3.0, 5.0, 3.0, 4.0, 6.0, 8.0, 6.0, 7.0, 13.0, 12.0, 10.0, 12.0, 19.0, 19.0, 21.0, 28.0, 28.0, 28.0, 27.0, 27.0, 34.0, 37.0, 44.0, 53.0, 35.0, 46.0, 37.0, 41.0, 30.0, 48.0, 39.0, 29.0, 36.0, 22.0, 31.0, 27.0, 21.0, 15.0, 13.0, 13.0, 10.0, 19.0, 11.0, 9.0, 4.0, 3.0, 2.0, 3.0, 3.0, 1.0, 3.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-56.92397689819336, -55.117820739746094, -53.311668395996094, -51.50551223754883, -49.69935607910156, -47.89320373535156, -46.0870475769043, -44.28089141845703, -42.47473907470703, -40.668582916259766, -38.862430572509766, -37.0562744140625, -35.2501220703125, -33.443965911865234, -31.63780975341797, -29.831655502319336, -28.025501251220703, -26.21934700012207, -24.413192749023438, -22.607036590576172, -20.80088233947754, -18.994728088378906, -17.18857192993164, -15.382417678833008, -13.576263427734375, -11.770109176635742, -9.963953971862793, -8.157798767089844, -6.351644515991211, -4.545490264892578, -2.739335060119629, -0.9331798553466797, 0.8729705810546875, 2.6791253089904785, 4.4852800369262695, 6.2914347648620605, 8.097589492797852, 9.903743743896484, 11.709898948669434, 13.516054153442383, 15.322208404541016, 17.12836265563965, 18.93451690673828, 20.740673065185547, 22.54682731628418, 24.352981567382812, 26.159137725830078, 27.96529197692871, 29.771446228027344, 31.577600479125977, 33.38375473022461, 35.189910888671875, 36.996063232421875, 38.80221939086914, 40.608375549316406, 42.414527893066406, 44.22068405151367, 46.02684020996094, 47.83299255371094, 49.6391487121582, 51.44530487060547, 53.25145721435547, 55.057613372802734, 56.86376953125, 58.669921875]}, "gradients/decoder.transformer.h.15.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 3.0, 3.0, 3.0, 3.0, 4.0, 3.0, 7.0, 13.0, 14.0, 11.0, 15.0, 14.0, 20.0, 28.0, 35.0, 35.0, 33.0, 35.0, 36.0, 20.0, 39.0, 34.0, 36.0, 55.0, 41.0, 44.0, 51.0, 35.0, 34.0, 43.0, 27.0, 36.0, 30.0, 30.0, 26.0, 19.0, 15.0, 8.0, 16.0, 13.0, 10.0, 7.0, 7.0, 5.0, 2.0, 3.0, 2.0, 2.0, 4.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-6.26171875, -6.0599365234375, -5.858154296875, -5.6563720703125, -5.45458984375, -5.2528076171875, -5.051025390625, -4.8492431640625, -4.6474609375, -4.4456787109375, -4.243896484375, -4.0421142578125, -3.84033203125, -3.6385498046875, -3.436767578125, -3.2349853515625, -3.033203125, -2.8314208984375, -2.629638671875, -2.4278564453125, -2.22607421875, -2.0242919921875, -1.822509765625, -1.6207275390625, -1.4189453125, -1.2171630859375, -1.015380859375, -0.8135986328125, -0.61181640625, -0.4100341796875, -0.208251953125, -0.0064697265625, 0.1953125, 0.3970947265625, 0.598876953125, 0.8006591796875, 1.00244140625, 1.2042236328125, 1.406005859375, 1.6077880859375, 1.8095703125, 2.0113525390625, 2.213134765625, 2.4149169921875, 2.61669921875, 2.8184814453125, 3.020263671875, 3.2220458984375, 3.423828125, 3.6256103515625, 3.827392578125, 4.0291748046875, 4.23095703125, 4.4327392578125, 4.634521484375, 4.8363037109375, 5.0380859375, 5.2398681640625, 5.441650390625, 5.6434326171875, 5.84521484375, 6.0469970703125, 6.248779296875, 6.4505615234375, 6.65234375]}, "gradients/decoder.transformer.h.15.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 3.0, 4.0, 7.0, 8.0, 13.0, 13.0, 16.0, 23.0, 30.0, 39.0, 44.0, 78.0, 83.0, 150.0, 229.0, 337.0, 527.0, 1101.0, 2132.0, 5194.0, 14623.0, 48632.0, 188721.0, 681076.0, 1494286.0, 1193975.0, 412348.0, 105783.0, 28528.0, 9180.0, 3449.0, 1497.0, 751.0, 469.0, 291.0, 172.0, 118.0, 91.0, 73.0, 55.0, 36.0, 21.0, 29.0, 14.0, 10.0, 8.0, 8.0, 2.0, 7.0, 5.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-7.1796875, -6.953857421875, -6.72802734375, -6.502197265625, -6.2763671875, -6.050537109375, -5.82470703125, -5.598876953125, -5.373046875, -5.147216796875, -4.92138671875, -4.695556640625, -4.4697265625, -4.243896484375, -4.01806640625, -3.792236328125, -3.56640625, -3.340576171875, -3.11474609375, -2.888916015625, -2.6630859375, -2.437255859375, -2.21142578125, -1.985595703125, -1.759765625, -1.533935546875, -1.30810546875, -1.082275390625, -0.8564453125, -0.630615234375, -0.40478515625, -0.178955078125, 0.046875, 0.272705078125, 0.49853515625, 0.724365234375, 0.9501953125, 1.176025390625, 1.40185546875, 1.627685546875, 1.853515625, 2.079345703125, 2.30517578125, 2.531005859375, 2.7568359375, 2.982666015625, 3.20849609375, 3.434326171875, 3.66015625, 3.885986328125, 4.11181640625, 4.337646484375, 4.5634765625, 4.789306640625, 5.01513671875, 5.240966796875, 5.466796875, 5.692626953125, 5.91845703125, 6.144287109375, 6.3701171875, 6.595947265625, 6.82177734375, 7.047607421875, 7.2734375]}, "gradients/decoder.transformer.h.15.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 1.0, 3.0, 6.0, 5.0, 6.0, 9.0, 4.0, 9.0, 9.0, 19.0, 20.0, 29.0, 38.0, 43.0, 45.0, 73.0, 88.0, 102.0, 163.0, 207.0, 239.0, 316.0, 371.0, 396.0, 381.0, 321.0, 262.0, 185.0, 179.0, 136.0, 97.0, 76.0, 65.0, 42.0, 36.0, 28.0, 14.0, 15.0, 10.0, 7.0, 10.0, 7.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-12.546875, -12.16064453125, -11.7744140625, -11.38818359375, -11.001953125, -10.61572265625, -10.2294921875, -9.84326171875, -9.45703125, -9.07080078125, -8.6845703125, -8.29833984375, -7.912109375, -7.52587890625, -7.1396484375, -6.75341796875, -6.3671875, -5.98095703125, -5.5947265625, -5.20849609375, -4.822265625, -4.43603515625, -4.0498046875, -3.66357421875, -3.27734375, -2.89111328125, -2.5048828125, -2.11865234375, -1.732421875, -1.34619140625, -0.9599609375, -0.57373046875, -0.1875, 0.19873046875, 0.5849609375, 0.97119140625, 1.357421875, 1.74365234375, 2.1298828125, 2.51611328125, 2.90234375, 3.28857421875, 3.6748046875, 4.06103515625, 4.447265625, 4.83349609375, 5.2197265625, 5.60595703125, 5.9921875, 6.37841796875, 6.7646484375, 7.15087890625, 7.537109375, 7.92333984375, 8.3095703125, 8.69580078125, 9.08203125, 9.46826171875, 9.8544921875, 10.24072265625, 10.626953125, 11.01318359375, 11.3994140625, 11.78564453125, 12.171875]}, "gradients/decoder.transformer.h.15.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 4.0, 6.0, 7.0, 10.0, 9.0, 12.0, 20.0, 28.0, 28.0, 28.0, 41.0, 53.0, 66.0, 97.0, 97.0, 159.0, 215.0, 400.0, 878.0, 5917.0, 3516542.0, 665239.0, 2697.0, 631.0, 291.0, 204.0, 140.0, 102.0, 82.0, 49.0, 48.0, 35.0, 33.0, 18.0, 22.0, 14.0, 11.0, 14.0, 10.0, 4.0, 8.0, 6.0, 4.0, 4.0, 4.0, 1.0, 1.0, 1.0, 0.0, 3.0], "bins": [-70.625, -68.599609375, -66.57421875, -64.548828125, -62.5234375, -60.498046875, -58.47265625, -56.447265625, -54.421875, -52.396484375, -50.37109375, -48.345703125, -46.3203125, -44.294921875, -42.26953125, -40.244140625, -38.21875, -36.193359375, -34.16796875, -32.142578125, -30.1171875, -28.091796875, -26.06640625, -24.041015625, -22.015625, -19.990234375, -17.96484375, -15.939453125, -13.9140625, -11.888671875, -9.86328125, -7.837890625, -5.8125, -3.787109375, -1.76171875, 0.263671875, 2.2890625, 4.314453125, 6.33984375, 8.365234375, 10.390625, 12.416015625, 14.44140625, 16.466796875, 18.4921875, 20.517578125, 22.54296875, 24.568359375, 26.59375, 28.619140625, 30.64453125, 32.669921875, 34.6953125, 36.720703125, 38.74609375, 40.771484375, 42.796875, 44.822265625, 46.84765625, 48.873046875, 50.8984375, 52.923828125, 54.94921875, 56.974609375, 59.0]}, "gradients/decoder.transformer.h.15.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 6.0, 12.0, 13.0, 61.0, 106.0, 175.0, 200.0, 200.0, 127.0, 64.0, 34.0, 12.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-60.44974899291992, -57.365997314453125, -54.28224182128906, -51.198486328125, -48.1147346496582, -45.030982971191406, -41.947227478027344, -38.86347198486328, -35.779720306396484, -32.69596862792969, -29.612213134765625, -26.528459548950195, -23.444705963134766, -20.360952377319336, -17.277198791503906, -14.193445205688477, -11.109691619873047, -8.025938034057617, -4.9421844482421875, -1.8584308624267578, 1.2253227233886719, 4.309076309204102, 7.392829895019531, 10.476583480834961, 13.56033706665039, 16.64409065246582, 19.72784423828125, 22.81159782409668, 25.89535140991211, 28.97910499572754, 32.06285858154297, 35.14661407470703, 38.23036193847656, 41.314117431640625, 44.39786911010742, 47.48162078857422, 50.56537628173828, 53.649131774902344, 56.73288345336914, 59.81663513183594, 62.900390625, 65.98414611816406, 69.06790161132812, 72.15164947509766, 75.23540496826172, 78.31916046142578, 81.40290832519531, 84.48666381835938, 87.57041931152344, 90.6541748046875, 93.73793029785156, 96.8216781616211, 99.90543365478516, 102.98918914794922, 106.07293701171875, 109.15669250488281, 112.24044799804688, 115.32420349121094, 118.407958984375, 121.49170684814453, 124.5754623413086, 127.65921783447266, 130.7429656982422, 133.82672119140625, 136.9104766845703]}, "gradients/decoder.transformer.h.15.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 3.0, 3.0, 6.0, 0.0, 10.0, 5.0, 10.0, 9.0, 9.0, 15.0, 10.0, 20.0, 23.0, 22.0, 17.0, 27.0, 31.0, 29.0, 34.0, 36.0, 37.0, 45.0, 44.0, 53.0, 39.0, 48.0, 43.0, 30.0, 35.0, 36.0, 30.0, 37.0, 20.0, 40.0, 16.0, 24.0, 24.0, 19.0, 16.0, 13.0, 8.0, 2.0, 5.0, 10.0, 5.0, 3.0, 6.0, 1.0, 3.0, 0.0, 4.0, 0.0, 0.0, 0.0, 2.0], "bins": [-49.761070251464844, -48.26976013183594, -46.77845001220703, -45.287139892578125, -43.79582977294922, -42.30451965332031, -40.813209533691406, -39.321895599365234, -37.83058547973633, -36.33927536010742, -34.847965240478516, -33.35665512084961, -31.86534309387207, -30.374032974243164, -28.882722854614258, -27.39141082763672, -25.900102615356445, -24.40879249572754, -22.917482376098633, -21.426170349121094, -19.934860229492188, -18.44355010986328, -16.952239990234375, -15.460928916931152, -13.969618797302246, -12.47830867767334, -10.986997604370117, -9.495687484741211, -8.004377365112305, -6.513066291809082, -5.021756172180176, -3.530445098876953, -2.039134979248047, -0.547824501991272, 0.9434859752655029, 2.4347963333129883, 3.9261069297790527, 5.417417526245117, 6.908727645874023, 8.400038719177246, 9.891348838806152, 11.382658958435059, 12.873970031738281, 14.365280151367188, 15.856590270996094, 17.347900390625, 18.839210510253906, 20.330522537231445, 21.82183265686035, 23.313142776489258, 24.804452896118164, 26.295764923095703, 27.78707504272461, 29.278385162353516, 30.769695281982422, 32.26100540161133, 33.752315521240234, 35.24362564086914, 36.73493576049805, 38.22624588012695, 39.71755599975586, 41.20886993408203, 42.70018005371094, 44.191490173339844, 45.68280029296875]}, "gradients/decoder.transformer.h.15.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 6.0, 4.0, 3.0, 6.0, 5.0, 9.0, 14.0, 15.0, 15.0, 17.0, 20.0, 29.0, 23.0, 44.0, 31.0, 33.0, 36.0, 37.0, 45.0, 34.0, 43.0, 36.0, 47.0, 30.0, 47.0, 40.0, 44.0, 28.0, 38.0, 29.0, 19.0, 28.0, 33.0, 24.0, 14.0, 20.0, 14.0, 8.0, 6.0, 5.0, 7.0, 3.0, 7.0, 3.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-6.26953125, -6.066650390625, -5.86376953125, -5.660888671875, -5.4580078125, -5.255126953125, -5.05224609375, -4.849365234375, -4.646484375, -4.443603515625, -4.24072265625, -4.037841796875, -3.8349609375, -3.632080078125, -3.42919921875, -3.226318359375, -3.0234375, -2.820556640625, -2.61767578125, -2.414794921875, -2.2119140625, -2.009033203125, -1.80615234375, -1.603271484375, -1.400390625, -1.197509765625, -0.99462890625, -0.791748046875, -0.5888671875, -0.385986328125, -0.18310546875, 0.019775390625, 0.22265625, 0.425537109375, 0.62841796875, 0.831298828125, 1.0341796875, 1.237060546875, 1.43994140625, 1.642822265625, 1.845703125, 2.048583984375, 2.25146484375, 2.454345703125, 2.6572265625, 2.860107421875, 3.06298828125, 3.265869140625, 3.46875, 3.671630859375, 3.87451171875, 4.077392578125, 4.2802734375, 4.483154296875, 4.68603515625, 4.888916015625, 5.091796875, 5.294677734375, 5.49755859375, 5.700439453125, 5.9033203125, 6.106201171875, 6.30908203125, 6.511962890625, 6.71484375]}, "gradients/decoder.transformer.h.15.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 4.0, 4.0, 3.0, 3.0, 5.0, 9.0, 15.0, 21.0, 40.0, 34.0, 66.0, 98.0, 128.0, 182.0, 257.0, 368.0, 558.0, 813.0, 1199.0, 1777.0, 2624.0, 4065.0, 6178.0, 9693.0, 14763.0, 23760.0, 38303.0, 61849.0, 109927.0, 250536.0, 246472.0, 108812.0, 61797.0, 37536.0, 23773.0, 15084.0, 9424.0, 6201.0, 3977.0, 2668.0, 1806.0, 1162.0, 813.0, 558.0, 373.0, 251.0, 173.0, 124.0, 90.0, 59.0, 54.0, 25.0, 11.0, 15.0, 11.0, 6.0, 8.0, 5.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-1.5185546875, -1.4695587158203125, -1.420562744140625, -1.3715667724609375, -1.32257080078125, -1.2735748291015625, -1.224578857421875, -1.1755828857421875, -1.1265869140625, -1.0775909423828125, -1.028594970703125, -0.9795989990234375, -0.93060302734375, -0.8816070556640625, -0.832611083984375, -0.7836151123046875, -0.734619140625, -0.6856231689453125, -0.636627197265625, -0.5876312255859375, -0.53863525390625, -0.4896392822265625, -0.440643310546875, -0.3916473388671875, -0.3426513671875, -0.2936553955078125, -0.244659423828125, -0.1956634521484375, -0.14666748046875, -0.0976715087890625, -0.048675537109375, 0.0003204345703125, 0.04931640625, 0.0983123779296875, 0.147308349609375, 0.1963043212890625, 0.24530029296875, 0.2942962646484375, 0.343292236328125, 0.3922882080078125, 0.4412841796875, 0.4902801513671875, 0.539276123046875, 0.5882720947265625, 0.63726806640625, 0.6862640380859375, 0.735260009765625, 0.7842559814453125, 0.833251953125, 0.8822479248046875, 0.931243896484375, 0.9802398681640625, 1.02923583984375, 1.0782318115234375, 1.127227783203125, 1.1762237548828125, 1.2252197265625, 1.2742156982421875, 1.323211669921875, 1.3722076416015625, 1.42120361328125, 1.4701995849609375, 1.519195556640625, 1.5681915283203125, 1.6171875]}, "gradients/decoder.transformer.h.15.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 5.0, 3.0, 5.0, 7.0, 6.0, 13.0, 8.0, 16.0, 15.0, 16.0, 23.0, 30.0, 25.0, 16.0, 32.0, 36.0, 31.0, 39.0, 49.0, 47.0, 49.0, 1063.0, 45.0, 41.0, 44.0, 40.0, 36.0, 33.0, 40.0, 24.0, 34.0, 34.0, 17.0, 18.0, 13.0, 10.0, 13.0, 14.0, 7.0, 17.0, 8.0, 3.0, 2.0, 3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.65625, -3.5291748046875, -3.402099609375, -3.2750244140625, -3.14794921875, -3.0208740234375, -2.893798828125, -2.7667236328125, -2.6396484375, -2.5125732421875, -2.385498046875, -2.2584228515625, -2.13134765625, -2.0042724609375, -1.877197265625, -1.7501220703125, -1.623046875, -1.4959716796875, -1.368896484375, -1.2418212890625, -1.11474609375, -0.9876708984375, -0.860595703125, -0.7335205078125, -0.6064453125, -0.4793701171875, -0.352294921875, -0.2252197265625, -0.09814453125, 0.0289306640625, 0.156005859375, 0.2830810546875, 0.41015625, 0.5372314453125, 0.664306640625, 0.7913818359375, 0.91845703125, 1.0455322265625, 1.172607421875, 1.2996826171875, 1.4267578125, 1.5538330078125, 1.680908203125, 1.8079833984375, 1.93505859375, 2.0621337890625, 2.189208984375, 2.3162841796875, 2.443359375, 2.5704345703125, 2.697509765625, 2.8245849609375, 2.95166015625, 3.0787353515625, 3.205810546875, 3.3328857421875, 3.4599609375, 3.5870361328125, 3.714111328125, 3.8411865234375, 3.96826171875, 4.0953369140625, 4.222412109375, 4.3494873046875, 4.4765625]}, "gradients/decoder.transformer.h.15.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 3.0, 2.0, 9.0, 8.0, 18.0, 20.0, 31.0, 40.0, 75.0, 91.0, 200.0, 291.0, 530.0, 872.0, 1657.0, 3055.0, 5600.0, 10343.0, 19957.0, 40595.0, 85642.0, 222134.0, 1464277.0, 126630.0, 57159.0, 27462.0, 14030.0, 7376.0, 3988.0, 2136.0, 1247.0, 684.0, 381.0, 217.0, 136.0, 82.0, 48.0, 29.0, 32.0, 12.0, 15.0, 7.0, 3.0, 5.0, 6.0, 2.0, 2.0, 1.0, 2.0, 0.0, 2.0], "bins": [-2.458984375, -2.389190673828125, -2.31939697265625, -2.249603271484375, -2.1798095703125, -2.110015869140625, -2.04022216796875, -1.970428466796875, -1.900634765625, -1.830841064453125, -1.76104736328125, -1.691253662109375, -1.6214599609375, -1.551666259765625, -1.48187255859375, -1.412078857421875, -1.34228515625, -1.272491455078125, -1.20269775390625, -1.132904052734375, -1.0631103515625, -0.993316650390625, -0.92352294921875, -0.853729248046875, -0.783935546875, -0.714141845703125, -0.64434814453125, -0.574554443359375, -0.5047607421875, -0.434967041015625, -0.36517333984375, -0.295379638671875, -0.2255859375, -0.155792236328125, -0.08599853515625, -0.016204833984375, 0.0535888671875, 0.123382568359375, 0.19317626953125, 0.262969970703125, 0.332763671875, 0.402557373046875, 0.47235107421875, 0.542144775390625, 0.6119384765625, 0.681732177734375, 0.75152587890625, 0.821319580078125, 0.89111328125, 0.960906982421875, 1.03070068359375, 1.100494384765625, 1.1702880859375, 1.240081787109375, 1.30987548828125, 1.379669189453125, 1.449462890625, 1.519256591796875, 1.58905029296875, 1.658843994140625, 1.7286376953125, 1.798431396484375, 1.86822509765625, 1.938018798828125, 2.0078125]}, "gradients/decoder.transformer.h.15.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 9.0, 5.0, 6.0, 4.0, 11.0, 7.0, 12.0, 14.0, 13.0, 23.0, 29.0, 28.0, 29.0, 64.0, 79.0, 119.0, 132.0, 104.0, 53.0, 55.0, 46.0, 27.0, 29.0, 11.0, 22.0, 13.0, 14.0, 15.0, 8.0, 7.0, 5.0, 3.0, 2.0, 4.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013170242309570312, -0.001270383596420288, -0.001223742961883545, -0.0011771023273468018, -0.0011304616928100586, -0.0010838210582733154, -0.0010371804237365723, -0.000990539789199829, -0.0009438991546630859, -0.0008972585201263428, -0.0008506178855895996, -0.0008039772510528564, -0.0007573366165161133, -0.0007106959819793701, -0.000664055347442627, -0.0006174147129058838, -0.0005707740783691406, -0.0005241334438323975, -0.0004774928092956543, -0.00043085217475891113, -0.00038421154022216797, -0.0003375709056854248, -0.00029093027114868164, -0.0002442896366119385, -0.0001976490020751953, -0.00015100836753845215, -0.00010436773300170898, -5.772709846496582e-05, -1.1086463928222656e-05, 3.555417060852051e-05, 8.219480514526367e-05, 0.00012883543968200684, 0.00017547607421875, 0.00022211670875549316, 0.00026875734329223633, 0.0003153979778289795, 0.00036203861236572266, 0.0004086792469024658, 0.000455319881439209, 0.0005019605159759521, 0.0005486011505126953, 0.0005952417850494385, 0.0006418824195861816, 0.0006885230541229248, 0.000735163688659668, 0.0007818043231964111, 0.0008284449577331543, 0.0008750855922698975, 0.0009217262268066406, 0.0009683668613433838, 0.001015007495880127, 0.0010616481304168701, 0.0011082887649536133, 0.0011549293994903564, 0.0012015700340270996, 0.0012482106685638428, 0.001294851303100586, 0.001341491937637329, 0.0013881325721740723, 0.0014347732067108154, 0.0014814138412475586, 0.0015280544757843018, 0.001574695110321045, 0.001621335744857788, 0.0016679763793945312]}, "gradients/decoder.transformer.h.15.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 3.0, 1.0, 1.0, 1.0, 3.0, 7.0, 4.0, 4.0, 8.0, 12.0, 13.0, 20.0, 17.0, 31.0, 28.0, 33.0, 69.0, 130.0, 236.0, 786.0, 801198.0, 244718.0, 680.0, 229.0, 99.0, 65.0, 48.0, 21.0, 24.0, 14.0, 12.0, 19.0, 7.0, 5.0, 7.0, 3.0, 2.0, 6.0, 0.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03924560546875, -0.0380549430847168, -0.036864280700683594, -0.03567361831665039, -0.03448295593261719, -0.033292293548583984, -0.03210163116455078, -0.030910968780517578, -0.029720306396484375, -0.028529644012451172, -0.02733898162841797, -0.026148319244384766, -0.024957656860351562, -0.02376699447631836, -0.022576332092285156, -0.021385669708251953, -0.02019500732421875, -0.019004344940185547, -0.017813682556152344, -0.01662302017211914, -0.015432357788085938, -0.014241695404052734, -0.013051033020019531, -0.011860370635986328, -0.010669708251953125, -0.009479045867919922, -0.008288383483886719, -0.007097721099853516, -0.0059070587158203125, -0.004716396331787109, -0.0035257339477539062, -0.002335071563720703, -0.0011444091796875, 4.6253204345703125e-05, 0.0012369155883789062, 0.0024275779724121094, 0.0036182403564453125, 0.004808902740478516, 0.005999565124511719, 0.007190227508544922, 0.008380889892578125, 0.009571552276611328, 0.010762214660644531, 0.011952877044677734, 0.013143539428710938, 0.01433420181274414, 0.015524864196777344, 0.016715526580810547, 0.01790618896484375, 0.019096851348876953, 0.020287513732910156, 0.02147817611694336, 0.022668838500976562, 0.023859500885009766, 0.02505016326904297, 0.026240825653076172, 0.027431488037109375, 0.028622150421142578, 0.02981281280517578, 0.031003475189208984, 0.03219413757324219, 0.03338479995727539, 0.034575462341308594, 0.0357661247253418, 0.036956787109375]}, "gradients/decoder.transformer.h.15.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 29.0, 160.0, 429.0, 310.0, 68.0, 12.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0019975234754383564, -0.001925585325807333, -0.0018536471761763096, -0.0017817090265452862, -0.0017097708769142628, -0.0016378327272832394, -0.001565894577652216, -0.0014939564280211926, -0.0014220182783901691, -0.0013500801287591457, -0.0012781419791281223, -0.001206203829497099, -0.0011342656798660755, -0.0010623275302350521, -0.0009903893806040287, -0.0009184512309730053, -0.0008465130813419819, -0.0007745749317109585, -0.0007026367820799351, -0.0006306986324489117, -0.0005587604828178883, -0.00048682233318686485, -0.00041488418355584145, -0.00034294603392481804, -0.00027100788429379463, -0.00019906973466277122, -0.00012713158503174782, -5.519343540072441e-05, 1.6744714230298996e-05, 8.86828638613224e-05, 0.0001606210134923458, 0.00023255916312336922, 0.00030449707992374897, 0.0003764352295547724, 0.0004483733791857958, 0.0005203115288168192, 0.0005922496784478426, 0.000664187828078866, 0.0007361259777098894, 0.0008080641273409128, 0.0008800022769719362, 0.0009519404266029596, 0.001023878576233983, 0.0010958167258650064, 0.0011677548754960299, 0.0012396930251270533, 0.0013116311747580767, 0.0013835693243891, 0.0014555074740201235, 0.0015274456236511469, 0.0015993837732821703, 0.0016713219229131937, 0.0017432600725442171, 0.0018151982221752405, 0.001887136371806264, 0.0019590745214372873, 0.0020310126710683107, 0.002102950820699334, 0.0021748889703303576, 0.002246827119961381, 0.0023187652695924044, 0.0023907034192234278, 0.002462641568854451, 0.0025345797184854746, 0.002606517868116498]}, "gradients/decoder.transformer.h.15.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 3.0, 1.0, 3.0, 4.0, 9.0, 6.0, 9.0, 6.0, 16.0, 22.0, 21.0, 18.0, 18.0, 28.0, 37.0, 31.0, 33.0, 39.0, 35.0, 42.0, 33.0, 44.0, 43.0, 40.0, 41.0, 48.0, 41.0, 32.0, 51.0, 28.0, 26.0, 29.0, 28.0, 23.0, 18.0, 17.0, 20.0, 17.0, 14.0, 6.0, 10.0, 6.0, 4.0, 3.0, 1.0, 1.0, 1.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006338357925415039, -0.0006140219047665596, -0.0005942080169916153, -0.000574394129216671, -0.0005545802414417267, -0.0005347663536667824, -0.0005149524658918381, -0.0004951385781168938, -0.00047532469034194946, -0.00045551080256700516, -0.00043569691479206085, -0.00041588302701711655, -0.00039606913924217224, -0.00037625525146722794, -0.00035644136369228363, -0.0003366274759173393, -0.000316813588142395, -0.0002969997003674507, -0.0002771858125925064, -0.0002573719248175621, -0.0002375580370426178, -0.0002177441492676735, -0.0001979302614927292, -0.00017811637371778488, -0.00015830248594284058, -0.00013848859816789627, -0.00011867471039295197, -9.886082261800766e-05, -7.904693484306335e-05, -5.923304706811905e-05, -3.9419159293174744e-05, -1.9605271518230438e-05, 2.086162567138672e-07, 2.0022504031658173e-05, 3.983639180660248e-05, 5.9650279581546783e-05, 7.946416735649109e-05, 9.92780551314354e-05, 0.0001190919429063797, 0.000138905830681324, 0.0001587197184562683, 0.00017853360623121262, 0.00019834749400615692, 0.00021816138178110123, 0.00023797526955604553, 0.00025778915733098984, 0.00027760304510593414, 0.00029741693288087845, 0.00031723082065582275, 0.00033704470843076706, 0.00035685859620571136, 0.00037667248398065567, 0.0003964863717556, 0.0004163002595305443, 0.0004361141473054886, 0.0004559280350804329, 0.0004757419228553772, 0.0004955558106303215, 0.0005153696984052658, 0.0005351835861802101, 0.0005549974739551544, 0.0005748113617300987, 0.000594625249505043, 0.0006144391372799873, 0.0006342530250549316]}, "gradients/decoder.transformer.h.15.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 6.0, 4.0, 3.0, 6.0, 5.0, 9.0, 14.0, 15.0, 15.0, 17.0, 20.0, 29.0, 23.0, 44.0, 31.0, 33.0, 36.0, 37.0, 45.0, 34.0, 43.0, 36.0, 47.0, 30.0, 47.0, 40.0, 44.0, 28.0, 38.0, 29.0, 19.0, 28.0, 34.0, 23.0, 14.0, 20.0, 14.0, 8.0, 6.0, 5.0, 7.0, 3.0, 7.0, 3.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-6.26953125, -6.066650390625, -5.86376953125, -5.660888671875, -5.4580078125, -5.255126953125, -5.05224609375, -4.849365234375, -4.646484375, -4.443603515625, -4.24072265625, -4.037841796875, -3.8349609375, -3.632080078125, -3.42919921875, -3.226318359375, -3.0234375, -2.820556640625, -2.61767578125, -2.414794921875, -2.2119140625, -2.009033203125, -1.80615234375, -1.603271484375, -1.400390625, -1.197509765625, -0.99462890625, -0.791748046875, -0.5888671875, -0.385986328125, -0.18310546875, 0.019775390625, 0.22265625, 0.425537109375, 0.62841796875, 0.831298828125, 1.0341796875, 1.237060546875, 1.43994140625, 1.642822265625, 1.845703125, 2.048583984375, 2.25146484375, 2.454345703125, 2.6572265625, 2.860107421875, 3.06298828125, 3.265869140625, 3.46875, 3.671630859375, 3.87451171875, 4.077392578125, 4.2802734375, 4.483154296875, 4.68603515625, 4.888916015625, 5.091796875, 5.294677734375, 5.49755859375, 5.700439453125, 5.9033203125, 6.106201171875, 6.30908203125, 6.511962890625, 6.71484375]}, "gradients/decoder.transformer.h.15.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 3.0, 4.0, 6.0, 9.0, 9.0, 18.0, 32.0, 44.0, 52.0, 93.0, 161.0, 253.0, 355.0, 620.0, 1177.0, 2194.0, 4553.0, 9890.0, 23833.0, 64983.0, 213248.0, 463455.0, 171645.0, 54616.0, 20238.0, 8651.0, 3834.0, 1943.0, 1111.0, 570.0, 361.0, 213.0, 133.0, 87.0, 57.0, 40.0, 27.0, 17.0, 15.0, 4.0, 2.0, 3.0, 2.0, 4.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-6.234375, -6.06317138671875, -5.8919677734375, -5.72076416015625, -5.549560546875, -5.37835693359375, -5.2071533203125, -5.03594970703125, -4.86474609375, -4.69354248046875, -4.5223388671875, -4.35113525390625, -4.179931640625, -4.00872802734375, -3.8375244140625, -3.66632080078125, -3.4951171875, -3.32391357421875, -3.1527099609375, -2.98150634765625, -2.810302734375, -2.63909912109375, -2.4678955078125, -2.29669189453125, -2.12548828125, -1.95428466796875, -1.7830810546875, -1.61187744140625, -1.440673828125, -1.26947021484375, -1.0982666015625, -0.92706298828125, -0.755859375, -0.58465576171875, -0.4134521484375, -0.24224853515625, -0.071044921875, 0.10015869140625, 0.2713623046875, 0.44256591796875, 0.61376953125, 0.78497314453125, 0.9561767578125, 1.12738037109375, 1.298583984375, 1.46978759765625, 1.6409912109375, 1.81219482421875, 1.9833984375, 2.15460205078125, 2.3258056640625, 2.49700927734375, 2.668212890625, 2.83941650390625, 3.0106201171875, 3.18182373046875, 3.35302734375, 3.52423095703125, 3.6954345703125, 3.86663818359375, 4.037841796875, 4.20904541015625, 4.3802490234375, 4.55145263671875, 4.72265625]}, "gradients/decoder.transformer.h.15.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 4.0, 3.0, 4.0, 6.0, 7.0, 5.0, 7.0, 11.0, 18.0, 20.0, 26.0, 32.0, 23.0, 32.0, 51.0, 36.0, 44.0, 51.0, 66.0, 129.0, 1507.0, 398.0, 125.0, 70.0, 59.0, 49.0, 42.0, 50.0, 28.0, 37.0, 23.0, 30.0, 14.0, 9.0, 15.0, 5.0, 7.0, 4.0, 5.0, 2.0, 3.0, 1.0, 3.0, 0.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-22.546875, -21.767822265625, -20.98876953125, -20.209716796875, -19.4306640625, -18.651611328125, -17.87255859375, -17.093505859375, -16.314453125, -15.535400390625, -14.75634765625, -13.977294921875, -13.1982421875, -12.419189453125, -11.64013671875, -10.861083984375, -10.08203125, -9.302978515625, -8.52392578125, -7.744873046875, -6.9658203125, -6.186767578125, -5.40771484375, -4.628662109375, -3.849609375, -3.070556640625, -2.29150390625, -1.512451171875, -0.7333984375, 0.045654296875, 0.82470703125, 1.603759765625, 2.3828125, 3.161865234375, 3.94091796875, 4.719970703125, 5.4990234375, 6.278076171875, 7.05712890625, 7.836181640625, 8.615234375, 9.394287109375, 10.17333984375, 10.952392578125, 11.7314453125, 12.510498046875, 13.28955078125, 14.068603515625, 14.84765625, 15.626708984375, 16.40576171875, 17.184814453125, 17.9638671875, 18.742919921875, 19.52197265625, 20.301025390625, 21.080078125, 21.859130859375, 22.63818359375, 23.417236328125, 24.1962890625, 24.975341796875, 25.75439453125, 26.533447265625, 27.3125]}, "gradients/decoder.transformer.h.15.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 4.0, 3.0, 1.0, 4.0, 3.0, 9.0, 3.0, 10.0, 5.0, 18.0, 12.0, 19.0, 30.0, 39.0, 50.0, 64.0, 107.0, 139.0, 163.0, 259.0, 405.0, 829.0, 6193.0, 2980428.0, 152992.0, 2284.0, 532.0, 263.0, 202.0, 165.0, 119.0, 82.0, 67.0, 43.0, 41.0, 39.0, 26.0, 15.0, 12.0, 6.0, 9.0, 9.0, 4.0, 6.0, 2.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.65625, -56.97119140625, -55.2861328125, -53.60107421875, -51.916015625, -50.23095703125, -48.5458984375, -46.86083984375, -45.17578125, -43.49072265625, -41.8056640625, -40.12060546875, -38.435546875, -36.75048828125, -35.0654296875, -33.38037109375, -31.6953125, -30.01025390625, -28.3251953125, -26.64013671875, -24.955078125, -23.27001953125, -21.5849609375, -19.89990234375, -18.21484375, -16.52978515625, -14.8447265625, -13.15966796875, -11.474609375, -9.78955078125, -8.1044921875, -6.41943359375, -4.734375, -3.04931640625, -1.3642578125, 0.32080078125, 2.005859375, 3.69091796875, 5.3759765625, 7.06103515625, 8.74609375, 10.43115234375, 12.1162109375, 13.80126953125, 15.486328125, 17.17138671875, 18.8564453125, 20.54150390625, 22.2265625, 23.91162109375, 25.5966796875, 27.28173828125, 28.966796875, 30.65185546875, 32.3369140625, 34.02197265625, 35.70703125, 37.39208984375, 39.0771484375, 40.76220703125, 42.447265625, 44.13232421875, 45.8173828125, 47.50244140625, 49.1875]}, "gradients/decoder.transformer.h.15.ln_1.weight": {"_type": "histogram", "values": [6.0, 85.0, 528.0, 356.0, 42.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-10.14566421508789, -6.601571083068848, -3.057478427886963, 0.4866142272949219, 4.030707359313965, 7.574800491333008, 11.118892669677734, 14.662986755371094, 18.20707893371582, 21.751171112060547, 25.295265197753906, 28.839357376098633, 32.38344955444336, 35.92754364013672, 39.47163391113281, 43.01573181152344, 46.55982208251953, 50.10391616821289, 53.648006439208984, 57.192100524902344, 60.7361946105957, 64.28028869628906, 67.82437896728516, 71.36846923828125, 74.91256713867188, 78.45665740966797, 82.0007553100586, 85.54484558105469, 89.08893585205078, 92.6330337524414, 96.1771240234375, 99.72122192382812, 103.26531219482422, 106.80940246582031, 110.35350036621094, 113.89759063720703, 117.44168090820312, 120.98577880859375, 124.52986907958984, 128.07395935058594, 131.61805725097656, 135.1621551513672, 138.70623779296875, 142.25033569335938, 145.79443359375, 149.33851623535156, 152.8826141357422, 156.4267120361328, 159.97079467773438, 163.514892578125, 167.05897521972656, 170.6030731201172, 174.1471710205078, 177.69125366210938, 181.2353515625, 184.77944946289062, 188.32354736328125, 191.86764526367188, 195.41172790527344, 198.95582580566406, 202.4999237060547, 206.04400634765625, 209.58810424804688, 213.1322021484375, 216.67628479003906]}, "gradients/decoder.transformer.h.15.ln_1.bias": {"_type": "histogram", "values": [3.0, 4.0, 3.0, 3.0, 0.0, 4.0, 2.0, 2.0, 5.0, 15.0, 3.0, 10.0, 19.0, 12.0, 16.0, 24.0, 30.0, 22.0, 22.0, 33.0, 51.0, 33.0, 40.0, 38.0, 42.0, 42.0, 48.0, 39.0, 48.0, 46.0, 50.0, 32.0, 46.0, 31.0, 37.0, 26.0, 19.0, 16.0, 20.0, 20.0, 14.0, 13.0, 7.0, 9.0, 4.0, 7.0, 3.0, 2.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-47.633689880371094, -45.84664535522461, -44.05959701538086, -42.272552490234375, -40.485504150390625, -38.69845962524414, -36.911415100097656, -35.124366760253906, -33.33732223510742, -31.550275802612305, -29.763229370117188, -27.976184844970703, -26.189138412475586, -24.40209197998047, -22.61504554748535, -20.827999114990234, -19.040952682495117, -17.25390625, -15.4668607711792, -13.679814338684082, -11.892768859863281, -10.105722427368164, -8.318675994873047, -6.531630516052246, -4.744584083557129, -2.95753812789917, -1.1704919338226318, 0.6165542602539062, 2.4036002159118652, 4.190646171569824, 5.977692604064941, 7.764738082885742, 9.55178451538086, 11.338830947875977, 13.125876426696777, 14.912922859191895, 16.699968338012695, 18.487014770507812, 20.27406120300293, 22.061107635498047, 23.84815216064453, 25.63519859313965, 27.422245025634766, 29.20928955078125, 30.996335983276367, 32.783382415771484, 34.57042694091797, 36.35747528076172, 38.14452362060547, 39.93156814575195, 41.7186164855957, 43.50566101074219, 45.29270935058594, 47.07975387573242, 48.866798400878906, 50.653846740722656, 52.44089126586914, 54.227935791015625, 56.014984130859375, 57.80202865600586, 59.58907699584961, 61.376121520996094, 63.163169860839844, 64.95021057128906, 66.73725891113281]}, "gradients/decoder.transformer.h.14.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 4.0, 3.0, 4.0, 5.0, 3.0, 9.0, 10.0, 14.0, 6.0, 15.0, 21.0, 19.0, 23.0, 32.0, 37.0, 38.0, 26.0, 31.0, 30.0, 51.0, 45.0, 38.0, 35.0, 36.0, 48.0, 42.0, 51.0, 39.0, 37.0, 17.0, 37.0, 33.0, 29.0, 28.0, 23.0, 14.0, 13.0, 13.0, 14.0, 9.0, 6.0, 7.0, 3.0, 3.0, 4.0, 6.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-6.6640625, -6.449951171875, -6.23583984375, -6.021728515625, -5.8076171875, -5.593505859375, -5.37939453125, -5.165283203125, -4.951171875, -4.737060546875, -4.52294921875, -4.308837890625, -4.0947265625, -3.880615234375, -3.66650390625, -3.452392578125, -3.23828125, -3.024169921875, -2.81005859375, -2.595947265625, -2.3818359375, -2.167724609375, -1.95361328125, -1.739501953125, -1.525390625, -1.311279296875, -1.09716796875, -0.883056640625, -0.6689453125, -0.454833984375, -0.24072265625, -0.026611328125, 0.1875, 0.401611328125, 0.61572265625, 0.829833984375, 1.0439453125, 1.258056640625, 1.47216796875, 1.686279296875, 1.900390625, 2.114501953125, 2.32861328125, 2.542724609375, 2.7568359375, 2.970947265625, 3.18505859375, 3.399169921875, 3.61328125, 3.827392578125, 4.04150390625, 4.255615234375, 4.4697265625, 4.683837890625, 4.89794921875, 5.112060546875, 5.326171875, 5.540283203125, 5.75439453125, 5.968505859375, 6.1826171875, 6.396728515625, 6.61083984375, 6.824951171875, 7.0390625]}, "gradients/decoder.transformer.h.14.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 6.0, 7.0, 8.0, 8.0, 9.0, 14.0, 17.0, 10.0, 22.0, 26.0, 29.0, 32.0, 45.0, 58.0, 67.0, 131.0, 259.0, 682.0, 4329.0, 155508.0, 3543288.0, 479923.0, 8095.0, 869.0, 283.0, 157.0, 87.0, 68.0, 56.0, 42.0, 23.0, 21.0, 16.0, 23.0, 9.0, 16.0, 9.0, 8.0, 5.0, 8.0, 3.0, 3.0, 5.0, 2.0, 2.0, 3.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-24.28125, -23.48828125, -22.6953125, -21.90234375, -21.109375, -20.31640625, -19.5234375, -18.73046875, -17.9375, -17.14453125, -16.3515625, -15.55859375, -14.765625, -13.97265625, -13.1796875, -12.38671875, -11.59375, -10.80078125, -10.0078125, -9.21484375, -8.421875, -7.62890625, -6.8359375, -6.04296875, -5.25, -4.45703125, -3.6640625, -2.87109375, -2.078125, -1.28515625, -0.4921875, 0.30078125, 1.09375, 1.88671875, 2.6796875, 3.47265625, 4.265625, 5.05859375, 5.8515625, 6.64453125, 7.4375, 8.23046875, 9.0234375, 9.81640625, 10.609375, 11.40234375, 12.1953125, 12.98828125, 13.78125, 14.57421875, 15.3671875, 16.16015625, 16.953125, 17.74609375, 18.5390625, 19.33203125, 20.125, 20.91796875, 21.7109375, 22.50390625, 23.296875, 24.08984375, 24.8828125, 25.67578125, 26.46875]}, "gradients/decoder.transformer.h.14.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 1.0, 2.0, 2.0, 6.0, 6.0, 13.0, 11.0, 13.0, 25.0, 21.0, 46.0, 62.0, 92.0, 117.0, 185.0, 313.0, 448.0, 536.0, 550.0, 457.0, 381.0, 280.0, 175.0, 94.0, 69.0, 47.0, 32.0, 36.0, 22.0, 13.0, 9.0, 3.0, 6.0, 3.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-13.8671875, -13.3375244140625, -12.807861328125, -12.2781982421875, -11.74853515625, -11.2188720703125, -10.689208984375, -10.1595458984375, -9.6298828125, -9.1002197265625, -8.570556640625, -8.0408935546875, -7.51123046875, -6.9815673828125, -6.451904296875, -5.9222412109375, -5.392578125, -4.8629150390625, -4.333251953125, -3.8035888671875, -3.27392578125, -2.7442626953125, -2.214599609375, -1.6849365234375, -1.1552734375, -0.6256103515625, -0.095947265625, 0.4337158203125, 0.96337890625, 1.4930419921875, 2.022705078125, 2.5523681640625, 3.08203125, 3.6116943359375, 4.141357421875, 4.6710205078125, 5.20068359375, 5.7303466796875, 6.260009765625, 6.7896728515625, 7.3193359375, 7.8489990234375, 8.378662109375, 8.9083251953125, 9.43798828125, 9.9676513671875, 10.497314453125, 11.0269775390625, 11.556640625, 12.0863037109375, 12.615966796875, 13.1456298828125, 13.67529296875, 14.2049560546875, 14.734619140625, 15.2642822265625, 15.7939453125, 16.3236083984375, 16.853271484375, 17.3829345703125, 17.91259765625, 18.4422607421875, 18.971923828125, 19.5015869140625, 20.03125]}, "gradients/decoder.transformer.h.14.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 3.0, 6.0, 5.0, 1.0, 4.0, 3.0, 13.0, 7.0, 18.0, 23.0, 28.0, 27.0, 31.0, 40.0, 71.0, 66.0, 153.0, 167.0, 253.0, 575.0, 1694.0, 571076.0, 3615588.0, 2909.0, 581.0, 279.0, 181.0, 97.0, 86.0, 51.0, 75.0, 32.0, 32.0, 28.0, 17.0, 17.0, 11.0, 11.0, 8.0, 5.0, 7.0, 3.0, 8.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-77.0, -74.525390625, -72.05078125, -69.576171875, -67.1015625, -64.626953125, -62.15234375, -59.677734375, -57.203125, -54.728515625, -52.25390625, -49.779296875, -47.3046875, -44.830078125, -42.35546875, -39.880859375, -37.40625, -34.931640625, -32.45703125, -29.982421875, -27.5078125, -25.033203125, -22.55859375, -20.083984375, -17.609375, -15.134765625, -12.66015625, -10.185546875, -7.7109375, -5.236328125, -2.76171875, -0.287109375, 2.1875, 4.662109375, 7.13671875, 9.611328125, 12.0859375, 14.560546875, 17.03515625, 19.509765625, 21.984375, 24.458984375, 26.93359375, 29.408203125, 31.8828125, 34.357421875, 36.83203125, 39.306640625, 41.78125, 44.255859375, 46.73046875, 49.205078125, 51.6796875, 54.154296875, 56.62890625, 59.103515625, 61.578125, 64.052734375, 66.52734375, 69.001953125, 71.4765625, 73.951171875, 76.42578125, 78.900390625, 81.375]}, "gradients/decoder.transformer.h.14.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 6.0, 20.0, 64.0, 140.0, 241.0, 261.0, 166.0, 81.0, 24.0, 7.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-83.1891860961914, -79.55642700195312, -75.92367553710938, -72.2909164428711, -68.65815734863281, -65.02540588378906, -61.39264678955078, -57.7598876953125, -54.127132415771484, -50.49437713623047, -46.86161804199219, -43.22886276245117, -39.596107482910156, -35.963348388671875, -32.33059310913086, -28.69783592224121, -25.065078735351562, -21.432321548461914, -17.799564361572266, -14.16680908203125, -10.534051895141602, -6.901294708251953, -3.2685394287109375, 0.36421775817871094, 3.9969749450683594, 7.62973165512085, 11.26248836517334, 14.895244598388672, 18.52800178527832, 22.16075897216797, 25.793514251708984, 29.426271438598633, 33.05903625488281, 36.69179153442383, 40.32455062866211, 43.957305908203125, 47.590065002441406, 51.22282028198242, 54.85557556152344, 58.48833465576172, 62.121089935302734, 65.75384521484375, 69.38660430908203, 73.01936340332031, 76.65211486816406, 80.28487396240234, 83.91763305664062, 87.55038452148438, 91.18314361572266, 94.81590270996094, 98.44865417480469, 102.08141326904297, 105.71417236328125, 109.346923828125, 112.97968292236328, 116.61244201660156, 120.24519348144531, 123.8779525756836, 127.51070404052734, 131.14346313476562, 134.77621459960938, 138.4089813232422, 142.04173278808594, 145.6744842529297, 149.3072509765625]}, "gradients/decoder.transformer.h.14.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 4.0, 1.0, 3.0, 5.0, 9.0, 5.0, 5.0, 9.0, 7.0, 17.0, 12.0, 21.0, 18.0, 24.0, 22.0, 16.0, 24.0, 26.0, 26.0, 32.0, 43.0, 43.0, 43.0, 36.0, 43.0, 35.0, 47.0, 45.0, 38.0, 39.0, 31.0, 32.0, 22.0, 29.0, 26.0, 23.0, 25.0, 16.0, 15.0, 16.0, 17.0, 8.0, 14.0, 5.0, 6.0, 7.0, 6.0, 7.0, 7.0, 2.0, 0.0, 2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-39.503021240234375, -38.1512451171875, -36.799468994140625, -35.44769287109375, -34.095916748046875, -32.744136810302734, -31.39236068725586, -30.040584564208984, -28.68880844116211, -27.337032318115234, -25.98525619506836, -24.63347816467285, -23.281702041625977, -21.9299259185791, -20.578147888183594, -19.22637176513672, -17.874595642089844, -16.52281951904297, -15.171042442321777, -13.819265365600586, -12.467489242553711, -11.115713119506836, -9.763936042785645, -8.412158966064453, -7.060382843017578, -5.708606243133545, -4.356829643249512, -3.0050530433654785, -1.6532764434814453, -0.3014998435974121, 1.050276756286621, 2.4020538330078125, 3.7538299560546875, 5.105606555938721, 6.457383155822754, 7.809159755706787, 9.16093635559082, 10.512712478637695, 11.864489555358887, 13.216266632080078, 14.568042755126953, 15.919818878173828, 17.271595001220703, 18.62337303161621, 19.975149154663086, 21.32692527770996, 22.67870330810547, 24.030479431152344, 25.38225555419922, 26.734031677246094, 28.08580780029297, 29.437585830688477, 30.78936195373535, 32.14113998413086, 33.492916107177734, 34.84469223022461, 36.196468353271484, 37.54824447631836, 38.900020599365234, 40.25179672241211, 41.60357666015625, 42.955352783203125, 44.30712890625, 45.658905029296875, 47.01068115234375]}, "gradients/decoder.transformer.h.14.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 4.0, 5.0, 2.0, 3.0, 7.0, 4.0, 10.0, 11.0, 18.0, 14.0, 20.0, 25.0, 17.0, 27.0, 21.0, 32.0, 37.0, 36.0, 45.0, 37.0, 46.0, 48.0, 35.0, 38.0, 49.0, 44.0, 32.0, 40.0, 33.0, 42.0, 35.0, 24.0, 24.0, 32.0, 20.0, 15.0, 13.0, 18.0, 10.0, 6.0, 9.0, 4.0, 6.0, 6.0, 3.0, 1.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.87109375, -6.65704345703125, -6.4429931640625, -6.22894287109375, -6.014892578125, -5.80084228515625, -5.5867919921875, -5.37274169921875, -5.15869140625, -4.94464111328125, -4.7305908203125, -4.51654052734375, -4.302490234375, -4.08843994140625, -3.8743896484375, -3.66033935546875, -3.4462890625, -3.23223876953125, -3.0181884765625, -2.80413818359375, -2.590087890625, -2.37603759765625, -2.1619873046875, -1.94793701171875, -1.73388671875, -1.51983642578125, -1.3057861328125, -1.09173583984375, -0.877685546875, -0.66363525390625, -0.4495849609375, -0.23553466796875, -0.021484375, 0.19256591796875, 0.4066162109375, 0.62066650390625, 0.834716796875, 1.04876708984375, 1.2628173828125, 1.47686767578125, 1.69091796875, 1.90496826171875, 2.1190185546875, 2.33306884765625, 2.547119140625, 2.76116943359375, 2.9752197265625, 3.18927001953125, 3.4033203125, 3.61737060546875, 3.8314208984375, 4.04547119140625, 4.259521484375, 4.47357177734375, 4.6876220703125, 4.90167236328125, 5.11572265625, 5.32977294921875, 5.5438232421875, 5.75787353515625, 5.971923828125, 6.18597412109375, 6.4000244140625, 6.61407470703125, 6.828125]}, "gradients/decoder.transformer.h.14.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 6.0, 4.0, 5.0, 11.0, 10.0, 20.0, 29.0, 37.0, 62.0, 114.0, 168.0, 254.0, 418.0, 670.0, 1024.0, 1682.0, 2701.0, 4337.0, 7020.0, 11364.0, 18474.0, 30803.0, 54174.0, 100761.0, 227106.0, 309204.0, 122586.0, 63850.0, 36024.0, 21617.0, 12949.0, 7916.0, 4826.0, 3122.0, 1928.0, 1183.0, 786.0, 465.0, 319.0, 216.0, 111.0, 65.0, 58.0, 35.0, 21.0, 13.0, 7.0, 5.0, 2.0, 6.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.8828125, -1.8242340087890625, -1.765655517578125, -1.7070770263671875, -1.64849853515625, -1.5899200439453125, -1.531341552734375, -1.4727630615234375, -1.4141845703125, -1.3556060791015625, -1.297027587890625, -1.2384490966796875, -1.17987060546875, -1.1212921142578125, -1.062713623046875, -1.0041351318359375, -0.945556640625, -0.8869781494140625, -0.828399658203125, -0.7698211669921875, -0.71124267578125, -0.6526641845703125, -0.594085693359375, -0.5355072021484375, -0.4769287109375, -0.4183502197265625, -0.359771728515625, -0.3011932373046875, -0.24261474609375, -0.1840362548828125, -0.125457763671875, -0.0668792724609375, -0.00830078125, 0.0502777099609375, 0.108856201171875, 0.1674346923828125, 0.22601318359375, 0.2845916748046875, 0.343170166015625, 0.4017486572265625, 0.4603271484375, 0.5189056396484375, 0.577484130859375, 0.6360626220703125, 0.69464111328125, 0.7532196044921875, 0.811798095703125, 0.8703765869140625, 0.928955078125, 0.9875335693359375, 1.046112060546875, 1.1046905517578125, 1.16326904296875, 1.2218475341796875, 1.280426025390625, 1.3390045166015625, 1.3975830078125, 1.4561614990234375, 1.514739990234375, 1.5733184814453125, 1.63189697265625, 1.6904754638671875, 1.749053955078125, 1.8076324462890625, 1.8662109375]}, "gradients/decoder.transformer.h.14.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 4.0, 6.0, 6.0, 10.0, 9.0, 9.0, 13.0, 14.0, 16.0, 21.0, 19.0, 28.0, 25.0, 36.0, 33.0, 36.0, 33.0, 35.0, 52.0, 37.0, 40.0, 1069.0, 56.0, 41.0, 48.0, 42.0, 33.0, 25.0, 25.0, 23.0, 32.0, 20.0, 20.0, 17.0, 19.0, 14.0, 11.0, 12.0, 8.0, 9.0, 7.0, 4.0, 4.0, 5.0, 2.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.0625, -3.92889404296875, -3.7952880859375, -3.66168212890625, -3.528076171875, -3.39447021484375, -3.2608642578125, -3.12725830078125, -2.99365234375, -2.86004638671875, -2.7264404296875, -2.59283447265625, -2.459228515625, -2.32562255859375, -2.1920166015625, -2.05841064453125, -1.9248046875, -1.79119873046875, -1.6575927734375, -1.52398681640625, -1.390380859375, -1.25677490234375, -1.1231689453125, -0.98956298828125, -0.85595703125, -0.72235107421875, -0.5887451171875, -0.45513916015625, -0.321533203125, -0.18792724609375, -0.0543212890625, 0.07928466796875, 0.212890625, 0.34649658203125, 0.4801025390625, 0.61370849609375, 0.747314453125, 0.88092041015625, 1.0145263671875, 1.14813232421875, 1.28173828125, 1.41534423828125, 1.5489501953125, 1.68255615234375, 1.816162109375, 1.94976806640625, 2.0833740234375, 2.21697998046875, 2.3505859375, 2.48419189453125, 2.6177978515625, 2.75140380859375, 2.885009765625, 3.01861572265625, 3.1522216796875, 3.28582763671875, 3.41943359375, 3.55303955078125, 3.6866455078125, 3.82025146484375, 3.953857421875, 4.08746337890625, 4.2210693359375, 4.35467529296875, 4.48828125]}, "gradients/decoder.transformer.h.14.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 7.0, 7.0, 6.0, 8.0, 14.0, 16.0, 25.0, 50.0, 69.0, 107.0, 176.0, 311.0, 512.0, 885.0, 1634.0, 2804.0, 5197.0, 9452.0, 17762.0, 34821.0, 70598.0, 163034.0, 1493259.0, 156033.0, 68279.0, 34060.0, 17243.0, 9168.0, 5047.0, 2784.0, 1610.0, 855.0, 520.0, 305.0, 180.0, 98.0, 57.0, 52.0, 31.0, 16.0, 14.0, 9.0, 8.0, 6.0, 6.0, 2.0, 3.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-2.451171875, -2.377960205078125, -2.30474853515625, -2.231536865234375, -2.1583251953125, -2.085113525390625, -2.01190185546875, -1.938690185546875, -1.865478515625, -1.792266845703125, -1.71905517578125, -1.645843505859375, -1.5726318359375, -1.499420166015625, -1.42620849609375, -1.352996826171875, -1.27978515625, -1.206573486328125, -1.13336181640625, -1.060150146484375, -0.9869384765625, -0.913726806640625, -0.84051513671875, -0.767303466796875, -0.694091796875, -0.620880126953125, -0.54766845703125, -0.474456787109375, -0.4012451171875, -0.328033447265625, -0.25482177734375, -0.181610107421875, -0.1083984375, -0.035186767578125, 0.03802490234375, 0.111236572265625, 0.1844482421875, 0.257659912109375, 0.33087158203125, 0.404083251953125, 0.477294921875, 0.550506591796875, 0.62371826171875, 0.696929931640625, 0.7701416015625, 0.843353271484375, 0.91656494140625, 0.989776611328125, 1.06298828125, 1.136199951171875, 1.20941162109375, 1.282623291015625, 1.3558349609375, 1.429046630859375, 1.50225830078125, 1.575469970703125, 1.648681640625, 1.721893310546875, 1.79510498046875, 1.868316650390625, 1.9415283203125, 2.014739990234375, 2.08795166015625, 2.161163330078125, 2.234375]}, "gradients/decoder.transformer.h.14.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 7.0, 5.0, 10.0, 11.0, 9.0, 22.0, 27.0, 25.0, 48.0, 55.0, 77.0, 95.0, 106.0, 97.0, 95.0, 72.0, 54.0, 50.0, 29.0, 26.0, 22.0, 13.0, 19.0, 7.0, 6.0, 2.0, 6.0, 3.0, 3.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00106048583984375, -0.0010221302509307861, -0.0009837746620178223, -0.0009454190731048584, -0.0009070634841918945, -0.0008687078952789307, -0.0008303523063659668, -0.0007919967174530029, -0.0007536411285400391, -0.0007152855396270752, -0.0006769299507141113, -0.0006385743618011475, -0.0006002187728881836, -0.0005618631839752197, -0.0005235075950622559, -0.000485152006149292, -0.0004467964172363281, -0.00040844082832336426, -0.0003700852394104004, -0.0003317296504974365, -0.00029337406158447266, -0.0002550184726715088, -0.00021666288375854492, -0.00017830729484558105, -0.0001399517059326172, -0.00010159611701965332, -6.324052810668945e-05, -2.4884939193725586e-05, 1.3470649719238281e-05, 5.182623863220215e-05, 9.018182754516602e-05, 0.00012853741645812988, 0.00016689300537109375, 0.00020524859428405762, 0.00024360418319702148, 0.00028195977210998535, 0.0003203153610229492, 0.0003586709499359131, 0.00039702653884887695, 0.0004353821277618408, 0.0004737377166748047, 0.0005120933055877686, 0.0005504488945007324, 0.0005888044834136963, 0.0006271600723266602, 0.000665515661239624, 0.0007038712501525879, 0.0007422268390655518, 0.0007805824279785156, 0.0008189380168914795, 0.0008572936058044434, 0.0008956491947174072, 0.0009340047836303711, 0.000972360372543335, 0.0010107159614562988, 0.0010490715503692627, 0.0010874271392822266, 0.0011257827281951904, 0.0011641383171081543, 0.0012024939060211182, 0.001240849494934082, 0.001279205083847046, 0.0013175606727600098, 0.0013559162616729736, 0.0013942718505859375]}, "gradients/decoder.transformer.h.14.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 4.0, 5.0, 10.0, 10.0, 12.0, 22.0, 23.0, 30.0, 41.0, 64.0, 95.0, 180.0, 289.0, 1018.0, 838657.0, 206637.0, 838.0, 256.0, 124.0, 61.0, 42.0, 46.0, 26.0, 18.0, 10.0, 8.0, 12.0, 5.0, 5.0, 4.0, 4.0, 5.0, 2.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.037139892578125, -0.036254167556762695, -0.03536844253540039, -0.034482717514038086, -0.03359699249267578, -0.03271126747131348, -0.03182554244995117, -0.030939817428588867, -0.030054092407226562, -0.029168367385864258, -0.028282642364501953, -0.02739691734313965, -0.026511192321777344, -0.02562546730041504, -0.024739742279052734, -0.02385401725769043, -0.022968292236328125, -0.02208256721496582, -0.021196842193603516, -0.02031111717224121, -0.019425392150878906, -0.0185396671295166, -0.017653942108154297, -0.016768217086791992, -0.015882492065429688, -0.014996767044067383, -0.014111042022705078, -0.013225317001342773, -0.012339591979980469, -0.011453866958618164, -0.01056814193725586, -0.009682416915893555, -0.00879669189453125, -0.007910966873168945, -0.007025241851806641, -0.006139516830444336, -0.005253791809082031, -0.0043680667877197266, -0.003482341766357422, -0.002596616744995117, -0.0017108917236328125, -0.0008251667022705078, 6.0558319091796875e-05, 0.0009462833404541016, 0.0018320083618164062, 0.002717733383178711, 0.0036034584045410156, 0.00448918342590332, 0.005374908447265625, 0.00626063346862793, 0.007146358489990234, 0.008032083511352539, 0.008917808532714844, 0.009803533554077148, 0.010689258575439453, 0.011574983596801758, 0.012460708618164062, 0.013346433639526367, 0.014232158660888672, 0.015117883682250977, 0.01600360870361328, 0.016889333724975586, 0.01777505874633789, 0.018660783767700195, 0.0195465087890625]}, "gradients/decoder.transformer.h.14.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 2.0, 1.0, 6.0, 107.0, 617.0, 259.0, 25.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006217927439138293, -0.0005131011130288243, -0.00040440948214381933, -0.0002957178803626448, -0.0001870262494776398, -7.833464769646525e-05, 3.0356983188539743e-05, 0.00013904861407354474, 0.00024774024495854974, 0.00035643187584355474, 0.00046512350672855973, 0.0005738150794059038, 0.0006825067102909088, 0.0007911983411759138, 0.0008998899720609188, 0.0010085816029459238, 0.0011172732338309288, 0.0012259648647159338, 0.0013346564956009388, 0.0014433481264859438, 0.0015520397573709488, 0.0016607313882559538, 0.0017694230191409588, 0.0018781146500259638, 0.0019868062809109688, 0.0020954979117959738, 0.0022041895426809788, 0.0023128811735659838, 0.0024215728044509888, 0.0025302644353359938, 0.0026389560662209988, 0.0027476476971060038, 0.002856339095160365, 0.00296503072604537, 0.003073722356930375, 0.00318241398781538, 0.003291105618700385, 0.00339979724958539, 0.003508488880470395, 0.0036171805113554, 0.003725872142240405, 0.00383456377312541, 0.003943255171179771, 0.00405194703489542, 0.004160638432949781, 0.00426933029666543, 0.004378021694719791, 0.00448671355843544, 0.004595404956489801, 0.004704096354544163, 0.004812788218259811, 0.004921479616314173, 0.005030171480029821, 0.005138862878084183, 0.005247554741799831, 0.005356246139854193, 0.005464938003569841, 0.005573629401624203, 0.005682321265339851, 0.005791012663394213, 0.005899704527109861, 0.006008395925164223, 0.006117087788879871, 0.006225779186934233, 0.006334471050649881]}, "gradients/decoder.transformer.h.14.ln_cross_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 3.0, 0.0, 6.0, 4.0, 1.0, 3.0, 6.0, 4.0, 6.0, 9.0, 10.0, 15.0, 14.0, 8.0, 12.0, 17.0, 24.0, 22.0, 25.0, 28.0, 34.0, 39.0, 43.0, 33.0, 35.0, 41.0, 40.0, 39.0, 47.0, 53.0, 40.0, 40.0, 31.0, 41.0, 24.0, 26.0, 23.0, 16.0, 22.0, 22.0, 17.0, 12.0, 17.0, 11.0, 12.0, 11.0, 5.0, 6.0, 5.0, 6.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0003898739814758301, -0.00037703942507505417, -0.00036420486867427826, -0.00035137031227350235, -0.00033853575587272644, -0.00032570119947195053, -0.0003128666430711746, -0.0003000320866703987, -0.0002871975302696228, -0.0002743629738688469, -0.000261528417468071, -0.0002486938610672951, -0.00023585930466651917, -0.00022302474826574326, -0.00021019019186496735, -0.00019735563546419144, -0.00018452107906341553, -0.00017168652266263962, -0.0001588519662618637, -0.0001460174098610878, -0.0001331828534603119, -0.00012034829705953598, -0.00010751374065876007, -9.467918425798416e-05, -8.184462785720825e-05, -6.901007145643234e-05, -5.617551505565643e-05, -4.3340958654880524e-05, -3.0506402254104614e-05, -1.7671845853328705e-05, -4.837289452552795e-06, 7.997266948223114e-06, 2.0831823348999023e-05, 3.366637974977493e-05, 4.650093615055084e-05, 5.933549255132675e-05, 7.217004895210266e-05, 8.500460535287857e-05, 9.783916175365448e-05, 0.00011067371815443039, 0.0001235082745552063, 0.0001363428309559822, 0.00014917738735675812, 0.00016201194375753403, 0.00017484650015830994, 0.00018768105655908585, 0.00020051561295986176, 0.00021335016936063766, 0.00022618472576141357, 0.00023901928216218948, 0.0002518538385629654, 0.0002646883949637413, 0.0002775229513645172, 0.0002903575077652931, 0.00030319206416606903, 0.00031602662056684494, 0.00032886117696762085, 0.00034169573336839676, 0.00035453028976917267, 0.0003673648461699486, 0.0003801994025707245, 0.0003930339589715004, 0.0004058685153722763, 0.0004187030717730522, 0.0004315376281738281]}, "gradients/decoder.transformer.h.14.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 4.0, 5.0, 2.0, 3.0, 7.0, 4.0, 10.0, 12.0, 17.0, 14.0, 20.0, 25.0, 17.0, 27.0, 21.0, 32.0, 37.0, 36.0, 45.0, 37.0, 46.0, 48.0, 35.0, 38.0, 49.0, 44.0, 32.0, 40.0, 33.0, 42.0, 35.0, 24.0, 24.0, 32.0, 20.0, 15.0, 13.0, 18.0, 10.0, 6.0, 9.0, 4.0, 6.0, 6.0, 3.0, 1.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.87109375, -6.65704345703125, -6.4429931640625, -6.22894287109375, -6.014892578125, -5.80084228515625, -5.5867919921875, -5.37274169921875, -5.15869140625, -4.94464111328125, -4.7305908203125, -4.51654052734375, -4.302490234375, -4.08843994140625, -3.8743896484375, -3.66033935546875, -3.4462890625, -3.23223876953125, -3.0181884765625, -2.80413818359375, -2.590087890625, -2.37603759765625, -2.1619873046875, -1.94793701171875, -1.73388671875, -1.51983642578125, -1.3057861328125, -1.09173583984375, -0.877685546875, -0.66363525390625, -0.4495849609375, -0.23553466796875, -0.021484375, 0.19256591796875, 0.4066162109375, 0.62066650390625, 0.834716796875, 1.04876708984375, 1.2628173828125, 1.47686767578125, 1.69091796875, 1.90496826171875, 2.1190185546875, 2.33306884765625, 2.547119140625, 2.76116943359375, 2.9752197265625, 3.18927001953125, 3.4033203125, 3.61737060546875, 3.8314208984375, 4.04547119140625, 4.259521484375, 4.47357177734375, 4.6876220703125, 4.90167236328125, 5.11572265625, 5.32977294921875, 5.5438232421875, 5.75787353515625, 5.971923828125, 6.18597412109375, 6.4000244140625, 6.61407470703125, 6.828125]}, "gradients/decoder.transformer.h.14.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 4.0, 6.0, 7.0, 13.0, 15.0, 10.0, 17.0, 19.0, 32.0, 52.0, 70.0, 86.0, 131.0, 184.0, 267.0, 350.0, 556.0, 856.0, 1392.0, 2450.0, 4936.0, 12084.0, 36199.0, 135031.0, 557557.0, 211600.0, 54123.0, 16565.0, 6394.0, 2925.0, 1553.0, 1002.0, 648.0, 427.0, 254.0, 205.0, 144.0, 121.0, 91.0, 59.0, 29.0, 23.0, 21.0, 19.0, 18.0, 7.0, 7.0, 3.0, 5.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-7.53125, -7.2933349609375, -7.055419921875, -6.8175048828125, -6.57958984375, -6.3416748046875, -6.103759765625, -5.8658447265625, -5.6279296875, -5.3900146484375, -5.152099609375, -4.9141845703125, -4.67626953125, -4.4383544921875, -4.200439453125, -3.9625244140625, -3.724609375, -3.4866943359375, -3.248779296875, -3.0108642578125, -2.77294921875, -2.5350341796875, -2.297119140625, -2.0592041015625, -1.8212890625, -1.5833740234375, -1.345458984375, -1.1075439453125, -0.86962890625, -0.6317138671875, -0.393798828125, -0.1558837890625, 0.08203125, 0.3199462890625, 0.557861328125, 0.7957763671875, 1.03369140625, 1.2716064453125, 1.509521484375, 1.7474365234375, 1.9853515625, 2.2232666015625, 2.461181640625, 2.6990966796875, 2.93701171875, 3.1749267578125, 3.412841796875, 3.6507568359375, 3.888671875, 4.1265869140625, 4.364501953125, 4.6024169921875, 4.84033203125, 5.0782470703125, 5.316162109375, 5.5540771484375, 5.7919921875, 6.0299072265625, 6.267822265625, 6.5057373046875, 6.74365234375, 6.9815673828125, 7.219482421875, 7.4573974609375, 7.6953125]}, "gradients/decoder.transformer.h.14.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 4.0, 4.0, 6.0, 9.0, 8.0, 11.0, 19.0, 21.0, 23.0, 28.0, 32.0, 39.0, 43.0, 51.0, 60.0, 47.0, 97.0, 238.0, 1686.0, 136.0, 88.0, 65.0, 59.0, 41.0, 46.0, 39.0, 27.0, 24.0, 16.0, 22.0, 21.0, 13.0, 11.0, 8.0, 6.0, 5.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0], "bins": [-31.625, -30.819580078125, -30.01416015625, -29.208740234375, -28.4033203125, -27.597900390625, -26.79248046875, -25.987060546875, -25.181640625, -24.376220703125, -23.57080078125, -22.765380859375, -21.9599609375, -21.154541015625, -20.34912109375, -19.543701171875, -18.73828125, -17.932861328125, -17.12744140625, -16.322021484375, -15.5166015625, -14.711181640625, -13.90576171875, -13.100341796875, -12.294921875, -11.489501953125, -10.68408203125, -9.878662109375, -9.0732421875, -8.267822265625, -7.46240234375, -6.656982421875, -5.8515625, -5.046142578125, -4.24072265625, -3.435302734375, -2.6298828125, -1.824462890625, -1.01904296875, -0.213623046875, 0.591796875, 1.397216796875, 2.20263671875, 3.008056640625, 3.8134765625, 4.618896484375, 5.42431640625, 6.229736328125, 7.03515625, 7.840576171875, 8.64599609375, 9.451416015625, 10.2568359375, 11.062255859375, 11.86767578125, 12.673095703125, 13.478515625, 14.283935546875, 15.08935546875, 15.894775390625, 16.7001953125, 17.505615234375, 18.31103515625, 19.116455078125, 19.921875]}, "gradients/decoder.transformer.h.14.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 1.0, 5.0, 2.0, 10.0, 6.0, 11.0, 30.0, 48.0, 96.0, 197.0, 511.0, 2768.0, 3137727.0, 3379.0, 521.0, 198.0, 99.0, 46.0, 33.0, 12.0, 6.0, 5.0, 3.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-94.5, -89.66015625, -84.8203125, -79.98046875, -75.140625, -70.30078125, -65.4609375, -60.62109375, -55.78125, -50.94140625, -46.1015625, -41.26171875, -36.421875, -31.58203125, -26.7421875, -21.90234375, -17.0625, -12.22265625, -7.3828125, -2.54296875, 2.296875, 7.13671875, 11.9765625, 16.81640625, 21.65625, 26.49609375, 31.3359375, 36.17578125, 41.015625, 45.85546875, 50.6953125, 55.53515625, 60.375, 65.21484375, 70.0546875, 74.89453125, 79.734375, 84.57421875, 89.4140625, 94.25390625, 99.09375, 103.93359375, 108.7734375, 113.61328125, 118.453125, 123.29296875, 128.1328125, 132.97265625, 137.8125, 142.65234375, 147.4921875, 152.33203125, 157.171875, 162.01171875, 166.8515625, 171.69140625, 176.53125, 181.37109375, 186.2109375, 191.05078125, 195.890625, 200.73046875, 205.5703125, 210.41015625, 215.25]}, "gradients/decoder.transformer.h.14.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 185.0, 831.0, 3.0], "bins": [-1059.2291259765625, -1042.181884765625, -1025.134521484375, -1008.0872802734375, -991.0400390625, -973.9927368164062, -956.9454956054688, -939.898193359375, -922.8509521484375, -905.8036499023438, -888.7564086914062, -871.7091064453125, -854.661865234375, -837.6145629882812, -820.5673217773438, -803.52001953125, -786.4727783203125, -769.4254760742188, -752.3782348632812, -735.3309326171875, -718.28369140625, -701.2363891601562, -684.1891479492188, -667.141845703125, -650.0945434570312, -633.0472412109375, -616.0, -598.9526977539062, -581.9054565429688, -564.858154296875, -547.8109130859375, -530.7636108398438, -513.7164306640625, -496.6691589355469, -479.62188720703125, -462.5746154785156, -445.52734375, -428.4800720214844, -411.43280029296875, -394.385498046875, -377.3382263183594, -360.29095458984375, -343.2436828613281, -326.1964111328125, -309.1491394042969, -292.10186767578125, -275.0545654296875, -258.00732421875, -240.9600372314453, -223.9127655029297, -206.86549377441406, -189.81820678710938, -172.77093505859375, -155.72366333007812, -138.6763916015625, -121.62911987304688, -104.58185577392578, -87.53458404541016, -70.4873046875, -53.440032958984375, -36.39276123046875, -19.345489501953125, -2.2982101440429688, 14.749061584472656, 31.796335220336914]}, "gradients/decoder.transformer.h.14.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 4.0, 4.0, 2.0, 8.0, 4.0, 4.0, 11.0, 4.0, 4.0, 20.0, 15.0, 18.0, 14.0, 21.0, 23.0, 24.0, 35.0, 32.0, 28.0, 33.0, 36.0, 34.0, 41.0, 51.0, 42.0, 40.0, 35.0, 43.0, 48.0, 44.0, 38.0, 28.0, 38.0, 28.0, 25.0, 20.0, 24.0, 21.0, 10.0, 16.0, 8.0, 10.0, 8.0, 10.0, 2.0, 2.0, 1.0, 1.0, 0.0, 2.0, 3.0, 0.0, 2.0], "bins": [-68.77639770507812, -66.82051086425781, -64.8646240234375, -62.90873718261719, -60.95284652709961, -58.9969596862793, -57.041072845458984, -55.08518600463867, -53.12929916381836, -51.17341232299805, -49.217525482177734, -47.261634826660156, -45.305747985839844, -43.34986114501953, -41.39397430419922, -39.438087463378906, -37.482200622558594, -35.52631378173828, -33.57042694091797, -31.614538192749023, -29.65865135192871, -27.702762603759766, -25.746875762939453, -23.79098892211914, -21.835098266601562, -19.87921142578125, -17.923322677612305, -15.967435836791992, -14.01154899597168, -12.05566120147705, -10.099773406982422, -8.14388656616211, -6.187999725341797, -4.232112407684326, -2.2762248516082764, -0.32033729553222656, 1.6355500221252441, 3.591437339782715, 5.547325134277344, 7.503211975097656, 9.459099769592285, 11.414987564086914, 13.370874404907227, 15.326762199401855, 17.282649993896484, 19.238536834716797, 21.19442367553711, 23.150310516357422, 25.106199264526367, 27.06208610534668, 29.017974853515625, 30.973861694335938, 32.92974853515625, 34.88563537597656, 36.841522216796875, 38.79740905761719, 40.753299713134766, 42.70918655395508, 44.66507339477539, 46.62096405029297, 48.57685089111328, 50.532737731933594, 52.488624572753906, 54.44451141357422, 56.40039825439453]}, "gradients/decoder.transformer.h.13.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 1.0, 5.0, 6.0, 6.0, 9.0, 16.0, 17.0, 11.0, 20.0, 23.0, 20.0, 18.0, 23.0, 33.0, 30.0, 50.0, 49.0, 46.0, 48.0, 35.0, 48.0, 52.0, 47.0, 34.0, 36.0, 33.0, 50.0, 33.0, 26.0, 29.0, 23.0, 31.0, 15.0, 24.0, 12.0, 13.0, 5.0, 8.0, 9.0, 3.0, 3.0, 5.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-7.81640625, -7.57867431640625, -7.3409423828125, -7.10321044921875, -6.865478515625, -6.62774658203125, -6.3900146484375, -6.15228271484375, -5.91455078125, -5.67681884765625, -5.4390869140625, -5.20135498046875, -4.963623046875, -4.72589111328125, -4.4881591796875, -4.25042724609375, -4.0126953125, -3.77496337890625, -3.5372314453125, -3.29949951171875, -3.061767578125, -2.82403564453125, -2.5863037109375, -2.34857177734375, -2.11083984375, -1.87310791015625, -1.6353759765625, -1.39764404296875, -1.159912109375, -0.92218017578125, -0.6844482421875, -0.44671630859375, -0.208984375, 0.02874755859375, 0.2664794921875, 0.50421142578125, 0.741943359375, 0.97967529296875, 1.2174072265625, 1.45513916015625, 1.69287109375, 1.93060302734375, 2.1683349609375, 2.40606689453125, 2.643798828125, 2.88153076171875, 3.1192626953125, 3.35699462890625, 3.5947265625, 3.83245849609375, 4.0701904296875, 4.30792236328125, 4.545654296875, 4.78338623046875, 5.0211181640625, 5.25885009765625, 5.49658203125, 5.73431396484375, 5.9720458984375, 6.20977783203125, 6.447509765625, 6.68524169921875, 6.9229736328125, 7.16070556640625, 7.3984375]}, "gradients/decoder.transformer.h.13.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 1.0, 0.0, 4.0, 7.0, 11.0, 11.0, 18.0, 10.0, 12.0, 21.0, 15.0, 29.0, 24.0, 26.0, 44.0, 24.0, 63.0, 143.0, 548.0, 3206.0, 105534.0, 3557091.0, 518952.0, 7129.0, 807.0, 192.0, 83.0, 46.0, 30.0, 33.0, 29.0, 22.0, 28.0, 17.0, 21.0, 7.0, 12.0, 12.0, 8.0, 4.0, 5.0, 3.0, 1.0, 6.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-29.828125, -28.94189453125, -28.0556640625, -27.16943359375, -26.283203125, -25.39697265625, -24.5107421875, -23.62451171875, -22.73828125, -21.85205078125, -20.9658203125, -20.07958984375, -19.193359375, -18.30712890625, -17.4208984375, -16.53466796875, -15.6484375, -14.76220703125, -13.8759765625, -12.98974609375, -12.103515625, -11.21728515625, -10.3310546875, -9.44482421875, -8.55859375, -7.67236328125, -6.7861328125, -5.89990234375, -5.013671875, -4.12744140625, -3.2412109375, -2.35498046875, -1.46875, -0.58251953125, 0.3037109375, 1.18994140625, 2.076171875, 2.96240234375, 3.8486328125, 4.73486328125, 5.62109375, 6.50732421875, 7.3935546875, 8.27978515625, 9.166015625, 10.05224609375, 10.9384765625, 11.82470703125, 12.7109375, 13.59716796875, 14.4833984375, 15.36962890625, 16.255859375, 17.14208984375, 18.0283203125, 18.91455078125, 19.80078125, 20.68701171875, 21.5732421875, 22.45947265625, 23.345703125, 24.23193359375, 25.1181640625, 26.00439453125, 26.890625]}, "gradients/decoder.transformer.h.13.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 2.0, 5.0, 12.0, 5.0, 12.0, 13.0, 17.0, 30.0, 23.0, 34.0, 43.0, 52.0, 64.0, 84.0, 111.0, 122.0, 151.0, 230.0, 258.0, 314.0, 347.0, 339.0, 364.0, 297.0, 249.0, 202.0, 143.0, 118.0, 93.0, 67.0, 57.0, 46.0, 40.0, 29.0, 23.0, 16.0, 17.0, 13.0, 4.0, 6.0, 5.0, 7.0, 2.0, 2.0, 1.0, 2.0, 2.0, 2.0, 0.0, 1.0], "bins": [-12.203125, -11.857177734375, -11.51123046875, -11.165283203125, -10.8193359375, -10.473388671875, -10.12744140625, -9.781494140625, -9.435546875, -9.089599609375, -8.74365234375, -8.397705078125, -8.0517578125, -7.705810546875, -7.35986328125, -7.013916015625, -6.66796875, -6.322021484375, -5.97607421875, -5.630126953125, -5.2841796875, -4.938232421875, -4.59228515625, -4.246337890625, -3.900390625, -3.554443359375, -3.20849609375, -2.862548828125, -2.5166015625, -2.170654296875, -1.82470703125, -1.478759765625, -1.1328125, -0.786865234375, -0.44091796875, -0.094970703125, 0.2509765625, 0.596923828125, 0.94287109375, 1.288818359375, 1.634765625, 1.980712890625, 2.32666015625, 2.672607421875, 3.0185546875, 3.364501953125, 3.71044921875, 4.056396484375, 4.40234375, 4.748291015625, 5.09423828125, 5.440185546875, 5.7861328125, 6.132080078125, 6.47802734375, 6.823974609375, 7.169921875, 7.515869140625, 7.86181640625, 8.207763671875, 8.5537109375, 8.899658203125, 9.24560546875, 9.591552734375, 9.9375]}, "gradients/decoder.transformer.h.13.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 4.0, 4.0, 1.0, 4.0, 7.0, 13.0, 8.0, 7.0, 15.0, 29.0, 16.0, 27.0, 42.0, 47.0, 51.0, 56.0, 71.0, 69.0, 89.0, 125.0, 154.0, 241.0, 293.0, 458.0, 1062.0, 22369.0, 4048622.0, 116749.0, 1632.0, 550.0, 319.0, 240.0, 180.0, 134.0, 129.0, 92.0, 84.0, 56.0, 47.0, 34.0, 35.0, 16.0, 23.0, 13.0, 15.0, 8.0, 12.0, 5.0, 10.0, 8.0, 7.0, 1.0, 6.0, 4.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 4.0], "bins": [-53.90625, -52.08740234375, -50.2685546875, -48.44970703125, -46.630859375, -44.81201171875, -42.9931640625, -41.17431640625, -39.35546875, -37.53662109375, -35.7177734375, -33.89892578125, -32.080078125, -30.26123046875, -28.4423828125, -26.62353515625, -24.8046875, -22.98583984375, -21.1669921875, -19.34814453125, -17.529296875, -15.71044921875, -13.8916015625, -12.07275390625, -10.25390625, -8.43505859375, -6.6162109375, -4.79736328125, -2.978515625, -1.15966796875, 0.6591796875, 2.47802734375, 4.296875, 6.11572265625, 7.9345703125, 9.75341796875, 11.572265625, 13.39111328125, 15.2099609375, 17.02880859375, 18.84765625, 20.66650390625, 22.4853515625, 24.30419921875, 26.123046875, 27.94189453125, 29.7607421875, 31.57958984375, 33.3984375, 35.21728515625, 37.0361328125, 38.85498046875, 40.673828125, 42.49267578125, 44.3115234375, 46.13037109375, 47.94921875, 49.76806640625, 51.5869140625, 53.40576171875, 55.224609375, 57.04345703125, 58.8623046875, 60.68115234375, 62.5]}, "gradients/decoder.transformer.h.13.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 28.0, 112.0, 221.0, 320.0, 213.0, 81.0, 27.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-188.25103759765625, -184.0236053466797, -179.7961883544922, -175.56875610351562, -171.34133911132812, -167.11390686035156, -162.88648986816406, -158.6590576171875, -154.431640625, -150.20420837402344, -145.97679138183594, -141.74935913085938, -137.52194213867188, -133.2945098876953, -129.0670928955078, -124.83966064453125, -120.61223602294922, -116.38481140136719, -112.15738677978516, -107.92996215820312, -103.7025375366211, -99.47511291503906, -95.2476806640625, -91.020263671875, -86.79283142089844, -82.5654067993164, -78.33798217773438, -74.11055755615234, -69.88313293457031, -65.65570831298828, -61.428279876708984, -57.20085525512695, -52.973426818847656, -48.746002197265625, -44.518577575683594, -40.29115295410156, -36.06372833251953, -31.836301803588867, -27.608875274658203, -23.381450653076172, -19.15402603149414, -14.92660140991211, -10.699175834655762, -6.471750259399414, -2.244325637817383, 1.9830989837646484, 6.2105255126953125, 10.437950134277344, 14.665374755859375, 18.892799377441406, 23.120223999023438, 27.3476505279541, 31.575075149536133, 35.80249786376953, 40.02992630004883, 44.25735092163086, 48.48477554321289, 52.71220016479492, 56.93962478637695, 61.16705322265625, 65.39447784423828, 69.62190246582031, 73.84932708740234, 78.07675170898438, 82.3041763305664]}, "gradients/decoder.transformer.h.13.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 4.0, 14.0, 10.0, 8.0, 12.0, 11.0, 19.0, 23.0, 19.0, 30.0, 26.0, 24.0, 33.0, 26.0, 40.0, 37.0, 48.0, 38.0, 33.0, 37.0, 46.0, 36.0, 49.0, 44.0, 33.0, 32.0, 30.0, 26.0, 29.0, 26.0, 24.0, 25.0, 14.0, 18.0, 7.0, 14.0, 9.0, 7.0, 8.0, 5.0, 8.0, 2.0, 3.0, 5.0, 4.0, 2.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-40.45892333984375, -39.16551208496094, -37.87209701538086, -36.57868576049805, -35.28527069091797, -33.991859436035156, -32.698448181152344, -31.4050350189209, -30.111621856689453, -28.818208694458008, -27.524795532226562, -26.23138427734375, -24.937971115112305, -23.64455795288086, -22.351146697998047, -21.0577335357666, -19.764320373535156, -18.47090721130371, -17.177494049072266, -15.884082794189453, -14.590669631958008, -13.297256469726562, -12.003844261169434, -10.710432052612305, -9.41701889038086, -8.123605728149414, -6.830193519592285, -5.536780834197998, -4.243368148803711, -2.949955463409424, -1.6565427780151367, -0.3631305694580078, 0.9302825927734375, 2.2236952781677246, 3.5171079635620117, 4.810520648956299, 6.103933334350586, 7.397346019744873, 8.69075870513916, 9.984170913696289, 11.277584075927734, 12.57099723815918, 13.864409446716309, 15.157821655273438, 16.451234817504883, 17.744647979736328, 19.03805923461914, 20.331472396850586, 21.62488555908203, 22.918298721313477, 24.211711883544922, 25.505123138427734, 26.79853630065918, 28.091949462890625, 29.385360717773438, 30.678773880004883, 31.972187042236328, 33.26559829711914, 34.55901336669922, 35.85242462158203, 37.145835876464844, 38.43925094604492, 39.732662200927734, 41.02607727050781, 42.319488525390625]}, "gradients/decoder.transformer.h.13.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 3.0, 0.0, 7.0, 6.0, 9.0, 13.0, 9.0, 11.0, 22.0, 22.0, 24.0, 26.0, 33.0, 23.0, 38.0, 45.0, 53.0, 38.0, 43.0, 41.0, 42.0, 45.0, 53.0, 49.0, 41.0, 32.0, 40.0, 30.0, 36.0, 32.0, 31.0, 16.0, 12.0, 18.0, 15.0, 12.0, 12.0, 9.0, 4.0, 4.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-8.0234375, -7.78564453125, -7.5478515625, -7.31005859375, -7.072265625, -6.83447265625, -6.5966796875, -6.35888671875, -6.12109375, -5.88330078125, -5.6455078125, -5.40771484375, -5.169921875, -4.93212890625, -4.6943359375, -4.45654296875, -4.21875, -3.98095703125, -3.7431640625, -3.50537109375, -3.267578125, -3.02978515625, -2.7919921875, -2.55419921875, -2.31640625, -2.07861328125, -1.8408203125, -1.60302734375, -1.365234375, -1.12744140625, -0.8896484375, -0.65185546875, -0.4140625, -0.17626953125, 0.0615234375, 0.29931640625, 0.537109375, 0.77490234375, 1.0126953125, 1.25048828125, 1.48828125, 1.72607421875, 1.9638671875, 2.20166015625, 2.439453125, 2.67724609375, 2.9150390625, 3.15283203125, 3.390625, 3.62841796875, 3.8662109375, 4.10400390625, 4.341796875, 4.57958984375, 4.8173828125, 5.05517578125, 5.29296875, 5.53076171875, 5.7685546875, 6.00634765625, 6.244140625, 6.48193359375, 6.7197265625, 6.95751953125, 7.1953125]}, "gradients/decoder.transformer.h.13.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 10.0, 5.0, 5.0, 14.0, 15.0, 26.0, 35.0, 72.0, 88.0, 126.0, 201.0, 307.0, 466.0, 792.0, 1099.0, 1751.0, 2807.0, 4401.0, 6935.0, 11404.0, 18178.0, 29990.0, 52122.0, 96714.0, 225827.0, 324280.0, 118937.0, 61312.0, 34591.0, 21277.0, 12918.0, 7898.0, 5071.0, 3120.0, 2081.0, 1326.0, 791.0, 506.0, 357.0, 268.0, 151.0, 97.0, 51.0, 42.0, 39.0, 22.0, 12.0, 13.0, 5.0, 1.0, 5.0, 0.0, 1.0, 3.0, 0.0, 0.0, 2.0], "bins": [-1.8681640625, -1.8100433349609375, -1.751922607421875, -1.6938018798828125, -1.63568115234375, -1.5775604248046875, -1.519439697265625, -1.4613189697265625, -1.4031982421875, -1.3450775146484375, -1.286956787109375, -1.2288360595703125, -1.17071533203125, -1.1125946044921875, -1.054473876953125, -0.9963531494140625, -0.938232421875, -0.8801116943359375, -0.821990966796875, -0.7638702392578125, -0.70574951171875, -0.6476287841796875, -0.589508056640625, -0.5313873291015625, -0.4732666015625, -0.4151458740234375, -0.357025146484375, -0.2989044189453125, -0.24078369140625, -0.1826629638671875, -0.124542236328125, -0.0664215087890625, -0.00830078125, 0.0498199462890625, 0.107940673828125, 0.1660614013671875, 0.22418212890625, 0.2823028564453125, 0.340423583984375, 0.3985443115234375, 0.4566650390625, 0.5147857666015625, 0.572906494140625, 0.6310272216796875, 0.68914794921875, 0.7472686767578125, 0.805389404296875, 0.8635101318359375, 0.921630859375, 0.9797515869140625, 1.037872314453125, 1.0959930419921875, 1.15411376953125, 1.2122344970703125, 1.270355224609375, 1.3284759521484375, 1.3865966796875, 1.4447174072265625, 1.502838134765625, 1.5609588623046875, 1.61907958984375, 1.6772003173828125, 1.735321044921875, 1.7934417724609375, 1.8515625]}, "gradients/decoder.transformer.h.13.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 2.0, 1.0, 2.0, 3.0, 4.0, 3.0, 10.0, 7.0, 3.0, 7.0, 13.0, 10.0, 25.0, 23.0, 29.0, 23.0, 31.0, 37.0, 31.0, 46.0, 32.0, 47.0, 36.0, 36.0, 50.0, 1065.0, 35.0, 28.0, 37.0, 36.0, 34.0, 40.0, 23.0, 31.0, 29.0, 27.0, 25.0, 23.0, 18.0, 16.0, 9.0, 12.0, 7.0, 6.0, 9.0, 6.0, 5.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.0390625, -3.8953857421875, -3.751708984375, -3.6080322265625, -3.46435546875, -3.3206787109375, -3.177001953125, -3.0333251953125, -2.8896484375, -2.7459716796875, -2.602294921875, -2.4586181640625, -2.31494140625, -2.1712646484375, -2.027587890625, -1.8839111328125, -1.740234375, -1.5965576171875, -1.452880859375, -1.3092041015625, -1.16552734375, -1.0218505859375, -0.878173828125, -0.7344970703125, -0.5908203125, -0.4471435546875, -0.303466796875, -0.1597900390625, -0.01611328125, 0.1275634765625, 0.271240234375, 0.4149169921875, 0.55859375, 0.7022705078125, 0.845947265625, 0.9896240234375, 1.13330078125, 1.2769775390625, 1.420654296875, 1.5643310546875, 1.7080078125, 1.8516845703125, 1.995361328125, 2.1390380859375, 2.28271484375, 2.4263916015625, 2.570068359375, 2.7137451171875, 2.857421875, 3.0010986328125, 3.144775390625, 3.2884521484375, 3.43212890625, 3.5758056640625, 3.719482421875, 3.8631591796875, 4.0068359375, 4.1505126953125, 4.294189453125, 4.4378662109375, 4.58154296875, 4.7252197265625, 4.868896484375, 5.0125732421875, 5.15625]}, "gradients/decoder.transformer.h.13.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 4.0, 2.0, 4.0, 5.0, 16.0, 10.0, 16.0, 23.0, 31.0, 56.0, 107.0, 140.0, 229.0, 378.0, 657.0, 1117.0, 1951.0, 3670.0, 6652.0, 12626.0, 24722.0, 50147.0, 106831.0, 287786.0, 1377782.0, 113007.0, 53434.0, 26313.0, 13462.0, 7040.0, 3810.0, 2136.0, 1188.0, 747.0, 393.0, 233.0, 153.0, 96.0, 50.0, 34.0, 22.0, 17.0, 17.0, 9.0, 7.0, 3.0, 2.0, 2.0, 2.0, 4.0, 0.0, 2.0], "bins": [-2.83203125, -2.753448486328125, -2.67486572265625, -2.596282958984375, -2.5177001953125, -2.439117431640625, -2.36053466796875, -2.281951904296875, -2.203369140625, -2.124786376953125, -2.04620361328125, -1.967620849609375, -1.8890380859375, -1.810455322265625, -1.73187255859375, -1.653289794921875, -1.57470703125, -1.496124267578125, -1.41754150390625, -1.338958740234375, -1.2603759765625, -1.181793212890625, -1.10321044921875, -1.024627685546875, -0.946044921875, -0.867462158203125, -0.78887939453125, -0.710296630859375, -0.6317138671875, -0.553131103515625, -0.47454833984375, -0.395965576171875, -0.3173828125, -0.238800048828125, -0.16021728515625, -0.081634521484375, -0.0030517578125, 0.075531005859375, 0.15411376953125, 0.232696533203125, 0.311279296875, 0.389862060546875, 0.46844482421875, 0.547027587890625, 0.6256103515625, 0.704193115234375, 0.78277587890625, 0.861358642578125, 0.93994140625, 1.018524169921875, 1.09710693359375, 1.175689697265625, 1.2542724609375, 1.332855224609375, 1.41143798828125, 1.490020751953125, 1.568603515625, 1.647186279296875, 1.72576904296875, 1.804351806640625, 1.8829345703125, 1.961517333984375, 2.04010009765625, 2.118682861328125, 2.197265625]}, "gradients/decoder.transformer.h.13.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 3.0, 2.0, 3.0, 1.0, 1.0, 2.0, 5.0, 9.0, 5.0, 6.0, 15.0, 12.0, 12.0, 11.0, 25.0, 36.0, 54.0, 62.0, 74.0, 94.0, 100.0, 116.0, 76.0, 71.0, 54.0, 36.0, 25.0, 21.0, 17.0, 11.0, 12.0, 5.0, 11.0, 10.0, 3.0, 4.0, 1.0, 4.0, 2.0, 4.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0012006759643554688, -0.0011558085680007935, -0.0011109411716461182, -0.0010660737752914429, -0.0010212063789367676, -0.0009763389825820923, -0.000931471586227417, -0.0008866041898727417, -0.0008417367935180664, -0.0007968693971633911, -0.0007520020008087158, -0.0007071346044540405, -0.0006622672080993652, -0.0006173998117446899, -0.0005725324153900146, -0.0005276650190353394, -0.00048279762268066406, -0.00043793022632598877, -0.0003930628299713135, -0.0003481954336166382, -0.0003033280372619629, -0.0002584606409072876, -0.0002135932445526123, -0.000168725848197937, -0.00012385845184326172, -7.899105548858643e-05, -3.412365913391113e-05, 1.074373722076416e-05, 5.561113357543945e-05, 0.00010047852993011475, 0.00014534592628479004, 0.00019021332263946533, 0.00023508071899414062, 0.0002799481153488159, 0.0003248155117034912, 0.0003696829080581665, 0.0004145503044128418, 0.0004594177007675171, 0.0005042850971221924, 0.0005491524934768677, 0.000594019889831543, 0.0006388872861862183, 0.0006837546825408936, 0.0007286220788955688, 0.0007734894752502441, 0.0008183568716049194, 0.0008632242679595947, 0.00090809166431427, 0.0009529590606689453, 0.0009978264570236206, 0.001042693853378296, 0.0010875612497329712, 0.0011324286460876465, 0.0011772960424423218, 0.001222163438796997, 0.0012670308351516724, 0.0013118982315063477, 0.001356765627861023, 0.0014016330242156982, 0.0014465004205703735, 0.0014913678169250488, 0.0015362352132797241, 0.0015811026096343994, 0.0016259700059890747, 0.00167083740234375]}, "gradients/decoder.transformer.h.13.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 3.0, 7.0, 4.0, 3.0, 9.0, 13.0, 8.0, 14.0, 18.0, 30.0, 43.0, 80.0, 126.0, 295.0, 1309.0, 1040783.0, 4962.0, 418.0, 166.0, 93.0, 54.0, 27.0, 26.0, 12.0, 11.0, 12.0, 7.0, 3.0, 6.0, 5.0, 4.0, 1.0, 1.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.048919677734375, -0.047654151916503906, -0.04638862609863281, -0.04512310028076172, -0.043857574462890625, -0.04259204864501953, -0.04132652282714844, -0.040060997009277344, -0.03879547119140625, -0.037529945373535156, -0.03626441955566406, -0.03499889373779297, -0.033733367919921875, -0.03246784210205078, -0.031202316284179688, -0.029936790466308594, -0.0286712646484375, -0.027405738830566406, -0.026140213012695312, -0.02487468719482422, -0.023609161376953125, -0.02234363555908203, -0.021078109741210938, -0.019812583923339844, -0.01854705810546875, -0.017281532287597656, -0.016016006469726562, -0.014750480651855469, -0.013484954833984375, -0.012219429016113281, -0.010953903198242188, -0.009688377380371094, -0.0084228515625, -0.007157325744628906, -0.0058917999267578125, -0.004626274108886719, -0.003360748291015625, -0.0020952224731445312, -0.0008296966552734375, 0.00043582916259765625, 0.00170135498046875, 0.0029668807983398438, 0.0042324066162109375, 0.005497932434082031, 0.006763458251953125, 0.008028984069824219, 0.009294509887695312, 0.010560035705566406, 0.0118255615234375, 0.013091087341308594, 0.014356613159179688, 0.015622138977050781, 0.016887664794921875, 0.01815319061279297, 0.019418716430664062, 0.020684242248535156, 0.02194976806640625, 0.023215293884277344, 0.024480819702148438, 0.02574634552001953, 0.027011871337890625, 0.02827739715576172, 0.029542922973632812, 0.030808448791503906, 0.032073974609375]}, "gradients/decoder.transformer.h.13.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 25.0, 519.0, 449.0, 21.0, 5.0, 2.0], "bins": [-0.007656607311218977, -0.007528883405029774, -0.0074011594988405704, -0.007273435592651367, -0.007145711686462164, -0.007017987780272961, -0.006890263874083757, -0.006762539967894554, -0.006634816061705351, -0.006507092155516148, -0.006379368249326944, -0.006251644343137741, -0.006123920436948538, -0.0059961965307593346, -0.005868472624570131, -0.005740748718380928, -0.0056130243465304375, -0.005485300440341234, -0.005357576534152031, -0.005229852627962828, -0.005102128721773624, -0.004974404815584421, -0.004846680909395218, -0.004718957003206015, -0.004591233097016811, -0.004463509190827608, -0.004335785284638405, -0.004208061378449202, -0.004080337472259998, -0.003952613566070795, -0.003824889659881592, -0.0036971657536923885, -0.0035694423131644726, -0.0034417184069752693, -0.003313994500786066, -0.003186270594596863, -0.0030585466884076595, -0.0029308227822184563, -0.002803098876029253, -0.0026753749698400497, -0.002547650830820203, -0.0024199269246309996, -0.0022922030184417963, -0.002164479112252593, -0.0020367552060633898, -0.0019090312998741865, -0.0017813072772696614, -0.0016535833710804582, -0.0015258595813065767, -0.0013981356751173735, -0.0012704117689281702, -0.001142687862738967, -0.0010149639565497637, -0.0008872399921528995, -0.0007595160277560353, -0.0006317921215668321, -0.0005040681571699679, -0.00037634425098076463, -0.0002486203156877309, -0.00012089638039469719, 6.827525794506073e-06, 0.00013455143198370934, 0.0002622753963805735, 0.0003899993025697768, 0.00051772320875898]}, "gradients/decoder.transformer.h.13.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 2.0, 3.0, 3.0, 6.0, 8.0, 3.0, 15.0, 15.0, 16.0, 13.0, 21.0, 23.0, 23.0, 30.0, 34.0, 43.0, 28.0, 41.0, 48.0, 37.0, 48.0, 44.0, 48.0, 54.0, 48.0, 58.0, 45.0, 34.0, 37.0, 31.0, 22.0, 25.0, 18.0, 14.0, 19.0, 12.0, 10.0, 13.0, 4.0, 8.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0006244182586669922, -0.0006045559421181679, -0.0005846936255693436, -0.0005648313090205193, -0.000544968992471695, -0.0005251066759228706, -0.0005052443593740463, -0.000485382042825222, -0.0004655197262763977, -0.0004456574097275734, -0.0004257950931787491, -0.0004059327766299248, -0.00038607046008110046, -0.00036620814353227615, -0.00034634582698345184, -0.00032648351043462753, -0.0003066211938858032, -0.0002867588773369789, -0.0002668965607881546, -0.0002470342442393303, -0.00022717192769050598, -0.00020730961114168167, -0.00018744729459285736, -0.00016758497804403305, -0.00014772266149520874, -0.00012786034494638443, -0.00010799802839756012, -8.813571184873581e-05, -6.82733952999115e-05, -4.841107875108719e-05, -2.854876220226288e-05, -8.686445653438568e-06, 1.1175870895385742e-05, 3.103818744421005e-05, 5.090050399303436e-05, 7.076282054185867e-05, 9.062513709068298e-05, 0.0001104874536395073, 0.0001303497701883316, 0.00015021208673715591, 0.00017007440328598022, 0.00018993671983480453, 0.00020979903638362885, 0.00022966135293245316, 0.00024952366948127747, 0.0002693859860301018, 0.0002892483025789261, 0.0003091106191277504, 0.0003289729356765747, 0.000348835252225399, 0.00036869756877422333, 0.00038855988532304764, 0.00040842220187187195, 0.00042828451842069626, 0.00044814683496952057, 0.0004680091515183449, 0.0004878714680671692, 0.0005077337846159935, 0.0005275961011648178, 0.0005474584177136421, 0.0005673207342624664, 0.0005871830508112907, 0.000607045367360115, 0.0006269076839089394, 0.0006467700004577637]}, "gradients/decoder.transformer.h.13.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 3.0, 0.0, 7.0, 6.0, 9.0, 13.0, 9.0, 11.0, 22.0, 22.0, 24.0, 26.0, 33.0, 23.0, 38.0, 45.0, 53.0, 38.0, 43.0, 41.0, 42.0, 45.0, 53.0, 49.0, 41.0, 32.0, 40.0, 30.0, 36.0, 32.0, 31.0, 16.0, 12.0, 18.0, 15.0, 12.0, 12.0, 9.0, 4.0, 4.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-8.0234375, -7.78564453125, -7.5478515625, -7.31005859375, -7.072265625, -6.83447265625, -6.5966796875, -6.35888671875, -6.12109375, -5.88330078125, -5.6455078125, -5.40771484375, -5.169921875, -4.93212890625, -4.6943359375, -4.45654296875, -4.21875, -3.98095703125, -3.7431640625, -3.50537109375, -3.267578125, -3.02978515625, -2.7919921875, -2.55419921875, -2.31640625, -2.07861328125, -1.8408203125, -1.60302734375, -1.365234375, -1.12744140625, -0.8896484375, -0.65185546875, -0.4140625, -0.17626953125, 0.0615234375, 0.29931640625, 0.537109375, 0.77490234375, 1.0126953125, 1.25048828125, 1.48828125, 1.72607421875, 1.9638671875, 2.20166015625, 2.439453125, 2.67724609375, 2.9150390625, 3.15283203125, 3.390625, 3.62841796875, 3.8662109375, 4.10400390625, 4.341796875, 4.57958984375, 4.8173828125, 5.05517578125, 5.29296875, 5.53076171875, 5.7685546875, 6.00634765625, 6.244140625, 6.48193359375, 6.7197265625, 6.95751953125, 7.1953125]}, "gradients/decoder.transformer.h.13.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 3.0, 5.0, 2.0, 8.0, 6.0, 7.0, 4.0, 13.0, 12.0, 19.0, 31.0, 38.0, 53.0, 74.0, 111.0, 140.0, 232.0, 298.0, 467.0, 693.0, 1075.0, 1778.0, 3091.0, 5594.0, 11042.0, 24225.0, 59263.0, 162070.0, 391689.0, 235828.0, 85681.0, 33545.0, 14720.0, 7067.0, 3741.0, 2148.0, 1262.0, 773.0, 545.0, 343.0, 241.0, 179.0, 115.0, 88.0, 73.0, 53.0, 22.0, 23.0, 19.0, 16.0, 20.0, 9.0, 7.0, 3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 2.0], "bins": [-4.5234375, -4.38067626953125, -4.2379150390625, -4.09515380859375, -3.952392578125, -3.80963134765625, -3.6668701171875, -3.52410888671875, -3.38134765625, -3.23858642578125, -3.0958251953125, -2.95306396484375, -2.810302734375, -2.66754150390625, -2.5247802734375, -2.38201904296875, -2.2392578125, -2.09649658203125, -1.9537353515625, -1.81097412109375, -1.668212890625, -1.52545166015625, -1.3826904296875, -1.23992919921875, -1.09716796875, -0.95440673828125, -0.8116455078125, -0.66888427734375, -0.526123046875, -0.38336181640625, -0.2406005859375, -0.09783935546875, 0.044921875, 0.18768310546875, 0.3304443359375, 0.47320556640625, 0.615966796875, 0.75872802734375, 0.9014892578125, 1.04425048828125, 1.18701171875, 1.32977294921875, 1.4725341796875, 1.61529541015625, 1.758056640625, 1.90081787109375, 2.0435791015625, 2.18634033203125, 2.3291015625, 2.47186279296875, 2.6146240234375, 2.75738525390625, 2.900146484375, 3.04290771484375, 3.1856689453125, 3.32843017578125, 3.47119140625, 3.61395263671875, 3.7567138671875, 3.89947509765625, 4.042236328125, 4.18499755859375, 4.3277587890625, 4.47052001953125, 4.61328125]}, "gradients/decoder.transformer.h.13.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 5.0, 4.0, 5.0, 6.0, 3.0, 9.0, 15.0, 26.0, 16.0, 27.0, 29.0, 20.0, 32.0, 33.0, 42.0, 45.0, 43.0, 73.0, 84.0, 322.0, 1643.0, 118.0, 61.0, 63.0, 57.0, 44.0, 35.0, 39.0, 25.0, 25.0, 23.0, 12.0, 18.0, 13.0, 9.0, 6.0, 8.0, 6.0, 2.0, 3.0, 2.0, 4.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0], "bins": [-23.625, -22.889892578125, -22.15478515625, -21.419677734375, -20.6845703125, -19.949462890625, -19.21435546875, -18.479248046875, -17.744140625, -17.009033203125, -16.27392578125, -15.538818359375, -14.8037109375, -14.068603515625, -13.33349609375, -12.598388671875, -11.86328125, -11.128173828125, -10.39306640625, -9.657958984375, -8.9228515625, -8.187744140625, -7.45263671875, -6.717529296875, -5.982421875, -5.247314453125, -4.51220703125, -3.777099609375, -3.0419921875, -2.306884765625, -1.57177734375, -0.836669921875, -0.1015625, 0.633544921875, 1.36865234375, 2.103759765625, 2.8388671875, 3.573974609375, 4.30908203125, 5.044189453125, 5.779296875, 6.514404296875, 7.24951171875, 7.984619140625, 8.7197265625, 9.454833984375, 10.18994140625, 10.925048828125, 11.66015625, 12.395263671875, 13.13037109375, 13.865478515625, 14.6005859375, 15.335693359375, 16.07080078125, 16.805908203125, 17.541015625, 18.276123046875, 19.01123046875, 19.746337890625, 20.4814453125, 21.216552734375, 21.95166015625, 22.686767578125, 23.421875]}, "gradients/decoder.transformer.h.13.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 3.0, 3.0, 3.0, 4.0, 8.0, 13.0, 11.0, 13.0, 29.0, 35.0, 57.0, 79.0, 126.0, 209.0, 337.0, 578.0, 2094.0, 3100183.0, 39846.0, 896.0, 396.0, 243.0, 175.0, 119.0, 83.0, 61.0, 34.0, 24.0, 18.0, 15.0, 4.0, 5.0, 5.0, 3.0, 1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-66.375, -63.984375, -61.59375, -59.203125, -56.8125, -54.421875, -52.03125, -49.640625, -47.25, -44.859375, -42.46875, -40.078125, -37.6875, -35.296875, -32.90625, -30.515625, -28.125, -25.734375, -23.34375, -20.953125, -18.5625, -16.171875, -13.78125, -11.390625, -9.0, -6.609375, -4.21875, -1.828125, 0.5625, 2.953125, 5.34375, 7.734375, 10.125, 12.515625, 14.90625, 17.296875, 19.6875, 22.078125, 24.46875, 26.859375, 29.25, 31.640625, 34.03125, 36.421875, 38.8125, 41.203125, 43.59375, 45.984375, 48.375, 50.765625, 53.15625, 55.546875, 57.9375, 60.328125, 62.71875, 65.109375, 67.5, 69.890625, 72.28125, 74.671875, 77.0625, 79.453125, 81.84375, 84.234375, 86.625]}, "gradients/decoder.transformer.h.13.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 17.0, 917.0, 84.0, 2.0, 1.0], "bins": [-407.33349609375, -400.61187744140625, -393.8902282714844, -387.1686096191406, -380.4469909667969, -373.7253723144531, -367.00372314453125, -360.2821044921875, -353.56048583984375, -346.8388671875, -340.1172180175781, -333.3955993652344, -326.6739807128906, -319.9523620605469, -313.230712890625, -306.50909423828125, -299.7874755859375, -293.06585693359375, -286.3442077636719, -279.6225891113281, -272.9009704589844, -266.1793518066406, -259.45770263671875, -252.736083984375, -246.0144500732422, -239.29281616210938, -232.57119750976562, -225.8495635986328, -219.12794494628906, -212.40631103515625, -205.6846923828125, -198.9630584716797, -192.24142456054688, -185.51979064941406, -178.7981719970703, -172.0765380859375, -165.35491943359375, -158.63328552246094, -151.9116668701172, -145.19003295898438, -138.46841430664062, -131.7467803955078, -125.02516174316406, -118.30353546142578, -111.5819091796875, -104.86027526855469, -98.13865661621094, -91.41702270507812, -84.69540405273438, -77.9737777709961, -71.25215148925781, -64.53052520751953, -57.80889892578125, -51.0872688293457, -44.36564254760742, -37.64401626586914, -30.922388076782227, -24.200761795043945, -17.47913360595703, -10.75750732421875, -4.035881042480469, 2.6857471466064453, 9.407373428344727, 16.128999710083008, 22.85062599182129]}, "gradients/decoder.transformer.h.13.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 1.0, 4.0, 3.0, 12.0, 10.0, 9.0, 6.0, 16.0, 13.0, 25.0, 23.0, 39.0, 28.0, 31.0, 39.0, 47.0, 50.0, 43.0, 51.0, 61.0, 54.0, 52.0, 36.0, 43.0, 47.0, 46.0, 42.0, 29.0, 36.0, 19.0, 22.0, 14.0, 15.0, 4.0, 10.0, 5.0, 5.0, 5.0, 3.0, 3.0, 3.0, 3.0, 1.0, 3.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-74.61872863769531, -72.40953063964844, -70.20033264160156, -67.99113464355469, -65.78193664550781, -63.57273864746094, -61.36354064941406, -59.15434265136719, -56.94514465332031, -54.73594665527344, -52.52674865722656, -50.31755065917969, -48.10835266113281, -45.89915466308594, -43.68995666503906, -41.48075866699219, -39.27156066894531, -37.06236267089844, -34.85316467285156, -32.64396667480469, -30.434768676757812, -28.225570678710938, -26.016372680664062, -23.807174682617188, -21.597976684570312, -19.388778686523438, -17.179580688476562, -14.970382690429688, -12.761184692382812, -10.551986694335938, -8.342788696289062, -6.1335906982421875, -3.9243850708007812, -1.7151870727539062, 0.49401092529296875, 2.7032089233398438, 4.912406921386719, 7.121604919433594, 9.330802917480469, 11.540000915527344, 13.749198913574219, 15.958396911621094, 18.16759490966797, 20.376792907714844, 22.58599090576172, 24.795188903808594, 27.00438690185547, 29.213584899902344, 31.42278289794922, 33.631980895996094, 35.84117889404297, 38.050376892089844, 40.25957489013672, 42.468772888183594, 44.67797088623047, 46.887168884277344, 49.09636688232422, 51.305564880371094, 53.51476287841797, 55.723960876464844, 57.93315887451172, 60.142356872558594, 62.35155487060547, 64.56075286865234, 66.76995086669922]}, "gradients/decoder.transformer.h.12.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 5.0, 3.0, 5.0, 5.0, 9.0, 16.0, 11.0, 11.0, 21.0, 20.0, 25.0, 22.0, 27.0, 31.0, 31.0, 48.0, 42.0, 42.0, 37.0, 41.0, 43.0, 55.0, 39.0, 43.0, 48.0, 30.0, 40.0, 35.0, 32.0, 29.0, 30.0, 30.0, 20.0, 14.0, 12.0, 11.0, 16.0, 7.0, 9.0, 3.0, 5.0, 1.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-8.1484375, -7.90924072265625, -7.6700439453125, -7.43084716796875, -7.191650390625, -6.95245361328125, -6.7132568359375, -6.47406005859375, -6.23486328125, -5.99566650390625, -5.7564697265625, -5.51727294921875, -5.278076171875, -5.03887939453125, -4.7996826171875, -4.56048583984375, -4.3212890625, -4.08209228515625, -3.8428955078125, -3.60369873046875, -3.364501953125, -3.12530517578125, -2.8861083984375, -2.64691162109375, -2.40771484375, -2.16851806640625, -1.9293212890625, -1.69012451171875, -1.450927734375, -1.21173095703125, -0.9725341796875, -0.73333740234375, -0.494140625, -0.25494384765625, -0.0157470703125, 0.22344970703125, 0.462646484375, 0.70184326171875, 0.9410400390625, 1.18023681640625, 1.41943359375, 1.65863037109375, 1.8978271484375, 2.13702392578125, 2.376220703125, 2.61541748046875, 2.8546142578125, 3.09381103515625, 3.3330078125, 3.57220458984375, 3.8114013671875, 4.05059814453125, 4.289794921875, 4.52899169921875, 4.7681884765625, 5.00738525390625, 5.24658203125, 5.48577880859375, 5.7249755859375, 5.96417236328125, 6.203369140625, 6.44256591796875, 6.6817626953125, 6.92095947265625, 7.16015625]}, "gradients/decoder.transformer.h.12.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 1.0, 2.0, 7.0, 7.0, 11.0, 11.0, 8.0, 18.0, 29.0, 18.0, 28.0, 26.0, 42.0, 61.0, 104.0, 157.0, 344.0, 1105.0, 5928.0, 103319.0, 2431147.0, 1595914.0, 50734.0, 3811.0, 725.0, 274.0, 141.0, 63.0, 56.0, 37.0, 32.0, 26.0, 21.0, 18.0, 16.0, 12.0, 8.0, 6.0, 8.0, 5.0, 1.0, 4.0, 2.0, 2.0, 2.0, 1.0, 2.0, 1.0], "bins": [-23.375, -22.741455078125, -22.10791015625, -21.474365234375, -20.8408203125, -20.207275390625, -19.57373046875, -18.940185546875, -18.306640625, -17.673095703125, -17.03955078125, -16.406005859375, -15.7724609375, -15.138916015625, -14.50537109375, -13.871826171875, -13.23828125, -12.604736328125, -11.97119140625, -11.337646484375, -10.7041015625, -10.070556640625, -9.43701171875, -8.803466796875, -8.169921875, -7.536376953125, -6.90283203125, -6.269287109375, -5.6357421875, -5.002197265625, -4.36865234375, -3.735107421875, -3.1015625, -2.468017578125, -1.83447265625, -1.200927734375, -0.5673828125, 0.066162109375, 0.69970703125, 1.333251953125, 1.966796875, 2.600341796875, 3.23388671875, 3.867431640625, 4.5009765625, 5.134521484375, 5.76806640625, 6.401611328125, 7.03515625, 7.668701171875, 8.30224609375, 8.935791015625, 9.5693359375, 10.202880859375, 10.83642578125, 11.469970703125, 12.103515625, 12.737060546875, 13.37060546875, 14.004150390625, 14.6376953125, 15.271240234375, 15.90478515625, 16.538330078125, 17.171875]}, "gradients/decoder.transformer.h.12.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 3.0, 4.0, 4.0, 0.0, 6.0, 10.0, 10.0, 12.0, 14.0, 12.0, 15.0, 35.0, 32.0, 61.0, 42.0, 74.0, 90.0, 118.0, 163.0, 197.0, 249.0, 311.0, 392.0, 409.0, 399.0, 330.0, 232.0, 209.0, 145.0, 106.0, 93.0, 74.0, 53.0, 37.0, 36.0, 19.0, 15.0, 17.0, 14.0, 9.0, 9.0, 9.0, 4.0, 4.0, 4.0, 4.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.953125, -9.5908203125, -9.228515625, -8.8662109375, -8.50390625, -8.1416015625, -7.779296875, -7.4169921875, -7.0546875, -6.6923828125, -6.330078125, -5.9677734375, -5.60546875, -5.2431640625, -4.880859375, -4.5185546875, -4.15625, -3.7939453125, -3.431640625, -3.0693359375, -2.70703125, -2.3447265625, -1.982421875, -1.6201171875, -1.2578125, -0.8955078125, -0.533203125, -0.1708984375, 0.19140625, 0.5537109375, 0.916015625, 1.2783203125, 1.640625, 2.0029296875, 2.365234375, 2.7275390625, 3.08984375, 3.4521484375, 3.814453125, 4.1767578125, 4.5390625, 4.9013671875, 5.263671875, 5.6259765625, 5.98828125, 6.3505859375, 6.712890625, 7.0751953125, 7.4375, 7.7998046875, 8.162109375, 8.5244140625, 8.88671875, 9.2490234375, 9.611328125, 9.9736328125, 10.3359375, 10.6982421875, 11.060546875, 11.4228515625, 11.78515625, 12.1474609375, 12.509765625, 12.8720703125, 13.234375]}, "gradients/decoder.transformer.h.12.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 3.0, 6.0, 8.0, 4.0, 5.0, 8.0, 14.0, 17.0, 22.0, 26.0, 35.0, 43.0, 61.0, 47.0, 87.0, 122.0, 143.0, 208.0, 313.0, 618.0, 2145.0, 3836362.0, 351291.0, 1127.0, 508.0, 279.0, 195.0, 151.0, 97.0, 77.0, 75.0, 45.0, 31.0, 26.0, 23.0, 11.0, 10.0, 17.0, 10.0, 7.0, 5.0, 5.0, 2.0, 2.0, 2.0, 1.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-77.375, -74.783203125, -72.19140625, -69.599609375, -67.0078125, -64.416015625, -61.82421875, -59.232421875, -56.640625, -54.048828125, -51.45703125, -48.865234375, -46.2734375, -43.681640625, -41.08984375, -38.498046875, -35.90625, -33.314453125, -30.72265625, -28.130859375, -25.5390625, -22.947265625, -20.35546875, -17.763671875, -15.171875, -12.580078125, -9.98828125, -7.396484375, -4.8046875, -2.212890625, 0.37890625, 2.970703125, 5.5625, 8.154296875, 10.74609375, 13.337890625, 15.9296875, 18.521484375, 21.11328125, 23.705078125, 26.296875, 28.888671875, 31.48046875, 34.072265625, 36.6640625, 39.255859375, 41.84765625, 44.439453125, 47.03125, 49.623046875, 52.21484375, 54.806640625, 57.3984375, 59.990234375, 62.58203125, 65.173828125, 67.765625, 70.357421875, 72.94921875, 75.541015625, 78.1328125, 80.724609375, 83.31640625, 85.908203125, 88.5]}, "gradients/decoder.transformer.h.12.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 4.0, 8.0, 15.0, 17.0, 37.0, 49.0, 76.0, 128.0, 118.0, 149.0, 117.0, 115.0, 77.0, 47.0, 33.0, 14.0, 7.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-63.00080871582031, -61.27339172363281, -59.54597473144531, -57.81855773925781, -56.09113693237305, -54.36371994018555, -52.63630294799805, -50.90888595581055, -49.18146514892578, -47.45404815673828, -45.72663116455078, -43.99921417236328, -42.271793365478516, -40.544376373291016, -38.816959381103516, -37.089542388916016, -35.362125396728516, -33.634708404541016, -31.907289505004883, -30.179872512817383, -28.45245361328125, -26.72503662109375, -24.99761962890625, -23.27020263671875, -21.542783737182617, -19.815366744995117, -18.087947845458984, -16.360530853271484, -14.633112907409668, -12.905694961547852, -11.178277969360352, -9.450860023498535, -7.723438262939453, -5.996020317077637, -4.2686028480529785, -2.5411853790283203, -0.8137674331665039, 0.9136505126953125, 2.6410675048828125, 4.368485450744629, 6.095903396606445, 7.823321342468262, 9.550739288330078, 11.278156280517578, 13.005574226379395, 14.732992172241211, 16.46040916442871, 18.187828063964844, 19.915245056152344, 21.642662048339844, 23.370080947875977, 25.097497940063477, 26.82491683959961, 28.55233383178711, 30.27975082397461, 32.00716781616211, 33.734588623046875, 35.462005615234375, 37.189422607421875, 38.916839599609375, 40.64426040649414, 42.37167739868164, 44.09909439086914, 45.82651138305664, 47.55392837524414]}, "gradients/decoder.transformer.h.12.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 6.0, 5.0, 6.0, 3.0, 11.0, 11.0, 16.0, 27.0, 11.0, 18.0, 23.0, 22.0, 38.0, 33.0, 37.0, 49.0, 51.0, 51.0, 31.0, 49.0, 38.0, 44.0, 54.0, 48.0, 26.0, 30.0, 45.0, 27.0, 35.0, 32.0, 28.0, 15.0, 24.0, 18.0, 11.0, 13.0, 6.0, 4.0, 3.0, 5.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.092201232910156, -40.60743713378906, -39.1226692199707, -37.63790512084961, -36.15313720703125, -34.668373107910156, -33.18360900878906, -31.698841094970703, -30.214075088500977, -28.72930908203125, -27.244543075561523, -25.759777069091797, -24.275012969970703, -22.790245056152344, -21.30548095703125, -19.820714950561523, -18.335948944091797, -16.85118293762207, -15.366416931152344, -13.881651878356934, -12.396885871887207, -10.91211986541748, -9.42735481262207, -7.942588806152344, -6.457822799682617, -4.973056793212891, -3.4882912635803223, -2.003525733947754, -0.5187597274780273, 0.9660062789916992, 2.4507713317871094, 3.935537338256836, 5.420307159423828, 6.905073165893555, 8.389839172363281, 9.874604225158691, 11.359370231628418, 12.844136238098145, 14.328901290893555, 15.813667297363281, 17.298433303833008, 18.783199310302734, 20.26796531677246, 21.752731323242188, 23.23749542236328, 24.72226333618164, 26.207027435302734, 27.69179344177246, 29.176559448242188, 30.661325454711914, 32.14609146118164, 33.630855560302734, 35.115623474121094, 36.60038757324219, 38.08515167236328, 39.56991958618164, 41.0546875, 42.539451599121094, 44.02421951293945, 45.50898361206055, 46.993751525878906, 48.478515625, 49.963279724121094, 51.44804763793945, 52.93281173706055]}, "gradients/decoder.transformer.h.12.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 7.0, 3.0, 2.0, 5.0, 12.0, 19.0, 19.0, 27.0, 26.0, 22.0, 16.0, 34.0, 41.0, 41.0, 41.0, 41.0, 54.0, 50.0, 50.0, 48.0, 49.0, 48.0, 49.0, 51.0, 33.0, 26.0, 33.0, 38.0, 24.0, 24.0, 13.0, 12.0, 12.0, 6.0, 9.0, 4.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.484375, -8.2255859375, -7.966796875, -7.7080078125, -7.44921875, -7.1904296875, -6.931640625, -6.6728515625, -6.4140625, -6.1552734375, -5.896484375, -5.6376953125, -5.37890625, -5.1201171875, -4.861328125, -4.6025390625, -4.34375, -4.0849609375, -3.826171875, -3.5673828125, -3.30859375, -3.0498046875, -2.791015625, -2.5322265625, -2.2734375, -2.0146484375, -1.755859375, -1.4970703125, -1.23828125, -0.9794921875, -0.720703125, -0.4619140625, -0.203125, 0.0556640625, 0.314453125, 0.5732421875, 0.83203125, 1.0908203125, 1.349609375, 1.6083984375, 1.8671875, 2.1259765625, 2.384765625, 2.6435546875, 2.90234375, 3.1611328125, 3.419921875, 3.6787109375, 3.9375, 4.1962890625, 4.455078125, 4.7138671875, 4.97265625, 5.2314453125, 5.490234375, 5.7490234375, 6.0078125, 6.2666015625, 6.525390625, 6.7841796875, 7.04296875, 7.3017578125, 7.560546875, 7.8193359375, 8.078125]}, "gradients/decoder.transformer.h.12.crossattention.c_proj.weight": {"_type": "histogram", "values": [4.0, 3.0, 3.0, 2.0, 5.0, 6.0, 8.0, 13.0, 14.0, 26.0, 41.0, 54.0, 58.0, 115.0, 160.0, 223.0, 297.0, 462.0, 683.0, 1035.0, 1637.0, 2384.0, 3597.0, 5552.0, 8625.0, 13161.0, 20467.0, 32768.0, 54628.0, 97105.0, 222580.0, 304729.0, 114048.0, 62182.0, 36791.0, 23008.0, 14701.0, 9430.0, 6112.0, 3899.0, 2633.0, 1729.0, 1167.0, 731.0, 518.0, 369.0, 241.0, 178.0, 110.0, 95.0, 57.0, 43.0, 23.0, 10.0, 22.0, 11.0, 6.0, 6.0, 2.0, 5.0, 2.0, 0.0, 0.0, 2.0], "bins": [-1.8271484375, -1.7684326171875, -1.709716796875, -1.6510009765625, -1.59228515625, -1.5335693359375, -1.474853515625, -1.4161376953125, -1.357421875, -1.2987060546875, -1.239990234375, -1.1812744140625, -1.12255859375, -1.0638427734375, -1.005126953125, -0.9464111328125, -0.8876953125, -0.8289794921875, -0.770263671875, -0.7115478515625, -0.65283203125, -0.5941162109375, -0.535400390625, -0.4766845703125, -0.41796875, -0.3592529296875, -0.300537109375, -0.2418212890625, -0.18310546875, -0.1243896484375, -0.065673828125, -0.0069580078125, 0.0517578125, 0.1104736328125, 0.169189453125, 0.2279052734375, 0.28662109375, 0.3453369140625, 0.404052734375, 0.4627685546875, 0.521484375, 0.5802001953125, 0.638916015625, 0.6976318359375, 0.75634765625, 0.8150634765625, 0.873779296875, 0.9324951171875, 0.9912109375, 1.0499267578125, 1.108642578125, 1.1673583984375, 1.22607421875, 1.2847900390625, 1.343505859375, 1.4022216796875, 1.4609375, 1.5196533203125, 1.578369140625, 1.6370849609375, 1.69580078125, 1.7545166015625, 1.813232421875, 1.8719482421875, 1.9306640625]}, "gradients/decoder.transformer.h.12.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 3.0, 5.0, 6.0, 8.0, 8.0, 9.0, 7.0, 19.0, 16.0, 12.0, 18.0, 26.0, 27.0, 22.0, 22.0, 36.0, 48.0, 32.0, 38.0, 35.0, 36.0, 40.0, 40.0, 1058.0, 49.0, 32.0, 27.0, 36.0, 39.0, 30.0, 29.0, 25.0, 32.0, 28.0, 22.0, 20.0, 15.0, 13.0, 14.0, 13.0, 12.0, 7.0, 5.0, 4.0, 2.0, 3.0, 2.0, 1.0, 3.0, 2.0, 2.0, 1.0], "bins": [-4.8828125, -4.7437744140625, -4.604736328125, -4.4656982421875, -4.32666015625, -4.1876220703125, -4.048583984375, -3.9095458984375, -3.7705078125, -3.6314697265625, -3.492431640625, -3.3533935546875, -3.21435546875, -3.0753173828125, -2.936279296875, -2.7972412109375, -2.658203125, -2.5191650390625, -2.380126953125, -2.2410888671875, -2.10205078125, -1.9630126953125, -1.823974609375, -1.6849365234375, -1.5458984375, -1.4068603515625, -1.267822265625, -1.1287841796875, -0.98974609375, -0.8507080078125, -0.711669921875, -0.5726318359375, -0.43359375, -0.2945556640625, -0.155517578125, -0.0164794921875, 0.12255859375, 0.2615966796875, 0.400634765625, 0.5396728515625, 0.6787109375, 0.8177490234375, 0.956787109375, 1.0958251953125, 1.23486328125, 1.3739013671875, 1.512939453125, 1.6519775390625, 1.791015625, 1.9300537109375, 2.069091796875, 2.2081298828125, 2.34716796875, 2.4862060546875, 2.625244140625, 2.7642822265625, 2.9033203125, 3.0423583984375, 3.181396484375, 3.3204345703125, 3.45947265625, 3.5985107421875, 3.737548828125, 3.8765869140625, 4.015625]}, "gradients/decoder.transformer.h.12.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 4.0, 2.0, 2.0, 2.0, 4.0, 3.0, 6.0, 6.0, 15.0, 29.0, 31.0, 48.0, 50.0, 98.0, 164.0, 278.0, 482.0, 826.0, 1337.0, 2475.0, 4267.0, 8017.0, 14574.0, 28096.0, 55175.0, 113030.0, 1380484.0, 275568.0, 104247.0, 51255.0, 25885.0, 13709.0, 7466.0, 4019.0, 2313.0, 1306.0, 756.0, 399.0, 233.0, 194.0, 99.0, 59.0, 47.0, 20.0, 16.0, 18.0, 8.0, 8.0, 6.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.203125, -2.12701416015625, -2.0509033203125, -1.97479248046875, -1.898681640625, -1.82257080078125, -1.7464599609375, -1.67034912109375, -1.59423828125, -1.51812744140625, -1.4420166015625, -1.36590576171875, -1.289794921875, -1.21368408203125, -1.1375732421875, -1.06146240234375, -0.9853515625, -0.90924072265625, -0.8331298828125, -0.75701904296875, -0.680908203125, -0.60479736328125, -0.5286865234375, -0.45257568359375, -0.37646484375, -0.30035400390625, -0.2242431640625, -0.14813232421875, -0.072021484375, 0.00408935546875, 0.0802001953125, 0.15631103515625, 0.232421875, 0.30853271484375, 0.3846435546875, 0.46075439453125, 0.536865234375, 0.61297607421875, 0.6890869140625, 0.76519775390625, 0.84130859375, 0.91741943359375, 0.9935302734375, 1.06964111328125, 1.145751953125, 1.22186279296875, 1.2979736328125, 1.37408447265625, 1.4501953125, 1.52630615234375, 1.6024169921875, 1.67852783203125, 1.754638671875, 1.83074951171875, 1.9068603515625, 1.98297119140625, 2.05908203125, 2.13519287109375, 2.2113037109375, 2.28741455078125, 2.363525390625, 2.43963623046875, 2.5157470703125, 2.59185791015625, 2.66796875]}, "gradients/decoder.transformer.h.12.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 1.0, 5.0, 2.0, 7.0, 6.0, 6.0, 5.0, 8.0, 13.0, 8.0, 21.0, 23.0, 28.0, 37.0, 45.0, 79.0, 82.0, 123.0, 102.0, 77.0, 60.0, 59.0, 36.0, 27.0, 24.0, 24.0, 20.0, 19.0, 6.0, 5.0, 9.0, 7.0, 7.0, 9.0, 6.0, 4.0, 1.0, 2.0, 2.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0017757415771484375, -0.001722082495689392, -0.0016684234142303467, -0.0016147643327713013, -0.0015611052513122559, -0.0015074461698532104, -0.001453787088394165, -0.0014001280069351196, -0.0013464689254760742, -0.0012928098440170288, -0.0012391507625579834, -0.001185491681098938, -0.0011318325996398926, -0.0010781735181808472, -0.0010245144367218018, -0.0009708553552627563, -0.0009171962738037109, -0.0008635371923446655, -0.0008098781108856201, -0.0007562190294265747, -0.0007025599479675293, -0.0006489008665084839, -0.0005952417850494385, -0.0005415827035903931, -0.00048792362213134766, -0.00043426454067230225, -0.00038060545921325684, -0.0003269463777542114, -0.000273287296295166, -0.0002196282148361206, -0.0001659691333770752, -0.00011231005191802979, -5.8650970458984375e-05, -4.991888999938965e-06, 4.8667192459106445e-05, 0.00010232627391815186, 0.00015598535537719727, 0.00020964443683624268, 0.0002633035182952881, 0.0003169625997543335, 0.0003706216812133789, 0.0004242807626724243, 0.0004779398441314697, 0.0005315989255905151, 0.0005852580070495605, 0.000638917088508606, 0.0006925761699676514, 0.0007462352514266968, 0.0007998943328857422, 0.0008535534143447876, 0.000907212495803833, 0.0009608715772628784, 0.0010145306587219238, 0.0010681897401809692, 0.0011218488216400146, 0.00117550790309906, 0.0012291669845581055, 0.0012828260660171509, 0.0013364851474761963, 0.0013901442289352417, 0.0014438033103942871, 0.0014974623918533325, 0.001551121473312378, 0.0016047805547714233, 0.0016584396362304688]}, "gradients/decoder.transformer.h.12.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 0.0, 1.0, 4.0, 0.0, 3.0, 5.0, 6.0, 11.0, 8.0, 6.0, 9.0, 8.0, 11.0, 22.0, 28.0, 37.0, 45.0, 62.0, 95.0, 214.0, 753.0, 783790.0, 262269.0, 671.0, 197.0, 90.0, 61.0, 37.0, 27.0, 22.0, 13.0, 14.0, 6.0, 9.0, 5.0, 6.0, 4.0, 5.0, 5.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.043914794921875, -0.042496681213378906, -0.04107856750488281, -0.03966045379638672, -0.038242340087890625, -0.03682422637939453, -0.03540611267089844, -0.033987998962402344, -0.03256988525390625, -0.031151771545410156, -0.029733657836914062, -0.02831554412841797, -0.026897430419921875, -0.02547931671142578, -0.024061203002929688, -0.022643089294433594, -0.0212249755859375, -0.019806861877441406, -0.018388748168945312, -0.01697063446044922, -0.015552520751953125, -0.014134407043457031, -0.012716293334960938, -0.011298179626464844, -0.00988006591796875, -0.008461952209472656, -0.0070438385009765625, -0.005625724792480469, -0.004207611083984375, -0.0027894973754882812, -0.0013713836669921875, 4.673004150390625e-05, 0.00146484375, 0.0028829574584960938, 0.0043010711669921875, 0.005719184875488281, 0.007137298583984375, 0.008555412292480469, 0.009973526000976562, 0.011391639709472656, 0.01280975341796875, 0.014227867126464844, 0.015645980834960938, 0.01706409454345703, 0.018482208251953125, 0.01990032196044922, 0.021318435668945312, 0.022736549377441406, 0.0241546630859375, 0.025572776794433594, 0.026990890502929688, 0.02840900421142578, 0.029827117919921875, 0.03124523162841797, 0.03266334533691406, 0.034081459045410156, 0.03549957275390625, 0.036917686462402344, 0.03833580017089844, 0.03975391387939453, 0.041172027587890625, 0.04259014129638672, 0.04400825500488281, 0.045426368713378906, 0.046844482421875]}, "gradients/decoder.transformer.h.12.ln_cross_attn.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 23.0, 228.0, 576.0, 169.0, 15.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006457806448452175, -0.0005444050184451044, -0.00044302939204499125, -0.0003416537947487086, -0.0002402781683485955, -0.00013890257105231285, -3.7526944652199745e-05, 6.384868174791336e-05, 0.00016522430814802647, 0.00026659993454813957, 0.0003679755609482527, 0.0004693511582445353, 0.0005707268137484789, 0.0006721023819409311, 0.0007734780083410442, 0.0008748536347411573, 0.0009762292611412704, 0.0010776048293337226, 0.0011789804557338357, 0.0012803560821339488, 0.001381731708534062, 0.001483107334934175, 0.0015844829613342881, 0.0016858585877344012, 0.0017872342141345143, 0.0018886098405346274, 0.0019899853505194187, 0.002091360976919532, 0.002192736603319645, 0.002294112229719758, 0.002395487856119871, 0.0024968634825199842, 0.002598239341750741, 0.002699614968150854, 0.002800990594550967, 0.0029023662209510803, 0.0030037418473511934, 0.0031051174737513065, 0.0032064931001514196, 0.0033078687265515327, 0.003409244352951646, 0.003510619979351759, 0.003611995605751872, 0.003713371232151985, 0.0038147468585520983, 0.003916122484952211, 0.004017497878521681, 0.004118873737752438, 0.004220249131321907, 0.0043216245248913765, 0.004423000384122133, 0.004524375777691603, 0.0046257516369223595, 0.004727127030491829, 0.004828502889722586, 0.004929878283292055, 0.005031254142522812, 0.005132629536092281, 0.005234005395323038, 0.0053353807888925076, 0.005436756648123264, 0.005538132041692734, 0.0056395079009234905, 0.00574088329449296, 0.005842259153723717]}, "gradients/decoder.transformer.h.12.ln_cross_attn.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0, 2.0, 3.0, 3.0, 1.0, 6.0, 6.0, 10.0, 11.0, 9.0, 12.0, 10.0, 15.0, 14.0, 22.0, 23.0, 21.0, 28.0, 21.0, 27.0, 36.0, 36.0, 30.0, 31.0, 42.0, 33.0, 39.0, 33.0, 40.0, 31.0, 39.0, 35.0, 36.0, 34.0, 23.0, 35.0, 28.0, 18.0, 14.0, 21.0, 17.0, 24.0, 17.0, 17.0, 6.0, 13.0, 7.0, 11.0, 3.0, 3.0, 6.0, 3.0, 1.0, 4.0, 1.0, 0.0, 2.0, 1.0], "bins": [-0.0006854534149169922, -0.0006651263684034348, -0.0006447993218898773, -0.0006244722753763199, -0.0006041452288627625, -0.000583818182349205, -0.0005634911358356476, -0.0005431640893220901, -0.0005228370428085327, -0.0005025099962949753, -0.00048218294978141785, -0.0004618559032678604, -0.000441528856754303, -0.00042120181024074554, -0.0004008747637271881, -0.0003805477172136307, -0.00036022067070007324, -0.0003398936241865158, -0.0003195665776729584, -0.00029923953115940094, -0.0002789124846458435, -0.00025858543813228607, -0.00023825839161872864, -0.0002179313451051712, -0.00019760429859161377, -0.00017727725207805634, -0.0001569502055644989, -0.00013662315905094147, -0.00011629611253738403, -9.59690660238266e-05, -7.564201951026917e-05, -5.531497299671173e-05, -3.49879264831543e-05, -1.4660879969596863e-05, 5.666166543960571e-06, 2.5993213057518005e-05, 4.632025957107544e-05, 6.664730608463287e-05, 8.697435259819031e-05, 0.00010730139911174774, 0.00012762844562530518, 0.0001479554921388626, 0.00016828253865242004, 0.00018860958516597748, 0.0002089366316795349, 0.00022926367819309235, 0.0002495907247066498, 0.0002699177712202072, 0.00029024481773376465, 0.0003105718642473221, 0.0003308989107608795, 0.00035122595727443695, 0.0003715530037879944, 0.0003918800503015518, 0.00041220709681510925, 0.0004325341433286667, 0.0004528611898422241, 0.00047318823635578156, 0.000493515282869339, 0.0005138423293828964, 0.0005341693758964539, 0.0005544964224100113, 0.0005748234689235687, 0.0005951505154371262, 0.0006154775619506836]}, "gradients/decoder.transformer.h.12.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 5.0, 7.0, 3.0, 2.0, 5.0, 12.0, 19.0, 19.0, 27.0, 26.0, 22.0, 16.0, 34.0, 41.0, 41.0, 41.0, 41.0, 54.0, 50.0, 50.0, 48.0, 49.0, 48.0, 49.0, 51.0, 33.0, 26.0, 33.0, 38.0, 24.0, 24.0, 13.0, 12.0, 12.0, 6.0, 9.0, 4.0, 8.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.484375, -8.2255859375, -7.966796875, -7.7080078125, -7.44921875, -7.1904296875, -6.931640625, -6.6728515625, -6.4140625, -6.1552734375, -5.896484375, -5.6376953125, -5.37890625, -5.1201171875, -4.861328125, -4.6025390625, -4.34375, -4.0849609375, -3.826171875, -3.5673828125, -3.30859375, -3.0498046875, -2.791015625, -2.5322265625, -2.2734375, -2.0146484375, -1.755859375, -1.4970703125, -1.23828125, -0.9794921875, -0.720703125, -0.4619140625, -0.203125, 0.0556640625, 0.314453125, 0.5732421875, 0.83203125, 1.0908203125, 1.349609375, 1.6083984375, 1.8671875, 2.1259765625, 2.384765625, 2.6435546875, 2.90234375, 3.1611328125, 3.419921875, 3.6787109375, 3.9375, 4.1962890625, 4.455078125, 4.7138671875, 4.97265625, 5.2314453125, 5.490234375, 5.7490234375, 6.0078125, 6.2666015625, 6.525390625, 6.7841796875, 7.04296875, 7.3017578125, 7.560546875, 7.8193359375, 8.078125]}, "gradients/decoder.transformer.h.12.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 3.0, 5.0, 8.0, 3.0, 4.0, 6.0, 18.0, 27.0, 27.0, 52.0, 62.0, 70.0, 112.0, 212.0, 349.0, 649.0, 1670.0, 4666.0, 15977.0, 83788.0, 600229.0, 283410.0, 41970.0, 9757.0, 2949.0, 1238.0, 526.0, 264.0, 159.0, 127.0, 61.0, 41.0, 32.0, 20.0, 17.0, 16.0, 10.0, 5.0, 12.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.25, -7.99853515625, -7.7470703125, -7.49560546875, -7.244140625, -6.99267578125, -6.7412109375, -6.48974609375, -6.23828125, -5.98681640625, -5.7353515625, -5.48388671875, -5.232421875, -4.98095703125, -4.7294921875, -4.47802734375, -4.2265625, -3.97509765625, -3.7236328125, -3.47216796875, -3.220703125, -2.96923828125, -2.7177734375, -2.46630859375, -2.21484375, -1.96337890625, -1.7119140625, -1.46044921875, -1.208984375, -0.95751953125, -0.7060546875, -0.45458984375, -0.203125, 0.04833984375, 0.2998046875, 0.55126953125, 0.802734375, 1.05419921875, 1.3056640625, 1.55712890625, 1.80859375, 2.06005859375, 2.3115234375, 2.56298828125, 2.814453125, 3.06591796875, 3.3173828125, 3.56884765625, 3.8203125, 4.07177734375, 4.3232421875, 4.57470703125, 4.826171875, 5.07763671875, 5.3291015625, 5.58056640625, 5.83203125, 6.08349609375, 6.3349609375, 6.58642578125, 6.837890625, 7.08935546875, 7.3408203125, 7.59228515625, 7.84375]}, "gradients/decoder.transformer.h.12.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 3.0, 0.0, 1.0, 1.0, 2.0, 6.0, 4.0, 9.0, 14.0, 11.0, 11.0, 19.0, 26.0, 27.0, 16.0, 30.0, 33.0, 30.0, 50.0, 41.0, 56.0, 215.0, 1828.0, 143.0, 53.0, 61.0, 53.0, 43.0, 36.0, 36.0, 29.0, 36.0, 23.0, 16.0, 23.0, 13.0, 9.0, 14.0, 11.0, 9.0, 5.0, 3.0, 2.0, 3.0, 1.0, 3.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.15625, -23.38671875, -22.6171875, -21.84765625, -21.078125, -20.30859375, -19.5390625, -18.76953125, -18.0, -17.23046875, -16.4609375, -15.69140625, -14.921875, -14.15234375, -13.3828125, -12.61328125, -11.84375, -11.07421875, -10.3046875, -9.53515625, -8.765625, -7.99609375, -7.2265625, -6.45703125, -5.6875, -4.91796875, -4.1484375, -3.37890625, -2.609375, -1.83984375, -1.0703125, -0.30078125, 0.46875, 1.23828125, 2.0078125, 2.77734375, 3.546875, 4.31640625, 5.0859375, 5.85546875, 6.625, 7.39453125, 8.1640625, 8.93359375, 9.703125, 10.47265625, 11.2421875, 12.01171875, 12.78125, 13.55078125, 14.3203125, 15.08984375, 15.859375, 16.62890625, 17.3984375, 18.16796875, 18.9375, 19.70703125, 20.4765625, 21.24609375, 22.015625, 22.78515625, 23.5546875, 24.32421875, 25.09375]}, "gradients/decoder.transformer.h.12.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 5.0, 4.0, 5.0, 5.0, 5.0, 9.0, 8.0, 19.0, 13.0, 19.0, 23.0, 30.0, 37.0, 65.0, 66.0, 104.0, 143.0, 214.0, 308.0, 547.0, 2195.0, 2601392.0, 537355.0, 1796.0, 480.0, 279.0, 167.0, 129.0, 66.0, 58.0, 43.0, 26.0, 24.0, 16.0, 18.0, 12.0, 7.0, 9.0, 3.0, 2.0, 1.0, 1.0, 6.0, 1.0, 1.0, 1.0, 3.0, 0.0, 2.0, 0.0, 1.0], "bins": [-64.6875, -62.8369140625, -60.986328125, -59.1357421875, -57.28515625, -55.4345703125, -53.583984375, -51.7333984375, -49.8828125, -48.0322265625, -46.181640625, -44.3310546875, -42.48046875, -40.6298828125, -38.779296875, -36.9287109375, -35.078125, -33.2275390625, -31.376953125, -29.5263671875, -27.67578125, -25.8251953125, -23.974609375, -22.1240234375, -20.2734375, -18.4228515625, -16.572265625, -14.7216796875, -12.87109375, -11.0205078125, -9.169921875, -7.3193359375, -5.46875, -3.6181640625, -1.767578125, 0.0830078125, 1.93359375, 3.7841796875, 5.634765625, 7.4853515625, 9.3359375, 11.1865234375, 13.037109375, 14.8876953125, 16.73828125, 18.5888671875, 20.439453125, 22.2900390625, 24.140625, 25.9912109375, 27.841796875, 29.6923828125, 31.54296875, 33.3935546875, 35.244140625, 37.0947265625, 38.9453125, 40.7958984375, 42.646484375, 44.4970703125, 46.34765625, 48.1982421875, 50.048828125, 51.8994140625, 53.75]}, "gradients/decoder.transformer.h.12.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 154.0, 691.0, 166.0, 4.0, 0.0, 0.0, 3.0], "bins": [-175.4291534423828, -172.42959594726562, -169.43002319335938, -166.4304656982422, -163.430908203125, -160.4313507080078, -157.43179321289062, -154.43222045898438, -151.4326629638672, -148.43310546875, -145.43353271484375, -142.43397521972656, -139.43441772460938, -136.4348602294922, -133.435302734375, -130.43572998046875, -127.43617248535156, -124.43661499023438, -121.43704986572266, -118.43748474121094, -115.43792724609375, -112.43836975097656, -109.43880462646484, -106.43923950195312, -103.43968200683594, -100.44012451171875, -97.44055938720703, -94.44099426269531, -91.44143676757812, -88.44187927246094, -85.44231414794922, -82.4427490234375, -79.44319152832031, -76.44363403320312, -73.4440689086914, -70.44450378417969, -67.4449462890625, -64.44538879394531, -61.445823669433594, -58.44626235961914, -55.44670486450195, -52.4471435546875, -49.44758224487305, -46.448020935058594, -43.44845962524414, -40.44889831542969, -37.449337005615234, -34.44977569580078, -31.450214385986328, -28.450653076171875, -25.451091766357422, -22.45153045654297, -19.451969146728516, -16.452407836914062, -13.45284652709961, -10.453285217285156, -7.453723907470703, -4.45416259765625, -1.4546012878417969, 1.5449600219726562, 4.544521331787109, 7.5440826416015625, 10.543643951416016, 13.543205261230469, 16.542766571044922]}, "gradients/decoder.transformer.h.12.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 4.0, 2.0, 4.0, 4.0, 7.0, 8.0, 11.0, 7.0, 13.0, 15.0, 19.0, 16.0, 22.0, 23.0, 24.0, 29.0, 28.0, 35.0, 33.0, 44.0, 40.0, 42.0, 39.0, 48.0, 38.0, 44.0, 40.0, 37.0, 38.0, 34.0, 51.0, 20.0, 28.0, 17.0, 19.0, 20.0, 16.0, 16.0, 15.0, 10.0, 11.0, 10.0, 5.0, 7.0, 6.0, 3.0, 6.0, 2.0, 2.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-54.37822341918945, -52.69472122192383, -51.01121520996094, -49.32771301269531, -47.64421081542969, -45.96070861816406, -44.27720642089844, -42.59370040893555, -40.91019821166992, -39.2266960144043, -37.543190002441406, -35.85968780517578, -34.176185607910156, -32.49268341064453, -30.809179306030273, -29.125675201416016, -27.44217300415039, -25.758670806884766, -24.075166702270508, -22.39166259765625, -20.708160400390625, -19.024658203125, -17.341154098510742, -15.6576509475708, -13.97414779663086, -12.290644645690918, -10.607141494750977, -8.923638343811035, -7.240135192871094, -5.556632041931152, -3.873128890991211, -2.1896257400512695, -0.5061264038085938, 1.1773767471313477, 2.860879898071289, 4.5443830490112305, 6.227886199951172, 7.911389350891113, 9.594892501831055, 11.278395652770996, 12.961898803710938, 14.645401954650879, 16.32890510559082, 18.012409210205078, 19.695911407470703, 21.379413604736328, 23.062917709350586, 24.746421813964844, 26.42992401123047, 28.113426208496094, 29.79693031311035, 31.48043441772461, 33.163936614990234, 34.84743881225586, 36.53094482421875, 38.214447021484375, 39.89794921875, 41.581451416015625, 43.26495361328125, 44.94845962524414, 46.631961822509766, 48.31546401977539, 49.99897003173828, 51.682472229003906, 53.36597442626953]}, "gradients/decoder.transformer.h.11.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 0.0, 5.0, 2.0, 7.0, 4.0, 8.0, 9.0, 13.0, 21.0, 13.0, 36.0, 27.0, 18.0, 32.0, 27.0, 32.0, 46.0, 52.0, 45.0, 43.0, 59.0, 43.0, 41.0, 48.0, 52.0, 51.0, 38.0, 34.0, 28.0, 26.0, 35.0, 27.0, 15.0, 13.0, 17.0, 9.0, 7.0, 5.0, 10.0, 4.0, 4.0, 3.0, 2.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.3671875, -8.1077880859375, -7.848388671875, -7.5889892578125, -7.32958984375, -7.0701904296875, -6.810791015625, -6.5513916015625, -6.2919921875, -6.0325927734375, -5.773193359375, -5.5137939453125, -5.25439453125, -4.9949951171875, -4.735595703125, -4.4761962890625, -4.216796875, -3.9573974609375, -3.697998046875, -3.4385986328125, -3.17919921875, -2.9197998046875, -2.660400390625, -2.4010009765625, -2.1416015625, -1.8822021484375, -1.622802734375, -1.3634033203125, -1.10400390625, -0.8446044921875, -0.585205078125, -0.3258056640625, -0.06640625, 0.1929931640625, 0.452392578125, 0.7117919921875, 0.97119140625, 1.2305908203125, 1.489990234375, 1.7493896484375, 2.0087890625, 2.2681884765625, 2.527587890625, 2.7869873046875, 3.04638671875, 3.3057861328125, 3.565185546875, 3.8245849609375, 4.083984375, 4.3433837890625, 4.602783203125, 4.8621826171875, 5.12158203125, 5.3809814453125, 5.640380859375, 5.8997802734375, 6.1591796875, 6.4185791015625, 6.677978515625, 6.9373779296875, 7.19677734375, 7.4561767578125, 7.715576171875, 7.9749755859375, 8.234375]}, "gradients/decoder.transformer.h.11.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 4.0, 5.0, 2.0, 5.0, 8.0, 10.0, 27.0, 22.0, 35.0, 43.0, 55.0, 64.0, 59.0, 101.0, 129.0, 211.0, 374.0, 964.0, 5285.0, 127266.0, 2834064.0, 1193684.0, 27977.0, 2337.0, 605.0, 274.0, 180.0, 120.0, 90.0, 48.0, 55.0, 45.0, 40.0, 22.0, 19.0, 12.0, 14.0, 10.0, 13.0, 4.0, 1.0, 2.0, 3.0, 2.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-21.875, -21.2080078125, -20.541015625, -19.8740234375, -19.20703125, -18.5400390625, -17.873046875, -17.2060546875, -16.5390625, -15.8720703125, -15.205078125, -14.5380859375, -13.87109375, -13.2041015625, -12.537109375, -11.8701171875, -11.203125, -10.5361328125, -9.869140625, -9.2021484375, -8.53515625, -7.8681640625, -7.201171875, -6.5341796875, -5.8671875, -5.2001953125, -4.533203125, -3.8662109375, -3.19921875, -2.5322265625, -1.865234375, -1.1982421875, -0.53125, 0.1357421875, 0.802734375, 1.4697265625, 2.13671875, 2.8037109375, 3.470703125, 4.1376953125, 4.8046875, 5.4716796875, 6.138671875, 6.8056640625, 7.47265625, 8.1396484375, 8.806640625, 9.4736328125, 10.140625, 10.8076171875, 11.474609375, 12.1416015625, 12.80859375, 13.4755859375, 14.142578125, 14.8095703125, 15.4765625, 16.1435546875, 16.810546875, 17.4775390625, 18.14453125, 18.8115234375, 19.478515625, 20.1455078125, 20.8125]}, "gradients/decoder.transformer.h.11.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 1.0, 4.0, 3.0, 0.0, 1.0, 5.0, 1.0, 5.0, 10.0, 9.0, 16.0, 20.0, 22.0, 35.0, 38.0, 52.0, 72.0, 121.0, 164.0, 235.0, 293.0, 373.0, 476.0, 525.0, 443.0, 331.0, 227.0, 177.0, 108.0, 78.0, 61.0, 50.0, 35.0, 33.0, 15.0, 12.0, 8.0, 10.0, 5.0, 4.0, 1.0, 0.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-16.375, -15.9442138671875, -15.513427734375, -15.0826416015625, -14.65185546875, -14.2210693359375, -13.790283203125, -13.3594970703125, -12.9287109375, -12.4979248046875, -12.067138671875, -11.6363525390625, -11.20556640625, -10.7747802734375, -10.343994140625, -9.9132080078125, -9.482421875, -9.0516357421875, -8.620849609375, -8.1900634765625, -7.75927734375, -7.3284912109375, -6.897705078125, -6.4669189453125, -6.0361328125, -5.6053466796875, -5.174560546875, -4.7437744140625, -4.31298828125, -3.8822021484375, -3.451416015625, -3.0206298828125, -2.58984375, -2.1590576171875, -1.728271484375, -1.2974853515625, -0.86669921875, -0.4359130859375, -0.005126953125, 0.4256591796875, 0.8564453125, 1.2872314453125, 1.718017578125, 2.1488037109375, 2.57958984375, 3.0103759765625, 3.441162109375, 3.8719482421875, 4.302734375, 4.7335205078125, 5.164306640625, 5.5950927734375, 6.02587890625, 6.4566650390625, 6.887451171875, 7.3182373046875, 7.7490234375, 8.1798095703125, 8.610595703125, 9.0413818359375, 9.47216796875, 9.9029541015625, 10.333740234375, 10.7645263671875, 11.1953125]}, "gradients/decoder.transformer.h.11.mlp.c_fc.weight": {"_type": "histogram", "values": [2.0, 2.0, 4.0, 3.0, 4.0, 9.0, 3.0, 7.0, 10.0, 13.0, 14.0, 18.0, 24.0, 39.0, 48.0, 59.0, 66.0, 80.0, 107.0, 139.0, 167.0, 230.0, 363.0, 545.0, 1711.0, 1212441.0, 2974188.0, 2061.0, 552.0, 345.0, 254.0, 176.0, 121.0, 98.0, 77.0, 58.0, 53.0, 36.0, 34.0, 36.0, 17.0, 23.0, 11.0, 9.0, 5.0, 14.0, 8.0, 1.0, 4.0, 0.0, 3.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-59.84375, -57.54638671875, -55.2490234375, -52.95166015625, -50.654296875, -48.35693359375, -46.0595703125, -43.76220703125, -41.46484375, -39.16748046875, -36.8701171875, -34.57275390625, -32.275390625, -29.97802734375, -27.6806640625, -25.38330078125, -23.0859375, -20.78857421875, -18.4912109375, -16.19384765625, -13.896484375, -11.59912109375, -9.3017578125, -7.00439453125, -4.70703125, -2.40966796875, -0.1123046875, 2.18505859375, 4.482421875, 6.77978515625, 9.0771484375, 11.37451171875, 13.671875, 15.96923828125, 18.2666015625, 20.56396484375, 22.861328125, 25.15869140625, 27.4560546875, 29.75341796875, 32.05078125, 34.34814453125, 36.6455078125, 38.94287109375, 41.240234375, 43.53759765625, 45.8349609375, 48.13232421875, 50.4296875, 52.72705078125, 55.0244140625, 57.32177734375, 59.619140625, 61.91650390625, 64.2138671875, 66.51123046875, 68.80859375, 71.10595703125, 73.4033203125, 75.70068359375, 77.998046875, 80.29541015625, 82.5927734375, 84.89013671875, 87.1875]}, "gradients/decoder.transformer.h.11.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 8.0, 29.0, 131.0, 283.0, 320.0, 178.0, 56.0, 10.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-145.71664428710938, -141.57821655273438, -137.4397735595703, -133.3013458251953, -129.16290283203125, -125.02447509765625, -120.88603973388672, -116.74760437011719, -112.60916900634766, -108.47073364257812, -104.3322982788086, -100.19386291503906, -96.05543518066406, -91.9169921875, -87.778564453125, -83.64012908935547, -79.50169372558594, -75.3632583618164, -71.22482299804688, -67.08638763427734, -62.94795608520508, -58.80952072143555, -54.67108917236328, -50.53265380859375, -46.39421844482422, -42.25578308105469, -38.117347717285156, -33.97891616821289, -29.84048080444336, -25.702045440673828, -21.56361198425293, -17.42517852783203, -13.2867431640625, -9.148308753967285, -5.00987434387207, -0.8714399337768555, 3.2669944763183594, 7.405429840087891, 11.543863296508789, 15.682296752929688, 19.82073211669922, 23.95916748046875, 28.09760093688965, 32.23603439331055, 36.37446975708008, 40.51290512084961, 44.651336669921875, 48.789772033691406, 52.92820739746094, 57.06664276123047, 61.205078125, 65.34351348876953, 69.48194885253906, 73.62037658691406, 77.7588119506836, 81.89724731445312, 86.03568267822266, 90.17411804199219, 94.31255340576172, 98.45098876953125, 102.58941650390625, 106.72785949707031, 110.86628723144531, 115.00472259521484, 119.14315795898438]}, "gradients/decoder.transformer.h.11.ln_2.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 6.0, 1.0, 9.0, 6.0, 6.0, 18.0, 17.0, 22.0, 17.0, 27.0, 31.0, 35.0, 39.0, 38.0, 42.0, 50.0, 55.0, 74.0, 59.0, 44.0, 58.0, 53.0, 49.0, 44.0, 33.0, 40.0, 24.0, 31.0, 12.0, 15.0, 13.0, 11.0, 9.0, 6.0, 8.0, 7.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-51.41609191894531, -49.62321472167969, -47.83034133911133, -46.03746795654297, -44.244590759277344, -42.45171356201172, -40.65884017944336, -38.865966796875, -37.073089599609375, -35.28021240234375, -33.48733901977539, -31.6944637298584, -29.901588439941406, -28.108713150024414, -26.315837860107422, -24.52296257019043, -22.730087280273438, -20.937211990356445, -19.144336700439453, -17.35146141052246, -15.558586120605469, -13.765710830688477, -11.972835540771484, -10.179960250854492, -8.3870849609375, -6.594209671020508, -4.801334381103516, -3.0084590911865234, -1.2155838012695312, 0.5772914886474609, 2.370166778564453, 4.163042068481445, 5.9559173583984375, 7.74879264831543, 9.541667938232422, 11.334543228149414, 13.127418518066406, 14.920293807983398, 16.71316909790039, 18.506044387817383, 20.298919677734375, 22.091794967651367, 23.88467025756836, 25.67754554748535, 27.470420837402344, 29.263296127319336, 31.056171417236328, 32.84904479980469, 34.64192199707031, 36.43479919433594, 38.2276725769043, 40.020545959472656, 41.81342315673828, 43.606300354003906, 45.399173736572266, 47.192047119140625, 48.98492431640625, 50.777801513671875, 52.570674896240234, 54.363548278808594, 56.15642547607422, 57.949302673339844, 59.7421760559082, 61.53504943847656, 63.32792663574219]}, "gradients/decoder.transformer.h.11.crossattention.c_proj.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 5.0, 5.0, 4.0, 11.0, 3.0, 8.0, 5.0, 10.0, 15.0, 17.0, 19.0, 19.0, 15.0, 27.0, 39.0, 25.0, 34.0, 43.0, 38.0, 33.0, 51.0, 37.0, 39.0, 47.0, 41.0, 33.0, 45.0, 35.0, 41.0, 36.0, 24.0, 27.0, 23.0, 21.0, 21.0, 17.0, 22.0, 12.0, 14.0, 8.0, 7.0, 10.0, 5.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.5546875, -6.33599853515625, -6.1173095703125, -5.89862060546875, -5.679931640625, -5.46124267578125, -5.2425537109375, -5.02386474609375, -4.80517578125, -4.58648681640625, -4.3677978515625, -4.14910888671875, -3.930419921875, -3.71173095703125, -3.4930419921875, -3.27435302734375, -3.0556640625, -2.83697509765625, -2.6182861328125, -2.39959716796875, -2.180908203125, -1.96221923828125, -1.7435302734375, -1.52484130859375, -1.30615234375, -1.08746337890625, -0.8687744140625, -0.65008544921875, -0.431396484375, -0.21270751953125, 0.0059814453125, 0.22467041015625, 0.443359375, 0.66204833984375, 0.8807373046875, 1.09942626953125, 1.318115234375, 1.53680419921875, 1.7554931640625, 1.97418212890625, 2.19287109375, 2.41156005859375, 2.6302490234375, 2.84893798828125, 3.067626953125, 3.28631591796875, 3.5050048828125, 3.72369384765625, 3.9423828125, 4.16107177734375, 4.3797607421875, 4.59844970703125, 4.817138671875, 5.03582763671875, 5.2545166015625, 5.47320556640625, 5.69189453125, 5.91058349609375, 6.1292724609375, 6.34796142578125, 6.566650390625, 6.78533935546875, 7.0040283203125, 7.22271728515625, 7.44140625]}, "gradients/decoder.transformer.h.11.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 3.0, 4.0, 7.0, 4.0, 9.0, 10.0, 14.0, 18.0, 46.0, 43.0, 80.0, 105.0, 151.0, 196.0, 315.0, 448.0, 620.0, 969.0, 1428.0, 2090.0, 3255.0, 4961.0, 7739.0, 11893.0, 19160.0, 30174.0, 49927.0, 89638.0, 181778.0, 326659.0, 134311.0, 71036.0, 41240.0, 25091.0, 15858.0, 9999.0, 6555.0, 4263.0, 2793.0, 1844.0, 1232.0, 813.0, 560.0, 375.0, 255.0, 173.0, 126.0, 93.0, 63.0, 41.0, 39.0, 13.0, 18.0, 16.0, 6.0, 8.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0], "bins": [-1.8115234375, -1.753662109375, -1.69580078125, -1.637939453125, -1.580078125, -1.522216796875, -1.46435546875, -1.406494140625, -1.3486328125, -1.290771484375, -1.23291015625, -1.175048828125, -1.1171875, -1.059326171875, -1.00146484375, -0.943603515625, -0.8857421875, -0.827880859375, -0.77001953125, -0.712158203125, -0.654296875, -0.596435546875, -0.53857421875, -0.480712890625, -0.4228515625, -0.364990234375, -0.30712890625, -0.249267578125, -0.19140625, -0.133544921875, -0.07568359375, -0.017822265625, 0.0400390625, 0.097900390625, 0.15576171875, 0.213623046875, 0.271484375, 0.329345703125, 0.38720703125, 0.445068359375, 0.5029296875, 0.560791015625, 0.61865234375, 0.676513671875, 0.734375, 0.792236328125, 0.85009765625, 0.907958984375, 0.9658203125, 1.023681640625, 1.08154296875, 1.139404296875, 1.197265625, 1.255126953125, 1.31298828125, 1.370849609375, 1.4287109375, 1.486572265625, 1.54443359375, 1.602294921875, 1.66015625, 1.718017578125, 1.77587890625, 1.833740234375, 1.8916015625]}, "gradients/decoder.transformer.h.11.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 5.0, 7.0, 6.0, 2.0, 6.0, 9.0, 9.0, 10.0, 15.0, 14.0, 21.0, 15.0, 21.0, 11.0, 24.0, 23.0, 21.0, 37.0, 49.0, 34.0, 32.0, 35.0, 48.0, 40.0, 1064.0, 42.0, 38.0, 34.0, 38.0, 31.0, 24.0, 37.0, 37.0, 25.0, 19.0, 22.0, 30.0, 13.0, 11.0, 12.0, 9.0, 12.0, 9.0, 9.0, 5.0, 8.0, 3.0, 3.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-4.40234375, -4.2669677734375, -4.131591796875, -3.9962158203125, -3.86083984375, -3.7254638671875, -3.590087890625, -3.4547119140625, -3.3193359375, -3.1839599609375, -3.048583984375, -2.9132080078125, -2.77783203125, -2.6424560546875, -2.507080078125, -2.3717041015625, -2.236328125, -2.1009521484375, -1.965576171875, -1.8302001953125, -1.69482421875, -1.5594482421875, -1.424072265625, -1.2886962890625, -1.1533203125, -1.0179443359375, -0.882568359375, -0.7471923828125, -0.61181640625, -0.4764404296875, -0.341064453125, -0.2056884765625, -0.0703125, 0.0650634765625, 0.200439453125, 0.3358154296875, 0.47119140625, 0.6065673828125, 0.741943359375, 0.8773193359375, 1.0126953125, 1.1480712890625, 1.283447265625, 1.4188232421875, 1.55419921875, 1.6895751953125, 1.824951171875, 1.9603271484375, 2.095703125, 2.2310791015625, 2.366455078125, 2.5018310546875, 2.63720703125, 2.7725830078125, 2.907958984375, 3.0433349609375, 3.1787109375, 3.3140869140625, 3.449462890625, 3.5848388671875, 3.72021484375, 3.8555908203125, 3.990966796875, 4.1263427734375, 4.26171875]}, "gradients/decoder.transformer.h.11.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 2.0, 5.0, 4.0, 11.0, 12.0, 10.0, 19.0, 34.0, 44.0, 61.0, 100.0, 195.0, 314.0, 514.0, 832.0, 1470.0, 2601.0, 4339.0, 7537.0, 13446.0, 24670.0, 47065.0, 94677.0, 244192.0, 1415915.0, 116798.0, 56688.0, 28995.0, 15695.0, 8798.0, 5062.0, 2851.0, 1680.0, 1033.0, 581.0, 311.0, 215.0, 108.0, 87.0, 56.0, 33.0, 16.0, 21.0, 10.0, 7.0, 7.0, 4.0, 6.0, 5.0, 1.0, 1.0, 3.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.31640625, -2.24200439453125, -2.1676025390625, -2.09320068359375, -2.018798828125, -1.94439697265625, -1.8699951171875, -1.79559326171875, -1.72119140625, -1.64678955078125, -1.5723876953125, -1.49798583984375, -1.423583984375, -1.34918212890625, -1.2747802734375, -1.20037841796875, -1.1259765625, -1.05157470703125, -0.9771728515625, -0.90277099609375, -0.828369140625, -0.75396728515625, -0.6795654296875, -0.60516357421875, -0.53076171875, -0.45635986328125, -0.3819580078125, -0.30755615234375, -0.233154296875, -0.15875244140625, -0.0843505859375, -0.00994873046875, 0.064453125, 0.13885498046875, 0.2132568359375, 0.28765869140625, 0.362060546875, 0.43646240234375, 0.5108642578125, 0.58526611328125, 0.65966796875, 0.73406982421875, 0.8084716796875, 0.88287353515625, 0.957275390625, 1.03167724609375, 1.1060791015625, 1.18048095703125, 1.2548828125, 1.32928466796875, 1.4036865234375, 1.47808837890625, 1.552490234375, 1.62689208984375, 1.7012939453125, 1.77569580078125, 1.85009765625, 1.92449951171875, 1.9989013671875, 2.07330322265625, 2.147705078125, 2.22210693359375, 2.2965087890625, 2.37091064453125, 2.4453125]}, "gradients/decoder.transformer.h.11.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 2.0, 1.0, 3.0, 4.0, 6.0, 6.0, 5.0, 5.0, 8.0, 10.0, 10.0, 12.0, 16.0, 15.0, 19.0, 24.0, 23.0, 33.0, 50.0, 66.0, 66.0, 74.0, 84.0, 64.0, 73.0, 53.0, 54.0, 45.0, 31.0, 26.0, 24.0, 18.0, 19.0, 10.0, 9.0, 7.0, 7.0, 4.0, 3.0, 3.0, 8.0, 1.0, 3.0, 2.0, 1.0, 2.0, 3.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-0.0010089874267578125, -0.000979110598564148, -0.0009492337703704834, -0.0009193569421768188, -0.0008894801139831543, -0.0008596032857894897, -0.0008297264575958252, -0.0007998496294021606, -0.0007699728012084961, -0.0007400959730148315, -0.000710219144821167, -0.0006803423166275024, -0.0006504654884338379, -0.0006205886602401733, -0.0005907118320465088, -0.0005608350038528442, -0.0005309581756591797, -0.0005010813474655151, -0.0004712045192718506, -0.00044132769107818604, -0.0004114508628845215, -0.00038157403469085693, -0.0003516972064971924, -0.00032182037830352783, -0.0002919435501098633, -0.00026206672191619873, -0.00023218989372253418, -0.00020231306552886963, -0.00017243623733520508, -0.00014255940914154053, -0.00011268258094787598, -8.280575275421143e-05, -5.2928924560546875e-05, -2.3052096366882324e-05, 6.8247318267822266e-06, 3.670156002044678e-05, 6.657838821411133e-05, 9.645521640777588e-05, 0.00012633204460144043, 0.00015620887279510498, 0.00018608570098876953, 0.00021596252918243408, 0.00024583935737609863, 0.0002757161855697632, 0.00030559301376342773, 0.0003354698419570923, 0.00036534667015075684, 0.0003952234983444214, 0.00042510032653808594, 0.0004549771547317505, 0.00048485398292541504, 0.0005147308111190796, 0.0005446076393127441, 0.0005744844675064087, 0.0006043612957000732, 0.0006342381238937378, 0.0006641149520874023, 0.0006939917802810669, 0.0007238686084747314, 0.000753745436668396, 0.0007836222648620605, 0.0008134990930557251, 0.0008433759212493896, 0.0008732527494430542, 0.0009031295776367188]}, "gradients/decoder.transformer.h.11.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 2.0, 3.0, 3.0, 3.0, 3.0, 5.0, 6.0, 6.0, 11.0, 14.0, 18.0, 23.0, 44.0, 39.0, 56.0, 77.0, 142.0, 234.0, 568.0, 7345.0, 1034835.0, 3922.0, 522.0, 225.0, 139.0, 68.0, 47.0, 35.0, 32.0, 26.0, 11.0, 16.0, 15.0, 11.0, 12.0, 8.0, 7.0, 6.0, 1.0, 3.0, 5.0, 3.0, 3.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0], "bins": [-0.0240020751953125, -0.02326226234436035, -0.022522449493408203, -0.021782636642456055, -0.021042823791503906, -0.020303010940551758, -0.01956319808959961, -0.01882338523864746, -0.018083572387695312, -0.017343759536743164, -0.016603946685791016, -0.015864133834838867, -0.015124320983886719, -0.01438450813293457, -0.013644695281982422, -0.012904882431030273, -0.012165069580078125, -0.011425256729125977, -0.010685443878173828, -0.00994563102722168, -0.009205818176269531, -0.008466005325317383, -0.007726192474365234, -0.006986379623413086, -0.0062465667724609375, -0.005506753921508789, -0.004766941070556641, -0.004027128219604492, -0.0032873153686523438, -0.0025475025177001953, -0.0018076896667480469, -0.0010678768157958984, -0.00032806396484375, 0.00041174888610839844, 0.0011515617370605469, 0.0018913745880126953, 0.0026311874389648438, 0.003371000289916992, 0.004110813140869141, 0.004850625991821289, 0.0055904388427734375, 0.006330251693725586, 0.007070064544677734, 0.007809877395629883, 0.008549690246582031, 0.00928950309753418, 0.010029315948486328, 0.010769128799438477, 0.011508941650390625, 0.012248754501342773, 0.012988567352294922, 0.01372838020324707, 0.014468193054199219, 0.015208005905151367, 0.015947818756103516, 0.016687631607055664, 0.017427444458007812, 0.01816725730895996, 0.01890707015991211, 0.019646883010864258, 0.020386695861816406, 0.021126508712768555, 0.021866321563720703, 0.02260613441467285, 0.023345947265625]}, "gradients/decoder.transformer.h.11.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 6.0, 10.0, 22.0, 38.0, 61.0, 101.0, 123.0, 154.0, 123.0, 135.0, 100.0, 59.0, 37.0, 22.0, 13.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.000758499139919877, -0.000737445370759815, -0.0007163916015997529, -0.0006953377742320299, -0.0006742840050719678, -0.0006532302359119058, -0.0006321764667518437, -0.0006111226975917816, -0.0005900689284317195, -0.0005690151592716575, -0.0005479613901115954, -0.0005269076209515333, -0.0005058537935838103, -0.00048480002442374825, -0.0004637462552636862, -0.0004426924861036241, -0.0004216386878397316, -0.0004005849186796695, -0.00037953112041577697, -0.0003584773512557149, -0.0003374235820956528, -0.00031636981293559074, -0.0002953160146716982, -0.00027426224551163614, -0.0002532084472477436, -0.0002321546635357663, -0.00021110089437570423, -0.00019004711066372693, -0.00016899334150366485, -0.00014793955779168755, -0.00012688577407971025, -0.00010583200491964817, -8.47782357595861e-05, -6.372445932356641e-05, -4.267067924956791e-05, -2.1616899175569415e-05, -5.631227395497262e-07, 2.0490653696469963e-05, 4.1544437408447266e-05, 6.259820656850934e-05, 8.365199028048664e-05, 0.00010470576671650633, 0.00012575954315252602, 0.00014681332686450332, 0.00016786711057648063, 0.0001889208797365427, 0.00020997466344852, 0.00023102843260858208, 0.0002520822163205594, 0.00027313598548062146, 0.000294189783744514, 0.00031524355290457606, 0.00033629732206463814, 0.0003573510912247002, 0.00037840488948859274, 0.0003994586586486548, 0.00042051245691254735, 0.0004415662260726094, 0.00046262002433650196, 0.00048367379349656403, 0.0005047275917604566, 0.0005257813609205186, 0.0005468351300805807, 0.0005678888992406428, 0.0005889426684007049]}, "gradients/decoder.transformer.h.11.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 2.0, 3.0, 5.0, 3.0, 4.0, 4.0, 2.0, 4.0, 10.0, 8.0, 11.0, 10.0, 23.0, 18.0, 17.0, 12.0, 20.0, 35.0, 34.0, 37.0, 36.0, 38.0, 41.0, 41.0, 34.0, 41.0, 39.0, 43.0, 51.0, 36.0, 43.0, 38.0, 33.0, 33.0, 31.0, 27.0, 17.0, 28.0, 23.0, 13.0, 13.0, 12.0, 9.0, 5.0, 6.0, 5.0, 9.0, 3.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00041943788528442383, -0.0004041166976094246, -0.00038879550993442535, -0.0003734743222594261, -0.0003581531345844269, -0.00034283194690942764, -0.0003275107592344284, -0.00031218957155942917, -0.00029686838388442993, -0.0002815471962094307, -0.00026622600853443146, -0.0002509048208594322, -0.00023558363318443298, -0.00022026244550943375, -0.0002049412578344345, -0.00018962007015943527, -0.00017429888248443604, -0.0001589776948094368, -0.00014365650713443756, -0.00012833531945943832, -0.00011301413178443909, -9.769294410943985e-05, -8.237175643444061e-05, -6.705056875944138e-05, -5.172938108444214e-05, -3.64081934094429e-05, -2.1087005734443665e-05, -5.7658180594444275e-06, 9.55536961555481e-06, 2.4876557290554047e-05, 4.0197744965553284e-05, 5.551893264055252e-05, 7.084012031555176e-05, 8.6161307990551e-05, 0.00010148249566555023, 0.00011680368334054947, 0.0001321248710155487, 0.00014744605869054794, 0.00016276724636554718, 0.00017808843404054642, 0.00019340962171554565, 0.0002087308093905449, 0.00022405199706554413, 0.00023937318474054337, 0.0002546943724155426, 0.00027001556009054184, 0.0002853367477655411, 0.0003006579354405403, 0.00031597912311553955, 0.0003313003107905388, 0.000346621498465538, 0.00036194268614053726, 0.0003772638738155365, 0.00039258506149053574, 0.000407906249165535, 0.0004232274368405342, 0.00043854862451553345, 0.0004538698121905327, 0.0004691909998655319, 0.00048451218754053116, 0.0004998333752155304, 0.0005151545628905296, 0.0005304757505655289, 0.0005457969382405281, 0.0005611181259155273]}, "gradients/decoder.transformer.h.11.attn.c_proj.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 5.0, 5.0, 4.0, 11.0, 3.0, 8.0, 5.0, 10.0, 15.0, 17.0, 19.0, 19.0, 15.0, 27.0, 39.0, 25.0, 34.0, 43.0, 38.0, 33.0, 51.0, 37.0, 39.0, 47.0, 41.0, 33.0, 45.0, 35.0, 41.0, 36.0, 24.0, 27.0, 23.0, 21.0, 21.0, 17.0, 22.0, 12.0, 14.0, 8.0, 7.0, 10.0, 5.0, 7.0, 3.0, 4.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-6.5546875, -6.33599853515625, -6.1173095703125, -5.89862060546875, -5.679931640625, -5.46124267578125, -5.2425537109375, -5.02386474609375, -4.80517578125, -4.58648681640625, -4.3677978515625, -4.14910888671875, -3.930419921875, -3.71173095703125, -3.4930419921875, -3.27435302734375, -3.0556640625, -2.83697509765625, -2.6182861328125, -2.39959716796875, -2.180908203125, -1.96221923828125, -1.7435302734375, -1.52484130859375, -1.30615234375, -1.08746337890625, -0.8687744140625, -0.65008544921875, -0.431396484375, -0.21270751953125, 0.0059814453125, 0.22467041015625, 0.443359375, 0.66204833984375, 0.8807373046875, 1.09942626953125, 1.318115234375, 1.53680419921875, 1.7554931640625, 1.97418212890625, 2.19287109375, 2.41156005859375, 2.6302490234375, 2.84893798828125, 3.067626953125, 3.28631591796875, 3.5050048828125, 3.72369384765625, 3.9423828125, 4.16107177734375, 4.3797607421875, 4.59844970703125, 4.817138671875, 5.03582763671875, 5.2545166015625, 5.47320556640625, 5.69189453125, 5.91058349609375, 6.1292724609375, 6.34796142578125, 6.566650390625, 6.78533935546875, 7.0040283203125, 7.22271728515625, 7.44140625]}, "gradients/decoder.transformer.h.11.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 1.0, 5.0, 5.0, 8.0, 10.0, 20.0, 26.0, 43.0, 79.0, 103.0, 210.0, 326.0, 584.0, 1254.0, 3064.0, 8203.0, 25457.0, 105424.0, 528745.0, 292430.0, 56929.0, 15969.0, 5468.0, 2123.0, 956.0, 450.0, 266.0, 149.0, 104.0, 54.0, 29.0, 19.0, 19.0, 15.0, 7.0, 2.0, 5.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6.4296875, -6.21392822265625, -5.9981689453125, -5.78240966796875, -5.566650390625, -5.35089111328125, -5.1351318359375, -4.91937255859375, -4.70361328125, -4.48785400390625, -4.2720947265625, -4.05633544921875, -3.840576171875, -3.62481689453125, -3.4090576171875, -3.19329833984375, -2.9775390625, -2.76177978515625, -2.5460205078125, -2.33026123046875, -2.114501953125, -1.89874267578125, -1.6829833984375, -1.46722412109375, -1.25146484375, -1.03570556640625, -0.8199462890625, -0.60418701171875, -0.388427734375, -0.17266845703125, 0.0430908203125, 0.25885009765625, 0.474609375, 0.69036865234375, 0.9061279296875, 1.12188720703125, 1.337646484375, 1.55340576171875, 1.7691650390625, 1.98492431640625, 2.20068359375, 2.41644287109375, 2.6322021484375, 2.84796142578125, 3.063720703125, 3.27947998046875, 3.4952392578125, 3.71099853515625, 3.9267578125, 4.14251708984375, 4.3582763671875, 4.57403564453125, 4.789794921875, 5.00555419921875, 5.2213134765625, 5.43707275390625, 5.65283203125, 5.86859130859375, 6.0843505859375, 6.30010986328125, 6.515869140625, 6.73162841796875, 6.9473876953125, 7.16314697265625, 7.37890625]}, "gradients/decoder.transformer.h.11.attn.c_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 2.0, 3.0, 0.0, 4.0, 6.0, 6.0, 6.0, 5.0, 15.0, 9.0, 17.0, 25.0, 23.0, 29.0, 19.0, 28.0, 44.0, 49.0, 34.0, 51.0, 39.0, 66.0, 125.0, 1657.0, 325.0, 89.0, 49.0, 44.0, 34.0, 29.0, 31.0, 39.0, 32.0, 33.0, 21.0, 16.0, 12.0, 13.0, 11.0, 5.0, 6.0, 8.0, 2.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.265625, -20.472412109375, -19.67919921875, -18.885986328125, -18.0927734375, -17.299560546875, -16.50634765625, -15.713134765625, -14.919921875, -14.126708984375, -13.33349609375, -12.540283203125, -11.7470703125, -10.953857421875, -10.16064453125, -9.367431640625, -8.57421875, -7.781005859375, -6.98779296875, -6.194580078125, -5.4013671875, -4.608154296875, -3.81494140625, -3.021728515625, -2.228515625, -1.435302734375, -0.64208984375, 0.151123046875, 0.9443359375, 1.737548828125, 2.53076171875, 3.323974609375, 4.1171875, 4.910400390625, 5.70361328125, 6.496826171875, 7.2900390625, 8.083251953125, 8.87646484375, 9.669677734375, 10.462890625, 11.256103515625, 12.04931640625, 12.842529296875, 13.6357421875, 14.428955078125, 15.22216796875, 16.015380859375, 16.80859375, 17.601806640625, 18.39501953125, 19.188232421875, 19.9814453125, 20.774658203125, 21.56787109375, 22.361083984375, 23.154296875, 23.947509765625, 24.74072265625, 25.533935546875, 26.3271484375, 27.120361328125, 27.91357421875, 28.706787109375, 29.5]}, "gradients/decoder.transformer.h.11.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 3.0, 0.0, 3.0, 1.0, 9.0, 6.0, 7.0, 14.0, 15.0, 13.0, 24.0, 26.0, 48.0, 61.0, 69.0, 94.0, 116.0, 157.0, 249.0, 375.0, 970.0, 35354.0, 3100479.0, 5764.0, 593.0, 326.0, 219.0, 144.0, 149.0, 119.0, 66.0, 41.0, 46.0, 40.0, 33.0, 23.0, 19.0, 12.0, 7.0, 8.0, 4.0, 3.0, 5.0, 2.0, 2.0, 3.0, 0.0, 1.0, 2.0], "bins": [-70.75, -68.8544921875, -66.958984375, -65.0634765625, -63.16796875, -61.2724609375, -59.376953125, -57.4814453125, -55.5859375, -53.6904296875, -51.794921875, -49.8994140625, -48.00390625, -46.1083984375, -44.212890625, -42.3173828125, -40.421875, -38.5263671875, -36.630859375, -34.7353515625, -32.83984375, -30.9443359375, -29.048828125, -27.1533203125, -25.2578125, -23.3623046875, -21.466796875, -19.5712890625, -17.67578125, -15.7802734375, -13.884765625, -11.9892578125, -10.09375, -8.1982421875, -6.302734375, -4.4072265625, -2.51171875, -0.6162109375, 1.279296875, 3.1748046875, 5.0703125, 6.9658203125, 8.861328125, 10.7568359375, 12.65234375, 14.5478515625, 16.443359375, 18.3388671875, 20.234375, 22.1298828125, 24.025390625, 25.9208984375, 27.81640625, 29.7119140625, 31.607421875, 33.5029296875, 35.3984375, 37.2939453125, 39.189453125, 41.0849609375, 42.98046875, 44.8759765625, 46.771484375, 48.6669921875, 50.5625]}, "gradients/decoder.transformer.h.11.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 3.0, 610.0, 403.0, 1.0, 2.0], "bins": [-400.2997131347656, -393.7281188964844, -387.15655517578125, -380.5849609375, -374.0133972167969, -367.4418029785156, -360.8702087402344, -354.29864501953125, -347.72705078125, -341.15545654296875, -334.5838928222656, -328.0122985839844, -321.44073486328125, -314.869140625, -308.29754638671875, -301.7259826660156, -295.1543884277344, -288.5827941894531, -282.01123046875, -275.43963623046875, -268.8680725097656, -262.2964782714844, -255.7248992919922, -249.1533203125, -242.5817413330078, -236.01016235351562, -229.43858337402344, -222.8669891357422, -216.29541015625, -209.7238311767578, -203.15225219726562, -196.58065795898438, -190.00906372070312, -183.43748474121094, -176.86590576171875, -170.2943115234375, -163.7227325439453, -157.15115356445312, -150.57957458496094, -144.00799560546875, -137.43641662597656, -130.86483764648438, -124.29325103759766, -117.72167205810547, -111.15008544921875, -104.57850646972656, -98.00692749023438, -91.43534088134766, -84.86375427246094, -78.29217529296875, -71.72058868408203, -65.14900970458984, -58.577423095703125, -52.00584411621094, -45.434261322021484, -38.86267852783203, -32.29109573364258, -25.719512939453125, -19.147930145263672, -12.576349258422852, -6.004766464233398, 0.5668144226074219, 7.138397216796875, 13.709980010986328, 20.28156280517578]}, "gradients/decoder.transformer.h.11.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 5.0, 5.0, 11.0, 7.0, 11.0, 16.0, 19.0, 23.0, 29.0, 27.0, 33.0, 32.0, 42.0, 47.0, 58.0, 50.0, 52.0, 52.0, 38.0, 47.0, 47.0, 41.0, 42.0, 42.0, 40.0, 42.0, 25.0, 22.0, 20.0, 22.0, 17.0, 9.0, 10.0, 7.0, 5.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-64.42731475830078, -62.367454528808594, -60.307594299316406, -58.24773406982422, -56.18787384033203, -54.128013610839844, -52.06815719604492, -50.008296966552734, -47.94843673706055, -45.88857650756836, -43.82871627807617, -41.768856048583984, -39.70899963378906, -37.649139404296875, -35.58927917480469, -33.5294189453125, -31.469558715820312, -29.409698486328125, -27.349838256835938, -25.289979934692383, -23.230119705200195, -21.170259475708008, -19.110401153564453, -17.050540924072266, -14.990680694580078, -12.93082046508789, -10.87096118927002, -8.811101913452148, -6.751241683959961, -4.691381454467773, -2.6315221786499023, -0.5716629028320312, 1.488189697265625, 3.5480494499206543, 5.607909202575684, 7.667768955230713, 9.727628707885742, 11.78748893737793, 13.8473482131958, 15.907207489013672, 17.96706771850586, 20.026927947998047, 22.086788177490234, 24.14664649963379, 26.206506729125977, 28.266366958618164, 30.32622528076172, 32.386085510253906, 34.445945739746094, 36.50580596923828, 38.56566619873047, 40.625526428222656, 42.685386657714844, 44.74524688720703, 46.80510330200195, 48.86496353149414, 50.92482376098633, 52.984683990478516, 55.0445442199707, 57.10440444946289, 59.16426086425781, 61.22412109375, 63.28398132324219, 65.34384155273438, 67.40370178222656]}, "gradients/decoder.transformer.h.10.mlp.c_proj.bias": {"_type": "histogram", "values": [4.0, 2.0, 1.0, 0.0, 6.0, 2.0, 4.0, 5.0, 3.0, 6.0, 13.0, 7.0, 8.0, 10.0, 17.0, 17.0, 19.0, 14.0, 20.0, 26.0, 29.0, 26.0, 48.0, 34.0, 29.0, 34.0, 46.0, 32.0, 49.0, 38.0, 35.0, 30.0, 44.0, 47.0, 39.0, 27.0, 29.0, 26.0, 30.0, 19.0, 24.0, 22.0, 13.0, 16.0, 12.0, 9.0, 12.0, 6.0, 6.0, 9.0, 6.0, 2.0, 3.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.30078125, -6.08428955078125, -5.8677978515625, -5.65130615234375, -5.434814453125, -5.21832275390625, -5.0018310546875, -4.78533935546875, -4.56884765625, -4.35235595703125, -4.1358642578125, -3.91937255859375, -3.702880859375, -3.48638916015625, -3.2698974609375, -3.05340576171875, -2.8369140625, -2.62042236328125, -2.4039306640625, -2.18743896484375, -1.970947265625, -1.75445556640625, -1.5379638671875, -1.32147216796875, -1.10498046875, -0.88848876953125, -0.6719970703125, -0.45550537109375, -0.239013671875, -0.02252197265625, 0.1939697265625, 0.41046142578125, 0.626953125, 0.84344482421875, 1.0599365234375, 1.27642822265625, 1.492919921875, 1.70941162109375, 1.9259033203125, 2.14239501953125, 2.35888671875, 2.57537841796875, 2.7918701171875, 3.00836181640625, 3.224853515625, 3.44134521484375, 3.6578369140625, 3.87432861328125, 4.0908203125, 4.30731201171875, 4.5238037109375, 4.74029541015625, 4.956787109375, 5.17327880859375, 5.3897705078125, 5.60626220703125, 5.82275390625, 6.03924560546875, 6.2557373046875, 6.47222900390625, 6.688720703125, 6.90521240234375, 7.1217041015625, 7.33819580078125, 7.5546875]}, "gradients/decoder.transformer.h.10.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 4.0, 2.0, 1.0, 10.0, 4.0, 8.0, 7.0, 14.0, 14.0, 15.0, 13.0, 17.0, 15.0, 24.0, 25.0, 30.0, 45.0, 86.0, 116.0, 261.0, 525.0, 1572.0, 10001.0, 437703.0, 3476968.0, 257089.0, 7354.0, 1243.0, 437.0, 227.0, 131.0, 82.0, 49.0, 31.0, 26.0, 28.0, 15.0, 15.0, 12.0, 21.0, 9.0, 8.0, 9.0, 5.0, 7.0, 2.0, 2.0, 4.0, 1.0, 1.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0], "bins": [-22.234375, -21.50390625, -20.7734375, -20.04296875, -19.3125, -18.58203125, -17.8515625, -17.12109375, -16.390625, -15.66015625, -14.9296875, -14.19921875, -13.46875, -12.73828125, -12.0078125, -11.27734375, -10.546875, -9.81640625, -9.0859375, -8.35546875, -7.625, -6.89453125, -6.1640625, -5.43359375, -4.703125, -3.97265625, -3.2421875, -2.51171875, -1.78125, -1.05078125, -0.3203125, 0.41015625, 1.140625, 1.87109375, 2.6015625, 3.33203125, 4.0625, 4.79296875, 5.5234375, 6.25390625, 6.984375, 7.71484375, 8.4453125, 9.17578125, 9.90625, 10.63671875, 11.3671875, 12.09765625, 12.828125, 13.55859375, 14.2890625, 15.01953125, 15.75, 16.48046875, 17.2109375, 17.94140625, 18.671875, 19.40234375, 20.1328125, 20.86328125, 21.59375, 22.32421875, 23.0546875, 23.78515625, 24.515625]}, "gradients/decoder.transformer.h.10.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 3.0, 3.0, 2.0, 3.0, 3.0, 5.0, 10.0, 9.0, 10.0, 13.0, 22.0, 31.0, 46.0, 54.0, 77.0, 91.0, 109.0, 201.0, 263.0, 360.0, 473.0, 500.0, 477.0, 376.0, 270.0, 176.0, 128.0, 89.0, 89.0, 54.0, 39.0, 31.0, 18.0, 15.0, 9.0, 6.0, 5.0, 5.0, 3.0, 1.0, 3.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-11.046875, -10.6142578125, -10.181640625, -9.7490234375, -9.31640625, -8.8837890625, -8.451171875, -8.0185546875, -7.5859375, -7.1533203125, -6.720703125, -6.2880859375, -5.85546875, -5.4228515625, -4.990234375, -4.5576171875, -4.125, -3.6923828125, -3.259765625, -2.8271484375, -2.39453125, -1.9619140625, -1.529296875, -1.0966796875, -0.6640625, -0.2314453125, 0.201171875, 0.6337890625, 1.06640625, 1.4990234375, 1.931640625, 2.3642578125, 2.796875, 3.2294921875, 3.662109375, 4.0947265625, 4.52734375, 4.9599609375, 5.392578125, 5.8251953125, 6.2578125, 6.6904296875, 7.123046875, 7.5556640625, 7.98828125, 8.4208984375, 8.853515625, 9.2861328125, 9.71875, 10.1513671875, 10.583984375, 11.0166015625, 11.44921875, 11.8818359375, 12.314453125, 12.7470703125, 13.1796875, 13.6123046875, 14.044921875, 14.4775390625, 14.91015625, 15.3427734375, 15.775390625, 16.2080078125, 16.640625]}, "gradients/decoder.transformer.h.10.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 4.0, 6.0, 5.0, 11.0, 7.0, 13.0, 15.0, 25.0, 38.0, 39.0, 35.0, 54.0, 68.0, 78.0, 106.0, 149.0, 167.0, 226.0, 382.0, 659.0, 4620.0, 4106229.0, 78575.0, 1048.0, 498.0, 309.0, 194.0, 162.0, 125.0, 105.0, 62.0, 65.0, 47.0, 41.0, 28.0, 18.0, 21.0, 10.0, 15.0, 8.0, 13.0, 2.0, 4.0, 5.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-84.5625, -82.1259765625, -79.689453125, -77.2529296875, -74.81640625, -72.3798828125, -69.943359375, -67.5068359375, -65.0703125, -62.6337890625, -60.197265625, -57.7607421875, -55.32421875, -52.8876953125, -50.451171875, -48.0146484375, -45.578125, -43.1416015625, -40.705078125, -38.2685546875, -35.83203125, -33.3955078125, -30.958984375, -28.5224609375, -26.0859375, -23.6494140625, -21.212890625, -18.7763671875, -16.33984375, -13.9033203125, -11.466796875, -9.0302734375, -6.59375, -4.1572265625, -1.720703125, 0.7158203125, 3.15234375, 5.5888671875, 8.025390625, 10.4619140625, 12.8984375, 15.3349609375, 17.771484375, 20.2080078125, 22.64453125, 25.0810546875, 27.517578125, 29.9541015625, 32.390625, 34.8271484375, 37.263671875, 39.7001953125, 42.13671875, 44.5732421875, 47.009765625, 49.4462890625, 51.8828125, 54.3193359375, 56.755859375, 59.1923828125, 61.62890625, 64.0654296875, 66.501953125, 68.9384765625, 71.375]}, "gradients/decoder.transformer.h.10.ln_2.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 4.0, 5.0, 20.0, 45.0, 140.0, 247.0, 273.0, 178.0, 71.0, 26.0, 7.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-32.20957565307617, -29.051116943359375, -25.892656326293945, -22.734195709228516, -19.57573699951172, -16.417278289794922, -13.258817672729492, -10.100357055664062, -6.941898345947266, -3.7834386825561523, -0.6249790191650391, 2.533480644226074, 5.6919403076171875, 8.8503999710083, 12.008859634399414, 15.167320251464844, 18.32577896118164, 21.484237670898438, 24.642698287963867, 27.801158905029297, 30.959617614746094, 34.11807632446289, 37.27653503417969, 40.43499755859375, 43.59345626831055, 46.751914978027344, 49.910377502441406, 53.0688362121582, 56.227294921875, 59.3857536315918, 62.544212341308594, 65.70267486572266, 68.86112976074219, 72.01959228515625, 75.17804718017578, 78.33650970458984, 81.49496459960938, 84.65342712402344, 87.8118896484375, 90.97035217285156, 94.1288070678711, 97.28726959228516, 100.44572448730469, 103.60418701171875, 106.76264953613281, 109.92110443115234, 113.0795669555664, 116.23802185058594, 119.396484375, 122.55494689941406, 125.7134017944336, 128.87185668945312, 132.0303192138672, 135.18878173828125, 138.3472442626953, 141.50570678710938, 144.66415405273438, 147.82261657714844, 150.9810791015625, 154.1395263671875, 157.29798889160156, 160.45645141601562, 163.6149139404297, 166.77337646484375, 169.9318389892578]}, "gradients/decoder.transformer.h.10.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 2.0, 3.0, 4.0, 3.0, 6.0, 11.0, 12.0, 14.0, 15.0, 13.0, 15.0, 20.0, 21.0, 28.0, 43.0, 35.0, 41.0, 40.0, 29.0, 35.0, 44.0, 38.0, 42.0, 45.0, 30.0, 24.0, 37.0, 38.0, 37.0, 29.0, 29.0, 27.0, 26.0, 23.0, 25.0, 20.0, 21.0, 19.0, 9.0, 10.0, 8.0, 9.0, 5.0, 4.0, 4.0, 3.0, 6.0, 3.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-44.12298583984375, -42.827606201171875, -41.532222747802734, -40.236839294433594, -38.94145965576172, -37.646080017089844, -36.3506965637207, -35.05531311035156, -33.75993347167969, -32.46455383300781, -31.169170379638672, -29.873788833618164, -28.578407287597656, -27.28302574157715, -25.98764419555664, -24.692262649536133, -23.396881103515625, -22.101499557495117, -20.80611801147461, -19.5107364654541, -18.215354919433594, -16.919973373413086, -15.624591827392578, -14.32921028137207, -13.033828735351562, -11.738447189331055, -10.443065643310547, -9.147684097290039, -7.852302551269531, -6.556921005249023, -5.261539459228516, -3.966157913208008, -2.6707763671875, -1.3753948211669922, -0.08001327514648438, 1.2153682708740234, 2.5107498168945312, 3.806131362915039, 5.101512908935547, 6.396894454956055, 7.6922760009765625, 8.98765754699707, 10.283039093017578, 11.578420639038086, 12.873802185058594, 14.169183731079102, 15.46456527709961, 16.759946823120117, 18.055328369140625, 19.350709915161133, 20.64609146118164, 21.94147300720215, 23.236854553222656, 24.532236099243164, 25.827617645263672, 27.12299919128418, 28.418380737304688, 29.713762283325195, 31.009143829345703, 32.304527282714844, 33.59990692138672, 34.895286560058594, 36.190670013427734, 37.486053466796875, 38.78143310546875]}, "gradients/decoder.transformer.h.10.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 4.0, 6.0, 4.0, 5.0, 6.0, 9.0, 10.0, 11.0, 9.0, 16.0, 17.0, 13.0, 29.0, 32.0, 36.0, 36.0, 45.0, 46.0, 40.0, 42.0, 35.0, 44.0, 46.0, 41.0, 41.0, 39.0, 42.0, 36.0, 25.0, 37.0, 16.0, 28.0, 27.0, 16.0, 26.0, 14.0, 17.0, 12.0, 7.0, 11.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 4.0], "bins": [-7.80859375, -7.57952880859375, -7.3504638671875, -7.12139892578125, -6.892333984375, -6.66326904296875, -6.4342041015625, -6.20513916015625, -5.97607421875, -5.74700927734375, -5.5179443359375, -5.28887939453125, -5.059814453125, -4.83074951171875, -4.6016845703125, -4.37261962890625, -4.1435546875, -3.91448974609375, -3.6854248046875, -3.45635986328125, -3.227294921875, -2.99822998046875, -2.7691650390625, -2.54010009765625, -2.31103515625, -2.08197021484375, -1.8529052734375, -1.62384033203125, -1.394775390625, -1.16571044921875, -0.9366455078125, -0.70758056640625, -0.478515625, -0.24945068359375, -0.0203857421875, 0.20867919921875, 0.437744140625, 0.66680908203125, 0.8958740234375, 1.12493896484375, 1.35400390625, 1.58306884765625, 1.8121337890625, 2.04119873046875, 2.270263671875, 2.49932861328125, 2.7283935546875, 2.95745849609375, 3.1865234375, 3.41558837890625, 3.6446533203125, 3.87371826171875, 4.102783203125, 4.33184814453125, 4.5609130859375, 4.78997802734375, 5.01904296875, 5.24810791015625, 5.4771728515625, 5.70623779296875, 5.935302734375, 6.16436767578125, 6.3934326171875, 6.62249755859375, 6.8515625]}, "gradients/decoder.transformer.h.10.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 6.0, 10.0, 5.0, 9.0, 13.0, 20.0, 35.0, 54.0, 66.0, 81.0, 141.0, 197.0, 264.0, 383.0, 545.0, 788.0, 1094.0, 1684.0, 2310.0, 3482.0, 4957.0, 7635.0, 11722.0, 18443.0, 29347.0, 48634.0, 83799.0, 160026.0, 335991.0, 139833.0, 75275.0, 44475.0, 27020.0, 17074.0, 11017.0, 7082.0, 4784.0, 3303.0, 2024.0, 1457.0, 1045.0, 724.0, 505.0, 336.0, 267.0, 165.0, 121.0, 100.0, 75.0, 44.0, 35.0, 23.0, 16.0, 12.0, 6.0, 5.0, 3.0, 1.0, 2.0, 2.0, 1.0], "bins": [-1.7197265625, -1.66497802734375, -1.6102294921875, -1.55548095703125, -1.500732421875, -1.44598388671875, -1.3912353515625, -1.33648681640625, -1.28173828125, -1.22698974609375, -1.1722412109375, -1.11749267578125, -1.062744140625, -1.00799560546875, -0.9532470703125, -0.89849853515625, -0.84375, -0.78900146484375, -0.7342529296875, -0.67950439453125, -0.624755859375, -0.57000732421875, -0.5152587890625, -0.46051025390625, -0.40576171875, -0.35101318359375, -0.2962646484375, -0.24151611328125, -0.186767578125, -0.13201904296875, -0.0772705078125, -0.02252197265625, 0.0322265625, 0.08697509765625, 0.1417236328125, 0.19647216796875, 0.251220703125, 0.30596923828125, 0.3607177734375, 0.41546630859375, 0.47021484375, 0.52496337890625, 0.5797119140625, 0.63446044921875, 0.689208984375, 0.74395751953125, 0.7987060546875, 0.85345458984375, 0.908203125, 0.96295166015625, 1.0177001953125, 1.07244873046875, 1.127197265625, 1.18194580078125, 1.2366943359375, 1.29144287109375, 1.34619140625, 1.40093994140625, 1.4556884765625, 1.51043701171875, 1.565185546875, 1.61993408203125, 1.6746826171875, 1.72943115234375, 1.7841796875]}, "gradients/decoder.transformer.h.10.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 5.0, 3.0, 4.0, 12.0, 6.0, 4.0, 9.0, 11.0, 12.0, 15.0, 9.0, 25.0, 37.0, 24.0, 36.0, 38.0, 33.0, 44.0, 56.0, 34.0, 46.0, 1068.0, 45.0, 39.0, 46.0, 51.0, 45.0, 27.0, 26.0, 32.0, 33.0, 24.0, 22.0, 23.0, 20.0, 11.0, 16.0, 10.0, 10.0, 8.0, 4.0, 4.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.50390625, -4.3463134765625, -4.188720703125, -4.0311279296875, -3.87353515625, -3.7159423828125, -3.558349609375, -3.4007568359375, -3.2431640625, -3.0855712890625, -2.927978515625, -2.7703857421875, -2.61279296875, -2.4552001953125, -2.297607421875, -2.1400146484375, -1.982421875, -1.8248291015625, -1.667236328125, -1.5096435546875, -1.35205078125, -1.1944580078125, -1.036865234375, -0.8792724609375, -0.7216796875, -0.5640869140625, -0.406494140625, -0.2489013671875, -0.09130859375, 0.0662841796875, 0.223876953125, 0.3814697265625, 0.5390625, 0.6966552734375, 0.854248046875, 1.0118408203125, 1.16943359375, 1.3270263671875, 1.484619140625, 1.6422119140625, 1.7998046875, 1.9573974609375, 2.114990234375, 2.2725830078125, 2.43017578125, 2.5877685546875, 2.745361328125, 2.9029541015625, 3.060546875, 3.2181396484375, 3.375732421875, 3.5333251953125, 3.69091796875, 3.8485107421875, 4.006103515625, 4.1636962890625, 4.3212890625, 4.4788818359375, 4.636474609375, 4.7940673828125, 4.95166015625, 5.1092529296875, 5.266845703125, 5.4244384765625, 5.58203125]}, "gradients/decoder.transformer.h.10.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 1.0, 4.0, 5.0, 2.0, 4.0, 9.0, 10.0, 19.0, 19.0, 37.0, 55.0, 85.0, 139.0, 203.0, 425.0, 682.0, 1234.0, 2266.0, 4169.0, 7856.0, 15075.0, 31167.0, 66230.0, 161641.0, 1493918.0, 173442.0, 70743.0, 33188.0, 16270.0, 8240.0, 4441.0, 2473.0, 1331.0, 726.0, 414.0, 234.0, 150.0, 86.0, 51.0, 26.0, 16.0, 17.0, 16.0, 12.0, 4.0, 3.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 1.0], "bins": [-3.0625, -2.976318359375, -2.89013671875, -2.803955078125, -2.7177734375, -2.631591796875, -2.54541015625, -2.459228515625, -2.373046875, -2.286865234375, -2.20068359375, -2.114501953125, -2.0283203125, -1.942138671875, -1.85595703125, -1.769775390625, -1.68359375, -1.597412109375, -1.51123046875, -1.425048828125, -1.3388671875, -1.252685546875, -1.16650390625, -1.080322265625, -0.994140625, -0.907958984375, -0.82177734375, -0.735595703125, -0.6494140625, -0.563232421875, -0.47705078125, -0.390869140625, -0.3046875, -0.218505859375, -0.13232421875, -0.046142578125, 0.0400390625, 0.126220703125, 0.21240234375, 0.298583984375, 0.384765625, 0.470947265625, 0.55712890625, 0.643310546875, 0.7294921875, 0.815673828125, 0.90185546875, 0.988037109375, 1.07421875, 1.160400390625, 1.24658203125, 1.332763671875, 1.4189453125, 1.505126953125, 1.59130859375, 1.677490234375, 1.763671875, 1.849853515625, 1.93603515625, 2.022216796875, 2.1083984375, 2.194580078125, 2.28076171875, 2.366943359375, 2.453125]}, "gradients/decoder.transformer.h.10.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 3.0, 4.0, 5.0, 7.0, 5.0, 4.0, 8.0, 9.0, 10.0, 5.0, 15.0, 14.0, 17.0, 17.0, 39.0, 51.0, 53.0, 46.0, 64.0, 72.0, 96.0, 77.0, 62.0, 49.0, 54.0, 42.0, 38.0, 18.0, 18.0, 16.0, 18.0, 17.0, 12.0, 9.0, 12.0, 6.0, 5.0, 4.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0014057159423828125, -0.0013637840747833252, -0.0013218522071838379, -0.0012799203395843506, -0.0012379884719848633, -0.001196056604385376, -0.0011541247367858887, -0.0011121928691864014, -0.001070261001586914, -0.0010283291339874268, -0.0009863972663879395, -0.0009444653987884521, -0.0009025335311889648, -0.0008606016635894775, -0.0008186697959899902, -0.0007767379283905029, -0.0007348060607910156, -0.0006928741931915283, -0.000650942325592041, -0.0006090104579925537, -0.0005670785903930664, -0.0005251467227935791, -0.0004832148551940918, -0.0004412829875946045, -0.0003993511199951172, -0.0003574192523956299, -0.0003154873847961426, -0.0002735555171966553, -0.00023162364959716797, -0.00018969178199768066, -0.00014775991439819336, -0.00010582804679870605, -6.389617919921875e-05, -2.1964311599731445e-05, 1.996755599975586e-05, 6.189942359924316e-05, 0.00010383129119873047, 0.00014576315879821777, 0.00018769502639770508, 0.00022962689399719238, 0.0002715587615966797, 0.000313490629196167, 0.0003554224967956543, 0.0003973543643951416, 0.0004392862319946289, 0.0004812180995941162, 0.0005231499671936035, 0.0005650818347930908, 0.0006070137023925781, 0.0006489455699920654, 0.0006908774375915527, 0.00073280930519104, 0.0007747411727905273, 0.0008166730403900146, 0.000858604907989502, 0.0009005367755889893, 0.0009424686431884766, 0.0009844005107879639, 0.0010263323783874512, 0.0010682642459869385, 0.0011101961135864258, 0.001152127981185913, 0.0011940598487854004, 0.0012359917163848877, 0.001277923583984375]}, "gradients/decoder.transformer.h.10.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 0.0, 3.0, 2.0, 3.0, 3.0, 6.0, 6.0, 7.0, 8.0, 14.0, 21.0, 22.0, 23.0, 19.0, 33.0, 49.0, 53.0, 91.0, 152.0, 431.0, 7213.0, 1038501.0, 1174.0, 283.0, 132.0, 69.0, 47.0, 48.0, 30.0, 27.0, 10.0, 19.0, 12.0, 10.0, 8.0, 6.0, 5.0, 5.0, 3.0, 1.0, 6.0, 2.0, 4.0, 3.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.037384033203125, -0.03626203536987305, -0.035140037536621094, -0.03401803970336914, -0.03289604187011719, -0.031774044036865234, -0.03065204620361328, -0.029530048370361328, -0.028408050537109375, -0.027286052703857422, -0.02616405487060547, -0.025042057037353516, -0.023920059204101562, -0.02279806137084961, -0.021676063537597656, -0.020554065704345703, -0.01943206787109375, -0.018310070037841797, -0.017188072204589844, -0.01606607437133789, -0.014944076538085938, -0.013822078704833984, -0.012700080871582031, -0.011578083038330078, -0.010456085205078125, -0.009334087371826172, -0.008212089538574219, -0.007090091705322266, -0.0059680938720703125, -0.004846096038818359, -0.0037240982055664062, -0.002602100372314453, -0.0014801025390625, -0.0003581047058105469, 0.0007638931274414062, 0.0018858909606933594, 0.0030078887939453125, 0.004129886627197266, 0.005251884460449219, 0.006373882293701172, 0.007495880126953125, 0.008617877960205078, 0.009739875793457031, 0.010861873626708984, 0.011983871459960938, 0.01310586929321289, 0.014227867126464844, 0.015349864959716797, 0.01647186279296875, 0.017593860626220703, 0.018715858459472656, 0.01983785629272461, 0.020959854125976562, 0.022081851959228516, 0.02320384979248047, 0.024325847625732422, 0.025447845458984375, 0.026569843292236328, 0.02769184112548828, 0.028813838958740234, 0.029935836791992188, 0.03105783462524414, 0.032179832458496094, 0.03330183029174805, 0.034423828125]}, "gradients/decoder.transformer.h.10.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 5.0, 83.0, 533.0, 356.0, 35.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.00460321269929409, -0.004502732772380114, -0.004402253311127424, -0.004301773384213448, -0.004201293922960758, -0.0041008139960467815, -0.004000334534794092, -0.0038998546078801155, -0.0037993749137967825, -0.0036988952197134495, -0.0035984155256301165, -0.0034979358315467834, -0.0033974561374634504, -0.0032969764433801174, -0.0031964965164661407, -0.0030960168223828077, -0.0029955371282994747, -0.0028950574342161417, -0.0027945777401328087, -0.0026940980460494757, -0.0025936183519661427, -0.002493138425052166, -0.0023926589637994766, -0.0022921790368855, -0.0021916995756328106, -0.0020912198815494776, -0.0019907401874661446, -0.0018902604933828115, -0.0017897806828841567, -0.0016893009888008237, -0.0015888212947174907, -0.0014883414842188358, -0.0013878619065508246, -0.0012873822124674916, -0.0011869025183841586, -0.0010864227078855038, -0.0009859430138021708, -0.0008854633197188377, -0.0007849836256355047, -0.0006845038733445108, -0.0005840241792611778, -0.0004835444560740143, -0.00038306473288685083, -0.0002825850388035178, -0.00018210531561635435, -8.162559242919087e-05, 1.885410165414214e-05, 0.00011933385394513607, 0.00021981354802846909, 0.00032029327121563256, 0.00042077299440279603, 0.000521252688486129, 0.000621732440777123, 0.000722212134860456, 0.000822691828943789, 0.0009231715812347829, 0.0010236513335257769, 0.0011241310276091099, 0.001224610721692443, 0.001325090415775776, 0.0014255702262744308, 0.0015260499203577638, 0.0016265296144410968, 0.0017270094249397516, 0.0018274890026077628]}, "gradients/decoder.transformer.h.10.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 1.0, 4.0, 0.0, 3.0, 2.0, 2.0, 10.0, 8.0, 15.0, 7.0, 17.0, 11.0, 21.0, 23.0, 32.0, 26.0, 23.0, 33.0, 38.0, 38.0, 32.0, 31.0, 49.0, 47.0, 45.0, 46.0, 47.0, 31.0, 52.0, 43.0, 25.0, 33.0, 28.0, 30.0, 29.0, 18.0, 21.0, 24.0, 10.0, 12.0, 6.0, 9.0, 11.0, 8.0, 4.0, 3.0, 1.0, 4.0, 0.0, 2.0, 1.0, 1.0], "bins": [-0.0007723569869995117, -0.0007513277232646942, -0.0007302984595298767, -0.0007092691957950592, -0.0006882399320602417, -0.0006672106683254242, -0.0006461814045906067, -0.0006251521408557892, -0.0006041228771209717, -0.0005830936133861542, -0.0005620643496513367, -0.0005410350859165192, -0.0005200058221817017, -0.0004989765584468842, -0.00047794729471206665, -0.00045691803097724915, -0.00043588876724243164, -0.00041485950350761414, -0.00039383023977279663, -0.0003728009760379791, -0.0003517717123031616, -0.0003307424485683441, -0.0003097131848335266, -0.0002886839210987091, -0.0002676546573638916, -0.0002466253936290741, -0.0002255961298942566, -0.0002045668661594391, -0.00018353760242462158, -0.00016250833868980408, -0.00014147907495498657, -0.00012044981122016907, -9.942054748535156e-05, -7.839128375053406e-05, -5.736202001571655e-05, -3.633275628089905e-05, -1.5303492546081543e-05, 5.725771188735962e-06, 2.6755034923553467e-05, 4.778429865837097e-05, 6.881356239318848e-05, 8.984282612800598e-05, 0.00011087208986282349, 0.000131901353597641, 0.0001529306173324585, 0.000173959881067276, 0.0001949891448020935, 0.000216018408536911, 0.00023704767227172852, 0.000258076936006546, 0.0002791061997413635, 0.00030013546347618103, 0.00032116472721099854, 0.00034219399094581604, 0.00036322325468063354, 0.00038425251841545105, 0.00040528178215026855, 0.00042631104588508606, 0.00044734030961990356, 0.00046836957335472107, 0.0004893988370895386, 0.0005104281008243561, 0.0005314573645591736, 0.0005524866282939911, 0.0005735158920288086]}, "gradients/decoder.transformer.h.10.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 4.0, 6.0, 4.0, 5.0, 6.0, 9.0, 10.0, 11.0, 9.0, 16.0, 17.0, 13.0, 29.0, 32.0, 36.0, 36.0, 45.0, 46.0, 40.0, 42.0, 35.0, 44.0, 46.0, 41.0, 41.0, 39.0, 42.0, 36.0, 25.0, 37.0, 16.0, 28.0, 27.0, 16.0, 26.0, 14.0, 17.0, 12.0, 7.0, 11.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 4.0], "bins": [-7.80859375, -7.57952880859375, -7.3504638671875, -7.12139892578125, -6.892333984375, -6.66326904296875, -6.4342041015625, -6.20513916015625, -5.97607421875, -5.74700927734375, -5.5179443359375, -5.28887939453125, -5.059814453125, -4.83074951171875, -4.6016845703125, -4.37261962890625, -4.1435546875, -3.91448974609375, -3.6854248046875, -3.45635986328125, -3.227294921875, -2.99822998046875, -2.7691650390625, -2.54010009765625, -2.31103515625, -2.08197021484375, -1.8529052734375, -1.62384033203125, -1.394775390625, -1.16571044921875, -0.9366455078125, -0.70758056640625, -0.478515625, -0.24945068359375, -0.0203857421875, 0.20867919921875, 0.437744140625, 0.66680908203125, 0.8958740234375, 1.12493896484375, 1.35400390625, 1.58306884765625, 1.8121337890625, 2.04119873046875, 2.270263671875, 2.49932861328125, 2.7283935546875, 2.95745849609375, 3.1865234375, 3.41558837890625, 3.6446533203125, 3.87371826171875, 4.102783203125, 4.33184814453125, 4.5609130859375, 4.78997802734375, 5.01904296875, 5.24810791015625, 5.4771728515625, 5.70623779296875, 5.935302734375, 6.16436767578125, 6.3934326171875, 6.62249755859375, 6.8515625]}, "gradients/decoder.transformer.h.10.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 3.0, 1.0, 6.0, 6.0, 6.0, 6.0, 7.0, 9.0, 9.0, 23.0, 30.0, 35.0, 35.0, 45.0, 65.0, 103.0, 125.0, 220.0, 306.0, 557.0, 1169.0, 3273.0, 11806.0, 47920.0, 202458.0, 558627.0, 166315.0, 40211.0, 9825.0, 2867.0, 1047.0, 444.0, 265.0, 189.0, 138.0, 103.0, 70.0, 76.0, 39.0, 34.0, 19.0, 18.0, 12.0, 17.0, 9.0, 8.0, 1.0, 3.0, 1.0, 3.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0], "bins": [-8.46875, -8.2154541015625, -7.962158203125, -7.7088623046875, -7.45556640625, -7.2022705078125, -6.948974609375, -6.6956787109375, -6.4423828125, -6.1890869140625, -5.935791015625, -5.6824951171875, -5.42919921875, -5.1759033203125, -4.922607421875, -4.6693115234375, -4.416015625, -4.1627197265625, -3.909423828125, -3.6561279296875, -3.40283203125, -3.1495361328125, -2.896240234375, -2.6429443359375, -2.3896484375, -2.1363525390625, -1.883056640625, -1.6297607421875, -1.37646484375, -1.1231689453125, -0.869873046875, -0.6165771484375, -0.36328125, -0.1099853515625, 0.143310546875, 0.3966064453125, 0.64990234375, 0.9031982421875, 1.156494140625, 1.4097900390625, 1.6630859375, 1.9163818359375, 2.169677734375, 2.4229736328125, 2.67626953125, 2.9295654296875, 3.182861328125, 3.4361572265625, 3.689453125, 3.9427490234375, 4.196044921875, 4.4493408203125, 4.70263671875, 4.9559326171875, 5.209228515625, 5.4625244140625, 5.7158203125, 5.9691162109375, 6.222412109375, 6.4757080078125, 6.72900390625, 6.9822998046875, 7.235595703125, 7.4888916015625, 7.7421875]}, "gradients/decoder.transformer.h.10.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 5.0, 3.0, 2.0, 3.0, 3.0, 4.0, 8.0, 11.0, 10.0, 15.0, 11.0, 15.0, 18.0, 23.0, 34.0, 31.0, 30.0, 32.0, 40.0, 50.0, 47.0, 60.0, 141.0, 1727.0, 235.0, 77.0, 63.0, 53.0, 36.0, 37.0, 26.0, 31.0, 33.0, 23.0, 28.0, 14.0, 16.0, 17.0, 8.0, 8.0, 8.0, 8.0, 3.0, 6.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-25.015625, -24.2724609375, -23.529296875, -22.7861328125, -22.04296875, -21.2998046875, -20.556640625, -19.8134765625, -19.0703125, -18.3271484375, -17.583984375, -16.8408203125, -16.09765625, -15.3544921875, -14.611328125, -13.8681640625, -13.125, -12.3818359375, -11.638671875, -10.8955078125, -10.15234375, -9.4091796875, -8.666015625, -7.9228515625, -7.1796875, -6.4365234375, -5.693359375, -4.9501953125, -4.20703125, -3.4638671875, -2.720703125, -1.9775390625, -1.234375, -0.4912109375, 0.251953125, 0.9951171875, 1.73828125, 2.4814453125, 3.224609375, 3.9677734375, 4.7109375, 5.4541015625, 6.197265625, 6.9404296875, 7.68359375, 8.4267578125, 9.169921875, 9.9130859375, 10.65625, 11.3994140625, 12.142578125, 12.8857421875, 13.62890625, 14.3720703125, 15.115234375, 15.8583984375, 16.6015625, 17.3447265625, 18.087890625, 18.8310546875, 19.57421875, 20.3173828125, 21.060546875, 21.8037109375, 22.546875]}, "gradients/decoder.transformer.h.10.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 5.0, 7.0, 17.0, 28.0, 36.0, 66.0, 97.0, 161.0, 259.0, 560.0, 3860.0, 3136453.0, 2965.0, 537.0, 267.0, 165.0, 90.0, 44.0, 37.0, 27.0, 10.0, 10.0, 5.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-120.125, -116.8310546875, -113.537109375, -110.2431640625, -106.94921875, -103.6552734375, -100.361328125, -97.0673828125, -93.7734375, -90.4794921875, -87.185546875, -83.8916015625, -80.59765625, -77.3037109375, -74.009765625, -70.7158203125, -67.421875, -64.1279296875, -60.833984375, -57.5400390625, -54.24609375, -50.9521484375, -47.658203125, -44.3642578125, -41.0703125, -37.7763671875, -34.482421875, -31.1884765625, -27.89453125, -24.6005859375, -21.306640625, -18.0126953125, -14.71875, -11.4248046875, -8.130859375, -4.8369140625, -1.54296875, 1.7509765625, 5.044921875, 8.3388671875, 11.6328125, 14.9267578125, 18.220703125, 21.5146484375, 24.80859375, 28.1025390625, 31.396484375, 34.6904296875, 37.984375, 41.2783203125, 44.572265625, 47.8662109375, 51.16015625, 54.4541015625, 57.748046875, 61.0419921875, 64.3359375, 67.6298828125, 70.923828125, 74.2177734375, 77.51171875, 80.8056640625, 84.099609375, 87.3935546875, 90.6875]}, "gradients/decoder.transformer.h.10.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 9.0, 56.0, 219.0, 391.0, 252.0, 71.0, 14.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.63893508911133, -40.90486526489258, -39.17079544067383, -37.43672561645508, -35.702659606933594, -33.968589782714844, -32.234519958496094, -30.500450134277344, -28.766380310058594, -27.032310485839844, -25.298240661621094, -23.564172744750977, -21.830102920532227, -20.096033096313477, -18.36196517944336, -16.62789535522461, -14.89382553100586, -13.15975570678711, -11.425686836242676, -9.691617965698242, -7.957548141479492, -6.223478317260742, -4.489409446716309, -2.755340576171875, -1.021270751953125, 0.7127985954284668, 2.4468679428100586, 4.18093729019165, 5.915006637573242, 7.649076461791992, 9.383145332336426, 11.11721420288086, 12.851280212402344, 14.585350036621094, 16.319419860839844, 18.05348777770996, 19.78755760192871, 21.52162742614746, 23.255695343017578, 24.989765167236328, 26.723834991455078, 28.457904815673828, 30.191974639892578, 31.926042556762695, 33.66011047363281, 35.39418029785156, 37.12825012207031, 38.86231994628906, 40.59638977050781, 42.33045959472656, 44.06452941894531, 45.79859924316406, 47.53266906738281, 49.26673889160156, 51.00080490112305, 52.7348747253418, 54.46894454956055, 56.2030143737793, 57.93708419799805, 59.6711540222168, 61.40522003173828, 63.13928985595703, 64.87335968017578, 66.60742950439453, 68.34149932861328]}, "gradients/decoder.transformer.h.10.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 0.0, 2.0, 1.0, 3.0, 4.0, 3.0, 4.0, 5.0, 11.0, 9.0, 15.0, 10.0, 10.0, 7.0, 17.0, 17.0, 22.0, 20.0, 20.0, 26.0, 30.0, 39.0, 36.0, 32.0, 31.0, 46.0, 37.0, 36.0, 30.0, 40.0, 42.0, 39.0, 37.0, 34.0, 26.0, 37.0, 26.0, 34.0, 25.0, 18.0, 23.0, 21.0, 24.0, 13.0, 15.0, 9.0, 5.0, 2.0, 3.0, 4.0, 6.0, 4.0, 3.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-53.49555206298828, -51.79428482055664, -50.093017578125, -48.391754150390625, -46.690486907958984, -44.989219665527344, -43.28795623779297, -41.58668899536133, -39.88542175292969, -38.18415451049805, -36.482887268066406, -34.78162384033203, -33.08035659790039, -31.37908935546875, -29.677824020385742, -27.976558685302734, -26.275291442871094, -24.574024200439453, -22.872758865356445, -21.171493530273438, -19.470226287841797, -17.768959045410156, -16.06769371032715, -14.366427421569824, -12.6651611328125, -10.963894844055176, -9.262628555297852, -7.561362266540527, -5.860095977783203, -4.158829689025879, -2.4575634002685547, -0.7562971115112305, 0.9449691772460938, 2.646235466003418, 4.347501754760742, 6.048768043518066, 7.750034332275391, 9.451300621032715, 11.152566909790039, 12.853833198547363, 14.555099487304688, 16.256366729736328, 17.957632064819336, 19.658897399902344, 21.360164642333984, 23.061431884765625, 24.762697219848633, 26.46396255493164, 28.16522979736328, 29.866497039794922, 31.56776237487793, 33.26902770996094, 34.97029495239258, 36.67156219482422, 38.372825622558594, 40.074092864990234, 41.775360107421875, 43.476627349853516, 45.177894592285156, 46.87915802001953, 48.58042526245117, 50.28169250488281, 51.98295593261719, 53.68422317504883, 55.38549041748047]}, "gradients/decoder.transformer.h.9.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 3.0, 4.0, 5.0, 6.0, 3.0, 4.0, 8.0, 5.0, 7.0, 13.0, 8.0, 12.0, 15.0, 20.0, 22.0, 22.0, 30.0, 43.0, 36.0, 48.0, 47.0, 40.0, 37.0, 47.0, 45.0, 44.0, 40.0, 40.0, 40.0, 41.0, 37.0, 26.0, 28.0, 25.0, 26.0, 22.0, 23.0, 17.0, 13.0, 17.0, 11.0, 11.0, 8.0, 5.0, 5.0, 2.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0], "bins": [-8.234375, -7.9898681640625, -7.745361328125, -7.5008544921875, -7.25634765625, -7.0118408203125, -6.767333984375, -6.5228271484375, -6.2783203125, -6.0338134765625, -5.789306640625, -5.5447998046875, -5.30029296875, -5.0557861328125, -4.811279296875, -4.5667724609375, -4.322265625, -4.0777587890625, -3.833251953125, -3.5887451171875, -3.34423828125, -3.0997314453125, -2.855224609375, -2.6107177734375, -2.3662109375, -2.1217041015625, -1.877197265625, -1.6326904296875, -1.38818359375, -1.1436767578125, -0.899169921875, -0.6546630859375, -0.41015625, -0.1656494140625, 0.078857421875, 0.3233642578125, 0.56787109375, 0.8123779296875, 1.056884765625, 1.3013916015625, 1.5458984375, 1.7904052734375, 2.034912109375, 2.2794189453125, 2.52392578125, 2.7684326171875, 3.012939453125, 3.2574462890625, 3.501953125, 3.7464599609375, 3.990966796875, 4.2354736328125, 4.47998046875, 4.7244873046875, 4.968994140625, 5.2135009765625, 5.4580078125, 5.7025146484375, 5.947021484375, 6.1915283203125, 6.43603515625, 6.6805419921875, 6.925048828125, 7.1695556640625, 7.4140625]}, "gradients/decoder.transformer.h.9.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 3.0, 4.0, 4.0, 5.0, 7.0, 4.0, 6.0, 11.0, 10.0, 13.0, 21.0, 28.0, 23.0, 38.0, 52.0, 51.0, 109.0, 155.0, 205.0, 372.0, 900.0, 5564.0, 209569.0, 3619828.0, 347984.0, 7133.0, 1025.0, 360.0, 218.0, 151.0, 110.0, 71.0, 49.0, 51.0, 30.0, 26.0, 13.0, 13.0, 6.0, 17.0, 19.0, 10.0, 7.0, 5.0, 3.0, 1.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-25.421875, -24.6162109375, -23.810546875, -23.0048828125, -22.19921875, -21.3935546875, -20.587890625, -19.7822265625, -18.9765625, -18.1708984375, -17.365234375, -16.5595703125, -15.75390625, -14.9482421875, -14.142578125, -13.3369140625, -12.53125, -11.7255859375, -10.919921875, -10.1142578125, -9.30859375, -8.5029296875, -7.697265625, -6.8916015625, -6.0859375, -5.2802734375, -4.474609375, -3.6689453125, -2.86328125, -2.0576171875, -1.251953125, -0.4462890625, 0.359375, 1.1650390625, 1.970703125, 2.7763671875, 3.58203125, 4.3876953125, 5.193359375, 5.9990234375, 6.8046875, 7.6103515625, 8.416015625, 9.2216796875, 10.02734375, 10.8330078125, 11.638671875, 12.4443359375, 13.25, 14.0556640625, 14.861328125, 15.6669921875, 16.47265625, 17.2783203125, 18.083984375, 18.8896484375, 19.6953125, 20.5009765625, 21.306640625, 22.1123046875, 22.91796875, 23.7236328125, 24.529296875, 25.3349609375, 26.140625]}, "gradients/decoder.transformer.h.9.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 3.0, 0.0, 2.0, 3.0, 6.0, 3.0, 9.0, 12.0, 12.0, 17.0, 29.0, 35.0, 39.0, 37.0, 48.0, 75.0, 96.0, 109.0, 136.0, 201.0, 280.0, 353.0, 478.0, 420.0, 391.0, 292.0, 229.0, 179.0, 135.0, 98.0, 82.0, 53.0, 40.0, 47.0, 26.0, 15.0, 22.0, 21.0, 15.0, 10.0, 3.0, 3.0, 5.0, 2.0, 1.0, 5.0, 4.0, 5.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-11.5625, -11.2147216796875, -10.866943359375, -10.5191650390625, -10.17138671875, -9.8236083984375, -9.475830078125, -9.1280517578125, -8.7802734375, -8.4324951171875, -8.084716796875, -7.7369384765625, -7.38916015625, -7.0413818359375, -6.693603515625, -6.3458251953125, -5.998046875, -5.6502685546875, -5.302490234375, -4.9547119140625, -4.60693359375, -4.2591552734375, -3.911376953125, -3.5635986328125, -3.2158203125, -2.8680419921875, -2.520263671875, -2.1724853515625, -1.82470703125, -1.4769287109375, -1.129150390625, -0.7813720703125, -0.43359375, -0.0858154296875, 0.261962890625, 0.6097412109375, 0.95751953125, 1.3052978515625, 1.653076171875, 2.0008544921875, 2.3486328125, 2.6964111328125, 3.044189453125, 3.3919677734375, 3.73974609375, 4.0875244140625, 4.435302734375, 4.7830810546875, 5.130859375, 5.4786376953125, 5.826416015625, 6.1741943359375, 6.52197265625, 6.8697509765625, 7.217529296875, 7.5653076171875, 7.9130859375, 8.2608642578125, 8.608642578125, 8.9564208984375, 9.30419921875, 9.6519775390625, 9.999755859375, 10.3475341796875, 10.6953125]}, "gradients/decoder.transformer.h.9.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 5.0, 5.0, 3.0, 6.0, 7.0, 10.0, 11.0, 15.0, 20.0, 17.0, 26.0, 38.0, 42.0, 52.0, 56.0, 56.0, 91.0, 114.0, 158.0, 172.0, 253.0, 341.0, 615.0, 4225.0, 4050930.0, 134213.0, 1059.0, 444.0, 282.0, 204.0, 165.0, 119.0, 103.0, 78.0, 73.0, 61.0, 49.0, 42.0, 32.0, 23.0, 22.0, 14.0, 9.0, 11.0, 5.0, 4.0, 3.0, 3.0, 1.0, 2.0, 2.0, 1.0, 1.0, 0.0, 3.0, 1.0], "bins": [-79.0, -76.587890625, -74.17578125, -71.763671875, -69.3515625, -66.939453125, -64.52734375, -62.115234375, -59.703125, -57.291015625, -54.87890625, -52.466796875, -50.0546875, -47.642578125, -45.23046875, -42.818359375, -40.40625, -37.994140625, -35.58203125, -33.169921875, -30.7578125, -28.345703125, -25.93359375, -23.521484375, -21.109375, -18.697265625, -16.28515625, -13.873046875, -11.4609375, -9.048828125, -6.63671875, -4.224609375, -1.8125, 0.599609375, 3.01171875, 5.423828125, 7.8359375, 10.248046875, 12.66015625, 15.072265625, 17.484375, 19.896484375, 22.30859375, 24.720703125, 27.1328125, 29.544921875, 31.95703125, 34.369140625, 36.78125, 39.193359375, 41.60546875, 44.017578125, 46.4296875, 48.841796875, 51.25390625, 53.666015625, 56.078125, 58.490234375, 60.90234375, 63.314453125, 65.7265625, 68.138671875, 70.55078125, 72.962890625, 75.375]}, "gradients/decoder.transformer.h.9.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 5.0, 33.0, 117.0, 306.0, 326.0, 153.0, 61.0, 14.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-42.84886169433594, -38.99525833129883, -35.14165496826172, -31.288049697875977, -27.434446334838867, -23.580842971801758, -19.727237701416016, -15.873634338378906, -12.020030975341797, -8.166427612304688, -4.312823295593262, -0.45921897888183594, 3.3943843841552734, 7.247987747192383, 11.101593017578125, 14.955196380615234, 18.808799743652344, 22.662403106689453, 26.516006469726562, 30.369611740112305, 34.22321319580078, 38.076820373535156, 41.930423736572266, 45.784027099609375, 49.637630462646484, 53.491233825683594, 57.3448371887207, 61.19844055175781, 65.05204772949219, 68.90564727783203, 72.7592544555664, 76.61285400390625, 80.46646118164062, 84.320068359375, 88.17366790771484, 92.02727508544922, 95.88087463378906, 99.73448181152344, 103.58808898925781, 107.44168853759766, 111.2952880859375, 115.14889526367188, 119.00249481201172, 122.8561019897461, 126.70970153808594, 130.5633087158203, 134.4169158935547, 138.2705078125, 142.12411499023438, 145.97772216796875, 149.83132934570312, 153.68492126464844, 157.5385284423828, 161.3921356201172, 165.24574279785156, 169.09933471679688, 172.9529571533203, 176.8065643310547, 180.66017150878906, 184.51376342773438, 188.36737060546875, 192.22097778320312, 196.0745849609375, 199.92819213867188, 203.7817840576172]}, "gradients/decoder.transformer.h.9.ln_2.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 3.0, 5.0, 9.0, 2.0, 7.0, 10.0, 6.0, 7.0, 15.0, 20.0, 12.0, 17.0, 8.0, 19.0, 28.0, 18.0, 26.0, 22.0, 32.0, 28.0, 43.0, 29.0, 36.0, 33.0, 42.0, 40.0, 33.0, 51.0, 48.0, 33.0, 27.0, 27.0, 32.0, 29.0, 26.0, 25.0, 23.0, 19.0, 18.0, 15.0, 20.0, 12.0, 14.0, 10.0, 7.0, 8.0, 8.0, 6.0, 1.0, 0.0, 1.0, 1.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-34.623260498046875, -33.453102111816406, -32.28294372558594, -31.112781524658203, -29.942623138427734, -28.772464752197266, -27.602304458618164, -26.432144165039062, -25.261985778808594, -24.091827392578125, -22.921667098999023, -21.751506805419922, -20.581348419189453, -19.411190032958984, -18.241029739379883, -17.07086944580078, -15.900711059570312, -14.730551719665527, -13.560392379760742, -12.390233039855957, -11.220073699951172, -10.049914360046387, -8.879755020141602, -7.709595680236816, -6.539436340332031, -5.369277000427246, -4.199117660522461, -3.028958320617676, -1.8587989807128906, -0.6886396408081055, 0.4815196990966797, 1.6516790390014648, 2.82183837890625, 3.991997718811035, 5.16215705871582, 6.3323163986206055, 7.502475738525391, 8.672635078430176, 9.842794418334961, 11.012953758239746, 12.183113098144531, 13.353272438049316, 14.523431777954102, 15.693591117858887, 16.863750457763672, 18.03390884399414, 19.204069137573242, 20.374229431152344, 21.544387817382812, 22.71454620361328, 23.884706497192383, 25.054866790771484, 26.225025177001953, 27.395183563232422, 28.565343856811523, 29.735504150390625, 30.905662536621094, 32.07582092285156, 33.24597930908203, 34.416141510009766, 35.586299896240234, 36.7564582824707, 37.92662048339844, 39.096778869628906, 40.266937255859375]}, "gradients/decoder.transformer.h.9.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 6.0, 6.0, 6.0, 6.0, 13.0, 13.0, 5.0, 12.0, 12.0, 24.0, 30.0, 26.0, 36.0, 34.0, 41.0, 44.0, 48.0, 34.0, 34.0, 40.0, 42.0, 57.0, 43.0, 30.0, 42.0, 37.0, 28.0, 37.0, 23.0, 25.0, 20.0, 21.0, 23.0, 27.0, 10.0, 10.0, 16.0, 6.0, 12.0, 3.0, 4.0, 5.0, 1.0, 4.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-7.70703125, -7.47003173828125, -7.2330322265625, -6.99603271484375, -6.759033203125, -6.52203369140625, -6.2850341796875, -6.04803466796875, -5.81103515625, -5.57403564453125, -5.3370361328125, -5.10003662109375, -4.863037109375, -4.62603759765625, -4.3890380859375, -4.15203857421875, -3.9150390625, -3.67803955078125, -3.4410400390625, -3.20404052734375, -2.967041015625, -2.73004150390625, -2.4930419921875, -2.25604248046875, -2.01904296875, -1.78204345703125, -1.5450439453125, -1.30804443359375, -1.071044921875, -0.83404541015625, -0.5970458984375, -0.36004638671875, -0.123046875, 0.11395263671875, 0.3509521484375, 0.58795166015625, 0.824951171875, 1.06195068359375, 1.2989501953125, 1.53594970703125, 1.77294921875, 2.00994873046875, 2.2469482421875, 2.48394775390625, 2.720947265625, 2.95794677734375, 3.1949462890625, 3.43194580078125, 3.6689453125, 3.90594482421875, 4.1429443359375, 4.37994384765625, 4.616943359375, 4.85394287109375, 5.0909423828125, 5.32794189453125, 5.56494140625, 5.80194091796875, 6.0389404296875, 6.27593994140625, 6.512939453125, 6.74993896484375, 6.9869384765625, 7.22393798828125, 7.4609375]}, "gradients/decoder.transformer.h.9.crossattention.c_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 3.0, 8.0, 16.0, 15.0, 25.0, 27.0, 49.0, 54.0, 91.0, 114.0, 151.0, 216.0, 276.0, 418.0, 551.0, 786.0, 1061.0, 1527.0, 2164.0, 2962.0, 4493.0, 6500.0, 9353.0, 14172.0, 21442.0, 32371.0, 50570.0, 81153.0, 141778.0, 307000.0, 140281.0, 80907.0, 50112.0, 32191.0, 20822.0, 13861.0, 9431.0, 6469.0, 4424.0, 3206.0, 2157.0, 1505.0, 1089.0, 811.0, 549.0, 390.0, 270.0, 228.0, 125.0, 119.0, 78.0, 58.0, 43.0, 32.0, 20.0, 16.0, 14.0, 6.0, 5.0, 2.0, 2.0, 1.0], "bins": [-1.6044921875, -1.5535430908203125, -1.502593994140625, -1.4516448974609375, -1.40069580078125, -1.3497467041015625, -1.298797607421875, -1.2478485107421875, -1.1968994140625, -1.1459503173828125, -1.095001220703125, -1.0440521240234375, -0.99310302734375, -0.9421539306640625, -0.891204833984375, -0.8402557373046875, -0.789306640625, -0.7383575439453125, -0.687408447265625, -0.6364593505859375, -0.58551025390625, -0.5345611572265625, -0.483612060546875, -0.4326629638671875, -0.3817138671875, -0.3307647705078125, -0.279815673828125, -0.2288665771484375, -0.17791748046875, -0.1269683837890625, -0.076019287109375, -0.0250701904296875, 0.02587890625, 0.0768280029296875, 0.127777099609375, 0.1787261962890625, 0.22967529296875, 0.2806243896484375, 0.331573486328125, 0.3825225830078125, 0.4334716796875, 0.4844207763671875, 0.535369873046875, 0.5863189697265625, 0.63726806640625, 0.6882171630859375, 0.739166259765625, 0.7901153564453125, 0.841064453125, 0.8920135498046875, 0.942962646484375, 0.9939117431640625, 1.04486083984375, 1.0958099365234375, 1.146759033203125, 1.1977081298828125, 1.2486572265625, 1.2996063232421875, 1.350555419921875, 1.4015045166015625, 1.45245361328125, 1.5034027099609375, 1.554351806640625, 1.6053009033203125, 1.65625]}, "gradients/decoder.transformer.h.9.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 2.0, 3.0, 0.0, 3.0, 3.0, 3.0, 3.0, 4.0, 5.0, 6.0, 4.0, 10.0, 16.0, 13.0, 19.0, 22.0, 20.0, 25.0, 36.0, 22.0, 34.0, 27.0, 32.0, 27.0, 35.0, 34.0, 39.0, 40.0, 1062.0, 34.0, 47.0, 33.0, 39.0, 32.0, 27.0, 33.0, 29.0, 29.0, 19.0, 27.0, 24.0, 19.0, 13.0, 13.0, 10.0, 11.0, 14.0, 13.0, 5.0, 8.0, 5.0, 3.0, 1.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-4.75390625, -4.60888671875, -4.4638671875, -4.31884765625, -4.173828125, -4.02880859375, -3.8837890625, -3.73876953125, -3.59375, -3.44873046875, -3.3037109375, -3.15869140625, -3.013671875, -2.86865234375, -2.7236328125, -2.57861328125, -2.43359375, -2.28857421875, -2.1435546875, -1.99853515625, -1.853515625, -1.70849609375, -1.5634765625, -1.41845703125, -1.2734375, -1.12841796875, -0.9833984375, -0.83837890625, -0.693359375, -0.54833984375, -0.4033203125, -0.25830078125, -0.11328125, 0.03173828125, 0.1767578125, 0.32177734375, 0.466796875, 0.61181640625, 0.7568359375, 0.90185546875, 1.046875, 1.19189453125, 1.3369140625, 1.48193359375, 1.626953125, 1.77197265625, 1.9169921875, 2.06201171875, 2.20703125, 2.35205078125, 2.4970703125, 2.64208984375, 2.787109375, 2.93212890625, 3.0771484375, 3.22216796875, 3.3671875, 3.51220703125, 3.6572265625, 3.80224609375, 3.947265625, 4.09228515625, 4.2373046875, 4.38232421875, 4.52734375]}, "gradients/decoder.transformer.h.9.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 0.0, 1.0, 3.0, 6.0, 12.0, 12.0, 19.0, 27.0, 28.0, 45.0, 76.0, 124.0, 173.0, 288.0, 475.0, 782.0, 1355.0, 2416.0, 4244.0, 7330.0, 13182.0, 24686.0, 46583.0, 91255.0, 209737.0, 1429806.0, 127748.0, 63491.0, 33048.0, 17635.0, 9713.0, 5423.0, 2971.0, 1741.0, 1086.0, 637.0, 382.0, 214.0, 128.0, 86.0, 54.0, 36.0, 22.0, 18.0, 11.0, 8.0, 6.0, 3.0, 4.0, 3.0, 3.0, 0.0, 3.0, 2.0, 2.0, 2.0, 0.0, 1.0], "bins": [-2.474609375, -2.3953857421875, -2.316162109375, -2.2369384765625, -2.15771484375, -2.0784912109375, -1.999267578125, -1.9200439453125, -1.8408203125, -1.7615966796875, -1.682373046875, -1.6031494140625, -1.52392578125, -1.4447021484375, -1.365478515625, -1.2862548828125, -1.20703125, -1.1278076171875, -1.048583984375, -0.9693603515625, -0.89013671875, -0.8109130859375, -0.731689453125, -0.6524658203125, -0.5732421875, -0.4940185546875, -0.414794921875, -0.3355712890625, -0.25634765625, -0.1771240234375, -0.097900390625, -0.0186767578125, 0.060546875, 0.1397705078125, 0.218994140625, 0.2982177734375, 0.37744140625, 0.4566650390625, 0.535888671875, 0.6151123046875, 0.6943359375, 0.7735595703125, 0.852783203125, 0.9320068359375, 1.01123046875, 1.0904541015625, 1.169677734375, 1.2489013671875, 1.328125, 1.4073486328125, 1.486572265625, 1.5657958984375, 1.64501953125, 1.7242431640625, 1.803466796875, 1.8826904296875, 1.9619140625, 2.0411376953125, 2.120361328125, 2.1995849609375, 2.27880859375, 2.3580322265625, 2.437255859375, 2.5164794921875, 2.595703125]}, "gradients/decoder.transformer.h.9.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 0.0, 1.0, 3.0, 1.0, 6.0, 7.0, 12.0, 9.0, 15.0, 23.0, 25.0, 30.0, 24.0, 53.0, 71.0, 101.0, 112.0, 134.0, 102.0, 78.0, 51.0, 45.0, 19.0, 15.0, 18.0, 12.0, 8.0, 5.0, 8.0, 6.0, 2.0, 2.0, 3.0, 1.0, 3.0, 0.0, 3.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0023021697998046875, -0.0022348761558532715, -0.0021675825119018555, -0.0021002888679504395, -0.0020329952239990234, -0.0019657015800476074, -0.0018984079360961914, -0.0018311142921447754, -0.0017638206481933594, -0.0016965270042419434, -0.0016292333602905273, -0.0015619397163391113, -0.0014946460723876953, -0.0014273524284362793, -0.0013600587844848633, -0.0012927651405334473, -0.0012254714965820312, -0.0011581778526306152, -0.0010908842086791992, -0.0010235905647277832, -0.0009562969207763672, -0.0008890032768249512, -0.0008217096328735352, -0.0007544159889221191, -0.0006871223449707031, -0.0006198287010192871, -0.0005525350570678711, -0.0004852414131164551, -0.00041794776916503906, -0.00035065412521362305, -0.00028336048126220703, -0.00021606683731079102, -0.000148773193359375, -8.147954940795898e-05, -1.4185905456542969e-05, 5.310773849487305e-05, 0.00012040138244628906, 0.00018769502639770508, 0.0002549886703491211, 0.0003222823143005371, 0.0003895759582519531, 0.00045686960220336914, 0.0005241632461547852, 0.0005914568901062012, 0.0006587505340576172, 0.0007260441780090332, 0.0007933378219604492, 0.0008606314659118652, 0.0009279251098632812, 0.0009952187538146973, 0.0010625123977661133, 0.0011298060417175293, 0.0011970996856689453, 0.0012643933296203613, 0.0013316869735717773, 0.0013989806175231934, 0.0014662742614746094, 0.0015335679054260254, 0.0016008615493774414, 0.0016681551933288574, 0.0017354488372802734, 0.0018027424812316895, 0.0018700361251831055, 0.0019373297691345215, 0.0020046234130859375]}, "gradients/decoder.transformer.h.9.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 3.0, 2.0, 2.0, 6.0, 3.0, 7.0, 11.0, 8.0, 20.0, 15.0, 27.0, 20.0, 46.0, 76.0, 132.0, 254.0, 1050.0, 1039616.0, 6313.0, 410.0, 180.0, 103.0, 59.0, 42.0, 33.0, 34.0, 26.0, 8.0, 17.0, 13.0, 9.0, 2.0, 3.0, 1.0, 3.0, 3.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.04486083984375, -0.0433506965637207, -0.041840553283691406, -0.04033041000366211, -0.03882026672363281, -0.037310123443603516, -0.03579998016357422, -0.03428983688354492, -0.032779693603515625, -0.03126955032348633, -0.02975940704345703, -0.028249263763427734, -0.026739120483398438, -0.02522897720336914, -0.023718833923339844, -0.022208690643310547, -0.02069854736328125, -0.019188404083251953, -0.017678260803222656, -0.01616811752319336, -0.014657974243164062, -0.013147830963134766, -0.011637687683105469, -0.010127544403076172, -0.008617401123046875, -0.007107257843017578, -0.005597114562988281, -0.004086971282958984, -0.0025768280029296875, -0.0010666847229003906, 0.00044345855712890625, 0.001953601837158203, 0.0034637451171875, 0.004973888397216797, 0.006484031677246094, 0.00799417495727539, 0.009504318237304688, 0.011014461517333984, 0.012524604797363281, 0.014034748077392578, 0.015544891357421875, 0.017055034637451172, 0.01856517791748047, 0.020075321197509766, 0.021585464477539062, 0.02309560775756836, 0.024605751037597656, 0.026115894317626953, 0.02762603759765625, 0.029136180877685547, 0.030646324157714844, 0.03215646743774414, 0.03366661071777344, 0.035176753997802734, 0.03668689727783203, 0.03819704055786133, 0.039707183837890625, 0.04121732711791992, 0.04272747039794922, 0.044237613677978516, 0.04574775695800781, 0.04725790023803711, 0.048768043518066406, 0.0502781867980957, 0.051788330078125]}, "gradients/decoder.transformer.h.9.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 3.0, 2.0, 19.0, 91.0, 443.0, 385.0, 68.0, 6.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005329780397005379, -0.00044286041520535946, -0.000352742790710181, -0.000262625195318833, -0.00017250757082365453, -8.238994632847607e-05, 7.727649062871933e-06, 9.78452735580504e-05, 0.00018796289805322886, 0.0002780805225484073, 0.0003681981470435858, 0.0004583157424349338, 0.0005484333960339427, 0.0006385509623214602, 0.0007286685868166387, 0.0008187862113118172, 0.0009089038358069956, 0.000999021460302174, 0.0010891390265896916, 0.001179256709292531, 0.0012693742755800486, 0.001359491958282888, 0.0014496095245704055, 0.0015397272072732449, 0.0016298447735607624, 0.00171996233984828, 0.0018100800225511193, 0.0019001975888386369, 0.0019903152715414762, 0.002080432837828994, 0.0021705504041165113, 0.0022606682032346725, 0.0023507855366915464, 0.002440903102979064, 0.0025310206692665815, 0.0026211384683847427, 0.0027112560346722603, 0.002801373600959778, 0.0028914911672472954, 0.002981608733534813, 0.003071726532652974, 0.0031618440989404917, 0.0032519616652280092, 0.0033420794643461704, 0.003432197030633688, 0.0035223145969212055, 0.003612432163208723, 0.0037025497294962406, 0.003792667295783758, 0.0038827848620712757, 0.003972902428358793, 0.004063019994646311, 0.004153137560933828, 0.004243255592882633, 0.004333373159170151, 0.004423490725457668, 0.004513608291745186, 0.004603725858032703, 0.004693843424320221, 0.0047839609906077385, 0.004874078556895256, 0.004964196588844061, 0.0050543141551315784, 0.005144431721419096, 0.0052345492877066135]}, "gradients/decoder.transformer.h.9.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 2.0, 2.0, 5.0, 3.0, 5.0, 5.0, 6.0, 8.0, 7.0, 13.0, 6.0, 15.0, 11.0, 15.0, 26.0, 30.0, 22.0, 38.0, 35.0, 39.0, 36.0, 35.0, 41.0, 37.0, 33.0, 38.0, 40.0, 43.0, 35.0, 24.0, 28.0, 43.0, 39.0, 29.0, 28.0, 24.0, 23.0, 18.0, 19.0, 18.0, 17.0, 13.0, 12.0, 13.0, 5.0, 4.0, 7.0, 6.0, 5.0, 2.0, 0.0, 2.0, 3.0, 1.0, 0.0, 2.0], "bins": [-0.0007742047309875488, -0.0007513789460062981, -0.0007285531610250473, -0.0007057273760437965, -0.0006829015910625458, -0.000660075806081295, -0.0006372500211000443, -0.0006144242361187935, -0.0005915984511375427, -0.000568772666156292, -0.0005459468811750412, -0.0005231210961937904, -0.0005002953112125397, -0.0004774695262312889, -0.00045464374125003815, -0.0004318179562687874, -0.0004089921712875366, -0.00038616638630628586, -0.0003633406013250351, -0.00034051481634378433, -0.00031768903136253357, -0.0002948632463812828, -0.00027203746140003204, -0.0002492116764187813, -0.00022638589143753052, -0.00020356010645627975, -0.000180734321475029, -0.00015790853649377823, -0.00013508275151252747, -0.0001122569665312767, -8.943118155002594e-05, -6.660539656877518e-05, -4.3779611587524414e-05, -2.095382660627365e-05, 1.8719583749771118e-06, 2.4697743356227875e-05, 4.752352833747864e-05, 7.03493133187294e-05, 9.317509829998016e-05, 0.00011600088328123093, 0.0001388266682624817, 0.00016165245324373245, 0.00018447823822498322, 0.00020730402320623398, 0.00023012980818748474, 0.0002529555931687355, 0.00027578137814998627, 0.00029860716313123703, 0.0003214329481124878, 0.00034425873309373856, 0.0003670845180749893, 0.0003899103030562401, 0.00041273608803749084, 0.0004355618730187416, 0.00045838765799999237, 0.00048121344298124313, 0.0005040392279624939, 0.0005268650129437447, 0.0005496907979249954, 0.0005725165829062462, 0.000595342367887497, 0.0006181681528687477, 0.0006409939378499985, 0.0006638197228312492, 0.0006866455078125]}, "gradients/decoder.transformer.h.9.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 6.0, 6.0, 6.0, 6.0, 13.0, 13.0, 5.0, 12.0, 12.0, 24.0, 30.0, 26.0, 37.0, 33.0, 41.0, 44.0, 48.0, 34.0, 34.0, 40.0, 43.0, 56.0, 44.0, 29.0, 42.0, 37.0, 28.0, 37.0, 24.0, 24.0, 21.0, 20.0, 23.0, 27.0, 10.0, 11.0, 15.0, 6.0, 13.0, 2.0, 4.0, 5.0, 1.0, 4.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-7.70703125, -7.469970703125, -7.23291015625, -6.995849609375, -6.7587890625, -6.521728515625, -6.28466796875, -6.047607421875, -5.810546875, -5.573486328125, -5.33642578125, -5.099365234375, -4.8623046875, -4.625244140625, -4.38818359375, -4.151123046875, -3.9140625, -3.677001953125, -3.43994140625, -3.202880859375, -2.9658203125, -2.728759765625, -2.49169921875, -2.254638671875, -2.017578125, -1.780517578125, -1.54345703125, -1.306396484375, -1.0693359375, -0.832275390625, -0.59521484375, -0.358154296875, -0.12109375, 0.115966796875, 0.35302734375, 0.590087890625, 0.8271484375, 1.064208984375, 1.30126953125, 1.538330078125, 1.775390625, 2.012451171875, 2.24951171875, 2.486572265625, 2.7236328125, 2.960693359375, 3.19775390625, 3.434814453125, 3.671875, 3.908935546875, 4.14599609375, 4.383056640625, 4.6201171875, 4.857177734375, 5.09423828125, 5.331298828125, 5.568359375, 5.805419921875, 6.04248046875, 6.279541015625, 6.5166015625, 6.753662109375, 6.99072265625, 7.227783203125, 7.46484375]}, "gradients/decoder.transformer.h.9.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 3.0, 1.0, 3.0, 3.0, 3.0, 6.0, 8.0, 6.0, 8.0, 12.0, 16.0, 19.0, 20.0, 29.0, 30.0, 59.0, 77.0, 78.0, 140.0, 211.0, 286.0, 480.0, 758.0, 1992.0, 10139.0, 87191.0, 764250.0, 160905.0, 16462.0, 2825.0, 963.0, 499.0, 295.0, 205.0, 142.0, 101.0, 73.0, 57.0, 47.0, 51.0, 25.0, 14.0, 19.0, 13.0, 16.0, 5.0, 3.0, 9.0, 2.0, 5.0, 0.0, 3.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-14.125, -13.6920166015625, -13.259033203125, -12.8260498046875, -12.39306640625, -11.9600830078125, -11.527099609375, -11.0941162109375, -10.6611328125, -10.2281494140625, -9.795166015625, -9.3621826171875, -8.92919921875, -8.4962158203125, -8.063232421875, -7.6302490234375, -7.197265625, -6.7642822265625, -6.331298828125, -5.8983154296875, -5.46533203125, -5.0323486328125, -4.599365234375, -4.1663818359375, -3.7333984375, -3.3004150390625, -2.867431640625, -2.4344482421875, -2.00146484375, -1.5684814453125, -1.135498046875, -0.7025146484375, -0.26953125, 0.1634521484375, 0.596435546875, 1.0294189453125, 1.46240234375, 1.8953857421875, 2.328369140625, 2.7613525390625, 3.1943359375, 3.6273193359375, 4.060302734375, 4.4932861328125, 4.92626953125, 5.3592529296875, 5.792236328125, 6.2252197265625, 6.658203125, 7.0911865234375, 7.524169921875, 7.9571533203125, 8.39013671875, 8.8231201171875, 9.256103515625, 9.6890869140625, 10.1220703125, 10.5550537109375, 10.988037109375, 11.4210205078125, 11.85400390625, 12.2869873046875, 12.719970703125, 13.1529541015625, 13.5859375]}, "gradients/decoder.transformer.h.9.attn.c_attn.bias": {"_type": "histogram", "values": [4.0, 2.0, 1.0, 3.0, 1.0, 1.0, 2.0, 3.0, 2.0, 8.0, 8.0, 11.0, 10.0, 13.0, 18.0, 18.0, 30.0, 30.0, 33.0, 32.0, 34.0, 36.0, 41.0, 49.0, 51.0, 93.0, 1600.0, 431.0, 84.0, 45.0, 48.0, 39.0, 39.0, 45.0, 29.0, 24.0, 26.0, 29.0, 24.0, 19.0, 12.0, 6.0, 7.0, 5.0, 7.0, 4.0, 1.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.703125, -20.89794921875, -20.0927734375, -19.28759765625, -18.482421875, -17.67724609375, -16.8720703125, -16.06689453125, -15.26171875, -14.45654296875, -13.6513671875, -12.84619140625, -12.041015625, -11.23583984375, -10.4306640625, -9.62548828125, -8.8203125, -8.01513671875, -7.2099609375, -6.40478515625, -5.599609375, -4.79443359375, -3.9892578125, -3.18408203125, -2.37890625, -1.57373046875, -0.7685546875, 0.03662109375, 0.841796875, 1.64697265625, 2.4521484375, 3.25732421875, 4.0625, 4.86767578125, 5.6728515625, 6.47802734375, 7.283203125, 8.08837890625, 8.8935546875, 9.69873046875, 10.50390625, 11.30908203125, 12.1142578125, 12.91943359375, 13.724609375, 14.52978515625, 15.3349609375, 16.14013671875, 16.9453125, 17.75048828125, 18.5556640625, 19.36083984375, 20.166015625, 20.97119140625, 21.7763671875, 22.58154296875, 23.38671875, 24.19189453125, 24.9970703125, 25.80224609375, 26.607421875, 27.41259765625, 28.2177734375, 29.02294921875, 29.828125]}, "gradients/decoder.transformer.h.9.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 2.0, 3.0, 1.0, 7.0, 4.0, 9.0, 5.0, 15.0, 18.0, 20.0, 29.0, 38.0, 28.0, 39.0, 63.0, 83.0, 96.0, 202.0, 342.0, 1144.0, 463488.0, 2677637.0, 1321.0, 430.0, 200.0, 116.0, 68.0, 58.0, 38.0, 32.0, 35.0, 30.0, 24.0, 22.0, 12.0, 11.0, 11.0, 7.0, 8.0, 4.0, 2.0, 0.0, 3.0, 1.0, 4.0, 1.0, 2.0, 3.0], "bins": [-89.1875, -86.7802734375, -84.373046875, -81.9658203125, -79.55859375, -77.1513671875, -74.744140625, -72.3369140625, -69.9296875, -67.5224609375, -65.115234375, -62.7080078125, -60.30078125, -57.8935546875, -55.486328125, -53.0791015625, -50.671875, -48.2646484375, -45.857421875, -43.4501953125, -41.04296875, -38.6357421875, -36.228515625, -33.8212890625, -31.4140625, -29.0068359375, -26.599609375, -24.1923828125, -21.78515625, -19.3779296875, -16.970703125, -14.5634765625, -12.15625, -9.7490234375, -7.341796875, -4.9345703125, -2.52734375, -0.1201171875, 2.287109375, 4.6943359375, 7.1015625, 9.5087890625, 11.916015625, 14.3232421875, 16.73046875, 19.1376953125, 21.544921875, 23.9521484375, 26.359375, 28.7666015625, 31.173828125, 33.5810546875, 35.98828125, 38.3955078125, 40.802734375, 43.2099609375, 45.6171875, 48.0244140625, 50.431640625, 52.8388671875, 55.24609375, 57.6533203125, 60.060546875, 62.4677734375, 64.875]}, "gradients/decoder.transformer.h.9.ln_1.weight": {"_type": "histogram", "values": [2.0, 2.0, 22.0, 152.0, 444.0, 336.0, 54.0, 8.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-10.117131233215332, -8.01663589477539, -5.916140556335449, -3.815645217895508, -1.7151498794555664, 0.385345458984375, 2.4858407974243164, 4.586336135864258, 6.686831474304199, 8.78732681274414, 10.887822151184082, 12.988317489624023, 15.088812828063965, 17.189308166503906, 19.28980255126953, 21.39029884338379, 23.490795135498047, 25.591289520263672, 27.69178581237793, 29.792282104492188, 31.892776489257812, 33.99327087402344, 36.09376525878906, 38.19426345825195, 40.29475784301758, 42.3952522277832, 44.495750427246094, 46.59624481201172, 48.696739196777344, 50.79723358154297, 52.897727966308594, 54.998226165771484, 57.098724365234375, 59.19921875, 61.299713134765625, 63.400211334228516, 65.50070190429688, 67.60120391845703, 69.70169830322266, 71.80219268798828, 73.9026870727539, 76.00318145751953, 78.10367584228516, 80.20417022705078, 82.30467224121094, 84.40516662597656, 86.50566101074219, 88.60615539550781, 90.70664978027344, 92.80714416503906, 94.90763854980469, 97.00813293457031, 99.10862731933594, 101.2091293334961, 103.30962371826172, 105.41011810302734, 107.51061248779297, 109.6111068725586, 111.71160125732422, 113.81209564208984, 115.91259765625, 118.01309204101562, 120.11358642578125, 122.21408081054688, 124.3145751953125]}, "gradients/decoder.transformer.h.9.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 6.0, 4.0, 9.0, 7.0, 13.0, 13.0, 10.0, 9.0, 20.0, 26.0, 21.0, 24.0, 28.0, 31.0, 30.0, 38.0, 41.0, 43.0, 44.0, 42.0, 45.0, 47.0, 49.0, 40.0, 42.0, 21.0, 51.0, 53.0, 29.0, 25.0, 26.0, 15.0, 21.0, 20.0, 12.0, 5.0, 10.0, 7.0, 12.0, 5.0, 4.0, 2.0, 7.0, 3.0, 0.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-69.00862121582031, -66.81301879882812, -64.6174087524414, -62.42180633544922, -60.226200103759766, -58.03059387207031, -55.834991455078125, -53.63938522338867, -51.44377899169922, -49.248172760009766, -47.05256652832031, -44.856964111328125, -42.66135787963867, -40.46575164794922, -38.27014923095703, -36.07454299926758, -33.878936767578125, -31.683330535888672, -29.48772621154785, -27.29212188720703, -25.096515655517578, -22.900909423828125, -20.705305099487305, -18.509700775146484, -16.31409454345703, -14.118489265441895, -11.922883987426758, -9.727278709411621, -7.531673431396484, -5.336068153381348, -3.140462875366211, -0.9448575973510742, 1.2507400512695312, 3.446345329284668, 5.641950607299805, 7.837555885314941, 10.033161163330078, 12.228766441345215, 14.424371719360352, 16.619976043701172, 18.815582275390625, 21.011188507080078, 23.2067928314209, 25.40239715576172, 27.598003387451172, 29.793609619140625, 31.989213943481445, 34.184818267822266, 36.38042449951172, 38.57603073120117, 40.771636962890625, 42.96723937988281, 45.162845611572266, 47.35845184326172, 49.554054260253906, 51.74966049194336, 53.94526672363281, 56.140872955322266, 58.33647918701172, 60.532081604003906, 62.72768783569336, 64.92329406738281, 67.118896484375, 69.31450653076172, 71.5101089477539]}, "gradients/decoder.transformer.h.8.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 2.0, 4.0, 6.0, 8.0, 10.0, 14.0, 8.0, 7.0, 8.0, 9.0, 22.0, 30.0, 26.0, 32.0, 27.0, 39.0, 26.0, 46.0, 43.0, 43.0, 37.0, 33.0, 35.0, 55.0, 47.0, 39.0, 30.0, 38.0, 32.0, 24.0, 25.0, 27.0, 27.0, 19.0, 19.0, 23.0, 19.0, 11.0, 9.0, 8.0, 11.0, 8.0, 2.0, 7.0, 1.0, 1.0, 3.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 2.0], "bins": [-7.8828125, -7.63946533203125, -7.3961181640625, -7.15277099609375, -6.909423828125, -6.66607666015625, -6.4227294921875, -6.17938232421875, -5.93603515625, -5.69268798828125, -5.4493408203125, -5.20599365234375, -4.962646484375, -4.71929931640625, -4.4759521484375, -4.23260498046875, -3.9892578125, -3.74591064453125, -3.5025634765625, -3.25921630859375, -3.015869140625, -2.77252197265625, -2.5291748046875, -2.28582763671875, -2.04248046875, -1.79913330078125, -1.5557861328125, -1.31243896484375, -1.069091796875, -0.82574462890625, -0.5823974609375, -0.33905029296875, -0.095703125, 0.14764404296875, 0.3909912109375, 0.63433837890625, 0.877685546875, 1.12103271484375, 1.3643798828125, 1.60772705078125, 1.85107421875, 2.09442138671875, 2.3377685546875, 2.58111572265625, 2.824462890625, 3.06781005859375, 3.3111572265625, 3.55450439453125, 3.7978515625, 4.04119873046875, 4.2845458984375, 4.52789306640625, 4.771240234375, 5.01458740234375, 5.2579345703125, 5.50128173828125, 5.74462890625, 5.98797607421875, 6.2313232421875, 6.47467041015625, 6.718017578125, 6.96136474609375, 7.2047119140625, 7.44805908203125, 7.69140625]}, "gradients/decoder.transformer.h.8.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 2.0, 5.0, 8.0, 9.0, 14.0, 9.0, 18.0, 24.0, 22.0, 33.0, 56.0, 51.0, 73.0, 79.0, 139.0, 217.0, 286.0, 475.0, 972.0, 2003.0, 5211.0, 15737.0, 60859.0, 267478.0, 916277.0, 1588448.0, 964583.0, 280282.0, 64495.0, 16556.0, 5245.0, 2099.0, 991.0, 505.0, 283.0, 175.0, 132.0, 107.0, 78.0, 48.0, 57.0, 33.0, 27.0, 20.0, 22.0, 13.0, 8.0, 11.0, 8.0, 4.0, 5.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0], "bins": [-9.015625, -8.7384033203125, -8.461181640625, -8.1839599609375, -7.90673828125, -7.6295166015625, -7.352294921875, -7.0750732421875, -6.7978515625, -6.5206298828125, -6.243408203125, -5.9661865234375, -5.68896484375, -5.4117431640625, -5.134521484375, -4.8572998046875, -4.580078125, -4.3028564453125, -4.025634765625, -3.7484130859375, -3.47119140625, -3.1939697265625, -2.916748046875, -2.6395263671875, -2.3623046875, -2.0850830078125, -1.807861328125, -1.5306396484375, -1.25341796875, -0.9761962890625, -0.698974609375, -0.4217529296875, -0.14453125, 0.1326904296875, 0.409912109375, 0.6871337890625, 0.96435546875, 1.2415771484375, 1.518798828125, 1.7960205078125, 2.0732421875, 2.3504638671875, 2.627685546875, 2.9049072265625, 3.18212890625, 3.4593505859375, 3.736572265625, 4.0137939453125, 4.291015625, 4.5682373046875, 4.845458984375, 5.1226806640625, 5.39990234375, 5.6771240234375, 5.954345703125, 6.2315673828125, 6.5087890625, 6.7860107421875, 7.063232421875, 7.3404541015625, 7.61767578125, 7.8948974609375, 8.172119140625, 8.4493408203125, 8.7265625]}, "gradients/decoder.transformer.h.8.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 4.0, 6.0, 10.0, 10.0, 11.0, 12.0, 21.0, 36.0, 35.0, 47.0, 55.0, 56.0, 109.0, 123.0, 135.0, 177.0, 241.0, 347.0, 417.0, 442.0, 401.0, 318.0, 239.0, 156.0, 132.0, 125.0, 92.0, 70.0, 66.0, 35.0, 42.0, 29.0, 24.0, 17.0, 13.0, 10.0, 9.0, 3.0, 2.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0], "bins": [-15.5546875, -15.127197265625, -14.69970703125, -14.272216796875, -13.8447265625, -13.417236328125, -12.98974609375, -12.562255859375, -12.134765625, -11.707275390625, -11.27978515625, -10.852294921875, -10.4248046875, -9.997314453125, -9.56982421875, -9.142333984375, -8.71484375, -8.287353515625, -7.85986328125, -7.432373046875, -7.0048828125, -6.577392578125, -6.14990234375, -5.722412109375, -5.294921875, -4.867431640625, -4.43994140625, -4.012451171875, -3.5849609375, -3.157470703125, -2.72998046875, -2.302490234375, -1.875, -1.447509765625, -1.02001953125, -0.592529296875, -0.1650390625, 0.262451171875, 0.68994140625, 1.117431640625, 1.544921875, 1.972412109375, 2.39990234375, 2.827392578125, 3.2548828125, 3.682373046875, 4.10986328125, 4.537353515625, 4.96484375, 5.392333984375, 5.81982421875, 6.247314453125, 6.6748046875, 7.102294921875, 7.52978515625, 7.957275390625, 8.384765625, 8.812255859375, 9.23974609375, 9.667236328125, 10.0947265625, 10.522216796875, 10.94970703125, 11.377197265625, 11.8046875]}, "gradients/decoder.transformer.h.8.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 2.0, 5.0, 4.0, 7.0, 13.0, 14.0, 15.0, 27.0, 32.0, 33.0, 48.0, 66.0, 80.0, 92.0, 106.0, 129.0, 161.0, 170.0, 220.0, 283.0, 454.0, 1202.0, 15389.0, 4112405.0, 59593.0, 1613.0, 553.0, 295.0, 244.0, 207.0, 152.0, 126.0, 104.0, 86.0, 85.0, 66.0, 60.0, 26.0, 31.0, 24.0, 20.0, 12.0, 8.0, 7.0, 9.0, 4.0, 1.0, 3.0, 3.0, 2.0, 1.0, 0.0, 1.0], "bins": [-72.8125, -70.70849609375, -68.6044921875, -66.50048828125, -64.396484375, -62.29248046875, -60.1884765625, -58.08447265625, -55.98046875, -53.87646484375, -51.7724609375, -49.66845703125, -47.564453125, -45.46044921875, -43.3564453125, -41.25244140625, -39.1484375, -37.04443359375, -34.9404296875, -32.83642578125, -30.732421875, -28.62841796875, -26.5244140625, -24.42041015625, -22.31640625, -20.21240234375, -18.1083984375, -16.00439453125, -13.900390625, -11.79638671875, -9.6923828125, -7.58837890625, -5.484375, -3.38037109375, -1.2763671875, 0.82763671875, 2.931640625, 5.03564453125, 7.1396484375, 9.24365234375, 11.34765625, 13.45166015625, 15.5556640625, 17.65966796875, 19.763671875, 21.86767578125, 23.9716796875, 26.07568359375, 28.1796875, 30.28369140625, 32.3876953125, 34.49169921875, 36.595703125, 38.69970703125, 40.8037109375, 42.90771484375, 45.01171875, 47.11572265625, 49.2197265625, 51.32373046875, 53.427734375, 55.53173828125, 57.6357421875, 59.73974609375, 61.84375]}, "gradients/decoder.transformer.h.8.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 82.0, 581.0, 336.0, 15.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-274.33038330078125, -265.9565734863281, -257.5827941894531, -249.208984375, -240.83517456054688, -232.4613800048828, -224.08758544921875, -215.71377563476562, -207.3399658203125, -198.96617126464844, -190.5923614501953, -182.21856689453125, -173.84475708007812, -165.47096252441406, -157.09716796875, -148.72335815429688, -140.3495635986328, -131.97576904296875, -123.60195922851562, -115.22816467285156, -106.85435485839844, -98.48056030273438, -90.10675811767578, -81.73295593261719, -73.3591537475586, -64.9853515625, -56.611549377441406, -48.23775100708008, -39.863948822021484, -31.49014663696289, -23.116348266601562, -14.742546081542969, -6.368743896484375, 2.0050573348999023, 10.37885856628418, 18.75265884399414, 27.126461029052734, 35.50026321411133, 43.874061584472656, 52.24786376953125, 60.621665954589844, 68.99546813964844, 77.36927032470703, 85.74307250976562, 94.11686706542969, 102.49067687988281, 110.86447143554688, 119.23827362060547, 127.61207580566406, 135.98587036132812, 144.35968017578125, 152.7334747314453, 161.10728454589844, 169.4810791015625, 177.85488891601562, 186.2286834716797, 194.60247802734375, 202.9762725830078, 211.35008239746094, 219.723876953125, 228.09768676757812, 236.4714813232422, 244.84527587890625, 253.21908569335938, 261.5928955078125]}, "gradients/decoder.transformer.h.8.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 6.0, 10.0, 7.0, 10.0, 8.0, 13.0, 13.0, 10.0, 18.0, 26.0, 20.0, 25.0, 25.0, 33.0, 22.0, 22.0, 38.0, 36.0, 37.0, 33.0, 44.0, 46.0, 38.0, 40.0, 37.0, 36.0, 35.0, 39.0, 31.0, 25.0, 22.0, 30.0, 23.0, 20.0, 20.0, 20.0, 18.0, 13.0, 13.0, 14.0, 1.0, 8.0, 5.0, 5.0, 2.0, 3.0, 3.0, 1.0, 1.0, 3.0, 3.0, 3.0, 1.0], "bins": [-51.42364501953125, -49.89433670043945, -48.36502456665039, -46.835716247558594, -45.30640411376953, -43.777095794677734, -42.24778366088867, -40.718475341796875, -39.18916320800781, -37.659854888916016, -36.13054275512695, -34.601234436035156, -33.071922302246094, -31.542612075805664, -30.013301849365234, -28.483993530273438, -26.954683303833008, -25.425373077392578, -23.89606285095215, -22.36675262451172, -20.83744239807129, -19.30813217163086, -17.778823852539062, -16.24951171875, -14.720202445983887, -13.190892219543457, -11.661581993103027, -10.132272720336914, -8.602962493896484, -7.0736517906188965, -5.544342041015625, -4.015031814575195, -2.4857215881347656, -0.9564114809036255, 0.5728986263275146, 2.1022086143493652, 3.631518840789795, 5.160829067230225, 6.690138816833496, 8.219449043273926, 9.748759269714355, 11.278069496154785, 12.807379722595215, 14.336688995361328, 15.865999221801758, 17.395309448242188, 18.924619674682617, 20.453929901123047, 21.983240127563477, 23.512550354003906, 25.041860580444336, 26.571170806884766, 28.100481033325195, 29.629791259765625, 31.159099578857422, 32.688411712646484, 34.21772003173828, 35.74702835083008, 37.27634048461914, 38.80564880371094, 40.3349609375, 41.8642692565918, 43.39358139038086, 44.922889709472656, 46.45220184326172]}, "gradients/decoder.transformer.h.8.crossattention.c_proj.bias": {"_type": "histogram", "values": [5.0, 2.0, 1.0, 4.0, 1.0, 5.0, 5.0, 3.0, 4.0, 5.0, 4.0, 4.0, 7.0, 7.0, 8.0, 14.0, 8.0, 12.0, 24.0, 18.0, 22.0, 22.0, 21.0, 34.0, 25.0, 33.0, 30.0, 38.0, 49.0, 44.0, 42.0, 26.0, 41.0, 46.0, 40.0, 38.0, 32.0, 30.0, 24.0, 31.0, 23.0, 17.0, 33.0, 17.0, 17.0, 15.0, 13.0, 15.0, 13.0, 11.0, 9.0, 1.0, 5.0, 9.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0], "bins": [-6.63671875, -6.42572021484375, -6.2147216796875, -6.00372314453125, -5.792724609375, -5.58172607421875, -5.3707275390625, -5.15972900390625, -4.94873046875, -4.73773193359375, -4.5267333984375, -4.31573486328125, -4.104736328125, -3.89373779296875, -3.6827392578125, -3.47174072265625, -3.2607421875, -3.04974365234375, -2.8387451171875, -2.62774658203125, -2.416748046875, -2.20574951171875, -1.9947509765625, -1.78375244140625, -1.57275390625, -1.36175537109375, -1.1507568359375, -0.93975830078125, -0.728759765625, -0.51776123046875, -0.3067626953125, -0.09576416015625, 0.115234375, 0.32623291015625, 0.5372314453125, 0.74822998046875, 0.959228515625, 1.17022705078125, 1.3812255859375, 1.59222412109375, 1.80322265625, 2.01422119140625, 2.2252197265625, 2.43621826171875, 2.647216796875, 2.85821533203125, 3.0692138671875, 3.28021240234375, 3.4912109375, 3.70220947265625, 3.9132080078125, 4.12420654296875, 4.335205078125, 4.54620361328125, 4.7572021484375, 4.96820068359375, 5.17919921875, 5.39019775390625, 5.6011962890625, 5.81219482421875, 6.023193359375, 6.23419189453125, 6.4451904296875, 6.65618896484375, 6.8671875]}, "gradients/decoder.transformer.h.8.crossattention.c_proj.weight": {"_type": "histogram", "values": [5.0, 2.0, 6.0, 13.0, 5.0, 16.0, 13.0, 21.0, 30.0, 33.0, 48.0, 79.0, 127.0, 184.0, 261.0, 325.0, 519.0, 685.0, 935.0, 1343.0, 1910.0, 2746.0, 3933.0, 5871.0, 8534.0, 12790.0, 18987.0, 28832.0, 45174.0, 76976.0, 160881.0, 331865.0, 146180.0, 71991.0, 43002.0, 27339.0, 18069.0, 12004.0, 8241.0, 5630.0, 3963.0, 2680.0, 1922.0, 1244.0, 1006.0, 645.0, 457.0, 340.0, 217.0, 142.0, 109.0, 73.0, 52.0, 34.0, 23.0, 19.0, 13.0, 7.0, 8.0, 7.0, 5.0, 2.0, 1.0, 2.0], "bins": [-1.6875, -1.6338348388671875, -1.580169677734375, -1.5265045166015625, -1.47283935546875, -1.4191741943359375, -1.365509033203125, -1.3118438720703125, -1.2581787109375, -1.2045135498046875, -1.150848388671875, -1.0971832275390625, -1.04351806640625, -0.9898529052734375, -0.936187744140625, -0.8825225830078125, -0.828857421875, -0.7751922607421875, -0.721527099609375, -0.6678619384765625, -0.61419677734375, -0.5605316162109375, -0.506866455078125, -0.4532012939453125, -0.3995361328125, -0.3458709716796875, -0.292205810546875, -0.2385406494140625, -0.18487548828125, -0.1312103271484375, -0.077545166015625, -0.0238800048828125, 0.02978515625, 0.0834503173828125, 0.137115478515625, 0.1907806396484375, 0.24444580078125, 0.2981109619140625, 0.351776123046875, 0.4054412841796875, 0.4591064453125, 0.5127716064453125, 0.566436767578125, 0.6201019287109375, 0.67376708984375, 0.7274322509765625, 0.781097412109375, 0.8347625732421875, 0.888427734375, 0.9420928955078125, 0.995758056640625, 1.0494232177734375, 1.10308837890625, 1.1567535400390625, 1.210418701171875, 1.2640838623046875, 1.3177490234375, 1.3714141845703125, 1.425079345703125, 1.4787445068359375, 1.53240966796875, 1.5860748291015625, 1.639739990234375, 1.6934051513671875, 1.7470703125]}, "gradients/decoder.transformer.h.8.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 5.0, 1.0, 5.0, 8.0, 6.0, 4.0, 9.0, 12.0, 17.0, 18.0, 19.0, 18.0, 28.0, 36.0, 34.0, 28.0, 41.0, 34.0, 33.0, 47.0, 47.0, 50.0, 1067.0, 38.0, 42.0, 35.0, 36.0, 33.0, 27.0, 29.0, 28.0, 22.0, 22.0, 27.0, 18.0, 19.0, 15.0, 19.0, 8.0, 14.0, 5.0, 5.0, 4.0, 5.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0], "bins": [-4.94140625, -4.79248046875, -4.6435546875, -4.49462890625, -4.345703125, -4.19677734375, -4.0478515625, -3.89892578125, -3.75, -3.60107421875, -3.4521484375, -3.30322265625, -3.154296875, -3.00537109375, -2.8564453125, -2.70751953125, -2.55859375, -2.40966796875, -2.2607421875, -2.11181640625, -1.962890625, -1.81396484375, -1.6650390625, -1.51611328125, -1.3671875, -1.21826171875, -1.0693359375, -0.92041015625, -0.771484375, -0.62255859375, -0.4736328125, -0.32470703125, -0.17578125, -0.02685546875, 0.1220703125, 0.27099609375, 0.419921875, 0.56884765625, 0.7177734375, 0.86669921875, 1.015625, 1.16455078125, 1.3134765625, 1.46240234375, 1.611328125, 1.76025390625, 1.9091796875, 2.05810546875, 2.20703125, 2.35595703125, 2.5048828125, 2.65380859375, 2.802734375, 2.95166015625, 3.1005859375, 3.24951171875, 3.3984375, 3.54736328125, 3.6962890625, 3.84521484375, 3.994140625, 4.14306640625, 4.2919921875, 4.44091796875, 4.58984375]}, "gradients/decoder.transformer.h.8.crossattention.c_attn.weight": {"_type": "histogram", "values": [3.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 5.0, 5.0, 13.0, 9.0, 9.0, 32.0, 30.0, 61.0, 77.0, 127.0, 174.0, 351.0, 511.0, 817.0, 1429.0, 2440.0, 4498.0, 7868.0, 14701.0, 28080.0, 56392.0, 119840.0, 1440580.0, 234057.0, 91266.0, 44079.0, 22548.0, 11882.0, 6500.0, 3573.0, 2099.0, 1198.0, 736.0, 443.0, 249.0, 150.0, 99.0, 77.0, 51.0, 19.0, 14.0, 14.0, 9.0, 7.0, 6.0, 3.0, 3.0, 4.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-2.515625, -2.43402099609375, -2.3524169921875, -2.27081298828125, -2.189208984375, -2.10760498046875, -2.0260009765625, -1.94439697265625, -1.86279296875, -1.78118896484375, -1.6995849609375, -1.61798095703125, -1.536376953125, -1.45477294921875, -1.3731689453125, -1.29156494140625, -1.2099609375, -1.12835693359375, -1.0467529296875, -0.96514892578125, -0.883544921875, -0.80194091796875, -0.7203369140625, -0.63873291015625, -0.55712890625, -0.47552490234375, -0.3939208984375, -0.31231689453125, -0.230712890625, -0.14910888671875, -0.0675048828125, 0.01409912109375, 0.095703125, 0.17730712890625, 0.2589111328125, 0.34051513671875, 0.422119140625, 0.50372314453125, 0.5853271484375, 0.66693115234375, 0.74853515625, 0.83013916015625, 0.9117431640625, 0.99334716796875, 1.074951171875, 1.15655517578125, 1.2381591796875, 1.31976318359375, 1.4013671875, 1.48297119140625, 1.5645751953125, 1.64617919921875, 1.727783203125, 1.80938720703125, 1.8909912109375, 1.97259521484375, 2.05419921875, 2.13580322265625, 2.2174072265625, 2.29901123046875, 2.380615234375, 2.46221923828125, 2.5438232421875, 2.62542724609375, 2.70703125]}, "gradients/decoder.transformer.h.8.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 4.0, 3.0, 3.0, 6.0, 18.0, 11.0, 12.0, 15.0, 19.0, 13.0, 32.0, 30.0, 36.0, 31.0, 59.0, 52.0, 62.0, 80.0, 70.0, 84.0, 55.0, 48.0, 45.0, 40.0, 35.0, 17.0, 18.0, 17.0, 20.0, 17.0, 6.0, 16.0, 3.0, 6.0, 8.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0011148452758789062, -0.0010750442743301392, -0.001035243272781372, -0.000995442271232605, -0.0009556412696838379, -0.0009158402681350708, -0.0008760392665863037, -0.0008362382650375366, -0.0007964372634887695, -0.0007566362619400024, -0.0007168352603912354, -0.0006770342588424683, -0.0006372332572937012, -0.0005974322557449341, -0.000557631254196167, -0.0005178302526473999, -0.0004780292510986328, -0.0004382282495498657, -0.00039842724800109863, -0.00035862624645233154, -0.00031882524490356445, -0.00027902424335479736, -0.00023922324180603027, -0.00019942224025726318, -0.0001596212387084961, -0.000119820237159729, -8.001923561096191e-05, -4.0218234062194824e-05, -4.172325134277344e-07, 3.9383769035339355e-05, 7.918477058410645e-05, 0.00011898577213287354, 0.00015878677368164062, 0.00019858777523040771, 0.0002383887767791748, 0.0002781897783279419, 0.000317990779876709, 0.0003577917814254761, 0.00039759278297424316, 0.00043739378452301025, 0.00047719478607177734, 0.0005169957876205444, 0.0005567967891693115, 0.0005965977907180786, 0.0006363987922668457, 0.0006761997938156128, 0.0007160007953643799, 0.000755801796913147, 0.0007956027984619141, 0.0008354038000106812, 0.0008752048015594482, 0.0009150058031082153, 0.0009548068046569824, 0.0009946078062057495, 0.0010344088077545166, 0.0010742098093032837, 0.0011140108108520508, 0.0011538118124008179, 0.001193612813949585, 0.001233413815498352, 0.0012732148170471191, 0.0013130158185958862, 0.0013528168201446533, 0.0013926178216934204, 0.0014324188232421875]}, "gradients/decoder.transformer.h.8.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 5.0, 0.0, 2.0, 5.0, 2.0, 2.0, 6.0, 7.0, 4.0, 6.0, 11.0, 20.0, 15.0, 23.0, 22.0, 24.0, 35.0, 37.0, 71.0, 86.0, 174.0, 301.0, 1018.0, 911604.0, 133594.0, 760.0, 249.0, 139.0, 84.0, 61.0, 39.0, 35.0, 26.0, 19.0, 24.0, 9.0, 10.0, 8.0, 10.0, 7.0, 3.0, 4.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.036529541015625, -0.035539865493774414, -0.03455018997192383, -0.03356051445007324, -0.032570838928222656, -0.03158116340637207, -0.030591487884521484, -0.0296018123626709, -0.028612136840820312, -0.027622461318969727, -0.02663278579711914, -0.025643110275268555, -0.02465343475341797, -0.023663759231567383, -0.022674083709716797, -0.02168440818786621, -0.020694732666015625, -0.01970505714416504, -0.018715381622314453, -0.017725706100463867, -0.01673603057861328, -0.015746355056762695, -0.01475667953491211, -0.013767004013061523, -0.012777328491210938, -0.011787652969360352, -0.010797977447509766, -0.00980830192565918, -0.008818626403808594, -0.007828950881958008, -0.006839275360107422, -0.005849599838256836, -0.00485992431640625, -0.003870248794555664, -0.002880573272705078, -0.0018908977508544922, -0.0009012222290039062, 8.845329284667969e-05, 0.0010781288146972656, 0.0020678043365478516, 0.0030574798583984375, 0.0040471553802490234, 0.005036830902099609, 0.006026506423950195, 0.007016181945800781, 0.008005857467651367, 0.008995532989501953, 0.009985208511352539, 0.010974884033203125, 0.011964559555053711, 0.012954235076904297, 0.013943910598754883, 0.014933586120605469, 0.015923261642456055, 0.01691293716430664, 0.017902612686157227, 0.018892288208007812, 0.0198819637298584, 0.020871639251708984, 0.02186131477355957, 0.022850990295410156, 0.023840665817260742, 0.024830341339111328, 0.025820016860961914, 0.0268096923828125]}, "gradients/decoder.transformer.h.8.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 179.0, 785.0, 51.0, 2.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006613454315811396, -0.0005097120301797986, -0.00035807868698611856, -0.00020644531468860805, -5.4811942391097546e-05, 9.682145901024342e-05, 0.00024845480220392346, 0.0004000881453976035, 0.0005517215467989445, 0.0007033549482002854, 0.0008549882913939655, 0.0010066216345876455, 0.0011582550359889865, 0.0013098884373903275, 0.0014615217223763466, 0.0016131551237776875, 0.0017647885251790285, 0.0019164219265803695, 0.0020680553279817104, 0.0022196886129677296, 0.0023713218979537487, 0.0025229554157704115, 0.0026745887007564306, 0.0028262222185730934, 0.0029778555035591125, 0.0031294887885451317, 0.0032811223063617945, 0.0034327555913478136, 0.0035843891091644764, 0.0037360223941504955, 0.0038876556791365147, 0.004039288964122534, 0.00419092271476984, 0.004342555999755859, 0.0044941892847418785, 0.004645823035389185, 0.004797456320375204, 0.004949089605361223, 0.005100722890347242, 0.0052523561753332615, 0.005403989925980568, 0.005555623210966587, 0.005707256495952606, 0.005858890246599913, 0.006010523531585932, 0.006162156816571951, 0.00631379010155797, 0.006465423386543989, 0.006617056671530008, 0.0067686899565160275, 0.006920323241502047, 0.007071956992149353, 0.007223590277135372, 0.007375223562121391, 0.00752685684710741, 0.0076784901320934296, 0.007830123417079449, 0.007981756702065468, 0.008133389987051487, 0.008285023272037506, 0.008436656557023525, 0.008588289842009544, 0.008739924058318138, 0.008891557343304157, 0.009043190628290176]}, "gradients/decoder.transformer.h.8.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 4.0, 4.0, 2.0, 3.0, 4.0, 7.0, 10.0, 7.0, 13.0, 8.0, 12.0, 22.0, 15.0, 23.0, 29.0, 29.0, 34.0, 34.0, 29.0, 27.0, 31.0, 41.0, 37.0, 31.0, 38.0, 35.0, 38.0, 50.0, 41.0, 42.0, 33.0, 29.0, 43.0, 28.0, 22.0, 25.0, 23.0, 25.0, 12.0, 15.0, 12.0, 15.0, 4.0, 6.0, 10.0, 2.0, 3.0, 4.0, 2.0, 0.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.0006281137466430664, -0.0006086360663175583, -0.0005891583859920502, -0.000569680705666542, -0.0005502030253410339, -0.0005307253450155258, -0.0005112476646900177, -0.0004917699843645096, -0.00047229230403900146, -0.00045281462371349335, -0.00043333694338798523, -0.0004138592630624771, -0.000394381582736969, -0.0003749039024114609, -0.00035542622208595276, -0.00033594854176044464, -0.0003164708614349365, -0.0002969931811094284, -0.0002775155007839203, -0.00025803782045841217, -0.00023856014013290405, -0.00021908245980739594, -0.00019960477948188782, -0.0001801270991563797, -0.00016064941883087158, -0.00014117173850536346, -0.00012169405817985535, -0.00010221637785434723, -8.273869752883911e-05, -6.3261017203331e-05, -4.3783336877822876e-05, -2.4305656552314758e-05, -4.827976226806641e-06, 1.4649704098701477e-05, 3.4127384424209595e-05, 5.360506474971771e-05, 7.308274507522583e-05, 9.256042540073395e-05, 0.00011203810572624207, 0.00013151578605175018, 0.0001509934663772583, 0.00017047114670276642, 0.00018994882702827454, 0.00020942650735378265, 0.00022890418767929077, 0.0002483818680047989, 0.000267859548330307, 0.0002873372286558151, 0.00030681490898132324, 0.00032629258930683136, 0.0003457702696323395, 0.0003652479499578476, 0.0003847256302833557, 0.00040420331060886383, 0.00042368099093437195, 0.00044315867125988007, 0.0004626363515853882, 0.0004821140319108963, 0.0005015917122364044, 0.0005210693925619125, 0.0005405470728874207, 0.0005600247532129288, 0.0005795024335384369, 0.000598980113863945, 0.0006184577941894531]}, "gradients/decoder.transformer.h.8.attn.c_proj.bias": {"_type": "histogram", "values": [5.0, 2.0, 1.0, 4.0, 1.0, 5.0, 5.0, 3.0, 4.0, 5.0, 4.0, 4.0, 7.0, 7.0, 8.0, 14.0, 8.0, 12.0, 24.0, 18.0, 22.0, 22.0, 21.0, 34.0, 25.0, 33.0, 30.0, 38.0, 49.0, 44.0, 42.0, 26.0, 41.0, 46.0, 40.0, 38.0, 32.0, 30.0, 24.0, 31.0, 23.0, 17.0, 33.0, 17.0, 17.0, 15.0, 13.0, 15.0, 13.0, 11.0, 9.0, 1.0, 5.0, 9.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0], "bins": [-6.63671875, -6.42572021484375, -6.2147216796875, -6.00372314453125, -5.792724609375, -5.58172607421875, -5.3707275390625, -5.15972900390625, -4.94873046875, -4.73773193359375, -4.5267333984375, -4.31573486328125, -4.104736328125, -3.89373779296875, -3.6827392578125, -3.47174072265625, -3.2607421875, -3.04974365234375, -2.8387451171875, -2.62774658203125, -2.416748046875, -2.20574951171875, -1.9947509765625, -1.78375244140625, -1.57275390625, -1.36175537109375, -1.1507568359375, -0.93975830078125, -0.728759765625, -0.51776123046875, -0.3067626953125, -0.09576416015625, 0.115234375, 0.32623291015625, 0.5372314453125, 0.74822998046875, 0.959228515625, 1.17022705078125, 1.3812255859375, 1.59222412109375, 1.80322265625, 2.01422119140625, 2.2252197265625, 2.43621826171875, 2.647216796875, 2.85821533203125, 3.0692138671875, 3.28021240234375, 3.4912109375, 3.70220947265625, 3.9132080078125, 4.12420654296875, 4.335205078125, 4.54620361328125, 4.7572021484375, 4.96820068359375, 5.17919921875, 5.39019775390625, 5.6011962890625, 5.81219482421875, 6.023193359375, 6.23419189453125, 6.4451904296875, 6.65618896484375, 6.8671875]}, "gradients/decoder.transformer.h.8.attn.c_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 3.0, 5.0, 3.0, 5.0, 12.0, 6.0, 6.0, 7.0, 8.0, 8.0, 9.0, 14.0, 15.0, 24.0, 38.0, 35.0, 60.0, 86.0, 130.0, 208.0, 358.0, 634.0, 1267.0, 2478.0, 5356.0, 11712.0, 25965.0, 59415.0, 150985.0, 353505.0, 261342.0, 98953.0, 41463.0, 18258.0, 8317.0, 3722.0, 1818.0, 917.0, 519.0, 327.0, 161.0, 119.0, 68.0, 57.0, 34.0, 30.0, 20.0, 20.0, 15.0, 10.0, 9.0, 8.0, 5.0, 2.0, 5.0, 2.0, 2.0, 4.0, 3.0, 0.0, 3.0, 2.0], "bins": [-5.421875, -5.25128173828125, -5.0806884765625, -4.91009521484375, -4.739501953125, -4.56890869140625, -4.3983154296875, -4.22772216796875, -4.05712890625, -3.88653564453125, -3.7159423828125, -3.54534912109375, -3.374755859375, -3.20416259765625, -3.0335693359375, -2.86297607421875, -2.6923828125, -2.52178955078125, -2.3511962890625, -2.18060302734375, -2.010009765625, -1.83941650390625, -1.6688232421875, -1.49822998046875, -1.32763671875, -1.15704345703125, -0.9864501953125, -0.81585693359375, -0.645263671875, -0.47467041015625, -0.3040771484375, -0.13348388671875, 0.037109375, 0.20770263671875, 0.3782958984375, 0.54888916015625, 0.719482421875, 0.89007568359375, 1.0606689453125, 1.23126220703125, 1.40185546875, 1.57244873046875, 1.7430419921875, 1.91363525390625, 2.084228515625, 2.25482177734375, 2.4254150390625, 2.59600830078125, 2.7666015625, 2.93719482421875, 3.1077880859375, 3.27838134765625, 3.448974609375, 3.61956787109375, 3.7901611328125, 3.96075439453125, 4.13134765625, 4.30194091796875, 4.4725341796875, 4.64312744140625, 4.813720703125, 4.98431396484375, 5.1549072265625, 5.32550048828125, 5.49609375]}, "gradients/decoder.transformer.h.8.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 2.0, 2.0, 2.0, 4.0, 2.0, 4.0, 6.0, 13.0, 6.0, 6.0, 13.0, 15.0, 18.0, 12.0, 17.0, 19.0, 21.0, 35.0, 26.0, 29.0, 40.0, 28.0, 48.0, 65.0, 92.0, 374.0, 1511.0, 140.0, 71.0, 59.0, 37.0, 31.0, 38.0, 33.0, 29.0, 23.0, 28.0, 35.0, 10.0, 15.0, 15.0, 17.0, 13.0, 14.0, 12.0, 9.0, 5.0, 2.0, 6.0, 2.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-21.140625, -20.461669921875, -19.78271484375, -19.103759765625, -18.4248046875, -17.745849609375, -17.06689453125, -16.387939453125, -15.708984375, -15.030029296875, -14.35107421875, -13.672119140625, -12.9931640625, -12.314208984375, -11.63525390625, -10.956298828125, -10.27734375, -9.598388671875, -8.91943359375, -8.240478515625, -7.5615234375, -6.882568359375, -6.20361328125, -5.524658203125, -4.845703125, -4.166748046875, -3.48779296875, -2.808837890625, -2.1298828125, -1.450927734375, -0.77197265625, -0.093017578125, 0.5859375, 1.264892578125, 1.94384765625, 2.622802734375, 3.3017578125, 3.980712890625, 4.65966796875, 5.338623046875, 6.017578125, 6.696533203125, 7.37548828125, 8.054443359375, 8.7333984375, 9.412353515625, 10.09130859375, 10.770263671875, 11.44921875, 12.128173828125, 12.80712890625, 13.486083984375, 14.1650390625, 14.843994140625, 15.52294921875, 16.201904296875, 16.880859375, 17.559814453125, 18.23876953125, 18.917724609375, 19.5966796875, 20.275634765625, 20.95458984375, 21.633544921875, 22.3125]}, "gradients/decoder.transformer.h.8.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 5.0, 5.0, 2.0, 4.0, 10.0, 13.0, 10.0, 18.0, 16.0, 27.0, 18.0, 31.0, 39.0, 51.0, 71.0, 85.0, 88.0, 114.0, 202.0, 247.0, 353.0, 629.0, 1614.0, 54785.0, 3047222.0, 36842.0, 1462.0, 568.0, 315.0, 185.0, 139.0, 114.0, 100.0, 70.0, 51.0, 43.0, 30.0, 32.0, 27.0, 17.0, 12.0, 8.0, 10.0, 8.0, 8.0, 3.0, 3.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-36.46875, -35.34521484375, -34.2216796875, -33.09814453125, -31.974609375, -30.85107421875, -29.7275390625, -28.60400390625, -27.48046875, -26.35693359375, -25.2333984375, -24.10986328125, -22.986328125, -21.86279296875, -20.7392578125, -19.61572265625, -18.4921875, -17.36865234375, -16.2451171875, -15.12158203125, -13.998046875, -12.87451171875, -11.7509765625, -10.62744140625, -9.50390625, -8.38037109375, -7.2568359375, -6.13330078125, -5.009765625, -3.88623046875, -2.7626953125, -1.63916015625, -0.515625, 0.60791015625, 1.7314453125, 2.85498046875, 3.978515625, 5.10205078125, 6.2255859375, 7.34912109375, 8.47265625, 9.59619140625, 10.7197265625, 11.84326171875, 12.966796875, 14.09033203125, 15.2138671875, 16.33740234375, 17.4609375, 18.58447265625, 19.7080078125, 20.83154296875, 21.955078125, 23.07861328125, 24.2021484375, 25.32568359375, 26.44921875, 27.57275390625, 28.6962890625, 29.81982421875, 30.943359375, 32.06689453125, 33.1904296875, 34.31396484375, 35.4375]}, "gradients/decoder.transformer.h.8.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 6.0, 501.0, 503.0, 5.0, 2.0, 0.0, 0.0, 1.0], "bins": [-325.35748291015625, -319.746826171875, -314.13616943359375, -308.5254821777344, -302.9148254394531, -297.3041687011719, -291.6935119628906, -286.08282470703125, -280.47216796875, -274.86151123046875, -269.2508544921875, -263.6401672363281, -258.0295104980469, -252.41885375976562, -246.8081817626953, -241.19752502441406, -235.58685302734375, -229.9761962890625, -224.3655242919922, -218.75486755371094, -213.14419555664062, -207.53353881835938, -201.92286682128906, -196.3122100830078, -190.70155334472656, -185.0908966064453, -179.480224609375, -173.86956787109375, -168.25889587402344, -162.6482391357422, -157.03756713867188, -151.42691040039062, -145.8162384033203, -140.20558166503906, -134.59490966796875, -128.9842529296875, -123.37358093261719, -117.7629165649414, -112.15225219726562, -106.54159545898438, -100.93092346191406, -95.32025909423828, -89.7095947265625, -84.09893035888672, -78.48826599121094, -72.87760162353516, -67.26693725585938, -61.65627670288086, -56.045616149902344, -50.43495178222656, -44.82428741455078, -39.213623046875, -33.60295867919922, -27.99229621887207, -22.381633758544922, -16.77096939086914, -11.16030502319336, -5.549641132354736, 0.06102275848388672, 5.671686172485352, 11.282350540161133, 16.893014907836914, 22.503677368164062, 28.114341735839844, 33.725006103515625]}, "gradients/decoder.transformer.h.8.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 4.0, 2.0, 7.0, 7.0, 9.0, 14.0, 8.0, 15.0, 15.0, 16.0, 21.0, 20.0, 32.0, 31.0, 36.0, 40.0, 41.0, 35.0, 41.0, 34.0, 50.0, 46.0, 39.0, 40.0, 40.0, 41.0, 41.0, 38.0, 36.0, 26.0, 20.0, 25.0, 25.0, 17.0, 12.0, 16.0, 12.0, 11.0, 9.0, 8.0, 9.0, 7.0, 4.0, 6.0, 2.0, 2.0, 1.0, 1.0, 1.0, 2.0], "bins": [-68.08250427246094, -66.18128204345703, -64.2800521850586, -62.37882614135742, -60.47760009765625, -58.57637405395508, -56.675148010253906, -54.77392578125, -52.87269592285156, -50.97146987915039, -49.07024383544922, -47.16901779174805, -45.267791748046875, -43.3665657043457, -41.46533966064453, -39.564117431640625, -37.66289138793945, -35.76166534423828, -33.86043930053711, -31.959213256835938, -30.057987213134766, -28.156761169433594, -26.255537033081055, -24.354310989379883, -22.45308494567871, -20.55185890197754, -18.650632858276367, -16.749408721923828, -14.84818172454834, -12.946955680847168, -11.045730590820312, -9.14450454711914, -7.243274688720703, -5.342048645019531, -3.4408230781555176, -1.539597511291504, 0.36162853240966797, 2.26285457611084, 4.164079666137695, 6.065305709838867, 7.966531753540039, 9.867757797241211, 11.768983840942383, 13.670208930969238, 15.57143497467041, 17.472660064697266, 19.373886108398438, 21.27511215209961, 23.17633819580078, 25.077564239501953, 26.978790283203125, 28.880016326904297, 30.78124237060547, 32.68246841430664, 34.58369445800781, 36.48491668701172, 38.386146545410156, 40.28737258911133, 42.1885986328125, 44.08982467651367, 45.991050720214844, 47.892276763916016, 49.79350280761719, 51.694725036621094, 53.595951080322266]}, "gradients/decoder.transformer.h.7.mlp.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 2.0, 3.0, 3.0, 5.0, 3.0, 3.0, 7.0, 4.0, 7.0, 6.0, 5.0, 13.0, 11.0, 15.0, 15.0, 19.0, 10.0, 27.0, 26.0, 31.0, 32.0, 36.0, 45.0, 39.0, 49.0, 47.0, 41.0, 46.0, 44.0, 43.0, 31.0, 36.0, 35.0, 35.0, 32.0, 29.0, 24.0, 29.0, 19.0, 21.0, 14.0, 20.0, 8.0, 10.0, 7.0, 7.0, 2.0, 6.0, 5.0, 2.0, 2.0, 3.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-7.140625, -6.903564453125, -6.66650390625, -6.429443359375, -6.1923828125, -5.955322265625, -5.71826171875, -5.481201171875, -5.244140625, -5.007080078125, -4.77001953125, -4.532958984375, -4.2958984375, -4.058837890625, -3.82177734375, -3.584716796875, -3.34765625, -3.110595703125, -2.87353515625, -2.636474609375, -2.3994140625, -2.162353515625, -1.92529296875, -1.688232421875, -1.451171875, -1.214111328125, -0.97705078125, -0.739990234375, -0.5029296875, -0.265869140625, -0.02880859375, 0.208251953125, 0.4453125, 0.682373046875, 0.91943359375, 1.156494140625, 1.3935546875, 1.630615234375, 1.86767578125, 2.104736328125, 2.341796875, 2.578857421875, 2.81591796875, 3.052978515625, 3.2900390625, 3.527099609375, 3.76416015625, 4.001220703125, 4.23828125, 4.475341796875, 4.71240234375, 4.949462890625, 5.1865234375, 5.423583984375, 5.66064453125, 5.897705078125, 6.134765625, 6.371826171875, 6.60888671875, 6.845947265625, 7.0830078125, 7.320068359375, 7.55712890625, 7.794189453125, 8.03125]}, "gradients/decoder.transformer.h.7.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 4.0, 2.0, 1.0, 0.0, 5.0, 6.0, 2.0, 5.0, 4.0, 5.0, 5.0, 5.0, 11.0, 12.0, 17.0, 14.0, 16.0, 24.0, 19.0, 37.0, 52.0, 56.0, 69.0, 115.0, 203.0, 459.0, 1781.0, 16903.0, 874699.0, 3198204.0, 95244.0, 4642.0, 838.0, 282.0, 140.0, 86.0, 65.0, 52.0, 39.0, 28.0, 30.0, 16.0, 16.0, 17.0, 6.0, 19.0, 8.0, 9.0, 4.0, 1.0, 6.0, 6.0, 1.0, 4.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0], "bins": [-26.078125, -25.243408203125, -24.40869140625, -23.573974609375, -22.7392578125, -21.904541015625, -21.06982421875, -20.235107421875, -19.400390625, -18.565673828125, -17.73095703125, -16.896240234375, -16.0615234375, -15.226806640625, -14.39208984375, -13.557373046875, -12.72265625, -11.887939453125, -11.05322265625, -10.218505859375, -9.3837890625, -8.549072265625, -7.71435546875, -6.879638671875, -6.044921875, -5.210205078125, -4.37548828125, -3.540771484375, -2.7060546875, -1.871337890625, -1.03662109375, -0.201904296875, 0.6328125, 1.467529296875, 2.30224609375, 3.136962890625, 3.9716796875, 4.806396484375, 5.64111328125, 6.475830078125, 7.310546875, 8.145263671875, 8.97998046875, 9.814697265625, 10.6494140625, 11.484130859375, 12.31884765625, 13.153564453125, 13.98828125, 14.822998046875, 15.65771484375, 16.492431640625, 17.3271484375, 18.161865234375, 18.99658203125, 19.831298828125, 20.666015625, 21.500732421875, 22.33544921875, 23.170166015625, 24.0048828125, 24.839599609375, 25.67431640625, 26.509033203125, 27.34375]}, "gradients/decoder.transformer.h.7.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 7.0, 10.0, 9.0, 18.0, 14.0, 26.0, 43.0, 55.0, 71.0, 121.0, 155.0, 222.0, 284.0, 437.0, 597.0, 515.0, 427.0, 289.0, 246.0, 150.0, 117.0, 81.0, 53.0, 36.0, 22.0, 29.0, 16.0, 9.0, 15.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-11.875, -11.381103515625, -10.88720703125, -10.393310546875, -9.8994140625, -9.405517578125, -8.91162109375, -8.417724609375, -7.923828125, -7.429931640625, -6.93603515625, -6.442138671875, -5.9482421875, -5.454345703125, -4.96044921875, -4.466552734375, -3.97265625, -3.478759765625, -2.98486328125, -2.490966796875, -1.9970703125, -1.503173828125, -1.00927734375, -0.515380859375, -0.021484375, 0.472412109375, 0.96630859375, 1.460205078125, 1.9541015625, 2.447998046875, 2.94189453125, 3.435791015625, 3.9296875, 4.423583984375, 4.91748046875, 5.411376953125, 5.9052734375, 6.399169921875, 6.89306640625, 7.386962890625, 7.880859375, 8.374755859375, 8.86865234375, 9.362548828125, 9.8564453125, 10.350341796875, 10.84423828125, 11.338134765625, 11.83203125, 12.325927734375, 12.81982421875, 13.313720703125, 13.8076171875, 14.301513671875, 14.79541015625, 15.289306640625, 15.783203125, 16.277099609375, 16.77099609375, 17.264892578125, 17.7587890625, 18.252685546875, 18.74658203125, 19.240478515625, 19.734375]}, "gradients/decoder.transformer.h.7.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 4.0, 4.0, 7.0, 9.0, 5.0, 17.0, 24.0, 24.0, 29.0, 43.0, 49.0, 74.0, 93.0, 115.0, 110.0, 159.0, 233.0, 410.0, 864.0, 17791.0, 4159051.0, 12882.0, 871.0, 389.0, 249.0, 183.0, 141.0, 94.0, 91.0, 50.0, 48.0, 37.0, 30.0, 21.0, 22.0, 16.0, 13.0, 11.0, 4.0, 8.0, 3.0, 6.0, 5.0, 2.0, 0.0, 0.0, 3.0, 1.0], "bins": [-99.625, -96.8935546875, -94.162109375, -91.4306640625, -88.69921875, -85.9677734375, -83.236328125, -80.5048828125, -77.7734375, -75.0419921875, -72.310546875, -69.5791015625, -66.84765625, -64.1162109375, -61.384765625, -58.6533203125, -55.921875, -53.1904296875, -50.458984375, -47.7275390625, -44.99609375, -42.2646484375, -39.533203125, -36.8017578125, -34.0703125, -31.3388671875, -28.607421875, -25.8759765625, -23.14453125, -20.4130859375, -17.681640625, -14.9501953125, -12.21875, -9.4873046875, -6.755859375, -4.0244140625, -1.29296875, 1.4384765625, 4.169921875, 6.9013671875, 9.6328125, 12.3642578125, 15.095703125, 17.8271484375, 20.55859375, 23.2900390625, 26.021484375, 28.7529296875, 31.484375, 34.2158203125, 36.947265625, 39.6787109375, 42.41015625, 45.1416015625, 47.873046875, 50.6044921875, 53.3359375, 56.0673828125, 58.798828125, 61.5302734375, 64.26171875, 66.9931640625, 69.724609375, 72.4560546875, 75.1875]}, "gradients/decoder.transformer.h.7.ln_2.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 18.0, 456.0, 519.0, 21.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-73.60321807861328, -64.4305648803711, -55.257911682128906, -46.085262298583984, -36.9126091003418, -27.73995590209961, -18.567306518554688, -9.3946533203125, -0.2220001220703125, 8.950652122497559, 18.12330436706543, 27.295955657958984, 36.46860885620117, 45.64126205444336, 54.81391143798828, 63.98656463623047, 73.15921783447266, 82.33187103271484, 91.50452423095703, 100.67716979980469, 109.84982299804688, 119.02247619628906, 128.19512939453125, 137.36778259277344, 146.54043579101562, 155.7130889892578, 164.8857421875, 174.0583953857422, 183.23104858398438, 192.40370178222656, 201.57635498046875, 210.74899291992188, 219.92166137695312, 229.0943145751953, 238.2669677734375, 247.4396209716797, 256.6122741699219, 265.784912109375, 274.95758056640625, 284.1302185058594, 293.3028869628906, 302.47552490234375, 311.648193359375, 320.8208312988281, 329.9934997558594, 339.1661376953125, 348.33880615234375, 357.5114440917969, 366.68408203125, 375.8567199707031, 385.0293884277344, 394.2020263671875, 403.37469482421875, 412.5473327636719, 421.7200012207031, 430.89263916015625, 440.0653076171875, 449.2379455566406, 458.4106140136719, 467.583251953125, 476.75592041015625, 485.9285583496094, 495.1012268066406, 504.27386474609375, 513.446533203125]}, "gradients/decoder.transformer.h.7.ln_2.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 4.0, 2.0, 6.0, 3.0, 6.0, 4.0, 5.0, 11.0, 12.0, 13.0, 17.0, 21.0, 33.0, 23.0, 29.0, 27.0, 30.0, 37.0, 36.0, 36.0, 32.0, 38.0, 49.0, 38.0, 50.0, 30.0, 34.0, 39.0, 40.0, 37.0, 35.0, 31.0, 25.0, 28.0, 16.0, 19.0, 19.0, 13.0, 6.0, 15.0, 12.0, 7.0, 9.0, 4.0, 5.0, 3.0, 3.0, 3.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0], "bins": [-45.55470275878906, -44.0551643371582, -42.55562973022461, -41.05609130859375, -39.55655288696289, -38.05701446533203, -36.55747985839844, -35.05794143676758, -33.55840301513672, -32.05886459350586, -30.559328079223633, -29.059791564941406, -27.560253143310547, -26.06071662902832, -24.561180114746094, -23.061641693115234, -21.56210708618164, -20.062570571899414, -18.563032150268555, -17.063495635986328, -15.563958168029785, -14.064420700073242, -12.564884185791016, -11.065346717834473, -9.56580924987793, -8.066271781921387, -6.566734790802002, -5.067197799682617, -3.567660331726074, -2.0681228637695312, -0.5685863494873047, 0.9309511184692383, 2.430492401123047, 3.9300296306610107, 5.429566860198975, 6.929103851318359, 8.428641319274902, 9.928178787231445, 11.427715301513672, 12.927252769470215, 14.426790237426758, 15.9263277053833, 17.425865173339844, 18.92540168762207, 20.424938201904297, 21.924476623535156, 23.424013137817383, 24.92354965209961, 26.42308807373047, 27.922624588012695, 29.422163009643555, 30.92169952392578, 32.42123794555664, 33.9207763671875, 35.420310974121094, 36.91984939575195, 38.41938781738281, 39.91892623901367, 41.418460845947266, 42.917999267578125, 44.417537689208984, 45.917076110839844, 47.41661071777344, 48.9161491394043, 50.41568374633789]}, "gradients/decoder.transformer.h.7.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 9.0, 8.0, 8.0, 8.0, 4.0, 17.0, 19.0, 18.0, 24.0, 20.0, 19.0, 36.0, 24.0, 31.0, 38.0, 33.0, 43.0, 35.0, 41.0, 42.0, 47.0, 44.0, 32.0, 44.0, 28.0, 29.0, 29.0, 38.0, 24.0, 21.0, 24.0, 26.0, 18.0, 20.0, 16.0, 13.0, 11.0, 9.0, 13.0, 4.0, 3.0, 6.0, 4.0, 5.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-7.109375, -6.88214111328125, -6.6549072265625, -6.42767333984375, -6.200439453125, -5.97320556640625, -5.7459716796875, -5.51873779296875, -5.29150390625, -5.06427001953125, -4.8370361328125, -4.60980224609375, -4.382568359375, -4.15533447265625, -3.9281005859375, -3.70086669921875, -3.4736328125, -3.24639892578125, -3.0191650390625, -2.79193115234375, -2.564697265625, -2.33746337890625, -2.1102294921875, -1.88299560546875, -1.65576171875, -1.42852783203125, -1.2012939453125, -0.97406005859375, -0.746826171875, -0.51959228515625, -0.2923583984375, -0.06512451171875, 0.162109375, 0.38934326171875, 0.6165771484375, 0.84381103515625, 1.071044921875, 1.29827880859375, 1.5255126953125, 1.75274658203125, 1.97998046875, 2.20721435546875, 2.4344482421875, 2.66168212890625, 2.888916015625, 3.11614990234375, 3.3433837890625, 3.57061767578125, 3.7978515625, 4.02508544921875, 4.2523193359375, 4.47955322265625, 4.706787109375, 4.93402099609375, 5.1612548828125, 5.38848876953125, 5.61572265625, 5.84295654296875, 6.0701904296875, 6.29742431640625, 6.524658203125, 6.75189208984375, 6.9791259765625, 7.20635986328125, 7.43359375]}, "gradients/decoder.transformer.h.7.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 3.0, 3.0, 3.0, 8.0, 13.0, 13.0, 25.0, 38.0, 65.0, 90.0, 110.0, 159.0, 255.0, 387.0, 573.0, 802.0, 1240.0, 1780.0, 2773.0, 4100.0, 6096.0, 9318.0, 14214.0, 22046.0, 33904.0, 55003.0, 95039.0, 204974.0, 303001.0, 115584.0, 64435.0, 39293.0, 25130.0, 16329.0, 10650.0, 7071.0, 4601.0, 3108.0, 2023.0, 1444.0, 911.0, 625.0, 425.0, 280.0, 210.0, 136.0, 96.0, 59.0, 38.0, 33.0, 19.0, 16.0, 8.0, 5.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-1.767578125, -1.710845947265625, -1.65411376953125, -1.597381591796875, -1.5406494140625, -1.483917236328125, -1.42718505859375, -1.370452880859375, -1.313720703125, -1.256988525390625, -1.20025634765625, -1.143524169921875, -1.0867919921875, -1.030059814453125, -0.97332763671875, -0.916595458984375, -0.85986328125, -0.803131103515625, -0.74639892578125, -0.689666748046875, -0.6329345703125, -0.576202392578125, -0.51947021484375, -0.462738037109375, -0.406005859375, -0.349273681640625, -0.29254150390625, -0.235809326171875, -0.1790771484375, -0.122344970703125, -0.06561279296875, -0.008880615234375, 0.0478515625, 0.104583740234375, 0.16131591796875, 0.218048095703125, 0.2747802734375, 0.331512451171875, 0.38824462890625, 0.444976806640625, 0.501708984375, 0.558441162109375, 0.61517333984375, 0.671905517578125, 0.7286376953125, 0.785369873046875, 0.84210205078125, 0.898834228515625, 0.95556640625, 1.012298583984375, 1.06903076171875, 1.125762939453125, 1.1824951171875, 1.239227294921875, 1.29595947265625, 1.352691650390625, 1.409423828125, 1.466156005859375, 1.52288818359375, 1.579620361328125, 1.6363525390625, 1.693084716796875, 1.74981689453125, 1.806549072265625, 1.86328125]}, "gradients/decoder.transformer.h.7.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 1.0, 2.0, 2.0, 4.0, 2.0, 8.0, 5.0, 11.0, 8.0, 16.0, 14.0, 16.0, 19.0, 19.0, 17.0, 38.0, 30.0, 31.0, 34.0, 51.0, 45.0, 48.0, 51.0, 1062.0, 35.0, 38.0, 45.0, 36.0, 34.0, 40.0, 27.0, 32.0, 25.0, 39.0, 27.0, 24.0, 13.0, 14.0, 10.0, 10.0, 14.0, 9.0, 8.0, 10.0, 5.0, 5.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-5.1875, -5.0302734375, -4.873046875, -4.7158203125, -4.55859375, -4.4013671875, -4.244140625, -4.0869140625, -3.9296875, -3.7724609375, -3.615234375, -3.4580078125, -3.30078125, -3.1435546875, -2.986328125, -2.8291015625, -2.671875, -2.5146484375, -2.357421875, -2.2001953125, -2.04296875, -1.8857421875, -1.728515625, -1.5712890625, -1.4140625, -1.2568359375, -1.099609375, -0.9423828125, -0.78515625, -0.6279296875, -0.470703125, -0.3134765625, -0.15625, 0.0009765625, 0.158203125, 0.3154296875, 0.47265625, 0.6298828125, 0.787109375, 0.9443359375, 1.1015625, 1.2587890625, 1.416015625, 1.5732421875, 1.73046875, 1.8876953125, 2.044921875, 2.2021484375, 2.359375, 2.5166015625, 2.673828125, 2.8310546875, 2.98828125, 3.1455078125, 3.302734375, 3.4599609375, 3.6171875, 3.7744140625, 3.931640625, 4.0888671875, 4.24609375, 4.4033203125, 4.560546875, 4.7177734375, 4.875]}, "gradients/decoder.transformer.h.7.crossattention.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 5.0, 7.0, 13.0, 12.0, 20.0, 23.0, 29.0, 47.0, 87.0, 120.0, 237.0, 371.0, 644.0, 1147.0, 2209.0, 3934.0, 6951.0, 13506.0, 26622.0, 54841.0, 121282.0, 1439948.0, 244123.0, 92226.0, 43243.0, 21223.0, 11041.0, 5925.0, 3171.0, 1758.0, 973.0, 597.0, 318.0, 181.0, 109.0, 61.0, 49.0, 31.0, 18.0, 8.0, 12.0, 2.0, 6.0, 4.0, 2.0, 1.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.662109375, -2.575775146484375, -2.48944091796875, -2.403106689453125, -2.3167724609375, -2.230438232421875, -2.14410400390625, -2.057769775390625, -1.971435546875, -1.885101318359375, -1.79876708984375, -1.712432861328125, -1.6260986328125, -1.539764404296875, -1.45343017578125, -1.367095947265625, -1.28076171875, -1.194427490234375, -1.10809326171875, -1.021759033203125, -0.9354248046875, -0.849090576171875, -0.76275634765625, -0.676422119140625, -0.590087890625, -0.503753662109375, -0.41741943359375, -0.331085205078125, -0.2447509765625, -0.158416748046875, -0.07208251953125, 0.014251708984375, 0.1005859375, 0.186920166015625, 0.27325439453125, 0.359588623046875, 0.4459228515625, 0.532257080078125, 0.61859130859375, 0.704925537109375, 0.791259765625, 0.877593994140625, 0.96392822265625, 1.050262451171875, 1.1365966796875, 1.222930908203125, 1.30926513671875, 1.395599365234375, 1.48193359375, 1.568267822265625, 1.65460205078125, 1.740936279296875, 1.8272705078125, 1.913604736328125, 1.99993896484375, 2.086273193359375, 2.172607421875, 2.258941650390625, 2.34527587890625, 2.431610107421875, 2.5179443359375, 2.604278564453125, 2.69061279296875, 2.776947021484375, 2.86328125]}, "gradients/decoder.transformer.h.7.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 3.0, 4.0, 4.0, 4.0, 5.0, 12.0, 9.0, 18.0, 15.0, 20.0, 25.0, 30.0, 68.0, 81.0, 81.0, 125.0, 106.0, 93.0, 67.0, 53.0, 46.0, 33.0, 20.0, 24.0, 20.0, 17.0, 10.0, 7.0, 3.0, 3.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0013942718505859375, -0.001349329948425293, -0.0013043880462646484, -0.001259446144104004, -0.0012145042419433594, -0.0011695623397827148, -0.0011246204376220703, -0.0010796785354614258, -0.0010347366333007812, -0.0009897947311401367, -0.0009448528289794922, -0.0008999109268188477, -0.0008549690246582031, -0.0008100271224975586, -0.0007650852203369141, -0.0007201433181762695, -0.000675201416015625, -0.0006302595138549805, -0.0005853176116943359, -0.0005403757095336914, -0.0004954338073730469, -0.00045049190521240234, -0.0004055500030517578, -0.0003606081008911133, -0.00031566619873046875, -0.0002707242965698242, -0.0002257823944091797, -0.00018084049224853516, -0.00013589859008789062, -9.09566879272461e-05, -4.601478576660156e-05, -1.0728836059570312e-06, 4.38690185546875e-05, 8.881092071533203e-05, 0.00013375282287597656, 0.0001786947250366211, 0.00022363662719726562, 0.00026857852935791016, 0.0003135204315185547, 0.0003584623336791992, 0.00040340423583984375, 0.0004483461380004883, 0.0004932880401611328, 0.0005382299423217773, 0.0005831718444824219, 0.0006281137466430664, 0.0006730556488037109, 0.0007179975509643555, 0.000762939453125, 0.0008078813552856445, 0.0008528232574462891, 0.0008977651596069336, 0.0009427070617675781, 0.0009876489639282227, 0.0010325908660888672, 0.0010775327682495117, 0.0011224746704101562, 0.0011674165725708008, 0.0012123584747314453, 0.0012573003768920898, 0.0013022422790527344, 0.001347184181213379, 0.0013921260833740234, 0.001437067985534668, 0.0014820098876953125]}, "gradients/decoder.transformer.h.7.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 0.0, 2.0, 2.0, 3.0, 4.0, 4.0, 5.0, 12.0, 15.0, 19.0, 23.0, 30.0, 35.0, 39.0, 78.0, 97.0, 154.0, 361.0, 3256.0, 1041440.0, 2191.0, 316.0, 181.0, 86.0, 46.0, 28.0, 25.0, 22.0, 18.0, 15.0, 14.0, 8.0, 4.0, 9.0, 6.0, 4.0, 4.0, 5.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03057861328125, -0.029540538787841797, -0.028502464294433594, -0.02746438980102539, -0.026426315307617188, -0.025388240814208984, -0.02435016632080078, -0.023312091827392578, -0.022274017333984375, -0.021235942840576172, -0.02019786834716797, -0.019159793853759766, -0.018121719360351562, -0.01708364486694336, -0.016045570373535156, -0.015007495880126953, -0.01396942138671875, -0.012931346893310547, -0.011893272399902344, -0.01085519790649414, -0.009817123413085938, -0.008779048919677734, -0.007740974426269531, -0.006702899932861328, -0.005664825439453125, -0.004626750946044922, -0.0035886764526367188, -0.0025506019592285156, -0.0015125274658203125, -0.0004744529724121094, 0.0005636215209960938, 0.0016016960144042969, 0.0026397705078125, 0.003677845001220703, 0.004715919494628906, 0.005753993988037109, 0.0067920684814453125, 0.007830142974853516, 0.008868217468261719, 0.009906291961669922, 0.010944366455078125, 0.011982440948486328, 0.013020515441894531, 0.014058589935302734, 0.015096664428710938, 0.01613473892211914, 0.017172813415527344, 0.018210887908935547, 0.01924896240234375, 0.020287036895751953, 0.021325111389160156, 0.02236318588256836, 0.023401260375976562, 0.024439334869384766, 0.02547740936279297, 0.026515483856201172, 0.027553558349609375, 0.028591632843017578, 0.02962970733642578, 0.030667781829833984, 0.03170585632324219, 0.03274393081665039, 0.033782005310058594, 0.0348200798034668, 0.035858154296875]}, "gradients/decoder.transformer.h.7.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 4.0, 13.0, 45.0, 155.0, 286.0, 294.0, 166.0, 40.0, 11.0, 5.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0011612925445660949, -0.0011128628393635154, -0.0010644332505762577, -0.0010160035453736782, -0.0009675738983787596, -0.000919144251383841, -0.0008707145461812615, -0.000822284899186343, -0.0007738552521914244, -0.0007254256051965058, -0.0006769959582015872, -0.0006285662529990077, -0.0005801366060040891, -0.0005317069590091705, -0.0004832772829104215, -0.00043484760681167245, -0.00038641795981675386, -0.0003379883128218353, -0.00028955863672308624, -0.00024112897517625242, -0.0001926993136294186, -0.0001442696520825848, -9.583999053575099e-05, -4.7410314437001944e-05, 1.0193325579166412e-06, 4.9448994104750454e-05, 9.787865565158427e-05, 0.00014630831719841808, 0.0001947379787452519, 0.0002431676402920857, 0.0002915973018389195, 0.00034002697793766856, 0.00038845662493258715, 0.00043688627192750573, 0.0004853159480262548, 0.0005337456241250038, 0.0005821752711199224, 0.000630604918114841, 0.0006790346233174205, 0.0007274642703123391, 0.0007758939173072577, 0.0008243235643021762, 0.0008727532112970948, 0.0009211829164996743, 0.0009696125634945929, 0.0010180422104895115, 0.001066471915692091, 0.0011149016208946705, 0.0011633312096819282, 0.0012117609148845077, 0.0012601905036717653, 0.0013086202088743448, 0.0013570499140769243, 0.001405479502864182, 0.0014539092080667615, 0.0015023387968540192, 0.0015507685020565987, 0.0015991982072591782, 0.0016476277960464358, 0.0016960575012490153, 0.001744487090036273, 0.0017929167952388525, 0.001841346500441432, 0.0018897762056440115, 0.0019382057944312692]}, "gradients/decoder.transformer.h.7.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 3.0, 2.0, 7.0, 2.0, 5.0, 7.0, 11.0, 12.0, 11.0, 17.0, 21.0, 20.0, 21.0, 19.0, 23.0, 25.0, 26.0, 31.0, 31.0, 32.0, 37.0, 31.0, 43.0, 42.0, 46.0, 28.0, 37.0, 33.0, 32.0, 42.0, 33.0, 42.0, 32.0, 29.0, 27.0, 21.0, 18.0, 14.0, 19.0, 11.0, 7.0, 15.0, 8.0, 5.0, 8.0, 5.0, 3.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 5.0, 1.0, 2.0], "bins": [-0.0004977583885192871, -0.00048219598829746246, -0.0004666335880756378, -0.00045107118785381317, -0.0004355087876319885, -0.0004199463874101639, -0.00040438398718833923, -0.0003888215869665146, -0.00037325918674468994, -0.0003576967865228653, -0.00034213438630104065, -0.000326571986079216, -0.00031100958585739136, -0.0002954471856355667, -0.00027988478541374207, -0.0002643223851919174, -0.0002487599849700928, -0.00023319758474826813, -0.00021763518452644348, -0.00020207278430461884, -0.0001865103840827942, -0.00017094798386096954, -0.0001553855836391449, -0.00013982318341732025, -0.0001242607831954956, -0.00010869838297367096, -9.313598275184631e-05, -7.757358253002167e-05, -6.201118230819702e-05, -4.6448782086372375e-05, -3.088638186454773e-05, -1.5323981642723083e-05, 2.384185791015625e-07, 1.580081880092621e-05, 3.1363219022750854e-05, 4.69256192445755e-05, 6.248801946640015e-05, 7.805041968822479e-05, 9.361281991004944e-05, 0.00010917522013187408, 0.00012473762035369873, 0.00014030002057552338, 0.00015586242079734802, 0.00017142482101917267, 0.00018698722124099731, 0.00020254962146282196, 0.0002181120216846466, 0.00023367442190647125, 0.0002492368221282959, 0.00026479922235012054, 0.0002803616225719452, 0.00029592402279376984, 0.0003114864230155945, 0.00032704882323741913, 0.0003426112234592438, 0.0003581736236810684, 0.00037373602390289307, 0.0003892984241247177, 0.00040486082434654236, 0.000420423224568367, 0.00043598562479019165, 0.0004515480250120163, 0.00046711042523384094, 0.0004826728254556656, 0.0004982352256774902]}, "gradients/decoder.transformer.h.7.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 1.0, 3.0, 3.0, 2.0, 4.0, 5.0, 9.0, 8.0, 8.0, 8.0, 4.0, 17.0, 19.0, 18.0, 24.0, 20.0, 19.0, 36.0, 24.0, 31.0, 38.0, 33.0, 43.0, 35.0, 41.0, 42.0, 47.0, 44.0, 32.0, 44.0, 28.0, 29.0, 29.0, 38.0, 24.0, 21.0, 24.0, 26.0, 18.0, 20.0, 16.0, 13.0, 11.0, 9.0, 13.0, 4.0, 3.0, 6.0, 4.0, 5.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-7.109375, -6.88214111328125, -6.6549072265625, -6.42767333984375, -6.200439453125, -5.97320556640625, -5.7459716796875, -5.51873779296875, -5.29150390625, -5.06427001953125, -4.8370361328125, -4.60980224609375, -4.382568359375, -4.15533447265625, -3.9281005859375, -3.70086669921875, -3.4736328125, -3.24639892578125, -3.0191650390625, -2.79193115234375, -2.564697265625, -2.33746337890625, -2.1102294921875, -1.88299560546875, -1.65576171875, -1.42852783203125, -1.2012939453125, -0.97406005859375, -0.746826171875, -0.51959228515625, -0.2923583984375, -0.06512451171875, 0.162109375, 0.38934326171875, 0.6165771484375, 0.84381103515625, 1.071044921875, 1.29827880859375, 1.5255126953125, 1.75274658203125, 1.97998046875, 2.20721435546875, 2.4344482421875, 2.66168212890625, 2.888916015625, 3.11614990234375, 3.3433837890625, 3.57061767578125, 3.7978515625, 4.02508544921875, 4.2523193359375, 4.47955322265625, 4.706787109375, 4.93402099609375, 5.1612548828125, 5.38848876953125, 5.61572265625, 5.84295654296875, 6.0701904296875, 6.29742431640625, 6.524658203125, 6.75189208984375, 6.9791259765625, 7.20635986328125, 7.43359375]}, "gradients/decoder.transformer.h.7.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 3.0, 2.0, 0.0, 4.0, 2.0, 3.0, 2.0, 7.0, 10.0, 8.0, 13.0, 9.0, 9.0, 20.0, 27.0, 34.0, 44.0, 45.0, 56.0, 97.0, 110.0, 165.0, 261.0, 409.0, 966.0, 3044.0, 16549.0, 136584.0, 759983.0, 111089.0, 14164.0, 2726.0, 829.0, 407.0, 223.0, 166.0, 118.0, 79.0, 57.0, 60.0, 32.0, 29.0, 27.0, 18.0, 11.0, 16.0, 14.0, 6.0, 6.0, 7.0, 5.0, 3.0, 6.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-14.03125, -13.585205078125, -13.13916015625, -12.693115234375, -12.2470703125, -11.801025390625, -11.35498046875, -10.908935546875, -10.462890625, -10.016845703125, -9.57080078125, -9.124755859375, -8.6787109375, -8.232666015625, -7.78662109375, -7.340576171875, -6.89453125, -6.448486328125, -6.00244140625, -5.556396484375, -5.1103515625, -4.664306640625, -4.21826171875, -3.772216796875, -3.326171875, -2.880126953125, -2.43408203125, -1.988037109375, -1.5419921875, -1.095947265625, -0.64990234375, -0.203857421875, 0.2421875, 0.688232421875, 1.13427734375, 1.580322265625, 2.0263671875, 2.472412109375, 2.91845703125, 3.364501953125, 3.810546875, 4.256591796875, 4.70263671875, 5.148681640625, 5.5947265625, 6.040771484375, 6.48681640625, 6.932861328125, 7.37890625, 7.824951171875, 8.27099609375, 8.717041015625, 9.1630859375, 9.609130859375, 10.05517578125, 10.501220703125, 10.947265625, 11.393310546875, 11.83935546875, 12.285400390625, 12.7314453125, 13.177490234375, 13.62353515625, 14.069580078125, 14.515625]}, "gradients/decoder.transformer.h.7.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 3.0, 4.0, 3.0, 5.0, 7.0, 6.0, 10.0, 10.0, 20.0, 10.0, 15.0, 22.0, 19.0, 29.0, 34.0, 46.0, 29.0, 42.0, 45.0, 53.0, 61.0, 96.0, 1522.0, 386.0, 110.0, 61.0, 57.0, 54.0, 50.0, 39.0, 29.0, 42.0, 23.0, 13.0, 30.0, 16.0, 15.0, 12.0, 5.0, 5.0, 5.0, 3.0, 3.0, 6.0, 1.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-26.640625, -25.831298828125, -25.02197265625, -24.212646484375, -23.4033203125, -22.593994140625, -21.78466796875, -20.975341796875, -20.166015625, -19.356689453125, -18.54736328125, -17.738037109375, -16.9287109375, -16.119384765625, -15.31005859375, -14.500732421875, -13.69140625, -12.882080078125, -12.07275390625, -11.263427734375, -10.4541015625, -9.644775390625, -8.83544921875, -8.026123046875, -7.216796875, -6.407470703125, -5.59814453125, -4.788818359375, -3.9794921875, -3.170166015625, -2.36083984375, -1.551513671875, -0.7421875, 0.067138671875, 0.87646484375, 1.685791015625, 2.4951171875, 3.304443359375, 4.11376953125, 4.923095703125, 5.732421875, 6.541748046875, 7.35107421875, 8.160400390625, 8.9697265625, 9.779052734375, 10.58837890625, 11.397705078125, 12.20703125, 13.016357421875, 13.82568359375, 14.635009765625, 15.4443359375, 16.253662109375, 17.06298828125, 17.872314453125, 18.681640625, 19.490966796875, 20.30029296875, 21.109619140625, 21.9189453125, 22.728271484375, 23.53759765625, 24.346923828125, 25.15625]}, "gradients/decoder.transformer.h.7.attn.c_attn.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 0.0, 3.0, 5.0, 0.0, 4.0, 6.0, 3.0, 5.0, 7.0, 7.0, 15.0, 18.0, 32.0, 31.0, 31.0, 55.0, 69.0, 103.0, 180.0, 300.0, 715.0, 3793.0, 2876780.0, 259844.0, 2275.0, 602.0, 274.0, 158.0, 90.0, 82.0, 53.0, 37.0, 31.0, 21.0, 21.0, 17.0, 18.0, 9.0, 8.0, 3.0, 6.0, 5.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.75, -56.4814453125, -54.212890625, -51.9443359375, -49.67578125, -47.4072265625, -45.138671875, -42.8701171875, -40.6015625, -38.3330078125, -36.064453125, -33.7958984375, -31.52734375, -29.2587890625, -26.990234375, -24.7216796875, -22.453125, -20.1845703125, -17.916015625, -15.6474609375, -13.37890625, -11.1103515625, -8.841796875, -6.5732421875, -4.3046875, -2.0361328125, 0.232421875, 2.5009765625, 4.76953125, 7.0380859375, 9.306640625, 11.5751953125, 13.84375, 16.1123046875, 18.380859375, 20.6494140625, 22.91796875, 25.1865234375, 27.455078125, 29.7236328125, 31.9921875, 34.2607421875, 36.529296875, 38.7978515625, 41.06640625, 43.3349609375, 45.603515625, 47.8720703125, 50.140625, 52.4091796875, 54.677734375, 56.9462890625, 59.21484375, 61.4833984375, 63.751953125, 66.0205078125, 68.2890625, 70.5576171875, 72.826171875, 75.0947265625, 77.36328125, 79.6318359375, 81.900390625, 84.1689453125, 86.4375]}, "gradients/decoder.transformer.h.7.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 195.0, 816.0, 4.0], "bins": [-807.1841430664062, -794.1924438476562, -781.20068359375, -768.208984375, -755.21728515625, -742.2255859375, -729.2338256835938, -716.2421264648438, -703.2504272460938, -690.2587280273438, -677.2669677734375, -664.2752685546875, -651.2835693359375, -638.2918701171875, -625.3001098632812, -612.3084106445312, -599.316650390625, -586.324951171875, -573.3331909179688, -560.3414916992188, -547.3497924804688, -534.3580932617188, -521.3663330078125, -508.3746337890625, -495.3829345703125, -482.3912048339844, -469.3995056152344, -456.40777587890625, -443.41607666015625, -430.4243469238281, -417.4326171875, -404.44091796875, -391.4491882324219, -378.45745849609375, -365.46575927734375, -352.4740295410156, -339.4823303222656, -326.4906005859375, -313.4989013671875, -300.5071716308594, -287.5154724121094, -274.52374267578125, -261.53204345703125, -248.54031372070312, -235.54861450195312, -222.556884765625, -209.56517028808594, -196.57345581054688, -183.58172607421875, -170.5900115966797, -157.59829711914062, -144.6065673828125, -131.6148681640625, -118.6231460571289, -105.63142395019531, -92.63970947265625, -79.64799499511719, -66.65628051757812, -53.6645622253418, -40.67284393310547, -27.681129455566406, -14.689414978027344, -1.69769287109375, 11.294021606445312, 24.285734176635742]}, "gradients/decoder.transformer.h.7.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 6.0, 9.0, 5.0, 5.0, 9.0, 3.0, 11.0, 16.0, 17.0, 12.0, 19.0, 29.0, 10.0, 22.0, 23.0, 24.0, 37.0, 35.0, 33.0, 39.0, 40.0, 42.0, 40.0, 47.0, 35.0, 37.0, 35.0, 46.0, 43.0, 35.0, 25.0, 28.0, 21.0, 25.0, 14.0, 23.0, 16.0, 18.0, 10.0, 10.0, 11.0, 10.0, 8.0, 6.0, 9.0, 6.0, 3.0, 4.0, 0.0, 0.0, 2.0, 2.0, 0.0, 2.0, 2.0], "bins": [-62.119842529296875, -60.12122344970703, -58.12260437011719, -56.123985290527344, -54.1253662109375, -52.126747131347656, -50.12812805175781, -48.12950897216797, -46.130889892578125, -44.13227081298828, -42.13365173339844, -40.135032653808594, -38.13641357421875, -36.137794494628906, -34.13917541503906, -32.14055633544922, -30.141935348510742, -28.1433162689209, -26.144697189331055, -24.14607810974121, -22.147459030151367, -20.14883804321289, -18.150218963623047, -16.151599884033203, -14.152981758117676, -12.154362678527832, -10.155743598937988, -8.157123565673828, -6.158504962921143, -4.159885406494141, -2.161266326904297, -0.16264724731445312, 1.8359718322753906, 3.8345909118652344, 5.833209991455078, 7.83182954788208, 9.830448150634766, 11.829068183898926, 13.82768726348877, 15.826306343078613, 17.82492446899414, 19.823543548583984, 21.822162628173828, 23.820781707763672, 25.819400787353516, 27.81801986694336, 29.816638946533203, 31.815258026123047, 33.813880920410156, 35.8125, 37.811119079589844, 39.80973815917969, 41.80835723876953, 43.806976318359375, 45.80559539794922, 47.80421447753906, 49.802833557128906, 51.80145263671875, 53.800071716308594, 55.79869079589844, 57.79730987548828, 59.795928955078125, 61.79454803466797, 63.79316711425781, 65.79178619384766]}, "gradients/decoder.transformer.h.6.mlp.c_proj.bias": {"_type": "histogram", "values": [3.0, 2.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 5.0, 3.0, 4.0, 11.0, 9.0, 8.0, 15.0, 14.0, 17.0, 18.0, 19.0, 15.0, 20.0, 24.0, 32.0, 32.0, 41.0, 31.0, 41.0, 31.0, 31.0, 44.0, 43.0, 42.0, 37.0, 28.0, 48.0, 36.0, 36.0, 32.0, 32.0, 24.0, 30.0, 17.0, 23.0, 22.0, 13.0, 9.0, 12.0, 10.0, 8.0, 8.0, 4.0, 8.0, 5.0, 3.0, 2.0, 3.0, 0.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-7.609375, -7.3603515625, -7.111328125, -6.8623046875, -6.61328125, -6.3642578125, -6.115234375, -5.8662109375, -5.6171875, -5.3681640625, -5.119140625, -4.8701171875, -4.62109375, -4.3720703125, -4.123046875, -3.8740234375, -3.625, -3.3759765625, -3.126953125, -2.8779296875, -2.62890625, -2.3798828125, -2.130859375, -1.8818359375, -1.6328125, -1.3837890625, -1.134765625, -0.8857421875, -0.63671875, -0.3876953125, -0.138671875, 0.1103515625, 0.359375, 0.6083984375, 0.857421875, 1.1064453125, 1.35546875, 1.6044921875, 1.853515625, 2.1025390625, 2.3515625, 2.6005859375, 2.849609375, 3.0986328125, 3.34765625, 3.5966796875, 3.845703125, 4.0947265625, 4.34375, 4.5927734375, 4.841796875, 5.0908203125, 5.33984375, 5.5888671875, 5.837890625, 6.0869140625, 6.3359375, 6.5849609375, 6.833984375, 7.0830078125, 7.33203125, 7.5810546875, 7.830078125, 8.0791015625, 8.328125]}, "gradients/decoder.transformer.h.6.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 5.0, 5.0, 1.0, 6.0, 8.0, 7.0, 8.0, 16.0, 13.0, 18.0, 23.0, 27.0, 31.0, 27.0, 44.0, 59.0, 103.0, 144.0, 227.0, 464.0, 1309.0, 6632.0, 136597.0, 3115938.0, 907448.0, 20802.0, 2614.0, 771.0, 290.0, 187.0, 112.0, 67.0, 64.0, 38.0, 36.0, 31.0, 29.0, 13.0, 7.0, 9.0, 9.0, 15.0, 9.0, 4.0, 9.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 6.0, 2.0, 2.0], "bins": [-27.65625, -26.83642578125, -26.0166015625, -25.19677734375, -24.376953125, -23.55712890625, -22.7373046875, -21.91748046875, -21.09765625, -20.27783203125, -19.4580078125, -18.63818359375, -17.818359375, -16.99853515625, -16.1787109375, -15.35888671875, -14.5390625, -13.71923828125, -12.8994140625, -12.07958984375, -11.259765625, -10.43994140625, -9.6201171875, -8.80029296875, -7.98046875, -7.16064453125, -6.3408203125, -5.52099609375, -4.701171875, -3.88134765625, -3.0615234375, -2.24169921875, -1.421875, -0.60205078125, 0.2177734375, 1.03759765625, 1.857421875, 2.67724609375, 3.4970703125, 4.31689453125, 5.13671875, 5.95654296875, 6.7763671875, 7.59619140625, 8.416015625, 9.23583984375, 10.0556640625, 10.87548828125, 11.6953125, 12.51513671875, 13.3349609375, 14.15478515625, 14.974609375, 15.79443359375, 16.6142578125, 17.43408203125, 18.25390625, 19.07373046875, 19.8935546875, 20.71337890625, 21.533203125, 22.35302734375, 23.1728515625, 23.99267578125, 24.8125]}, "gradients/decoder.transformer.h.6.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 1.0, 5.0, 6.0, 8.0, 10.0, 20.0, 24.0, 41.0, 84.0, 168.0, 274.0, 471.0, 767.0, 938.0, 620.0, 303.0, 153.0, 90.0, 42.0, 29.0, 17.0, 7.0, 3.0, 2.0, 0.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-23.140625, -22.257568359375, -21.37451171875, -20.491455078125, -19.6083984375, -18.725341796875, -17.84228515625, -16.959228515625, -16.076171875, -15.193115234375, -14.31005859375, -13.427001953125, -12.5439453125, -11.660888671875, -10.77783203125, -9.894775390625, -9.01171875, -8.128662109375, -7.24560546875, -6.362548828125, -5.4794921875, -4.596435546875, -3.71337890625, -2.830322265625, -1.947265625, -1.064208984375, -0.18115234375, 0.701904296875, 1.5849609375, 2.468017578125, 3.35107421875, 4.234130859375, 5.1171875, 6.000244140625, 6.88330078125, 7.766357421875, 8.6494140625, 9.532470703125, 10.41552734375, 11.298583984375, 12.181640625, 13.064697265625, 13.94775390625, 14.830810546875, 15.7138671875, 16.596923828125, 17.47998046875, 18.363037109375, 19.24609375, 20.129150390625, 21.01220703125, 21.895263671875, 22.7783203125, 23.661376953125, 24.54443359375, 25.427490234375, 26.310546875, 27.193603515625, 28.07666015625, 28.959716796875, 29.8427734375, 30.725830078125, 31.60888671875, 32.491943359375, 33.375]}, "gradients/decoder.transformer.h.6.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 5.0, 7.0, 6.0, 12.0, 17.0, 15.0, 16.0, 37.0, 39.0, 52.0, 64.0, 92.0, 123.0, 193.0, 274.0, 648.0, 2735.0, 369515.0, 3813176.0, 5116.0, 874.0, 448.0, 213.0, 151.0, 104.0, 98.0, 50.0, 49.0, 41.0, 33.0, 22.0, 18.0, 16.0, 12.0, 7.0, 1.0, 3.0, 1.0, 2.0, 0.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-87.125, -84.4140625, -81.703125, -78.9921875, -76.28125, -73.5703125, -70.859375, -68.1484375, -65.4375, -62.7265625, -60.015625, -57.3046875, -54.59375, -51.8828125, -49.171875, -46.4609375, -43.75, -41.0390625, -38.328125, -35.6171875, -32.90625, -30.1953125, -27.484375, -24.7734375, -22.0625, -19.3515625, -16.640625, -13.9296875, -11.21875, -8.5078125, -5.796875, -3.0859375, -0.375, 2.3359375, 5.046875, 7.7578125, 10.46875, 13.1796875, 15.890625, 18.6015625, 21.3125, 24.0234375, 26.734375, 29.4453125, 32.15625, 34.8671875, 37.578125, 40.2890625, 43.0, 45.7109375, 48.421875, 51.1328125, 53.84375, 56.5546875, 59.265625, 61.9765625, 64.6875, 67.3984375, 70.109375, 72.8203125, 75.53125, 78.2421875, 80.953125, 83.6640625, 86.375]}, "gradients/decoder.transformer.h.6.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 9.0, 85.0, 362.0, 422.0, 125.0, 8.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-247.7996368408203, -241.9252471923828, -236.05084228515625, -230.17645263671875, -224.30206298828125, -218.42767333984375, -212.5532684326172, -206.6788787841797, -200.80447387695312, -194.93008422851562, -189.05567932128906, -183.18128967285156, -177.30690002441406, -171.4324951171875, -165.55810546875, -159.6837158203125, -153.809326171875, -147.9349365234375, -142.06053161621094, -136.18614196777344, -130.31175231933594, -124.4373550415039, -118.56295776367188, -112.68856811523438, -106.81417083740234, -100.93977355957031, -95.06538391113281, -89.19098663330078, -83.31658935546875, -77.44219970703125, -71.56780242919922, -65.69340515136719, -59.81901550292969, -53.94462203979492, -48.070228576660156, -42.195831298828125, -36.32143783569336, -30.447044372558594, -24.572647094726562, -18.698253631591797, -12.823860168457031, -6.949465751647949, -1.0750713348388672, 4.799324035644531, 10.673717498779297, 16.548110961914062, 22.422508239746094, 28.29690170288086, 34.171295166015625, 40.04568862915039, 45.920082092285156, 51.79447937011719, 57.66887283325195, 63.54326629638672, 69.41766357421875, 75.29205322265625, 81.16645050048828, 87.04084777832031, 92.91523742675781, 98.78963470458984, 104.66403198242188, 110.53842163085938, 116.4128189086914, 122.28721618652344, 128.16160583496094]}, "gradients/decoder.transformer.h.6.ln_2.bias": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 12.0, 6.0, 3.0, 9.0, 11.0, 16.0, 10.0, 14.0, 23.0, 21.0, 22.0, 28.0, 33.0, 34.0, 38.0, 43.0, 45.0, 42.0, 40.0, 37.0, 32.0, 34.0, 55.0, 49.0, 34.0, 30.0, 26.0, 38.0, 27.0, 30.0, 19.0, 21.0, 20.0, 24.0, 23.0, 10.0, 13.0, 8.0, 7.0, 4.0, 4.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-52.632354736328125, -50.9722900390625, -49.312225341796875, -47.65216064453125, -45.992095947265625, -44.33203125, -42.671966552734375, -41.01190185546875, -39.351837158203125, -37.6917724609375, -36.031707763671875, -34.37164306640625, -32.711578369140625, -31.051513671875, -29.391448974609375, -27.73138427734375, -26.071321487426758, -24.411256790161133, -22.751192092895508, -21.091127395629883, -19.431062698364258, -17.770999908447266, -16.11093521118164, -14.4508695602417, -12.790804862976074, -11.13074016571045, -9.470675468444824, -7.810611248016357, -6.150546550750732, -4.490482330322266, -2.8304176330566406, -1.1703529357910156, 0.4897117614746094, 2.1497764587402344, 3.8098409175872803, 5.469905376434326, 7.129970073699951, 8.790034294128418, 10.450098991394043, 12.110163688659668, 13.770228385925293, 15.430293083190918, 17.090356826782227, 18.75042152404785, 20.410486221313477, 22.0705509185791, 23.730615615844727, 25.39068031311035, 27.050745010375977, 28.7108097076416, 30.370874404907227, 32.03093719482422, 33.691001892089844, 35.35106658935547, 37.011131286621094, 38.67119598388672, 40.331260681152344, 41.99132537841797, 43.651390075683594, 45.31145477294922, 46.971519470214844, 48.63158416748047, 50.291648864746094, 51.95171356201172, 53.611778259277344]}, "gradients/decoder.transformer.h.6.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 5.0, 6.0, 8.0, 6.0, 10.0, 7.0, 11.0, 7.0, 6.0, 10.0, 12.0, 19.0, 21.0, 22.0, 16.0, 23.0, 44.0, 30.0, 32.0, 44.0, 41.0, 37.0, 36.0, 38.0, 25.0, 34.0, 51.0, 35.0, 33.0, 31.0, 34.0, 33.0, 27.0, 27.0, 29.0, 22.0, 23.0, 14.0, 12.0, 13.0, 14.0, 15.0, 9.0, 10.0, 6.0, 3.0, 3.0, 5.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-8.234375, -7.98736572265625, -7.7403564453125, -7.49334716796875, -7.246337890625, -6.99932861328125, -6.7523193359375, -6.50531005859375, -6.25830078125, -6.01129150390625, -5.7642822265625, -5.51727294921875, -5.270263671875, -5.02325439453125, -4.7762451171875, -4.52923583984375, -4.2822265625, -4.03521728515625, -3.7882080078125, -3.54119873046875, -3.294189453125, -3.04718017578125, -2.8001708984375, -2.55316162109375, -2.30615234375, -2.05914306640625, -1.8121337890625, -1.56512451171875, -1.318115234375, -1.07110595703125, -0.8240966796875, -0.57708740234375, -0.330078125, -0.08306884765625, 0.1639404296875, 0.41094970703125, 0.657958984375, 0.90496826171875, 1.1519775390625, 1.39898681640625, 1.64599609375, 1.89300537109375, 2.1400146484375, 2.38702392578125, 2.634033203125, 2.88104248046875, 3.1280517578125, 3.37506103515625, 3.6220703125, 3.86907958984375, 4.1160888671875, 4.36309814453125, 4.610107421875, 4.85711669921875, 5.1041259765625, 5.35113525390625, 5.59814453125, 5.84515380859375, 6.0921630859375, 6.33917236328125, 6.586181640625, 6.83319091796875, 7.0802001953125, 7.32720947265625, 7.57421875]}, "gradients/decoder.transformer.h.6.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 5.0, 8.0, 12.0, 17.0, 26.0, 44.0, 54.0, 119.0, 131.0, 219.0, 306.0, 450.0, 666.0, 947.0, 1354.0, 1867.0, 2555.0, 3444.0, 4928.0, 7212.0, 10004.0, 14393.0, 21940.0, 32906.0, 52278.0, 86694.0, 172936.0, 298384.0, 129049.0, 71699.0, 44276.0, 28025.0, 18872.0, 12712.0, 8725.0, 6104.0, 4340.0, 3137.0, 2243.0, 1648.0, 1189.0, 811.0, 615.0, 404.0, 294.0, 186.0, 123.0, 83.0, 38.0, 38.0, 23.0, 12.0, 10.0, 7.0, 2.0, 3.0, 1.0, 2.0], "bins": [-1.9638671875, -1.9049224853515625, -1.845977783203125, -1.7870330810546875, -1.72808837890625, -1.6691436767578125, -1.610198974609375, -1.5512542724609375, -1.4923095703125, -1.4333648681640625, -1.374420166015625, -1.3154754638671875, -1.25653076171875, -1.1975860595703125, -1.138641357421875, -1.0796966552734375, -1.020751953125, -0.9618072509765625, -0.902862548828125, -0.8439178466796875, -0.78497314453125, -0.7260284423828125, -0.667083740234375, -0.6081390380859375, -0.5491943359375, -0.4902496337890625, -0.431304931640625, -0.3723602294921875, -0.31341552734375, -0.2544708251953125, -0.195526123046875, -0.1365814208984375, -0.07763671875, -0.0186920166015625, 0.040252685546875, 0.0991973876953125, 0.15814208984375, 0.2170867919921875, 0.276031494140625, 0.3349761962890625, 0.3939208984375, 0.4528656005859375, 0.511810302734375, 0.5707550048828125, 0.62969970703125, 0.6886444091796875, 0.747589111328125, 0.8065338134765625, 0.865478515625, 0.9244232177734375, 0.983367919921875, 1.0423126220703125, 1.10125732421875, 1.1602020263671875, 1.219146728515625, 1.2780914306640625, 1.3370361328125, 1.3959808349609375, 1.454925537109375, 1.5138702392578125, 1.57281494140625, 1.6317596435546875, 1.690704345703125, 1.7496490478515625, 1.80859375]}, "gradients/decoder.transformer.h.6.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 6.0, 1.0, 2.0, 5.0, 5.0, 4.0, 8.0, 8.0, 9.0, 10.0, 14.0, 15.0, 18.0, 17.0, 20.0, 15.0, 23.0, 22.0, 31.0, 32.0, 26.0, 30.0, 35.0, 46.0, 37.0, 43.0, 1056.0, 36.0, 33.0, 27.0, 37.0, 35.0, 33.0, 43.0, 37.0, 23.0, 26.0, 16.0, 21.0, 22.0, 14.0, 22.0, 12.0, 7.0, 17.0, 13.0, 4.0, 8.0, 4.0, 3.0, 3.0, 1.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.91796875, -4.764892578125, -4.61181640625, -4.458740234375, -4.3056640625, -4.152587890625, -3.99951171875, -3.846435546875, -3.693359375, -3.540283203125, -3.38720703125, -3.234130859375, -3.0810546875, -2.927978515625, -2.77490234375, -2.621826171875, -2.46875, -2.315673828125, -2.16259765625, -2.009521484375, -1.8564453125, -1.703369140625, -1.55029296875, -1.397216796875, -1.244140625, -1.091064453125, -0.93798828125, -0.784912109375, -0.6318359375, -0.478759765625, -0.32568359375, -0.172607421875, -0.01953125, 0.133544921875, 0.28662109375, 0.439697265625, 0.5927734375, 0.745849609375, 0.89892578125, 1.052001953125, 1.205078125, 1.358154296875, 1.51123046875, 1.664306640625, 1.8173828125, 1.970458984375, 2.12353515625, 2.276611328125, 2.4296875, 2.582763671875, 2.73583984375, 2.888916015625, 3.0419921875, 3.195068359375, 3.34814453125, 3.501220703125, 3.654296875, 3.807373046875, 3.96044921875, 4.113525390625, 4.2666015625, 4.419677734375, 4.57275390625, 4.725830078125, 4.87890625]}, "gradients/decoder.transformer.h.6.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 1.0, 4.0, 1.0, 6.0, 11.0, 10.0, 12.0, 29.0, 21.0, 41.0, 59.0, 102.0, 156.0, 268.0, 369.0, 706.0, 1135.0, 2022.0, 3541.0, 6004.0, 10344.0, 18564.0, 34311.0, 64902.0, 132629.0, 1432323.0, 198833.0, 88331.0, 45971.0, 24408.0, 13560.0, 7700.0, 4421.0, 2590.0, 1496.0, 942.0, 505.0, 295.0, 195.0, 103.0, 81.0, 31.0, 34.0, 21.0, 15.0, 8.0, 7.0, 8.0, 4.0, 2.0, 1.0, 5.0, 3.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.6640625, -2.580078125, -2.49609375, -2.412109375, -2.328125, -2.244140625, -2.16015625, -2.076171875, -1.9921875, -1.908203125, -1.82421875, -1.740234375, -1.65625, -1.572265625, -1.48828125, -1.404296875, -1.3203125, -1.236328125, -1.15234375, -1.068359375, -0.984375, -0.900390625, -0.81640625, -0.732421875, -0.6484375, -0.564453125, -0.48046875, -0.396484375, -0.3125, -0.228515625, -0.14453125, -0.060546875, 0.0234375, 0.107421875, 0.19140625, 0.275390625, 0.359375, 0.443359375, 0.52734375, 0.611328125, 0.6953125, 0.779296875, 0.86328125, 0.947265625, 1.03125, 1.115234375, 1.19921875, 1.283203125, 1.3671875, 1.451171875, 1.53515625, 1.619140625, 1.703125, 1.787109375, 1.87109375, 1.955078125, 2.0390625, 2.123046875, 2.20703125, 2.291015625, 2.375, 2.458984375, 2.54296875, 2.626953125, 2.7109375]}, "gradients/decoder.transformer.h.6.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 5.0, 6.0, 5.0, 6.0, 3.0, 7.0, 16.0, 13.0, 14.0, 20.0, 21.0, 20.0, 30.0, 50.0, 51.0, 71.0, 97.0, 91.0, 99.0, 76.0, 56.0, 50.0, 43.0, 37.0, 28.0, 19.0, 15.0, 13.0, 8.0, 6.0, 9.0, 5.0, 8.0, 4.0, 3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0016326904296875, -0.0015811324119567871, -0.0015295743942260742, -0.0014780163764953613, -0.0014264583587646484, -0.0013749003410339355, -0.0013233423233032227, -0.0012717843055725098, -0.0012202262878417969, -0.001168668270111084, -0.001117110252380371, -0.0010655522346496582, -0.0010139942169189453, -0.0009624361991882324, -0.0009108781814575195, -0.0008593201637268066, -0.0008077621459960938, -0.0007562041282653809, -0.000704646110534668, -0.0006530880928039551, -0.0006015300750732422, -0.0005499720573425293, -0.0004984140396118164, -0.0004468560218811035, -0.0003952980041503906, -0.00034373998641967773, -0.00029218196868896484, -0.00024062395095825195, -0.00018906593322753906, -0.00013750791549682617, -8.594989776611328e-05, -3.439188003540039e-05, 1.71661376953125e-05, 6.872415542602539e-05, 0.00012028217315673828, 0.00017184019088745117, 0.00022339820861816406, 0.00027495622634887695, 0.00032651424407958984, 0.00037807226181030273, 0.0004296302795410156, 0.0004811882972717285, 0.0005327463150024414, 0.0005843043327331543, 0.0006358623504638672, 0.0006874203681945801, 0.000738978385925293, 0.0007905364036560059, 0.0008420944213867188, 0.0008936524391174316, 0.0009452104568481445, 0.0009967684745788574, 0.0010483264923095703, 0.0010998845100402832, 0.001151442527770996, 0.001203000545501709, 0.0012545585632324219, 0.0013061165809631348, 0.0013576745986938477, 0.0014092326164245605, 0.0014607906341552734, 0.0015123486518859863, 0.0015639066696166992, 0.0016154646873474121, 0.001667022705078125]}, "gradients/decoder.transformer.h.6.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 3.0, 3.0, 3.0, 3.0, 5.0, 8.0, 10.0, 12.0, 16.0, 16.0, 21.0, 41.0, 42.0, 58.0, 76.0, 143.0, 209.0, 526.0, 3518.0, 1038083.0, 4543.0, 504.0, 233.0, 137.0, 86.0, 60.0, 45.0, 22.0, 28.0, 16.0, 21.0, 13.0, 9.0, 9.0, 9.0, 9.0, 4.0, 5.0, 5.0, 4.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.037384033203125, -0.036269187927246094, -0.03515434265136719, -0.03403949737548828, -0.032924652099609375, -0.03180980682373047, -0.030694961547851562, -0.029580116271972656, -0.02846527099609375, -0.027350425720214844, -0.026235580444335938, -0.02512073516845703, -0.024005889892578125, -0.02289104461669922, -0.021776199340820312, -0.020661354064941406, -0.0195465087890625, -0.018431663513183594, -0.017316818237304688, -0.01620197296142578, -0.015087127685546875, -0.013972282409667969, -0.012857437133789062, -0.011742591857910156, -0.01062774658203125, -0.009512901306152344, -0.008398056030273438, -0.007283210754394531, -0.006168365478515625, -0.005053520202636719, -0.0039386749267578125, -0.0028238296508789062, -0.001708984375, -0.0005941390991210938, 0.0005207061767578125, 0.0016355514526367188, 0.002750396728515625, 0.0038652420043945312, 0.0049800872802734375, 0.006094932556152344, 0.00720977783203125, 0.008324623107910156, 0.009439468383789062, 0.010554313659667969, 0.011669158935546875, 0.012784004211425781, 0.013898849487304688, 0.015013694763183594, 0.0161285400390625, 0.017243385314941406, 0.018358230590820312, 0.01947307586669922, 0.020587921142578125, 0.02170276641845703, 0.022817611694335938, 0.023932456970214844, 0.02504730224609375, 0.026162147521972656, 0.027276992797851562, 0.02839183807373047, 0.029506683349609375, 0.03062152862548828, 0.03173637390136719, 0.032851219177246094, 0.033966064453125]}, "gradients/decoder.transformer.h.6.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 30.0, 632.0, 338.0, 16.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0016195472562685609, -0.001454607816413045, -0.0012896682601422071, -0.0011247288202866912, -0.0009597893222235143, -0.0007948498241603374, -0.0006299103843048215, -0.0004649708280339837, -0.00030003138817846775, -0.0001350919046672061, 2.9847578844055533e-05, 0.00019478704780340195, 0.0003597265458665788, 0.0005246660439297557, 0.0006896054837852716, 0.0008545450400561094, 0.0010194844799116254, 0.0011844239197671413, 0.0013493634760379791, 0.001514302915893495, 0.001679242355749011, 0.0018441819120198488, 0.002009121235460043, 0.0021740607917308807, 0.0023390003480017185, 0.0025039399042725563, 0.0026688792277127504, 0.0028338187839835882, 0.002998758340254426, 0.00316369766369462, 0.003328637219965458, 0.0034935767762362957, 0.003658515866845846, 0.003823455423116684, 0.003988394979387522, 0.0041533345356583595, 0.00431827362626791, 0.004483213182538748, 0.004648152738809586, 0.004813092295080423, 0.004978031851351261, 0.005142971407622099, 0.005307910963892937, 0.005472850054502487, 0.005637789610773325, 0.005802729167044163, 0.0059676687233150005, 0.006132608279585838, 0.006297547370195389, 0.006462486926466227, 0.006627426482737064, 0.006792365573346615, 0.006957305129617453, 0.00712224468588829, 0.007287184242159128, 0.007452123798429966, 0.007617063354700804, 0.0077820029109716415, 0.007946942001581192, 0.008111882023513317, 0.008276821114122868, 0.008441761136054993, 0.008606700226664543, 0.008771639317274094, 0.008936579339206219]}, "gradients/decoder.transformer.h.6.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 6.0, 11.0, 3.0, 7.0, 9.0, 13.0, 8.0, 18.0, 14.0, 17.0, 28.0, 27.0, 22.0, 36.0, 33.0, 47.0, 37.0, 29.0, 32.0, 38.0, 35.0, 38.0, 41.0, 30.0, 29.0, 31.0, 31.0, 35.0, 39.0, 35.0, 26.0, 26.0, 22.0, 25.0, 19.0, 19.0, 18.0, 11.0, 13.0, 7.0, 8.0, 9.0, 8.0, 6.0, 3.0, 3.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0], "bins": [-0.0006828904151916504, -0.0006628455594182014, -0.0006428007036447525, -0.0006227558478713036, -0.0006027109920978546, -0.0005826661363244057, -0.0005626212805509567, -0.0005425764247775078, -0.0005225315690040588, -0.0005024867132306099, -0.00048244185745716095, -0.000462397001683712, -0.00044235214591026306, -0.0004223072901368141, -0.0004022624343633652, -0.00038221757858991623, -0.0003621727228164673, -0.00034212786704301834, -0.0003220830112695694, -0.00030203815549612045, -0.0002819932997226715, -0.00026194844394922256, -0.00024190358817577362, -0.00022185873240232468, -0.00020181387662887573, -0.0001817690208554268, -0.00016172416508197784, -0.0001416793093085289, -0.00012163445353507996, -0.00010158959776163101, -8.154474198818207e-05, -6.149988621473312e-05, -4.145503044128418e-05, -2.1410174667835236e-05, -1.3653188943862915e-06, 1.8679536879062653e-05, 3.87243926525116e-05, 5.876924842596054e-05, 7.881410419940948e-05, 9.885895997285843e-05, 0.00011890381574630737, 0.00013894867151975632, 0.00015899352729320526, 0.0001790383830666542, 0.00019908323884010315, 0.0002191280946135521, 0.00023917295038700104, 0.00025921780616045, 0.0002792626619338989, 0.00029930751770734787, 0.0003193523734807968, 0.00033939722925424576, 0.0003594420850276947, 0.00037948694080114365, 0.0003995317965745926, 0.00041957665234804153, 0.0004396215081214905, 0.0004596663638949394, 0.00047971121966838837, 0.0004997560754418373, 0.0005198009312152863, 0.0005398457869887352, 0.0005598906427621841, 0.0005799354985356331, 0.000599980354309082]}, "gradients/decoder.transformer.h.6.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 4.0, 2.0, 4.0, 5.0, 6.0, 8.0, 6.0, 10.0, 7.0, 11.0, 7.0, 6.0, 10.0, 12.0, 19.0, 21.0, 22.0, 16.0, 23.0, 44.0, 30.0, 32.0, 44.0, 41.0, 37.0, 36.0, 38.0, 25.0, 34.0, 51.0, 35.0, 33.0, 31.0, 34.0, 33.0, 27.0, 27.0, 29.0, 22.0, 23.0, 14.0, 12.0, 13.0, 14.0, 15.0, 9.0, 10.0, 6.0, 3.0, 3.0, 5.0, 1.0, 5.0, 1.0, 2.0, 0.0, 1.0, 2.0], "bins": [-8.234375, -7.98736572265625, -7.7403564453125, -7.49334716796875, -7.246337890625, -6.99932861328125, -6.7523193359375, -6.50531005859375, -6.25830078125, -6.01129150390625, -5.7642822265625, -5.51727294921875, -5.270263671875, -5.02325439453125, -4.7762451171875, -4.52923583984375, -4.2822265625, -4.03521728515625, -3.7882080078125, -3.54119873046875, -3.294189453125, -3.04718017578125, -2.8001708984375, -2.55316162109375, -2.30615234375, -2.05914306640625, -1.8121337890625, -1.56512451171875, -1.318115234375, -1.07110595703125, -0.8240966796875, -0.57708740234375, -0.330078125, -0.08306884765625, 0.1639404296875, 0.41094970703125, 0.657958984375, 0.90496826171875, 1.1519775390625, 1.39898681640625, 1.64599609375, 1.89300537109375, 2.1400146484375, 2.38702392578125, 2.634033203125, 2.88104248046875, 3.1280517578125, 3.37506103515625, 3.6220703125, 3.86907958984375, 4.1160888671875, 4.36309814453125, 4.610107421875, 4.85711669921875, 5.1041259765625, 5.35113525390625, 5.59814453125, 5.84515380859375, 6.0921630859375, 6.33917236328125, 6.586181640625, 6.83319091796875, 7.0802001953125, 7.32720947265625, 7.57421875]}, "gradients/decoder.transformer.h.6.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 1.0, 1.0, 4.0, 4.0, 8.0, 6.0, 11.0, 15.0, 14.0, 18.0, 20.0, 21.0, 22.0, 18.0, 30.0, 33.0, 77.0, 91.0, 103.0, 135.0, 206.0, 248.0, 337.0, 480.0, 593.0, 1019.0, 2604.0, 14085.0, 135739.0, 825995.0, 54401.0, 7427.0, 1791.0, 807.0, 525.0, 382.0, 268.0, 216.0, 188.0, 133.0, 111.0, 77.0, 68.0, 36.0, 48.0, 31.0, 31.0, 24.0, 18.0, 10.0, 6.0, 8.0, 8.0, 5.0, 6.0, 1.0, 4.0, 1.0, 1.0, 2.0], "bins": [-20.8125, -20.18798828125, -19.5634765625, -18.93896484375, -18.314453125, -17.68994140625, -17.0654296875, -16.44091796875, -15.81640625, -15.19189453125, -14.5673828125, -13.94287109375, -13.318359375, -12.69384765625, -12.0693359375, -11.44482421875, -10.8203125, -10.19580078125, -9.5712890625, -8.94677734375, -8.322265625, -7.69775390625, -7.0732421875, -6.44873046875, -5.82421875, -5.19970703125, -4.5751953125, -3.95068359375, -3.326171875, -2.70166015625, -2.0771484375, -1.45263671875, -0.828125, -0.20361328125, 0.4208984375, 1.04541015625, 1.669921875, 2.29443359375, 2.9189453125, 3.54345703125, 4.16796875, 4.79248046875, 5.4169921875, 6.04150390625, 6.666015625, 7.29052734375, 7.9150390625, 8.53955078125, 9.1640625, 9.78857421875, 10.4130859375, 11.03759765625, 11.662109375, 12.28662109375, 12.9111328125, 13.53564453125, 14.16015625, 14.78466796875, 15.4091796875, 16.03369140625, 16.658203125, 17.28271484375, 17.9072265625, 18.53173828125, 19.15625]}, "gradients/decoder.transformer.h.6.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 2.0, 6.0, 8.0, 5.0, 12.0, 16.0, 8.0, 11.0, 14.0, 14.0, 18.0, 22.0, 24.0, 26.0, 27.0, 30.0, 32.0, 36.0, 46.0, 37.0, 76.0, 417.0, 1611.0, 108.0, 61.0, 53.0, 50.0, 37.0, 34.0, 29.0, 21.0, 26.0, 18.0, 16.0, 16.0, 18.0, 18.0, 7.0, 6.0, 5.0, 8.0, 6.0, 5.0, 4.0, 5.0, 3.0, 0.0, 0.0, 0.0, 0.0, 5.0], "bins": [-28.28125, -27.475830078125, -26.67041015625, -25.864990234375, -25.0595703125, -24.254150390625, -23.44873046875, -22.643310546875, -21.837890625, -21.032470703125, -20.22705078125, -19.421630859375, -18.6162109375, -17.810791015625, -17.00537109375, -16.199951171875, -15.39453125, -14.589111328125, -13.78369140625, -12.978271484375, -12.1728515625, -11.367431640625, -10.56201171875, -9.756591796875, -8.951171875, -8.145751953125, -7.34033203125, -6.534912109375, -5.7294921875, -4.924072265625, -4.11865234375, -3.313232421875, -2.5078125, -1.702392578125, -0.89697265625, -0.091552734375, 0.7138671875, 1.519287109375, 2.32470703125, 3.130126953125, 3.935546875, 4.740966796875, 5.54638671875, 6.351806640625, 7.1572265625, 7.962646484375, 8.76806640625, 9.573486328125, 10.37890625, 11.184326171875, 11.98974609375, 12.795166015625, 13.6005859375, 14.406005859375, 15.21142578125, 16.016845703125, 16.822265625, 17.627685546875, 18.43310546875, 19.238525390625, 20.0439453125, 20.849365234375, 21.65478515625, 22.460205078125, 23.265625]}, "gradients/decoder.transformer.h.6.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 0.0, 0.0, 4.0, 4.0, 4.0, 5.0, 4.0, 8.0, 7.0, 5.0, 8.0, 13.0, 20.0, 23.0, 16.0, 20.0, 31.0, 28.0, 43.0, 56.0, 76.0, 117.0, 227.0, 511.0, 1902.0, 2962450.0, 177885.0, 1211.0, 375.0, 187.0, 113.0, 68.0, 52.0, 40.0, 27.0, 28.0, 27.0, 15.0, 14.0, 17.0, 10.0, 7.0, 12.0, 18.0, 5.0, 9.0, 4.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-65.625, -63.353515625, -61.08203125, -58.810546875, -56.5390625, -54.267578125, -51.99609375, -49.724609375, -47.453125, -45.181640625, -42.91015625, -40.638671875, -38.3671875, -36.095703125, -33.82421875, -31.552734375, -29.28125, -27.009765625, -24.73828125, -22.466796875, -20.1953125, -17.923828125, -15.65234375, -13.380859375, -11.109375, -8.837890625, -6.56640625, -4.294921875, -2.0234375, 0.248046875, 2.51953125, 4.791015625, 7.0625, 9.333984375, 11.60546875, 13.876953125, 16.1484375, 18.419921875, 20.69140625, 22.962890625, 25.234375, 27.505859375, 29.77734375, 32.048828125, 34.3203125, 36.591796875, 38.86328125, 41.134765625, 43.40625, 45.677734375, 47.94921875, 50.220703125, 52.4921875, 54.763671875, 57.03515625, 59.306640625, 61.578125, 63.849609375, 66.12109375, 68.392578125, 70.6640625, 72.935546875, 75.20703125, 77.478515625, 79.75]}, "gradients/decoder.transformer.h.6.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 18.0, 346.0, 547.0, 99.0, 7.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-156.63739013671875, -153.6923370361328, -150.74729919433594, -147.80224609375, -144.85719299316406, -141.9121551513672, -138.96710205078125, -136.02206420898438, -133.07701110839844, -130.1319580078125, -127.1869125366211, -124.24186706542969, -121.29682159423828, -118.35177612304688, -115.40672302246094, -112.46167755126953, -109.51663208007812, -106.57158660888672, -103.62653350830078, -100.68148803710938, -97.73644256591797, -94.79139709472656, -91.84634399414062, -88.90129852294922, -85.95624542236328, -83.01119995117188, -80.06614685058594, -77.12110137939453, -74.17605590820312, -71.23101043701172, -68.28595733642578, -65.34091186523438, -62.39586639404297, -59.4508171081543, -56.50577163696289, -53.56072235107422, -50.61567687988281, -47.67062759399414, -44.72557830810547, -41.78053283691406, -38.83548355102539, -35.89043426513672, -32.94538879394531, -30.00033950805664, -27.055294036865234, -24.110244750976562, -21.165197372436523, -18.220149993896484, -15.275102615356445, -12.330055236816406, -9.385007858276367, -6.439959526062012, -3.4949121475219727, -0.5498647689819336, 2.395183563232422, 5.340230941772461, 8.2852783203125, 11.230325698852539, 14.175373077392578, 17.12042236328125, 20.065467834472656, 23.010517120361328, 25.955564498901367, 28.900611877441406, 31.845659255981445]}, "gradients/decoder.transformer.h.6.ln_1.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 1.0, 1.0, 4.0, 4.0, 2.0, 6.0, 5.0, 7.0, 12.0, 9.0, 15.0, 12.0, 20.0, 24.0, 13.0, 19.0, 26.0, 24.0, 15.0, 40.0, 38.0, 37.0, 37.0, 52.0, 41.0, 39.0, 54.0, 39.0, 34.0, 33.0, 36.0, 34.0, 43.0, 25.0, 20.0, 20.0, 27.0, 18.0, 20.0, 14.0, 19.0, 16.0, 11.0, 5.0, 11.0, 8.0, 4.0, 5.0, 1.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 2.0], "bins": [-67.83663177490234, -65.82483673095703, -63.81304168701172, -61.801246643066406, -59.789451599121094, -57.77765655517578, -55.76586151123047, -53.754066467285156, -51.742271423339844, -49.73047637939453, -47.71868133544922, -45.706886291503906, -43.695091247558594, -41.68329620361328, -39.67150115966797, -37.659706115722656, -35.64790725708008, -33.636112213134766, -31.624317169189453, -29.61252212524414, -27.600727081298828, -25.588932037353516, -23.57713508605957, -21.565340042114258, -19.553544998168945, -17.541749954223633, -15.52995491027832, -13.518158912658691, -11.506363868713379, -9.494568824768066, -7.4827728271484375, -5.470977783203125, -3.459186553955078, -1.4473912715911865, 0.5644040107727051, 2.576199531555176, 4.587994575500488, 6.599789619445801, 8.61158561706543, 10.623380661010742, 12.635175704956055, 14.646970748901367, 16.65876579284668, 18.670562744140625, 20.682357788085938, 22.69415283203125, 24.705947875976562, 26.717742919921875, 28.729537963867188, 30.7413330078125, 32.75312805175781, 34.764923095703125, 36.77671813964844, 38.78851318359375, 40.80030822753906, 42.812103271484375, 44.82389831542969, 46.835693359375, 48.84748840332031, 50.859283447265625, 52.87107849121094, 54.88287353515625, 56.89466857910156, 58.906463623046875, 60.91826248168945]}, "gradients/decoder.transformer.h.5.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 4.0, 2.0, 6.0, 7.0, 2.0, 10.0, 9.0, 8.0, 9.0, 8.0, 7.0, 15.0, 22.0, 27.0, 17.0, 16.0, 21.0, 27.0, 30.0, 36.0, 25.0, 54.0, 31.0, 40.0, 36.0, 33.0, 33.0, 45.0, 39.0, 31.0, 36.0, 32.0, 32.0, 30.0, 38.0, 26.0, 22.0, 20.0, 22.0, 13.0, 16.0, 19.0, 9.0, 9.0, 11.0, 5.0, 5.0, 7.0, 4.0, 3.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-8.046875, -7.795166015625, -7.54345703125, -7.291748046875, -7.0400390625, -6.788330078125, -6.53662109375, -6.284912109375, -6.033203125, -5.781494140625, -5.52978515625, -5.278076171875, -5.0263671875, -4.774658203125, -4.52294921875, -4.271240234375, -4.01953125, -3.767822265625, -3.51611328125, -3.264404296875, -3.0126953125, -2.760986328125, -2.50927734375, -2.257568359375, -2.005859375, -1.754150390625, -1.50244140625, -1.250732421875, -0.9990234375, -0.747314453125, -0.49560546875, -0.243896484375, 0.0078125, 0.259521484375, 0.51123046875, 0.762939453125, 1.0146484375, 1.266357421875, 1.51806640625, 1.769775390625, 2.021484375, 2.273193359375, 2.52490234375, 2.776611328125, 3.0283203125, 3.280029296875, 3.53173828125, 3.783447265625, 4.03515625, 4.286865234375, 4.53857421875, 4.790283203125, 5.0419921875, 5.293701171875, 5.54541015625, 5.797119140625, 6.048828125, 6.300537109375, 6.55224609375, 6.803955078125, 7.0556640625, 7.307373046875, 7.55908203125, 7.810791015625, 8.0625]}, "gradients/decoder.transformer.h.5.mlp.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 5.0, 2.0, 6.0, 6.0, 3.0, 5.0, 6.0, 4.0, 9.0, 12.0, 12.0, 20.0, 20.0, 36.0, 40.0, 48.0, 53.0, 88.0, 111.0, 160.0, 186.0, 268.0, 401.0, 774.0, 1935.0, 6569.0, 41341.0, 983586.0, 2886087.0, 248857.0, 16770.0, 3683.0, 1257.0, 606.0, 376.0, 217.0, 181.0, 129.0, 88.0, 69.0, 64.0, 46.0, 38.0, 23.0, 22.0, 19.0, 12.0, 13.0, 8.0, 3.0, 9.0, 5.0, 6.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-21.90625, -21.2060546875, -20.505859375, -19.8056640625, -19.10546875, -18.4052734375, -17.705078125, -17.0048828125, -16.3046875, -15.6044921875, -14.904296875, -14.2041015625, -13.50390625, -12.8037109375, -12.103515625, -11.4033203125, -10.703125, -10.0029296875, -9.302734375, -8.6025390625, -7.90234375, -7.2021484375, -6.501953125, -5.8017578125, -5.1015625, -4.4013671875, -3.701171875, -3.0009765625, -2.30078125, -1.6005859375, -0.900390625, -0.2001953125, 0.5, 1.2001953125, 1.900390625, 2.6005859375, 3.30078125, 4.0009765625, 4.701171875, 5.4013671875, 6.1015625, 6.8017578125, 7.501953125, 8.2021484375, 8.90234375, 9.6025390625, 10.302734375, 11.0029296875, 11.703125, 12.4033203125, 13.103515625, 13.8037109375, 14.50390625, 15.2041015625, 15.904296875, 16.6044921875, 17.3046875, 18.0048828125, 18.705078125, 19.4052734375, 20.10546875, 20.8056640625, 21.505859375, 22.2060546875, 22.90625]}, "gradients/decoder.transformer.h.5.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 1.0, 2.0, 4.0, 6.0, 10.0, 16.0, 30.0, 45.0, 85.0, 138.0, 249.0, 449.0, 715.0, 891.0, 644.0, 355.0, 183.0, 116.0, 62.0, 32.0, 21.0, 14.0, 3.0, 6.0, 5.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-27.3125, -26.518310546875, -25.72412109375, -24.929931640625, -24.1357421875, -23.341552734375, -22.54736328125, -21.753173828125, -20.958984375, -20.164794921875, -19.37060546875, -18.576416015625, -17.7822265625, -16.988037109375, -16.19384765625, -15.399658203125, -14.60546875, -13.811279296875, -13.01708984375, -12.222900390625, -11.4287109375, -10.634521484375, -9.84033203125, -9.046142578125, -8.251953125, -7.457763671875, -6.66357421875, -5.869384765625, -5.0751953125, -4.281005859375, -3.48681640625, -2.692626953125, -1.8984375, -1.104248046875, -0.31005859375, 0.484130859375, 1.2783203125, 2.072509765625, 2.86669921875, 3.660888671875, 4.455078125, 5.249267578125, 6.04345703125, 6.837646484375, 7.6318359375, 8.426025390625, 9.22021484375, 10.014404296875, 10.80859375, 11.602783203125, 12.39697265625, 13.191162109375, 13.9853515625, 14.779541015625, 15.57373046875, 16.367919921875, 17.162109375, 17.956298828125, 18.75048828125, 19.544677734375, 20.3388671875, 21.133056640625, 21.92724609375, 22.721435546875, 23.515625]}, "gradients/decoder.transformer.h.5.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 2.0, 9.0, 9.0, 14.0, 19.0, 26.0, 37.0, 41.0, 71.0, 78.0, 136.0, 172.0, 312.0, 664.0, 2067.0, 48471.0, 4110100.0, 28571.0, 1946.0, 595.0, 293.0, 174.0, 129.0, 100.0, 70.0, 46.0, 31.0, 24.0, 9.0, 18.0, 10.0, 9.0, 9.0, 9.0, 3.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-70.0625, -67.68359375, -65.3046875, -62.92578125, -60.546875, -58.16796875, -55.7890625, -53.41015625, -51.03125, -48.65234375, -46.2734375, -43.89453125, -41.515625, -39.13671875, -36.7578125, -34.37890625, -32.0, -29.62109375, -27.2421875, -24.86328125, -22.484375, -20.10546875, -17.7265625, -15.34765625, -12.96875, -10.58984375, -8.2109375, -5.83203125, -3.453125, -1.07421875, 1.3046875, 3.68359375, 6.0625, 8.44140625, 10.8203125, 13.19921875, 15.578125, 17.95703125, 20.3359375, 22.71484375, 25.09375, 27.47265625, 29.8515625, 32.23046875, 34.609375, 36.98828125, 39.3671875, 41.74609375, 44.125, 46.50390625, 48.8828125, 51.26171875, 53.640625, 56.01953125, 58.3984375, 60.77734375, 63.15625, 65.53515625, 67.9140625, 70.29296875, 72.671875, 75.05078125, 77.4296875, 79.80859375, 82.1875]}, "gradients/decoder.transformer.h.5.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 414.0, 598.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-910.3716430664062, -885.8063354492188, -861.2410888671875, -836.67578125, -812.1104736328125, -787.5452270507812, -762.9799194335938, -738.4146728515625, -713.849365234375, -689.2840576171875, -664.7188110351562, -640.1535034179688, -615.5882568359375, -591.02294921875, -566.4576416015625, -541.892333984375, -517.3270874023438, -492.7618103027344, -468.196533203125, -443.6312255859375, -419.0659484863281, -394.50067138671875, -369.93536376953125, -345.3700866699219, -320.8048095703125, -296.2395324707031, -271.67425537109375, -247.10894775390625, -222.54367065429688, -197.9783935546875, -173.41310119628906, -148.84780883789062, -124.2825927734375, -99.7173080444336, -75.15202331542969, -50.58673858642578, -26.021453857421875, -1.4561691284179688, 23.109115600585938, 47.674407958984375, 72.23968505859375, 96.80496978759766, 121.37025451660156, 145.935546875, 170.50082397460938, 195.06610107421875, 219.6313934326172, 244.19668579101562, 268.761962890625, 293.3272399902344, 317.89251708984375, 342.45782470703125, 367.0231018066406, 391.58837890625, 416.1536865234375, 440.7189636230469, 465.28424072265625, 489.8495178222656, 514.414794921875, 538.9801025390625, 563.54541015625, 588.1106567382812, 612.6759643554688, 637.2412109375, 661.8065185546875]}, "gradients/decoder.transformer.h.5.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 0.0, 8.0, 2.0, 6.0, 4.0, 7.0, 4.0, 11.0, 14.0, 22.0, 19.0, 26.0, 26.0, 23.0, 29.0, 35.0, 34.0, 32.0, 33.0, 41.0, 40.0, 46.0, 41.0, 42.0, 34.0, 43.0, 42.0, 47.0, 42.0, 48.0, 30.0, 31.0, 18.0, 19.0, 21.0, 25.0, 25.0, 13.0, 11.0, 4.0, 2.0, 7.0, 2.0, 4.0, 1.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-48.35646057128906, -46.61703109741211, -44.877601623535156, -43.1381721496582, -41.39874267578125, -39.6593132019043, -37.919883728027344, -36.180450439453125, -34.44102478027344, -32.701595306396484, -30.96216583251953, -29.222736358642578, -27.483306884765625, -25.743877410888672, -24.004446029663086, -22.265016555786133, -20.525585174560547, -18.786155700683594, -17.04672622680664, -15.307295799255371, -13.567866325378418, -11.828436851501465, -10.089006423950195, -8.349576950073242, -6.610147476196289, -4.870718002319336, -3.1312880516052246, -1.3918581008911133, 0.34757137298583984, 2.087000846862793, 3.8264312744140625, 5.565860748291016, 7.305290222167969, 9.044719696044922, 10.784149169921875, 12.523579597473145, 14.263009071350098, 16.002437591552734, 17.74186897277832, 19.481298446655273, 21.220727920532227, 22.96015739440918, 24.699586868286133, 26.43901824951172, 28.178447723388672, 29.917877197265625, 31.657306671142578, 33.39673614501953, 35.136165618896484, 36.87559509277344, 38.61502456665039, 40.354454040527344, 42.0938835144043, 43.83331298828125, 45.57274627685547, 47.312171936035156, 49.051605224609375, 50.79103469848633, 52.53046417236328, 54.269893646240234, 56.00932312011719, 57.74875259399414, 59.488182067871094, 61.22761535644531, 62.967041015625]}, "gradients/decoder.transformer.h.5.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 4.0, 5.0, 7.0, 7.0, 9.0, 17.0, 11.0, 19.0, 18.0, 26.0, 25.0, 33.0, 16.0, 27.0, 33.0, 35.0, 40.0, 34.0, 48.0, 30.0, 41.0, 33.0, 41.0, 41.0, 41.0, 47.0, 36.0, 34.0, 32.0, 32.0, 20.0, 27.0, 22.0, 15.0, 20.0, 16.0, 11.0, 10.0, 9.0, 11.0, 3.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 1.0], "bins": [-10.2265625, -9.9412841796875, -9.656005859375, -9.3707275390625, -9.08544921875, -8.8001708984375, -8.514892578125, -8.2296142578125, -7.9443359375, -7.6590576171875, -7.373779296875, -7.0885009765625, -6.80322265625, -6.5179443359375, -6.232666015625, -5.9473876953125, -5.662109375, -5.3768310546875, -5.091552734375, -4.8062744140625, -4.52099609375, -4.2357177734375, -3.950439453125, -3.6651611328125, -3.3798828125, -3.0946044921875, -2.809326171875, -2.5240478515625, -2.23876953125, -1.9534912109375, -1.668212890625, -1.3829345703125, -1.09765625, -0.8123779296875, -0.527099609375, -0.2418212890625, 0.04345703125, 0.3287353515625, 0.614013671875, 0.8992919921875, 1.1845703125, 1.4698486328125, 1.755126953125, 2.0404052734375, 2.32568359375, 2.6109619140625, 2.896240234375, 3.1815185546875, 3.466796875, 3.7520751953125, 4.037353515625, 4.3226318359375, 4.60791015625, 4.8931884765625, 5.178466796875, 5.4637451171875, 5.7490234375, 6.0343017578125, 6.319580078125, 6.6048583984375, 6.89013671875, 7.1754150390625, 7.460693359375, 7.7459716796875, 8.03125]}, "gradients/decoder.transformer.h.5.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 6.0, 0.0, 10.0, 8.0, 8.0, 12.0, 16.0, 25.0, 37.0, 47.0, 100.0, 87.0, 160.0, 240.0, 304.0, 497.0, 685.0, 1111.0, 1575.0, 2588.0, 3909.0, 5974.0, 9387.0, 14563.0, 22785.0, 36158.0, 59710.0, 103222.0, 235543.0, 270962.0, 110918.0, 62424.0, 38086.0, 24250.0, 15105.0, 9676.0, 6319.0, 4106.0, 2676.0, 1741.0, 1159.0, 789.0, 494.0, 379.0, 214.0, 178.0, 110.0, 60.0, 49.0, 36.0, 23.0, 20.0, 14.0, 3.0, 1.0, 7.0, 1.0, 2.0, 0.0, 2.0], "bins": [-2.419921875, -2.346710205078125, -2.27349853515625, -2.200286865234375, -2.1270751953125, -2.053863525390625, -1.98065185546875, -1.907440185546875, -1.834228515625, -1.761016845703125, -1.68780517578125, -1.614593505859375, -1.5413818359375, -1.468170166015625, -1.39495849609375, -1.321746826171875, -1.24853515625, -1.175323486328125, -1.10211181640625, -1.028900146484375, -0.9556884765625, -0.882476806640625, -0.80926513671875, -0.736053466796875, -0.662841796875, -0.589630126953125, -0.51641845703125, -0.443206787109375, -0.3699951171875, -0.296783447265625, -0.22357177734375, -0.150360107421875, -0.0771484375, -0.003936767578125, 0.06927490234375, 0.142486572265625, 0.2156982421875, 0.288909912109375, 0.36212158203125, 0.435333251953125, 0.508544921875, 0.581756591796875, 0.65496826171875, 0.728179931640625, 0.8013916015625, 0.874603271484375, 0.94781494140625, 1.021026611328125, 1.09423828125, 1.167449951171875, 1.24066162109375, 1.313873291015625, 1.3870849609375, 1.460296630859375, 1.53350830078125, 1.606719970703125, 1.679931640625, 1.753143310546875, 1.82635498046875, 1.899566650390625, 1.9727783203125, 2.045989990234375, 2.11920166015625, 2.192413330078125, 2.265625]}, "gradients/decoder.transformer.h.5.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 4.0, 4.0, 5.0, 4.0, 11.0, 11.0, 12.0, 12.0, 20.0, 25.0, 23.0, 21.0, 23.0, 22.0, 24.0, 24.0, 31.0, 48.0, 38.0, 50.0, 47.0, 1065.0, 40.0, 47.0, 37.0, 31.0, 48.0, 37.0, 39.0, 29.0, 26.0, 26.0, 31.0, 18.0, 18.0, 19.0, 12.0, 13.0, 15.0, 10.0, 5.0, 6.0, 1.0, 5.0, 2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-5.109375, -4.91204833984375, -4.7147216796875, -4.51739501953125, -4.320068359375, -4.12274169921875, -3.9254150390625, -3.72808837890625, -3.53076171875, -3.33343505859375, -3.1361083984375, -2.93878173828125, -2.741455078125, -2.54412841796875, -2.3468017578125, -2.14947509765625, -1.9521484375, -1.75482177734375, -1.5574951171875, -1.36016845703125, -1.162841796875, -0.96551513671875, -0.7681884765625, -0.57086181640625, -0.37353515625, -0.17620849609375, 0.0211181640625, 0.21844482421875, 0.415771484375, 0.61309814453125, 0.8104248046875, 1.00775146484375, 1.205078125, 1.40240478515625, 1.5997314453125, 1.79705810546875, 1.994384765625, 2.19171142578125, 2.3890380859375, 2.58636474609375, 2.78369140625, 2.98101806640625, 3.1783447265625, 3.37567138671875, 3.572998046875, 3.77032470703125, 3.9676513671875, 4.16497802734375, 4.3623046875, 4.55963134765625, 4.7569580078125, 4.95428466796875, 5.151611328125, 5.34893798828125, 5.5462646484375, 5.74359130859375, 5.94091796875, 6.13824462890625, 6.3355712890625, 6.53289794921875, 6.730224609375, 6.92755126953125, 7.1248779296875, 7.32220458984375, 7.51953125]}, "gradients/decoder.transformer.h.5.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 5.0, 4.0, 7.0, 7.0, 19.0, 25.0, 21.0, 52.0, 65.0, 97.0, 183.0, 319.0, 514.0, 991.0, 1891.0, 3632.0, 6810.0, 12963.0, 25600.0, 51835.0, 110597.0, 1409315.0, 277959.0, 98802.0, 46844.0, 23152.0, 12060.0, 6171.0, 3255.0, 1755.0, 927.0, 498.0, 282.0, 171.0, 104.0, 74.0, 42.0, 23.0, 20.0, 18.0, 10.0, 7.0, 8.0, 3.0, 0.0, 3.0, 0.0, 2.0], "bins": [-4.125, -4.0162353515625, -3.907470703125, -3.7987060546875, -3.68994140625, -3.5811767578125, -3.472412109375, -3.3636474609375, -3.2548828125, -3.1461181640625, -3.037353515625, -2.9285888671875, -2.81982421875, -2.7110595703125, -2.602294921875, -2.4935302734375, -2.384765625, -2.2760009765625, -2.167236328125, -2.0584716796875, -1.94970703125, -1.8409423828125, -1.732177734375, -1.6234130859375, -1.5146484375, -1.4058837890625, -1.297119140625, -1.1883544921875, -1.07958984375, -0.9708251953125, -0.862060546875, -0.7532958984375, -0.64453125, -0.5357666015625, -0.427001953125, -0.3182373046875, -0.20947265625, -0.1007080078125, 0.008056640625, 0.1168212890625, 0.2255859375, 0.3343505859375, 0.443115234375, 0.5518798828125, 0.66064453125, 0.7694091796875, 0.878173828125, 0.9869384765625, 1.095703125, 1.2044677734375, 1.313232421875, 1.4219970703125, 1.53076171875, 1.6395263671875, 1.748291015625, 1.8570556640625, 1.9658203125, 2.0745849609375, 2.183349609375, 2.2921142578125, 2.40087890625, 2.5096435546875, 2.618408203125, 2.7271728515625, 2.8359375]}, "gradients/decoder.transformer.h.5.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 4.0, 1.0, 4.0, 7.0, 3.0, 3.0, 8.0, 7.0, 5.0, 4.0, 13.0, 16.0, 21.0, 27.0, 23.0, 43.0, 47.0, 80.0, 75.0, 74.0, 95.0, 91.0, 78.0, 51.0, 47.0, 47.0, 26.0, 20.0, 18.0, 14.0, 13.0, 9.0, 7.0, 6.0, 6.0, 6.0, 0.0, 5.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0015811920166015625, -0.0015336424112319946, -0.0014860928058624268, -0.0014385432004928589, -0.001390993595123291, -0.0013434439897537231, -0.0012958943843841553, -0.0012483447790145874, -0.0012007951736450195, -0.0011532455682754517, -0.0011056959629058838, -0.001058146357536316, -0.001010596752166748, -0.0009630471467971802, -0.0009154975414276123, -0.0008679479360580444, -0.0008203983306884766, -0.0007728487253189087, -0.0007252991199493408, -0.000677749514579773, -0.0006301999092102051, -0.0005826503038406372, -0.0005351006984710693, -0.00048755109310150146, -0.0004400014877319336, -0.0003924518823623657, -0.00034490227699279785, -0.00029735267162323, -0.0002498030662536621, -0.00020225346088409424, -0.00015470385551452637, -0.0001071542501449585, -5.9604644775390625e-05, -1.2055039405822754e-05, 3.549456596374512e-05, 8.304417133331299e-05, 0.00013059377670288086, 0.00017814338207244873, 0.0002256929874420166, 0.00027324259281158447, 0.00032079219818115234, 0.0003683418035507202, 0.0004158914089202881, 0.00046344101428985596, 0.0005109906196594238, 0.0005585402250289917, 0.0006060898303985596, 0.0006536394357681274, 0.0007011890411376953, 0.0007487386465072632, 0.0007962882518768311, 0.0008438378572463989, 0.0008913874626159668, 0.0009389370679855347, 0.0009864866733551025, 0.0010340362787246704, 0.0010815858840942383, 0.0011291354894638062, 0.001176685094833374, 0.001224234700202942, 0.0012717843055725098, 0.0013193339109420776, 0.0013668835163116455, 0.0014144331216812134, 0.0014619827270507812]}, "gradients/decoder.transformer.h.5.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 2.0, 1.0, 3.0, 0.0, 4.0, 3.0, 6.0, 7.0, 10.0, 9.0, 15.0, 13.0, 25.0, 26.0, 36.0, 50.0, 60.0, 91.0, 143.0, 252.0, 451.0, 1794.0, 991661.0, 52133.0, 800.0, 338.0, 203.0, 111.0, 80.0, 56.0, 32.0, 36.0, 25.0, 15.0, 14.0, 9.0, 11.0, 10.0, 3.0, 6.0, 5.0, 4.0, 5.0, 2.0, 0.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.0322265625, -0.03124380111694336, -0.03026103973388672, -0.029278278350830078, -0.028295516967773438, -0.027312755584716797, -0.026329994201660156, -0.025347232818603516, -0.024364471435546875, -0.023381710052490234, -0.022398948669433594, -0.021416187286376953, -0.020433425903320312, -0.019450664520263672, -0.01846790313720703, -0.01748514175415039, -0.01650238037109375, -0.01551961898803711, -0.014536857604980469, -0.013554096221923828, -0.012571334838867188, -0.011588573455810547, -0.010605812072753906, -0.009623050689697266, -0.008640289306640625, -0.007657527923583984, -0.006674766540527344, -0.005692005157470703, -0.0047092437744140625, -0.003726482391357422, -0.0027437210083007812, -0.0017609596252441406, -0.0007781982421875, 0.00020456314086914062, 0.0011873245239257812, 0.002170085906982422, 0.0031528472900390625, 0.004135608673095703, 0.005118370056152344, 0.006101131439208984, 0.007083892822265625, 0.008066654205322266, 0.009049415588378906, 0.010032176971435547, 0.011014938354492188, 0.011997699737548828, 0.012980461120605469, 0.01396322250366211, 0.01494598388671875, 0.01592874526977539, 0.01691150665283203, 0.017894268035888672, 0.018877029418945312, 0.019859790802001953, 0.020842552185058594, 0.021825313568115234, 0.022808074951171875, 0.023790836334228516, 0.024773597717285156, 0.025756359100341797, 0.026739120483398438, 0.027721881866455078, 0.02870464324951172, 0.02968740463256836, 0.030670166015625]}, "gradients/decoder.transformer.h.5.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 20.0, 102.0, 285.0, 377.0, 175.0, 38.0, 12.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0034552295692265034, -0.003380466951057315, -0.00330570456571877, -0.0032309419475495815, -0.0031561795622110367, -0.003081416944041848, -0.0030066545587033033, -0.002931891940534115, -0.00285712955519557, -0.0027823669370263815, -0.0027076045516878366, -0.002632841933518648, -0.0025580795481801033, -0.002483316930010915, -0.00240855454467237, -0.0023337919265031815, -0.002259029308333993, -0.0021842666901648045, -0.0021095043048262596, -0.002034741686657071, -0.0019599793013185263, -0.0018852166831493378, -0.001810454181395471, -0.0017356916796416044, -0.0016609291778877378, -0.001586166676133871, -0.0015114041743800044, -0.0014366416726261377, -0.0013618790544569492, -0.0012871166691184044, -0.0012123540509492159, -0.0011375915491953492, -0.0010628288146108389, -0.0009880663128569722, -0.0009133038111031055, -0.000838541251141578, -0.0007637787493877113, -0.0006890162476338446, -0.000614253687672317, -0.0005394911859184504, -0.0004647286841645837, -0.000389966182410717, -0.0003152036515530199, -0.00024044113524723798, -0.00016567861894145608, -9.091611718758941e-05, -1.6153586329892278e-05, 5.860894452780485e-05, 0.00013337144628167152, 0.00020813396258745342, 0.0002828964788932353, 0.00035765900975093246, 0.00043242151150479913, 0.0005071840132586658, 0.0005819465732201934, 0.0006567090749740601, 0.0007314715767279267, 0.0008062340784817934, 0.0008809965802356601, 0.0009557591401971877, 0.0010305217001587152, 0.00110528408549726, 0.0011800467036664486, 0.0012548092054203153, 0.001329571707174182]}, "gradients/decoder.transformer.h.5.ln_cross_attn.bias": {"_type": "histogram", "values": [4.0, 0.0, 1.0, 0.0, 0.0, 4.0, 3.0, 3.0, 3.0, 3.0, 8.0, 13.0, 10.0, 17.0, 14.0, 20.0, 13.0, 21.0, 39.0, 31.0, 23.0, 34.0, 28.0, 37.0, 31.0, 39.0, 42.0, 41.0, 40.0, 53.0, 27.0, 31.0, 42.0, 45.0, 43.0, 22.0, 25.0, 27.0, 30.0, 19.0, 22.0, 20.0, 15.0, 17.0, 13.0, 4.0, 9.0, 6.0, 5.0, 6.0, 5.0, 6.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006074309349060059, -0.0005867434665560722, -0.0005660559982061386, -0.000545368529856205, -0.0005246810615062714, -0.0005039935931563377, -0.0004833061248064041, -0.0004626186564564705, -0.00044193118810653687, -0.00042124371975660324, -0.0004005562514066696, -0.000379868783056736, -0.00035918131470680237, -0.00033849384635686874, -0.0003178063780069351, -0.0002971189096570015, -0.00027643144130706787, -0.00025574397295713425, -0.00023505650460720062, -0.000214369036257267, -0.00019368156790733337, -0.00017299409955739975, -0.00015230663120746613, -0.0001316191628575325, -0.00011093169450759888, -9.024422615766525e-05, -6.955675780773163e-05, -4.8869289457798004e-05, -2.818182110786438e-05, -7.494352757930756e-06, 1.3193115592002869e-05, 3.388058394193649e-05, 5.456805229187012e-05, 7.525552064180374e-05, 9.594298899173737e-05, 0.00011663045734167099, 0.00013731792569160461, 0.00015800539404153824, 0.00017869286239147186, 0.0001993803307414055, 0.0002200677990913391, 0.00024075526744127274, 0.00026144273579120636, 0.00028213020414114, 0.0003028176724910736, 0.00032350514084100723, 0.00034419260919094086, 0.0003648800775408745, 0.0003855675458908081, 0.00040625501424074173, 0.00042694248259067535, 0.000447629950940609, 0.0004683174192905426, 0.0004890048876404762, 0.0005096923559904099, 0.0005303798243403435, 0.0005510672926902771, 0.0005717547610402107, 0.0005924422293901443, 0.000613129697740078, 0.0006338171660900116, 0.0006545046344399452, 0.0006751921027898788, 0.0006958795711398125, 0.0007165670394897461]}, "gradients/decoder.transformer.h.5.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 3.0, 3.0, 5.0, 4.0, 5.0, 7.0, 7.0, 9.0, 17.0, 11.0, 19.0, 18.0, 26.0, 25.0, 33.0, 16.0, 27.0, 33.0, 35.0, 40.0, 34.0, 48.0, 30.0, 41.0, 33.0, 41.0, 41.0, 41.0, 47.0, 36.0, 34.0, 32.0, 32.0, 20.0, 27.0, 22.0, 15.0, 20.0, 16.0, 11.0, 10.0, 9.0, 11.0, 3.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 1.0], "bins": [-10.2265625, -9.9412841796875, -9.656005859375, -9.3707275390625, -9.08544921875, -8.8001708984375, -8.514892578125, -8.2296142578125, -7.9443359375, -7.6590576171875, -7.373779296875, -7.0885009765625, -6.80322265625, -6.5179443359375, -6.232666015625, -5.9473876953125, -5.662109375, -5.3768310546875, -5.091552734375, -4.8062744140625, -4.52099609375, -4.2357177734375, -3.950439453125, -3.6651611328125, -3.3798828125, -3.0946044921875, -2.809326171875, -2.5240478515625, -2.23876953125, -1.9534912109375, -1.668212890625, -1.3829345703125, -1.09765625, -0.8123779296875, -0.527099609375, -0.2418212890625, 0.04345703125, 0.3287353515625, 0.614013671875, 0.8992919921875, 1.1845703125, 1.4698486328125, 1.755126953125, 2.0404052734375, 2.32568359375, 2.6109619140625, 2.896240234375, 3.1815185546875, 3.466796875, 3.7520751953125, 4.037353515625, 4.3226318359375, 4.60791015625, 4.8931884765625, 5.178466796875, 5.4637451171875, 5.7490234375, 6.0343017578125, 6.319580078125, 6.6048583984375, 6.89013671875, 7.1754150390625, 7.460693359375, 7.7459716796875, 8.03125]}, "gradients/decoder.transformer.h.5.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 1.0, 2.0, 2.0, 5.0, 2.0, 6.0, 6.0, 7.0, 7.0, 11.0, 19.0, 19.0, 31.0, 32.0, 35.0, 56.0, 72.0, 106.0, 131.0, 174.0, 290.0, 495.0, 1040.0, 2671.0, 7652.0, 23067.0, 76403.0, 328410.0, 455880.0, 105672.0, 29887.0, 9798.0, 3525.0, 1375.0, 612.0, 315.0, 185.0, 134.0, 97.0, 77.0, 54.0, 53.0, 43.0, 27.0, 22.0, 18.0, 16.0, 6.0, 7.0, 5.0, 3.0, 3.0, 1.0, 0.0, 1.0, 1.0], "bins": [-13.6328125, -13.25537109375, -12.8779296875, -12.50048828125, -12.123046875, -11.74560546875, -11.3681640625, -10.99072265625, -10.61328125, -10.23583984375, -9.8583984375, -9.48095703125, -9.103515625, -8.72607421875, -8.3486328125, -7.97119140625, -7.59375, -7.21630859375, -6.8388671875, -6.46142578125, -6.083984375, -5.70654296875, -5.3291015625, -4.95166015625, -4.57421875, -4.19677734375, -3.8193359375, -3.44189453125, -3.064453125, -2.68701171875, -2.3095703125, -1.93212890625, -1.5546875, -1.17724609375, -0.7998046875, -0.42236328125, -0.044921875, 0.33251953125, 0.7099609375, 1.08740234375, 1.46484375, 1.84228515625, 2.2197265625, 2.59716796875, 2.974609375, 3.35205078125, 3.7294921875, 4.10693359375, 4.484375, 4.86181640625, 5.2392578125, 5.61669921875, 5.994140625, 6.37158203125, 6.7490234375, 7.12646484375, 7.50390625, 7.88134765625, 8.2587890625, 8.63623046875, 9.013671875, 9.39111328125, 9.7685546875, 10.14599609375, 10.5234375]}, "gradients/decoder.transformer.h.5.attn.c_attn.bias": {"_type": "histogram", "values": [4.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 5.0, 5.0, 6.0, 9.0, 14.0, 8.0, 7.0, 13.0, 10.0, 11.0, 21.0, 13.0, 22.0, 18.0, 26.0, 32.0, 37.0, 45.0, 48.0, 60.0, 60.0, 118.0, 294.0, 1540.0, 128.0, 83.0, 46.0, 45.0, 52.0, 34.0, 33.0, 26.0, 35.0, 23.0, 26.0, 19.0, 10.0, 17.0, 9.0, 13.0, 12.0, 7.0, 5.0, 3.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-25.875, -25.072021484375, -24.26904296875, -23.466064453125, -22.6630859375, -21.860107421875, -21.05712890625, -20.254150390625, -19.451171875, -18.648193359375, -17.84521484375, -17.042236328125, -16.2392578125, -15.436279296875, -14.63330078125, -13.830322265625, -13.02734375, -12.224365234375, -11.42138671875, -10.618408203125, -9.8154296875, -9.012451171875, -8.20947265625, -7.406494140625, -6.603515625, -5.800537109375, -4.99755859375, -4.194580078125, -3.3916015625, -2.588623046875, -1.78564453125, -0.982666015625, -0.1796875, 0.623291015625, 1.42626953125, 2.229248046875, 3.0322265625, 3.835205078125, 4.63818359375, 5.441162109375, 6.244140625, 7.047119140625, 7.85009765625, 8.653076171875, 9.4560546875, 10.259033203125, 11.06201171875, 11.864990234375, 12.66796875, 13.470947265625, 14.27392578125, 15.076904296875, 15.8798828125, 16.682861328125, 17.48583984375, 18.288818359375, 19.091796875, 19.894775390625, 20.69775390625, 21.500732421875, 22.3037109375, 23.106689453125, 23.90966796875, 24.712646484375, 25.515625]}, "gradients/decoder.transformer.h.5.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 4.0, 3.0, 15.0, 7.0, 8.0, 15.0, 19.0, 22.0, 36.0, 39.0, 49.0, 56.0, 84.0, 106.0, 151.0, 216.0, 317.0, 552.0, 1416.0, 16373.0, 1054084.0, 2047775.0, 20917.0, 1649.0, 570.0, 332.0, 224.0, 173.0, 105.0, 81.0, 75.0, 41.0, 40.0, 33.0, 23.0, 31.0, 13.0, 20.0, 12.0, 6.0, 5.0, 5.0, 2.0, 2.0, 4.0, 3.0, 4.0], "bins": [-41.96875, -40.865478515625, -39.76220703125, -38.658935546875, -37.5556640625, -36.452392578125, -35.34912109375, -34.245849609375, -33.142578125, -32.039306640625, -30.93603515625, -29.832763671875, -28.7294921875, -27.626220703125, -26.52294921875, -25.419677734375, -24.31640625, -23.213134765625, -22.10986328125, -21.006591796875, -19.9033203125, -18.800048828125, -17.69677734375, -16.593505859375, -15.490234375, -14.386962890625, -13.28369140625, -12.180419921875, -11.0771484375, -9.973876953125, -8.87060546875, -7.767333984375, -6.6640625, -5.560791015625, -4.45751953125, -3.354248046875, -2.2509765625, -1.147705078125, -0.04443359375, 1.058837890625, 2.162109375, 3.265380859375, 4.36865234375, 5.471923828125, 6.5751953125, 7.678466796875, 8.78173828125, 9.885009765625, 10.98828125, 12.091552734375, 13.19482421875, 14.298095703125, 15.4013671875, 16.504638671875, 17.60791015625, 18.711181640625, 19.814453125, 20.917724609375, 22.02099609375, 23.124267578125, 24.2275390625, 25.330810546875, 26.43408203125, 27.537353515625, 28.640625]}, "gradients/decoder.transformer.h.5.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 12.0, 666.0, 336.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.333749771118164, -21.752384185791016, -14.1710205078125, -6.589654922485352, 0.9917087554931641, 8.57307243347168, 16.15443992614746, 23.735803604125977, 31.317167282104492, 38.89853286743164, 46.479896545410156, 54.06126403808594, 61.64262390136719, 69.22399139404297, 76.80535888671875, 84.38671875, 91.96808624267578, 99.54945373535156, 107.13081359863281, 114.7121810913086, 122.29354858398438, 129.87490844726562, 137.45626831054688, 145.0376434326172, 152.61900329589844, 160.2003631591797, 167.78173828125, 175.36309814453125, 182.9444580078125, 190.52581787109375, 198.10719299316406, 205.6885528564453, 213.26992797851562, 220.85128784179688, 228.4326629638672, 236.01402282714844, 243.5953826904297, 251.1767578125, 258.75811767578125, 266.3394775390625, 273.92083740234375, 281.502197265625, 289.08355712890625, 296.6649169921875, 304.2463073730469, 311.8276672363281, 319.4090270996094, 326.9903869628906, 334.57177734375, 342.15313720703125, 349.7344970703125, 357.31585693359375, 364.8972473144531, 372.4786071777344, 380.0599670410156, 387.6413269042969, 395.2226867675781, 402.8040466308594, 410.3854064941406, 417.966796875, 425.54815673828125, 433.1295166015625, 440.71087646484375, 448.292236328125, 455.87359619140625]}, "gradients/decoder.transformer.h.5.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 3.0, 1.0, 10.0, 5.0, 4.0, 8.0, 6.0, 10.0, 12.0, 10.0, 18.0, 16.0, 14.0, 24.0, 28.0, 22.0, 25.0, 23.0, 38.0, 33.0, 33.0, 34.0, 33.0, 38.0, 36.0, 45.0, 51.0, 41.0, 37.0, 28.0, 41.0, 40.0, 28.0, 29.0, 25.0, 26.0, 16.0, 19.0, 11.0, 21.0, 10.0, 14.0, 10.0, 6.0, 8.0, 5.0, 5.0, 4.0, 3.0, 5.0, 2.0, 3.0], "bins": [-70.1112060546875, -68.2155532836914, -66.31989288330078, -64.42424011230469, -62.528587341308594, -60.632930755615234, -58.737274169921875, -56.84162139892578, -54.94596481323242, -53.05030822753906, -51.15465545654297, -49.25899887084961, -47.36334228515625, -45.467689514160156, -43.5720329284668, -41.67637634277344, -39.780723571777344, -37.885066986083984, -35.98941421508789, -34.09375762939453, -32.19810485839844, -30.302448272705078, -28.40679168701172, -26.511137008666992, -24.615482330322266, -22.71982765197754, -20.824172973632812, -18.928516387939453, -17.032861709594727, -15.13720703125, -13.241551399230957, -11.345895767211914, -9.450241088867188, -7.554585933685303, -5.658930778503418, -3.763275623321533, -1.8676204681396484, 0.028034210205078125, 1.923689842224121, 3.819345474243164, 5.715000152587891, 7.610655307769775, 9.50631046295166, 11.401966094970703, 13.29762077331543, 15.193275451660156, 17.088932037353516, 18.984586715698242, 20.88024139404297, 22.775896072387695, 24.671550750732422, 26.56720733642578, 28.462862014770508, 30.358516693115234, 32.254173278808594, 34.14982604980469, 36.04548263549805, 37.941139221191406, 39.8367919921875, 41.73244857788086, 43.62810516357422, 45.52375793457031, 47.41941452026367, 49.31507110595703, 51.210723876953125]}, "gradients/decoder.transformer.h.4.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 6.0, 3.0, 3.0, 4.0, 8.0, 7.0, 5.0, 11.0, 14.0, 18.0, 24.0, 19.0, 24.0, 25.0, 31.0, 21.0, 39.0, 37.0, 36.0, 44.0, 29.0, 60.0, 42.0, 43.0, 44.0, 40.0, 34.0, 37.0, 36.0, 41.0, 31.0, 27.0, 26.0, 19.0, 21.0, 21.0, 17.0, 14.0, 15.0, 7.0, 11.0, 5.0, 6.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0], "bins": [-11.1953125, -10.8909912109375, -10.586669921875, -10.2823486328125, -9.97802734375, -9.6737060546875, -9.369384765625, -9.0650634765625, -8.7607421875, -8.4564208984375, -8.152099609375, -7.8477783203125, -7.54345703125, -7.2391357421875, -6.934814453125, -6.6304931640625, -6.326171875, -6.0218505859375, -5.717529296875, -5.4132080078125, -5.10888671875, -4.8045654296875, -4.500244140625, -4.1959228515625, -3.8916015625, -3.5872802734375, -3.282958984375, -2.9786376953125, -2.67431640625, -2.3699951171875, -2.065673828125, -1.7613525390625, -1.45703125, -1.1527099609375, -0.848388671875, -0.5440673828125, -0.23974609375, 0.0645751953125, 0.368896484375, 0.6732177734375, 0.9775390625, 1.2818603515625, 1.586181640625, 1.8905029296875, 2.19482421875, 2.4991455078125, 2.803466796875, 3.1077880859375, 3.412109375, 3.7164306640625, 4.020751953125, 4.3250732421875, 4.62939453125, 4.9337158203125, 5.238037109375, 5.5423583984375, 5.8466796875, 6.1510009765625, 6.455322265625, 6.7596435546875, 7.06396484375, 7.3682861328125, 7.672607421875, 7.9769287109375, 8.28125]}, "gradients/decoder.transformer.h.4.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 4.0, 6.0, 0.0, 0.0, 0.0, 5.0, 1.0, 3.0, 5.0, 5.0, 12.0, 7.0, 9.0, 14.0, 16.0, 28.0, 27.0, 35.0, 38.0, 60.0, 96.0, 173.0, 465.0, 1255.0, 6177.0, 224604.0, 3811362.0, 142767.0, 5147.0, 1067.0, 342.0, 202.0, 96.0, 45.0, 49.0, 28.0, 24.0, 25.0, 19.0, 16.0, 17.0, 8.0, 11.0, 5.0, 7.0, 4.0, 0.0, 2.0, 6.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-38.59375, -37.36669921875, -36.1396484375, -34.91259765625, -33.685546875, -32.45849609375, -31.2314453125, -30.00439453125, -28.77734375, -27.55029296875, -26.3232421875, -25.09619140625, -23.869140625, -22.64208984375, -21.4150390625, -20.18798828125, -18.9609375, -17.73388671875, -16.5068359375, -15.27978515625, -14.052734375, -12.82568359375, -11.5986328125, -10.37158203125, -9.14453125, -7.91748046875, -6.6904296875, -5.46337890625, -4.236328125, -3.00927734375, -1.7822265625, -0.55517578125, 0.671875, 1.89892578125, 3.1259765625, 4.35302734375, 5.580078125, 6.80712890625, 8.0341796875, 9.26123046875, 10.48828125, 11.71533203125, 12.9423828125, 14.16943359375, 15.396484375, 16.62353515625, 17.8505859375, 19.07763671875, 20.3046875, 21.53173828125, 22.7587890625, 23.98583984375, 25.212890625, 26.43994140625, 27.6669921875, 28.89404296875, 30.12109375, 31.34814453125, 32.5751953125, 33.80224609375, 35.029296875, 36.25634765625, 37.4833984375, 38.71044921875, 39.9375]}, "gradients/decoder.transformer.h.4.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 5.0, 7.0, 5.0, 10.0, 13.0, 21.0, 26.0, 55.0, 75.0, 111.0, 214.0, 430.0, 659.0, 744.0, 687.0, 386.0, 233.0, 147.0, 85.0, 57.0, 32.0, 21.0, 22.0, 10.0, 10.0, 2.0, 6.0, 5.0, 3.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-17.46875, -16.876708984375, -16.28466796875, -15.692626953125, -15.1005859375, -14.508544921875, -13.91650390625, -13.324462890625, -12.732421875, -12.140380859375, -11.54833984375, -10.956298828125, -10.3642578125, -9.772216796875, -9.18017578125, -8.588134765625, -7.99609375, -7.404052734375, -6.81201171875, -6.219970703125, -5.6279296875, -5.035888671875, -4.44384765625, -3.851806640625, -3.259765625, -2.667724609375, -2.07568359375, -1.483642578125, -0.8916015625, -0.299560546875, 0.29248046875, 0.884521484375, 1.4765625, 2.068603515625, 2.66064453125, 3.252685546875, 3.8447265625, 4.436767578125, 5.02880859375, 5.620849609375, 6.212890625, 6.804931640625, 7.39697265625, 7.989013671875, 8.5810546875, 9.173095703125, 9.76513671875, 10.357177734375, 10.94921875, 11.541259765625, 12.13330078125, 12.725341796875, 13.3173828125, 13.909423828125, 14.50146484375, 15.093505859375, 15.685546875, 16.277587890625, 16.86962890625, 17.461669921875, 18.0537109375, 18.645751953125, 19.23779296875, 19.829833984375, 20.421875]}, "gradients/decoder.transformer.h.4.mlp.c_fc.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 6.0, 9.0, 6.0, 6.0, 4.0, 3.0, 15.0, 7.0, 7.0, 27.0, 32.0, 48.0, 48.0, 83.0, 138.0, 260.0, 615.0, 1459.0, 5736.0, 52516.0, 3052603.0, 1046531.0, 28006.0, 3905.0, 1149.0, 471.0, 223.0, 125.0, 77.0, 48.0, 27.0, 22.0, 15.0, 10.0, 2.0, 13.0, 12.0, 7.0, 8.0, 1.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-41.9375, -40.7685546875, -39.599609375, -38.4306640625, -37.26171875, -36.0927734375, -34.923828125, -33.7548828125, -32.5859375, -31.4169921875, -30.248046875, -29.0791015625, -27.91015625, -26.7412109375, -25.572265625, -24.4033203125, -23.234375, -22.0654296875, -20.896484375, -19.7275390625, -18.55859375, -17.3896484375, -16.220703125, -15.0517578125, -13.8828125, -12.7138671875, -11.544921875, -10.3759765625, -9.20703125, -8.0380859375, -6.869140625, -5.7001953125, -4.53125, -3.3623046875, -2.193359375, -1.0244140625, 0.14453125, 1.3134765625, 2.482421875, 3.6513671875, 4.8203125, 5.9892578125, 7.158203125, 8.3271484375, 9.49609375, 10.6650390625, 11.833984375, 13.0029296875, 14.171875, 15.3408203125, 16.509765625, 17.6787109375, 18.84765625, 20.0166015625, 21.185546875, 22.3544921875, 23.5234375, 24.6923828125, 25.861328125, 27.0302734375, 28.19921875, 29.3681640625, 30.537109375, 31.7060546875, 32.875]}, "gradients/decoder.transformer.h.4.ln_2.weight": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 15.0, 47.0, 98.0, 262.0, 257.0, 190.0, 101.0, 32.0, 8.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-58.606117248535156, -55.20945739746094, -51.81279754638672, -48.4161376953125, -45.01947784423828, -41.62281799316406, -38.22616195678711, -34.82950210571289, -31.432842254638672, -28.036182403564453, -24.639522552490234, -21.24286460876465, -17.84620475769043, -14.449544906616211, -11.052886962890625, -7.656227111816406, -4.2595672607421875, -0.862907886505127, 2.5337514877319336, 5.930410385131836, 9.327070236206055, 12.723730087280273, 16.12038803100586, 19.517047882080078, 22.913707733154297, 26.310367584228516, 29.707027435302734, 33.10368347167969, 36.500343322753906, 39.897003173828125, 43.293663024902344, 46.69032287597656, 50.08697509765625, 53.48363494873047, 56.88029479980469, 60.276954650878906, 63.673614501953125, 67.07027435302734, 70.46693420410156, 73.86358642578125, 77.26025390625, 80.65691375732422, 84.05357360839844, 87.45023345947266, 90.84689331054688, 94.2435531616211, 97.64021301269531, 101.036865234375, 104.43352508544922, 107.83018493652344, 111.22684478759766, 114.62350463867188, 118.0201644897461, 121.41682434082031, 124.8134765625, 128.21014404296875, 131.60679626464844, 135.00344848632812, 138.40011596679688, 141.79676818847656, 145.1934356689453, 148.590087890625, 151.98675537109375, 155.38340759277344, 158.7800750732422]}, "gradients/decoder.transformer.h.4.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 2.0, 4.0, 3.0, 4.0, 8.0, 14.0, 9.0, 11.0, 18.0, 19.0, 21.0, 24.0, 29.0, 27.0, 28.0, 30.0, 37.0, 34.0, 36.0, 29.0, 41.0, 39.0, 42.0, 47.0, 42.0, 24.0, 42.0, 33.0, 38.0, 30.0, 35.0, 46.0, 19.0, 21.0, 20.0, 18.0, 20.0, 11.0, 14.0, 9.0, 6.0, 4.0, 7.0, 2.0, 4.0, 5.0, 5.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-38.5040168762207, -37.20600509643555, -35.907989501953125, -34.60997772216797, -33.31196594238281, -32.01395034790039, -30.715938568115234, -29.417924880981445, -28.119911193847656, -26.821897506713867, -25.523883819580078, -24.225872039794922, -22.927858352661133, -21.629844665527344, -20.331832885742188, -19.0338191986084, -17.73580551147461, -16.43779182434082, -15.139779090881348, -13.841766357421875, -12.543752670288086, -11.245738983154297, -9.947726249694824, -8.649713516235352, -7.3516998291015625, -6.053686618804932, -4.755673408508301, -3.45766019821167, -2.159646987915039, -0.8616337776184082, 0.43637943267822266, 1.7343921661376953, 3.0324020385742188, 4.33041524887085, 5.6284284591674805, 6.926441669464111, 8.224454879760742, 9.522468566894531, 10.820481300354004, 12.118494033813477, 13.416507720947266, 14.714521408081055, 16.012535095214844, 17.310546875, 18.60856056213379, 19.906574249267578, 21.204586029052734, 22.502599716186523, 23.800613403320312, 25.0986270904541, 26.39664077758789, 27.694652557373047, 28.992666244506836, 30.290679931640625, 31.58869171142578, 32.88670349121094, 34.18471908569336, 35.482730865478516, 36.78074645996094, 38.078758239746094, 39.37677001953125, 40.67478561401367, 41.97279739379883, 43.27081298828125, 44.568824768066406]}, "gradients/decoder.transformer.h.4.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 6.0, 6.0, 5.0, 5.0, 8.0, 16.0, 9.0, 17.0, 8.0, 14.0, 17.0, 25.0, 27.0, 23.0, 30.0, 37.0, 37.0, 31.0, 31.0, 45.0, 40.0, 42.0, 27.0, 50.0, 48.0, 45.0, 39.0, 38.0, 30.0, 40.0, 28.0, 31.0, 24.0, 23.0, 25.0, 13.0, 9.0, 12.0, 11.0, 13.0, 10.0, 2.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-9.8359375, -9.5316162109375, -9.227294921875, -8.9229736328125, -8.61865234375, -8.3143310546875, -8.010009765625, -7.7056884765625, -7.4013671875, -7.0970458984375, -6.792724609375, -6.4884033203125, -6.18408203125, -5.8797607421875, -5.575439453125, -5.2711181640625, -4.966796875, -4.6624755859375, -4.358154296875, -4.0538330078125, -3.74951171875, -3.4451904296875, -3.140869140625, -2.8365478515625, -2.5322265625, -2.2279052734375, -1.923583984375, -1.6192626953125, -1.31494140625, -1.0106201171875, -0.706298828125, -0.4019775390625, -0.09765625, 0.2066650390625, 0.510986328125, 0.8153076171875, 1.11962890625, 1.4239501953125, 1.728271484375, 2.0325927734375, 2.3369140625, 2.6412353515625, 2.945556640625, 3.2498779296875, 3.55419921875, 3.8585205078125, 4.162841796875, 4.4671630859375, 4.771484375, 5.0758056640625, 5.380126953125, 5.6844482421875, 5.98876953125, 6.2930908203125, 6.597412109375, 6.9017333984375, 7.2060546875, 7.5103759765625, 7.814697265625, 8.1190185546875, 8.42333984375, 8.7276611328125, 9.031982421875, 9.3363037109375, 9.640625]}, "gradients/decoder.transformer.h.4.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 4.0, 9.0, 12.0, 12.0, 19.0, 35.0, 39.0, 50.0, 99.0, 152.0, 178.0, 299.0, 477.0, 667.0, 1016.0, 1445.0, 2150.0, 3108.0, 4860.0, 7359.0, 10811.0, 16930.0, 26660.0, 44070.0, 77035.0, 150086.0, 330953.0, 161908.0, 81482.0, 46456.0, 28378.0, 17676.0, 11327.0, 7493.0, 5087.0, 3214.0, 2271.0, 1494.0, 1054.0, 708.0, 514.0, 309.0, 210.0, 148.0, 94.0, 76.0, 41.0, 32.0, 21.0, 14.0, 9.0, 12.0, 5.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-2.326171875, -2.25244140625, -2.1787109375, -2.10498046875, -2.03125, -1.95751953125, -1.8837890625, -1.81005859375, -1.736328125, -1.66259765625, -1.5888671875, -1.51513671875, -1.44140625, -1.36767578125, -1.2939453125, -1.22021484375, -1.146484375, -1.07275390625, -0.9990234375, -0.92529296875, -0.8515625, -0.77783203125, -0.7041015625, -0.63037109375, -0.556640625, -0.48291015625, -0.4091796875, -0.33544921875, -0.26171875, -0.18798828125, -0.1142578125, -0.04052734375, 0.033203125, 0.10693359375, 0.1806640625, 0.25439453125, 0.328125, 0.40185546875, 0.4755859375, 0.54931640625, 0.623046875, 0.69677734375, 0.7705078125, 0.84423828125, 0.91796875, 0.99169921875, 1.0654296875, 1.13916015625, 1.212890625, 1.28662109375, 1.3603515625, 1.43408203125, 1.5078125, 1.58154296875, 1.6552734375, 1.72900390625, 1.802734375, 1.87646484375, 1.9501953125, 2.02392578125, 2.09765625, 2.17138671875, 2.2451171875, 2.31884765625, 2.392578125]}, "gradients/decoder.transformer.h.4.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 2.0, 4.0, 6.0, 1.0, 4.0, 5.0, 6.0, 5.0, 11.0, 17.0, 16.0, 22.0, 31.0, 18.0, 30.0, 23.0, 24.0, 27.0, 42.0, 39.0, 47.0, 28.0, 32.0, 37.0, 1070.0, 37.0, 43.0, 39.0, 36.0, 27.0, 36.0, 33.0, 37.0, 25.0, 29.0, 27.0, 16.0, 14.0, 17.0, 19.0, 11.0, 11.0, 6.0, 6.0, 6.0, 5.0, 4.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-6.609375, -6.41741943359375, -6.2254638671875, -6.03350830078125, -5.841552734375, -5.64959716796875, -5.4576416015625, -5.26568603515625, -5.07373046875, -4.88177490234375, -4.6898193359375, -4.49786376953125, -4.305908203125, -4.11395263671875, -3.9219970703125, -3.73004150390625, -3.5380859375, -3.34613037109375, -3.1541748046875, -2.96221923828125, -2.770263671875, -2.57830810546875, -2.3863525390625, -2.19439697265625, -2.00244140625, -1.81048583984375, -1.6185302734375, -1.42657470703125, -1.234619140625, -1.04266357421875, -0.8507080078125, -0.65875244140625, -0.466796875, -0.27484130859375, -0.0828857421875, 0.10906982421875, 0.301025390625, 0.49298095703125, 0.6849365234375, 0.87689208984375, 1.06884765625, 1.26080322265625, 1.4527587890625, 1.64471435546875, 1.836669921875, 2.02862548828125, 2.2205810546875, 2.41253662109375, 2.6044921875, 2.79644775390625, 2.9884033203125, 3.18035888671875, 3.372314453125, 3.56427001953125, 3.7562255859375, 3.94818115234375, 4.14013671875, 4.33209228515625, 4.5240478515625, 4.71600341796875, 4.907958984375, 5.09991455078125, 5.2918701171875, 5.48382568359375, 5.67578125]}, "gradients/decoder.transformer.h.4.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 3.0, 4.0, 6.0, 6.0, 10.0, 10.0, 21.0, 34.0, 44.0, 53.0, 113.0, 163.0, 292.0, 488.0, 827.0, 1548.0, 2739.0, 4918.0, 8893.0, 17053.0, 32994.0, 66359.0, 141909.0, 1452355.0, 193493.0, 85160.0, 41427.0, 21302.0, 11304.0, 5983.0, 3254.0, 1849.0, 1025.0, 612.0, 331.0, 228.0, 120.0, 65.0, 50.0, 36.0, 16.0, 10.0, 9.0, 4.0, 3.0, 6.0, 6.0, 2.0, 5.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.12109375, -3.015899658203125, -2.91070556640625, -2.805511474609375, -2.7003173828125, -2.595123291015625, -2.48992919921875, -2.384735107421875, -2.279541015625, -2.174346923828125, -2.06915283203125, -1.963958740234375, -1.8587646484375, -1.753570556640625, -1.64837646484375, -1.543182373046875, -1.43798828125, -1.332794189453125, -1.22760009765625, -1.122406005859375, -1.0172119140625, -0.912017822265625, -0.80682373046875, -0.701629638671875, -0.596435546875, -0.491241455078125, -0.38604736328125, -0.280853271484375, -0.1756591796875, -0.070465087890625, 0.03472900390625, 0.139923095703125, 0.2451171875, 0.350311279296875, 0.45550537109375, 0.560699462890625, 0.6658935546875, 0.771087646484375, 0.87628173828125, 0.981475830078125, 1.086669921875, 1.191864013671875, 1.29705810546875, 1.402252197265625, 1.5074462890625, 1.612640380859375, 1.71783447265625, 1.823028564453125, 1.92822265625, 2.033416748046875, 2.13861083984375, 2.243804931640625, 2.3489990234375, 2.454193115234375, 2.55938720703125, 2.664581298828125, 2.769775390625, 2.874969482421875, 2.98016357421875, 3.085357666015625, 3.1905517578125, 3.295745849609375, 3.40093994140625, 3.506134033203125, 3.611328125]}, "gradients/decoder.transformer.h.4.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 8.0, 2.0, 7.0, 4.0, 6.0, 8.0, 6.0, 10.0, 15.0, 17.0, 21.0, 25.0, 38.0, 37.0, 45.0, 63.0, 71.0, 75.0, 76.0, 88.0, 63.0, 58.0, 51.0, 47.0, 32.0, 30.0, 22.0, 20.0, 12.0, 14.0, 9.0, 7.0, 9.0, 3.0, 4.0, 1.0, 2.0, 0.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0013780593872070312, -0.0013279765844345093, -0.0012778937816619873, -0.0012278109788894653, -0.0011777281761169434, -0.0011276453733444214, -0.0010775625705718994, -0.0010274797677993774, -0.0009773969650268555, -0.0009273141622543335, -0.0008772313594818115, -0.0008271485567092896, -0.0007770657539367676, -0.0007269829511642456, -0.0006769001483917236, -0.0006268173456192017, -0.0005767345428466797, -0.0005266517400741577, -0.00047656893730163574, -0.00042648613452911377, -0.0003764033317565918, -0.0003263205289840698, -0.00027623772621154785, -0.00022615492343902588, -0.0001760721206665039, -0.00012598931789398193, -7.590651512145996e-05, -2.5823712348937988e-05, 2.4259090423583984e-05, 7.434189319610596e-05, 0.00012442469596862793, 0.0001745074987411499, 0.00022459030151367188, 0.00027467310428619385, 0.0003247559070587158, 0.0003748387098312378, 0.00042492151260375977, 0.00047500431537628174, 0.0005250871181488037, 0.0005751699209213257, 0.0006252527236938477, 0.0006753355264663696, 0.0007254183292388916, 0.0007755011320114136, 0.0008255839347839355, 0.0008756667375564575, 0.0009257495403289795, 0.0009758323431015015, 0.0010259151458740234, 0.0010759979486465454, 0.0011260807514190674, 0.0011761635541915894, 0.0012262463569641113, 0.0012763291597366333, 0.0013264119625091553, 0.0013764947652816772, 0.0014265775680541992, 0.0014766603708267212, 0.0015267431735992432, 0.0015768259763717651, 0.0016269087791442871, 0.001676991581916809, 0.001727074384689331, 0.001777157187461853, 0.001827239990234375]}, "gradients/decoder.transformer.h.4.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 1.0, 2.0, 2.0, 6.0, 4.0, 10.0, 5.0, 10.0, 13.0, 20.0, 21.0, 38.0, 32.0, 53.0, 75.0, 96.0, 152.0, 231.0, 481.0, 2133.0, 992821.0, 50492.0, 877.0, 363.0, 177.0, 126.0, 76.0, 57.0, 40.0, 34.0, 18.0, 15.0, 17.0, 15.0, 7.0, 8.0, 5.0, 5.0, 4.0, 9.0, 2.0, 3.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0367431640625, -0.03571605682373047, -0.03468894958496094, -0.033661842346191406, -0.032634735107421875, -0.031607627868652344, -0.030580520629882812, -0.02955341339111328, -0.02852630615234375, -0.02749919891357422, -0.026472091674804688, -0.025444984436035156, -0.024417877197265625, -0.023390769958496094, -0.022363662719726562, -0.02133655548095703, -0.0203094482421875, -0.01928234100341797, -0.018255233764648438, -0.017228126525878906, -0.016201019287109375, -0.015173912048339844, -0.014146804809570312, -0.013119697570800781, -0.01209259033203125, -0.011065483093261719, -0.010038375854492188, -0.009011268615722656, -0.007984161376953125, -0.006957054138183594, -0.0059299468994140625, -0.004902839660644531, -0.003875732421875, -0.0028486251831054688, -0.0018215179443359375, -0.0007944107055664062, 0.000232696533203125, 0.0012598037719726562, 0.0022869110107421875, 0.0033140182495117188, 0.00434112548828125, 0.005368232727050781, 0.0063953399658203125, 0.007422447204589844, 0.008449554443359375, 0.009476661682128906, 0.010503768920898438, 0.011530876159667969, 0.0125579833984375, 0.013585090637207031, 0.014612197875976562, 0.015639305114746094, 0.016666412353515625, 0.017693519592285156, 0.018720626831054688, 0.01974773406982422, 0.02077484130859375, 0.02180194854736328, 0.022829055786132812, 0.023856163024902344, 0.024883270263671875, 0.025910377502441406, 0.026937484741210938, 0.02796459197998047, 0.02899169921875]}, "gradients/decoder.transformer.h.4.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 18.0, 142.0, 588.0, 229.0, 36.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0024743997491896152, -0.002341821091249585, -0.0022092426661401987, -0.0020766640082001686, -0.0019440855830907822, -0.001811506925150752, -0.0016789283836260438, -0.0015463498421013355, -0.0014137713005766273, -0.001281192759051919, -0.0011486142175272107, -0.0010160356760025024, -0.0008834570762701333, -0.000750878534745425, -0.0006182999350130558, -0.00048572139348834753, -0.00035314285196363926, -0.00022056429588701576, -8.798573981039226e-05, 4.459283081814647e-05, 0.00017717137234285474, 0.000309749913867563, 0.0004423285135999322, 0.0005749070551246405, 0.0007074855966493487, 0.000840064138174057, 0.0009726426796987653, 0.0011052212212234735, 0.0012377998791635036, 0.00137037830427289, 0.0015029569622129202, 0.0016355355037376285, 0.0017681140452623367, 0.001900692586787045, 0.0020332711283117533, 0.0021658497862517834, 0.00229842821136117, 0.0024310068693012, 0.00256358552724123, 0.0026961639523506165, 0.002828742377460003, 0.002961321035400033, 0.0030938994605094194, 0.0032264781184494495, 0.003359056543558836, 0.003491635201498866, 0.003624213859438896, 0.0037567922845482826, 0.0038893709424883127, 0.004021949600428343, 0.004154528025537729, 0.004287106450647116, 0.0044196853414177895, 0.004552263766527176, 0.004684842191636562, 0.004817420616745949, 0.0049499995075166225, 0.005082577932626009, 0.005215156823396683, 0.005347735248506069, 0.005480313673615456, 0.005612892098724842, 0.005745470989495516, 0.005878049414604902, 0.006010627839714289]}, "gradients/decoder.transformer.h.4.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 2.0, 2.0, 3.0, 1.0, 3.0, 2.0, 3.0, 5.0, 4.0, 1.0, 4.0, 5.0, 7.0, 9.0, 15.0, 19.0, 15.0, 19.0, 22.0, 23.0, 31.0, 28.0, 37.0, 42.0, 37.0, 39.0, 36.0, 37.0, 51.0, 36.0, 40.0, 34.0, 34.0, 36.0, 36.0, 31.0, 43.0, 34.0, 29.0, 26.0, 17.0, 19.0, 20.0, 16.0, 14.0, 10.0, 7.0, 8.0, 3.0, 9.0, 5.0, 2.0, 2.0, 3.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0007639527320861816, -0.0007404424250125885, -0.0007169321179389954, -0.0006934218108654022, -0.0006699115037918091, -0.0006464011967182159, -0.0006228908896446228, -0.0005993805825710297, -0.0005758702754974365, -0.0005523599684238434, -0.0005288496613502502, -0.0005053393542766571, -0.00048182904720306396, -0.0004583187401294708, -0.0004348084330558777, -0.00041129812598228455, -0.0003877878189086914, -0.00036427751183509827, -0.0003407672047615051, -0.000317256897687912, -0.00029374659061431885, -0.0002702362835407257, -0.00024672597646713257, -0.00022321566939353943, -0.0001997053623199463, -0.00017619505524635315, -0.00015268474817276, -0.00012917444109916687, -0.00010566413402557373, -8.215382695198059e-05, -5.864351987838745e-05, -3.513321280479431e-05, -1.1622905731201172e-05, 1.1887401342391968e-05, 3.539770841598511e-05, 5.890801548957825e-05, 8.241832256317139e-05, 0.00010592862963676453, 0.00012943893671035767, 0.0001529492437839508, 0.00017645955085754395, 0.00019996985793113708, 0.00022348016500473022, 0.00024699047207832336, 0.0002705007791519165, 0.00029401108622550964, 0.0003175213932991028, 0.0003410317003726959, 0.00036454200744628906, 0.0003880523145198822, 0.00041156262159347534, 0.0004350729286670685, 0.0004585832357406616, 0.00048209354281425476, 0.0005056038498878479, 0.000529114156961441, 0.0005526244640350342, 0.0005761347711086273, 0.0005996450781822205, 0.0006231553852558136, 0.0006466656923294067, 0.0006701759994029999, 0.000693686306476593, 0.0007171966135501862, 0.0007407069206237793]}, "gradients/decoder.transformer.h.4.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 6.0, 6.0, 5.0, 5.0, 8.0, 16.0, 9.0, 17.0, 8.0, 14.0, 17.0, 25.0, 27.0, 23.0, 30.0, 37.0, 37.0, 31.0, 31.0, 45.0, 40.0, 42.0, 27.0, 50.0, 48.0, 45.0, 39.0, 38.0, 30.0, 40.0, 28.0, 31.0, 24.0, 23.0, 25.0, 13.0, 9.0, 12.0, 11.0, 13.0, 10.0, 2.0, 4.0, 3.0, 2.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-9.8359375, -9.5316162109375, -9.227294921875, -8.9229736328125, -8.61865234375, -8.3143310546875, -8.010009765625, -7.7056884765625, -7.4013671875, -7.0970458984375, -6.792724609375, -6.4884033203125, -6.18408203125, -5.8797607421875, -5.575439453125, -5.2711181640625, -4.966796875, -4.6624755859375, -4.358154296875, -4.0538330078125, -3.74951171875, -3.4451904296875, -3.140869140625, -2.8365478515625, -2.5322265625, -2.2279052734375, -1.923583984375, -1.6192626953125, -1.31494140625, -1.0106201171875, -0.706298828125, -0.4019775390625, -0.09765625, 0.2066650390625, 0.510986328125, 0.8153076171875, 1.11962890625, 1.4239501953125, 1.728271484375, 2.0325927734375, 2.3369140625, 2.6412353515625, 2.945556640625, 3.2498779296875, 3.55419921875, 3.8585205078125, 4.162841796875, 4.4671630859375, 4.771484375, 5.0758056640625, 5.380126953125, 5.6844482421875, 5.98876953125, 6.2930908203125, 6.597412109375, 6.9017333984375, 7.2060546875, 7.5103759765625, 7.814697265625, 8.1190185546875, 8.42333984375, 8.7276611328125, 9.031982421875, 9.3363037109375, 9.640625]}, "gradients/decoder.transformer.h.4.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 1.0, 1.0, 3.0, 7.0, 8.0, 8.0, 11.0, 17.0, 28.0, 23.0, 55.0, 58.0, 76.0, 102.0, 140.0, 193.0, 232.0, 378.0, 521.0, 822.0, 1397.0, 2672.0, 5911.0, 16024.0, 54415.0, 204414.0, 457085.0, 216044.0, 57868.0, 16821.0, 6303.0, 2685.0, 1515.0, 827.0, 506.0, 354.0, 271.0, 210.0, 124.0, 119.0, 79.0, 68.0, 51.0, 35.0, 21.0, 14.0, 13.0, 10.0, 10.0, 8.0, 1.0, 1.0, 3.0, 3.0, 1.0, 0.0, 0.0, 3.0], "bins": [-10.203125, -9.889404296875, -9.57568359375, -9.261962890625, -8.9482421875, -8.634521484375, -8.32080078125, -8.007080078125, -7.693359375, -7.379638671875, -7.06591796875, -6.752197265625, -6.4384765625, -6.124755859375, -5.81103515625, -5.497314453125, -5.18359375, -4.869873046875, -4.55615234375, -4.242431640625, -3.9287109375, -3.614990234375, -3.30126953125, -2.987548828125, -2.673828125, -2.360107421875, -2.04638671875, -1.732666015625, -1.4189453125, -1.105224609375, -0.79150390625, -0.477783203125, -0.1640625, 0.149658203125, 0.46337890625, 0.777099609375, 1.0908203125, 1.404541015625, 1.71826171875, 2.031982421875, 2.345703125, 2.659423828125, 2.97314453125, 3.286865234375, 3.6005859375, 3.914306640625, 4.22802734375, 4.541748046875, 4.85546875, 5.169189453125, 5.48291015625, 5.796630859375, 6.1103515625, 6.424072265625, 6.73779296875, 7.051513671875, 7.365234375, 7.678955078125, 7.99267578125, 8.306396484375, 8.6201171875, 8.933837890625, 9.24755859375, 9.561279296875, 9.875]}, "gradients/decoder.transformer.h.4.attn.c_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 0.0, 2.0, 2.0, 0.0, 2.0, 5.0, 3.0, 2.0, 5.0, 4.0, 5.0, 9.0, 3.0, 8.0, 10.0, 19.0, 19.0, 26.0, 29.0, 35.0, 48.0, 35.0, 56.0, 61.0, 60.0, 154.0, 1612.0, 331.0, 104.0, 70.0, 53.0, 52.0, 36.0, 41.0, 25.0, 17.0, 26.0, 14.0, 18.0, 17.0, 13.0, 8.0, 7.0, 3.0, 3.0, 2.0, 2.0, 5.0, 3.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-29.765625, -28.798583984375, -27.83154296875, -26.864501953125, -25.8974609375, -24.930419921875, -23.96337890625, -22.996337890625, -22.029296875, -21.062255859375, -20.09521484375, -19.128173828125, -18.1611328125, -17.194091796875, -16.22705078125, -15.260009765625, -14.29296875, -13.325927734375, -12.35888671875, -11.391845703125, -10.4248046875, -9.457763671875, -8.49072265625, -7.523681640625, -6.556640625, -5.589599609375, -4.62255859375, -3.655517578125, -2.6884765625, -1.721435546875, -0.75439453125, 0.212646484375, 1.1796875, 2.146728515625, 3.11376953125, 4.080810546875, 5.0478515625, 6.014892578125, 6.98193359375, 7.948974609375, 8.916015625, 9.883056640625, 10.85009765625, 11.817138671875, 12.7841796875, 13.751220703125, 14.71826171875, 15.685302734375, 16.65234375, 17.619384765625, 18.58642578125, 19.553466796875, 20.5205078125, 21.487548828125, 22.45458984375, 23.421630859375, 24.388671875, 25.355712890625, 26.32275390625, 27.289794921875, 28.2568359375, 29.223876953125, 30.19091796875, 31.157958984375, 32.125]}, "gradients/decoder.transformer.h.4.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 7.0, 2.0, 4.0, 3.0, 2.0, 5.0, 14.0, 15.0, 20.0, 20.0, 37.0, 48.0, 48.0, 56.0, 82.0, 110.0, 163.0, 230.0, 364.0, 653.0, 2025.0, 64995.0, 3036044.0, 37373.0, 1604.0, 600.0, 322.0, 220.0, 152.0, 126.0, 87.0, 70.0, 41.0, 43.0, 32.0, 19.0, 11.0, 18.0, 12.0, 3.0, 11.0, 3.0, 3.0, 9.0, 3.0, 7.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-42.15625, -40.85693359375, -39.5576171875, -38.25830078125, -36.958984375, -35.65966796875, -34.3603515625, -33.06103515625, -31.76171875, -30.46240234375, -29.1630859375, -27.86376953125, -26.564453125, -25.26513671875, -23.9658203125, -22.66650390625, -21.3671875, -20.06787109375, -18.7685546875, -17.46923828125, -16.169921875, -14.87060546875, -13.5712890625, -12.27197265625, -10.97265625, -9.67333984375, -8.3740234375, -7.07470703125, -5.775390625, -4.47607421875, -3.1767578125, -1.87744140625, -0.578125, 0.72119140625, 2.0205078125, 3.31982421875, 4.619140625, 5.91845703125, 7.2177734375, 8.51708984375, 9.81640625, 11.11572265625, 12.4150390625, 13.71435546875, 15.013671875, 16.31298828125, 17.6123046875, 18.91162109375, 20.2109375, 21.51025390625, 22.8095703125, 24.10888671875, 25.408203125, 26.70751953125, 28.0068359375, 29.30615234375, 30.60546875, 31.90478515625, 33.2041015625, 34.50341796875, 35.802734375, 37.10205078125, 38.4013671875, 39.70068359375, 41.0]}, "gradients/decoder.transformer.h.4.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 8.0, 519.0, 487.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-365.8355712890625, -359.1766662597656, -352.5177307128906, -345.85882568359375, -339.19989013671875, -332.5409851074219, -325.882080078125, -319.22314453125, -312.5642395019531, -305.90533447265625, -299.24639892578125, -292.5874938964844, -285.9285583496094, -279.2696533203125, -272.6107482910156, -265.9518127441406, -259.29290771484375, -252.6339874267578, -245.97506713867188, -239.316162109375, -232.65724182128906, -225.99832153320312, -219.3394012451172, -212.68048095703125, -206.0215606689453, -199.36264038085938, -192.70372009277344, -186.04481506347656, -179.38589477539062, -172.7269744873047, -166.06805419921875, -159.40914916992188, -152.750244140625, -146.09132385253906, -139.43240356445312, -132.77349853515625, -126.11457824707031, -119.45565795898438, -112.79673767089844, -106.13782501220703, -99.47889709472656, -92.81997680664062, -86.16106414794922, -79.50214385986328, -72.84323120117188, -66.18431091308594, -59.525394439697266, -52.866477966308594, -46.20756530761719, -39.548648834228516, -32.889732360839844, -26.23081398010254, -19.571897506713867, -12.912979125976562, -6.254062652587891, 0.40485382080078125, 7.063770294189453, 13.722686767578125, 20.381603240966797, 27.0405216217041, 33.699440002441406, 40.35835647583008, 47.01727294921875, 53.67618942260742, 60.335105895996094]}, "gradients/decoder.transformer.h.4.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 4.0, 7.0, 5.0, 5.0, 3.0, 4.0, 4.0, 7.0, 7.0, 14.0, 14.0, 19.0, 14.0, 21.0, 14.0, 26.0, 23.0, 16.0, 24.0, 16.0, 28.0, 34.0, 31.0, 38.0, 36.0, 38.0, 34.0, 43.0, 32.0, 36.0, 43.0, 38.0, 39.0, 28.0, 33.0, 28.0, 27.0, 21.0, 18.0, 13.0, 22.0, 18.0, 14.0, 10.0, 5.0, 14.0, 8.0, 9.0, 9.0, 6.0, 7.0, 3.0, 7.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-49.79507827758789, -48.12015151977539, -46.44522476196289, -44.770301818847656, -43.095375061035156, -41.420448303222656, -39.745521545410156, -38.070594787597656, -36.395668029785156, -34.720741271972656, -33.045814514160156, -31.37088966369629, -29.695964813232422, -28.021038055419922, -26.346111297607422, -24.671184539794922, -22.996261596679688, -21.321334838867188, -19.64640998840332, -17.97148323059082, -16.296558380126953, -14.621631622314453, -12.946704864501953, -11.27177906036377, -9.596853256225586, -7.921927452087402, -6.2470011711120605, -4.572074890136719, -2.897149085998535, -1.2222232818603516, 0.45270347595214844, 2.127629280090332, 3.8025588989257812, 5.477484703063965, 7.152410984039307, 8.827337265014648, 10.502263069152832, 12.177188873291016, 13.852115631103516, 15.5270414352417, 17.201967239379883, 18.876893997192383, 20.55181884765625, 22.22674560546875, 23.90167236328125, 25.576597213745117, 27.251523971557617, 28.926448822021484, 30.601375579833984, 32.276302337646484, 33.951229095458984, 35.62615203857422, 37.30107879638672, 38.97600555419922, 40.65093231201172, 42.32585906982422, 44.00078582763672, 45.67571258544922, 47.35063934326172, 49.02556610107422, 50.70048904418945, 52.37541580200195, 54.05034255981445, 55.72526931762695, 57.40019226074219]}, "gradients/decoder.transformer.h.3.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 5.0, 4.0, 4.0, 1.0, 11.0, 17.0, 9.0, 5.0, 17.0, 18.0, 18.0, 26.0, 19.0, 28.0, 28.0, 28.0, 33.0, 33.0, 41.0, 34.0, 41.0, 48.0, 39.0, 39.0, 43.0, 30.0, 41.0, 40.0, 43.0, 36.0, 39.0, 23.0, 24.0, 26.0, 23.0, 18.0, 14.0, 9.0, 12.0, 11.0, 7.0, 6.0, 5.0, 4.0, 3.0, 2.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-10.328125, -10.021240234375, -9.71435546875, -9.407470703125, -9.1005859375, -8.793701171875, -8.48681640625, -8.179931640625, -7.873046875, -7.566162109375, -7.25927734375, -6.952392578125, -6.6455078125, -6.338623046875, -6.03173828125, -5.724853515625, -5.41796875, -5.111083984375, -4.80419921875, -4.497314453125, -4.1904296875, -3.883544921875, -3.57666015625, -3.269775390625, -2.962890625, -2.656005859375, -2.34912109375, -2.042236328125, -1.7353515625, -1.428466796875, -1.12158203125, -0.814697265625, -0.5078125, -0.200927734375, 0.10595703125, 0.412841796875, 0.7197265625, 1.026611328125, 1.33349609375, 1.640380859375, 1.947265625, 2.254150390625, 2.56103515625, 2.867919921875, 3.1748046875, 3.481689453125, 3.78857421875, 4.095458984375, 4.40234375, 4.709228515625, 5.01611328125, 5.322998046875, 5.6298828125, 5.936767578125, 6.24365234375, 6.550537109375, 6.857421875, 7.164306640625, 7.47119140625, 7.778076171875, 8.0849609375, 8.391845703125, 8.69873046875, 9.005615234375, 9.3125]}, "gradients/decoder.transformer.h.3.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 5.0, 3.0, 0.0, 2.0, 3.0, 1.0, 4.0, 4.0, 13.0, 9.0, 13.0, 11.0, 24.0, 17.0, 29.0, 55.0, 55.0, 72.0, 99.0, 132.0, 169.0, 276.0, 365.0, 485.0, 1323.0, 4187741.0, 1500.0, 461.0, 375.0, 283.0, 207.0, 141.0, 84.0, 91.0, 49.0, 40.0, 31.0, 25.0, 31.0, 17.0, 13.0, 4.0, 9.0, 7.0, 5.0, 2.0, 1.0, 7.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-324.25, -313.96484375, -303.6796875, -293.39453125, -283.109375, -272.82421875, -262.5390625, -252.25390625, -241.96875, -231.68359375, -221.3984375, -211.11328125, -200.828125, -190.54296875, -180.2578125, -169.97265625, -159.6875, -149.40234375, -139.1171875, -128.83203125, -118.546875, -108.26171875, -97.9765625, -87.69140625, -77.40625, -67.12109375, -56.8359375, -46.55078125, -36.265625, -25.98046875, -15.6953125, -5.41015625, 4.875, 15.16015625, 25.4453125, 35.73046875, 46.015625, 56.30078125, 66.5859375, 76.87109375, 87.15625, 97.44140625, 107.7265625, 118.01171875, 128.296875, 138.58203125, 148.8671875, 159.15234375, 169.4375, 179.72265625, 190.0078125, 200.29296875, 210.578125, 220.86328125, 231.1484375, 241.43359375, 251.71875, 262.00390625, 272.2890625, 282.57421875, 292.859375, 303.14453125, 313.4296875, 323.71484375, 334.0]}, "gradients/decoder.transformer.h.3.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 1.0, 4.0, 5.0, 5.0, 9.0, 20.0, 29.0, 37.0, 51.0, 75.0, 114.0, 222.0, 378.0, 678.0, 902.0, 665.0, 316.0, 190.0, 128.0, 86.0, 43.0, 31.0, 24.0, 18.0, 19.0, 10.0, 2.0, 8.0, 3.0, 3.0, 1.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-22.5625, -21.888671875, -21.21484375, -20.541015625, -19.8671875, -19.193359375, -18.51953125, -17.845703125, -17.171875, -16.498046875, -15.82421875, -15.150390625, -14.4765625, -13.802734375, -13.12890625, -12.455078125, -11.78125, -11.107421875, -10.43359375, -9.759765625, -9.0859375, -8.412109375, -7.73828125, -7.064453125, -6.390625, -5.716796875, -5.04296875, -4.369140625, -3.6953125, -3.021484375, -2.34765625, -1.673828125, -1.0, -0.326171875, 0.34765625, 1.021484375, 1.6953125, 2.369140625, 3.04296875, 3.716796875, 4.390625, 5.064453125, 5.73828125, 6.412109375, 7.0859375, 7.759765625, 8.43359375, 9.107421875, 9.78125, 10.455078125, 11.12890625, 11.802734375, 12.4765625, 13.150390625, 13.82421875, 14.498046875, 15.171875, 15.845703125, 16.51953125, 17.193359375, 17.8671875, 18.541015625, 19.21484375, 19.888671875, 20.5625]}, "gradients/decoder.transformer.h.3.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 0.0, 4.0, 6.0, 5.0, 7.0, 21.0, 17.0, 21.0, 51.0, 57.0, 82.0, 124.0, 205.0, 53537.0, 4139447.0, 323.0, 120.0, 82.0, 52.0, 64.0, 19.0, 13.0, 15.0, 10.0, 6.0, 4.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-158.25, -149.00390625, -139.7578125, -130.51171875, -121.265625, -112.01953125, -102.7734375, -93.52734375, -84.28125, -75.03515625, -65.7890625, -56.54296875, -47.296875, -38.05078125, -28.8046875, -19.55859375, -10.3125, -1.06640625, 8.1796875, 17.42578125, 26.671875, 35.91796875, 45.1640625, 54.41015625, 63.65625, 72.90234375, 82.1484375, 91.39453125, 100.640625, 109.88671875, 119.1328125, 128.37890625, 137.625, 146.87109375, 156.1171875, 165.36328125, 174.609375, 183.85546875, 193.1015625, 202.34765625, 211.59375, 220.83984375, 230.0859375, 239.33203125, 248.578125, 257.82421875, 267.0703125, 276.31640625, 285.5625, 294.80859375, 304.0546875, 313.30078125, 322.546875, 331.79296875, 341.0390625, 350.28515625, 359.53125, 368.77734375, 378.0234375, 387.26953125, 396.515625, 405.76171875, 415.0078125, 424.25390625, 433.5]}, "gradients/decoder.transformer.h.3.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 8.0, 177.0, 596.0, 218.0, 15.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-428.4667053222656, -420.61639404296875, -412.7660827636719, -404.9158020019531, -397.06549072265625, -389.2151794433594, -381.3648681640625, -373.51458740234375, -365.6642761230469, -357.81396484375, -349.9636535644531, -342.1133728027344, -334.2630615234375, -326.4127502441406, -318.56243896484375, -310.712158203125, -302.86181640625, -295.0115051269531, -287.16119384765625, -279.3109130859375, -271.4606018066406, -263.61029052734375, -255.75997924804688, -247.90968322753906, -240.05938720703125, -232.20907592773438, -224.35877990722656, -216.5084686279297, -208.65817260742188, -200.807861328125, -192.95755004882812, -185.1072540283203, -177.2569580078125, -169.40664672851562, -161.5563507080078, -153.70603942871094, -145.85574340820312, -138.00543212890625, -130.15512084960938, -122.30482482910156, -114.45452117919922, -106.60421752929688, -98.75391387939453, -90.90361022949219, -83.05329895019531, -75.2030029296875, -67.35269165039062, -59.50238800048828, -51.65208435058594, -43.801780700683594, -35.95147705078125, -28.10116958618164, -20.250865936279297, -12.400562286376953, -4.550254821777344, 3.300048828125, 11.150352478027344, 19.000656127929688, 26.850961685180664, 34.70126724243164, 42.551570892333984, 50.40187454223633, 58.25218200683594, 66.10248565673828, 73.95278930664062]}, "gradients/decoder.transformer.h.3.ln_2.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 5.0, 4.0, 6.0, 8.0, 13.0, 9.0, 14.0, 9.0, 16.0, 23.0, 22.0, 27.0, 33.0, 33.0, 32.0, 34.0, 32.0, 39.0, 39.0, 55.0, 41.0, 36.0, 45.0, 38.0, 37.0, 41.0, 36.0, 26.0, 30.0, 39.0, 24.0, 24.0, 26.0, 22.0, 23.0, 14.0, 15.0, 11.0, 8.0, 4.0, 6.0, 3.0, 3.0, 7.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-55.30072021484375, -53.498348236083984, -51.695980072021484, -49.89360809326172, -48.09123992919922, -46.28886795043945, -44.48649597167969, -42.68412780761719, -40.88175964355469, -39.07938766479492, -37.27701950073242, -35.474647521972656, -33.672279357910156, -31.86990737915039, -30.067537307739258, -28.265167236328125, -26.46279525756836, -24.660425186157227, -22.858055114746094, -21.055683135986328, -19.253314971923828, -17.450942993164062, -15.64857292175293, -13.846202850341797, -12.043832778930664, -10.241462707519531, -8.439092636108398, -6.636721611022949, -4.834351539611816, -3.0319814682006836, -1.2296104431152344, 0.5727596282958984, 2.3751296997070312, 4.177499771118164, 5.979870319366455, 7.782240867614746, 9.584610939025879, 11.386981010437012, 13.189352035522461, 14.991722106933594, 16.794092178344727, 18.59646224975586, 20.398832321166992, 22.201202392578125, 24.00357437133789, 25.80594253540039, 27.608314514160156, 29.41068458557129, 31.213054656982422, 33.01542663574219, 34.81779479980469, 36.62016677856445, 38.42253494262695, 40.22490692138672, 42.02727508544922, 43.829647064208984, 45.63201904296875, 47.434391021728516, 49.236759185791016, 51.03913116455078, 52.84149932861328, 54.64387130737305, 56.44624328613281, 58.24861145019531, 60.05097961425781]}, "gradients/decoder.transformer.h.3.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 4.0, 5.0, 5.0, 7.0, 4.0, 10.0, 8.0, 18.0, 6.0, 6.0, 9.0, 20.0, 25.0, 18.0, 36.0, 27.0, 34.0, 23.0, 28.0, 40.0, 33.0, 43.0, 41.0, 36.0, 40.0, 44.0, 52.0, 38.0, 31.0, 23.0, 40.0, 31.0, 26.0, 30.0, 30.0, 17.0, 23.0, 12.0, 12.0, 15.0, 17.0, 9.0, 11.0, 7.0, 4.0, 5.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-9.453125, -9.1593017578125, -8.865478515625, -8.5716552734375, -8.27783203125, -7.9840087890625, -7.690185546875, -7.3963623046875, -7.1025390625, -6.8087158203125, -6.514892578125, -6.2210693359375, -5.92724609375, -5.6334228515625, -5.339599609375, -5.0457763671875, -4.751953125, -4.4581298828125, -4.164306640625, -3.8704833984375, -3.57666015625, -3.2828369140625, -2.989013671875, -2.6951904296875, -2.4013671875, -2.1075439453125, -1.813720703125, -1.5198974609375, -1.22607421875, -0.9322509765625, -0.638427734375, -0.3446044921875, -0.05078125, 0.2430419921875, 0.536865234375, 0.8306884765625, 1.12451171875, 1.4183349609375, 1.712158203125, 2.0059814453125, 2.2998046875, 2.5936279296875, 2.887451171875, 3.1812744140625, 3.47509765625, 3.7689208984375, 4.062744140625, 4.3565673828125, 4.650390625, 4.9442138671875, 5.238037109375, 5.5318603515625, 5.82568359375, 6.1195068359375, 6.413330078125, 6.7071533203125, 7.0009765625, 7.2947998046875, 7.588623046875, 7.8824462890625, 8.17626953125, 8.4700927734375, 8.763916015625, 9.0577392578125, 9.3515625]}, "gradients/decoder.transformer.h.3.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 2.0, 3.0, 7.0, 7.0, 13.0, 17.0, 30.0, 44.0, 63.0, 101.0, 134.0, 175.0, 239.0, 405.0, 553.0, 867.0, 1209.0, 1793.0, 2887.0, 4136.0, 6066.0, 9138.0, 13876.0, 21583.0, 33830.0, 53851.0, 91331.0, 186123.0, 304730.0, 125368.0, 69038.0, 42705.0, 26648.0, 17311.0, 11456.0, 7746.0, 4979.0, 3295.0, 2203.0, 1494.0, 1022.0, 681.0, 467.0, 330.0, 208.0, 138.0, 100.0, 52.0, 30.0, 27.0, 21.0, 14.0, 13.0, 3.0, 5.0, 1.0, 1.0, 3.0, 1.0, 1.0], "bins": [-2.4140625, -2.339202880859375, -2.26434326171875, -2.189483642578125, -2.1146240234375, -2.039764404296875, -1.96490478515625, -1.890045166015625, -1.815185546875, -1.740325927734375, -1.66546630859375, -1.590606689453125, -1.5157470703125, -1.440887451171875, -1.36602783203125, -1.291168212890625, -1.21630859375, -1.141448974609375, -1.06658935546875, -0.991729736328125, -0.9168701171875, -0.842010498046875, -0.76715087890625, -0.692291259765625, -0.617431640625, -0.542572021484375, -0.46771240234375, -0.392852783203125, -0.3179931640625, -0.243133544921875, -0.16827392578125, -0.093414306640625, -0.0185546875, 0.056304931640625, 0.13116455078125, 0.206024169921875, 0.2808837890625, 0.355743408203125, 0.43060302734375, 0.505462646484375, 0.580322265625, 0.655181884765625, 0.73004150390625, 0.804901123046875, 0.8797607421875, 0.954620361328125, 1.02947998046875, 1.104339599609375, 1.17919921875, 1.254058837890625, 1.32891845703125, 1.403778076171875, 1.4786376953125, 1.553497314453125, 1.62835693359375, 1.703216552734375, 1.778076171875, 1.852935791015625, 1.92779541015625, 2.002655029296875, 2.0775146484375, 2.152374267578125, 2.22723388671875, 2.302093505859375, 2.376953125]}, "gradients/decoder.transformer.h.3.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 6.0, 6.0, 6.0, 4.0, 9.0, 8.0, 4.0, 10.0, 17.0, 18.0, 21.0, 19.0, 26.0, 23.0, 24.0, 39.0, 50.0, 31.0, 48.0, 45.0, 45.0, 1075.0, 43.0, 36.0, 40.0, 57.0, 51.0, 33.0, 40.0, 38.0, 28.0, 22.0, 15.0, 22.0, 19.0, 7.0, 10.0, 6.0, 7.0, 11.0, 9.0, 3.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-7.4375, -7.21832275390625, -6.9991455078125, -6.77996826171875, -6.560791015625, -6.34161376953125, -6.1224365234375, -5.90325927734375, -5.68408203125, -5.46490478515625, -5.2457275390625, -5.02655029296875, -4.807373046875, -4.58819580078125, -4.3690185546875, -4.14984130859375, -3.9306640625, -3.71148681640625, -3.4923095703125, -3.27313232421875, -3.053955078125, -2.83477783203125, -2.6156005859375, -2.39642333984375, -2.17724609375, -1.95806884765625, -1.7388916015625, -1.51971435546875, -1.300537109375, -1.08135986328125, -0.8621826171875, -0.64300537109375, -0.423828125, -0.20465087890625, 0.0145263671875, 0.23370361328125, 0.452880859375, 0.67205810546875, 0.8912353515625, 1.11041259765625, 1.32958984375, 1.54876708984375, 1.7679443359375, 1.98712158203125, 2.206298828125, 2.42547607421875, 2.6446533203125, 2.86383056640625, 3.0830078125, 3.30218505859375, 3.5213623046875, 3.74053955078125, 3.959716796875, 4.17889404296875, 4.3980712890625, 4.61724853515625, 4.83642578125, 5.05560302734375, 5.2747802734375, 5.49395751953125, 5.713134765625, 5.93231201171875, 6.1514892578125, 6.37066650390625, 6.58984375]}, "gradients/decoder.transformer.h.3.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 6.0, 14.0, 21.0, 18.0, 24.0, 54.0, 58.0, 111.0, 198.0, 281.0, 508.0, 837.0, 1453.0, 2848.0, 4935.0, 9420.0, 18480.0, 38706.0, 87849.0, 240725.0, 1465572.0, 123203.0, 51904.0, 24023.0, 12046.0, 6061.0, 3412.0, 1792.0, 1066.0, 624.0, 310.0, 205.0, 130.0, 90.0, 53.0, 23.0, 23.0, 10.0, 11.0, 8.0, 10.0, 8.0, 2.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-3.63671875, -3.5162353515625, -3.395751953125, -3.2752685546875, -3.15478515625, -3.0343017578125, -2.913818359375, -2.7933349609375, -2.6728515625, -2.5523681640625, -2.431884765625, -2.3114013671875, -2.19091796875, -2.0704345703125, -1.949951171875, -1.8294677734375, -1.708984375, -1.5885009765625, -1.468017578125, -1.3475341796875, -1.22705078125, -1.1065673828125, -0.986083984375, -0.8656005859375, -0.7451171875, -0.6246337890625, -0.504150390625, -0.3836669921875, -0.26318359375, -0.1427001953125, -0.022216796875, 0.0982666015625, 0.21875, 0.3392333984375, 0.459716796875, 0.5802001953125, 0.70068359375, 0.8211669921875, 0.941650390625, 1.0621337890625, 1.1826171875, 1.3031005859375, 1.423583984375, 1.5440673828125, 1.66455078125, 1.7850341796875, 1.905517578125, 2.0260009765625, 2.146484375, 2.2669677734375, 2.387451171875, 2.5079345703125, 2.62841796875, 2.7489013671875, 2.869384765625, 2.9898681640625, 3.1103515625, 3.2308349609375, 3.351318359375, 3.4718017578125, 3.59228515625, 3.7127685546875, 3.833251953125, 3.9537353515625, 4.07421875]}, "gradients/decoder.transformer.h.3.crossattention.q_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 1.0, 0.0, 3.0, 0.0, 4.0, 0.0, 0.0, 2.0, 6.0, 6.0, 8.0, 9.0, 13.0, 12.0, 16.0, 15.0, 21.0, 27.0, 28.0, 43.0, 42.0, 65.0, 72.0, 88.0, 86.0, 70.0, 69.0, 56.0, 42.0, 39.0, 32.0, 23.0, 13.0, 17.0, 15.0, 18.0, 17.0, 10.0, 5.0, 1.0, 5.0, 5.0, 0.0, 0.0, 3.0, 4.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0014801025390625, -0.0014332085847854614, -0.0013863146305084229, -0.0013394206762313843, -0.0012925267219543457, -0.0012456327676773071, -0.0011987388134002686, -0.00115184485912323, -0.0011049509048461914, -0.0010580569505691528, -0.0010111629962921143, -0.0009642690420150757, -0.0009173750877380371, -0.0008704811334609985, -0.00082358717918396, -0.0007766932249069214, -0.0007297992706298828, -0.0006829053163528442, -0.0006360113620758057, -0.0005891174077987671, -0.0005422234535217285, -0.0004953294992446899, -0.00044843554496765137, -0.0004015415906906128, -0.0003546476364135742, -0.00030775368213653564, -0.00026085972785949707, -0.0002139657735824585, -0.00016707181930541992, -0.00012017786502838135, -7.328391075134277e-05, -2.63899564743042e-05, 2.0503997802734375e-05, 6.739795207977295e-05, 0.00011429190635681152, 0.0001611858606338501, 0.00020807981491088867, 0.00025497376918792725, 0.0003018677234649658, 0.0003487616777420044, 0.00039565563201904297, 0.00044254958629608154, 0.0004894435405731201, 0.0005363374948501587, 0.0005832314491271973, 0.0006301254034042358, 0.0006770193576812744, 0.000723913311958313, 0.0007708072662353516, 0.0008177012205123901, 0.0008645951747894287, 0.0009114891290664673, 0.0009583830833435059, 0.0010052770376205444, 0.001052170991897583, 0.0010990649461746216, 0.0011459589004516602, 0.0011928528547286987, 0.0012397468090057373, 0.0012866407632827759, 0.0013335347175598145, 0.001380428671836853, 0.0014273226261138916, 0.0014742165803909302, 0.0015211105346679688]}, "gradients/decoder.transformer.h.3.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 0.0, 3.0, 6.0, 2.0, 9.0, 7.0, 5.0, 10.0, 11.0, 15.0, 19.0, 35.0, 30.0, 49.0, 58.0, 64.0, 104.0, 152.0, 220.0, 350.0, 564.0, 2374.0, 977931.0, 64017.0, 1057.0, 447.0, 298.0, 203.0, 121.0, 89.0, 71.0, 45.0, 40.0, 39.0, 31.0, 25.0, 17.0, 10.0, 6.0, 6.0, 2.0, 2.0, 6.0, 1.0, 6.0, 2.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.029144287109375, -0.028281688690185547, -0.027419090270996094, -0.02655649185180664, -0.025693893432617188, -0.024831295013427734, -0.02396869659423828, -0.023106098175048828, -0.022243499755859375, -0.021380901336669922, -0.02051830291748047, -0.019655704498291016, -0.018793106079101562, -0.01793050765991211, -0.017067909240722656, -0.016205310821533203, -0.01534271240234375, -0.014480113983154297, -0.013617515563964844, -0.01275491714477539, -0.011892318725585938, -0.011029720306396484, -0.010167121887207031, -0.009304523468017578, -0.008441925048828125, -0.007579326629638672, -0.006716728210449219, -0.005854129791259766, -0.0049915313720703125, -0.004128932952880859, -0.0032663345336914062, -0.002403736114501953, -0.0015411376953125, -0.0006785392761230469, 0.00018405914306640625, 0.0010466575622558594, 0.0019092559814453125, 0.0027718544006347656, 0.0036344528198242188, 0.004497051239013672, 0.005359649658203125, 0.006222248077392578, 0.007084846496582031, 0.007947444915771484, 0.008810043334960938, 0.00967264175415039, 0.010535240173339844, 0.011397838592529297, 0.01226043701171875, 0.013123035430908203, 0.013985633850097656, 0.01484823226928711, 0.015710830688476562, 0.016573429107666016, 0.01743602752685547, 0.018298625946044922, 0.019161224365234375, 0.020023822784423828, 0.02088642120361328, 0.021749019622802734, 0.022611618041992188, 0.02347421646118164, 0.024336814880371094, 0.025199413299560547, 0.02606201171875]}, "gradients/decoder.transformer.h.3.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 7.0, 6.0, 19.0, 52.0, 105.0, 190.0, 229.0, 194.0, 122.0, 56.0, 21.0, 8.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0007859938777983189, -0.0007435671868734062, -0.0007011404377408326, -0.0006587137468159199, -0.0006162870558910072, -0.0005738603649660945, -0.0005314336158335209, -0.0004890069249086082, -0.00044658020487986505, -0.0004041534848511219, -0.0003617267939262092, -0.00031930007389746606, -0.0002768733538687229, -0.00023444666294381022, -0.00019201994291506708, -0.00014959325199015439, -0.00010716653196141124, -6.473982648458332e-05, -2.2313113731797785e-05, 2.011359902098775e-05, 6.254030449781567e-05, 0.00010496700997464359, 0.00014739373000338674, 0.00018982042092829943, 0.00023224714095704257, 0.0002746738609857857, 0.0003171005519106984, 0.00035952727193944156, 0.0004019539919681847, 0.0004443806828930974, 0.00048680740292184055, 0.0005292340647429228, 0.0005716608138754964, 0.0006140875048004091, 0.0006565142539329827, 0.0006989409448578954, 0.0007413676357828081, 0.0007837943267077208, 0.0008262210758402944, 0.000868647766765207, 0.0009110744576901197, 0.0009535011486150324, 0.0009959278395399451, 0.0010383545886725187, 0.0010807813378050923, 0.0011232079705223441, 0.0011656347196549177, 0.0012080613523721695, 0.001250488217920065, 0.0012929149670526385, 0.0013353415997698903, 0.001377768348902464, 0.0014201950980350375, 0.0014626217307522893, 0.001505048479884863, 0.0015474751126021147, 0.0015899018617346883, 0.0016323286108672619, 0.0016747552435845137, 0.0017171819927170873, 0.0017596087418496609, 0.0018020353745669127, 0.0018444621236994863, 0.0018868888728320599, 0.0019293155055493116]}, "gradients/decoder.transformer.h.3.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 5.0, 3.0, 3.0, 3.0, 4.0, 12.0, 8.0, 5.0, 10.0, 15.0, 20.0, 12.0, 15.0, 13.0, 30.0, 30.0, 26.0, 35.0, 49.0, 41.0, 40.0, 48.0, 40.0, 32.0, 47.0, 44.0, 48.0, 44.0, 50.0, 32.0, 34.0, 28.0, 28.0, 29.0, 23.0, 18.0, 19.0, 12.0, 12.0, 10.0, 5.0, 6.0, 8.0, 7.0, 2.0, 3.0, 2.0, 4.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0006769299507141113, -0.0006539030000567436, -0.0006308760493993759, -0.0006078490987420082, -0.0005848221480846405, -0.0005617951974272728, -0.0005387682467699051, -0.0005157412961125374, -0.0004927143454551697, -0.00046968739479780197, -0.00044666044414043427, -0.00042363349348306656, -0.00040060654282569885, -0.00037757959216833115, -0.00035455264151096344, -0.00033152569085359573, -0.00030849874019622803, -0.0002854717895388603, -0.0002624448388814926, -0.0002394178882241249, -0.0002163909375667572, -0.0001933639869093895, -0.0001703370362520218, -0.00014731008559465408, -0.00012428313493728638, -0.00010125618427991867, -7.822923362255096e-05, -5.520228296518326e-05, -3.217533230781555e-05, -9.148381650447845e-06, 1.387856900691986e-05, 3.690551966428757e-05, 5.9932470321655273e-05, 8.295942097902298e-05, 0.00010598637163639069, 0.0001290133222937584, 0.0001520402729511261, 0.0001750672236084938, 0.0001980941742658615, 0.00022112112492322922, 0.0002441480755805969, 0.00026717502623796463, 0.00029020197689533234, 0.00031322892755270004, 0.00033625587821006775, 0.00035928282886743546, 0.00038230977952480316, 0.00040533673018217087, 0.0004283636808395386, 0.0004513906314969063, 0.000474417582154274, 0.0004974445328116417, 0.0005204714834690094, 0.0005434984341263771, 0.0005665253847837448, 0.0005895523354411125, 0.0006125792860984802, 0.0006356062367558479, 0.0006586331874132156, 0.0006816601380705833, 0.000704687088727951, 0.0007277140393853188, 0.0007507409900426865, 0.0007737679407000542, 0.0007967948913574219]}, "gradients/decoder.transformer.h.3.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 3.0, 4.0, 5.0, 5.0, 7.0, 4.0, 10.0, 8.0, 18.0, 6.0, 6.0, 9.0, 20.0, 25.0, 18.0, 36.0, 27.0, 34.0, 23.0, 28.0, 40.0, 33.0, 43.0, 41.0, 36.0, 40.0, 44.0, 52.0, 38.0, 31.0, 23.0, 40.0, 31.0, 26.0, 30.0, 30.0, 17.0, 23.0, 12.0, 12.0, 15.0, 17.0, 9.0, 11.0, 7.0, 4.0, 5.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-9.453125, -9.1593017578125, -8.865478515625, -8.5716552734375, -8.27783203125, -7.9840087890625, -7.690185546875, -7.3963623046875, -7.1025390625, -6.8087158203125, -6.514892578125, -6.2210693359375, -5.92724609375, -5.6334228515625, -5.339599609375, -5.0457763671875, -4.751953125, -4.4581298828125, -4.164306640625, -3.8704833984375, -3.57666015625, -3.2828369140625, -2.989013671875, -2.6951904296875, -2.4013671875, -2.1075439453125, -1.813720703125, -1.5198974609375, -1.22607421875, -0.9322509765625, -0.638427734375, -0.3446044921875, -0.05078125, 0.2430419921875, 0.536865234375, 0.8306884765625, 1.12451171875, 1.4183349609375, 1.712158203125, 2.0059814453125, 2.2998046875, 2.5936279296875, 2.887451171875, 3.1812744140625, 3.47509765625, 3.7689208984375, 4.062744140625, 4.3565673828125, 4.650390625, 4.9442138671875, 5.238037109375, 5.5318603515625, 5.82568359375, 6.1195068359375, 6.413330078125, 6.7071533203125, 7.0009765625, 7.2947998046875, 7.588623046875, 7.8824462890625, 8.17626953125, 8.4700927734375, 8.763916015625, 9.0577392578125, 9.3515625]}, "gradients/decoder.transformer.h.3.attn.c_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 5.0, 3.0, 6.0, 6.0, 10.0, 10.0, 19.0, 20.0, 26.0, 40.0, 52.0, 75.0, 86.0, 96.0, 152.0, 174.0, 202.0, 259.0, 348.0, 477.0, 581.0, 783.0, 1044.0, 1381.0, 2123.0, 3897.0, 10864.0, 47063.0, 253745.0, 536405.0, 144157.0, 27160.0, 7165.0, 3031.0, 1816.0, 1227.0, 877.0, 738.0, 539.0, 426.0, 300.0, 260.0, 199.0, 177.0, 135.0, 101.0, 77.0, 67.0, 44.0, 38.0, 25.0, 14.0, 8.0, 10.0, 9.0, 8.0, 5.0, 4.0, 1.0, 1.0, 2.0], "bins": [-13.1484375, -12.74169921875, -12.3349609375, -11.92822265625, -11.521484375, -11.11474609375, -10.7080078125, -10.30126953125, -9.89453125, -9.48779296875, -9.0810546875, -8.67431640625, -8.267578125, -7.86083984375, -7.4541015625, -7.04736328125, -6.640625, -6.23388671875, -5.8271484375, -5.42041015625, -5.013671875, -4.60693359375, -4.2001953125, -3.79345703125, -3.38671875, -2.97998046875, -2.5732421875, -2.16650390625, -1.759765625, -1.35302734375, -0.9462890625, -0.53955078125, -0.1328125, 0.27392578125, 0.6806640625, 1.08740234375, 1.494140625, 1.90087890625, 2.3076171875, 2.71435546875, 3.12109375, 3.52783203125, 3.9345703125, 4.34130859375, 4.748046875, 5.15478515625, 5.5615234375, 5.96826171875, 6.375, 6.78173828125, 7.1884765625, 7.59521484375, 8.001953125, 8.40869140625, 8.8154296875, 9.22216796875, 9.62890625, 10.03564453125, 10.4423828125, 10.84912109375, 11.255859375, 11.66259765625, 12.0693359375, 12.47607421875, 12.8828125]}, "gradients/decoder.transformer.h.3.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 4.0, 4.0, 11.0, 13.0, 32.0, 22.0, 45.0, 33.0, 67.0, 79.0, 143.0, 337.0, 1770.0, 140.0, 89.0, 73.0, 49.0, 52.0, 36.0, 19.0, 17.0, 5.0, 9.0, 4.0, 4.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-46.84375, -45.23876953125, -43.6337890625, -42.02880859375, -40.423828125, -38.81884765625, -37.2138671875, -35.60888671875, -34.00390625, -32.39892578125, -30.7939453125, -29.18896484375, -27.583984375, -25.97900390625, -24.3740234375, -22.76904296875, -21.1640625, -19.55908203125, -17.9541015625, -16.34912109375, -14.744140625, -13.13916015625, -11.5341796875, -9.92919921875, -8.32421875, -6.71923828125, -5.1142578125, -3.50927734375, -1.904296875, -0.29931640625, 1.3056640625, 2.91064453125, 4.515625, 6.12060546875, 7.7255859375, 9.33056640625, 10.935546875, 12.54052734375, 14.1455078125, 15.75048828125, 17.35546875, 18.96044921875, 20.5654296875, 22.17041015625, 23.775390625, 25.38037109375, 26.9853515625, 28.59033203125, 30.1953125, 31.80029296875, 33.4052734375, 35.01025390625, 36.615234375, 38.22021484375, 39.8251953125, 41.43017578125, 43.03515625, 44.64013671875, 46.2451171875, 47.85009765625, 49.455078125, 51.06005859375, 52.6650390625, 54.27001953125, 55.875]}, "gradients/decoder.transformer.h.3.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 2.0, 4.0, 10.0, 4.0, 3.0, 10.0, 19.0, 25.0, 33.0, 56.0, 107.0, 143.0, 266.0, 664.0, 3033.0, 3136535.0, 3411.0, 704.0, 260.0, 152.0, 96.0, 53.0, 50.0, 25.0, 21.0, 13.0, 3.0, 7.0, 3.0, 1.0, 4.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-141.25, -137.158203125, -133.06640625, -128.974609375, -124.8828125, -120.791015625, -116.69921875, -112.607421875, -108.515625, -104.423828125, -100.33203125, -96.240234375, -92.1484375, -88.056640625, -83.96484375, -79.873046875, -75.78125, -71.689453125, -67.59765625, -63.505859375, -59.4140625, -55.322265625, -51.23046875, -47.138671875, -43.046875, -38.955078125, -34.86328125, -30.771484375, -26.6796875, -22.587890625, -18.49609375, -14.404296875, -10.3125, -6.220703125, -2.12890625, 1.962890625, 6.0546875, 10.146484375, 14.23828125, 18.330078125, 22.421875, 26.513671875, 30.60546875, 34.697265625, 38.7890625, 42.880859375, 46.97265625, 51.064453125, 55.15625, 59.248046875, 63.33984375, 67.431640625, 71.5234375, 75.615234375, 79.70703125, 83.798828125, 87.890625, 91.982421875, 96.07421875, 100.166015625, 104.2578125, 108.349609375, 112.44140625, 116.533203125, 120.625]}, "gradients/decoder.transformer.h.3.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 30.0, 367.0, 518.0, 90.0, 11.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-157.63394165039062, -153.01654052734375, -148.39913940429688, -143.78173828125, -139.16433715820312, -134.54693603515625, -129.92953491210938, -125.31212615966797, -120.6947250366211, -116.07732391357422, -111.45992279052734, -106.84252166748047, -102.22511291503906, -97.60771179199219, -92.99031066894531, -88.37290954589844, -83.75550842285156, -79.13810729980469, -74.52070617675781, -69.90330505371094, -65.28590393066406, -60.66849899291992, -56.05109405517578, -51.433692932128906, -46.81629180908203, -42.198890686035156, -37.58148956298828, -32.96408462524414, -28.346683502197266, -23.72928237915039, -19.111879348754883, -14.494476318359375, -9.8770751953125, -5.259673118591309, -0.6422710418701172, 3.975131034851074, 8.592533111572266, 13.20993423461914, 17.82733726501465, 22.444740295410156, 27.06214141845703, 31.679542541503906, 36.29694366455078, 40.91434860229492, 45.5317497253418, 50.14915084838867, 54.76655578613281, 59.38395690917969, 64.00135803222656, 68.61875915527344, 73.23616027832031, 77.85356140136719, 82.47096252441406, 87.08836364746094, 91.70577239990234, 96.32317352294922, 100.9405746459961, 105.55797576904297, 110.17537689208984, 114.79277801513672, 119.41018676757812, 124.027587890625, 128.64498901367188, 133.26239013671875, 137.87979125976562]}, "gradients/decoder.transformer.h.3.ln_1.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 6.0, 5.0, 6.0, 3.0, 6.0, 6.0, 11.0, 13.0, 16.0, 25.0, 34.0, 37.0, 33.0, 29.0, 40.0, 38.0, 43.0, 38.0, 61.0, 47.0, 46.0, 39.0, 50.0, 32.0, 34.0, 40.0, 30.0, 33.0, 37.0, 22.0, 28.0, 23.0, 18.0, 13.0, 13.0, 8.0, 16.0, 5.0, 6.0, 5.0, 3.0, 2.0, 0.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-75.12521362304688, -72.54951477050781, -69.97382354736328, -67.39812469482422, -64.82242584228516, -62.246734619140625, -59.67103576660156, -57.095340728759766, -54.51964569091797, -51.94395065307617, -49.36825180053711, -46.79255676269531, -44.216861724853516, -41.64116668701172, -39.065467834472656, -36.48977279663086, -33.9140739440918, -31.338376998901367, -28.76268196105957, -26.18698501586914, -23.611289978027344, -21.035593032836914, -18.459896087646484, -15.884201049804688, -13.308504104614258, -10.732808113098145, -8.157112121582031, -5.581415176391602, -3.0057191848754883, -0.430023193359375, 2.1456737518310547, 4.721368789672852, 7.297065734863281, 9.872761726379395, 12.448457717895508, 15.024154663085938, 17.599849700927734, 20.175546646118164, 22.751243591308594, 25.32693862915039, 27.90263557434082, 30.47833251953125, 33.05402755737305, 35.629722595214844, 38.205421447753906, 40.7811164855957, 43.3568115234375, 45.93251037597656, 48.50820541381836, 51.083900451660156, 53.65959930419922, 56.235294342041016, 58.81098937988281, 61.386688232421875, 63.96238327026367, 66.53807830810547, 69.11377716064453, 71.6894760131836, 74.26516723632812, 76.84086608886719, 79.41656494140625, 81.99225616455078, 84.56795501708984, 87.14364624023438, 89.71934509277344]}, "gradients/decoder.transformer.h.2.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 3.0, 0.0, 1.0, 1.0, 1.0, 2.0, 6.0, 5.0, 4.0, 5.0, 10.0, 6.0, 8.0, 10.0, 13.0, 12.0, 7.0, 13.0, 19.0, 22.0, 18.0, 21.0, 22.0, 26.0, 30.0, 33.0, 33.0, 30.0, 42.0, 37.0, 35.0, 38.0, 47.0, 38.0, 40.0, 29.0, 44.0, 31.0, 32.0, 19.0, 27.0, 26.0, 26.0, 19.0, 23.0, 14.0, 17.0, 11.0, 14.0, 8.0, 9.0, 7.0, 4.0, 5.0, 4.0, 3.0, 1.0, 2.0, 2.0, 1.0, 3.0, 2.0, 2.0], "bins": [-8.9375, -8.6636962890625, -8.389892578125, -8.1160888671875, -7.84228515625, -7.5684814453125, -7.294677734375, -7.0208740234375, -6.7470703125, -6.4732666015625, -6.199462890625, -5.9256591796875, -5.65185546875, -5.3780517578125, -5.104248046875, -4.8304443359375, -4.556640625, -4.2828369140625, -4.009033203125, -3.7352294921875, -3.46142578125, -3.1876220703125, -2.913818359375, -2.6400146484375, -2.3662109375, -2.0924072265625, -1.818603515625, -1.5447998046875, -1.27099609375, -0.9971923828125, -0.723388671875, -0.4495849609375, -0.17578125, 0.0980224609375, 0.371826171875, 0.6456298828125, 0.91943359375, 1.1932373046875, 1.467041015625, 1.7408447265625, 2.0146484375, 2.2884521484375, 2.562255859375, 2.8360595703125, 3.10986328125, 3.3836669921875, 3.657470703125, 3.9312744140625, 4.205078125, 4.4788818359375, 4.752685546875, 5.0264892578125, 5.30029296875, 5.5740966796875, 5.847900390625, 6.1217041015625, 6.3955078125, 6.6693115234375, 6.943115234375, 7.2169189453125, 7.49072265625, 7.7645263671875, 8.038330078125, 8.3121337890625, 8.5859375]}, "gradients/decoder.transformer.h.2.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 0.0, 13.0, 5.0, 3.0, 9.0, 14.0, 13.0, 18.0, 30.0, 39.0, 36.0, 55.0, 65.0, 105.0, 92.0, 141.0, 196.0, 263.0, 536.0, 1517.0, 8056.0, 403298.0, 3608197.0, 163609.0, 5424.0, 1098.0, 419.0, 248.0, 179.0, 124.0, 105.0, 79.0, 73.0, 50.0, 49.0, 40.0, 20.0, 25.0, 19.0, 6.0, 7.0, 8.0, 4.0, 1.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-37.875, -36.6689453125, -35.462890625, -34.2568359375, -33.05078125, -31.8447265625, -30.638671875, -29.4326171875, -28.2265625, -27.0205078125, -25.814453125, -24.6083984375, -23.40234375, -22.1962890625, -20.990234375, -19.7841796875, -18.578125, -17.3720703125, -16.166015625, -14.9599609375, -13.75390625, -12.5478515625, -11.341796875, -10.1357421875, -8.9296875, -7.7236328125, -6.517578125, -5.3115234375, -4.10546875, -2.8994140625, -1.693359375, -0.4873046875, 0.71875, 1.9248046875, 3.130859375, 4.3369140625, 5.54296875, 6.7490234375, 7.955078125, 9.1611328125, 10.3671875, 11.5732421875, 12.779296875, 13.9853515625, 15.19140625, 16.3974609375, 17.603515625, 18.8095703125, 20.015625, 21.2216796875, 22.427734375, 23.6337890625, 24.83984375, 26.0458984375, 27.251953125, 28.4580078125, 29.6640625, 30.8701171875, 32.076171875, 33.2822265625, 34.48828125, 35.6943359375, 36.900390625, 38.1064453125, 39.3125]}, "gradients/decoder.transformer.h.2.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 7.0, 4.0, 1.0, 3.0, 12.0, 7.0, 10.0, 24.0, 36.0, 55.0, 58.0, 93.0, 152.0, 203.0, 285.0, 457.0, 671.0, 703.0, 433.0, 281.0, 210.0, 123.0, 81.0, 61.0, 37.0, 24.0, 14.0, 7.0, 7.0, 3.0, 6.0, 7.0, 4.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-19.296875, -18.632080078125, -17.96728515625, -17.302490234375, -16.6376953125, -15.972900390625, -15.30810546875, -14.643310546875, -13.978515625, -13.313720703125, -12.64892578125, -11.984130859375, -11.3193359375, -10.654541015625, -9.98974609375, -9.324951171875, -8.66015625, -7.995361328125, -7.33056640625, -6.665771484375, -6.0009765625, -5.336181640625, -4.67138671875, -4.006591796875, -3.341796875, -2.677001953125, -2.01220703125, -1.347412109375, -0.6826171875, -0.017822265625, 0.64697265625, 1.311767578125, 1.9765625, 2.641357421875, 3.30615234375, 3.970947265625, 4.6357421875, 5.300537109375, 5.96533203125, 6.630126953125, 7.294921875, 7.959716796875, 8.62451171875, 9.289306640625, 9.9541015625, 10.618896484375, 11.28369140625, 11.948486328125, 12.61328125, 13.278076171875, 13.94287109375, 14.607666015625, 15.2724609375, 15.937255859375, 16.60205078125, 17.266845703125, 17.931640625, 18.596435546875, 19.26123046875, 19.926025390625, 20.5908203125, 21.255615234375, 21.92041015625, 22.585205078125, 23.25]}, "gradients/decoder.transformer.h.2.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 2.0, 9.0, 12.0, 25.0, 44.0, 66.0, 128.0, 293.0, 1103.0, 26923.0, 4155552.0, 8756.0, 846.0, 271.0, 121.0, 63.0, 34.0, 19.0, 7.0, 3.0, 6.0, 1.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-129.5, -126.126953125, -122.75390625, -119.380859375, -116.0078125, -112.634765625, -109.26171875, -105.888671875, -102.515625, -99.142578125, -95.76953125, -92.396484375, -89.0234375, -85.650390625, -82.27734375, -78.904296875, -75.53125, -72.158203125, -68.78515625, -65.412109375, -62.0390625, -58.666015625, -55.29296875, -51.919921875, -48.546875, -45.173828125, -41.80078125, -38.427734375, -35.0546875, -31.681640625, -28.30859375, -24.935546875, -21.5625, -18.189453125, -14.81640625, -11.443359375, -8.0703125, -4.697265625, -1.32421875, 2.048828125, 5.421875, 8.794921875, 12.16796875, 15.541015625, 18.9140625, 22.287109375, 25.66015625, 29.033203125, 32.40625, 35.779296875, 39.15234375, 42.525390625, 45.8984375, 49.271484375, 52.64453125, 56.017578125, 59.390625, 62.763671875, 66.13671875, 69.509765625, 72.8828125, 76.255859375, 79.62890625, 83.001953125, 86.375]}, "gradients/decoder.transformer.h.2.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 180.0, 786.0, 43.0, 4.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-678.7581176757812, -662.726318359375, -646.6944580078125, -630.66259765625, -614.6307983398438, -598.5989990234375, -582.567138671875, -566.5352783203125, -550.5034790039062, -534.4716796875, -518.4398193359375, -502.4079895019531, -486.37615966796875, -470.3443298339844, -454.3125, -438.2806701660156, -422.24884033203125, -406.2170104980469, -390.1851806640625, -374.1533508300781, -358.12152099609375, -342.0896911621094, -326.057861328125, -310.0260314941406, -293.99420166015625, -277.9623718261719, -261.9305419921875, -245.89871215820312, -229.86688232421875, -213.83505249023438, -197.80322265625, -181.77139282226562, -165.7396240234375, -149.70779418945312, -133.67596435546875, -117.64413452148438, -101.6123046875, -85.58047485351562, -69.54864501953125, -53.516815185546875, -37.4849853515625, -21.453155517578125, -5.42132568359375, 10.610504150390625, 26.642333984375, 42.674163818359375, 58.70599365234375, 74.73782348632812, 90.7696533203125, 106.80148315429688, 122.83331298828125, 138.86514282226562, 154.89697265625, 170.92880249023438, 186.96063232421875, 202.99246215820312, 219.0242919921875, 235.05612182617188, 251.08795166015625, 267.1197814941406, 283.151611328125, 299.1834411621094, 315.21527099609375, 331.2471008300781, 347.2789306640625]}, "gradients/decoder.transformer.h.2.ln_2.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 4.0, 6.0, 6.0, 15.0, 14.0, 8.0, 21.0, 15.0, 23.0, 27.0, 21.0, 31.0, 39.0, 34.0, 49.0, 44.0, 54.0, 54.0, 54.0, 54.0, 41.0, 37.0, 52.0, 44.0, 36.0, 37.0, 31.0, 41.0, 22.0, 24.0, 17.0, 21.0, 5.0, 11.0, 6.0, 4.0, 2.0, 4.0, 2.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-74.73199462890625, -72.3093490600586, -69.88670349121094, -67.46405792236328, -65.04141235351562, -62.6187629699707, -60.19611740112305, -57.773468017578125, -55.35082244873047, -52.92817687988281, -50.505531311035156, -48.0828857421875, -45.66023635864258, -43.23759078979492, -40.814945220947266, -38.392295837402344, -35.96965408325195, -33.5470085144043, -31.124361038208008, -28.70171546936035, -26.279067993164062, -23.856422424316406, -21.43377685546875, -19.01112937927246, -16.588483810424805, -14.165837287902832, -11.74319076538086, -9.320545196533203, -6.8978986740112305, -4.475252151489258, -2.0526065826416016, 0.3700408935546875, 2.7926864624023438, 5.215332984924316, 7.637979030609131, 10.060625076293945, 12.483271598815918, 14.90591812133789, 17.328563690185547, 19.751211166381836, 22.173856735229492, 24.59650230407715, 27.019149780273438, 29.441795349121094, 31.86444091796875, 34.287086486816406, 36.70973205566406, 39.132381439208984, 41.55502700805664, 43.9776725769043, 46.40031814575195, 48.822967529296875, 51.24561309814453, 53.66825866699219, 56.090904235839844, 58.5135498046875, 60.936195373535156, 63.35884094238281, 65.78148651123047, 68.20413208007812, 70.62677764892578, 73.04942321777344, 75.47207641601562, 77.89472198486328, 80.31736755371094]}, "gradients/decoder.transformer.h.2.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 10.0, 6.0, 9.0, 10.0, 15.0, 18.0, 26.0, 25.0, 28.0, 25.0, 25.0, 36.0, 23.0, 26.0, 42.0, 46.0, 35.0, 42.0, 44.0, 52.0, 42.0, 43.0, 40.0, 35.0, 33.0, 37.0, 33.0, 29.0, 22.0, 15.0, 23.0, 24.0, 13.0, 14.0, 10.0, 10.0, 10.0, 7.0, 6.0, 3.0, 4.0, 2.0, 1.0, 2.0, 2.0, 2.0], "bins": [-10.609375, -10.321533203125, -10.03369140625, -9.745849609375, -9.4580078125, -9.170166015625, -8.88232421875, -8.594482421875, -8.306640625, -8.018798828125, -7.73095703125, -7.443115234375, -7.1552734375, -6.867431640625, -6.57958984375, -6.291748046875, -6.00390625, -5.716064453125, -5.42822265625, -5.140380859375, -4.8525390625, -4.564697265625, -4.27685546875, -3.989013671875, -3.701171875, -3.413330078125, -3.12548828125, -2.837646484375, -2.5498046875, -2.261962890625, -1.97412109375, -1.686279296875, -1.3984375, -1.110595703125, -0.82275390625, -0.534912109375, -0.2470703125, 0.040771484375, 0.32861328125, 0.616455078125, 0.904296875, 1.192138671875, 1.47998046875, 1.767822265625, 2.0556640625, 2.343505859375, 2.63134765625, 2.919189453125, 3.20703125, 3.494873046875, 3.78271484375, 4.070556640625, 4.3583984375, 4.646240234375, 4.93408203125, 5.221923828125, 5.509765625, 5.797607421875, 6.08544921875, 6.373291015625, 6.6611328125, 6.948974609375, 7.23681640625, 7.524658203125, 7.8125]}, "gradients/decoder.transformer.h.2.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 8.0, 6.0, 5.0, 8.0, 11.0, 28.0, 44.0, 72.0, 96.0, 129.0, 217.0, 370.0, 545.0, 877.0, 1376.0, 2239.0, 3561.0, 5697.0, 9398.0, 15514.0, 26684.0, 46110.0, 83038.0, 178442.0, 371520.0, 136986.0, 69368.0, 38381.0, 22731.0, 13782.0, 8212.0, 4965.0, 3002.0, 1835.0, 1082.0, 806.0, 511.0, 293.0, 183.0, 141.0, 106.0, 64.0, 41.0, 30.0, 17.0, 6.0, 9.0, 8.0, 4.0, 1.0, 4.0, 2.0], "bins": [-2.994140625, -2.911773681640625, -2.82940673828125, -2.747039794921875, -2.6646728515625, -2.582305908203125, -2.49993896484375, -2.417572021484375, -2.335205078125, -2.252838134765625, -2.17047119140625, -2.088104248046875, -2.0057373046875, -1.923370361328125, -1.84100341796875, -1.758636474609375, -1.67626953125, -1.593902587890625, -1.51153564453125, -1.429168701171875, -1.3468017578125, -1.264434814453125, -1.18206787109375, -1.099700927734375, -1.017333984375, -0.934967041015625, -0.85260009765625, -0.770233154296875, -0.6878662109375, -0.605499267578125, -0.52313232421875, -0.440765380859375, -0.3583984375, -0.276031494140625, -0.19366455078125, -0.111297607421875, -0.0289306640625, 0.053436279296875, 0.13580322265625, 0.218170166015625, 0.300537109375, 0.382904052734375, 0.46527099609375, 0.547637939453125, 0.6300048828125, 0.712371826171875, 0.79473876953125, 0.877105712890625, 0.95947265625, 1.041839599609375, 1.12420654296875, 1.206573486328125, 1.2889404296875, 1.371307373046875, 1.45367431640625, 1.536041259765625, 1.618408203125, 1.700775146484375, 1.78314208984375, 1.865509033203125, 1.9478759765625, 2.030242919921875, 2.11260986328125, 2.194976806640625, 2.27734375]}, "gradients/decoder.transformer.h.2.crossattention.c_attn.bias": {"_type": "histogram", "values": [3.0, 0.0, 5.0, 2.0, 2.0, 4.0, 6.0, 9.0, 3.0, 7.0, 11.0, 10.0, 14.0, 15.0, 22.0, 13.0, 21.0, 23.0, 20.0, 27.0, 24.0, 27.0, 40.0, 35.0, 34.0, 37.0, 21.0, 44.0, 36.0, 1065.0, 28.0, 36.0, 41.0, 28.0, 42.0, 28.0, 29.0, 26.0, 23.0, 15.0, 18.0, 26.0, 23.0, 19.0, 10.0, 15.0, 11.0, 8.0, 8.0, 1.0, 5.0, 9.0, 5.0, 5.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.609375, -4.45343017578125, -4.2974853515625, -4.14154052734375, -3.985595703125, -3.82965087890625, -3.6737060546875, -3.51776123046875, -3.36181640625, -3.20587158203125, -3.0499267578125, -2.89398193359375, -2.738037109375, -2.58209228515625, -2.4261474609375, -2.27020263671875, -2.1142578125, -1.95831298828125, -1.8023681640625, -1.64642333984375, -1.490478515625, -1.33453369140625, -1.1785888671875, -1.02264404296875, -0.86669921875, -0.71075439453125, -0.5548095703125, -0.39886474609375, -0.242919921875, -0.08697509765625, 0.0689697265625, 0.22491455078125, 0.380859375, 0.53680419921875, 0.6927490234375, 0.84869384765625, 1.004638671875, 1.16058349609375, 1.3165283203125, 1.47247314453125, 1.62841796875, 1.78436279296875, 1.9403076171875, 2.09625244140625, 2.252197265625, 2.40814208984375, 2.5640869140625, 2.72003173828125, 2.8759765625, 3.03192138671875, 3.1878662109375, 3.34381103515625, 3.499755859375, 3.65570068359375, 3.8116455078125, 3.96759033203125, 4.12353515625, 4.27947998046875, 4.4354248046875, 4.59136962890625, 4.747314453125, 4.90325927734375, 5.0592041015625, 5.21514892578125, 5.37109375]}, "gradients/decoder.transformer.h.2.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 2.0, 6.0, 5.0, 12.0, 5.0, 5.0, 11.0, 18.0, 23.0, 51.0, 62.0, 115.0, 174.0, 278.0, 468.0, 784.0, 1363.0, 2372.0, 3853.0, 6739.0, 11821.0, 20996.0, 38074.0, 72108.0, 146911.0, 1440736.0, 172831.0, 80885.0, 42326.0, 23258.0, 12818.0, 7540.0, 4222.0, 2587.0, 1470.0, 841.0, 536.0, 323.0, 161.0, 109.0, 72.0, 56.0, 30.0, 26.0, 14.0, 13.0, 3.0, 8.0, 9.0, 5.0, 1.0, 3.0, 4.0, 1.0, 3.0], "bins": [-2.970703125, -2.88482666015625, -2.7989501953125, -2.71307373046875, -2.627197265625, -2.54132080078125, -2.4554443359375, -2.36956787109375, -2.28369140625, -2.19781494140625, -2.1119384765625, -2.02606201171875, -1.940185546875, -1.85430908203125, -1.7684326171875, -1.68255615234375, -1.5966796875, -1.51080322265625, -1.4249267578125, -1.33905029296875, -1.253173828125, -1.16729736328125, -1.0814208984375, -0.99554443359375, -0.90966796875, -0.82379150390625, -0.7379150390625, -0.65203857421875, -0.566162109375, -0.48028564453125, -0.3944091796875, -0.30853271484375, -0.22265625, -0.13677978515625, -0.0509033203125, 0.03497314453125, 0.120849609375, 0.20672607421875, 0.2926025390625, 0.37847900390625, 0.46435546875, 0.55023193359375, 0.6361083984375, 0.72198486328125, 0.807861328125, 0.89373779296875, 0.9796142578125, 1.06549072265625, 1.1513671875, 1.23724365234375, 1.3231201171875, 1.40899658203125, 1.494873046875, 1.58074951171875, 1.6666259765625, 1.75250244140625, 1.83837890625, 1.92425537109375, 2.0101318359375, 2.09600830078125, 2.181884765625, 2.26776123046875, 2.3536376953125, 2.43951416015625, 2.525390625]}, "gradients/decoder.transformer.h.2.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 4.0, 6.0, 7.0, 11.0, 8.0, 11.0, 10.0, 23.0, 15.0, 26.0, 29.0, 52.0, 75.0, 92.0, 129.0, 133.0, 100.0, 68.0, 53.0, 45.0, 27.0, 23.0, 16.0, 8.0, 7.0, 8.0, 5.0, 4.0, 4.0, 1.0, 3.0, 1.0, 2.0, 2.0, 0.0, 1.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0022792816162109375, -0.0022161900997161865, -0.0021530985832214355, -0.0020900070667266846, -0.0020269155502319336, -0.0019638240337371826, -0.0019007325172424316, -0.0018376410007476807, -0.0017745494842529297, -0.0017114579677581787, -0.0016483664512634277, -0.0015852749347686768, -0.0015221834182739258, -0.0014590919017791748, -0.0013960003852844238, -0.0013329088687896729, -0.0012698173522949219, -0.001206725835800171, -0.00114363431930542, -0.001080542802810669, -0.001017451286315918, -0.000954359769821167, -0.000891268253326416, -0.000828176736831665, -0.0007650852203369141, -0.0007019937038421631, -0.0006389021873474121, -0.0005758106708526611, -0.0005127191543579102, -0.0004496276378631592, -0.0003865361213684082, -0.0003234446048736572, -0.00026035308837890625, -0.00019726157188415527, -0.0001341700553894043, -7.107853889465332e-05, -7.987022399902344e-06, 5.510449409484863e-05, 0.00011819601058959961, 0.00018128752708435059, 0.00024437904357910156, 0.00030747056007385254, 0.0003705620765686035, 0.0004336535930633545, 0.0004967451095581055, 0.0005598366260528564, 0.0006229281425476074, 0.0006860196590423584, 0.0007491111755371094, 0.0008122026920318604, 0.0008752942085266113, 0.0009383857250213623, 0.0010014772415161133, 0.0010645687580108643, 0.0011276602745056152, 0.0011907517910003662, 0.0012538433074951172, 0.0013169348239898682, 0.0013800263404846191, 0.0014431178569793701, 0.001506209373474121, 0.001569300889968872, 0.001632392406463623, 0.001695483922958374, 0.001758575439453125]}, "gradients/decoder.transformer.h.2.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 2.0, 3.0, 3.0, 5.0, 3.0, 5.0, 3.0, 6.0, 4.0, 6.0, 4.0, 7.0, 13.0, 13.0, 26.0, 22.0, 38.0, 55.0, 84.0, 105.0, 193.0, 280.0, 487.0, 1126.0, 523166.0, 520333.0, 1149.0, 520.0, 275.0, 194.0, 133.0, 69.0, 68.0, 32.0, 36.0, 21.0, 29.0, 15.0, 9.0, 11.0, 7.0, 2.0, 2.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.028839111328125, -0.02780914306640625, -0.0267791748046875, -0.02574920654296875, -0.02471923828125, -0.02368927001953125, -0.0226593017578125, -0.02162933349609375, -0.020599365234375, -0.01956939697265625, -0.0185394287109375, -0.01750946044921875, -0.0164794921875, -0.01544952392578125, -0.0144195556640625, -0.01338958740234375, -0.012359619140625, -0.01132965087890625, -0.0102996826171875, -0.00926971435546875, -0.00823974609375, -0.00720977783203125, -0.0061798095703125, -0.00514984130859375, -0.004119873046875, -0.00308990478515625, -0.0020599365234375, -0.00102996826171875, 0.0, 0.00102996826171875, 0.0020599365234375, 0.00308990478515625, 0.004119873046875, 0.00514984130859375, 0.0061798095703125, 0.00720977783203125, 0.00823974609375, 0.00926971435546875, 0.0102996826171875, 0.01132965087890625, 0.012359619140625, 0.01338958740234375, 0.0144195556640625, 0.01544952392578125, 0.0164794921875, 0.01750946044921875, 0.0185394287109375, 0.01956939697265625, 0.020599365234375, 0.02162933349609375, 0.0226593017578125, 0.02368927001953125, 0.02471923828125, 0.02574920654296875, 0.0267791748046875, 0.02780914306640625, 0.028839111328125, 0.02986907958984375, 0.0308990478515625, 0.03192901611328125, 0.032958984375, 0.03398895263671875, 0.0350189208984375, 0.03604888916015625, 0.037078857421875]}, "gradients/decoder.transformer.h.2.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 5.0, 106.0, 619.0, 262.0, 22.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.005850363057106733, -0.005727712530642748, -0.00560506246984005, -0.005482411943376064, -0.005359761882573366, -0.005237111356109381, -0.005114461295306683, -0.004991810768842697, -0.004869160708039999, -0.0047465101815760136, -0.004623860120773315, -0.00450120959430933, -0.004378559533506632, -0.004255909007042646, -0.004133258946239948, -0.004010608419775963, -0.003887958126142621, -0.0037653078325092793, -0.0036426575388759375, -0.0035200072452425957, -0.003397356951609254, -0.003274706657975912, -0.0031520561315119267, -0.0030294060707092285, -0.002906755544245243, -0.0027841052506119013, -0.0026614549569785595, -0.0025388046633452177, -0.002416154369711876, -0.002293504076078534, -0.0021708537824451923, -0.002048203255981207, -0.0019255531951785088, -0.001802902901545167, -0.0016802526079118252, -0.0015576023142784834, -0.0014349520206451416, -0.0013123017270117998, -0.0011896513169631362, -0.0010670010233297944, -0.0009443507296964526, -0.0008217004360631108, -0.000699050142429769, -0.0005763997905887663, -0.00045374949695542455, -0.00033109920332208276, -0.00020844885148108006, -8.579855784773827e-05, 3.685173578560352e-05, 0.00015950204397086054, 0.00028215235215611756, 0.0004048026748932898, 0.0005274529685266316, 0.0006501032621599734, 0.0007727536140009761, 0.0008954039076343179, 0.0010180542012676597, 0.0011407044949010015, 0.0012633547885343432, 0.0013860051985830069, 0.0015086554922163486, 0.0016313057858496904, 0.0017539560794830322, 0.001876606373116374, 0.001999256666749716]}, "gradients/decoder.transformer.h.2.ln_cross_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 3.0, 8.0, 8.0, 10.0, 17.0, 9.0, 19.0, 21.0, 19.0, 17.0, 17.0, 20.0, 23.0, 20.0, 35.0, 37.0, 40.0, 27.0, 55.0, 49.0, 45.0, 43.0, 41.0, 38.0, 44.0, 32.0, 27.0, 33.0, 38.0, 31.0, 19.0, 25.0, 19.0, 15.0, 16.0, 13.0, 19.0, 16.0, 5.0, 7.0, 8.0, 6.0, 2.0, 5.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.0007446408271789551, -0.0007225163280963898, -0.0007003918290138245, -0.0006782673299312592, -0.0006561428308486938, -0.0006340183317661285, -0.0006118938326835632, -0.0005897693336009979, -0.0005676448345184326, -0.0005455203354358673, -0.000523395836353302, -0.0005012713372707367, -0.0004791468381881714, -0.0004570223391056061, -0.00043489784002304077, -0.00041277334094047546, -0.00039064884185791016, -0.00036852434277534485, -0.00034639984369277954, -0.00032427534461021423, -0.0003021508455276489, -0.0002800263464450836, -0.0002579018473625183, -0.000235777348279953, -0.0002136528491973877, -0.0001915283501148224, -0.00016940385103225708, -0.00014727935194969177, -0.00012515485286712646, -0.00010303035378456116, -8.090585470199585e-05, -5.878135561943054e-05, -3.6656856536865234e-05, -1.4532357454299927e-05, 7.592141628265381e-06, 2.971664071083069e-05, 5.1841139793395996e-05, 7.39656388759613e-05, 9.609013795852661e-05, 0.00011821463704109192, 0.00014033913612365723, 0.00016246363520622253, 0.00018458813428878784, 0.00020671263337135315, 0.00022883713245391846, 0.00025096163153648376, 0.00027308613061904907, 0.0002952106297016144, 0.0003173351287841797, 0.000339459627866745, 0.0003615841269493103, 0.0003837086260318756, 0.0004058331251144409, 0.0004279576241970062, 0.00045008212327957153, 0.00047220662236213684, 0.0004943311214447021, 0.0005164556205272675, 0.0005385801196098328, 0.0005607046186923981, 0.0005828291177749634, 0.0006049536168575287, 0.000627078115940094, 0.0006492026150226593, 0.0006713271141052246]}, "gradients/decoder.transformer.h.2.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 3.0, 4.0, 4.0, 0.0, 0.0, 2.0, 1.0, 10.0, 6.0, 9.0, 10.0, 15.0, 18.0, 26.0, 25.0, 28.0, 25.0, 25.0, 36.0, 23.0, 26.0, 42.0, 46.0, 35.0, 42.0, 44.0, 52.0, 42.0, 43.0, 40.0, 35.0, 33.0, 37.0, 33.0, 29.0, 22.0, 15.0, 23.0, 24.0, 13.0, 14.0, 10.0, 10.0, 10.0, 7.0, 6.0, 3.0, 4.0, 2.0, 1.0, 2.0, 2.0, 2.0], "bins": [-10.609375, -10.321533203125, -10.03369140625, -9.745849609375, -9.4580078125, -9.170166015625, -8.88232421875, -8.594482421875, -8.306640625, -8.018798828125, -7.73095703125, -7.443115234375, -7.1552734375, -6.867431640625, -6.57958984375, -6.291748046875, -6.00390625, -5.716064453125, -5.42822265625, -5.140380859375, -4.8525390625, -4.564697265625, -4.27685546875, -3.989013671875, -3.701171875, -3.413330078125, -3.12548828125, -2.837646484375, -2.5498046875, -2.261962890625, -1.97412109375, -1.686279296875, -1.3984375, -1.110595703125, -0.82275390625, -0.534912109375, -0.2470703125, 0.040771484375, 0.32861328125, 0.616455078125, 0.904296875, 1.192138671875, 1.47998046875, 1.767822265625, 2.0556640625, 2.343505859375, 2.63134765625, 2.919189453125, 3.20703125, 3.494873046875, 3.78271484375, 4.070556640625, 4.3583984375, 4.646240234375, 4.93408203125, 5.221923828125, 5.509765625, 5.797607421875, 6.08544921875, 6.373291015625, 6.6611328125, 6.948974609375, 7.23681640625, 7.524658203125, 7.8125]}, "gradients/decoder.transformer.h.2.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 3.0, 6.0, 13.0, 7.0, 14.0, 19.0, 24.0, 36.0, 45.0, 62.0, 109.0, 162.0, 229.0, 357.0, 531.0, 764.0, 1190.0, 1882.0, 3457.0, 8451.0, 51917.0, 697923.0, 248455.0, 20488.0, 5368.0, 2559.0, 1519.0, 982.0, 677.0, 431.0, 256.0, 171.0, 138.0, 97.0, 63.0, 41.0, 36.0, 26.0, 22.0, 8.0, 6.0, 6.0, 2.0, 4.0, 4.0, 0.0, 2.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-21.71875, -20.93701171875, -20.1552734375, -19.37353515625, -18.591796875, -17.81005859375, -17.0283203125, -16.24658203125, -15.46484375, -14.68310546875, -13.9013671875, -13.11962890625, -12.337890625, -11.55615234375, -10.7744140625, -9.99267578125, -9.2109375, -8.42919921875, -7.6474609375, -6.86572265625, -6.083984375, -5.30224609375, -4.5205078125, -3.73876953125, -2.95703125, -2.17529296875, -1.3935546875, -0.61181640625, 0.169921875, 0.95166015625, 1.7333984375, 2.51513671875, 3.296875, 4.07861328125, 4.8603515625, 5.64208984375, 6.423828125, 7.20556640625, 7.9873046875, 8.76904296875, 9.55078125, 10.33251953125, 11.1142578125, 11.89599609375, 12.677734375, 13.45947265625, 14.2412109375, 15.02294921875, 15.8046875, 16.58642578125, 17.3681640625, 18.14990234375, 18.931640625, 19.71337890625, 20.4951171875, 21.27685546875, 22.05859375, 22.84033203125, 23.6220703125, 24.40380859375, 25.185546875, 25.96728515625, 26.7490234375, 27.53076171875, 28.3125]}, "gradients/decoder.transformer.h.2.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 6.0, 7.0, 18.0, 15.0, 21.0, 47.0, 67.0, 79.0, 98.0, 146.0, 2069.0, 134.0, 82.0, 77.0, 55.0, 37.0, 35.0, 20.0, 20.0, 7.0, 7.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-62.375, -60.61572265625, -58.8564453125, -57.09716796875, -55.337890625, -53.57861328125, -51.8193359375, -50.06005859375, -48.30078125, -46.54150390625, -44.7822265625, -43.02294921875, -41.263671875, -39.50439453125, -37.7451171875, -35.98583984375, -34.2265625, -32.46728515625, -30.7080078125, -28.94873046875, -27.189453125, -25.43017578125, -23.6708984375, -21.91162109375, -20.15234375, -18.39306640625, -16.6337890625, -14.87451171875, -13.115234375, -11.35595703125, -9.5966796875, -7.83740234375, -6.078125, -4.31884765625, -2.5595703125, -0.80029296875, 0.958984375, 2.71826171875, 4.4775390625, 6.23681640625, 7.99609375, 9.75537109375, 11.5146484375, 13.27392578125, 15.033203125, 16.79248046875, 18.5517578125, 20.31103515625, 22.0703125, 23.82958984375, 25.5888671875, 27.34814453125, 29.107421875, 30.86669921875, 32.6259765625, 34.38525390625, 36.14453125, 37.90380859375, 39.6630859375, 41.42236328125, 43.181640625, 44.94091796875, 46.7001953125, 48.45947265625, 50.21875]}, "gradients/decoder.transformer.h.2.attn.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 4.0, 3.0, 11.0, 14.0, 24.0, 44.0, 51.0, 96.0, 209.0, 403.0, 2138.0, 3139139.0, 2660.0, 458.0, 204.0, 119.0, 50.0, 27.0, 30.0, 16.0, 6.0, 3.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-159.375, -153.791015625, -148.20703125, -142.623046875, -137.0390625, -131.455078125, -125.87109375, -120.287109375, -114.703125, -109.119140625, -103.53515625, -97.951171875, -92.3671875, -86.783203125, -81.19921875, -75.615234375, -70.03125, -64.447265625, -58.86328125, -53.279296875, -47.6953125, -42.111328125, -36.52734375, -30.943359375, -25.359375, -19.775390625, -14.19140625, -8.607421875, -3.0234375, 2.560546875, 8.14453125, 13.728515625, 19.3125, 24.896484375, 30.48046875, 36.064453125, 41.6484375, 47.232421875, 52.81640625, 58.400390625, 63.984375, 69.568359375, 75.15234375, 80.736328125, 86.3203125, 91.904296875, 97.48828125, 103.072265625, 108.65625, 114.240234375, 119.82421875, 125.408203125, 130.9921875, 136.576171875, 142.16015625, 147.744140625, 153.328125, 158.912109375, 164.49609375, 170.080078125, 175.6640625, 181.248046875, 186.83203125, 192.416015625, 198.0]}, "gradients/decoder.transformer.h.2.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 10.0, 41.0, 163.0, 423.0, 293.0, 61.0, 15.0, 7.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-59.99898147583008, -55.90753936767578, -51.816097259521484, -47.72465515136719, -43.633216857910156, -39.541770935058594, -35.45033264160156, -31.358890533447266, -27.26744842529297, -23.176006317138672, -19.084564208984375, -14.993124008178711, -10.901681900024414, -6.810239791870117, -2.718799591064453, 1.3726425170898438, 5.464084625244141, 9.555526733398438, 13.646967887878418, 17.7384090423584, 21.829851150512695, 25.921293258666992, 30.012733459472656, 34.10417556762695, 38.19561767578125, 42.28705978393555, 46.378501892089844, 50.469940185546875, 54.56138610839844, 58.65282440185547, 62.744266510009766, 66.83570861816406, 70.92715454101562, 75.01859283447266, 79.11003875732422, 83.20147705078125, 87.29292297363281, 91.38436126708984, 95.47579956054688, 99.56724548339844, 103.65869140625, 107.75012969970703, 111.8415756225586, 115.93301391601562, 120.02445983886719, 124.11589813232422, 128.20733642578125, 132.2987823486328, 136.39022827148438, 140.48167419433594, 144.57310485839844, 148.66455078125, 152.75599670410156, 156.84744262695312, 160.93887329101562, 165.0303192138672, 169.1217498779297, 173.21319580078125, 177.30462646484375, 181.3960723876953, 185.48751831054688, 189.57896423339844, 193.67039489746094, 197.7618408203125, 201.85328674316406]}, "gradients/decoder.transformer.h.2.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 7.0, 2.0, 2.0, 7.0, 11.0, 8.0, 12.0, 3.0, 17.0, 16.0, 19.0, 21.0, 17.0, 23.0, 30.0, 24.0, 37.0, 46.0, 43.0, 42.0, 36.0, 42.0, 47.0, 34.0, 52.0, 52.0, 52.0, 47.0, 41.0, 30.0, 29.0, 25.0, 20.0, 15.0, 24.0, 18.0, 11.0, 9.0, 12.0, 7.0, 5.0, 3.0, 3.0, 2.0, 3.0, 4.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-82.27857971191406, -79.4348373413086, -76.59109497070312, -73.74734497070312, -70.90360260009766, -68.05986022949219, -65.21611785888672, -62.37237548828125, -59.52863311767578, -56.68489074707031, -53.84114456176758, -50.99740219116211, -48.15365982055664, -45.309913635253906, -42.46617126464844, -39.62242889404297, -36.778682708740234, -33.934940338134766, -31.091196060180664, -28.247451782226562, -25.403709411621094, -22.559965133666992, -19.71622085571289, -16.872478485107422, -14.02873420715332, -11.184990882873535, -8.34124755859375, -5.497503280639648, -2.6537599563598633, 0.18998336791992188, 3.0337276458740234, 5.877470016479492, 8.721214294433594, 11.564957618713379, 14.408700942993164, 17.252445220947266, 20.096187591552734, 22.939931869506836, 25.783676147460938, 28.627418518066406, 31.471162796020508, 34.31490707397461, 37.15864944458008, 40.00239562988281, 42.84613800048828, 45.68988037109375, 48.53362274169922, 51.37736511230469, 54.22111129760742, 57.06485366821289, 59.908599853515625, 62.752342224121094, 65.59608459472656, 68.43982696533203, 71.2835693359375, 74.1273193359375, 76.97106170654297, 79.81480407714844, 82.6585464477539, 85.50228881835938, 88.34603881835938, 91.18978118896484, 94.03352355957031, 96.87726593017578, 99.72100830078125]}, "gradients/decoder.transformer.h.1.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 4.0, 4.0, 5.0, 5.0, 8.0, 8.0, 8.0, 22.0, 16.0, 12.0, 26.0, 28.0, 15.0, 32.0, 38.0, 35.0, 34.0, 44.0, 41.0, 35.0, 36.0, 47.0, 44.0, 42.0, 50.0, 42.0, 40.0, 40.0, 35.0, 17.0, 25.0, 16.0, 20.0, 21.0, 20.0, 12.0, 12.0, 15.0, 11.0, 10.0, 9.0, 6.0, 6.0, 5.0, 3.0, 3.0, 3.0, 1.0, 0.0, 1.0, 2.0], "bins": [-10.3984375, -10.1014404296875, -9.804443359375, -9.5074462890625, -9.21044921875, -8.9134521484375, -8.616455078125, -8.3194580078125, -8.0224609375, -7.7254638671875, -7.428466796875, -7.1314697265625, -6.83447265625, -6.5374755859375, -6.240478515625, -5.9434814453125, -5.646484375, -5.3494873046875, -5.052490234375, -4.7554931640625, -4.45849609375, -4.1614990234375, -3.864501953125, -3.5675048828125, -3.2705078125, -2.9735107421875, -2.676513671875, -2.3795166015625, -2.08251953125, -1.7855224609375, -1.488525390625, -1.1915283203125, -0.89453125, -0.5975341796875, -0.300537109375, -0.0035400390625, 0.29345703125, 0.5904541015625, 0.887451171875, 1.1844482421875, 1.4814453125, 1.7784423828125, 2.075439453125, 2.3724365234375, 2.66943359375, 2.9664306640625, 3.263427734375, 3.5604248046875, 3.857421875, 4.1544189453125, 4.451416015625, 4.7484130859375, 5.04541015625, 5.3424072265625, 5.639404296875, 5.9364013671875, 6.2333984375, 6.5303955078125, 6.827392578125, 7.1243896484375, 7.42138671875, 7.7183837890625, 8.015380859375, 8.3123779296875, 8.609375]}, "gradients/decoder.transformer.h.1.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 4.0, 0.0, 1.0, 4.0, 2.0, 8.0, 4.0, 10.0, 9.0, 9.0, 19.0, 22.0, 36.0, 55.0, 65.0, 100.0, 130.0, 182.0, 260.0, 373.0, 580.0, 837.0, 1510.0, 2701.0, 5712.0, 13599.0, 40985.0, 188425.0, 868882.0, 1925738.0, 882824.0, 189501.0, 44176.0, 14526.0, 5859.0, 2897.0, 1533.0, 924.0, 551.0, 379.0, 255.0, 182.0, 111.0, 83.0, 64.0, 45.0, 36.0, 25.0, 18.0, 12.0, 12.0, 7.0, 4.0, 5.0, 1.0, 3.0, 2.0, 2.0, 1.0, 1.0, 2.0], "bins": [-13.53125, -13.114990234375, -12.69873046875, -12.282470703125, -11.8662109375, -11.449951171875, -11.03369140625, -10.617431640625, -10.201171875, -9.784912109375, -9.36865234375, -8.952392578125, -8.5361328125, -8.119873046875, -7.70361328125, -7.287353515625, -6.87109375, -6.454833984375, -6.03857421875, -5.622314453125, -5.2060546875, -4.789794921875, -4.37353515625, -3.957275390625, -3.541015625, -3.124755859375, -2.70849609375, -2.292236328125, -1.8759765625, -1.459716796875, -1.04345703125, -0.627197265625, -0.2109375, 0.205322265625, 0.62158203125, 1.037841796875, 1.4541015625, 1.870361328125, 2.28662109375, 2.702880859375, 3.119140625, 3.535400390625, 3.95166015625, 4.367919921875, 4.7841796875, 5.200439453125, 5.61669921875, 6.032958984375, 6.44921875, 6.865478515625, 7.28173828125, 7.697998046875, 8.1142578125, 8.530517578125, 8.94677734375, 9.363037109375, 9.779296875, 10.195556640625, 10.61181640625, 11.028076171875, 11.4443359375, 11.860595703125, 12.27685546875, 12.693115234375, 13.109375]}, "gradients/decoder.transformer.h.1.mlp.c_fc.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 4.0, 0.0, 5.0, 8.0, 7.0, 8.0, 10.0, 16.0, 17.0, 34.0, 55.0, 87.0, 135.0, 165.0, 230.0, 308.0, 479.0, 625.0, 609.0, 424.0, 247.0, 190.0, 141.0, 72.0, 61.0, 32.0, 24.0, 20.0, 18.0, 16.0, 5.0, 6.0, 6.0, 4.0, 5.0, 3.0, 3.0, 2.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-17.328125, -16.6865234375, -16.044921875, -15.4033203125, -14.76171875, -14.1201171875, -13.478515625, -12.8369140625, -12.1953125, -11.5537109375, -10.912109375, -10.2705078125, -9.62890625, -8.9873046875, -8.345703125, -7.7041015625, -7.0625, -6.4208984375, -5.779296875, -5.1376953125, -4.49609375, -3.8544921875, -3.212890625, -2.5712890625, -1.9296875, -1.2880859375, -0.646484375, -0.0048828125, 0.63671875, 1.2783203125, 1.919921875, 2.5615234375, 3.203125, 3.8447265625, 4.486328125, 5.1279296875, 5.76953125, 6.4111328125, 7.052734375, 7.6943359375, 8.3359375, 8.9775390625, 9.619140625, 10.2607421875, 10.90234375, 11.5439453125, 12.185546875, 12.8271484375, 13.46875, 14.1103515625, 14.751953125, 15.3935546875, 16.03515625, 16.6767578125, 17.318359375, 17.9599609375, 18.6015625, 19.2431640625, 19.884765625, 20.5263671875, 21.16796875, 21.8095703125, 22.451171875, 23.0927734375, 23.734375]}, "gradients/decoder.transformer.h.1.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 6.0, 0.0, 4.0, 1.0, 2.0, 1.0, 1.0, 3.0, 8.0, 2.0, 10.0, 12.0, 28.0, 32.0, 54.0, 94.0, 175.0, 385.0, 1039.0, 4075.0, 3819862.0, 364066.0, 2782.0, 834.0, 349.0, 189.0, 104.0, 50.0, 44.0, 26.0, 19.0, 8.0, 5.0, 6.0, 8.0, 5.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-98.375, -95.6337890625, -92.892578125, -90.1513671875, -87.41015625, -84.6689453125, -81.927734375, -79.1865234375, -76.4453125, -73.7041015625, -70.962890625, -68.2216796875, -65.48046875, -62.7392578125, -59.998046875, -57.2568359375, -54.515625, -51.7744140625, -49.033203125, -46.2919921875, -43.55078125, -40.8095703125, -38.068359375, -35.3271484375, -32.5859375, -29.8447265625, -27.103515625, -24.3623046875, -21.62109375, -18.8798828125, -16.138671875, -13.3974609375, -10.65625, -7.9150390625, -5.173828125, -2.4326171875, 0.30859375, 3.0498046875, 5.791015625, 8.5322265625, 11.2734375, 14.0146484375, 16.755859375, 19.4970703125, 22.23828125, 24.9794921875, 27.720703125, 30.4619140625, 33.203125, 35.9443359375, 38.685546875, 41.4267578125, 44.16796875, 46.9091796875, 49.650390625, 52.3916015625, 55.1328125, 57.8740234375, 60.615234375, 63.3564453125, 66.09765625, 68.8388671875, 71.580078125, 74.3212890625, 77.0625]}, "gradients/decoder.transformer.h.1.ln_2.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 13.0, 300.0, 608.0, 92.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-264.47296142578125, -254.75198364257812, -245.031005859375, -235.3100128173828, -225.5890350341797, -215.86805725097656, -206.14706420898438, -196.42608642578125, -186.70510864257812, -176.984130859375, -167.26315307617188, -157.5421600341797, -147.82118225097656, -138.10020446777344, -128.37921142578125, -118.65823364257812, -108.937255859375, -99.21627807617188, -89.49529266357422, -79.77430725097656, -70.05332946777344, -60.33234786987305, -50.611366271972656, -40.890380859375, -31.169403076171875, -21.448421478271484, -11.727439880371094, -2.006458282470703, 7.7145233154296875, 17.435504913330078, 27.15648651123047, 36.877471923828125, 46.598480224609375, 56.319461822509766, 66.04044342041016, 75.76142883300781, 85.48240661621094, 95.20338439941406, 104.92436981201172, 114.64535522460938, 124.3663330078125, 134.08731079101562, 143.80828857421875, 153.52928161621094, 163.25025939941406, 172.9712371826172, 182.69223022460938, 192.4132080078125, 202.13418579101562, 211.85516357421875, 221.57614135742188, 231.29713439941406, 241.0181121826172, 250.7390899658203, 260.4600830078125, 270.1810607910156, 279.90203857421875, 289.6230163574219, 299.343994140625, 309.0649719238281, 318.78594970703125, 328.5069580078125, 338.2279357910156, 347.94891357421875, 357.6698913574219]}, "gradients/decoder.transformer.h.1.ln_2.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 5.0, 6.0, 2.0, 1.0, 1.0, 6.0, 10.0, 5.0, 13.0, 13.0, 12.0, 18.0, 17.0, 20.0, 24.0, 20.0, 27.0, 33.0, 36.0, 36.0, 36.0, 31.0, 45.0, 48.0, 56.0, 43.0, 36.0, 39.0, 49.0, 34.0, 49.0, 35.0, 16.0, 28.0, 28.0, 19.0, 14.0, 20.0, 12.0, 16.0, 10.0, 12.0, 7.0, 8.0, 3.0, 3.0, 5.0, 1.0, 0.0, 0.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 2.0], "bins": [-64.44195556640625, -62.45415115356445, -60.46634292602539, -58.478538513183594, -56.4907341003418, -54.5029296875, -52.51512145996094, -50.52731704711914, -48.539512634277344, -46.55170822143555, -44.563899993896484, -42.57609558105469, -40.58829116821289, -38.600486755371094, -36.61267852783203, -34.624874114990234, -32.63706588745117, -30.649259567260742, -28.661455154418945, -26.673648834228516, -24.68584442138672, -22.69803810119629, -20.71023178100586, -18.722427368164062, -16.734621047973633, -14.74681568145752, -12.759010314941406, -10.771203994750977, -8.783398628234863, -6.79559326171875, -4.80778694152832, -2.819981575012207, -0.8321762084960938, 1.1556293964385986, 3.143435001373291, 5.1312408447265625, 7.119046211242676, 9.106851577758789, 11.094657897949219, 13.082463264465332, 15.070268630981445, 17.058074951171875, 19.045879364013672, 21.0336856842041, 23.02149200439453, 25.009296417236328, 26.997102737426758, 28.984909057617188, 30.972713470458984, 32.96051788330078, 34.948326110839844, 36.93613052368164, 38.92393493652344, 40.9117431640625, 42.8995475769043, 44.887351989746094, 46.875160217285156, 48.86296463012695, 50.850772857666016, 52.83857727050781, 54.82638168334961, 56.814186096191406, 58.80199432373047, 60.789798736572266, 62.77760314941406]}, "gradients/decoder.transformer.h.1.crossattention.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 8.0, 8.0, 2.0, 2.0, 11.0, 11.0, 13.0, 11.0, 12.0, 17.0, 16.0, 24.0, 38.0, 34.0, 32.0, 33.0, 48.0, 35.0, 57.0, 42.0, 44.0, 41.0, 45.0, 36.0, 37.0, 38.0, 37.0, 31.0, 38.0, 24.0, 28.0, 31.0, 17.0, 13.0, 14.0, 13.0, 5.0, 10.0, 10.0, 9.0, 6.0, 7.0, 3.0, 2.0, 4.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-8.7890625, -8.5164794921875, -8.243896484375, -7.9713134765625, -7.69873046875, -7.4261474609375, -7.153564453125, -6.8809814453125, -6.6083984375, -6.3358154296875, -6.063232421875, -5.7906494140625, -5.51806640625, -5.2454833984375, -4.972900390625, -4.7003173828125, -4.427734375, -4.1551513671875, -3.882568359375, -3.6099853515625, -3.33740234375, -3.0648193359375, -2.792236328125, -2.5196533203125, -2.2470703125, -1.9744873046875, -1.701904296875, -1.4293212890625, -1.15673828125, -0.8841552734375, -0.611572265625, -0.3389892578125, -0.06640625, 0.2061767578125, 0.478759765625, 0.7513427734375, 1.02392578125, 1.2965087890625, 1.569091796875, 1.8416748046875, 2.1142578125, 2.3868408203125, 2.659423828125, 2.9320068359375, 3.20458984375, 3.4771728515625, 3.749755859375, 4.0223388671875, 4.294921875, 4.5675048828125, 4.840087890625, 5.1126708984375, 5.38525390625, 5.6578369140625, 5.930419921875, 6.2030029296875, 6.4755859375, 6.7481689453125, 7.020751953125, 7.2933349609375, 7.56591796875, 7.8385009765625, 8.111083984375, 8.3836669921875, 8.65625]}, "gradients/decoder.transformer.h.1.crossattention.c_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 3.0, 5.0, 3.0, 5.0, 14.0, 17.0, 25.0, 47.0, 55.0, 71.0, 120.0, 174.0, 240.0, 327.0, 525.0, 750.0, 981.0, 1489.0, 2020.0, 3014.0, 4233.0, 6227.0, 8739.0, 13588.0, 20327.0, 32056.0, 52173.0, 90075.0, 192675.0, 314154.0, 122881.0, 66152.0, 39748.0, 25036.0, 16036.0, 10772.0, 7291.0, 4865.0, 3557.0, 2360.0, 1787.0, 1204.0, 810.0, 606.0, 401.0, 282.0, 208.0, 146.0, 101.0, 60.0, 41.0, 38.0, 20.0, 12.0, 6.0, 10.0, 4.0, 2.0, 3.0, 1.0, 1.0], "bins": [-2.076171875, -2.011749267578125, -1.94732666015625, -1.882904052734375, -1.8184814453125, -1.754058837890625, -1.68963623046875, -1.625213623046875, -1.560791015625, -1.496368408203125, -1.43194580078125, -1.367523193359375, -1.3031005859375, -1.238677978515625, -1.17425537109375, -1.109832763671875, -1.04541015625, -0.980987548828125, -0.91656494140625, -0.852142333984375, -0.7877197265625, -0.723297119140625, -0.65887451171875, -0.594451904296875, -0.530029296875, -0.465606689453125, -0.40118408203125, -0.336761474609375, -0.2723388671875, -0.207916259765625, -0.14349365234375, -0.079071044921875, -0.0146484375, 0.049774169921875, 0.11419677734375, 0.178619384765625, 0.2430419921875, 0.307464599609375, 0.37188720703125, 0.436309814453125, 0.500732421875, 0.565155029296875, 0.62957763671875, 0.694000244140625, 0.7584228515625, 0.822845458984375, 0.88726806640625, 0.951690673828125, 1.01611328125, 1.080535888671875, 1.14495849609375, 1.209381103515625, 1.2738037109375, 1.338226318359375, 1.40264892578125, 1.467071533203125, 1.531494140625, 1.595916748046875, 1.66033935546875, 1.724761962890625, 1.7891845703125, 1.853607177734375, 1.91802978515625, 1.982452392578125, 2.046875]}, "gradients/decoder.transformer.h.1.crossattention.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 6.0, 6.0, 7.0, 10.0, 12.0, 17.0, 17.0, 19.0, 19.0, 17.0, 20.0, 25.0, 40.0, 22.0, 34.0, 44.0, 42.0, 39.0, 47.0, 35.0, 49.0, 1057.0, 45.0, 29.0, 40.0, 37.0, 30.0, 35.0, 34.0, 29.0, 30.0, 18.0, 19.0, 28.0, 16.0, 14.0, 8.0, 11.0, 6.0, 3.0, 5.0, 4.0, 3.0, 4.0, 1.0, 5.0], "bins": [-7.08984375, -6.90850830078125, -6.7271728515625, -6.54583740234375, -6.364501953125, -6.18316650390625, -6.0018310546875, -5.82049560546875, -5.63916015625, -5.45782470703125, -5.2764892578125, -5.09515380859375, -4.913818359375, -4.73248291015625, -4.5511474609375, -4.36981201171875, -4.1884765625, -4.00714111328125, -3.8258056640625, -3.64447021484375, -3.463134765625, -3.28179931640625, -3.1004638671875, -2.91912841796875, -2.73779296875, -2.55645751953125, -2.3751220703125, -2.19378662109375, -2.012451171875, -1.83111572265625, -1.6497802734375, -1.46844482421875, -1.287109375, -1.10577392578125, -0.9244384765625, -0.74310302734375, -0.561767578125, -0.38043212890625, -0.1990966796875, -0.01776123046875, 0.16357421875, 0.34490966796875, 0.5262451171875, 0.70758056640625, 0.888916015625, 1.07025146484375, 1.2515869140625, 1.43292236328125, 1.6142578125, 1.79559326171875, 1.9769287109375, 2.15826416015625, 2.339599609375, 2.52093505859375, 2.7022705078125, 2.88360595703125, 3.06494140625, 3.24627685546875, 3.4276123046875, 3.60894775390625, 3.790283203125, 3.97161865234375, 4.1529541015625, 4.33428955078125, 4.515625]}, "gradients/decoder.transformer.h.1.crossattention.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 6.0, 0.0, 5.0, 4.0, 8.0, 7.0, 11.0, 12.0, 22.0, 35.0, 50.0, 94.0, 131.0, 255.0, 441.0, 835.0, 1553.0, 2873.0, 5477.0, 10800.0, 21132.0, 43990.0, 94520.0, 241702.0, 1440655.0, 122565.0, 55625.0, 26664.0, 13124.0, 6908.0, 3563.0, 1866.0, 960.0, 490.0, 292.0, 172.0, 103.0, 58.0, 50.0, 26.0, 20.0, 15.0, 10.0, 6.0, 6.0, 1.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.708984375, -2.605438232421875, -2.50189208984375, -2.398345947265625, -2.2947998046875, -2.191253662109375, -2.08770751953125, -1.984161376953125, -1.880615234375, -1.777069091796875, -1.67352294921875, -1.569976806640625, -1.4664306640625, -1.362884521484375, -1.25933837890625, -1.155792236328125, -1.05224609375, -0.948699951171875, -0.84515380859375, -0.741607666015625, -0.6380615234375, -0.534515380859375, -0.43096923828125, -0.327423095703125, -0.223876953125, -0.120330810546875, -0.01678466796875, 0.086761474609375, 0.1903076171875, 0.293853759765625, 0.39739990234375, 0.500946044921875, 0.6044921875, 0.708038330078125, 0.81158447265625, 0.915130615234375, 1.0186767578125, 1.122222900390625, 1.22576904296875, 1.329315185546875, 1.432861328125, 1.536407470703125, 1.63995361328125, 1.743499755859375, 1.8470458984375, 1.950592041015625, 2.05413818359375, 2.157684326171875, 2.26123046875, 2.364776611328125, 2.46832275390625, 2.571868896484375, 2.6754150390625, 2.778961181640625, 2.88250732421875, 2.986053466796875, 3.089599609375, 3.193145751953125, 3.29669189453125, 3.400238037109375, 3.5037841796875, 3.607330322265625, 3.71087646484375, 3.814422607421875, 3.91796875]}, "gradients/decoder.transformer.h.1.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 4.0, 0.0, 3.0, 5.0, 5.0, 8.0, 9.0, 15.0, 19.0, 36.0, 43.0, 61.0, 111.0, 162.0, 173.0, 124.0, 81.0, 38.0, 30.0, 27.0, 16.0, 7.0, 9.0, 8.0, 3.0, 2.0, 1.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.002452850341796875, -0.0023684799671173096, -0.002284109592437744, -0.0021997392177581787, -0.0021153688430786133, -0.002030998468399048, -0.0019466280937194824, -0.001862257719039917, -0.0017778873443603516, -0.0016935169696807861, -0.0016091465950012207, -0.0015247762203216553, -0.0014404058456420898, -0.0013560354709625244, -0.001271665096282959, -0.0011872947216033936, -0.0011029243469238281, -0.0010185539722442627, -0.0009341835975646973, -0.0008498132228851318, -0.0007654428482055664, -0.000681072473526001, -0.0005967020988464355, -0.0005123317241668701, -0.0004279613494873047, -0.00034359097480773926, -0.00025922060012817383, -0.0001748502254486084, -9.047985076904297e-05, -6.109476089477539e-06, 7.826089859008789e-05, 0.00016263127326965332, 0.00024700164794921875, 0.0003313720226287842, 0.0004157423973083496, 0.000500112771987915, 0.0005844831466674805, 0.0006688535213470459, 0.0007532238960266113, 0.0008375942707061768, 0.0009219646453857422, 0.0010063350200653076, 0.001090705394744873, 0.0011750757694244385, 0.001259446144104004, 0.0013438165187835693, 0.0014281868934631348, 0.0015125572681427002, 0.0015969276428222656, 0.001681298017501831, 0.0017656683921813965, 0.001850038766860962, 0.0019344091415405273, 0.0020187795162200928, 0.002103149890899658, 0.0021875202655792236, 0.002271890640258789, 0.0023562610149383545, 0.00244063138961792, 0.0025250017642974854, 0.0026093721389770508, 0.002693742513656616, 0.0027781128883361816, 0.002862483263015747, 0.0029468536376953125]}, "gradients/decoder.transformer.h.1.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 5.0, 0.0, 3.0, 6.0, 3.0, 8.0, 9.0, 13.0, 12.0, 19.0, 28.0, 31.0, 46.0, 78.0, 121.0, 228.0, 412.0, 1120.0, 1026064.0, 18773.0, 766.0, 315.0, 191.0, 91.0, 68.0, 39.0, 32.0, 19.0, 17.0, 10.0, 7.0, 10.0, 3.0, 3.0, 5.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.04351806640625, -0.04226875305175781, -0.041019439697265625, -0.03977012634277344, -0.03852081298828125, -0.03727149963378906, -0.036022186279296875, -0.03477287292480469, -0.0335235595703125, -0.03227424621582031, -0.031024932861328125, -0.029775619506835938, -0.02852630615234375, -0.027276992797851562, -0.026027679443359375, -0.024778366088867188, -0.023529052734375, -0.022279739379882812, -0.021030426025390625, -0.019781112670898438, -0.01853179931640625, -0.017282485961914062, -0.016033172607421875, -0.014783859252929688, -0.0135345458984375, -0.012285232543945312, -0.011035919189453125, -0.009786605834960938, -0.00853729248046875, -0.0072879791259765625, -0.006038665771484375, -0.0047893524169921875, -0.0035400390625, -0.0022907257080078125, -0.001041412353515625, 0.0002079010009765625, 0.00145721435546875, 0.0027065277099609375, 0.003955841064453125, 0.0052051544189453125, 0.0064544677734375, 0.0077037811279296875, 0.008953094482421875, 0.010202407836914062, 0.01145172119140625, 0.012701034545898438, 0.013950347900390625, 0.015199661254882812, 0.016448974609375, 0.017698287963867188, 0.018947601318359375, 0.020196914672851562, 0.02144622802734375, 0.022695541381835938, 0.023944854736328125, 0.025194168090820312, 0.0264434814453125, 0.027692794799804688, 0.028942108154296875, 0.030191421508789062, 0.03144073486328125, 0.03269004821777344, 0.033939361572265625, 0.03518867492675781, 0.03643798828125]}, "gradients/decoder.transformer.h.1.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 26.0, 140.0, 406.0, 334.0, 78.0, 22.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0005947514437139034, -0.0005399063811637461, -0.0004850612604059279, -0.0004302161978557706, -0.0003753711062017828, -0.00032052601454779506, -0.00026568095199763775, -0.00021083586034364998, -0.00015599076868966222, -0.00010114568431163207, -4.6300599933601916e-05, 8.544477168470621e-06, 6.338956882245839e-05, 0.00011823466047644615, 0.00017307972302660346, 0.00022792481468059123, 0.000282769906334579, 0.00033761499798856676, 0.0003924600896425545, 0.00044730515219271183, 0.00050215027295053, 0.0005569953355006874, 0.0006118403980508447, 0.0006666855188086629, 0.0007215305813588202, 0.0007763756439089775, 0.0008312207646667957, 0.000886065827216953, 0.0009409108897671103, 0.0009957560105249286, 0.001050601014867425, 0.0011054461356252432, 0.0011602912563830614, 0.0012151363771408796, 0.001269981381483376, 0.0013248265022411942, 0.0013796716229990125, 0.0014345166273415089, 0.001489361748099327, 0.0015442068688571453, 0.0015990519896149635, 0.0016538971103727818, 0.0017087421147152781, 0.0017635872354730964, 0.0018184323562309146, 0.001873277360573411, 0.0019281224813312292, 0.0019829676020890474, 0.002037812490016222, 0.0020926576107740402, 0.0021475027315318584, 0.0022023478522896767, 0.0022571927402168512, 0.0023120378609746695, 0.0023668829817324877, 0.002421728102490306, 0.002476573223248124, 0.0025314183440059423, 0.0025862634647637606, 0.002641108352690935, 0.0026959534734487534, 0.0027507985942065716, 0.00280564371496439, 0.002860488835722208, 0.0029153339564800262]}, "gradients/decoder.transformer.h.1.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 1.0, 0.0, 1.0, 2.0, 1.0, 4.0, 3.0, 6.0, 5.0, 5.0, 9.0, 10.0, 4.0, 13.0, 16.0, 10.0, 16.0, 16.0, 20.0, 29.0, 23.0, 26.0, 26.0, 24.0, 28.0, 35.0, 29.0, 37.0, 32.0, 35.0, 37.0, 40.0, 42.0, 40.0, 41.0, 27.0, 25.0, 31.0, 33.0, 18.0, 25.0, 14.0, 26.0, 20.0, 22.0, 19.0, 9.0, 6.0, 17.0, 9.0, 11.0, 9.0, 9.0, 8.0, 3.0, 6.0, 1.0, 1.0, 1.0, 0.0, 3.0], "bins": [-0.0007615089416503906, -0.0007392885163426399, -0.0007170680910348892, -0.0006948476657271385, -0.0006726272404193878, -0.0006504068151116371, -0.0006281863898038864, -0.0006059659644961357, -0.000583745539188385, -0.0005615251138806343, -0.0005393046885728836, -0.0005170842632651329, -0.0004948638379573822, -0.0004726434126496315, -0.0004504229873418808, -0.0004282025620341301, -0.0004059821367263794, -0.0003837617114186287, -0.000361541286110878, -0.0003393208608031273, -0.0003171004354953766, -0.0002948800101876259, -0.0002726595848798752, -0.0002504391595721245, -0.00022821873426437378, -0.00020599830895662308, -0.00018377788364887238, -0.00016155745834112167, -0.00013933703303337097, -0.00011711660772562027, -9.489618241786957e-05, -7.267575711011887e-05, -5.0455331802368164e-05, -2.8234906494617462e-05, -6.01448118686676e-06, 1.620594412088394e-05, 3.8426369428634644e-05, 6.0646794736385345e-05, 8.286722004413605e-05, 0.00010508764535188675, 0.00012730807065963745, 0.00014952849596738815, 0.00017174892127513885, 0.00019396934658288956, 0.00021618977189064026, 0.00023841019719839096, 0.00026063062250614166, 0.00028285104781389236, 0.00030507147312164307, 0.00032729189842939377, 0.00034951232373714447, 0.00037173274904489517, 0.0003939531743526459, 0.0004161735996603966, 0.0004383940249681473, 0.000460614450275898, 0.0004828348755836487, 0.0005050553008913994, 0.0005272757261991501, 0.0005494961515069008, 0.0005717165768146515, 0.0005939370021224022, 0.0006161574274301529, 0.0006383778527379036, 0.0006605982780456543]}, "gradients/decoder.transformer.h.1.attn.c_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 3.0, 8.0, 8.0, 2.0, 2.0, 11.0, 11.0, 13.0, 11.0, 12.0, 17.0, 16.0, 24.0, 38.0, 34.0, 32.0, 33.0, 48.0, 35.0, 57.0, 42.0, 44.0, 41.0, 45.0, 36.0, 37.0, 38.0, 37.0, 31.0, 38.0, 24.0, 28.0, 31.0, 17.0, 13.0, 14.0, 13.0, 5.0, 10.0, 10.0, 9.0, 6.0, 7.0, 3.0, 2.0, 4.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 2.0, 1.0], "bins": [-8.7890625, -8.5164794921875, -8.243896484375, -7.9713134765625, -7.69873046875, -7.4261474609375, -7.153564453125, -6.8809814453125, -6.6083984375, -6.3358154296875, -6.063232421875, -5.7906494140625, -5.51806640625, -5.2454833984375, -4.972900390625, -4.7003173828125, -4.427734375, -4.1551513671875, -3.882568359375, -3.6099853515625, -3.33740234375, -3.0648193359375, -2.792236328125, -2.5196533203125, -2.2470703125, -1.9744873046875, -1.701904296875, -1.4293212890625, -1.15673828125, -0.8841552734375, -0.611572265625, -0.3389892578125, -0.06640625, 0.2061767578125, 0.478759765625, 0.7513427734375, 1.02392578125, 1.2965087890625, 1.569091796875, 1.8416748046875, 2.1142578125, 2.3868408203125, 2.659423828125, 2.9320068359375, 3.20458984375, 3.4771728515625, 3.749755859375, 4.0223388671875, 4.294921875, 4.5675048828125, 4.840087890625, 5.1126708984375, 5.38525390625, 5.6578369140625, 5.930419921875, 6.2030029296875, 6.4755859375, 6.7481689453125, 7.020751953125, 7.2933349609375, 7.56591796875, 7.8385009765625, 8.111083984375, 8.3836669921875, 8.65625]}, "gradients/decoder.transformer.h.1.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 2.0, 4.0, 3.0, 8.0, 9.0, 13.0, 14.0, 14.0, 22.0, 37.0, 35.0, 61.0, 83.0, 106.0, 161.0, 240.0, 351.0, 523.0, 939.0, 1654.0, 3657.0, 8639.0, 27392.0, 137541.0, 695045.0, 129227.0, 26572.0, 8483.0, 3536.0, 1599.0, 881.0, 565.0, 330.0, 224.0, 162.0, 116.0, 77.0, 57.0, 43.0, 27.0, 28.0, 19.0, 15.0, 8.0, 10.0, 8.0, 3.0, 6.0, 3.0, 2.0, 4.0, 3.0, 1.0, 0.0, 0.0, 2.0], "bins": [-18.296875, -17.733642578125, -17.17041015625, -16.607177734375, -16.0439453125, -15.480712890625, -14.91748046875, -14.354248046875, -13.791015625, -13.227783203125, -12.66455078125, -12.101318359375, -11.5380859375, -10.974853515625, -10.41162109375, -9.848388671875, -9.28515625, -8.721923828125, -8.15869140625, -7.595458984375, -7.0322265625, -6.468994140625, -5.90576171875, -5.342529296875, -4.779296875, -4.216064453125, -3.65283203125, -3.089599609375, -2.5263671875, -1.963134765625, -1.39990234375, -0.836669921875, -0.2734375, 0.289794921875, 0.85302734375, 1.416259765625, 1.9794921875, 2.542724609375, 3.10595703125, 3.669189453125, 4.232421875, 4.795654296875, 5.35888671875, 5.922119140625, 6.4853515625, 7.048583984375, 7.61181640625, 8.175048828125, 8.73828125, 9.301513671875, 9.86474609375, 10.427978515625, 10.9912109375, 11.554443359375, 12.11767578125, 12.680908203125, 13.244140625, 13.807373046875, 14.37060546875, 14.933837890625, 15.4970703125, 16.060302734375, 16.62353515625, 17.186767578125, 17.75]}, "gradients/decoder.transformer.h.1.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 5.0, 5.0, 12.0, 8.0, 4.0, 11.0, 8.0, 16.0, 31.0, 28.0, 26.0, 45.0, 51.0, 45.0, 57.0, 69.0, 161.0, 1980.0, 101.0, 58.0, 60.0, 63.0, 35.0, 40.0, 29.0, 24.0, 11.0, 16.0, 11.0, 14.0, 5.0, 8.0, 3.0, 2.0, 4.0, 2.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-32.28125, -31.34326171875, -30.4052734375, -29.46728515625, -28.529296875, -27.59130859375, -26.6533203125, -25.71533203125, -24.77734375, -23.83935546875, -22.9013671875, -21.96337890625, -21.025390625, -20.08740234375, -19.1494140625, -18.21142578125, -17.2734375, -16.33544921875, -15.3974609375, -14.45947265625, -13.521484375, -12.58349609375, -11.6455078125, -10.70751953125, -9.76953125, -8.83154296875, -7.8935546875, -6.95556640625, -6.017578125, -5.07958984375, -4.1416015625, -3.20361328125, -2.265625, -1.32763671875, -0.3896484375, 0.54833984375, 1.486328125, 2.42431640625, 3.3623046875, 4.30029296875, 5.23828125, 6.17626953125, 7.1142578125, 8.05224609375, 8.990234375, 9.92822265625, 10.8662109375, 11.80419921875, 12.7421875, 13.68017578125, 14.6181640625, 15.55615234375, 16.494140625, 17.43212890625, 18.3701171875, 19.30810546875, 20.24609375, 21.18408203125, 22.1220703125, 23.06005859375, 23.998046875, 24.93603515625, 25.8740234375, 26.81201171875, 27.75]}, "gradients/decoder.transformer.h.1.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 1.0, 5.0, 2.0, 4.0, 5.0, 4.0, 11.0, 10.0, 7.0, 16.0, 22.0, 24.0, 47.0, 67.0, 119.0, 187.0, 376.0, 954.0, 45461.0, 3095662.0, 1564.0, 515.0, 244.0, 130.0, 70.0, 53.0, 35.0, 23.0, 26.0, 18.0, 12.0, 10.0, 7.0, 4.0, 6.0, 6.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-87.625, -84.7197265625, -81.814453125, -78.9091796875, -76.00390625, -73.0986328125, -70.193359375, -67.2880859375, -64.3828125, -61.4775390625, -58.572265625, -55.6669921875, -52.76171875, -49.8564453125, -46.951171875, -44.0458984375, -41.140625, -38.2353515625, -35.330078125, -32.4248046875, -29.51953125, -26.6142578125, -23.708984375, -20.8037109375, -17.8984375, -14.9931640625, -12.087890625, -9.1826171875, -6.27734375, -3.3720703125, -0.466796875, 2.4384765625, 5.34375, 8.2490234375, 11.154296875, 14.0595703125, 16.96484375, 19.8701171875, 22.775390625, 25.6806640625, 28.5859375, 31.4912109375, 34.396484375, 37.3017578125, 40.20703125, 43.1123046875, 46.017578125, 48.9228515625, 51.828125, 54.7333984375, 57.638671875, 60.5439453125, 63.44921875, 66.3544921875, 69.259765625, 72.1650390625, 75.0703125, 77.9755859375, 80.880859375, 83.7861328125, 86.69140625, 89.5966796875, 92.501953125, 95.4072265625, 98.3125]}, "gradients/decoder.transformer.h.1.ln_1.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 6.0, 79.0, 815.0, 107.0, 8.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-38.51272201538086, -33.41378402709961, -28.314842224121094, -23.215904235839844, -18.11696434020996, -13.018024444580078, -7.919086456298828, -2.8201446533203125, 2.2787933349609375, 7.377732753753662, 12.476672172546387, 17.575611114501953, 22.674551010131836, 27.77349090576172, 32.87242889404297, 37.971370697021484, 43.070308685302734, 48.169246673583984, 53.2681884765625, 58.36712646484375, 63.466064453125, 68.56500244140625, 73.6639404296875, 78.76288604736328, 83.86182403564453, 88.96076202392578, 94.05970001220703, 99.15864562988281, 104.25758361816406, 109.35652160644531, 114.45545959472656, 119.55439758300781, 124.65333557128906, 129.7522735595703, 134.85121154785156, 139.9501495361328, 145.04908752441406, 150.14804077148438, 155.24697875976562, 160.34591674804688, 165.44485473632812, 170.54379272460938, 175.64273071289062, 180.74166870117188, 185.84060668945312, 190.93954467773438, 196.03848266601562, 201.13743591308594, 206.23635864257812, 211.33529663085938, 216.43423461914062, 221.53317260742188, 226.63211059570312, 231.73104858398438, 236.82998657226562, 241.92893981933594, 247.0278778076172, 252.12681579589844, 257.22576904296875, 262.32470703125, 267.42364501953125, 272.5225830078125, 277.62152099609375, 282.720458984375, 287.81939697265625]}, "gradients/decoder.transformer.h.1.ln_1.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 1.0, 3.0, 4.0, 7.0, 5.0, 5.0, 7.0, 9.0, 10.0, 12.0, 20.0, 14.0, 19.0, 26.0, 34.0, 37.0, 37.0, 41.0, 46.0, 56.0, 54.0, 35.0, 55.0, 46.0, 49.0, 40.0, 47.0, 45.0, 36.0, 25.0, 26.0, 26.0, 21.0, 20.0, 21.0, 19.0, 12.0, 17.0, 6.0, 3.0, 3.0, 2.0, 3.0, 5.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-57.683353424072266, -55.71160125732422, -53.73984909057617, -51.768096923828125, -49.79634475708008, -47.82459259033203, -45.85283660888672, -43.88108825683594, -41.909332275390625, -39.93758010864258, -37.96582794189453, -35.994075775146484, -34.02232360839844, -32.05057144165039, -30.07881736755371, -28.107065200805664, -26.13531494140625, -24.163562774658203, -22.191810607910156, -20.22005844116211, -18.248306274414062, -16.276554107666016, -14.304800033569336, -12.333047866821289, -10.361295700073242, -8.389543533325195, -6.41779088973999, -4.446038246154785, -2.4742860794067383, -0.5025339126586914, 1.4692192077636719, 3.4409713745117188, 5.412727355957031, 7.384479522705078, 9.356231689453125, 11.327984809875488, 13.299736976623535, 15.271489143371582, 17.243242263793945, 19.214994430541992, 21.18674659729004, 23.158498764038086, 25.130250930786133, 27.102005004882812, 29.07375717163086, 31.045509338378906, 33.01726150512695, 34.989013671875, 36.96076583862305, 38.932518005371094, 40.90427017211914, 42.87602233886719, 44.847774505615234, 46.81952667236328, 48.791282653808594, 50.763031005859375, 52.73478698730469, 54.706539154052734, 56.67829132080078, 58.65004348754883, 60.621795654296875, 62.59354782104492, 64.56529998779297, 66.53705596923828, 68.50880432128906]}, "gradients/decoder.transformer.h.0.mlp.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 2.0, 8.0, 7.0, 3.0, 8.0, 2.0, 4.0, 13.0, 12.0, 10.0, 17.0, 11.0, 12.0, 22.0, 28.0, 27.0, 34.0, 25.0, 35.0, 42.0, 42.0, 36.0, 42.0, 53.0, 47.0, 49.0, 45.0, 31.0, 40.0, 42.0, 27.0, 32.0, 31.0, 26.0, 16.0, 20.0, 20.0, 14.0, 6.0, 16.0, 8.0, 8.0, 6.0, 7.0, 6.0, 6.0, 5.0, 3.0, 1.0, 3.0, 0.0, 2.0, 2.0, 0.0, 0.0, 2.0], "bins": [-10.34375, -10.0228271484375, -9.701904296875, -9.3809814453125, -9.06005859375, -8.7391357421875, -8.418212890625, -8.0972900390625, -7.7763671875, -7.4554443359375, -7.134521484375, -6.8135986328125, -6.49267578125, -6.1717529296875, -5.850830078125, -5.5299072265625, -5.208984375, -4.8880615234375, -4.567138671875, -4.2462158203125, -3.92529296875, -3.6043701171875, -3.283447265625, -2.9625244140625, -2.6416015625, -2.3206787109375, -1.999755859375, -1.6788330078125, -1.35791015625, -1.0369873046875, -0.716064453125, -0.3951416015625, -0.07421875, 0.2467041015625, 0.567626953125, 0.8885498046875, 1.20947265625, 1.5303955078125, 1.851318359375, 2.1722412109375, 2.4931640625, 2.8140869140625, 3.135009765625, 3.4559326171875, 3.77685546875, 4.0977783203125, 4.418701171875, 4.7396240234375, 5.060546875, 5.3814697265625, 5.702392578125, 6.0233154296875, 6.34423828125, 6.6651611328125, 6.986083984375, 7.3070068359375, 7.6279296875, 7.9488525390625, 8.269775390625, 8.5906982421875, 8.91162109375, 9.2325439453125, 9.553466796875, 9.8743896484375, 10.1953125]}, "gradients/decoder.transformer.h.0.mlp.c_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 1.0, 3.0, 3.0, 2.0, 8.0, 3.0, 6.0, 10.0, 8.0, 12.0, 17.0, 16.0, 29.0, 26.0, 52.0, 42.0, 62.0, 76.0, 75.0, 89.0, 117.0, 168.0, 241.0, 428.0, 634.0, 1268.0, 4259.0, 776877.0, 3397063.0, 8525.0, 1761.0, 776.0, 434.0, 309.0, 197.0, 141.0, 100.0, 94.0, 75.0, 39.0, 42.0, 42.0, 31.0, 20.0, 26.0, 18.0, 18.0, 12.0, 10.0, 8.0, 9.0, 5.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-73.0, -70.6572265625, -68.314453125, -65.9716796875, -63.62890625, -61.2861328125, -58.943359375, -56.6005859375, -54.2578125, -51.9150390625, -49.572265625, -47.2294921875, -44.88671875, -42.5439453125, -40.201171875, -37.8583984375, -35.515625, -33.1728515625, -30.830078125, -28.4873046875, -26.14453125, -23.8017578125, -21.458984375, -19.1162109375, -16.7734375, -14.4306640625, -12.087890625, -9.7451171875, -7.40234375, -5.0595703125, -2.716796875, -0.3740234375, 1.96875, 4.3115234375, 6.654296875, 8.9970703125, 11.33984375, 13.6826171875, 16.025390625, 18.3681640625, 20.7109375, 23.0537109375, 25.396484375, 27.7392578125, 30.08203125, 32.4248046875, 34.767578125, 37.1103515625, 39.453125, 41.7958984375, 44.138671875, 46.4814453125, 48.82421875, 51.1669921875, 53.509765625, 55.8525390625, 58.1953125, 60.5380859375, 62.880859375, 65.2236328125, 67.56640625, 69.9091796875, 72.251953125, 74.5947265625, 76.9375]}, "gradients/decoder.transformer.h.0.mlp.c_fc.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 3.0, 1.0, 4.0, 5.0, 6.0, 31.0, 68.0, 232.0, 953.0, 1853.0, 711.0, 130.0, 56.0, 16.0, 5.0, 4.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-33.5, -31.861328125, -30.22265625, -28.583984375, -26.9453125, -25.306640625, -23.66796875, -22.029296875, -20.390625, -18.751953125, -17.11328125, -15.474609375, -13.8359375, -12.197265625, -10.55859375, -8.919921875, -7.28125, -5.642578125, -4.00390625, -2.365234375, -0.7265625, 0.912109375, 2.55078125, 4.189453125, 5.828125, 7.466796875, 9.10546875, 10.744140625, 12.3828125, 14.021484375, 15.66015625, 17.298828125, 18.9375, 20.576171875, 22.21484375, 23.853515625, 25.4921875, 27.130859375, 28.76953125, 30.408203125, 32.046875, 33.685546875, 35.32421875, 36.962890625, 38.6015625, 40.240234375, 41.87890625, 43.517578125, 45.15625, 46.794921875, 48.43359375, 50.072265625, 51.7109375, 53.349609375, 54.98828125, 56.626953125, 58.265625, 59.904296875, 61.54296875, 63.181640625, 64.8203125, 66.458984375, 68.09765625, 69.736328125, 71.375]}, "gradients/decoder.transformer.h.0.mlp.c_fc.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 3.0, 2.0, 1.0, 7.0, 6.0, 12.0, 23.0, 48.0, 103.0, 220.0, 703.0, 4801.0, 4095550.0, 90486.0, 1611.0, 394.0, 158.0, 72.0, 43.0, 27.0, 7.0, 5.0, 2.0, 1.0, 2.0, 4.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-79.75, -76.9775390625, -74.205078125, -71.4326171875, -68.66015625, -65.8876953125, -63.115234375, -60.3427734375, -57.5703125, -54.7978515625, -52.025390625, -49.2529296875, -46.48046875, -43.7080078125, -40.935546875, -38.1630859375, -35.390625, -32.6181640625, -29.845703125, -27.0732421875, -24.30078125, -21.5283203125, -18.755859375, -15.9833984375, -13.2109375, -10.4384765625, -7.666015625, -4.8935546875, -2.12109375, 0.6513671875, 3.423828125, 6.1962890625, 8.96875, 11.7412109375, 14.513671875, 17.2861328125, 20.05859375, 22.8310546875, 25.603515625, 28.3759765625, 31.1484375, 33.9208984375, 36.693359375, 39.4658203125, 42.23828125, 45.0107421875, 47.783203125, 50.5556640625, 53.328125, 56.1005859375, 58.873046875, 61.6455078125, 64.41796875, 67.1904296875, 69.962890625, 72.7353515625, 75.5078125, 78.2802734375, 81.052734375, 83.8251953125, 86.59765625, 89.3701171875, 92.142578125, 94.9150390625, 97.6875]}, "gradients/decoder.transformer.h.0.ln_2.weight": {"_type": "histogram", "values": [3.0, 2.0, 6.0, 7.0, 39.0, 110.0, 322.0, 377.0, 117.0, 25.0, 10.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-79.99321746826172, -68.5245132446289, -57.05580520629883, -45.58709716796875, -34.11839294433594, -22.649688720703125, -11.180976867675781, 0.28772735595703125, 11.756431579589844, 23.22513771057129, 34.693843841552734, 46.16255187988281, 57.631256103515625, 69.09996032714844, 80.56867218017578, 92.0373764038086, 103.5060806274414, 114.97478485107422, 126.44349670410156, 137.91220092773438, 149.3809051513672, 160.849609375, 172.31832885742188, 183.78701782226562, 195.2557373046875, 206.7244415283203, 218.19314575195312, 229.661865234375, 241.13055419921875, 252.59927368164062, 264.0679931640625, 275.53668212890625, 287.00537109375, 298.4740905761719, 309.9427795410156, 321.4114990234375, 332.88018798828125, 344.3489074707031, 355.817626953125, 367.28631591796875, 378.7550048828125, 390.2237243652344, 401.6924133300781, 413.1611328125, 424.62982177734375, 436.0985412597656, 447.5672607421875, 459.03594970703125, 470.5046691894531, 481.973388671875, 493.44207763671875, 504.9107971191406, 516.3795166015625, 527.8482055664062, 539.31689453125, 550.78564453125, 562.2543334960938, 573.7230224609375, 585.1917724609375, 596.6604614257812, 608.129150390625, 619.5978393554688, 631.0665893554688, 642.5352783203125, 654.0039672851562]}, "gradients/decoder.transformer.h.0.ln_2.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 3.0, 9.0, 9.0, 10.0, 10.0, 12.0, 14.0, 11.0, 17.0, 29.0, 24.0, 28.0, 35.0, 36.0, 32.0, 40.0, 46.0, 49.0, 46.0, 44.0, 39.0, 43.0, 46.0, 52.0, 57.0, 44.0, 32.0, 30.0, 26.0, 26.0, 16.0, 18.0, 12.0, 13.0, 6.0, 8.0, 8.0, 11.0, 5.0, 4.0, 3.0, 3.0, 1.0, 1.0, 2.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-70.95883178710938, -68.74300384521484, -66.52718353271484, -64.31135559082031, -62.09553146362305, -59.87970733642578, -57.66387939453125, -55.448055267333984, -53.23223114013672, -51.01640701293945, -48.80058288574219, -46.584754943847656, -44.36893081665039, -42.153106689453125, -39.937278747558594, -37.72145462036133, -35.50563049316406, -33.2898063659668, -31.0739803314209, -28.858154296875, -26.642330169677734, -24.42650604248047, -22.21068000793457, -19.994853973388672, -17.779029846191406, -15.563204765319824, -13.347379684448242, -11.13155460357666, -8.915729522705078, -6.699904441833496, -4.484079360961914, -2.268254280090332, -0.05242919921875, 2.163395881652832, 4.379220962524414, 6.595046043395996, 8.810871124267578, 11.02669620513916, 13.242521286010742, 15.458346366882324, 17.674171447753906, 19.889995574951172, 22.10582160949707, 24.32164764404297, 26.537471771240234, 28.7532958984375, 30.9691219329834, 33.1849479675293, 35.40077209472656, 37.61659622192383, 39.832420349121094, 42.048248291015625, 44.26407241821289, 46.479896545410156, 48.69572448730469, 50.91154861450195, 53.12737274169922, 55.343196868896484, 57.55902099609375, 59.77484893798828, 61.99067306518555, 64.20649719238281, 66.42232513427734, 68.63814544677734, 70.85397338867188]}, "gradients/decoder.transformer.h.0.crossattention.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 2.0, 1.0, 6.0, 12.0, 7.0, 15.0, 12.0, 12.0, 18.0, 17.0, 28.0, 27.0, 26.0, 32.0, 38.0, 42.0, 37.0, 38.0, 41.0, 48.0, 45.0, 46.0, 44.0, 43.0, 49.0, 53.0, 38.0, 45.0, 25.0, 23.0, 30.0, 22.0, 15.0, 13.0, 9.0, 12.0, 5.0, 6.0, 8.0, 9.0, 3.0, 4.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-90.5625, -87.5234375, -84.484375, -81.4453125, -78.40625, -75.3671875, -72.328125, -69.2890625, -66.25, -63.2109375, -60.171875, -57.1328125, -54.09375, -51.0546875, -48.015625, -44.9765625, -41.9375, -38.8984375, -35.859375, -32.8203125, -29.78125, -26.7421875, -23.703125, -20.6640625, -17.625, -14.5859375, -11.546875, -8.5078125, -5.46875, -2.4296875, 0.609375, 3.6484375, 6.6875, 9.7265625, 12.765625, 15.8046875, 18.84375, 21.8828125, 24.921875, 27.9609375, 31.0, 34.0390625, 37.078125, 40.1171875, 43.15625, 46.1953125, 49.234375, 52.2734375, 55.3125, 58.3515625, 61.390625, 64.4296875, 67.46875, 70.5078125, 73.546875, 76.5859375, 79.625, 82.6640625, 85.703125, 88.7421875, 91.78125, 94.8203125, 97.859375, 100.8984375, 103.9375]}, "gradients/decoder.transformer.h.0.crossattention.c_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 5.0, 4.0, 6.0, 5.0, 6.0, 16.0, 30.0, 36.0, 56.0, 91.0, 119.0, 169.0, 284.0, 426.0, 622.0, 972.0, 1591.0, 2381.0, 3692.0, 6063.0, 9886.0, 16349.0, 27946.0, 50019.0, 94867.0, 205586.0, 336765.0, 133163.0, 67148.0, 36428.0, 20844.0, 12421.0, 7593.0, 4726.0, 2912.0, 1794.0, 1203.0, 800.0, 509.0, 344.0, 231.0, 143.0, 108.0, 69.0, 42.0, 29.0, 19.0, 17.0, 13.0, 7.0, 7.0, 4.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-24.640625, -23.876220703125, -23.11181640625, -22.347412109375, -21.5830078125, -20.818603515625, -20.05419921875, -19.289794921875, -18.525390625, -17.760986328125, -16.99658203125, -16.232177734375, -15.4677734375, -14.703369140625, -13.93896484375, -13.174560546875, -12.41015625, -11.645751953125, -10.88134765625, -10.116943359375, -9.3525390625, -8.588134765625, -7.82373046875, -7.059326171875, -6.294921875, -5.530517578125, -4.76611328125, -4.001708984375, -3.2373046875, -2.472900390625, -1.70849609375, -0.944091796875, -0.1796875, 0.584716796875, 1.34912109375, 2.113525390625, 2.8779296875, 3.642333984375, 4.40673828125, 5.171142578125, 5.935546875, 6.699951171875, 7.46435546875, 8.228759765625, 8.9931640625, 9.757568359375, 10.52197265625, 11.286376953125, 12.05078125, 12.815185546875, 13.57958984375, 14.343994140625, 15.1083984375, 15.872802734375, 16.63720703125, 17.401611328125, 18.166015625, 18.930419921875, 19.69482421875, 20.459228515625, 21.2236328125, 21.988037109375, 22.75244140625, 23.516845703125, 24.28125]}, "gradients/decoder.transformer.h.0.crossattention.c_attn.bias": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 2.0, 6.0, 4.0, 3.0, 4.0, 8.0, 6.0, 10.0, 7.0, 9.0, 19.0, 10.0, 19.0, 30.0, 15.0, 19.0, 22.0, 22.0, 29.0, 29.0, 34.0, 34.0, 36.0, 30.0, 31.0, 33.0, 1051.0, 39.0, 29.0, 38.0, 34.0, 37.0, 40.0, 31.0, 34.0, 33.0, 23.0, 34.0, 22.0, 15.0, 21.0, 16.0, 10.0, 7.0, 9.0, 6.0, 8.0, 9.0, 4.0, 2.0, 3.0, 4.0, 6.0, 3.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-45.25, -43.716796875, -42.18359375, -40.650390625, -39.1171875, -37.583984375, -36.05078125, -34.517578125, -32.984375, -31.451171875, -29.91796875, -28.384765625, -26.8515625, -25.318359375, -23.78515625, -22.251953125, -20.71875, -19.185546875, -17.65234375, -16.119140625, -14.5859375, -13.052734375, -11.51953125, -9.986328125, -8.453125, -6.919921875, -5.38671875, -3.853515625, -2.3203125, -0.787109375, 0.74609375, 2.279296875, 3.8125, 5.345703125, 6.87890625, 8.412109375, 9.9453125, 11.478515625, 13.01171875, 14.544921875, 16.078125, 17.611328125, 19.14453125, 20.677734375, 22.2109375, 23.744140625, 25.27734375, 26.810546875, 28.34375, 29.876953125, 31.41015625, 32.943359375, 34.4765625, 36.009765625, 37.54296875, 39.076171875, 40.609375, 42.142578125, 43.67578125, 45.208984375, 46.7421875, 48.275390625, 49.80859375, 51.341796875, 52.875]}, "gradients/decoder.transformer.h.0.crossattention.c_attn.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 6.0, 4.0, 3.0, 3.0, 8.0, 12.0, 10.0, 17.0, 18.0, 40.0, 53.0, 66.0, 146.0, 222.0, 364.0, 529.0, 1004.0, 1575.0, 2660.0, 4393.0, 7623.0, 13362.0, 24134.0, 44868.0, 86549.0, 186122.0, 1421294.0, 145610.0, 71409.0, 37204.0, 20311.0, 11420.0, 6494.0, 3826.0, 2248.0, 1351.0, 840.0, 483.0, 295.0, 207.0, 113.0, 64.0, 64.0, 32.0, 25.0, 18.0, 9.0, 9.0, 4.0, 4.0, 6.0, 6.0, 2.0, 2.0, 1.0, 2.0], "bins": [-28.734375, -27.89794921875, -27.0615234375, -26.22509765625, -25.388671875, -24.55224609375, -23.7158203125, -22.87939453125, -22.04296875, -21.20654296875, -20.3701171875, -19.53369140625, -18.697265625, -17.86083984375, -17.0244140625, -16.18798828125, -15.3515625, -14.51513671875, -13.6787109375, -12.84228515625, -12.005859375, -11.16943359375, -10.3330078125, -9.49658203125, -8.66015625, -7.82373046875, -6.9873046875, -6.15087890625, -5.314453125, -4.47802734375, -3.6416015625, -2.80517578125, -1.96875, -1.13232421875, -0.2958984375, 0.54052734375, 1.376953125, 2.21337890625, 3.0498046875, 3.88623046875, 4.72265625, 5.55908203125, 6.3955078125, 7.23193359375, 8.068359375, 8.90478515625, 9.7412109375, 10.57763671875, 11.4140625, 12.25048828125, 13.0869140625, 13.92333984375, 14.759765625, 15.59619140625, 16.4326171875, 17.26904296875, 18.10546875, 18.94189453125, 19.7783203125, 20.61474609375, 21.451171875, 22.28759765625, 23.1240234375, 23.96044921875, 24.796875]}, "gradients/decoder.transformer.h.0.crossattention.q_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 0.0, 2.0, 4.0, 5.0, 3.0, 7.0, 11.0, 10.0, 20.0, 15.0, 10.0, 13.0, 26.0, 43.0, 53.0, 61.0, 84.0, 95.0, 94.0, 110.0, 80.0, 53.0, 37.0, 32.0, 35.0, 27.0, 15.0, 19.0, 11.0, 11.0, 4.0, 8.0, 4.0, 4.0, 2.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0196533203125, -0.01908278465270996, -0.018512248992919922, -0.017941713333129883, -0.017371177673339844, -0.016800642013549805, -0.016230106353759766, -0.015659570693969727, -0.015089035034179688, -0.014518499374389648, -0.01394796371459961, -0.01337742805480957, -0.012806892395019531, -0.012236356735229492, -0.011665821075439453, -0.011095285415649414, -0.010524749755859375, -0.009954214096069336, -0.009383678436279297, -0.008813142776489258, -0.008242607116699219, -0.00767207145690918, -0.007101535797119141, -0.0065310001373291016, -0.0059604644775390625, -0.0053899288177490234, -0.004819393157958984, -0.004248857498168945, -0.0036783218383789062, -0.003107786178588867, -0.002537250518798828, -0.001966714859008789, -0.00139617919921875, -0.0008256435394287109, -0.0002551078796386719, 0.0003154277801513672, 0.0008859634399414062, 0.0014564990997314453, 0.0020270347595214844, 0.0025975704193115234, 0.0031681060791015625, 0.0037386417388916016, 0.004309177398681641, 0.00487971305847168, 0.005450248718261719, 0.006020784378051758, 0.006591320037841797, 0.007161855697631836, 0.007732391357421875, 0.008302927017211914, 0.008873462677001953, 0.009443998336791992, 0.010014533996582031, 0.01058506965637207, 0.01115560531616211, 0.011726140975952148, 0.012296676635742188, 0.012867212295532227, 0.013437747955322266, 0.014008283615112305, 0.014578819274902344, 0.015149354934692383, 0.015719890594482422, 0.01629042625427246, 0.0168609619140625]}, "gradients/decoder.transformer.h.0.crossattention.q_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 3.0, 2.0, 4.0, 4.0, 6.0, 8.0, 8.0, 14.0, 13.0, 23.0, 30.0, 53.0, 70.0, 87.0, 136.0, 241.0, 452.0, 1132.0, 6794.0, 295753.0, 729150.0, 11816.0, 1487.0, 508.0, 262.0, 156.0, 107.0, 61.0, 45.0, 45.0, 21.0, 19.0, 12.0, 9.0, 9.0, 9.0, 2.0, 6.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.28759765625, -0.2791633605957031, -0.27072906494140625, -0.2622947692871094, -0.2538604736328125, -0.24542617797851562, -0.23699188232421875, -0.22855758666992188, -0.220123291015625, -0.21168899536132812, -0.20325469970703125, -0.19482040405273438, -0.1863861083984375, -0.17795181274414062, -0.16951751708984375, -0.16108322143554688, -0.15264892578125, -0.14421463012695312, -0.13578033447265625, -0.12734603881835938, -0.1189117431640625, -0.11047744750976562, -0.10204315185546875, -0.09360885620117188, -0.085174560546875, -0.07674026489257812, -0.06830596923828125, -0.059871673583984375, -0.0514373779296875, -0.043003082275390625, -0.03456878662109375, -0.026134490966796875, -0.0177001953125, -0.009265899658203125, -0.00083160400390625, 0.007602691650390625, 0.0160369873046875, 0.024471282958984375, 0.03290557861328125, 0.041339874267578125, 0.049774169921875, 0.058208465576171875, 0.06664276123046875, 0.07507705688476562, 0.0835113525390625, 0.09194564819335938, 0.10037994384765625, 0.10881423950195312, 0.11724853515625, 0.12568283081054688, 0.13411712646484375, 0.14255142211914062, 0.1509857177734375, 0.15942001342773438, 0.16785430908203125, 0.17628860473632812, 0.184722900390625, 0.19315719604492188, 0.20159149169921875, 0.21002578735351562, 0.2184600830078125, 0.22689437866210938, 0.23532867431640625, 0.24376296997070312, 0.252197265625]}, "gradients/decoder.transformer.h.0.ln_cross_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 4.0, 10.0, 16.0, 149.0, 429.0, 315.0, 68.0, 16.0, 5.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.07982425391674042, -0.0781906321644783, -0.07655701786279678, -0.07492339611053467, -0.07328978180885315, -0.07165616005659103, -0.07002254575490952, -0.0683889240026474, -0.06675530970096588, -0.06512168794870377, -0.06348807364702225, -0.06185445562005043, -0.06022083759307861, -0.058587219566106796, -0.05695360153913498, -0.05531998351216316, -0.053686365485191345, -0.05205274745821953, -0.05041912943124771, -0.048785511404275894, -0.04715189337730408, -0.04551827535033226, -0.04388465732336044, -0.042251039296388626, -0.04061741754412651, -0.038983799517154694, -0.03735018149018288, -0.03571656346321106, -0.03408294543623924, -0.032449327409267426, -0.03081570938229561, -0.02918209135532379, -0.027548471465706825, -0.025914853438735008, -0.02428123541176319, -0.022647617384791374, -0.021013999357819557, -0.01938037946820259, -0.017746761441230774, -0.016113143414258957, -0.014479526318609715, -0.012845908291637897, -0.01121229026466608, -0.009578671306371689, -0.007945053279399872, -0.006311435252428055, -0.004677817225456238, -0.0030441991984844208, -0.0014105811715126038, 0.00022303697187453508, 0.001856655115261674, 0.0034902733750641346, 0.005123891402035952, 0.006757509894669056, 0.008391127921640873, 0.01002474594861269, 0.011658363975584507, 0.013291982002556324, 0.014925600029528141, 0.016559218987822533, 0.01819283701479435, 0.019826455041766167, 0.021460073068737984, 0.0230936910957098, 0.024727309122681618]}, "gradients/decoder.transformer.h.0.ln_cross_attn.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 1.0, 9.0, 7.0, 6.0, 18.0, 15.0, 19.0, 28.0, 22.0, 27.0, 36.0, 37.0, 39.0, 49.0, 65.0, 43.0, 44.0, 59.0, 68.0, 58.0, 47.0, 46.0, 34.0, 43.0, 43.0, 24.0, 22.0, 20.0, 14.0, 15.0, 18.0, 10.0, 4.0, 4.0, 3.0, 5.0, 4.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.008314907550811768, -0.008013147860765457, -0.007711388170719147, -0.007409628480672836, -0.007107868790626526, -0.0068061091005802155, -0.006504349410533905, -0.006202589720487595, -0.005900830030441284, -0.005599070340394974, -0.005297310650348663, -0.004995550960302353, -0.0046937912702560425, -0.004392031580209732, -0.004090271890163422, -0.003788512200117111, -0.0034867525100708008, -0.0031849928200244904, -0.00288323312997818, -0.0025814734399318695, -0.002279713749885559, -0.0019779540598392487, -0.0016761943697929382, -0.0013744346797466278, -0.0010726749897003174, -0.000770915299654007, -0.00046915560960769653, -0.0001673959195613861, 0.00013436377048492432, 0.00043612346053123474, 0.0007378831505775452, 0.0010396428406238556, 0.001341402530670166, 0.0016431622207164764, 0.0019449219107627869, 0.0022466816008090973, 0.0025484412908554077, 0.002850200980901718, 0.0031519606709480286, 0.003453720360994339, 0.0037554800510406494, 0.00405723974108696, 0.00435899943113327, 0.004660759121179581, 0.004962518811225891, 0.0052642785012722015, 0.005566038191318512, 0.005867797881364822, 0.006169557571411133, 0.006471317261457443, 0.006773076951503754, 0.007074836641550064, 0.0073765963315963745, 0.007678356021642685, 0.007980115711688995, 0.008281875401735306, 0.008583635091781616, 0.008885394781827927, 0.009187154471874237, 0.009488914161920547, 0.009790673851966858, 0.010092433542013168, 0.010394193232059479, 0.01069595292210579, 0.0109977126121521]}, "gradients/decoder.transformer.h.0.attn.c_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 3.0, 2.0, 1.0, 6.0, 12.0, 7.0, 16.0, 11.0, 12.0, 18.0, 20.0, 25.0, 27.0, 27.0, 32.0, 38.0, 41.0, 38.0, 37.0, 41.0, 48.0, 46.0, 45.0, 44.0, 43.0, 51.0, 51.0, 39.0, 44.0, 25.0, 26.0, 27.0, 22.0, 15.0, 13.0, 9.0, 12.0, 6.0, 5.0, 8.0, 8.0, 4.0, 4.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-90.5, -87.4619140625, -84.423828125, -81.3857421875, -78.34765625, -75.3095703125, -72.271484375, -69.2333984375, -66.1953125, -63.1572265625, -60.119140625, -57.0810546875, -54.04296875, -51.0048828125, -47.966796875, -44.9287109375, -41.890625, -38.8525390625, -35.814453125, -32.7763671875, -29.73828125, -26.7001953125, -23.662109375, -20.6240234375, -17.5859375, -14.5478515625, -11.509765625, -8.4716796875, -5.43359375, -2.3955078125, 0.642578125, 3.6806640625, 6.71875, 9.7568359375, 12.794921875, 15.8330078125, 18.87109375, 21.9091796875, 24.947265625, 27.9853515625, 31.0234375, 34.0615234375, 37.099609375, 40.1376953125, 43.17578125, 46.2138671875, 49.251953125, 52.2900390625, 55.328125, 58.3662109375, 61.404296875, 64.4423828125, 67.48046875, 70.5185546875, 73.556640625, 76.5947265625, 79.6328125, 82.6708984375, 85.708984375, 88.7470703125, 91.78515625, 94.8232421875, 97.861328125, 100.8994140625, 103.9375]}, "gradients/decoder.transformer.h.0.attn.c_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 5.0, 12.0, 4.0, 23.0, 9.0, 30.0, 33.0, 46.0, 82.0, 83.0, 146.0, 202.0, 296.0, 496.0, 916.0, 1449.0, 2578.0, 5565.0, 12980.0, 44497.0, 278002.0, 577639.0, 87306.0, 20610.0, 7307.0, 3617.0, 1899.0, 1016.0, 592.0, 374.0, 238.0, 155.0, 94.0, 68.0, 47.0, 29.0, 23.0, 29.0, 20.0, 9.0, 11.0, 4.0, 7.0, 7.0, 3.0, 3.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-24.21875, -23.360107421875, -22.50146484375, -21.642822265625, -20.7841796875, -19.925537109375, -19.06689453125, -18.208251953125, -17.349609375, -16.490966796875, -15.63232421875, -14.773681640625, -13.9150390625, -13.056396484375, -12.19775390625, -11.339111328125, -10.48046875, -9.621826171875, -8.76318359375, -7.904541015625, -7.0458984375, -6.187255859375, -5.32861328125, -4.469970703125, -3.611328125, -2.752685546875, -1.89404296875, -1.035400390625, -0.1767578125, 0.681884765625, 1.54052734375, 2.399169921875, 3.2578125, 4.116455078125, 4.97509765625, 5.833740234375, 6.6923828125, 7.551025390625, 8.40966796875, 9.268310546875, 10.126953125, 10.985595703125, 11.84423828125, 12.702880859375, 13.5615234375, 14.420166015625, 15.27880859375, 16.137451171875, 16.99609375, 17.854736328125, 18.71337890625, 19.572021484375, 20.4306640625, 21.289306640625, 22.14794921875, 23.006591796875, 23.865234375, 24.723876953125, 25.58251953125, 26.441162109375, 27.2998046875, 28.158447265625, 29.01708984375, 29.875732421875, 30.734375]}, "gradients/decoder.transformer.h.0.attn.c_attn.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 4.0, 13.0, 9.0, 12.0, 26.0, 43.0, 42.0, 66.0, 72.0, 67.0, 99.0, 2135.0, 96.0, 78.0, 78.0, 40.0, 42.0, 42.0, 26.0, 12.0, 17.0, 8.0, 5.0, 2.0, 4.0, 5.0, 1.0, 2.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-257.75, -249.8046875, -241.859375, -233.9140625, -225.96875, -218.0234375, -210.078125, -202.1328125, -194.1875, -186.2421875, -178.296875, -170.3515625, -162.40625, -154.4609375, -146.515625, -138.5703125, -130.625, -122.6796875, -114.734375, -106.7890625, -98.84375, -90.8984375, -82.953125, -75.0078125, -67.0625, -59.1171875, -51.171875, -43.2265625, -35.28125, -27.3359375, -19.390625, -11.4453125, -3.5, 4.4453125, 12.390625, 20.3359375, 28.28125, 36.2265625, 44.171875, 52.1171875, 60.0625, 68.0078125, 75.953125, 83.8984375, 91.84375, 99.7890625, 107.734375, 115.6796875, 123.625, 131.5703125, 139.515625, 147.4609375, 155.40625, 163.3515625, 171.296875, 179.2421875, 187.1875, 195.1328125, 203.078125, 211.0234375, 218.96875, 226.9140625, 234.859375, 242.8046875, 250.75]}, "gradients/decoder.transformer.h.0.attn.c_attn.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 3.0, 3.0, 11.0, 7.0, 15.0, 24.0, 11.0, 36.0, 52.0, 65.0, 132.0, 195.0, 342.0, 615.0, 1828.0, 10886.0, 2399638.0, 717892.0, 10693.0, 1787.0, 613.0, 323.0, 207.0, 115.0, 76.0, 32.0, 38.0, 26.0, 13.0, 8.0, 10.0, 6.0, 2.0, 6.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-62.125, -60.0537109375, -57.982421875, -55.9111328125, -53.83984375, -51.7685546875, -49.697265625, -47.6259765625, -45.5546875, -43.4833984375, -41.412109375, -39.3408203125, -37.26953125, -35.1982421875, -33.126953125, -31.0556640625, -28.984375, -26.9130859375, -24.841796875, -22.7705078125, -20.69921875, -18.6279296875, -16.556640625, -14.4853515625, -12.4140625, -10.3427734375, -8.271484375, -6.2001953125, -4.12890625, -2.0576171875, 0.013671875, 2.0849609375, 4.15625, 6.2275390625, 8.298828125, 10.3701171875, 12.44140625, 14.5126953125, 16.583984375, 18.6552734375, 20.7265625, 22.7978515625, 24.869140625, 26.9404296875, 29.01171875, 31.0830078125, 33.154296875, 35.2255859375, 37.296875, 39.3681640625, 41.439453125, 43.5107421875, 45.58203125, 47.6533203125, 49.724609375, 51.7958984375, 53.8671875, 55.9384765625, 58.009765625, 60.0810546875, 62.15234375, 64.2236328125, 66.294921875, 68.3662109375, 70.4375]}, "gradients/decoder.transformer.h.0.ln_1.weight": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 1.0, 15.0, 51.0, 491.0, 375.0, 64.0, 8.0, 3.0, 7.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-200.3923797607422, -171.44140625, -142.4904327392578, -113.53946685791016, -84.58849334716797, -55.63752746582031, -26.686553955078125, 2.2644195556640625, 31.21539306640625, 60.16636657714844, 89.11734008789062, 118.06830596923828, 147.019287109375, 175.97024536132812, 204.9212188720703, 233.8721923828125, 262.82318115234375, 291.7741394042969, 320.7251281738281, 349.67608642578125, 378.6270751953125, 407.5780334472656, 436.52899169921875, 465.47998046875, 494.4309387207031, 523.3818969726562, 552.3328857421875, 581.2838745117188, 610.2348022460938, 639.185791015625, 668.1367797851562, 697.0877685546875, 726.0387573242188, 754.98974609375, 783.940673828125, 812.8916625976562, 841.8426513671875, 870.7935791015625, 899.7445678710938, 928.695556640625, 957.6465454101562, 986.5975341796875, 1015.5484619140625, 1044.49951171875, 1073.450439453125, 1102.4013671875, 1131.3524169921875, 1160.3033447265625, 1189.2542724609375, 1218.2052001953125, 1247.15625, 1276.107177734375, 1305.05810546875, 1334.0091552734375, 1362.9600830078125, 1391.9111328125, 1420.862060546875, 1449.81298828125, 1478.7640380859375, 1507.7149658203125, 1536.6658935546875, 1565.616943359375, 1594.56787109375, 1623.518798828125, 1652.4698486328125]}, "gradients/decoder.transformer.h.0.ln_1.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 4.0, 3.0, 1.0, 0.0, 7.0, 3.0, 4.0, 13.0, 9.0, 11.0, 8.0, 16.0, 9.0, 19.0, 22.0, 26.0, 26.0, 29.0, 34.0, 31.0, 47.0, 44.0, 53.0, 47.0, 48.0, 45.0, 46.0, 52.0, 35.0, 41.0, 37.0, 36.0, 29.0, 30.0, 16.0, 22.0, 23.0, 11.0, 13.0, 11.0, 11.0, 4.0, 10.0, 6.0, 7.0, 3.0, 3.0, 7.0, 1.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-189.0828094482422, -183.53890991210938, -177.9949951171875, -172.4510955810547, -166.90719604492188, -161.36329650878906, -155.8193817138672, -150.27548217773438, -144.73158264160156, -139.18768310546875, -133.64376831054688, -128.09986877441406, -122.55596923828125, -117.0120620727539, -111.46815490722656, -105.92425537109375, -100.3803482055664, -94.83644104003906, -89.29254150390625, -83.7486343383789, -78.2047348022461, -72.66082763671875, -67.11692810058594, -61.573020935058594, -56.029117584228516, -50.48521423339844, -44.94131088256836, -39.39740753173828, -33.85350036621094, -28.309598922729492, -22.76569366455078, -17.221790313720703, -11.677886962890625, -6.133983135223389, -0.5900793075561523, 4.953824996948242, 10.49772834777832, 16.0416316986084, 21.58553695678711, 27.129440307617188, 32.673343658447266, 38.217247009277344, 43.76115036010742, 49.3050537109375, 54.848960876464844, 60.392860412597656, 65.936767578125, 71.48066711425781, 77.02457427978516, 82.5684814453125, 88.11238098144531, 93.65628814697266, 99.20018768310547, 104.74409484863281, 110.28799438476562, 115.83190155029297, 121.37580871582031, 126.91971588134766, 132.463623046875, 138.0075225830078, 143.55142211914062, 149.09532165527344, 154.6392364501953, 160.18313598632812, 165.72703552246094]}, "gradients/decoder.transformer.wpe.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 4.0, 3.0, 0.0, 5.0, 2.0, 7.0, 7.0, 9.0, 12.0, 20.0, 21.0, 31.0, 33.0, 51.0, 59.0, 91.0, 124.0, 203.0, 263.0, 411.0, 615.0, 828.0, 987.0, 1041438.0, 893.0, 747.0, 544.0, 354.0, 240.0, 157.0, 107.0, 86.0, 53.0, 39.0, 18.0, 15.0, 17.0, 20.0, 7.0, 9.0, 7.0, 8.0, 4.0, 3.0, 6.0, 4.0, 3.0, 0.0, 2.0, 2.0, 0.0, 3.0], "bins": [-76.24835205078125, -74.08812713623047, -71.92790222167969, -69.76766967773438, -67.6074447631836, -65.44721984863281, -63.286991119384766, -61.12676239013672, -58.96653747558594, -56.806312561035156, -54.64608383178711, -52.48585510253906, -50.32563018798828, -48.1654052734375, -46.00517654418945, -43.844947814941406, -41.684722900390625, -39.524497985839844, -37.3642692565918, -35.20404052734375, -33.04381561279297, -30.883588790893555, -28.72336196899414, -26.563135147094727, -24.402908325195312, -22.2426815032959, -20.082454681396484, -17.92222785949707, -15.762001037597656, -13.601774215698242, -11.441547393798828, -9.281320571899414, -7.121086120605469, -4.960859298706055, -2.8006324768066406, -0.6404056549072266, 1.5198211669921875, 3.6800479888916016, 5.840274810791016, 8.00050163269043, 10.160728454589844, 12.320955276489258, 14.481182098388672, 16.641408920288086, 18.8016357421875, 20.961862564086914, 23.122089385986328, 25.282316207885742, 27.442543029785156, 29.60276985168457, 31.762996673583984, 33.92322540283203, 36.08345031738281, 38.243675231933594, 40.40390396118164, 42.56413269042969, 44.72435760498047, 46.88458251953125, 49.0448112487793, 51.205039978027344, 53.365264892578125, 55.525489807128906, 57.68571853637695, 59.845947265625, 62.00617218017578]}, "gradients/decoder.transformer.wte.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 6.0, 5.0, 7.0, 4.0, 12.0, 9.0, 16.0, 18.0, 23.0, 27.0, 103.0, 543.0, 51462052.0, 164.0, 49.0, 29.0, 26.0, 10.0, 3.0, 4.0, 0.0, 4.0, 5.0, 2.0, 4.0, 7.0, 4.0, 8.0, 4.0, 8.0, 4.0, 3.0, 2.0, 4.0, 2.0, 3.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6036.0, -5777.8173828125, -5519.63427734375, -5261.45166015625, -5003.2685546875, -4745.0859375, -4486.9033203125, -4228.72021484375, -3970.537353515625, -3712.3544921875, -3454.171630859375, -3195.98876953125, -2937.80615234375, -2679.623046875, -2421.4404296875, -2163.257568359375, -1905.07470703125, -1646.891845703125, -1388.708984375, -1130.5262451171875, -872.3433837890625, -614.1605224609375, -355.977783203125, -97.794921875, 160.387939453125, 418.5707702636719, 676.7536010742188, 934.9364013671875, 1193.1192626953125, 1451.3021240234375, 1709.48486328125, 1967.667724609375, 2225.8505859375, 2484.033447265625, 2742.21630859375, 3000.39892578125, 3258.58203125, 3516.7646484375, 3774.947509765625, 4033.13037109375, 4291.3134765625, 4549.49609375, 4807.67919921875, 5065.86181640625, 5324.044921875, 5582.2275390625, 5840.41015625, 6098.59326171875, 6356.77587890625, 6614.95849609375, 6873.1416015625, 7131.32421875, 7389.50732421875, 7647.68994140625, 7905.873046875, 8164.0556640625, 8422.23828125, 8680.4208984375, 8938.603515625, 9196.787109375, 9454.9697265625, 9713.15234375, 9971.3349609375, 10229.517578125, 10487.701171875]}, "gradients/encoder.adapter.layers.2.conv.weight": {"_type": "histogram", "values": [2.0, 6.0, 6.0, 3.0, 7.0, 6.0, 16.0, 16.0, 18.0, 51.0, 58.0, 83.0, 129.0, 194.0, 284.0, 388.0, 519.0, 848.0, 1355.0, 2005.0, 2934.0, 4700.0, 6876.0, 10834.0, 16956.0, 27609.0, 44119.0, 73617.0, 127699.0, 232614.0, 507470.0, 3991997.0, 626517.0, 260131.0, 140669.0, 80718.0, 48035.0, 29454.0, 18509.0, 11969.0, 7701.0, 4892.0, 3048.0, 2071.0, 1412.0, 938.0, 624.0, 430.0, 290.0, 175.0, 162.0, 91.0, 66.0, 42.0, 28.0, 19.0, 14.0, 6.0, 8.0, 6.0, 6.0, 3.0, 2.0, 1.0], "bins": [-9.78125, -9.4715576171875, -9.161865234375, -8.8521728515625, -8.54248046875, -8.2327880859375, -7.923095703125, -7.6134033203125, -7.3037109375, -6.9940185546875, -6.684326171875, -6.3746337890625, -6.06494140625, -5.7552490234375, -5.445556640625, -5.1358642578125, -4.826171875, -4.5164794921875, -4.206787109375, -3.8970947265625, -3.58740234375, -3.2777099609375, -2.968017578125, -2.6583251953125, -2.3486328125, -2.0389404296875, -1.729248046875, -1.4195556640625, -1.10986328125, -0.8001708984375, -0.490478515625, -0.1807861328125, 0.12890625, 0.4385986328125, 0.748291015625, 1.0579833984375, 1.36767578125, 1.6773681640625, 1.987060546875, 2.2967529296875, 2.6064453125, 2.9161376953125, 3.225830078125, 3.5355224609375, 3.84521484375, 4.1549072265625, 4.464599609375, 4.7742919921875, 5.083984375, 5.3936767578125, 5.703369140625, 6.0130615234375, 6.32275390625, 6.6324462890625, 6.942138671875, 7.2518310546875, 7.5615234375, 7.8712158203125, 8.180908203125, 8.4906005859375, 8.80029296875, 9.1099853515625, 9.419677734375, 9.7293701171875, 10.0390625]}, "gradients/encoder.adapter.layers.2.conv.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 6.0, 3.0, 5.0, 4.0, 5.0, 11.0, 8.0, 6.0, 14.0, 12.0, 24.0, 14.0, 32.0, 21.0, 32.0, 37.0, 52.0, 37.0, 43.0, 55.0, 71.0, 191.0, 646.0, 213.0, 80.0, 49.0, 46.0, 44.0, 38.0, 41.0, 27.0, 28.0, 22.0, 27.0, 15.0, 12.0, 15.0, 14.0, 8.0, 9.0, 6.0, 5.0, 1.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-20.265625, -19.645263671875, -19.02490234375, -18.404541015625, -17.7841796875, -17.163818359375, -16.54345703125, -15.923095703125, -15.302734375, -14.682373046875, -14.06201171875, -13.441650390625, -12.8212890625, -12.200927734375, -11.58056640625, -10.960205078125, -10.33984375, -9.719482421875, -9.09912109375, -8.478759765625, -7.8583984375, -7.238037109375, -6.61767578125, -5.997314453125, -5.376953125, -4.756591796875, -4.13623046875, -3.515869140625, -2.8955078125, -2.275146484375, -1.65478515625, -1.034423828125, -0.4140625, 0.206298828125, 0.82666015625, 1.447021484375, 2.0673828125, 2.687744140625, 3.30810546875, 3.928466796875, 4.548828125, 5.169189453125, 5.78955078125, 6.409912109375, 7.0302734375, 7.650634765625, 8.27099609375, 8.891357421875, 9.51171875, 10.132080078125, 10.75244140625, 11.372802734375, 11.9931640625, 12.613525390625, 13.23388671875, 13.854248046875, 14.474609375, 15.094970703125, 15.71533203125, 16.335693359375, 16.9560546875, 17.576416015625, 18.19677734375, 18.817138671875, 19.4375]}, "gradients/encoder.adapter.layers.1.conv.weight": {"_type": "histogram", "values": [3.0, 3.0, 6.0, 9.0, 3.0, 6.0, 8.0, 24.0, 34.0, 63.0, 78.0, 75.0, 134.0, 156.0, 222.0, 316.0, 469.0, 674.0, 952.0, 1501.0, 2205.0, 3290.0, 5176.0, 7886.0, 12432.0, 19511.0, 31707.0, 52588.0, 89597.0, 162112.0, 342199.0, 2764134.0, 2072645.0, 335145.0, 159920.0, 88327.0, 51990.0, 31540.0, 19313.0, 12145.0, 7853.0, 4808.0, 3235.0, 2347.0, 1457.0, 996.0, 634.0, 509.0, 271.0, 238.0, 163.0, 91.0, 67.0, 62.0, 27.0, 32.0, 10.0, 17.0, 6.0, 13.0, 8.0, 8.0, 3.0, 3.0], "bins": [-9.7265625, -9.4224853515625, -9.118408203125, -8.8143310546875, -8.51025390625, -8.2061767578125, -7.902099609375, -7.5980224609375, -7.2939453125, -6.9898681640625, -6.685791015625, -6.3817138671875, -6.07763671875, -5.7735595703125, -5.469482421875, -5.1654052734375, -4.861328125, -4.5572509765625, -4.253173828125, -3.9490966796875, -3.64501953125, -3.3409423828125, -3.036865234375, -2.7327880859375, -2.4287109375, -2.1246337890625, -1.820556640625, -1.5164794921875, -1.21240234375, -0.9083251953125, -0.604248046875, -0.3001708984375, 0.00390625, 0.3079833984375, 0.612060546875, 0.9161376953125, 1.22021484375, 1.5242919921875, 1.828369140625, 2.1324462890625, 2.4365234375, 2.7406005859375, 3.044677734375, 3.3487548828125, 3.65283203125, 3.9569091796875, 4.260986328125, 4.5650634765625, 4.869140625, 5.1732177734375, 5.477294921875, 5.7813720703125, 6.08544921875, 6.3895263671875, 6.693603515625, 6.9976806640625, 7.3017578125, 7.6058349609375, 7.909912109375, 8.2139892578125, 8.51806640625, 8.8221435546875, 9.126220703125, 9.4302978515625, 9.734375]}, "gradients/encoder.adapter.layers.1.conv.bias": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 1.0, 1.0, 3.0, 2.0, 2.0, 6.0, 3.0, 9.0, 8.0, 8.0, 12.0, 20.0, 17.0, 18.0, 19.0, 30.0, 32.0, 34.0, 47.0, 40.0, 41.0, 64.0, 90.0, 178.0, 520.0, 309.0, 95.0, 66.0, 44.0, 30.0, 40.0, 36.0, 30.0, 31.0, 24.0, 21.0, 15.0, 16.0, 17.0, 14.0, 14.0, 8.0, 6.0, 3.0, 4.0, 1.0, 3.0, 2.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-11.8046875, -11.380615234375, -10.95654296875, -10.532470703125, -10.1083984375, -9.684326171875, -9.26025390625, -8.836181640625, -8.412109375, -7.988037109375, -7.56396484375, -7.139892578125, -6.7158203125, -6.291748046875, -5.86767578125, -5.443603515625, -5.01953125, -4.595458984375, -4.17138671875, -3.747314453125, -3.3232421875, -2.899169921875, -2.47509765625, -2.051025390625, -1.626953125, -1.202880859375, -0.77880859375, -0.354736328125, 0.0693359375, 0.493408203125, 0.91748046875, 1.341552734375, 1.765625, 2.189697265625, 2.61376953125, 3.037841796875, 3.4619140625, 3.885986328125, 4.31005859375, 4.734130859375, 5.158203125, 5.582275390625, 6.00634765625, 6.430419921875, 6.8544921875, 7.278564453125, 7.70263671875, 8.126708984375, 8.55078125, 8.974853515625, 9.39892578125, 9.822998046875, 10.2470703125, 10.671142578125, 11.09521484375, 11.519287109375, 11.943359375, 12.367431640625, 12.79150390625, 13.215576171875, 13.6396484375, 14.063720703125, 14.48779296875, 14.911865234375, 15.3359375]}, "gradients/encoder.adapter.layers.0.conv.weight": {"_type": "histogram", "values": [5.0, 4.0, 3.0, 7.0, 5.0, 0.0, 6.0, 1.0, 10.0, 20.0, 19.0, 32.0, 26.0, 43.0, 69.0, 79.0, 103.0, 157.0, 188.0, 248.0, 289.0, 464.0, 655.0, 832.0, 1157.0, 1852.0, 2832.0, 4815.0, 9100.0, 20385.0, 56520.0, 538635.0, 5549320.0, 59237.0, 20954.0, 9308.0, 4911.0, 2855.0, 1863.0, 1170.0, 889.0, 629.0, 435.0, 301.0, 259.0, 184.0, 118.0, 132.0, 47.0, 65.0, 52.0, 61.0, 25.0, 4.0, 18.0, 14.0, 5.0, 13.0, 8.0, 9.0, 6.0, 0.0, 0.0, 3.0], "bins": [-28.328125, -27.443359375, -26.55859375, -25.673828125, -24.7890625, -23.904296875, -23.01953125, -22.134765625, -21.25, -20.365234375, -19.48046875, -18.595703125, -17.7109375, -16.826171875, -15.94140625, -15.056640625, -14.171875, -13.287109375, -12.40234375, -11.517578125, -10.6328125, -9.748046875, -8.86328125, -7.978515625, -7.09375, -6.208984375, -5.32421875, -4.439453125, -3.5546875, -2.669921875, -1.78515625, -0.900390625, -0.015625, 0.869140625, 1.75390625, 2.638671875, 3.5234375, 4.408203125, 5.29296875, 6.177734375, 7.0625, 7.947265625, 8.83203125, 9.716796875, 10.6015625, 11.486328125, 12.37109375, 13.255859375, 14.140625, 15.025390625, 15.91015625, 16.794921875, 17.6796875, 18.564453125, 19.44921875, 20.333984375, 21.21875, 22.103515625, 22.98828125, 23.873046875, 24.7578125, 25.642578125, 26.52734375, 27.412109375, 28.296875]}, "gradients/encoder.adapter.layers.0.conv.bias": {"_type": "histogram", "values": [2.0, 0.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 6.0, 3.0, 4.0, 3.0, 2.0, 8.0, 10.0, 10.0, 14.0, 12.0, 22.0, 14.0, 30.0, 27.0, 31.0, 32.0, 36.0, 39.0, 49.0, 56.0, 96.0, 160.0, 266.0, 438.0, 152.0, 93.0, 63.0, 46.0, 34.0, 37.0, 30.0, 37.0, 28.0, 28.0, 23.0, 14.0, 17.0, 9.0, 16.0, 8.0, 11.0, 3.0, 6.0, 4.0, 2.0, 2.0, 4.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-9.9375, -9.6290283203125, -9.320556640625, -9.0120849609375, -8.70361328125, -8.3951416015625, -8.086669921875, -7.7781982421875, -7.4697265625, -7.1612548828125, -6.852783203125, -6.5443115234375, -6.23583984375, -5.9273681640625, -5.618896484375, -5.3104248046875, -5.001953125, -4.6934814453125, -4.385009765625, -4.0765380859375, -3.76806640625, -3.4595947265625, -3.151123046875, -2.8426513671875, -2.5341796875, -2.2257080078125, -1.917236328125, -1.6087646484375, -1.30029296875, -0.9918212890625, -0.683349609375, -0.3748779296875, -0.06640625, 0.2420654296875, 0.550537109375, 0.8590087890625, 1.16748046875, 1.4759521484375, 1.784423828125, 2.0928955078125, 2.4013671875, 2.7098388671875, 3.018310546875, 3.3267822265625, 3.63525390625, 3.9437255859375, 4.252197265625, 4.5606689453125, 4.869140625, 5.1776123046875, 5.486083984375, 5.7945556640625, 6.10302734375, 6.4114990234375, 6.719970703125, 7.0284423828125, 7.3369140625, 7.6453857421875, 7.953857421875, 8.2623291015625, 8.57080078125, 8.8792724609375, 9.187744140625, 9.4962158203125, 9.8046875]}, "gradients/encoder.encoder.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 4.0, 3.0, 4.0, 12.0, 10.0, 12.0, 52.0, 109.0, 408.0, 281.0, 61.0, 32.0, 9.0, 6.0, 4.0, 2.0, 1.0, 1.0, 2.0], "bins": [-97.36585998535156, -95.55780792236328, -93.749755859375, -91.94171142578125, -90.13365936279297, -88.32560729980469, -86.5175552368164, -84.70951080322266, -82.90145874023438, -81.0934066772461, -79.28535461425781, -77.47731018066406, -75.66925811767578, -73.8612060546875, -72.05315399169922, -70.24510955810547, -68.43705749511719, -66.6290054321289, -64.82095336914062, -63.01290512084961, -61.204856872558594, -59.39680480957031, -57.5887565612793, -55.780704498291016, -53.972652435302734, -52.16460037231445, -50.35655212402344, -48.548500061035156, -46.74045181274414, -44.93239974975586, -43.124351501464844, -41.31629943847656, -39.50825500488281, -37.70020294189453, -35.892154693603516, -34.084102630615234, -32.27605438232422, -30.468002319335938, -28.659954071044922, -26.85190200805664, -25.04384994506836, -23.23579978942871, -21.427749633789062, -19.619699478149414, -17.811649322509766, -16.003597259521484, -14.195548057556152, -12.387497901916504, -10.579448699951172, -8.771398544311523, -6.963348388671875, -5.155297756195068, -3.34724760055542, -1.5391969680786133, 0.26885318756103516, 2.0769033432006836, 3.884953498840332, 5.6930036544799805, 7.501053810119629, 9.309104919433594, 11.117155075073242, 12.92520523071289, 14.733255386352539, 16.541305541992188, 18.349355697631836]}, "gradients/encoder.encoder.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 6.0, 8.0, 7.0, 4.0, 6.0, 2.0, 8.0, 10.0, 10.0, 16.0, 20.0, 19.0, 18.0, 26.0, 27.0, 21.0, 31.0, 32.0, 28.0, 34.0, 42.0, 38.0, 33.0, 37.0, 39.0, 36.0, 25.0, 37.0, 35.0, 30.0, 25.0, 31.0, 36.0, 27.0, 25.0, 35.0, 18.0, 22.0, 16.0, 13.0, 13.0, 9.0, 12.0, 7.0, 8.0, 8.0, 7.0, 5.0, 7.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-12.808023452758789, -12.405165672302246, -12.002306938171387, -11.599449157714844, -11.196590423583984, -10.793732643127441, -10.390874862670898, -9.988016128540039, -9.585158348083496, -9.182300567626953, -8.779441833496094, -8.37658405303955, -7.97372579574585, -7.570867538452148, -7.1680097579956055, -6.765151500701904, -6.362293243408203, -5.959434986114502, -5.556576728820801, -5.153718948364258, -4.750860691070557, -4.3480024337768555, -3.9451444149017334, -3.5422863960266113, -3.13942813873291, -2.736569881439209, -2.333711862564087, -1.9308537244796753, -1.5279955863952637, -1.1251373291015625, -0.7222793102264404, -0.31942129135131836, 0.08343791961669922, 0.48629605770111084, 0.8891541957855225, 1.292012333869934, 1.6948704719543457, 2.097728729248047, 2.500586748123169, 2.903444766998291, 3.306303024291992, 3.7091612815856934, 4.1120195388793945, 4.5148773193359375, 4.917735576629639, 5.32059383392334, 5.723451614379883, 6.126309871673584, 6.529168128967285, 6.932026386260986, 7.3348846435546875, 7.7377424240112305, 8.140600204467773, 8.543458938598633, 8.946316719055176, 9.349174499511719, 9.752033233642578, 10.154891014099121, 10.55774974822998, 10.960607528686523, 11.363466262817383, 11.766324043273926, 12.169181823730469, 12.572040557861328, 12.974898338317871]}, "gradients/encoder.encoder.layers.23.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 1.0, 4.0, 3.0, 2.0, 5.0, 0.0, 9.0, 7.0, 8.0, 10.0, 8.0, 10.0, 17.0, 21.0, 27.0, 45.0, 48.0, 86.0, 86.0, 119.0, 189.0, 250.0, 334.0, 531.0, 755.0, 1150.0, 1970.0, 3235.0, 6076.0, 14326.0, 72974.0, 4032292.0, 36278.0, 10578.0, 4884.0, 2810.0, 1676.0, 1063.0, 710.0, 516.0, 310.0, 284.0, 146.0, 118.0, 79.0, 61.0, 38.0, 37.0, 22.0, 19.0, 12.0, 12.0, 16.0, 10.0, 4.0, 5.0, 4.0, 3.0, 4.0, 3.0], "bins": [-0.0377197265625, -0.036652565002441406, -0.03558540344238281, -0.03451824188232422, -0.033451080322265625, -0.03238391876220703, -0.03131675720214844, -0.030249595642089844, -0.02918243408203125, -0.028115272521972656, -0.027048110961914062, -0.02598094940185547, -0.024913787841796875, -0.02384662628173828, -0.022779464721679688, -0.021712303161621094, -0.0206451416015625, -0.019577980041503906, -0.018510818481445312, -0.01744365692138672, -0.016376495361328125, -0.015309333801269531, -0.014242172241210938, -0.013175010681152344, -0.01210784912109375, -0.011040687561035156, -0.009973526000976562, -0.008906364440917969, -0.007839202880859375, -0.006772041320800781, -0.0057048797607421875, -0.004637718200683594, -0.003570556640625, -0.0025033950805664062, -0.0014362335205078125, -0.00036907196044921875, 0.000698089599609375, 0.0017652511596679688, 0.0028324127197265625, 0.0038995742797851562, 0.00496673583984375, 0.006033897399902344, 0.0071010589599609375, 0.008168220520019531, 0.009235382080078125, 0.010302543640136719, 0.011369705200195312, 0.012436866760253906, 0.0135040283203125, 0.014571189880371094, 0.015638351440429688, 0.01670551300048828, 0.017772674560546875, 0.01883983612060547, 0.019906997680664062, 0.020974159240722656, 0.02204132080078125, 0.023108482360839844, 0.024175643920898438, 0.02524280548095703, 0.026309967041015625, 0.02737712860107422, 0.028444290161132812, 0.029511451721191406, 0.03057861328125]}, "gradients/encoder.encoder.layers.23.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 2.0, 4.0, 0.0, 4.0, 3.0, 9.0, 4.0, 7.0, 2.0, 5.0, 3.0, 13.0, 16.0, 26.0, 290.0, 457.0, 66.0, 16.0, 9.0, 12.0, 9.0, 7.0, 9.0, 4.0, 5.0, 4.0, 4.0, 1.0, 3.0, 1.0, 4.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.002956390380859375, -0.0028515756130218506, -0.002746760845184326, -0.0026419460773468018, -0.0025371313095092773, -0.002432316541671753, -0.0023275017738342285, -0.002222687005996704, -0.0021178722381591797, -0.0020130574703216553, -0.0019082427024841309, -0.0018034279346466064, -0.001698613166809082, -0.0015937983989715576, -0.0014889836311340332, -0.0013841688632965088, -0.0012793540954589844, -0.00117453932762146, -0.0010697245597839355, -0.0009649097919464111, -0.0008600950241088867, -0.0007552802562713623, -0.0006504654884338379, -0.0005456507205963135, -0.00044083595275878906, -0.00033602118492126465, -0.00023120641708374023, -0.00012639164924621582, -2.1576881408691406e-05, 8.323788642883301e-05, 0.00018805265426635742, 0.00029286742210388184, 0.00039768218994140625, 0.0005024969577789307, 0.0006073117256164551, 0.0007121264934539795, 0.0008169412612915039, 0.0009217560291290283, 0.0010265707969665527, 0.0011313855648040771, 0.0012362003326416016, 0.001341015100479126, 0.0014458298683166504, 0.0015506446361541748, 0.0016554594039916992, 0.0017602741718292236, 0.001865088939666748, 0.0019699037075042725, 0.002074718475341797, 0.0021795332431793213, 0.0022843480110168457, 0.00238916277885437, 0.0024939775466918945, 0.002598792314529419, 0.0027036070823669434, 0.0028084218502044678, 0.002913236618041992, 0.0030180513858795166, 0.003122866153717041, 0.0032276809215545654, 0.00333249568939209, 0.0034373104572296143, 0.0035421252250671387, 0.003646939992904663, 0.0037517547607421875]}, "gradients/encoder.encoder.layers.23.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 3.0, 2.0, 4.0, 7.0, 1.0, 5.0, 9.0, 13.0, 23.0, 23.0, 42.0, 62.0, 99.0, 152.0, 224.0, 377.0, 708.0, 1522.0, 4200.0, 19221.0, 978924.0, 3159763.0, 21178.0, 4285.0, 1647.0, 705.0, 393.0, 219.0, 151.0, 110.0, 74.0, 46.0, 35.0, 21.0, 14.0, 4.0, 3.0, 5.0, 2.0, 6.0, 3.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0638427734375, -0.06178474426269531, -0.059726715087890625, -0.05766868591308594, -0.05561065673828125, -0.05355262756347656, -0.051494598388671875, -0.04943656921386719, -0.0473785400390625, -0.04532051086425781, -0.043262481689453125, -0.04120445251464844, -0.03914642333984375, -0.03708839416503906, -0.035030364990234375, -0.03297233581542969, -0.030914306640625, -0.028856277465820312, -0.026798248291015625, -0.024740219116210938, -0.02268218994140625, -0.020624160766601562, -0.018566131591796875, -0.016508102416992188, -0.0144500732421875, -0.012392044067382812, -0.010334014892578125, -0.008275985717773438, -0.00621795654296875, -0.0041599273681640625, -0.002101898193359375, -4.38690185546875e-05, 0.00201416015625, 0.0040721893310546875, 0.006130218505859375, 0.008188247680664062, 0.01024627685546875, 0.012304306030273438, 0.014362335205078125, 0.016420364379882812, 0.0184783935546875, 0.020536422729492188, 0.022594451904296875, 0.024652481079101562, 0.02671051025390625, 0.028768539428710938, 0.030826568603515625, 0.03288459777832031, 0.034942626953125, 0.03700065612792969, 0.039058685302734375, 0.04111671447753906, 0.04317474365234375, 0.04523277282714844, 0.047290802001953125, 0.04934883117675781, 0.0514068603515625, 0.05346488952636719, 0.055522918701171875, 0.05758094787597656, 0.05963897705078125, 0.06169700622558594, 0.06375503540039062, 0.06581306457519531, 0.06787109375]}, "gradients/encoder.encoder.layers.23.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 7.0, 3.0, 11.0, 8.0, 13.0, 9.0, 22.0, 27.0, 26.0, 42.0, 55.0, 58.0, 86.0, 119.0, 170.0, 429.0, 2025.0, 387.0, 178.0, 91.0, 62.0, 57.0, 44.0, 35.0, 23.0, 21.0, 25.0, 9.0, 10.0, 8.0, 7.0, 6.0, 4.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.006290435791015625, -0.0060964226722717285, -0.005902409553527832, -0.0057083964347839355, -0.005514383316040039, -0.005320370197296143, -0.005126357078552246, -0.00493234395980835, -0.004738330841064453, -0.004544317722320557, -0.00435030460357666, -0.004156291484832764, -0.003962278366088867, -0.0037682652473449707, -0.0035742521286010742, -0.0033802390098571777, -0.0031862258911132812, -0.0029922127723693848, -0.0027981996536254883, -0.002604186534881592, -0.0024101734161376953, -0.002216160297393799, -0.0020221471786499023, -0.0018281340599060059, -0.0016341209411621094, -0.0014401078224182129, -0.0012460947036743164, -0.00105208158493042, -0.0008580684661865234, -0.000664055347442627, -0.00047004222869873047, -0.000276029109954834, -8.20159912109375e-05, 0.00011199712753295898, 0.00030601024627685547, 0.000500023365020752, 0.0006940364837646484, 0.0008880496025085449, 0.0010820627212524414, 0.0012760758399963379, 0.0014700889587402344, 0.0016641020774841309, 0.0018581151962280273, 0.002052128314971924, 0.0022461414337158203, 0.002440154552459717, 0.0026341676712036133, 0.0028281807899475098, 0.0030221939086914062, 0.0032162070274353027, 0.0034102201461791992, 0.0036042332649230957, 0.003798246383666992, 0.003992259502410889, 0.004186272621154785, 0.004380285739898682, 0.004574298858642578, 0.004768311977386475, 0.004962325096130371, 0.005156338214874268, 0.005350351333618164, 0.0055443644523620605, 0.005738377571105957, 0.0059323906898498535, 0.00612640380859375]}, "gradients/encoder.encoder.layers.23.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 14.0, 21.0, 93.0, 662.0, 170.0, 25.0, 12.0, 6.0, 3.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.21538019180297852, -0.21094518899917603, -0.20651017129421234, -0.20207516849040985, -0.19764015078544617, -0.19320514798164368, -0.1887701451778412, -0.1843351274728775, -0.179900124669075, -0.17546512186527252, -0.17103010416030884, -0.16659510135650635, -0.16216008365154266, -0.15772508084774017, -0.1532900631427765, -0.148855060338974, -0.1444200575351715, -0.13998505473136902, -0.13555003702640533, -0.13111503422260284, -0.12668001651763916, -0.12224501371383667, -0.11781000345945358, -0.1133749932050705, -0.10893997550010681, -0.10450496524572372, -0.10006995499134064, -0.09563495218753815, -0.09119994193315506, -0.08676493167877197, -0.08232992142438889, -0.0778949111700058, -0.07345990091562271, -0.06902489066123962, -0.06458988040685654, -0.06015487387776375, -0.05571986734867096, -0.05128485709428787, -0.046849846839904785, -0.0424148365855217, -0.03797983005642891, -0.03354481980204582, -0.029109813272953033, -0.024674803018569946, -0.02023979462683201, -0.01580478623509407, -0.011369775980710983, -0.006934767588973045, -0.0024997591972351074, 0.0019352496601641178, 0.006370258517563343, 0.010805267840623856, 0.015240276232361794, 0.01967528462409973, 0.02411029487848282, 0.028545303270220757, 0.032980311661958694, 0.03741532191634178, 0.04185032844543457, 0.04628533869981766, 0.050720348954200745, 0.05515535548329353, 0.05959036573767662, 0.06402537226676941, 0.0684603825211525]}, "gradients/encoder.encoder.layers.23.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 6.0, 1.0, 3.0, 2.0, 5.0, 3.0, 7.0, 11.0, 4.0, 14.0, 13.0, 14.0, 14.0, 26.0, 26.0, 38.0, 41.0, 29.0, 44.0, 42.0, 45.0, 62.0, 68.0, 57.0, 65.0, 58.0, 47.0, 40.0, 48.0, 29.0, 30.0, 22.0, 23.0, 14.0, 9.0, 10.0, 13.0, 11.0, 2.0, 3.0, 0.0, 4.0, 1.0, 1.0, 2.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.020285427570343018, -0.0196320042014122, -0.018978580832481384, -0.018325157463550568, -0.01767173409461975, -0.017018310725688934, -0.016364887356758118, -0.0157114639878273, -0.015058040618896484, -0.014404617249965668, -0.013751193881034851, -0.013097770512104034, -0.012444347143173218, -0.011790923774242401, -0.011137500405311584, -0.010484077036380768, -0.009830653667449951, -0.009177230298519135, -0.008523806929588318, -0.007870383560657501, -0.007216960191726685, -0.006563536822795868, -0.005910113453865051, -0.005256690084934235, -0.004603266716003418, -0.003949843347072601, -0.0032964199781417847, -0.002642996609210968, -0.0019895732402801514, -0.0013361498713493347, -0.0006827265024185181, -2.9303133487701416e-05, 0.0006241202354431152, 0.0012775436043739319, 0.0019309669733047485, 0.002584390342235565, 0.003237813711166382, 0.0038912370800971985, 0.004544660449028015, 0.005198083817958832, 0.0058515071868896484, 0.006504930555820465, 0.007158353924751282, 0.007811777293682098, 0.008465200662612915, 0.009118624031543732, 0.009772047400474548, 0.010425470769405365, 0.011078894138336182, 0.011732317507266998, 0.012385740876197815, 0.013039164245128632, 0.013692587614059448, 0.014346010982990265, 0.014999434351921082, 0.015652857720851898, 0.016306281089782715, 0.01695970445871353, 0.017613127827644348, 0.018266551196575165, 0.01891997456550598, 0.019573397934436798, 0.020226821303367615, 0.02088024467229843, 0.021533668041229248]}, "gradients/encoder.encoder.layers.23.attention.out_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 6.0, 4.0, 4.0, 7.0, 16.0, 15.0, 12.0, 23.0, 35.0, 33.0, 43.0, 70.0, 56.0, 113.0, 139.0, 158.0, 212.0, 322.0, 470.0, 669.0, 847.0, 1203.0, 1719.0, 2547.0, 3927.0, 6316.0, 10899.0, 24390.0, 730179.0, 216044.0, 20462.0, 9979.0, 5828.0, 3690.0, 2404.0, 1570.0, 1112.0, 802.0, 612.0, 401.0, 304.0, 230.0, 140.0, 124.0, 110.0, 63.0, 59.0, 49.0, 33.0, 32.0, 27.0, 14.0, 17.0, 12.0, 7.0, 7.0, 1.0, 3.0, 2.0, 0.0, 1.0], "bins": [-0.033294677734375, -0.03225135803222656, -0.031208038330078125, -0.030164718627929688, -0.02912139892578125, -0.028078079223632812, -0.027034759521484375, -0.025991439819335938, -0.0249481201171875, -0.023904800415039062, -0.022861480712890625, -0.021818161010742188, -0.02077484130859375, -0.019731521606445312, -0.018688201904296875, -0.017644882202148438, -0.0166015625, -0.015558242797851562, -0.014514923095703125, -0.013471603393554688, -0.01242828369140625, -0.011384963989257812, -0.010341644287109375, -0.009298324584960938, -0.0082550048828125, -0.0072116851806640625, -0.006168365478515625, -0.0051250457763671875, -0.00408172607421875, -0.0030384063720703125, -0.001995086669921875, -0.0009517669677734375, 9.1552734375e-05, 0.0011348724365234375, 0.002178192138671875, 0.0032215118408203125, 0.00426483154296875, 0.0053081512451171875, 0.006351470947265625, 0.0073947906494140625, 0.0084381103515625, 0.009481430053710938, 0.010524749755859375, 0.011568069458007812, 0.01261138916015625, 0.013654708862304688, 0.014698028564453125, 0.015741348266601562, 0.01678466796875, 0.017827987670898438, 0.018871307373046875, 0.019914627075195312, 0.02095794677734375, 0.022001266479492188, 0.023044586181640625, 0.024087905883789062, 0.0251312255859375, 0.026174545288085938, 0.027217864990234375, 0.028261184692382812, 0.02930450439453125, 0.030347824096679688, 0.031391143798828125, 0.03243446350097656, 0.033477783203125]}, "gradients/encoder.encoder.layers.23.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 1.0, 5.0, 3.0, 0.0, 2.0, 3.0, 6.0, 5.0, 5.0, 5.0, 5.0, 3.0, 6.0, 9.0, 18.0, 36.0, 299.0, 404.0, 92.0, 24.0, 6.0, 11.0, 10.0, 4.0, 8.0, 8.0, 5.0, 3.0, 5.0, 4.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0029125213623046875, -0.0028128623962402344, -0.0027132034301757812, -0.002613544464111328, -0.002513885498046875, -0.002414226531982422, -0.0023145675659179688, -0.0022149085998535156, -0.0021152496337890625, -0.0020155906677246094, -0.0019159317016601562, -0.0018162727355957031, -0.00171661376953125, -0.0016169548034667969, -0.0015172958374023438, -0.0014176368713378906, -0.0013179779052734375, -0.0012183189392089844, -0.0011186599731445312, -0.0010190010070800781, -0.000919342041015625, -0.0008196830749511719, -0.0007200241088867188, -0.0006203651428222656, -0.0005207061767578125, -0.0004210472106933594, -0.00032138824462890625, -0.00022172927856445312, -0.0001220703125, -2.2411346435546875e-05, 7.724761962890625e-05, 0.00017690658569335938, 0.0002765655517578125, 0.0003762245178222656, 0.00047588348388671875, 0.0005755424499511719, 0.000675201416015625, 0.0007748603820800781, 0.0008745193481445312, 0.0009741783142089844, 0.0010738372802734375, 0.0011734962463378906, 0.0012731552124023438, 0.0013728141784667969, 0.00147247314453125, 0.0015721321105957031, 0.0016717910766601562, 0.0017714500427246094, 0.0018711090087890625, 0.0019707679748535156, 0.0020704269409179688, 0.002170085906982422, 0.002269744873046875, 0.002369403839111328, 0.0024690628051757812, 0.0025687217712402344, 0.0026683807373046875, 0.0027680397033691406, 0.0028676986694335938, 0.002967357635498047, 0.0030670166015625, 0.003166675567626953, 0.0032663345336914062, 0.0033659934997558594, 0.0034656524658203125]}, "gradients/encoder.encoder.layers.23.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 3.0, 4.0, 6.0, 19.0, 14.0, 25.0, 27.0, 40.0, 42.0, 55.0, 86.0, 126.0, 167.0, 284.0, 360.0, 487.0, 783.0, 1251.0, 2173.0, 4344.0, 9579.0, 34829.0, 756174.0, 201702.0, 20440.0, 7062.0, 3245.0, 1861.0, 1129.0, 697.0, 443.0, 329.0, 230.0, 148.0, 104.0, 72.0, 64.0, 31.0, 35.0, 26.0, 17.0, 15.0, 11.0, 6.0, 3.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0511474609375, -0.04948854446411133, -0.047829627990722656, -0.046170711517333984, -0.04451179504394531, -0.04285287857055664, -0.04119396209716797, -0.0395350456237793, -0.037876129150390625, -0.03621721267700195, -0.03455829620361328, -0.03289937973022461, -0.031240463256835938, -0.029581546783447266, -0.027922630310058594, -0.026263713836669922, -0.02460479736328125, -0.022945880889892578, -0.021286964416503906, -0.019628047943115234, -0.017969131469726562, -0.01631021499633789, -0.014651298522949219, -0.012992382049560547, -0.011333465576171875, -0.009674549102783203, -0.008015632629394531, -0.006356716156005859, -0.0046977996826171875, -0.0030388832092285156, -0.0013799667358398438, 0.0002789497375488281, 0.0019378662109375, 0.003596782684326172, 0.005255699157714844, 0.006914615631103516, 0.008573532104492188, 0.01023244857788086, 0.011891365051269531, 0.013550281524658203, 0.015209197998046875, 0.016868114471435547, 0.01852703094482422, 0.02018594741821289, 0.021844863891601562, 0.023503780364990234, 0.025162696838378906, 0.026821613311767578, 0.02848052978515625, 0.030139446258544922, 0.031798362731933594, 0.033457279205322266, 0.03511619567871094, 0.03677511215209961, 0.03843402862548828, 0.04009294509887695, 0.041751861572265625, 0.0434107780456543, 0.04506969451904297, 0.04672861099243164, 0.04838752746582031, 0.050046443939208984, 0.051705360412597656, 0.05336427688598633, 0.055023193359375]}, "gradients/encoder.encoder.layers.23.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 3.0, 6.0, 12.0, 6.0, 14.0, 9.0, 13.0, 21.0, 12.0, 20.0, 22.0, 28.0, 41.0, 29.0, 31.0, 23.0, 25.0, 26.0, 23.0, 29.0, 37.0, 26.0, 56.0, 32.0, 35.0, 23.0, 54.0, 48.0, 29.0, 34.0, 19.0, 29.0, 25.0, 20.0, 17.0, 20.0, 19.0, 17.0, 15.0, 8.0, 8.0, 9.0, 6.0, 5.0, 8.0, 3.0, 1.0, 2.0, 1.0, 3.0, 2.0, 3.0, 2.0, 1.0, 1.0], "bins": [-0.0107421875, -0.010399460792541504, -0.010056734085083008, -0.009714007377624512, -0.009371280670166016, -0.00902855396270752, -0.008685827255249023, -0.008343100547790527, -0.008000373840332031, -0.007657647132873535, -0.007314920425415039, -0.006972193717956543, -0.006629467010498047, -0.006286740303039551, -0.005944013595581055, -0.005601286888122559, -0.0052585601806640625, -0.004915833473205566, -0.00457310676574707, -0.004230380058288574, -0.003887653350830078, -0.003544926643371582, -0.003202199935913086, -0.00285947322845459, -0.0025167465209960938, -0.0021740198135375977, -0.0018312931060791016, -0.0014885663986206055, -0.0011458396911621094, -0.0008031129837036133, -0.0004603862762451172, -0.0001176595687866211, 0.000225067138671875, 0.0005677938461303711, 0.0009105205535888672, 0.0012532472610473633, 0.0015959739685058594, 0.0019387006759643555, 0.0022814273834228516, 0.0026241540908813477, 0.0029668807983398438, 0.00330960750579834, 0.003652334213256836, 0.003995060920715332, 0.004337787628173828, 0.004680514335632324, 0.00502324104309082, 0.005365967750549316, 0.0057086944580078125, 0.006051421165466309, 0.006394147872924805, 0.006736874580383301, 0.007079601287841797, 0.007422327995300293, 0.007765054702758789, 0.008107781410217285, 0.008450508117675781, 0.008793234825134277, 0.009135961532592773, 0.00947868824005127, 0.009821414947509766, 0.010164141654968262, 0.010506868362426758, 0.010849595069885254, 0.01119232177734375]}, "gradients/encoder.encoder.layers.23.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 3.0, 4.0, 5.0, 1.0, 1.0, 1.0, 5.0, 13.0, 16.0, 23.0, 31.0, 50.0, 110.0, 276.0, 581.0, 2483.0, 64058.0, 974817.0, 4633.0, 874.0, 265.0, 139.0, 66.0, 43.0, 16.0, 14.0, 13.0, 6.0, 5.0, 3.0, 4.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.060791015625, -0.059061527252197266, -0.05733203887939453, -0.0556025505065918, -0.05387306213378906, -0.05214357376098633, -0.050414085388183594, -0.04868459701538086, -0.046955108642578125, -0.04522562026977539, -0.043496131896972656, -0.04176664352416992, -0.04003715515136719, -0.03830766677856445, -0.03657817840576172, -0.034848690032958984, -0.03311920166015625, -0.031389713287353516, -0.02966022491455078, -0.027930736541748047, -0.026201248168945312, -0.024471759796142578, -0.022742271423339844, -0.02101278305053711, -0.019283294677734375, -0.01755380630493164, -0.015824317932128906, -0.014094829559326172, -0.012365341186523438, -0.010635852813720703, -0.008906364440917969, -0.007176876068115234, -0.0054473876953125, -0.0037178993225097656, -0.0019884109497070312, -0.0002589225769042969, 0.0014705657958984375, 0.003200054168701172, 0.004929542541503906, 0.006659030914306641, 0.008388519287109375, 0.01011800765991211, 0.011847496032714844, 0.013576984405517578, 0.015306472778320312, 0.017035961151123047, 0.01876544952392578, 0.020494937896728516, 0.02222442626953125, 0.023953914642333984, 0.02568340301513672, 0.027412891387939453, 0.029142379760742188, 0.030871868133544922, 0.032601356506347656, 0.03433084487915039, 0.036060333251953125, 0.03778982162475586, 0.039519309997558594, 0.04124879837036133, 0.04297828674316406, 0.0447077751159668, 0.04643726348876953, 0.048166751861572266, 0.049896240234375]}, "gradients/encoder.encoder.layers.23.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 3.0, 5.0, 3.0, 4.0, 7.0, 5.0, 8.0, 10.0, 14.0, 19.0, 28.0, 50.0, 85.0, 146.0, 177.0, 157.0, 102.0, 70.0, 36.0, 19.0, 18.0, 7.0, 8.0, 7.0, 5.0, 7.0, 5.0, 2.0, 5.0, 1.0, 1.0, 0.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.4139881134033203e-05, -2.3443251848220825e-05, -2.2746622562408447e-05, -2.204999327659607e-05, -2.135336399078369e-05, -2.0656734704971313e-05, -1.9960105419158936e-05, -1.9263476133346558e-05, -1.856684684753418e-05, -1.7870217561721802e-05, -1.7173588275909424e-05, -1.6476958990097046e-05, -1.5780329704284668e-05, -1.508370041847229e-05, -1.4387071132659912e-05, -1.3690441846847534e-05, -1.2993812561035156e-05, -1.2297183275222778e-05, -1.16005539894104e-05, -1.0903924703598022e-05, -1.0207295417785645e-05, -9.510666131973267e-06, -8.814036846160889e-06, -8.11740756034851e-06, -7.420778274536133e-06, -6.724148988723755e-06, -6.027519702911377e-06, -5.330890417098999e-06, -4.634261131286621e-06, -3.937631845474243e-06, -3.2410025596618652e-06, -2.5443732738494873e-06, -1.8477439880371094e-06, -1.1511147022247314e-06, -4.544854164123535e-07, 2.421438694000244e-07, 9.387731552124023e-07, 1.6354024410247803e-06, 2.332031726837158e-06, 3.028661012649536e-06, 3.725290298461914e-06, 4.421919584274292e-06, 5.11854887008667e-06, 5.815178155899048e-06, 6.511807441711426e-06, 7.208436727523804e-06, 7.905066013336182e-06, 8.60169529914856e-06, 9.298324584960938e-06, 9.994953870773315e-06, 1.0691583156585693e-05, 1.1388212442398071e-05, 1.208484172821045e-05, 1.2781471014022827e-05, 1.3478100299835205e-05, 1.4174729585647583e-05, 1.4871358871459961e-05, 1.556798815727234e-05, 1.6264617443084717e-05, 1.6961246728897095e-05, 1.7657876014709473e-05, 1.835450530052185e-05, 1.905113458633423e-05, 1.9747763872146606e-05, 2.0444393157958984e-05]}, "gradients/encoder.encoder.layers.23.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 3.0, 4.0, 7.0, 9.0, 8.0, 20.0, 12.0, 18.0, 24.0, 34.0, 68.0, 96.0, 169.0, 395.0, 986.0, 4488.0, 312368.0, 723187.0, 4671.0, 1048.0, 379.0, 214.0, 104.0, 82.0, 53.0, 40.0, 20.0, 8.0, 10.0, 11.0, 7.0, 3.0, 7.0, 1.0, 1.0, 1.0, 3.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0859375, -0.08263397216796875, -0.0793304443359375, -0.07602691650390625, -0.072723388671875, -0.06941986083984375, -0.0661163330078125, -0.06281280517578125, -0.05950927734375, -0.05620574951171875, -0.0529022216796875, -0.04959869384765625, -0.046295166015625, -0.04299163818359375, -0.0396881103515625, -0.03638458251953125, -0.0330810546875, -0.02977752685546875, -0.0264739990234375, -0.02317047119140625, -0.019866943359375, -0.01656341552734375, -0.0132598876953125, -0.00995635986328125, -0.00665283203125, -0.00334930419921875, -4.57763671875e-05, 0.00325775146484375, 0.006561279296875, 0.00986480712890625, 0.0131683349609375, 0.01647186279296875, 0.019775390625, 0.02307891845703125, 0.0263824462890625, 0.02968597412109375, 0.032989501953125, 0.03629302978515625, 0.0395965576171875, 0.04290008544921875, 0.04620361328125, 0.04950714111328125, 0.0528106689453125, 0.05611419677734375, 0.059417724609375, 0.06272125244140625, 0.0660247802734375, 0.06932830810546875, 0.0726318359375, 0.07593536376953125, 0.0792388916015625, 0.08254241943359375, 0.085845947265625, 0.08914947509765625, 0.0924530029296875, 0.09575653076171875, 0.09906005859375, 0.10236358642578125, 0.1056671142578125, 0.10897064208984375, 0.112274169921875, 0.11557769775390625, 0.1188812255859375, 0.12218475341796875, 0.12548828125]}, "gradients/encoder.encoder.layers.23.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 8.0, 5.0, 7.0, 6.0, 7.0, 9.0, 9.0, 10.0, 27.0, 35.0, 39.0, 82.0, 342.0, 184.0, 81.0, 55.0, 24.0, 25.0, 10.0, 7.0, 5.0, 4.0, 3.0, 4.0, 2.0, 1.0, 3.0, 4.0, 1.0, 1.0, 4.0, 1.0, 0.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.015411376953125, -0.01473546028137207, -0.01405954360961914, -0.013383626937866211, -0.012707710266113281, -0.012031793594360352, -0.011355876922607422, -0.010679960250854492, -0.010004043579101562, -0.009328126907348633, -0.008652210235595703, -0.007976293563842773, -0.007300376892089844, -0.006624460220336914, -0.005948543548583984, -0.005272626876831055, -0.004596710205078125, -0.003920793533325195, -0.0032448768615722656, -0.002568960189819336, -0.0018930435180664062, -0.0012171268463134766, -0.0005412101745605469, 0.0001347064971923828, 0.0008106231689453125, 0.0014865398406982422, 0.002162456512451172, 0.0028383731842041016, 0.0035142898559570312, 0.004190206527709961, 0.004866123199462891, 0.00554203987121582, 0.00621795654296875, 0.00689387321472168, 0.007569789886474609, 0.008245706558227539, 0.008921623229980469, 0.009597539901733398, 0.010273456573486328, 0.010949373245239258, 0.011625289916992188, 0.012301206588745117, 0.012977123260498047, 0.013653039932250977, 0.014328956604003906, 0.015004873275756836, 0.015680789947509766, 0.016356706619262695, 0.017032623291015625, 0.017708539962768555, 0.018384456634521484, 0.019060373306274414, 0.019736289978027344, 0.020412206649780273, 0.021088123321533203, 0.021764039993286133, 0.022439956665039062, 0.023115873336791992, 0.023791790008544922, 0.02446770668029785, 0.02514362335205078, 0.02581954002380371, 0.02649545669555664, 0.02717137336730957, 0.0278472900390625]}, "gradients/encoder.encoder.layers.23.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 3.0, 6.0, 3.0, 8.0, 10.0, 17.0, 66.0, 364.0, 425.0, 49.0, 13.0, 10.0, 7.0, 6.0, 2.0, 1.0, 4.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.40105634927749634, -0.3836613595485687, -0.3662663698196411, -0.3488713800907135, -0.3314763903617859, -0.3140813708305359, -0.2966863811016083, -0.27929139137268066, -0.26189640164375305, -0.24450141191482544, -0.22710642218589783, -0.20971141755580902, -0.1923164278268814, -0.1749214380979538, -0.157526433467865, -0.14013144373893738, -0.12273645401000977, -0.10534146428108215, -0.08794646710157394, -0.07055146992206573, -0.05315648019313812, -0.03576149046421051, -0.0183664932847023, -0.0009714961051940918, 0.01642349362373352, 0.03381848707795143, 0.05121348053216934, 0.06860847771167755, 0.08600346744060516, 0.10339845716953278, 0.12079345434904099, 0.1381884515285492, 0.15558350086212158, 0.1729784905910492, 0.1903734803199768, 0.2077684849500656, 0.22516347467899323, 0.24255846440792084, 0.25995346903800964, 0.27734845876693726, 0.29474344849586487, 0.3121384382247925, 0.3295334279537201, 0.3469284176826477, 0.3643234372138977, 0.38171839714050293, 0.39911341667175293, 0.41650840640068054, 0.43390339612960815, 0.45129838585853577, 0.4686933755874634, 0.486088365316391, 0.5034833550453186, 0.5208783745765686, 0.5382733345031738, 0.5556683540344238, 0.5730633735656738, 0.5904583930969238, 0.607853353023529, 0.625248372554779, 0.6426433324813843, 0.6600383520126343, 0.6774333119392395, 0.6948283314704895, 0.7122232913970947]}, "gradients/encoder.encoder.layers.23.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 3.0, 4.0, 7.0, 9.0, 3.0, 6.0, 11.0, 9.0, 17.0, 35.0, 45.0, 76.0, 111.0, 127.0, 130.0, 99.0, 106.0, 66.0, 33.0, 27.0, 9.0, 20.0, 8.0, 6.0, 6.0, 4.0, 8.0, 2.0, 3.0, 2.0, 1.0, 1.0, 5.0, 2.0, 0.0, 3.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2344350814819336, -0.22719219326972961, -0.21994929015636444, -0.21270640194416046, -0.20546351373195648, -0.1982206106185913, -0.19097772240638733, -0.18373483419418335, -0.17649194598197937, -0.1692490577697754, -0.16200615465641022, -0.15476326644420624, -0.14752037823200226, -0.14027747511863708, -0.1330345869064331, -0.12579169869422913, -0.11854879558086395, -0.11130589991807938, -0.1040630117058754, -0.09682011604309082, -0.08957722783088684, -0.08233433216810226, -0.07509143650531769, -0.06784854829311371, -0.06060565263032913, -0.053362760692834854, -0.046119868755340576, -0.038876973092556, -0.03163408115506172, -0.024391189217567444, -0.017148293554782867, -0.00990540161728859, -0.0026625096797943115, 0.004580383189022541, 0.011823276057839394, 0.01906616985797882, 0.0263090617954731, 0.03355195373296738, 0.04079484939575195, 0.04803774133324623, 0.05528063327074051, 0.06252352893352509, 0.06976641714572906, 0.07700931280851364, 0.08425220847129822, 0.0914950966835022, 0.09873799234628677, 0.10598088800907135, 0.11322377622127533, 0.1204666718840599, 0.12770956754684448, 0.13495245575904846, 0.14219534397125244, 0.14943823218345642, 0.1566811352968216, 0.16392402350902557, 0.17116692662239075, 0.17840981483459473, 0.1856527179479599, 0.19289560616016388, 0.20013849437236786, 0.20738139748573303, 0.214624285697937, 0.221867173910141, 0.22911006212234497]}, "gradients/encoder.encoder.layers.22.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 3.0, 3.0, 4.0, 6.0, 5.0, 2.0, 9.0, 6.0, 9.0, 11.0, 13.0, 18.0, 22.0, 29.0, 75.0, 242.0, 2997.0, 4111681.0, 77355.0, 1472.0, 139.0, 49.0, 30.0, 18.0, 14.0, 15.0, 8.0, 9.0, 6.0, 12.0, 5.0, 6.0, 1.0, 2.0, 3.0, 1.0, 0.0, 1.0, 3.0, 3.0, 2.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-4.375, -4.253082275390625, -4.13116455078125, -4.009246826171875, -3.8873291015625, -3.765411376953125, -3.64349365234375, -3.521575927734375, -3.399658203125, -3.277740478515625, -3.15582275390625, -3.033905029296875, -2.9119873046875, -2.790069580078125, -2.66815185546875, -2.546234130859375, -2.42431640625, -2.302398681640625, -2.18048095703125, -2.058563232421875, -1.9366455078125, -1.814727783203125, -1.69281005859375, -1.570892333984375, -1.448974609375, -1.327056884765625, -1.20513916015625, -1.083221435546875, -0.9613037109375, -0.839385986328125, -0.71746826171875, -0.595550537109375, -0.4736328125, -0.351715087890625, -0.22979736328125, -0.107879638671875, 0.0140380859375, 0.135955810546875, 0.25787353515625, 0.379791259765625, 0.501708984375, 0.623626708984375, 0.74554443359375, 0.867462158203125, 0.9893798828125, 1.111297607421875, 1.23321533203125, 1.355133056640625, 1.47705078125, 1.598968505859375, 1.72088623046875, 1.842803955078125, 1.9647216796875, 2.086639404296875, 2.20855712890625, 2.330474853515625, 2.452392578125, 2.574310302734375, 2.69622802734375, 2.818145751953125, 2.9400634765625, 3.061981201171875, 3.18389892578125, 3.305816650390625, 3.427734375]}, "gradients/encoder.encoder.layers.22.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 2.0, 4.0, 1.0, 4.0, 7.0, 2.0, 5.0, 0.0, 1.0, 2.0, 6.0, 10.0, 9.0, 7.0, 8.0, 13.0, 75.0, 297.0, 333.0, 112.0, 27.0, 9.0, 13.0, 7.0, 9.0, 0.0, 5.0, 7.0, 6.0, 7.0, 3.0, 3.0, 4.0, 0.0, 1.0, 0.0, 4.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0027332305908203125, -0.0026482045650482178, -0.002563178539276123, -0.0024781525135040283, -0.0023931264877319336, -0.002308100461959839, -0.002223074436187744, -0.0021380484104156494, -0.0020530223846435547, -0.00196799635887146, -0.0018829703330993652, -0.0017979443073272705, -0.0017129182815551758, -0.001627892255783081, -0.0015428662300109863, -0.0014578402042388916, -0.0013728141784667969, -0.0012877881526947021, -0.0012027621269226074, -0.0011177361011505127, -0.001032710075378418, -0.0009476840496063232, -0.0008626580238342285, -0.0007776319980621338, -0.0006926059722900391, -0.0006075799465179443, -0.0005225539207458496, -0.0004375278949737549, -0.00035250186920166016, -0.00026747584342956543, -0.0001824498176574707, -9.742379188537598e-05, -1.239776611328125e-05, 7.262825965881348e-05, 0.0001576542854309082, 0.00024268031120300293, 0.00032770633697509766, 0.0004127323627471924, 0.0004977583885192871, 0.0005827844142913818, 0.0006678104400634766, 0.0007528364658355713, 0.000837862491607666, 0.0009228885173797607, 0.0010079145431518555, 0.0010929405689239502, 0.001177966594696045, 0.0012629926204681396, 0.0013480186462402344, 0.001433044672012329, 0.0015180706977844238, 0.0016030967235565186, 0.0016881227493286133, 0.001773148775100708, 0.0018581748008728027, 0.0019432008266448975, 0.002028226852416992, 0.002113252878189087, 0.0021982789039611816, 0.0022833049297332764, 0.002368330955505371, 0.002453356981277466, 0.0025383830070495605, 0.0026234090328216553, 0.00270843505859375]}, "gradients/encoder.encoder.layers.22.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 2.0, 6.0, 5.0, 1.0, 7.0, 9.0, 18.0, 12.0, 20.0, 20.0, 19.0, 23.0, 41.0, 43.0, 54.0, 67.0, 73.0, 89.0, 100.0, 137.0, 380.0, 5378.0, 4171286.0, 15190.0, 497.0, 153.0, 138.0, 93.0, 93.0, 60.0, 62.0, 50.0, 37.0, 37.0, 19.0, 19.0, 12.0, 9.0, 11.0, 6.0, 2.0, 6.0, 1.0, 1.0, 1.0, 2.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.69189453125, -0.6706771850585938, -0.6494598388671875, -0.6282424926757812, -0.607025146484375, -0.5858078002929688, -0.5645904541015625, -0.5433731079101562, -0.52215576171875, -0.5009384155273438, -0.4797210693359375, -0.45850372314453125, -0.437286376953125, -0.41606903076171875, -0.3948516845703125, -0.37363433837890625, -0.3524169921875, -0.33119964599609375, -0.3099822998046875, -0.28876495361328125, -0.267547607421875, -0.24633026123046875, -0.2251129150390625, -0.20389556884765625, -0.18267822265625, -0.16146087646484375, -0.1402435302734375, -0.11902618408203125, -0.097808837890625, -0.07659149169921875, -0.0553741455078125, -0.03415679931640625, -0.012939453125, 0.00827789306640625, 0.0294952392578125, 0.05071258544921875, 0.071929931640625, 0.09314727783203125, 0.1143646240234375, 0.13558197021484375, 0.15679931640625, 0.17801666259765625, 0.1992340087890625, 0.22045135498046875, 0.241668701171875, 0.26288604736328125, 0.2841033935546875, 0.30532073974609375, 0.3265380859375, 0.34775543212890625, 0.3689727783203125, 0.39019012451171875, 0.411407470703125, 0.43262481689453125, 0.4538421630859375, 0.47505950927734375, 0.49627685546875, 0.5174942016601562, 0.5387115478515625, 0.5599288940429688, 0.581146240234375, 0.6023635864257812, 0.6235809326171875, 0.6447982788085938, 0.666015625]}, "gradients/encoder.encoder.layers.22.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 3.0, 5.0, 5.0, 2.0, 5.0, 13.0, 15.0, 19.0, 16.0, 24.0, 17.0, 27.0, 36.0, 45.0, 76.0, 76.0, 93.0, 93.0, 126.0, 126.0, 219.0, 286.0, 1114.0, 414.0, 228.0, 171.0, 158.0, 144.0, 96.0, 75.0, 72.0, 55.0, 45.0, 51.0, 34.0, 19.0, 20.0, 9.0, 12.0, 12.0, 5.0, 3.0, 10.0, 1.0, 2.0, 1.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.005359649658203125, -0.005196750164031982, -0.00503385066986084, -0.004870951175689697, -0.004708051681518555, -0.004545152187347412, -0.0043822526931762695, -0.004219353199005127, -0.004056453704833984, -0.003893554210662842, -0.0037306547164916992, -0.0035677552223205566, -0.003404855728149414, -0.0032419562339782715, -0.003079056739807129, -0.0029161572456359863, -0.0027532577514648438, -0.002590358257293701, -0.0024274587631225586, -0.002264559268951416, -0.0021016597747802734, -0.0019387602806091309, -0.0017758607864379883, -0.0016129612922668457, -0.0014500617980957031, -0.0012871623039245605, -0.001124262809753418, -0.0009613633155822754, -0.0007984638214111328, -0.0006355643272399902, -0.00047266483306884766, -0.0003097653388977051, -0.0001468658447265625, 1.6033649444580078e-05, 0.00017893314361572266, 0.00034183263778686523, 0.0005047321319580078, 0.0006676316261291504, 0.000830531120300293, 0.0009934306144714355, 0.0011563301086425781, 0.0013192296028137207, 0.0014821290969848633, 0.0016450285911560059, 0.0018079280853271484, 0.001970827579498291, 0.0021337270736694336, 0.002296626567840576, 0.0024595260620117188, 0.0026224255561828613, 0.002785325050354004, 0.0029482245445251465, 0.003111124038696289, 0.0032740235328674316, 0.0034369230270385742, 0.003599822521209717, 0.0037627220153808594, 0.003925621509552002, 0.0040885210037231445, 0.004251420497894287, 0.00441431999206543, 0.004577219486236572, 0.004740118980407715, 0.004903018474578857, 0.00506591796875]}, "gradients/encoder.encoder.layers.22.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 4.0, 16.0, 32.0, 706.0, 227.0, 21.0, 9.0, 2.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.13565918803215027, -0.12024404853582382, -0.10482890903949738, -0.08941377699375153, -0.07399863749742508, -0.05858349800109863, -0.04316836595535278, -0.027753226459026337, -0.01233808696269989, 0.003077050670981407, 0.018492188304662704, 0.03390732407569885, 0.0493224635720253, 0.06473760306835175, 0.0801527351140976, 0.09556787461042404, 0.11098301410675049, 0.12639814615249634, 0.14181329309940338, 0.15722842514514923, 0.17264357209205627, 0.18805870413780212, 0.20347383618354797, 0.21888896822929382, 0.23430411517620087, 0.24971924722194672, 0.26513439416885376, 0.2805495262145996, 0.29596465826034546, 0.3113797903060913, 0.32679492235183716, 0.3422100841999054, 0.35762524604797363, 0.3730403780937195, 0.38845551013946533, 0.4038706421852112, 0.4192858040332794, 0.43470093607902527, 0.4501160681247711, 0.46553120017051697, 0.4809463620185852, 0.49636149406433105, 0.5117766261100769, 0.5271917581558228, 0.5426068902015686, 0.5580220222473145, 0.5734372138977051, 0.5888523459434509, 0.6042674779891968, 0.6196826100349426, 0.6350977420806885, 0.6505128741264343, 0.6659280061721802, 0.6813431978225708, 0.6967582702636719, 0.7121734619140625, 0.7275885343551636, 0.7430036664009094, 0.7584187984466553, 0.7738339304924011, 0.789249062538147, 0.8046642541885376, 0.8200793266296387, 0.8354945182800293, 0.8509096503257751]}, "gradients/encoder.encoder.layers.22.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 4.0, 5.0, 4.0, 6.0, 4.0, 5.0, 10.0, 12.0, 13.0, 19.0, 17.0, 28.0, 27.0, 35.0, 43.0, 56.0, 49.0, 52.0, 50.0, 63.0, 69.0, 69.0, 56.0, 53.0, 46.0, 41.0, 35.0, 28.0, 28.0, 17.0, 20.0, 9.0, 5.0, 9.0, 3.0, 6.0, 4.0, 2.0, 4.0, 5.0, 4.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.03824424743652344, -0.036715056747198105, -0.03518586605787277, -0.03365667909383774, -0.032127488404512405, -0.030598297715187073, -0.02906910888850689, -0.027539920061826706, -0.026010729372501373, -0.02448153868317604, -0.022952349856495857, -0.021423161029815674, -0.01989397034049034, -0.01836477965116501, -0.016835590824484825, -0.015306401066482067, -0.013777211308479309, -0.012248021550476551, -0.010718831792473793, -0.009189642034471035, -0.007660452276468277, -0.006131262518465519, -0.004602072760462761, -0.003072883002460003, -0.0015436932444572449, -1.4503486454486847e-05, 0.0015146862715482712, 0.003043876029551029, 0.004573065787553787, 0.006102255545556545, 0.007631445303559303, 0.009160635061562061, 0.01068982481956482, 0.012219014577567577, 0.013748204335570335, 0.015277394093573093, 0.01680658385157585, 0.018335774540901184, 0.019864963367581367, 0.02139415219426155, 0.022923342883586884, 0.024452533572912216, 0.0259817223995924, 0.027510911226272583, 0.029040101915597916, 0.03056929260492325, 0.03209847956895828, 0.033627670258283615, 0.03515686094760895, 0.03668605163693428, 0.03821524232625961, 0.03974442929029465, 0.04127361997961998, 0.04280281066894531, 0.04433199763298035, 0.04586118832230568, 0.04739037901163101, 0.048919569700956345, 0.05044876039028168, 0.05197794735431671, 0.053507138043642044, 0.05503632873296738, 0.05656551569700241, 0.058094706386327744, 0.059623897075653076]}, "gradients/encoder.encoder.layers.22.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 5.0, 6.0, 12.0, 7.0, 20.0, 18.0, 42.0, 62.0, 75.0, 117.0, 197.0, 267.0, 391.0, 617.0, 976.0, 1590.0, 2589.0, 4635.0, 8533.0, 18540.0, 142369.0, 812738.0, 29142.0, 11224.0, 5895.0, 3252.0, 1874.0, 1173.0, 722.0, 510.0, 306.0, 221.0, 148.0, 86.0, 77.0, 51.0, 31.0, 19.0, 16.0, 3.0, 6.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.05938720703125, -0.05735301971435547, -0.05531883239746094, -0.053284645080566406, -0.051250457763671875, -0.049216270446777344, -0.04718208312988281, -0.04514789581298828, -0.04311370849609375, -0.04107952117919922, -0.03904533386230469, -0.037011146545410156, -0.034976959228515625, -0.032942771911621094, -0.030908584594726562, -0.02887439727783203, -0.0268402099609375, -0.02480602264404297, -0.022771835327148438, -0.020737648010253906, -0.018703460693359375, -0.016669273376464844, -0.014635086059570312, -0.012600898742675781, -0.01056671142578125, -0.008532524108886719, -0.0064983367919921875, -0.004464149475097656, -0.002429962158203125, -0.00039577484130859375, 0.0016384124755859375, 0.0036725997924804688, 0.005706787109375, 0.007740974426269531, 0.009775161743164062, 0.011809349060058594, 0.013843536376953125, 0.015877723693847656, 0.017911911010742188, 0.01994609832763672, 0.02198028564453125, 0.02401447296142578, 0.026048660278320312, 0.028082847595214844, 0.030117034912109375, 0.032151222229003906, 0.03418540954589844, 0.03621959686279297, 0.0382537841796875, 0.04028797149658203, 0.04232215881347656, 0.044356346130371094, 0.046390533447265625, 0.048424720764160156, 0.05045890808105469, 0.05249309539794922, 0.05452728271484375, 0.05656147003173828, 0.05859565734863281, 0.060629844665527344, 0.06266403198242188, 0.0646982192993164, 0.06673240661621094, 0.06876659393310547, 0.07080078125]}, "gradients/encoder.encoder.layers.22.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 1.0, 3.0, 3.0, 1.0, 4.0, 2.0, 10.0, 7.0, 8.0, 9.0, 17.0, 23.0, 128.0, 323.0, 271.0, 93.0, 25.0, 18.0, 9.0, 8.0, 9.0, 9.0, 5.0, 5.0, 5.0, 2.0, 2.0, 3.0, 2.0, 0.0, 0.0, 2.0, 0.0, 2.0, 2.0], "bins": [-0.004489898681640625, -0.004385203123092651, -0.004280507564544678, -0.004175812005996704, -0.0040711164474487305, -0.003966420888900757, -0.003861725330352783, -0.0037570297718048096, -0.003652334213256836, -0.0035476386547088623, -0.0034429430961608887, -0.003338247537612915, -0.0032335519790649414, -0.0031288564205169678, -0.003024160861968994, -0.0029194653034210205, -0.002814769744873047, -0.0027100741863250732, -0.0026053786277770996, -0.002500683069229126, -0.0023959875106811523, -0.0022912919521331787, -0.002186596393585205, -0.0020819008350372314, -0.001977205276489258, -0.0018725097179412842, -0.0017678141593933105, -0.001663118600845337, -0.0015584230422973633, -0.0014537274837493896, -0.001349031925201416, -0.0012443363666534424, -0.0011396408081054688, -0.0010349452495574951, -0.0009302496910095215, -0.0008255541324615479, -0.0007208585739135742, -0.0006161630153656006, -0.000511467456817627, -0.0004067718982696533, -0.0003020763397216797, -0.00019738078117370605, -9.268522262573242e-05, 1.2010335922241211e-05, 0.00011670589447021484, 0.00022140145301818848, 0.0003260970115661621, 0.00043079257011413574, 0.0005354881286621094, 0.000640183687210083, 0.0007448792457580566, 0.0008495748043060303, 0.0009542703628540039, 0.0010589659214019775, 0.0011636614799499512, 0.0012683570384979248, 0.0013730525970458984, 0.001477748155593872, 0.0015824437141418457, 0.0016871392726898193, 0.001791834831237793, 0.0018965303897857666, 0.0020012259483337402, 0.002105921506881714, 0.0022106170654296875]}, "gradients/encoder.encoder.layers.22.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 2.0, 5.0, 4.0, 4.0, 7.0, 7.0, 10.0, 8.0, 13.0, 16.0, 23.0, 12.0, 25.0, 32.0, 26.0, 38.0, 44.0, 39.0, 75.0, 253.0, 1805.0, 38056.0, 996901.0, 9828.0, 801.0, 158.0, 60.0, 54.0, 36.0, 32.0, 33.0, 26.0, 22.0, 16.0, 18.0, 15.0, 14.0, 6.0, 15.0, 10.0, 4.0, 1.0, 3.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0], "bins": [-0.17333984375, -0.1682891845703125, -0.163238525390625, -0.1581878662109375, -0.15313720703125, -0.1480865478515625, -0.143035888671875, -0.1379852294921875, -0.1329345703125, -0.1278839111328125, -0.122833251953125, -0.1177825927734375, -0.11273193359375, -0.1076812744140625, -0.102630615234375, -0.0975799560546875, -0.092529296875, -0.0874786376953125, -0.082427978515625, -0.0773773193359375, -0.07232666015625, -0.0672760009765625, -0.062225341796875, -0.0571746826171875, -0.0521240234375, -0.0470733642578125, -0.042022705078125, -0.0369720458984375, -0.03192138671875, -0.0268707275390625, -0.021820068359375, -0.0167694091796875, -0.01171875, -0.0066680908203125, -0.001617431640625, 0.0034332275390625, 0.00848388671875, 0.0135345458984375, 0.018585205078125, 0.0236358642578125, 0.0286865234375, 0.0337371826171875, 0.038787841796875, 0.0438385009765625, 0.04888916015625, 0.0539398193359375, 0.058990478515625, 0.0640411376953125, 0.069091796875, 0.0741424560546875, 0.079193115234375, 0.0842437744140625, 0.08929443359375, 0.0943450927734375, 0.099395751953125, 0.1044464111328125, 0.1094970703125, 0.1145477294921875, 0.119598388671875, 0.1246490478515625, 0.12969970703125, 0.1347503662109375, 0.139801025390625, 0.1448516845703125, 0.14990234375]}, "gradients/encoder.encoder.layers.22.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 3.0, 7.0, 6.0, 3.0, 9.0, 6.0, 6.0, 11.0, 14.0, 13.0, 22.0, 19.0, 16.0, 33.0, 23.0, 36.0, 25.0, 29.0, 34.0, 24.0, 30.0, 39.0, 49.0, 27.0, 55.0, 42.0, 38.0, 43.0, 44.0, 37.0, 36.0, 26.0, 35.0, 31.0, 12.0, 25.0, 19.0, 13.0, 7.0, 14.0, 15.0, 5.0, 8.0, 4.0, 5.0, 2.0, 2.0, 1.0, 1.0, 2.0, 2.0, 2.0, 6.0], "bins": [-0.0087127685546875, -0.00846177339553833, -0.00821077823638916, -0.00795978307723999, -0.00770878791809082, -0.00745779275894165, -0.0072067975997924805, -0.0069558024406433105, -0.006704807281494141, -0.006453812122344971, -0.006202816963195801, -0.005951821804046631, -0.005700826644897461, -0.005449831485748291, -0.005198836326599121, -0.004947841167449951, -0.004696846008300781, -0.004445850849151611, -0.004194855690002441, -0.0039438605308532715, -0.0036928653717041016, -0.0034418702125549316, -0.0031908750534057617, -0.002939879894256592, -0.002688884735107422, -0.002437889575958252, -0.002186894416809082, -0.0019358992576599121, -0.0016849040985107422, -0.0014339089393615723, -0.0011829137802124023, -0.0009319186210632324, -0.0006809234619140625, -0.0004299283027648926, -0.00017893314361572266, 7.206201553344727e-05, 0.0003230571746826172, 0.0005740523338317871, 0.000825047492980957, 0.001076042652130127, 0.0013270378112792969, 0.0015780329704284668, 0.0018290281295776367, 0.0020800232887268066, 0.0023310184478759766, 0.0025820136070251465, 0.0028330087661743164, 0.0030840039253234863, 0.0033349990844726562, 0.003585994243621826, 0.003836989402770996, 0.004087984561920166, 0.004338979721069336, 0.004589974880218506, 0.004840970039367676, 0.005091965198516846, 0.005342960357666016, 0.0055939555168151855, 0.0058449506759643555, 0.006095945835113525, 0.006346940994262695, 0.006597936153411865, 0.006848931312561035, 0.007099926471710205, 0.007350921630859375]}, "gradients/encoder.encoder.layers.22.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 2.0, 3.0, 6.0, 0.0, 5.0, 4.0, 3.0, 1.0, 7.0, 8.0, 6.0, 13.0, 17.0, 42.0, 83.0, 334.0, 3488.0, 1035439.0, 8233.0, 580.0, 141.0, 50.0, 24.0, 14.0, 10.0, 7.0, 3.0, 7.0, 6.0, 2.0, 4.0, 7.0, 1.0, 3.0, 3.0, 3.0, 1.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0855712890625, -0.08194732666015625, -0.0783233642578125, -0.07469940185546875, -0.071075439453125, -0.06745147705078125, -0.0638275146484375, -0.06020355224609375, -0.05657958984375, -0.05295562744140625, -0.0493316650390625, -0.04570770263671875, -0.042083740234375, -0.03845977783203125, -0.0348358154296875, -0.03121185302734375, -0.027587890625, -0.02396392822265625, -0.0203399658203125, -0.01671600341796875, -0.013092041015625, -0.00946807861328125, -0.0058441162109375, -0.00222015380859375, 0.00140380859375, 0.00502777099609375, 0.0086517333984375, 0.01227569580078125, 0.015899658203125, 0.01952362060546875, 0.0231475830078125, 0.02677154541015625, 0.0303955078125, 0.03401947021484375, 0.0376434326171875, 0.04126739501953125, 0.044891357421875, 0.04851531982421875, 0.0521392822265625, 0.05576324462890625, 0.05938720703125, 0.06301116943359375, 0.0666351318359375, 0.07025909423828125, 0.073883056640625, 0.07750701904296875, 0.0811309814453125, 0.08475494384765625, 0.08837890625, 0.09200286865234375, 0.0956268310546875, 0.09925079345703125, 0.102874755859375, 0.10649871826171875, 0.1101226806640625, 0.11374664306640625, 0.11737060546875, 0.12099456787109375, 0.1246185302734375, 0.12824249267578125, 0.131866455078125, 0.13549041748046875, 0.1391143798828125, 0.14273834228515625, 0.1463623046875]}, "gradients/encoder.encoder.layers.22.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 4.0, 6.0, 1.0, 5.0, 6.0, 9.0, 4.0, 3.0, 22.0, 18.0, 34.0, 41.0, 120.0, 237.0, 235.0, 121.0, 63.0, 27.0, 15.0, 9.0, 5.0, 2.0, 4.0, 4.0, 4.0, 5.0, 2.0, 3.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.0003268718719482422, -0.0003190934658050537, -0.00031131505966186523, -0.00030353665351867676, -0.0002957582473754883, -0.0002879798412322998, -0.00028020143508911133, -0.00027242302894592285, -0.0002646446228027344, -0.0002568662166595459, -0.0002490878105163574, -0.00024130940437316895, -0.00023353099822998047, -0.000225752592086792, -0.00021797418594360352, -0.00021019577980041504, -0.00020241737365722656, -0.00019463896751403809, -0.0001868605613708496, -0.00017908215522766113, -0.00017130374908447266, -0.00016352534294128418, -0.0001557469367980957, -0.00014796853065490723, -0.00014019012451171875, -0.00013241171836853027, -0.0001246333122253418, -0.00011685490608215332, -0.00010907649993896484, -0.00010129809379577637, -9.351968765258789e-05, -8.574128150939941e-05, -7.796287536621094e-05, -7.018446922302246e-05, -6.240606307983398e-05, -5.462765693664551e-05, -4.684925079345703e-05, -3.9070844650268555e-05, -3.129243850708008e-05, -2.35140323638916e-05, -1.5735626220703125e-05, -7.957220077514648e-06, -1.7881393432617188e-07, 7.599592208862305e-06, 1.537799835205078e-05, 2.3156404495239258e-05, 3.0934810638427734e-05, 3.871321678161621e-05, 4.649162292480469e-05, 5.4270029067993164e-05, 6.204843521118164e-05, 6.982684135437012e-05, 7.76052474975586e-05, 8.538365364074707e-05, 9.316205978393555e-05, 0.00010094046592712402, 0.0001087188720703125, 0.00011649727821350098, 0.00012427568435668945, 0.00013205409049987793, 0.0001398324966430664, 0.00014761090278625488, 0.00015538930892944336, 0.00016316771507263184, 0.0001709461212158203]}, "gradients/encoder.encoder.layers.22.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 5.0, 3.0, 2.0, 8.0, 5.0, 15.0, 18.0, 27.0, 48.0, 74.0, 219.0, 691.0, 4162.0, 955525.0, 84403.0, 2613.0, 441.0, 145.0, 62.0, 29.0, 12.0, 12.0, 7.0, 11.0, 5.0, 3.0, 4.0, 4.0, 1.0, 1.0, 2.0, 2.0, 2.0, 1.0, 2.0], "bins": [-0.1651611328125, -0.16131114959716797, -0.15746116638183594, -0.1536111831665039, -0.14976119995117188, -0.14591121673583984, -0.1420612335205078, -0.13821125030517578, -0.13436126708984375, -0.13051128387451172, -0.1266613006591797, -0.12281131744384766, -0.11896133422851562, -0.1151113510131836, -0.11126136779785156, -0.10741138458251953, -0.1035614013671875, -0.09971141815185547, -0.09586143493652344, -0.0920114517211914, -0.08816146850585938, -0.08431148529052734, -0.08046150207519531, -0.07661151885986328, -0.07276153564453125, -0.06891155242919922, -0.06506156921386719, -0.061211585998535156, -0.057361602783203125, -0.053511619567871094, -0.04966163635253906, -0.04581165313720703, -0.041961669921875, -0.03811168670654297, -0.03426170349121094, -0.030411720275878906, -0.026561737060546875, -0.022711753845214844, -0.018861770629882812, -0.015011787414550781, -0.01116180419921875, -0.007311820983886719, -0.0034618377685546875, 0.00038814544677734375, 0.004238128662109375, 0.008088111877441406, 0.011938095092773438, 0.01578807830810547, 0.0196380615234375, 0.02348804473876953, 0.027338027954101562, 0.031188011169433594, 0.035037994384765625, 0.038887977600097656, 0.04273796081542969, 0.04658794403076172, 0.05043792724609375, 0.05428791046142578, 0.05813789367675781, 0.061987876892089844, 0.06583786010742188, 0.0696878433227539, 0.07353782653808594, 0.07738780975341797, 0.08123779296875]}, "gradients/encoder.encoder.layers.22.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 4.0, 2.0, 0.0, 3.0, 3.0, 11.0, 7.0, 6.0, 14.0, 16.0, 18.0, 49.0, 68.0, 473.0, 182.0, 51.0, 38.0, 16.0, 12.0, 7.0, 6.0, 1.0, 5.0, 4.0, 3.0, 1.0, 3.0, 4.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 4.0], "bins": [-0.038604736328125, -0.0377042293548584, -0.0368037223815918, -0.035903215408325195, -0.035002708435058594, -0.03410220146179199, -0.03320169448852539, -0.03230118751525879, -0.03140068054199219, -0.030500173568725586, -0.029599666595458984, -0.028699159622192383, -0.02779865264892578, -0.02689814567565918, -0.025997638702392578, -0.025097131729125977, -0.024196624755859375, -0.023296117782592773, -0.022395610809326172, -0.02149510383605957, -0.02059459686279297, -0.019694089889526367, -0.018793582916259766, -0.017893075942993164, -0.016992568969726562, -0.01609206199645996, -0.01519155502319336, -0.014291048049926758, -0.013390541076660156, -0.012490034103393555, -0.011589527130126953, -0.010689020156860352, -0.00978851318359375, -0.008888006210327148, -0.007987499237060547, -0.007086992263793945, -0.006186485290527344, -0.005285978317260742, -0.004385471343994141, -0.003484964370727539, -0.0025844573974609375, -0.001683950424194336, -0.0007834434509277344, 0.00011706352233886719, 0.0010175704956054688, 0.0019180774688720703, 0.002818584442138672, 0.0037190914154052734, 0.004619598388671875, 0.0055201053619384766, 0.006420612335205078, 0.00732111930847168, 0.008221626281738281, 0.009122133255004883, 0.010022640228271484, 0.010923147201538086, 0.011823654174804688, 0.012724161148071289, 0.01362466812133789, 0.014525175094604492, 0.015425682067871094, 0.016326189041137695, 0.017226696014404297, 0.0181272029876709, 0.0190277099609375]}, "gradients/encoder.encoder.layers.22.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 3.0, 4.0, 11.0, 20.0, 86.0, 703.0, 147.0, 21.0, 10.0, 4.0, 4.0, 2.0, 1.0, 1.0], "bins": [-0.9651861786842346, -0.947821855545044, -0.9304575324058533, -0.9130932092666626, -0.8957288265228271, -0.8783645033836365, -0.8610001802444458, -0.8436358571052551, -0.8262715339660645, -0.8089072108268738, -0.7915428876876831, -0.7741785049438477, -0.756814181804657, -0.7394498586654663, -0.7220855355262756, -0.704721212387085, -0.6873568296432495, -0.6699925065040588, -0.6526281833648682, -0.6352638006210327, -0.617899477481842, -0.6005351543426514, -0.5831708312034607, -0.56580650806427, -0.5484421849250793, -0.5310778617858887, -0.513713538646698, -0.49634918570518494, -0.47898486256599426, -0.4616205096244812, -0.4442561864852905, -0.42689186334609985, -0.4095275104045868, -0.3921631872653961, -0.37479883432388306, -0.3574345111846924, -0.3400701880455017, -0.32270586490631104, -0.305341511964798, -0.2879771888256073, -0.27061283588409424, -0.25324851274490356, -0.2358841747045517, -0.21851983666419983, -0.20115551352500916, -0.1837911754846573, -0.16642683744430542, -0.14906251430511475, -0.13169819116592407, -0.1143338605761528, -0.09696952998638153, -0.07960519194602966, -0.06224086135625839, -0.04487653076648712, -0.027512192726135254, -0.010147862136363983, 0.007216468453407288, 0.024580800905823708, 0.04194513335824013, 0.0593094676733017, 0.07667379826307297, 0.09403812885284424, 0.1114024668931961, 0.12876680493354797, 0.14613112807273865]}, "gradients/encoder.encoder.layers.22.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 3.0, 5.0, 4.0, 4.0, 6.0, 5.0, 8.0, 12.0, 11.0, 18.0, 18.0, 28.0, 32.0, 51.0, 52.0, 62.0, 71.0, 69.0, 74.0, 73.0, 78.0, 61.0, 55.0, 45.0, 38.0, 21.0, 21.0, 11.0, 13.0, 10.0, 8.0, 9.0, 4.0, 2.0, 4.0, 5.0, 6.0, 4.0, 1.0, 4.0, 1.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.0941152572631836, -0.09138787537813187, -0.08866048604249954, -0.08593310415744781, -0.08320571482181549, -0.08047833293676376, -0.07775095105171204, -0.07502356171607971, -0.07229617983102798, -0.06956879794597626, -0.06684140861034393, -0.0641140267252922, -0.06138664111495018, -0.058659255504608154, -0.05593187361955643, -0.0532044880092144, -0.050477102398872375, -0.04774971678853035, -0.045022331178188324, -0.0422949492931366, -0.03956756368279457, -0.036840178072452545, -0.03411279618740082, -0.03138541057705879, -0.028658024966716766, -0.02593063935637474, -0.023203255608677864, -0.020475871860980988, -0.017748486250638962, -0.01502110157161951, -0.01229371689260006, -0.009566333144903183, -0.006838947534561157, -0.004111562855541706, -0.001384178176522255, 0.0013432065024971962, 0.004070591181516647, 0.0067979758605360985, 0.00952536053955555, 0.012252744287252426, 0.014980129897594452, 0.017707515507936478, 0.020434899255633354, 0.02316228300333023, 0.025889668613672256, 0.028617054224014282, 0.03134443610906601, 0.034071821719408035, 0.03679920732975006, 0.03952659294009209, 0.04225397855043411, 0.04498136043548584, 0.047708746045827866, 0.05043613165616989, 0.05316351354122162, 0.055890899151563644, 0.05861828476190567, 0.061345670372247696, 0.06407305598258972, 0.06680043786764145, 0.06952781975269318, 0.0722552090883255, 0.07498259097337723, 0.07770997285842896, 0.08043736219406128]}, "gradients/encoder.encoder.layers.21.feed_forward.output_dense.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 2.0, 2.0, 0.0, 2.0, 0.0, 4.0, 0.0, 0.0, 6.0, 2.0, 2.0, 0.0, 10.0, 8.0, 8.0, 8.0, 6.0, 10.0, 4.0, 18.0, 16.0, 10.0, 30.0, 38.0, 66.0, 416.0, 4192941.0, 398.0, 97.0, 30.0, 26.0, 28.0, 18.0, 2.0, 8.0, 8.0, 10.0, 8.0, 8.0, 6.0, 4.0, 0.0, 0.0, 6.0, 2.0, 2.0, 8.0, 4.0, 2.0, 2.0, 4.0, 0.0, 2.0, 4.0], "bins": [-3.16796875, -3.08123779296875, -2.9945068359375, -2.90777587890625, -2.821044921875, -2.73431396484375, -2.6475830078125, -2.56085205078125, -2.47412109375, -2.38739013671875, -2.3006591796875, -2.21392822265625, -2.127197265625, -2.04046630859375, -1.9537353515625, -1.86700439453125, -1.7802734375, -1.69354248046875, -1.6068115234375, -1.52008056640625, -1.433349609375, -1.34661865234375, -1.2598876953125, -1.17315673828125, -1.08642578125, -0.99969482421875, -0.9129638671875, -0.82623291015625, -0.739501953125, -0.65277099609375, -0.5660400390625, -0.47930908203125, -0.392578125, -0.30584716796875, -0.2191162109375, -0.13238525390625, -0.045654296875, 0.04107666015625, 0.1278076171875, 0.21453857421875, 0.30126953125, 0.38800048828125, 0.4747314453125, 0.56146240234375, 0.648193359375, 0.73492431640625, 0.8216552734375, 0.90838623046875, 0.9951171875, 1.08184814453125, 1.1685791015625, 1.25531005859375, 1.342041015625, 1.42877197265625, 1.5155029296875, 1.60223388671875, 1.68896484375, 1.77569580078125, 1.8624267578125, 1.94915771484375, 2.035888671875, 2.12261962890625, 2.2093505859375, 2.29608154296875, 2.3828125]}, "gradients/encoder.encoder.layers.21.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 4.0, 1.0, 1.0, 0.0, 4.0, 3.0, 4.0, 3.0, 2.0, 2.0, 12.0, 5.0, 7.0, 12.0, 12.0, 24.0, 81.0, 170.0, 257.0, 193.0, 81.0, 42.0, 24.0, 4.0, 5.0, 9.0, 8.0, 10.0, 7.0, 2.0, 6.0, 4.0, 4.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0], "bins": [-0.00286102294921875, -0.002782970666885376, -0.002704918384552002, -0.002626866102218628, -0.002548813819885254, -0.00247076153755188, -0.002392709255218506, -0.002314656972885132, -0.002236604690551758, -0.002158552408218384, -0.0020805001258850098, -0.0020024478435516357, -0.0019243955612182617, -0.0018463432788848877, -0.0017682909965515137, -0.0016902387142181396, -0.0016121864318847656, -0.0015341341495513916, -0.0014560818672180176, -0.0013780295848846436, -0.0012999773025512695, -0.0012219250202178955, -0.0011438727378845215, -0.0010658204555511475, -0.0009877681732177734, -0.0009097158908843994, -0.0008316636085510254, -0.0007536113262176514, -0.0006755590438842773, -0.0005975067615509033, -0.0005194544792175293, -0.0004414021968841553, -0.00036334991455078125, -0.0002852976322174072, -0.0002072453498840332, -0.00012919306755065918, -5.1140785217285156e-05, 2.6911497116088867e-05, 0.00010496377944946289, 0.00018301606178283691, 0.00026106834411621094, 0.00033912062644958496, 0.000417172908782959, 0.000495225191116333, 0.000573277473449707, 0.0006513297557830811, 0.0007293820381164551, 0.0008074343204498291, 0.0008854866027832031, 0.0009635388851165771, 0.0010415911674499512, 0.0011196434497833252, 0.0011976957321166992, 0.0012757480144500732, 0.0013538002967834473, 0.0014318525791168213, 0.0015099048614501953, 0.0015879571437835693, 0.0016660094261169434, 0.0017440617084503174, 0.0018221139907836914, 0.0019001662731170654, 0.0019782185554504395, 0.0020562708377838135, 0.0021343231201171875]}, "gradients/encoder.encoder.layers.21.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 4.0, 4.0, 4.0, 4.0, 4.0, 9.0, 9.0, 11.0, 22.0, 38.0, 51.0, 69.0, 106.0, 277.0, 1360.0, 4191067.0, 766.0, 182.0, 80.0, 71.0, 38.0, 31.0, 28.0, 16.0, 13.0, 7.0, 6.0, 5.0, 2.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.69873046875, -0.674102783203125, -0.64947509765625, -0.624847412109375, -0.6002197265625, -0.575592041015625, -0.55096435546875, -0.526336669921875, -0.501708984375, -0.477081298828125, -0.45245361328125, -0.427825927734375, -0.4031982421875, -0.378570556640625, -0.35394287109375, -0.329315185546875, -0.3046875, -0.280059814453125, -0.25543212890625, -0.230804443359375, -0.2061767578125, -0.181549072265625, -0.15692138671875, -0.132293701171875, -0.107666015625, -0.083038330078125, -0.05841064453125, -0.033782958984375, -0.0091552734375, 0.015472412109375, 0.04010009765625, 0.064727783203125, 0.08935546875, 0.113983154296875, 0.13861083984375, 0.163238525390625, 0.1878662109375, 0.212493896484375, 0.23712158203125, 0.261749267578125, 0.286376953125, 0.311004638671875, 0.33563232421875, 0.360260009765625, 0.3848876953125, 0.409515380859375, 0.43414306640625, 0.458770751953125, 0.4833984375, 0.508026123046875, 0.53265380859375, 0.557281494140625, 0.5819091796875, 0.606536865234375, 0.63116455078125, 0.655792236328125, 0.680419921875, 0.705047607421875, 0.72967529296875, 0.754302978515625, 0.7789306640625, 0.803558349609375, 0.82818603515625, 0.852813720703125, 0.87744140625]}, "gradients/encoder.encoder.layers.21.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 5.0, 7.0, 10.0, 26.0, 77.0, 247.0, 1631.0, 1713.0, 242.0, 77.0, 34.0, 10.0, 3.0, 3.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0275421142578125, -0.02701246738433838, -0.026482820510864258, -0.025953173637390137, -0.025423526763916016, -0.024893879890441895, -0.024364233016967773, -0.023834586143493652, -0.02330493927001953, -0.02277529239654541, -0.02224564552307129, -0.021715998649597168, -0.021186351776123047, -0.020656704902648926, -0.020127058029174805, -0.019597411155700684, -0.019067764282226562, -0.01853811740875244, -0.01800847053527832, -0.0174788236618042, -0.016949176788330078, -0.016419529914855957, -0.015889883041381836, -0.015360236167907715, -0.014830589294433594, -0.014300942420959473, -0.013771295547485352, -0.01324164867401123, -0.01271200180053711, -0.012182354927062988, -0.011652708053588867, -0.011123061180114746, -0.010593414306640625, -0.010063767433166504, -0.009534120559692383, -0.009004473686218262, -0.00847482681274414, -0.00794517993927002, -0.0074155330657958984, -0.006885886192321777, -0.006356239318847656, -0.005826592445373535, -0.005296945571899414, -0.004767298698425293, -0.004237651824951172, -0.0037080049514770508, -0.0031783580780029297, -0.0026487112045288086, -0.0021190643310546875, -0.0015894174575805664, -0.0010597705841064453, -0.0005301237106323242, -4.76837158203125e-07, 0.000529170036315918, 0.001058816909790039, 0.0015884637832641602, 0.0021181106567382812, 0.0026477575302124023, 0.0031774044036865234, 0.0037070512771606445, 0.004236698150634766, 0.004766345024108887, 0.005295991897583008, 0.005825638771057129, 0.00635528564453125]}, "gradients/encoder.encoder.layers.21.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 153.0, 847.0, 16.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.3899803161621094, -1.3622641563415527, -1.3345481157302856, -1.306831955909729, -1.279115915298462, -1.2513997554779053, -1.2236835956573486, -1.1959675550460815, -1.168251395225525, -1.1405352354049683, -1.1128191947937012, -1.0851030349731445, -1.0573869943618774, -1.0296708345413208, -1.0019547939300537, -0.9742386341094971, -0.9465225338935852, -0.9188064336776733, -0.8910903334617615, -0.8633742332458496, -0.835658073425293, -0.8079419732093811, -0.7802258729934692, -0.7525097727775574, -0.7247936725616455, -0.6970775723457336, -0.6693614721298218, -0.6416453123092651, -0.6139292120933533, -0.5862131118774414, -0.5584970116615295, -0.5307809114456177, -0.503064751625061, -0.47534865140914917, -0.4476325213909149, -0.41991642117500305, -0.3922002911567688, -0.36448419094085693, -0.33676809072494507, -0.3090519905090332, -0.28133586049079895, -0.2536197602748871, -0.22590363025665283, -0.19818753004074097, -0.1704714149236679, -0.14275529980659485, -0.11503919959068298, -0.08732308447360992, -0.059606969356536865, -0.031890857964754105, -0.004174746572971344, 0.023541361093521118, 0.05125747621059418, 0.07897359132766724, 0.1066896915435791, 0.13440580666065216, 0.16212192177772522, 0.18983803689479828, 0.21755415201187134, 0.2452702522277832, 0.27298635244369507, 0.3007024824619293, 0.3284185826778412, 0.35613471269607544, 0.3838508129119873]}, "gradients/encoder.encoder.layers.21.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 16.0, 40.0, 113.0, 184.0, 252.0, 213.0, 116.0, 47.0, 20.0, 7.0, 1.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.57295823097229, -0.5622075796127319, -0.5514569282531738, -0.540706217288971, -0.5299555659294128, -0.5192049145698547, -0.5084542632102966, -0.49770358204841614, -0.48695290088653564, -0.47620224952697754, -0.46545156836509705, -0.45470091700553894, -0.44395023584365845, -0.43319958448410034, -0.42244890332221985, -0.41169825196266174, -0.40094757080078125, -0.39019691944122314, -0.37944623827934265, -0.36869558691978455, -0.35794490575790405, -0.34719425439834595, -0.33644357323646545, -0.32569292187690735, -0.31494227051734924, -0.30419161915779114, -0.29344093799591064, -0.28269028663635254, -0.27193960547447205, -0.26118895411491394, -0.25043827295303345, -0.23968762159347534, -0.22893694043159485, -0.21818627417087555, -0.20743560791015625, -0.19668494164943695, -0.18593427538871765, -0.17518360912799835, -0.16443294286727905, -0.15368229150772095, -0.14293161034584045, -0.13218094408512115, -0.12143027782440186, -0.11067961156368256, -0.09992894530296326, -0.08917827904224396, -0.07842762023210526, -0.06767695397138596, -0.05692629516124725, -0.046175628900527954, -0.035424962639808655, -0.024674300104379654, -0.013923633843660355, -0.0031729675829410553, 0.0075776949524879456, 0.018328361213207245, 0.029079027473926544, 0.039829693734645844, 0.05058035999536514, 0.061331022530794144, 0.07208168506622314, 0.08283235132694244, 0.09358301758766174, 0.10433368384838104, 0.11508435010910034]}, "gradients/encoder.encoder.layers.21.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 3.0, 0.0, 0.0, 2.0, 2.0, 6.0, 10.0, 10.0, 11.0, 20.0, 31.0, 16.0, 20.0, 38.0, 48.0, 49.0, 41.0, 66.0, 59.0, 4660.0, 1043013.0, 74.0, 57.0, 65.0, 52.0, 47.0, 37.0, 20.0, 21.0, 18.0, 13.0, 10.0, 6.0, 6.0, 7.0, 8.0, 2.0, 1.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 5.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.6826171875, -0.6620712280273438, -0.6415252685546875, -0.6209793090820312, -0.600433349609375, -0.5798873901367188, -0.5593414306640625, -0.5387954711914062, -0.51824951171875, -0.49770355224609375, -0.4771575927734375, -0.45661163330078125, -0.436065673828125, -0.41551971435546875, -0.3949737548828125, -0.37442779541015625, -0.3538818359375, -0.33333587646484375, -0.3127899169921875, -0.29224395751953125, -0.271697998046875, -0.25115203857421875, -0.2306060791015625, -0.21006011962890625, -0.18951416015625, -0.16896820068359375, -0.1484222412109375, -0.12787628173828125, -0.107330322265625, -0.08678436279296875, -0.0662384033203125, -0.04569244384765625, -0.025146484375, -0.00460052490234375, 0.0159454345703125, 0.03649139404296875, 0.057037353515625, 0.07758331298828125, 0.0981292724609375, 0.11867523193359375, 0.13922119140625, 0.15976715087890625, 0.1803131103515625, 0.20085906982421875, 0.221405029296875, 0.24195098876953125, 0.2624969482421875, 0.28304290771484375, 0.3035888671875, 0.32413482666015625, 0.3446807861328125, 0.36522674560546875, 0.385772705078125, 0.40631866455078125, 0.4268646240234375, 0.44741058349609375, 0.46795654296875, 0.48850250244140625, 0.5090484619140625, 0.5295944213867188, 0.550140380859375, 0.5706863403320312, 0.5912322998046875, 0.6117782592773438, 0.63232421875]}, "gradients/encoder.encoder.layers.21.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 14.0, 395.0, 581.0, 33.0], "bins": [-0.136474609375, -0.13427501916885376, -0.13207542896270752, -0.12987583875656128, -0.12767624855041504, -0.1254766583442688, -0.12327706813812256, -0.12107747793197632, -0.11887788772583008, -0.11667829751968384, -0.1144787073135376, -0.11227911710739136, -0.11007952690124512, -0.10787993669509888, -0.10568034648895264, -0.1034807562828064, -0.10128116607666016, -0.09908157587051392, -0.09688198566436768, -0.09468239545822144, -0.0924828052520752, -0.09028321504592896, -0.08808362483978271, -0.08588403463363647, -0.08368444442749023, -0.081484854221344, -0.07928526401519775, -0.07708567380905151, -0.07488608360290527, -0.07268649339675903, -0.07048690319061279, -0.06828731298446655, -0.06608772277832031, -0.06388813257217407, -0.06168854236602783, -0.05948895215988159, -0.05728936195373535, -0.05508977174758911, -0.05289018154144287, -0.05069059133529663, -0.04849100112915039, -0.04629141092300415, -0.04409182071685791, -0.04189223051071167, -0.03969264030456543, -0.03749305009841919, -0.03529345989227295, -0.03309386968612671, -0.03089427947998047, -0.02869468927383423, -0.02649509906768799, -0.024295508861541748, -0.022095918655395508, -0.019896328449249268, -0.017696738243103027, -0.015497148036956787, -0.013297557830810547, -0.011097967624664307, -0.008898377418518066, -0.006698787212371826, -0.004499197006225586, -0.0022996068000793457, -0.00010001659393310547, 0.0020995736122131348, 0.004299163818359375]}, "gradients/encoder.encoder.layers.21.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 3.0, 3.0, 6.0, 3.0, 3.0, 7.0, 12.0, 8.0, 10.0, 16.0, 27.0, 40.0, 49.0, 71.0, 120.0, 208.0, 582.0, 4103.0, 176144.0, 857135.0, 8426.0, 909.0, 275.0, 136.0, 81.0, 47.0, 35.0, 28.0, 16.0, 14.0, 14.0, 7.0, 5.0, 5.0, 5.0, 4.0, 0.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.264892578125, -0.2547569274902344, -0.24462127685546875, -0.23448562622070312, -0.2243499755859375, -0.21421432495117188, -0.20407867431640625, -0.19394302368164062, -0.183807373046875, -0.17367172241210938, -0.16353607177734375, -0.15340042114257812, -0.1432647705078125, -0.13312911987304688, -0.12299346923828125, -0.11285781860351562, -0.10272216796875, -0.09258651733398438, -0.08245086669921875, -0.07231521606445312, -0.0621795654296875, -0.052043914794921875, -0.04190826416015625, -0.031772613525390625, -0.021636962890625, -0.011501312255859375, -0.00136566162109375, 0.008769989013671875, 0.0189056396484375, 0.029041290283203125, 0.03917694091796875, 0.049312591552734375, 0.0594482421875, 0.06958389282226562, 0.07971954345703125, 0.08985519409179688, 0.0999908447265625, 0.11012649536132812, 0.12026214599609375, 0.13039779663085938, 0.140533447265625, 0.15066909790039062, 0.16080474853515625, 0.17094039916992188, 0.1810760498046875, 0.19121170043945312, 0.20134735107421875, 0.21148300170898438, 0.22161865234375, 0.23175430297851562, 0.24188995361328125, 0.2520256042480469, 0.2621612548828125, 0.2722969055175781, 0.28243255615234375, 0.2925682067871094, 0.302703857421875, 0.3128395080566406, 0.32297515869140625, 0.3331108093261719, 0.3432464599609375, 0.3533821105957031, 0.36351776123046875, 0.3736534118652344, 0.3837890625]}, "gradients/encoder.encoder.layers.21.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 3.0, 3.0, 7.0, 1.0, 7.0, 12.0, 11.0, 14.0, 13.0, 15.0, 28.0, 40.0, 34.0, 56.0, 60.0, 47.0, 79.0, 71.0, 68.0, 72.0, 62.0, 66.0, 53.0, 31.0, 34.0, 27.0, 21.0, 20.0, 17.0, 4.0, 9.0, 6.0, 2.0, 7.0, 2.0, 2.0, 1.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09210205078125, -0.08881282806396484, -0.08552360534667969, -0.08223438262939453, -0.07894515991210938, -0.07565593719482422, -0.07236671447753906, -0.0690774917602539, -0.06578826904296875, -0.062499046325683594, -0.05920982360839844, -0.05592060089111328, -0.052631378173828125, -0.04934215545654297, -0.04605293273925781, -0.042763710021972656, -0.0394744873046875, -0.036185264587402344, -0.03289604187011719, -0.02960681915283203, -0.026317596435546875, -0.02302837371826172, -0.019739151000976562, -0.016449928283691406, -0.01316070556640625, -0.009871482849121094, -0.0065822601318359375, -0.0032930374145507812, -3.814697265625e-06, 0.0032854080200195312, 0.0065746307373046875, 0.009863853454589844, 0.013153076171875, 0.016442298889160156, 0.019731521606445312, 0.02302074432373047, 0.026309967041015625, 0.02959918975830078, 0.03288841247558594, 0.036177635192871094, 0.03946685791015625, 0.042756080627441406, 0.04604530334472656, 0.04933452606201172, 0.052623748779296875, 0.05591297149658203, 0.05920219421386719, 0.062491416931152344, 0.0657806396484375, 0.06906986236572266, 0.07235908508300781, 0.07564830780029297, 0.07893753051757812, 0.08222675323486328, 0.08551597595214844, 0.0888051986694336, 0.09209442138671875, 0.0953836441040039, 0.09867286682128906, 0.10196208953857422, 0.10525131225585938, 0.10854053497314453, 0.11182975769042969, 0.11511898040771484, 0.118408203125]}, "gradients/encoder.encoder.layers.21.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 4.0, 2.0, 1.0, 4.0, 6.0, 8.0, 13.0, 9.0, 8.0, 20.0, 17.0, 21.0, 41.0, 41.0, 80.0, 120.0, 241.0, 502.0, 1982.0, 11072.0, 920945.0, 105607.0, 5768.0, 1219.0, 338.0, 146.0, 96.0, 67.0, 39.0, 28.0, 28.0, 15.0, 17.0, 10.0, 10.0, 7.0, 8.0, 0.0, 5.0, 4.0, 6.0, 2.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.304443359375, -0.2954444885253906, -0.28644561767578125, -0.2774467468261719, -0.2684478759765625, -0.2594490051269531, -0.25045013427734375, -0.24145126342773438, -0.232452392578125, -0.22345352172851562, -0.21445465087890625, -0.20545578002929688, -0.1964569091796875, -0.18745803833007812, -0.17845916748046875, -0.16946029663085938, -0.16046142578125, -0.15146255493164062, -0.14246368408203125, -0.13346481323242188, -0.1244659423828125, -0.11546707153320312, -0.10646820068359375, -0.09746932983398438, -0.088470458984375, -0.07947158813476562, -0.07047271728515625, -0.061473846435546875, -0.0524749755859375, -0.043476104736328125, -0.03447723388671875, -0.025478363037109375, -0.0164794921875, -0.007480621337890625, 0.00151824951171875, 0.010517120361328125, 0.0195159912109375, 0.028514862060546875, 0.03751373291015625, 0.046512603759765625, 0.055511474609375, 0.06451034545898438, 0.07350921630859375, 0.08250808715820312, 0.0915069580078125, 0.10050582885742188, 0.10950469970703125, 0.11850357055664062, 0.12750244140625, 0.13650131225585938, 0.14550018310546875, 0.15449905395507812, 0.1634979248046875, 0.17249679565429688, 0.18149566650390625, 0.19049453735351562, 0.199493408203125, 0.20849227905273438, 0.21749114990234375, 0.22649002075195312, 0.2354888916015625, 0.24448776245117188, 0.25348663330078125, 0.2624855041503906, 0.271484375]}, "gradients/encoder.encoder.layers.21.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 2.0, 8.0, 0.0, 6.0, 4.0, 7.0, 9.0, 10.0, 19.0, 31.0, 71.0, 86.0, 130.0, 158.0, 158.0, 109.0, 65.0, 29.0, 16.0, 15.0, 10.0, 8.0, 13.0, 9.0, 7.0, 4.0, 2.0, 2.0, 1.0, 3.0, 3.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.1636486053466797e-05, -2.0976178348064423e-05, -2.031587064266205e-05, -1.9655562937259674e-05, -1.89952552318573e-05, -1.8334947526454926e-05, -1.767463982105255e-05, -1.7014332115650177e-05, -1.6354024410247803e-05, -1.569371670484543e-05, -1.5033408999443054e-05, -1.437310129404068e-05, -1.3712793588638306e-05, -1.3052485883235931e-05, -1.2392178177833557e-05, -1.1731870472431183e-05, -1.1071562767028809e-05, -1.0411255061626434e-05, -9.75094735622406e-06, -9.090639650821686e-06, -8.430331945419312e-06, -7.770024240016937e-06, -7.109716534614563e-06, -6.449408829212189e-06, -5.7891011238098145e-06, -5.12879341840744e-06, -4.468485713005066e-06, -3.8081780076026917e-06, -3.1478703022003174e-06, -2.487562596797943e-06, -1.8272548913955688e-06, -1.1669471859931946e-06, -5.066394805908203e-07, 1.5366822481155396e-07, 8.139759302139282e-07, 1.4742836356163025e-06, 2.1345913410186768e-06, 2.794899046421051e-06, 3.4552067518234253e-06, 4.1155144572257996e-06, 4.775822162628174e-06, 5.436129868030548e-06, 6.096437573432922e-06, 6.756745278835297e-06, 7.417052984237671e-06, 8.077360689640045e-06, 8.73766839504242e-06, 9.397976100444794e-06, 1.0058283805847168e-05, 1.0718591511249542e-05, 1.1378899216651917e-05, 1.203920692205429e-05, 1.2699514627456665e-05, 1.335982233285904e-05, 1.4020130038261414e-05, 1.4680437743663788e-05, 1.5340745449066162e-05, 1.6001053154468536e-05, 1.666136085987091e-05, 1.7321668565273285e-05, 1.798197627067566e-05, 1.8642283976078033e-05, 1.9302591681480408e-05, 1.9962899386882782e-05, 2.0623207092285156e-05]}, "gradients/encoder.encoder.layers.21.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 5.0, 2.0, 0.0, 6.0, 11.0, 5.0, 9.0, 17.0, 19.0, 33.0, 80.0, 117.0, 320.0, 1234.0, 5705.0, 954041.0, 81197.0, 4239.0, 957.0, 273.0, 109.0, 77.0, 31.0, 17.0, 20.0, 10.0, 12.0, 3.0, 3.0, 5.0, 3.0, 4.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.9013671875, -0.87310791015625, -0.8448486328125, -0.81658935546875, -0.788330078125, -0.76007080078125, -0.7318115234375, -0.70355224609375, -0.67529296875, -0.64703369140625, -0.6187744140625, -0.59051513671875, -0.562255859375, -0.53399658203125, -0.5057373046875, -0.47747802734375, -0.44921875, -0.42095947265625, -0.3927001953125, -0.36444091796875, -0.336181640625, -0.30792236328125, -0.2796630859375, -0.25140380859375, -0.22314453125, -0.19488525390625, -0.1666259765625, -0.13836669921875, -0.110107421875, -0.08184814453125, -0.0535888671875, -0.02532958984375, 0.0029296875, 0.03118896484375, 0.0594482421875, 0.08770751953125, 0.115966796875, 0.14422607421875, 0.1724853515625, 0.20074462890625, 0.22900390625, 0.25726318359375, 0.2855224609375, 0.31378173828125, 0.342041015625, 0.37030029296875, 0.3985595703125, 0.42681884765625, 0.455078125, 0.48333740234375, 0.5115966796875, 0.53985595703125, 0.568115234375, 0.59637451171875, 0.6246337890625, 0.65289306640625, 0.68115234375, 0.70941162109375, 0.7376708984375, 0.76593017578125, 0.794189453125, 0.82244873046875, 0.8507080078125, 0.87896728515625, 0.9072265625]}, "gradients/encoder.encoder.layers.21.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 1.0, 0.0, 0.0, 3.0, 2.0, 3.0, 3.0, 3.0, 8.0, 11.0, 19.0, 57.0, 533.0, 261.0, 44.0, 22.0, 13.0, 5.0, 5.0, 0.0, 2.0, 3.0, 2.0, 2.0, 3.0, 0.0, 0.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.260498046875, -0.24856948852539062, -0.23664093017578125, -0.22471237182617188, -0.2127838134765625, -0.20085525512695312, -0.18892669677734375, -0.17699813842773438, -0.165069580078125, -0.15314102172851562, -0.14121246337890625, -0.12928390502929688, -0.1173553466796875, -0.10542678833007812, -0.09349822998046875, -0.08156967163085938, -0.06964111328125, -0.057712554931640625, -0.04578399658203125, -0.033855438232421875, -0.0219268798828125, -0.009998321533203125, 0.00193023681640625, 0.013858795166015625, 0.025787353515625, 0.037715911865234375, 0.04964447021484375, 0.061573028564453125, 0.0735015869140625, 0.08543014526367188, 0.09735870361328125, 0.10928726196289062, 0.1212158203125, 0.13314437866210938, 0.14507293701171875, 0.15700149536132812, 0.1689300537109375, 0.18085861206054688, 0.19278717041015625, 0.20471572875976562, 0.216644287109375, 0.22857284545898438, 0.24050140380859375, 0.2524299621582031, 0.2643585205078125, 0.2762870788574219, 0.28821563720703125, 0.3001441955566406, 0.31207275390625, 0.3240013122558594, 0.33592987060546875, 0.3478584289550781, 0.3597869873046875, 0.3717155456542969, 0.38364410400390625, 0.3955726623535156, 0.407501220703125, 0.4194297790527344, 0.43135833740234375, 0.4432868957519531, 0.4552154541015625, 0.4671440124511719, 0.47907257080078125, 0.4910011291503906, 0.5029296875]}, "gradients/encoder.encoder.layers.21.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 5.0, 61.0, 871.0, 67.0, 9.0, 4.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.73531723022461, -8.561802864074707, -8.388288497924805, -8.214775085449219, -8.041260719299316, -7.867746353149414, -7.694231986999512, -7.520717620849609, -7.347203254699707, -7.173688888549805, -7.0001749992370605, -6.826660633087158, -6.653146266937256, -6.479632377624512, -6.306118011474609, -6.132603645324707, -5.959089756011963, -5.7855753898620605, -5.612061500549316, -5.438547134399414, -5.265032768249512, -5.091518402099609, -4.918004512786865, -4.744490146636963, -4.570976257324219, -4.397461891174316, -4.223948001861572, -4.05043363571167, -3.8769192695617676, -3.7034051418304443, -3.529891014099121, -3.3563766479492188, -3.1828627586364746, -3.0093486309051514, -2.835834264755249, -2.662320137023926, -2.4888057708740234, -2.3152916431427, -2.141777515411377, -1.9682632684707642, -1.7947490215301514, -1.6212347745895386, -1.4477205276489258, -1.2742063999176025, -1.1006921529769897, -0.927177906036377, -0.7536637783050537, -0.5801495313644409, -0.4066352844238281, -0.23312106728553772, -0.059606850147247314, 0.1139073371887207, 0.2874215841293335, 0.4609358310699463, 0.6344499588012695, 0.8079642057418823, 0.9814784526824951, 1.154992699623108, 1.3285069465637207, 1.502021074295044, 1.6755353212356567, 1.8490495681762695, 2.0225636959075928, 2.196077823638916, 2.3695921897888184]}, "gradients/encoder.encoder.layers.21.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 3.0, 8.0, 9.0, 16.0, 54.0, 59.0, 94.0, 119.0, 138.0, 142.0, 116.0, 99.0, 73.0, 44.0, 20.0, 14.0, 6.0, 2.0, 0.0, 3.0], "bins": [-2.9110710620880127, -2.858119487762451, -2.8051681518554688, -2.7522165775299072, -2.699265241622925, -2.6463136672973633, -2.593362331390381, -2.5404107570648193, -2.487459421157837, -2.4345078468322754, -2.381556510925293, -2.3286049365997314, -2.275653600692749, -2.2227020263671875, -2.169750690460205, -2.1167991161346436, -2.063847541809082, -2.0108959674835205, -1.957944631576538, -1.9049931764602661, -1.8520417213439941, -1.7990902662277222, -1.7461388111114502, -1.6931872367858887, -1.6402359008789062, -1.5872844457626343, -1.5343329906463623, -1.4813815355300903, -1.4284300804138184, -1.3754786252975464, -1.3225271701812744, -1.269575595855713, -1.216624140739441, -1.163672685623169, -1.110721230506897, -1.057769775390625, -1.004818320274353, -0.951866865158081, -0.8989153504371643, -0.8459638953208923, -0.7930124402046204, -0.7400609850883484, -0.6871095299720764, -0.6341580152511597, -0.5812065601348877, -0.5282551050186157, -0.47530364990234375, -0.4223521947860718, -0.3694007396697998, -0.31644928455352783, -0.26349782943725586, -0.2105463445186615, -0.15759488940238953, -0.10464343428611755, -0.05169194936752319, 0.0012595057487487793, 0.05421096086502075, 0.10716242343187332, 0.1601138859987259, 0.21306535601615906, 0.26601681113243103, 0.318968266248703, 0.37191975116729736, 0.42487120628356934, 0.4778226613998413]}, "gradients/encoder.encoder.layers.20.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 13.0, 7.0, 11.0, 19.0, 26.0, 48.0, 67.0, 120.0, 239.0, 811.0, 4192047.0, 746.0, 122.0, 19.0, 5.0], "bins": [-4.48046875, -4.405242919921875, -4.33001708984375, -4.254791259765625, -4.1795654296875, -4.104339599609375, -4.02911376953125, -3.953887939453125, -3.878662109375, -3.803436279296875, -3.72821044921875, -3.652984619140625, -3.5777587890625, -3.502532958984375, -3.42730712890625, -3.352081298828125, -3.27685546875, -3.201629638671875, -3.12640380859375, -3.051177978515625, -2.9759521484375, -2.900726318359375, -2.82550048828125, -2.750274658203125, -2.675048828125, -2.599822998046875, -2.52459716796875, -2.449371337890625, -2.3741455078125, -2.298919677734375, -2.22369384765625, -2.148468017578125, -2.0732421875, -1.998016357421875, -1.92279052734375, -1.847564697265625, -1.7723388671875, -1.697113037109375, -1.62188720703125, -1.546661376953125, -1.471435546875, -1.396209716796875, -1.32098388671875, -1.245758056640625, -1.1705322265625, -1.095306396484375, -1.02008056640625, -0.944854736328125, -0.86962890625, -0.794403076171875, -0.71917724609375, -0.643951416015625, -0.5687255859375, -0.493499755859375, -0.41827392578125, -0.343048095703125, -0.267822265625, -0.192596435546875, -0.11737060546875, -0.042144775390625, 0.0330810546875, 0.108306884765625, 0.18353271484375, 0.258758544921875, 0.333984375]}, "gradients/encoder.encoder.layers.20.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 33.0, 121.0, 328.0, 342.0, 155.0, 32.0, 7.0], "bins": [-0.1578369140625, -0.15520596504211426, -0.15257501602172852, -0.14994406700134277, -0.14731311798095703, -0.1446821689605713, -0.14205121994018555, -0.1394202709197998, -0.13678932189941406, -0.13415837287902832, -0.13152742385864258, -0.12889647483825684, -0.1262655258178711, -0.12363457679748535, -0.12100362777709961, -0.11837267875671387, -0.11574172973632812, -0.11311078071594238, -0.11047983169555664, -0.1078488826751709, -0.10521793365478516, -0.10258698463439941, -0.09995603561401367, -0.09732508659362793, -0.09469413757324219, -0.09206318855285645, -0.0894322395324707, -0.08680129051208496, -0.08417034149169922, -0.08153939247131348, -0.07890844345092773, -0.07627749443054199, -0.07364654541015625, -0.07101559638977051, -0.06838464736938477, -0.06575369834899902, -0.06312274932861328, -0.06049180030822754, -0.0578608512878418, -0.055229902267456055, -0.05259895324707031, -0.04996800422668457, -0.04733705520629883, -0.044706106185913086, -0.042075157165527344, -0.0394442081451416, -0.03681325912475586, -0.03418231010437012, -0.031551361083984375, -0.028920412063598633, -0.02628946304321289, -0.02365851402282715, -0.021027565002441406, -0.018396615982055664, -0.015765666961669922, -0.01313471794128418, -0.010503768920898438, -0.007872819900512695, -0.005241870880126953, -0.002610921859741211, 2.002716064453125e-05, 0.0026509761810302734, 0.005281925201416016, 0.007912874221801758, 0.0105438232421875]}, "gradients/encoder.encoder.layers.20.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 9.0, 30.0, 53.0, 100.0, 317.0, 1557.0, 4184059.0, 7489.0, 480.0, 133.0, 39.0, 12.0, 7.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.69921875, -2.62353515625, -2.5478515625, -2.47216796875, -2.396484375, -2.32080078125, -2.2451171875, -2.16943359375, -2.09375, -2.01806640625, -1.9423828125, -1.86669921875, -1.791015625, -1.71533203125, -1.6396484375, -1.56396484375, -1.48828125, -1.41259765625, -1.3369140625, -1.26123046875, -1.185546875, -1.10986328125, -1.0341796875, -0.95849609375, -0.8828125, -0.80712890625, -0.7314453125, -0.65576171875, -0.580078125, -0.50439453125, -0.4287109375, -0.35302734375, -0.27734375, -0.20166015625, -0.1259765625, -0.05029296875, 0.025390625, 0.10107421875, 0.1767578125, 0.25244140625, 0.328125, 0.40380859375, 0.4794921875, 0.55517578125, 0.630859375, 0.70654296875, 0.7822265625, 0.85791015625, 0.93359375, 1.00927734375, 1.0849609375, 1.16064453125, 1.236328125, 1.31201171875, 1.3876953125, 1.46337890625, 1.5390625, 1.61474609375, 1.6904296875, 1.76611328125, 1.841796875, 1.91748046875, 1.9931640625, 2.06884765625, 2.14453125]}, "gradients/encoder.encoder.layers.20.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 7.0, 30.0, 789.0, 3138.0, 92.0, 22.0, 5.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.44970703125, -0.4395008087158203, -0.4292945861816406, -0.41908836364746094, -0.40888214111328125, -0.39867591857910156, -0.3884696960449219, -0.3782634735107422, -0.3680572509765625, -0.3578510284423828, -0.3476448059082031, -0.33743858337402344, -0.32723236083984375, -0.31702613830566406, -0.3068199157714844, -0.2966136932373047, -0.286407470703125, -0.2762012481689453, -0.2659950256347656, -0.25578880310058594, -0.24558258056640625, -0.23537635803222656, -0.22517013549804688, -0.2149639129638672, -0.2047576904296875, -0.1945514678955078, -0.18434524536132812, -0.17413902282714844, -0.16393280029296875, -0.15372657775878906, -0.14352035522460938, -0.1333141326904297, -0.12310791015625, -0.11290168762207031, -0.10269546508789062, -0.09248924255371094, -0.08228302001953125, -0.07207679748535156, -0.061870574951171875, -0.05166435241699219, -0.0414581298828125, -0.03125190734863281, -0.021045684814453125, -0.010839462280273438, -0.00063323974609375, 0.009572982788085938, 0.019779205322265625, 0.029985427856445312, 0.040191650390625, 0.05039787292480469, 0.060604095458984375, 0.07081031799316406, 0.08101654052734375, 0.09122276306152344, 0.10142898559570312, 0.11163520812988281, 0.1218414306640625, 0.1320476531982422, 0.14225387573242188, 0.15246009826660156, 0.16266632080078125, 0.17287254333496094, 0.18307876586914062, 0.1932849884033203, 0.2034912109375]}, "gradients/encoder.encoder.layers.20.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 5.0, 107.0, 878.0, 22.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-6.116090297698975, -5.97467565536499, -5.833261013031006, -5.691845893859863, -5.550431251525879, -5.4090166091918945, -5.26760196685791, -5.126187324523926, -4.984772682189941, -4.843358039855957, -4.701943397521973, -4.56052827835083, -4.419113636016846, -4.277698993682861, -4.136284351348877, -3.9948697090148926, -3.85345458984375, -3.7120399475097656, -3.570625066757202, -3.4292104244232178, -3.2877955436706543, -3.14638090133667, -3.0049662590026855, -2.863551616668701, -2.7221367359161377, -2.5807220935821533, -2.43930721282959, -2.2978925704956055, -2.156477928161621, -2.0150630474090576, -1.8736484050750732, -1.7322336435317993, -1.5908193588256836, -1.4494045972824097, -1.3079898357391357, -1.1665751934051514, -1.0251604318618774, -0.8837456703186035, -0.7423309683799744, -0.6009162664413452, -0.4595015048980713, -0.31808677315711975, -0.1766720414161682, -0.035257309675216675, 0.10615742206573486, 0.2475721836090088, 0.38898688554763794, 0.5304015874862671, 0.671816349029541, 0.8132311105728149, 0.9546458125114441, 1.0960605144500732, 1.2374752759933472, 1.378890037536621, 1.5203046798706055, 1.6617194414138794, 1.8031342029571533, 1.9445489645004272, 2.085963726043701, 2.2273783683776855, 2.36879301071167, 2.5102078914642334, 2.6516225337982178, 2.7930374145507812, 2.9344520568847656]}, "gradients/encoder.encoder.layers.20.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 5.0, 3.0, 13.0, 18.0, 24.0, 60.0, 84.0, 99.0, 128.0, 134.0, 119.0, 106.0, 91.0, 52.0, 36.0, 19.0, 9.0, 4.0, 2.0, 4.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1245499849319458, -1.0871564149856567, -1.0497628450393677, -1.012369155883789, -0.9749756455421448, -0.9375820159912109, -0.9001884460449219, -0.8627948760986328, -0.8254013061523438, -0.7880077362060547, -0.7506141066551208, -0.7132205367088318, -0.6758269667625427, -0.6384333372116089, -0.6010397672653198, -0.5636461973190308, -0.5262525677680969, -0.4888589680194855, -0.4514653980731964, -0.41407179832458496, -0.3766782283782959, -0.33928462862968445, -0.301891028881073, -0.26449745893478394, -0.22710385918617249, -0.18971027433872223, -0.15231668949127197, -0.11492308974266052, -0.07752950489521027, -0.04013592004776001, -0.0027423202991485596, 0.0346512496471405, 0.07204484939575195, 0.10943843424320221, 0.14683201909065247, 0.18422561883926392, 0.22161920368671417, 0.25901278853416443, 0.2964063882827759, 0.33379995822906494, 0.3711935579776764, 0.40858715772628784, 0.4459807276725769, 0.48337432742118835, 0.5207679271697998, 0.5581614971160889, 0.5955550670623779, 0.632948637008667, 0.6703422665596008, 0.7077358365058899, 0.7451294660568237, 0.7825230360031128, 0.8199166059494019, 0.8573101758956909, 0.8947038054466248, 0.9320973753929138, 0.9694910049438477, 1.0068845748901367, 1.0442781448364258, 1.0816717147827148, 1.1190654039382935, 1.1564589738845825, 1.1938525438308716, 1.2312461137771606, 1.2686396837234497]}, "gradients/encoder.encoder.layers.20.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 4.0, 2.0, 2.0, 11.0, 12.0, 11.0, 18.0, 22.0, 41.0, 52.0, 78.0, 188.0, 1200.0, 142323.0, 901911.0, 2154.0, 263.0, 91.0, 62.0, 31.0, 27.0, 20.0, 17.0, 9.0, 2.0, 2.0, 2.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.962890625, -0.9362258911132812, -0.9095611572265625, -0.8828964233398438, -0.856231689453125, -0.8295669555664062, -0.8029022216796875, -0.7762374877929688, -0.74957275390625, -0.7229080200195312, -0.6962432861328125, -0.6695785522460938, -0.642913818359375, -0.6162490844726562, -0.5895843505859375, -0.5629196166992188, -0.5362548828125, -0.5095901489257812, -0.4829254150390625, -0.45626068115234375, -0.429595947265625, -0.40293121337890625, -0.3762664794921875, -0.34960174560546875, -0.32293701171875, -0.29627227783203125, -0.2696075439453125, -0.24294281005859375, -0.216278076171875, -0.18961334228515625, -0.1629486083984375, -0.13628387451171875, -0.109619140625, -0.08295440673828125, -0.0562896728515625, -0.02962493896484375, -0.002960205078125, 0.02370452880859375, 0.0503692626953125, 0.07703399658203125, 0.10369873046875, 0.13036346435546875, 0.1570281982421875, 0.18369293212890625, 0.210357666015625, 0.23702239990234375, 0.2636871337890625, 0.29035186767578125, 0.3170166015625, 0.34368133544921875, 0.3703460693359375, 0.39701080322265625, 0.423675537109375, 0.45034027099609375, 0.4770050048828125, 0.5036697387695312, 0.53033447265625, 0.5569992065429688, 0.5836639404296875, 0.6103286743164062, 0.636993408203125, 0.6636581420898438, 0.6903228759765625, 0.7169876098632812, 0.74365234375]}, "gradients/encoder.encoder.layers.20.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 4.0, 17.0, 73.0, 147.0, 252.0, 287.0, 149.0, 54.0, 22.0, 5.0, 1.0, 2.0, 1.0, 1.0, 1.0], "bins": [-0.17431640625, -0.17108488082885742, -0.16785335540771484, -0.16462182998657227, -0.1613903045654297, -0.1581587791442871, -0.15492725372314453, -0.15169572830200195, -0.14846420288085938, -0.1452326774597168, -0.14200115203857422, -0.13876962661743164, -0.13553810119628906, -0.13230657577514648, -0.1290750503540039, -0.12584352493286133, -0.12261199951171875, -0.11938047409057617, -0.1161489486694336, -0.11291742324829102, -0.10968589782714844, -0.10645437240600586, -0.10322284698486328, -0.0999913215637207, -0.09675979614257812, -0.09352827072143555, -0.09029674530029297, -0.08706521987915039, -0.08383369445800781, -0.08060216903686523, -0.07737064361572266, -0.07413911819458008, -0.0709075927734375, -0.06767606735229492, -0.06444454193115234, -0.061213016510009766, -0.05798149108886719, -0.05474996566772461, -0.05151844024658203, -0.04828691482543945, -0.045055389404296875, -0.0418238639831543, -0.03859233856201172, -0.03536081314086914, -0.03212928771972656, -0.028897762298583984, -0.025666236877441406, -0.022434711456298828, -0.01920318603515625, -0.015971660614013672, -0.012740135192871094, -0.009508609771728516, -0.0062770843505859375, -0.0030455589294433594, 0.00018596649169921875, 0.003417491912841797, 0.006649017333984375, 0.009880542755126953, 0.013112068176269531, 0.01634359359741211, 0.019575119018554688, 0.022806644439697266, 0.026038169860839844, 0.029269695281982422, 0.032501220703125]}, "gradients/encoder.encoder.layers.20.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 5.0, 2.0, 7.0, 9.0, 12.0, 25.0, 29.0, 43.0, 54.0, 79.0, 191.0, 457.0, 2102.0, 84070.0, 952133.0, 7928.0, 814.0, 267.0, 128.0, 63.0, 40.0, 33.0, 23.0, 14.0, 9.0, 4.0, 11.0, 7.0, 2.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.50390625, -0.4896087646484375, -0.475311279296875, -0.4610137939453125, -0.44671630859375, -0.4324188232421875, -0.418121337890625, -0.4038238525390625, -0.3895263671875, -0.3752288818359375, -0.360931396484375, -0.3466339111328125, -0.33233642578125, -0.3180389404296875, -0.303741455078125, -0.2894439697265625, -0.275146484375, -0.2608489990234375, -0.246551513671875, -0.2322540283203125, -0.21795654296875, -0.2036590576171875, -0.189361572265625, -0.1750640869140625, -0.1607666015625, -0.1464691162109375, -0.132171630859375, -0.1178741455078125, -0.10357666015625, -0.0892791748046875, -0.074981689453125, -0.0606842041015625, -0.04638671875, -0.0320892333984375, -0.017791748046875, -0.0034942626953125, 0.01080322265625, 0.0251007080078125, 0.039398193359375, 0.0536956787109375, 0.0679931640625, 0.0822906494140625, 0.096588134765625, 0.1108856201171875, 0.12518310546875, 0.1394805908203125, 0.153778076171875, 0.1680755615234375, 0.182373046875, 0.1966705322265625, 0.210968017578125, 0.2252655029296875, 0.23956298828125, 0.2538604736328125, 0.268157958984375, 0.2824554443359375, 0.2967529296875, 0.3110504150390625, 0.325347900390625, 0.3396453857421875, 0.35394287109375, 0.3682403564453125, 0.382537841796875, 0.3968353271484375, 0.4111328125]}, "gradients/encoder.encoder.layers.20.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 2.0, 13.0, 12.0, 11.0, 21.0, 25.0, 25.0, 50.0, 65.0, 39.0, 70.0, 72.0, 88.0, 76.0, 79.0, 79.0, 62.0, 47.0, 39.0, 27.0, 23.0, 22.0, 12.0, 13.0, 10.0, 8.0, 8.0, 4.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.182373046875, -0.17718505859375, -0.1719970703125, -0.16680908203125, -0.16162109375, -0.15643310546875, -0.1512451171875, -0.14605712890625, -0.140869140625, -0.13568115234375, -0.1304931640625, -0.12530517578125, -0.1201171875, -0.11492919921875, -0.1097412109375, -0.10455322265625, -0.099365234375, -0.09417724609375, -0.0889892578125, -0.08380126953125, -0.07861328125, -0.07342529296875, -0.0682373046875, -0.06304931640625, -0.057861328125, -0.05267333984375, -0.0474853515625, -0.04229736328125, -0.037109375, -0.03192138671875, -0.0267333984375, -0.02154541015625, -0.016357421875, -0.01116943359375, -0.0059814453125, -0.00079345703125, 0.00439453125, 0.00958251953125, 0.0147705078125, 0.01995849609375, 0.025146484375, 0.03033447265625, 0.0355224609375, 0.04071044921875, 0.0458984375, 0.05108642578125, 0.0562744140625, 0.06146240234375, 0.066650390625, 0.07183837890625, 0.0770263671875, 0.08221435546875, 0.08740234375, 0.09259033203125, 0.0977783203125, 0.10296630859375, 0.108154296875, 0.11334228515625, 0.1185302734375, 0.12371826171875, 0.12890625, 0.13409423828125, 0.1392822265625, 0.14447021484375, 0.149658203125]}, "gradients/encoder.encoder.layers.20.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 2.0, 0.0, 3.0, 2.0, 7.0, 4.0, 6.0, 7.0, 8.0, 15.0, 16.0, 20.0, 40.0, 43.0, 91.0, 178.0, 307.0, 652.0, 1810.0, 8664.0, 167548.0, 838782.0, 25079.0, 3353.0, 954.0, 413.0, 210.0, 116.0, 76.0, 47.0, 29.0, 13.0, 13.0, 12.0, 12.0, 4.0, 8.0, 3.0, 6.0, 6.0, 2.0, 3.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.129150390625, -0.12514305114746094, -0.12113571166992188, -0.11712837219238281, -0.11312103271484375, -0.10911369323730469, -0.10510635375976562, -0.10109901428222656, -0.0970916748046875, -0.09308433532714844, -0.08907699584960938, -0.08506965637207031, -0.08106231689453125, -0.07705497741699219, -0.07304763793945312, -0.06904029846191406, -0.065032958984375, -0.06102561950683594, -0.057018280029296875, -0.05301094055175781, -0.04900360107421875, -0.04499626159667969, -0.040988922119140625, -0.03698158264160156, -0.0329742431640625, -0.028966903686523438, -0.024959564208984375, -0.020952224731445312, -0.01694488525390625, -0.012937545776367188, -0.008930206298828125, -0.0049228668212890625, -0.00091552734375, 0.0030918121337890625, 0.007099151611328125, 0.011106491088867188, 0.01511383056640625, 0.019121170043945312, 0.023128509521484375, 0.027135848999023438, 0.0311431884765625, 0.03515052795410156, 0.039157867431640625, 0.04316520690917969, 0.04717254638671875, 0.05117988586425781, 0.055187225341796875, 0.05919456481933594, 0.063201904296875, 0.06720924377441406, 0.07121658325195312, 0.07522392272949219, 0.07923126220703125, 0.08323860168457031, 0.08724594116210938, 0.09125328063964844, 0.0952606201171875, 0.09926795959472656, 0.10327529907226562, 0.10728263854980469, 0.11128997802734375, 0.11529731750488281, 0.11930465698242188, 0.12331199645996094, 0.1273193359375]}, "gradients/encoder.encoder.layers.20.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 1.0, 0.0, 1.0, 1.0, 4.0, 6.0, 8.0, 5.0, 6.0, 22.0, 13.0, 47.0, 48.0, 56.0, 81.0, 99.0, 112.0, 106.0, 115.0, 83.0, 57.0, 58.0, 35.0, 11.0, 13.0, 9.0, 4.0, 2.0, 3.0, 0.0, 0.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.424551010131836e-05, -1.3824552297592163e-05, -1.3403594493865967e-05, -1.298263669013977e-05, -1.2561678886413574e-05, -1.2140721082687378e-05, -1.1719763278961182e-05, -1.1298805475234985e-05, -1.0877847671508789e-05, -1.0456889867782593e-05, -1.0035932064056396e-05, -9.6149742603302e-06, -9.194016456604004e-06, -8.773058652877808e-06, -8.352100849151611e-06, -7.931143045425415e-06, -7.510185241699219e-06, -7.0892274379730225e-06, -6.668269634246826e-06, -6.24731183052063e-06, -5.826354026794434e-06, -5.405396223068237e-06, -4.984438419342041e-06, -4.563480615615845e-06, -4.1425228118896484e-06, -3.721565008163452e-06, -3.300607204437256e-06, -2.8796494007110596e-06, -2.4586915969848633e-06, -2.037733793258667e-06, -1.6167759895324707e-06, -1.1958181858062744e-06, -7.748603820800781e-07, -3.5390257835388184e-07, 6.705522537231445e-08, 4.880130290985107e-07, 9.08970832824707e-07, 1.3299286365509033e-06, 1.7508864402770996e-06, 2.171844244003296e-06, 2.592802047729492e-06, 3.0137598514556885e-06, 3.4347176551818848e-06, 3.855675458908081e-06, 4.276633262634277e-06, 4.697591066360474e-06, 5.11854887008667e-06, 5.539506673812866e-06, 5.9604644775390625e-06, 6.381422281265259e-06, 6.802380084991455e-06, 7.223337888717651e-06, 7.644295692443848e-06, 8.065253496170044e-06, 8.48621129989624e-06, 8.907169103622437e-06, 9.328126907348633e-06, 9.749084711074829e-06, 1.0170042514801025e-05, 1.0591000318527222e-05, 1.1011958122253418e-05, 1.1432915925979614e-05, 1.185387372970581e-05, 1.2274831533432007e-05, 1.2695789337158203e-05]}, "gradients/encoder.encoder.layers.20.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 5.0, 17.0, 38.0, 76.0, 294.0, 1247.0, 20280.0, 1018394.0, 7148.0, 786.0, 173.0, 52.0, 25.0, 11.0, 6.0, 4.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.42626953125, -0.41268157958984375, -0.3990936279296875, -0.38550567626953125, -0.371917724609375, -0.35832977294921875, -0.3447418212890625, -0.33115386962890625, -0.31756591796875, -0.30397796630859375, -0.2903900146484375, -0.27680206298828125, -0.263214111328125, -0.24962615966796875, -0.2360382080078125, -0.22245025634765625, -0.2088623046875, -0.19527435302734375, -0.1816864013671875, -0.16809844970703125, -0.154510498046875, -0.14092254638671875, -0.1273345947265625, -0.11374664306640625, -0.10015869140625, -0.08657073974609375, -0.0729827880859375, -0.05939483642578125, -0.045806884765625, -0.03221893310546875, -0.0186309814453125, -0.00504302978515625, 0.008544921875, 0.02213287353515625, 0.0357208251953125, 0.04930877685546875, 0.062896728515625, 0.07648468017578125, 0.0900726318359375, 0.10366058349609375, 0.11724853515625, 0.13083648681640625, 0.1444244384765625, 0.15801239013671875, 0.171600341796875, 0.18518829345703125, 0.1987762451171875, 0.21236419677734375, 0.2259521484375, 0.23954010009765625, 0.2531280517578125, 0.26671600341796875, 0.280303955078125, 0.29389190673828125, 0.3074798583984375, 0.32106781005859375, 0.33465576171875, 0.34824371337890625, 0.3618316650390625, 0.37541961669921875, 0.389007568359375, 0.40259552001953125, 0.4161834716796875, 0.42977142333984375, 0.443359375]}, "gradients/encoder.encoder.layers.20.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 3.0, 4.0, 10.0, 13.0, 12.0, 29.0, 41.0, 90.0, 151.0, 213.0, 209.0, 115.0, 50.0, 34.0, 18.0, 4.0, 9.0, 2.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 2.0, 2.0], "bins": [-0.188232421875, -0.18428802490234375, -0.1803436279296875, -0.17639923095703125, -0.172454833984375, -0.16851043701171875, -0.1645660400390625, -0.16062164306640625, -0.15667724609375, -0.15273284912109375, -0.1487884521484375, -0.14484405517578125, -0.140899658203125, -0.13695526123046875, -0.1330108642578125, -0.12906646728515625, -0.1251220703125, -0.12117767333984375, -0.1172332763671875, -0.11328887939453125, -0.109344482421875, -0.10540008544921875, -0.1014556884765625, -0.09751129150390625, -0.09356689453125, -0.08962249755859375, -0.0856781005859375, -0.08173370361328125, -0.077789306640625, -0.07384490966796875, -0.0699005126953125, -0.06595611572265625, -0.06201171875, -0.05806732177734375, -0.0541229248046875, -0.05017852783203125, -0.046234130859375, -0.04228973388671875, -0.0383453369140625, -0.03440093994140625, -0.03045654296875, -0.02651214599609375, -0.0225677490234375, -0.01862335205078125, -0.014678955078125, -0.01073455810546875, -0.0067901611328125, -0.00284576416015625, 0.0010986328125, 0.00504302978515625, 0.0089874267578125, 0.01293182373046875, 0.016876220703125, 0.02082061767578125, 0.0247650146484375, 0.02870941162109375, 0.03265380859375, 0.03659820556640625, 0.0405426025390625, 0.04448699951171875, 0.048431396484375, 0.05237579345703125, 0.0563201904296875, 0.06026458740234375, 0.064208984375]}, "gradients/encoder.encoder.layers.20.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 4.0, 72.0, 894.0, 43.0, 3.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.2823108434677124, -1.1305437088012695, -0.9787766933441162, -0.8270095586776733, -0.6752424836158752, -0.5234754085540771, -0.3717082738876343, -0.21994125843048096, -0.06817412376403809, 0.0835929661989212, 0.2353600561618805, 0.387127161026001, 0.5388942360877991, 0.6906613111495972, 0.84242844581604, 0.9941954612731934, 1.1459625959396362, 1.297729730606079, 1.4494967460632324, 1.6012638807296753, 1.7530310153961182, 1.9047980308532715, 2.056565284729004, 2.208332061767578, 2.3600993156433105, 2.511866331100464, 2.6636335849761963, 2.8154006004333496, 2.967167615890503, 3.1189346313476562, 3.2707018852233887, 3.422468900680542, 3.5742363929748535, 3.726003408432007, 3.8777706623077393, 4.029537677764893, 4.181304931640625, 4.333071708679199, 4.484838962554932, 4.636606216430664, 4.788372993469238, 4.940140247344971, 5.091907024383545, 5.243674278259277, 5.39544153213501, 5.547208309173584, 5.698975563049316, 5.850742340087891, 6.002510070800781, 6.154277324676514, 6.306044101715088, 6.45781135559082, 6.609578609466553, 6.761345386505127, 6.913112640380859, 7.064879417419434, 7.216646671295166, 7.368413925170898, 7.520180702209473, 7.671947956085205, 7.8237152099609375, 7.975481986999512, 8.127248764038086, 8.279016494750977, 8.43078327178955]}, "gradients/encoder.encoder.layers.20.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 4.0, 2.0, 4.0, 4.0, 6.0, 7.0, 11.0, 15.0, 14.0, 24.0, 39.0, 26.0, 41.0, 42.0, 51.0, 71.0, 63.0, 69.0, 65.0, 75.0, 57.0, 58.0, 52.0, 51.0, 37.0, 23.0, 25.0, 22.0, 13.0, 22.0, 8.0, 2.0, 2.0, 3.0, 3.0, 3.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6482570767402649, -0.6215471625328064, -0.5948372483253479, -0.5681272745132446, -0.5414173603057861, -0.5147074460983276, -0.48799753189086914, -0.46128761768341064, -0.43457767367362976, -0.40786775946617126, -0.3811578154563904, -0.3544479012489319, -0.3277379870414734, -0.3010280430316925, -0.274318128824234, -0.24760819971561432, -0.22089827060699463, -0.19418834149837494, -0.16747841238975525, -0.14076849818229675, -0.11405856907367706, -0.08734863996505737, -0.06063872575759888, -0.03392879664897919, -0.007218867540359497, 0.019491057842969894, 0.046200983226299286, 0.07291090488433838, 0.09962083399295807, 0.12633076310157776, 0.15304067730903625, 0.17975060641765594, 0.20646047592163086, 0.23317040503025055, 0.25988033413887024, 0.28659024834632874, 0.3133001923561096, 0.3400101065635681, 0.3667200207710266, 0.3934299349784851, 0.420139878988266, 0.4468497931957245, 0.47355973720550537, 0.5002696514129639, 0.5269795656204224, 0.5536894798278809, 0.5803993940353394, 0.6071093678474426, 0.6338192820549011, 0.6605291962623596, 0.6872391104698181, 0.7139490842819214, 0.7406589984893799, 0.7673689126968384, 0.7940788269042969, 0.8207887411117554, 0.8474986553192139, 0.8742085695266724, 0.9009184837341309, 0.9276283979415894, 0.9543383717536926, 0.9810482859611511, 1.0077581405639648, 1.034468173980713, 1.0611780881881714]}, "gradients/encoder.encoder.layers.19.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 2.0, 3.0, 2.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 3.0, 6.0, 5.0, 5.0, 6.0, 8.0, 17.0, 15.0, 11.0, 23.0, 26.0, 28.0, 25.0, 51.0, 55.0, 62.0, 114.0, 236.0, 946.0, 11983.0, 4080819.0, 96755.0, 2589.0, 337.0, 88.0, 36.0, 14.0, 6.0, 0.0, 1.0, 0.0, 3.0], "bins": [-1.0390625, -1.0193595886230469, -0.9996566772460938, -0.9799537658691406, -0.9602508544921875, -0.9405479431152344, -0.9208450317382812, -0.9011421203613281, -0.881439208984375, -0.8617362976074219, -0.8420333862304688, -0.8223304748535156, -0.8026275634765625, -0.7829246520996094, -0.7632217407226562, -0.7435188293457031, -0.72381591796875, -0.7041130065917969, -0.6844100952148438, -0.6647071838378906, -0.6450042724609375, -0.6253013610839844, -0.6055984497070312, -0.5858955383300781, -0.566192626953125, -0.5464897155761719, -0.5267868041992188, -0.5070838928222656, -0.4873809814453125, -0.4676780700683594, -0.44797515869140625, -0.4282722473144531, -0.4085693359375, -0.3888664245605469, -0.36916351318359375, -0.3494606018066406, -0.3297576904296875, -0.3100547790527344, -0.29035186767578125, -0.2706489562988281, -0.250946044921875, -0.23124313354492188, -0.21154022216796875, -0.19183731079101562, -0.1721343994140625, -0.15243148803710938, -0.13272857666015625, -0.11302566528320312, -0.09332275390625, -0.07361984252929688, -0.05391693115234375, -0.034214019775390625, -0.0145111083984375, 0.005191802978515625, 0.02489471435546875, 0.044597625732421875, 0.064300537109375, 0.08400344848632812, 0.10370635986328125, 0.12340927124023438, 0.1431121826171875, 0.16281509399414062, 0.18251800537109375, 0.20222091674804688, 0.221923828125]}, "gradients/encoder.encoder.layers.19.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 2.0, 13.0, 34.0, 91.0, 175.0, 254.0, 226.0, 134.0, 55.0, 23.0, 8.0, 0.0, 1.0, 1.0], "bins": [-0.165771484375, -0.1627941131591797, -0.15981674194335938, -0.15683937072753906, -0.15386199951171875, -0.15088462829589844, -0.14790725708007812, -0.1449298858642578, -0.1419525146484375, -0.1389751434326172, -0.13599777221679688, -0.13302040100097656, -0.13004302978515625, -0.12706565856933594, -0.12408828735351562, -0.12111091613769531, -0.118133544921875, -0.11515617370605469, -0.11217880249023438, -0.10920143127441406, -0.10622406005859375, -0.10324668884277344, -0.10026931762695312, -0.09729194641113281, -0.0943145751953125, -0.09133720397949219, -0.08835983276367188, -0.08538246154785156, -0.08240509033203125, -0.07942771911621094, -0.07645034790039062, -0.07347297668457031, -0.07049560546875, -0.06751823425292969, -0.06454086303710938, -0.06156349182128906, -0.05858612060546875, -0.05560874938964844, -0.052631378173828125, -0.04965400695800781, -0.0466766357421875, -0.04369926452636719, -0.040721893310546875, -0.03774452209472656, -0.03476715087890625, -0.03178977966308594, -0.028812408447265625, -0.025835037231445312, -0.022857666015625, -0.019880294799804688, -0.016902923583984375, -0.013925552368164062, -0.01094818115234375, -0.007970809936523438, -0.004993438720703125, -0.0020160675048828125, 0.0009613037109375, 0.0039386749267578125, 0.006916046142578125, 0.009893417358398438, 0.01287078857421875, 0.015848159790039062, 0.018825531005859375, 0.021802902221679688, 0.0247802734375]}, "gradients/encoder.encoder.layers.19.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 2.0, 6.0, 12.0, 15.0, 42.0, 95.0, 175.0, 409.0, 1315.0, 326442.0, 3864130.0, 1101.0, 296.0, 145.0, 58.0, 26.0, 10.0, 7.0, 3.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-2.322265625, -2.2758560180664062, -2.2294464111328125, -2.1830368041992188, -2.136627197265625, -2.0902175903320312, -2.0438079833984375, -1.9973983764648438, -1.95098876953125, -1.9045791625976562, -1.8581695556640625, -1.8117599487304688, -1.765350341796875, -1.7189407348632812, -1.6725311279296875, -1.6261215209960938, -1.5797119140625, -1.5333023071289062, -1.4868927001953125, -1.4404830932617188, -1.394073486328125, -1.3476638793945312, -1.3012542724609375, -1.2548446655273438, -1.20843505859375, -1.1620254516601562, -1.1156158447265625, -1.0692062377929688, -1.022796630859375, -0.9763870239257812, -0.9299774169921875, -0.8835678100585938, -0.837158203125, -0.7907485961914062, -0.7443389892578125, -0.6979293823242188, -0.651519775390625, -0.6051101684570312, -0.5587005615234375, -0.5122909545898438, -0.46588134765625, -0.41947174072265625, -0.3730621337890625, -0.32665252685546875, -0.280242919921875, -0.23383331298828125, -0.1874237060546875, -0.14101409912109375, -0.0946044921875, -0.04819488525390625, -0.0017852783203125, 0.04462432861328125, 0.091033935546875, 0.13744354248046875, 0.1838531494140625, 0.23026275634765625, 0.27667236328125, 0.32308197021484375, 0.3694915771484375, 0.41590118408203125, 0.462310791015625, 0.5087203979492188, 0.5551300048828125, 0.6015396118164062, 0.64794921875]}, "gradients/encoder.encoder.layers.19.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 3.0, 1.0, 5.0, 7.0, 15.0, 30.0, 89.0, 255.0, 3033.0, 434.0, 96.0, 60.0, 34.0, 9.0, 9.0, 4.0, 1.0, 2.0, 1.0], "bins": [-0.235595703125, -0.23120403289794922, -0.22681236267089844, -0.22242069244384766, -0.21802902221679688, -0.2136373519897461, -0.2092456817626953, -0.20485401153564453, -0.20046234130859375, -0.19607067108154297, -0.1916790008544922, -0.1872873306274414, -0.18289566040039062, -0.17850399017333984, -0.17411231994628906, -0.16972064971923828, -0.1653289794921875, -0.16093730926513672, -0.15654563903808594, -0.15215396881103516, -0.14776229858398438, -0.1433706283569336, -0.1389789581298828, -0.13458728790283203, -0.13019561767578125, -0.12580394744873047, -0.12141227722167969, -0.1170206069946289, -0.11262893676757812, -0.10823726654052734, -0.10384559631347656, -0.09945392608642578, -0.095062255859375, -0.09067058563232422, -0.08627891540527344, -0.08188724517822266, -0.07749557495117188, -0.0731039047241211, -0.06871223449707031, -0.06432056427001953, -0.05992889404296875, -0.05553722381591797, -0.05114555358886719, -0.046753883361816406, -0.042362213134765625, -0.037970542907714844, -0.03357887268066406, -0.02918720245361328, -0.0247955322265625, -0.02040386199951172, -0.016012191772460938, -0.011620521545410156, -0.007228851318359375, -0.0028371810913085938, 0.0015544891357421875, 0.005946159362792969, 0.01033782958984375, 0.014729499816894531, 0.019121170043945312, 0.023512840270996094, 0.027904510498046875, 0.032296180725097656, 0.03668785095214844, 0.04107952117919922, 0.04547119140625]}, "gradients/encoder.encoder.layers.19.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 3.0, 3.0, 2.0, 28.0, 350.0, 589.0, 34.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0609214305877686, -0.995305061340332, -0.9296887516975403, -0.8640724420547485, -0.798456072807312, -0.7328397035598755, -0.6672233939170837, -0.601607084274292, -0.5359907150268555, -0.47037437558174133, -0.4047580361366272, -0.33914169669151306, -0.2735253572463989, -0.2079090178012848, -0.14229267835617065, -0.07667633891105652, -0.011059999465942383, 0.05455633997917175, 0.12017267942428589, 0.18578901886940002, 0.25140535831451416, 0.3170216977596283, 0.38263803720474243, 0.44825437664985657, 0.5138707160949707, 0.5794870853424072, 0.645103394985199, 0.7107197046279907, 0.7763360738754272, 0.8419524431228638, 0.9075687527656555, 0.9731850624084473, 1.0388011932373047, 1.1044175624847412, 1.1700339317321777, 1.2356501817703247, 1.3012665510177612, 1.3668829202651978, 1.4324991703033447, 1.4981155395507812, 1.5637319087982178, 1.6293482780456543, 1.6949646472930908, 1.7605808973312378, 1.8261972665786743, 1.8918136358261108, 1.9574298858642578, 2.0230462551116943, 2.088662624359131, 2.1542789936065674, 2.219895362854004, 2.2855117321014404, 2.351128101348877, 2.4167442321777344, 2.482360601425171, 2.5479769706726074, 2.613593339920044, 2.6792097091674805, 2.744826078414917, 2.8104424476623535, 2.876058578491211, 2.9416749477386475, 3.007291316986084, 3.0729076862335205, 3.138524055480957]}, "gradients/encoder.encoder.layers.19.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 2.0, 4.0, 7.0, 11.0, 14.0, 29.0, 40.0, 68.0, 98.0, 86.0, 85.0, 96.0, 101.0, 95.0, 89.0, 58.0, 38.0, 27.0, 28.0, 13.0, 6.0, 2.0, 3.0, 1.0, 3.0, 3.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6604729294776917, -0.6416794657707214, -0.622886061668396, -0.6040925979614258, -0.5852991938591003, -0.5665057301521301, -0.5477123260498047, -0.5289188623428345, -0.5101253986358643, -0.49133196473121643, -0.4725385308265686, -0.4537450969219208, -0.43495166301727295, -0.41615819931030273, -0.3973647654056549, -0.3785713315010071, -0.35977792739868164, -0.3409844934940338, -0.322191059589386, -0.30339762568473816, -0.28460419178009033, -0.2658107280731201, -0.2470172941684723, -0.22822386026382446, -0.20943042635917664, -0.1906369924545288, -0.17184355854988098, -0.15305010974407196, -0.13425667583942413, -0.1154632419347763, -0.09666980057954788, -0.07787635922431946, -0.059082865715026855, -0.04028942808508873, -0.021495990455150604, -0.0027025528252124786, 0.016090884804725647, 0.034884318709373474, 0.0536777600646019, 0.07247120141983032, 0.09126463532447815, 0.11005806922912598, 0.1288515031337738, 0.14764495193958282, 0.16643838584423065, 0.18523181974887848, 0.2040252685546875, 0.22281870245933533, 0.24161213636398315, 0.260405570268631, 0.2791990041732788, 0.29799243807792664, 0.31678587198257446, 0.3355793356895447, 0.3543727695941925, 0.37316620349884033, 0.39195963740348816, 0.410753071308136, 0.4295465052127838, 0.44833993911743164, 0.46713340282440186, 0.4859268069267273, 0.5047202706336975, 0.523513674736023, 0.5423071384429932]}, "gradients/encoder.encoder.layers.19.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 3.0, 2.0, 3.0, 4.0, 3.0, 5.0, 4.0, 12.0, 15.0, 13.0, 12.0, 18.0, 22.0, 29.0, 37.0, 48.0, 56.0, 97.0, 208.0, 595.0, 2331.0, 20825.0, 638686.0, 368628.0, 13944.0, 1907.0, 499.0, 194.0, 95.0, 50.0, 41.0, 41.0, 28.0, 19.0, 21.0, 14.0, 15.0, 8.0, 5.0, 7.0, 5.0, 4.0, 1.0, 5.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.356689453125, -0.345855712890625, -0.33502197265625, -0.324188232421875, -0.3133544921875, -0.302520751953125, -0.29168701171875, -0.280853271484375, -0.27001953125, -0.259185791015625, -0.24835205078125, -0.237518310546875, -0.2266845703125, -0.215850830078125, -0.20501708984375, -0.194183349609375, -0.183349609375, -0.172515869140625, -0.16168212890625, -0.150848388671875, -0.1400146484375, -0.129180908203125, -0.11834716796875, -0.107513427734375, -0.0966796875, -0.085845947265625, -0.07501220703125, -0.064178466796875, -0.0533447265625, -0.042510986328125, -0.03167724609375, -0.020843505859375, -0.010009765625, 0.000823974609375, 0.01165771484375, 0.022491455078125, 0.0333251953125, 0.044158935546875, 0.05499267578125, 0.065826416015625, 0.07666015625, 0.087493896484375, 0.09832763671875, 0.109161376953125, 0.1199951171875, 0.130828857421875, 0.14166259765625, 0.152496337890625, 0.163330078125, 0.174163818359375, 0.18499755859375, 0.195831298828125, 0.2066650390625, 0.217498779296875, 0.22833251953125, 0.239166259765625, 0.25, 0.260833740234375, 0.27166748046875, 0.282501220703125, 0.2933349609375, 0.304168701171875, 0.31500244140625, 0.325836181640625, 0.336669921875]}, "gradients/encoder.encoder.layers.19.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 0.0, 1.0, 15.0, 14.0, 55.0, 110.0, 147.0, 201.0, 191.0, 120.0, 80.0, 42.0, 22.0, 9.0, 4.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.1663818359375, -0.16322898864746094, -0.16007614135742188, -0.1569232940673828, -0.15377044677734375, -0.1506175994873047, -0.14746475219726562, -0.14431190490722656, -0.1411590576171875, -0.13800621032714844, -0.13485336303710938, -0.1317005157470703, -0.12854766845703125, -0.1253948211669922, -0.12224197387695312, -0.11908912658691406, -0.115936279296875, -0.11278343200683594, -0.10963058471679688, -0.10647773742675781, -0.10332489013671875, -0.10017204284667969, -0.09701919555664062, -0.09386634826660156, -0.0907135009765625, -0.08756065368652344, -0.08440780639648438, -0.08125495910644531, -0.07810211181640625, -0.07494926452636719, -0.07179641723632812, -0.06864356994628906, -0.06549072265625, -0.06233787536621094, -0.059185028076171875, -0.05603218078613281, -0.05287933349609375, -0.04972648620605469, -0.046573638916015625, -0.04342079162597656, -0.0402679443359375, -0.03711509704589844, -0.033962249755859375, -0.030809402465820312, -0.02765655517578125, -0.024503707885742188, -0.021350860595703125, -0.018198013305664062, -0.015045166015625, -0.011892318725585938, -0.008739471435546875, -0.0055866241455078125, -0.00243377685546875, 0.0007190704345703125, 0.003871917724609375, 0.0070247650146484375, 0.0101776123046875, 0.013330459594726562, 0.016483306884765625, 0.019636154174804688, 0.02278900146484375, 0.025941848754882812, 0.029094696044921875, 0.03224754333496094, 0.035400390625]}, "gradients/encoder.encoder.layers.19.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 2.0, 2.0, 4.0, 8.0, 7.0, 15.0, 11.0, 19.0, 19.0, 35.0, 43.0, 66.0, 90.0, 103.0, 188.0, 358.0, 824.0, 2456.0, 11586.0, 115874.0, 832645.0, 71975.0, 8714.0, 1961.0, 660.0, 307.0, 166.0, 97.0, 74.0, 48.0, 43.0, 37.0, 28.0, 19.0, 18.0, 16.0, 10.0, 7.0, 8.0, 2.0, 2.0, 2.0, 5.0, 3.0, 2.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.233642578125, -0.22620582580566406, -0.21876907348632812, -0.2113323211669922, -0.20389556884765625, -0.1964588165283203, -0.18902206420898438, -0.18158531188964844, -0.1741485595703125, -0.16671180725097656, -0.15927505493164062, -0.1518383026123047, -0.14440155029296875, -0.1369647979736328, -0.12952804565429688, -0.12209129333496094, -0.114654541015625, -0.10721778869628906, -0.09978103637695312, -0.09234428405761719, -0.08490753173828125, -0.07747077941894531, -0.07003402709960938, -0.06259727478027344, -0.0551605224609375, -0.04772377014160156, -0.040287017822265625, -0.03285026550292969, -0.02541351318359375, -0.017976760864257812, -0.010540008544921875, -0.0031032562255859375, 0.00433349609375, 0.011770248413085938, 0.019207000732421875, 0.026643753051757812, 0.03408050537109375, 0.04151725769042969, 0.048954010009765625, 0.05639076232910156, 0.0638275146484375, 0.07126426696777344, 0.07870101928710938, 0.08613777160644531, 0.09357452392578125, 0.10101127624511719, 0.10844802856445312, 0.11588478088378906, 0.123321533203125, 0.13075828552246094, 0.13819503784179688, 0.1456317901611328, 0.15306854248046875, 0.1605052947998047, 0.16794204711914062, 0.17537879943847656, 0.1828155517578125, 0.19025230407714844, 0.19768905639648438, 0.2051258087158203, 0.21256256103515625, 0.2199993133544922, 0.22743606567382812, 0.23487281799316406, 0.2423095703125]}, "gradients/encoder.encoder.layers.19.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 4.0, 1.0, 3.0, 4.0, 5.0, 9.0, 13.0, 13.0, 10.0, 23.0, 25.0, 26.0, 30.0, 37.0, 31.0, 39.0, 43.0, 44.0, 50.0, 62.0, 58.0, 66.0, 51.0, 43.0, 43.0, 44.0, 44.0, 28.0, 24.0, 29.0, 22.0, 12.0, 16.0, 13.0, 9.0, 9.0, 3.0, 8.0, 5.0, 3.0, 6.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1475830078125, -0.14284706115722656, -0.13811111450195312, -0.1333751678466797, -0.12863922119140625, -0.12390327453613281, -0.11916732788085938, -0.11443138122558594, -0.1096954345703125, -0.10495948791503906, -0.10022354125976562, -0.09548759460449219, -0.09075164794921875, -0.08601570129394531, -0.08127975463867188, -0.07654380798339844, -0.071807861328125, -0.06707191467285156, -0.062335968017578125, -0.05760002136230469, -0.05286407470703125, -0.04812812805175781, -0.043392181396484375, -0.03865623474121094, -0.0339202880859375, -0.029184341430664062, -0.024448394775390625, -0.019712448120117188, -0.01497650146484375, -0.010240554809570312, -0.005504608154296875, -0.0007686614990234375, 0.00396728515625, 0.008703231811523438, 0.013439178466796875, 0.018175125122070312, 0.02291107177734375, 0.027647018432617188, 0.032382965087890625, 0.03711891174316406, 0.0418548583984375, 0.04659080505371094, 0.051326751708984375, 0.05606269836425781, 0.06079864501953125, 0.06553459167480469, 0.07027053833007812, 0.07500648498535156, 0.079742431640625, 0.08447837829589844, 0.08921432495117188, 0.09395027160644531, 0.09868621826171875, 0.10342216491699219, 0.10815811157226562, 0.11289405822753906, 0.1176300048828125, 0.12236595153808594, 0.12710189819335938, 0.1318378448486328, 0.13657379150390625, 0.1413097381591797, 0.14604568481445312, 0.15078163146972656, 0.155517578125]}, "gradients/encoder.encoder.layers.19.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 2.0, 1.0, 2.0, 1.0, 4.0, 3.0, 6.0, 3.0, 4.0, 8.0, 13.0, 31.0, 24.0, 52.0, 65.0, 109.0, 166.0, 240.0, 418.0, 806.0, 1667.0, 3936.0, 11868.0, 66263.0, 757063.0, 175169.0, 20412.0, 5577.0, 2212.0, 1071.0, 539.0, 302.0, 169.0, 109.0, 66.0, 48.0, 45.0, 22.0, 22.0, 15.0, 5.0, 7.0, 3.0, 3.0, 2.0, 2.0, 2.0, 2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.0914306640625, -0.08854484558105469, -0.08565902709960938, -0.08277320861816406, -0.07988739013671875, -0.07700157165527344, -0.07411575317382812, -0.07122993469238281, -0.0683441162109375, -0.06545829772949219, -0.06257247924804688, -0.05968666076660156, -0.05680084228515625, -0.05391502380371094, -0.051029205322265625, -0.04814338684082031, -0.045257568359375, -0.04237174987792969, -0.039485931396484375, -0.03660011291503906, -0.03371429443359375, -0.030828475952148438, -0.027942657470703125, -0.025056838989257812, -0.0221710205078125, -0.019285202026367188, -0.016399383544921875, -0.013513565063476562, -0.01062774658203125, -0.0077419281005859375, -0.004856109619140625, -0.0019702911376953125, 0.00091552734375, 0.0038013458251953125, 0.006687164306640625, 0.009572982788085938, 0.01245880126953125, 0.015344619750976562, 0.018230438232421875, 0.021116256713867188, 0.0240020751953125, 0.026887893676757812, 0.029773712158203125, 0.03265953063964844, 0.03554534912109375, 0.03843116760253906, 0.041316986083984375, 0.04420280456542969, 0.047088623046875, 0.04997444152832031, 0.052860260009765625, 0.05574607849121094, 0.05863189697265625, 0.06151771545410156, 0.06440353393554688, 0.06728935241699219, 0.0701751708984375, 0.07306098937988281, 0.07594680786132812, 0.07883262634277344, 0.08171844482421875, 0.08460426330566406, 0.08749008178710938, 0.09037590026855469, 0.09326171875]}, "gradients/encoder.encoder.layers.19.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 4.0, 0.0, 1.0, 2.0, 5.0, 6.0, 12.0, 7.0, 24.0, 50.0, 79.0, 129.0, 176.0, 175.0, 116.0, 80.0, 60.0, 30.0, 23.0, 11.0, 13.0, 5.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.6047229766845703e-05, -2.530217170715332e-05, -2.4557113647460938e-05, -2.3812055587768555e-05, -2.3066997528076172e-05, -2.232193946838379e-05, -2.1576881408691406e-05, -2.0831823348999023e-05, -2.008676528930664e-05, -1.9341707229614258e-05, -1.8596649169921875e-05, -1.7851591110229492e-05, -1.710653305053711e-05, -1.6361474990844727e-05, -1.5616416931152344e-05, -1.4871358871459961e-05, -1.4126300811767578e-05, -1.3381242752075195e-05, -1.2636184692382812e-05, -1.189112663269043e-05, -1.1146068572998047e-05, -1.0401010513305664e-05, -9.655952453613281e-06, -8.910894393920898e-06, -8.165836334228516e-06, -7.420778274536133e-06, -6.67572021484375e-06, -5.930662155151367e-06, -5.185604095458984e-06, -4.4405460357666016e-06, -3.6954879760742188e-06, -2.950429916381836e-06, -2.205371856689453e-06, -1.4603137969970703e-06, -7.152557373046875e-07, 2.9802322387695312e-08, 7.748603820800781e-07, 1.519918441772461e-06, 2.2649765014648438e-06, 3.0100345611572266e-06, 3.7550926208496094e-06, 4.500150680541992e-06, 5.245208740234375e-06, 5.990266799926758e-06, 6.735324859619141e-06, 7.4803829193115234e-06, 8.225440979003906e-06, 8.970499038696289e-06, 9.715557098388672e-06, 1.0460615158081055e-05, 1.1205673217773438e-05, 1.195073127746582e-05, 1.2695789337158203e-05, 1.3440847396850586e-05, 1.4185905456542969e-05, 1.4930963516235352e-05, 1.5676021575927734e-05, 1.6421079635620117e-05, 1.71661376953125e-05, 1.7911195755004883e-05, 1.8656253814697266e-05, 1.940131187438965e-05, 2.014636993408203e-05, 2.0891427993774414e-05, 2.1636486053466797e-05]}, "gradients/encoder.encoder.layers.19.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 3.0, 2.0, 7.0, 7.0, 4.0, 15.0, 19.0, 31.0, 54.0, 117.0, 188.0, 501.0, 1291.0, 4567.0, 27680.0, 714172.0, 278869.0, 16027.0, 3287.0, 957.0, 405.0, 158.0, 86.0, 52.0, 19.0, 13.0, 10.0, 9.0, 3.0, 5.0, 3.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.17236328125, -0.16768836975097656, -0.16301345825195312, -0.1583385467529297, -0.15366363525390625, -0.1489887237548828, -0.14431381225585938, -0.13963890075683594, -0.1349639892578125, -0.13028907775878906, -0.12561416625976562, -0.12093925476074219, -0.11626434326171875, -0.11158943176269531, -0.10691452026367188, -0.10223960876464844, -0.097564697265625, -0.09288978576660156, -0.08821487426757812, -0.08353996276855469, -0.07886505126953125, -0.07419013977050781, -0.06951522827148438, -0.06484031677246094, -0.0601654052734375, -0.05549049377441406, -0.050815582275390625, -0.04614067077636719, -0.04146575927734375, -0.03679084777832031, -0.032115936279296875, -0.027441024780273438, -0.02276611328125, -0.018091201782226562, -0.013416290283203125, -0.008741378784179688, -0.00406646728515625, 0.0006084442138671875, 0.005283355712890625, 0.009958267211914062, 0.0146331787109375, 0.019308090209960938, 0.023983001708984375, 0.028657913208007812, 0.03333282470703125, 0.03800773620605469, 0.042682647705078125, 0.04735755920410156, 0.052032470703125, 0.05670738220214844, 0.061382293701171875, 0.06605720520019531, 0.07073211669921875, 0.07540702819824219, 0.08008193969726562, 0.08475685119628906, 0.0894317626953125, 0.09410667419433594, 0.09878158569335938, 0.10345649719238281, 0.10813140869140625, 0.11280632019042969, 0.11748123168945312, 0.12215614318847656, 0.1268310546875]}, "gradients/encoder.encoder.layers.19.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 6.0, 3.0, 7.0, 13.0, 15.0, 12.0, 25.0, 26.0, 44.0, 70.0, 78.0, 108.0, 140.0, 98.0, 100.0, 87.0, 52.0, 41.0, 17.0, 24.0, 10.0, 8.0, 1.0, 7.0, 5.0, 8.0, 2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 3.0], "bins": [-0.0982666015625, -0.09602117538452148, -0.09377574920654297, -0.09153032302856445, -0.08928489685058594, -0.08703947067260742, -0.0847940444946289, -0.08254861831665039, -0.08030319213867188, -0.07805776596069336, -0.07581233978271484, -0.07356691360473633, -0.07132148742675781, -0.0690760612487793, -0.06683063507080078, -0.06458520889282227, -0.06233978271484375, -0.060094356536865234, -0.05784893035888672, -0.0556035041809082, -0.05335807800292969, -0.05111265182495117, -0.048867225646972656, -0.04662179946899414, -0.044376373291015625, -0.04213094711303711, -0.039885520935058594, -0.03764009475708008, -0.03539466857910156, -0.03314924240112305, -0.03090381622314453, -0.028658390045166016, -0.0264129638671875, -0.024167537689208984, -0.02192211151123047, -0.019676685333251953, -0.017431259155273438, -0.015185832977294922, -0.012940406799316406, -0.01069498062133789, -0.008449554443359375, -0.006204128265380859, -0.003958702087402344, -0.0017132759094238281, 0.0005321502685546875, 0.002777576446533203, 0.005023002624511719, 0.007268428802490234, 0.00951385498046875, 0.011759281158447266, 0.014004707336425781, 0.016250133514404297, 0.018495559692382812, 0.020740985870361328, 0.022986412048339844, 0.02523183822631836, 0.027477264404296875, 0.02972269058227539, 0.031968116760253906, 0.03421354293823242, 0.03645896911621094, 0.03870439529418945, 0.04094982147216797, 0.043195247650146484, 0.045440673828125]}, "gradients/encoder.encoder.layers.19.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 5.0, 40.0, 802.0, 162.0, 7.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.3697757720947266, -3.1988203525543213, -3.027864694595337, -2.8569092750549316, -2.6859536170959473, -2.514998197555542, -2.3440427780151367, -2.1730871200561523, -2.002131462097168, -1.8311759233474731, -1.6602203845977783, -1.489264965057373, -1.3183093070983887, -1.1473538875579834, -0.9763983488082886, -0.8054428100585938, -0.6344873905181885, -0.46353185176849365, -0.2925763428211212, -0.12162083387374878, 0.049334704875946045, 0.22029024362564087, 0.3912457227706909, 0.5622012615203857, 0.7331568002700806, 0.9041123390197754, 1.0750678777694702, 1.246023416519165, 1.4169788360595703, 1.5879344940185547, 1.75888991355896, 1.9298454523086548, 2.1008009910583496, 2.271756410598755, 2.4427120685577393, 2.6136674880981445, 2.784623146057129, 2.955578565597534, 3.1265339851379395, 3.297489643096924, 3.468445301055908, 3.6394007205963135, 3.810356378555298, 3.981311798095703, 4.1522674560546875, 4.323223114013672, 4.494178295135498, 4.665133953094482, 4.836089134216309, 5.007044792175293, 5.177999973297119, 5.3489556312561035, 5.519911289215088, 5.690866947174072, 5.861822128295898, 6.032777786254883, 6.203733444213867, 6.374689102172852, 6.545644283294678, 6.716599941253662, 6.8875555992126465, 7.058511257171631, 7.229466438293457, 7.400422096252441, 7.571377754211426]}, "gradients/encoder.encoder.layers.19.layer_norm.bias": {"_type": "histogram", "values": [2.0, 2.0, 0.0, 1.0, 2.0, 3.0, 2.0, 5.0, 2.0, 10.0, 7.0, 16.0, 13.0, 26.0, 22.0, 21.0, 22.0, 35.0, 46.0, 47.0, 46.0, 53.0, 50.0, 61.0, 51.0, 50.0, 60.0, 57.0, 45.0, 43.0, 31.0, 33.0, 32.0, 18.0, 18.0, 23.0, 18.0, 20.0, 4.0, 4.0, 4.0, 7.0, 1.0, 4.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7007409930229187, -0.6725161075592041, -0.6442911624908447, -0.6160662770271301, -0.5878413319587708, -0.5596164464950562, -0.5313915014266968, -0.5031666159629822, -0.4749417006969452, -0.4467167854309082, -0.4184918701648712, -0.39026695489883423, -0.36204206943511963, -0.33381712436676025, -0.30559223890304565, -0.27736732363700867, -0.24914240837097168, -0.2209174931049347, -0.1926925778388977, -0.1644676774740219, -0.13624276220798492, -0.10801784694194794, -0.07979294657707214, -0.051568031311035156, -0.02334311604499817, 0.00488179549574852, 0.03310670703649521, 0.0613316148519516, 0.08955653011798859, 0.11778144538402557, 0.14600634574890137, 0.17423126101493835, 0.20245611667633057, 0.23068103194236755, 0.25890594720840454, 0.28713083267211914, 0.3153557777404785, 0.3435806632041931, 0.3718055784702301, 0.4000304937362671, 0.4282554090023041, 0.45648032426834106, 0.48470523953437805, 0.512930154800415, 0.5411550402641296, 0.569379985332489, 0.5976048707962036, 0.625829815864563, 0.6540547013282776, 0.6822795867919922, 0.7105045318603516, 0.7387294173240662, 0.7669543623924255, 0.7951792478561401, 0.8234041929244995, 0.8516290783882141, 0.8798539638519287, 0.9080788493156433, 0.9363037943840027, 0.9645286798477173, 0.9927536249160767, 1.020978569984436, 1.0492033958435059, 1.0774283409118652, 1.1056532859802246]}, "gradients/encoder.encoder.layers.18.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 4.0, 2.0, 6.0, 2.0, 7.0, 4.0, 3.0, 7.0, 6.0, 10.0, 5.0, 7.0, 9.0, 16.0, 15.0, 13.0, 32.0, 32.0, 53.0, 75.0, 144.0, 242.0, 751.0, 2559.0, 16687.0, 3857683.0, 301126.0, 11875.0, 2156.0, 507.0, 161.0, 57.0, 16.0, 9.0, 5.0, 1.0, 2.0, 1.0, 1.0], "bins": [-0.7353515625, -0.7209014892578125, -0.706451416015625, -0.6920013427734375, -0.67755126953125, -0.6631011962890625, -0.648651123046875, -0.6342010498046875, -0.6197509765625, -0.6053009033203125, -0.590850830078125, -0.5764007568359375, -0.56195068359375, -0.5475006103515625, -0.533050537109375, -0.5186004638671875, -0.504150390625, -0.4897003173828125, -0.475250244140625, -0.4608001708984375, -0.44635009765625, -0.4319000244140625, -0.417449951171875, -0.4029998779296875, -0.3885498046875, -0.3740997314453125, -0.359649658203125, -0.3451995849609375, -0.33074951171875, -0.3162994384765625, -0.301849365234375, -0.2873992919921875, -0.27294921875, -0.2584991455078125, -0.244049072265625, -0.2295989990234375, -0.21514892578125, -0.2006988525390625, -0.186248779296875, -0.1717987060546875, -0.1573486328125, -0.1428985595703125, -0.128448486328125, -0.1139984130859375, -0.09954833984375, -0.0850982666015625, -0.070648193359375, -0.0561981201171875, -0.041748046875, -0.0272979736328125, -0.012847900390625, 0.0016021728515625, 0.01605224609375, 0.0305023193359375, 0.044952392578125, 0.0594024658203125, 0.0738525390625, 0.0883026123046875, 0.102752685546875, 0.1172027587890625, 0.13165283203125, 0.1461029052734375, 0.160552978515625, 0.1750030517578125, 0.189453125]}, "gradients/encoder.encoder.layers.18.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 2.0, 17.0, 25.0, 57.0, 112.0, 184.0, 163.0, 176.0, 117.0, 82.0, 40.0, 20.0, 8.0, 3.0, 4.0, 3.0, 0.0, 1.0], "bins": [-0.1654052734375, -0.1622624397277832, -0.1591196060180664, -0.1559767723083496, -0.1528339385986328, -0.14969110488891602, -0.14654827117919922, -0.14340543746948242, -0.14026260375976562, -0.13711977005004883, -0.13397693634033203, -0.13083410263061523, -0.12769126892089844, -0.12454843521118164, -0.12140560150146484, -0.11826276779174805, -0.11511993408203125, -0.11197710037231445, -0.10883426666259766, -0.10569143295288086, -0.10254859924316406, -0.09940576553344727, -0.09626293182373047, -0.09312009811401367, -0.08997726440429688, -0.08683443069458008, -0.08369159698486328, -0.08054876327514648, -0.07740592956542969, -0.07426309585571289, -0.0711202621459961, -0.0679774284362793, -0.0648345947265625, -0.0616917610168457, -0.058548927307128906, -0.05540609359741211, -0.05226325988769531, -0.049120426177978516, -0.04597759246826172, -0.04283475875854492, -0.039691925048828125, -0.03654909133911133, -0.03340625762939453, -0.030263423919677734, -0.027120590209960938, -0.02397775650024414, -0.020834922790527344, -0.017692089080810547, -0.01454925537109375, -0.011406421661376953, -0.008263587951660156, -0.005120754241943359, -0.0019779205322265625, 0.0011649131774902344, 0.004307746887207031, 0.007450580596923828, 0.010593414306640625, 0.013736248016357422, 0.01687908172607422, 0.020021915435791016, 0.023164749145507812, 0.02630758285522461, 0.029450416564941406, 0.0325932502746582, 0.035736083984375]}, "gradients/encoder.encoder.layers.18.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 6.0, 11.0, 15.0, 28.0, 60.0, 151.0, 420.0, 1389.0, 16149.0, 4160205.0, 13446.0, 1558.0, 432.0, 205.0, 92.0, 51.0, 33.0, 19.0, 7.0, 4.0, 5.0, 0.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.70849609375, -0.6795425415039062, -0.6505889892578125, -0.6216354370117188, -0.592681884765625, -0.5637283325195312, -0.5347747802734375, -0.5058212280273438, -0.47686767578125, -0.44791412353515625, -0.4189605712890625, -0.39000701904296875, -0.361053466796875, -0.33209991455078125, -0.3031463623046875, -0.27419281005859375, -0.2452392578125, -0.21628570556640625, -0.1873321533203125, -0.15837860107421875, -0.129425048828125, -0.10047149658203125, -0.0715179443359375, -0.04256439208984375, -0.01361083984375, 0.01534271240234375, 0.0442962646484375, 0.07324981689453125, 0.102203369140625, 0.13115692138671875, 0.1601104736328125, 0.18906402587890625, 0.218017578125, 0.24697113037109375, 0.2759246826171875, 0.30487823486328125, 0.333831787109375, 0.36278533935546875, 0.3917388916015625, 0.42069244384765625, 0.44964599609375, 0.47859954833984375, 0.5075531005859375, 0.5365066528320312, 0.565460205078125, 0.5944137573242188, 0.6233673095703125, 0.6523208618164062, 0.6812744140625, 0.7102279663085938, 0.7391815185546875, 0.7681350708007812, 0.797088623046875, 0.8260421752929688, 0.8549957275390625, 0.8839492797851562, 0.91290283203125, 0.9418563842773438, 0.9708099365234375, 0.9997634887695312, 1.028717041015625, 1.0576705932617188, 1.0866241455078125, 1.1155776977539062, 1.14453125]}, "gradients/encoder.encoder.layers.18.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 3.0, 11.0, 22.0, 47.0, 175.0, 3084.0, 598.0, 86.0, 28.0, 13.0, 6.0, 7.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.108154296875, -0.10207366943359375, -0.0959930419921875, -0.08991241455078125, -0.083831787109375, -0.07775115966796875, -0.0716705322265625, -0.06558990478515625, -0.05950927734375, -0.05342864990234375, -0.0473480224609375, -0.04126739501953125, -0.035186767578125, -0.02910614013671875, -0.0230255126953125, -0.01694488525390625, -0.0108642578125, -0.00478363037109375, 0.0012969970703125, 0.00737762451171875, 0.013458251953125, 0.01953887939453125, 0.0256195068359375, 0.03170013427734375, 0.03778076171875, 0.04386138916015625, 0.0499420166015625, 0.05602264404296875, 0.062103271484375, 0.06818389892578125, 0.0742645263671875, 0.08034515380859375, 0.08642578125, 0.09250640869140625, 0.0985870361328125, 0.10466766357421875, 0.110748291015625, 0.11682891845703125, 0.1229095458984375, 0.12899017333984375, 0.13507080078125, 0.14115142822265625, 0.1472320556640625, 0.15331268310546875, 0.159393310546875, 0.16547393798828125, 0.1715545654296875, 0.17763519287109375, 0.1837158203125, 0.18979644775390625, 0.1958770751953125, 0.20195770263671875, 0.208038330078125, 0.21411895751953125, 0.2201995849609375, 0.22628021240234375, 0.23236083984375, 0.23844146728515625, 0.2445220947265625, 0.25060272216796875, 0.256683349609375, 0.26276397705078125, 0.2688446044921875, 0.27492523193359375, 0.281005859375]}, "gradients/encoder.encoder.layers.18.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 23.0, 978.0, 12.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.666889190673828, -8.507675170898438, -8.34846019744873, -8.18924617767334, -8.030031204223633, -7.870817184448242, -7.711602687835693, -7.5523881912231445, -7.393174171447754, -7.233959674835205, -7.074745178222656, -6.915531158447266, -6.756316661834717, -6.597102165222168, -6.437887668609619, -6.27867317199707, -6.1194586753845215, -5.960244178771973, -5.801029682159424, -5.641815662384033, -5.482601165771484, -5.3233866691589355, -5.164172172546387, -5.004957675933838, -4.845743179321289, -4.68652868270874, -4.527314186096191, -4.368100166320801, -4.208885669708252, -4.049671173095703, -3.8904566764831543, -3.7312421798706055, -3.572028636932373, -3.412814140319824, -3.2535998821258545, -3.0943853855133057, -2.935171127319336, -2.775956630706787, -2.6167421340942383, -2.4575276374816895, -2.2983131408691406, -2.139098644256592, -1.979884386062622, -1.8206698894500732, -1.661455512046814, -1.5022411346435547, -1.3430266380310059, -1.1838122606277466, -1.0245980024337769, -0.8653836250305176, -0.7061691880226135, -0.5469547510147095, -0.3877403736114502, -0.22852599620819092, -0.06931155920028687, 0.08990287780761719, 0.24911725521087646, 0.40833166241645813, 0.5675460696220398, 0.7267605066299438, 0.8859748840332031, 1.0451892614364624, 1.2044036388397217, 1.3636181354522705, 1.5228325128555298]}, "gradients/encoder.encoder.layers.18.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 5.0, 8.0, 17.0, 17.0, 45.0, 58.0, 69.0, 101.0, 116.0, 92.0, 130.0, 109.0, 79.0, 61.0, 39.0, 23.0, 13.0, 7.0, 9.0, 6.0, 2.0, 1.0, 4.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4563528299331665, -0.4409220218658447, -0.42549121379852295, -0.41006040573120117, -0.3946295976638794, -0.3791987895965576, -0.36376798152923584, -0.34833720326423645, -0.3329063951969147, -0.3174755871295929, -0.3020447790622711, -0.28661397099494934, -0.27118316292762756, -0.2557523846626282, -0.2403215616941452, -0.22489076852798462, -0.20945994555950165, -0.19402913749217987, -0.1785983294248581, -0.1631675362586975, -0.14773672819137573, -0.13230592012405396, -0.11687511205673218, -0.101444311439991, -0.08601350337266922, -0.07058269530534744, -0.05515189468860626, -0.039721086621284485, -0.024290282279253006, -0.008859477937221527, 0.00657133013010025, 0.02200213074684143, 0.03743293881416321, 0.05286374315619469, 0.06829454749822617, 0.08372535556554794, 0.09915615618228912, 0.1145869642496109, 0.13001777231693268, 0.14544856548309326, 0.16087937355041504, 0.17631018161773682, 0.1917409896850586, 0.20717179775238037, 0.22260259091854095, 0.23803339898586273, 0.2534642219543457, 0.2688950002193451, 0.28432583808898926, 0.29975664615631104, 0.3151874542236328, 0.3306182622909546, 0.34604907035827637, 0.36147987842559814, 0.3769106864929199, 0.3923414647579193, 0.4077722728252411, 0.42320308089256287, 0.43863388895988464, 0.4540646970272064, 0.4694955050945282, 0.4849262833595276, 0.5003570914268494, 0.5157878994941711, 0.5312187075614929]}, "gradients/encoder.encoder.layers.18.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 4.0, 6.0, 4.0, 4.0, 6.0, 10.0, 21.0, 26.0, 26.0, 42.0, 57.0, 88.0, 204.0, 664.0, 3186.0, 56962.0, 939518.0, 43774.0, 2831.0, 614.0, 222.0, 90.0, 51.0, 47.0, 32.0, 18.0, 17.0, 13.0, 8.0, 5.0, 5.0, 3.0, 3.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.415771484375, -0.4046745300292969, -0.39357757568359375, -0.3824806213378906, -0.3713836669921875, -0.3602867126464844, -0.34918975830078125, -0.3380928039550781, -0.326995849609375, -0.3158988952636719, -0.30480194091796875, -0.2937049865722656, -0.2826080322265625, -0.2715110778808594, -0.26041412353515625, -0.24931716918945312, -0.23822021484375, -0.22712326049804688, -0.21602630615234375, -0.20492935180664062, -0.1938323974609375, -0.18273544311523438, -0.17163848876953125, -0.16054153442382812, -0.149444580078125, -0.13834762573242188, -0.12725067138671875, -0.11615371704101562, -0.1050567626953125, -0.09395980834960938, -0.08286285400390625, -0.07176589965820312, -0.0606689453125, -0.049571990966796875, -0.03847503662109375, -0.027378082275390625, -0.0162811279296875, -0.005184173583984375, 0.00591278076171875, 0.017009735107421875, 0.028106689453125, 0.039203643798828125, 0.05030059814453125, 0.061397552490234375, 0.0724945068359375, 0.08359146118164062, 0.09468841552734375, 0.10578536987304688, 0.11688232421875, 0.12797927856445312, 0.13907623291015625, 0.15017318725585938, 0.1612701416015625, 0.17236709594726562, 0.18346405029296875, 0.19456100463867188, 0.205657958984375, 0.21675491333007812, 0.22785186767578125, 0.23894882202148438, 0.2500457763671875, 0.2611427307128906, 0.27223968505859375, 0.2833366394042969, 0.29443359375]}, "gradients/encoder.encoder.layers.18.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 7.0, 26.0, 56.0, 95.0, 155.0, 198.0, 169.0, 141.0, 79.0, 45.0, 17.0, 8.0, 2.0, 4.0, 5.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1605224609375, -0.1573619842529297, -0.15420150756835938, -0.15104103088378906, -0.14788055419921875, -0.14472007751464844, -0.14155960083007812, -0.1383991241455078, -0.1352386474609375, -0.1320781707763672, -0.12891769409179688, -0.12575721740722656, -0.12259674072265625, -0.11943626403808594, -0.11627578735351562, -0.11311531066894531, -0.109954833984375, -0.10679435729980469, -0.10363388061523438, -0.10047340393066406, -0.09731292724609375, -0.09415245056152344, -0.09099197387695312, -0.08783149719238281, -0.0846710205078125, -0.08151054382324219, -0.07835006713867188, -0.07518959045410156, -0.07202911376953125, -0.06886863708496094, -0.06570816040039062, -0.06254768371582031, -0.05938720703125, -0.05622673034667969, -0.053066253662109375, -0.04990577697753906, -0.04674530029296875, -0.04358482360839844, -0.040424346923828125, -0.03726387023925781, -0.0341033935546875, -0.030942916870117188, -0.027782440185546875, -0.024621963500976562, -0.02146148681640625, -0.018301010131835938, -0.015140533447265625, -0.011980056762695312, -0.008819580078125, -0.0056591033935546875, -0.002498626708984375, 0.0006618499755859375, 0.00382232666015625, 0.0069828033447265625, 0.010143280029296875, 0.013303756713867188, 0.0164642333984375, 0.019624710083007812, 0.022785186767578125, 0.025945663452148438, 0.02910614013671875, 0.03226661682128906, 0.035427093505859375, 0.03858757019042969, 0.041748046875]}, "gradients/encoder.encoder.layers.18.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 5.0, 5.0, 4.0, 2.0, 9.0, 6.0, 7.0, 17.0, 15.0, 29.0, 34.0, 73.0, 83.0, 155.0, 331.0, 918.0, 3846.0, 33546.0, 841253.0, 156466.0, 8981.0, 1644.0, 521.0, 237.0, 128.0, 87.0, 56.0, 22.0, 26.0, 19.0, 13.0, 9.0, 6.0, 4.0, 3.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.3046875, -0.2959136962890625, -0.287139892578125, -0.2783660888671875, -0.26959228515625, -0.2608184814453125, -0.252044677734375, -0.2432708740234375, -0.2344970703125, -0.2257232666015625, -0.216949462890625, -0.2081756591796875, -0.19940185546875, -0.1906280517578125, -0.181854248046875, -0.1730804443359375, -0.164306640625, -0.1555328369140625, -0.146759033203125, -0.1379852294921875, -0.12921142578125, -0.1204376220703125, -0.111663818359375, -0.1028900146484375, -0.0941162109375, -0.0853424072265625, -0.076568603515625, -0.0677947998046875, -0.05902099609375, -0.0502471923828125, -0.041473388671875, -0.0326995849609375, -0.02392578125, -0.0151519775390625, -0.006378173828125, 0.0023956298828125, 0.01116943359375, 0.0199432373046875, 0.028717041015625, 0.0374908447265625, 0.0462646484375, 0.0550384521484375, 0.063812255859375, 0.0725860595703125, 0.08135986328125, 0.0901336669921875, 0.098907470703125, 0.1076812744140625, 0.116455078125, 0.1252288818359375, 0.134002685546875, 0.1427764892578125, 0.15155029296875, 0.1603240966796875, 0.169097900390625, 0.1778717041015625, 0.1866455078125, 0.1954193115234375, 0.204193115234375, 0.2129669189453125, 0.22174072265625, 0.2305145263671875, 0.239288330078125, 0.2480621337890625, 0.2568359375]}, "gradients/encoder.encoder.layers.18.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 3.0, 4.0, 1.0, 5.0, 6.0, 7.0, 18.0, 11.0, 20.0, 18.0, 28.0, 34.0, 38.0, 54.0, 64.0, 86.0, 70.0, 82.0, 63.0, 78.0, 59.0, 50.0, 49.0, 39.0, 37.0, 22.0, 15.0, 10.0, 12.0, 11.0, 2.0, 2.0, 2.0, 8.0, 2.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.246337890625, -0.23956680297851562, -0.23279571533203125, -0.22602462768554688, -0.2192535400390625, -0.21248245239257812, -0.20571136474609375, -0.19894027709960938, -0.192169189453125, -0.18539810180664062, -0.17862701416015625, -0.17185592651367188, -0.1650848388671875, -0.15831375122070312, -0.15154266357421875, -0.14477157592773438, -0.13800048828125, -0.13122940063476562, -0.12445831298828125, -0.11768722534179688, -0.1109161376953125, -0.10414505004882812, -0.09737396240234375, -0.09060287475585938, -0.083831787109375, -0.07706069946289062, -0.07028961181640625, -0.06351852416992188, -0.0567474365234375, -0.049976348876953125, -0.04320526123046875, -0.036434173583984375, -0.0296630859375, -0.022891998291015625, -0.01612091064453125, -0.009349822998046875, -0.0025787353515625, 0.004192352294921875, 0.01096343994140625, 0.017734527587890625, 0.024505615234375, 0.031276702880859375, 0.03804779052734375, 0.044818878173828125, 0.0515899658203125, 0.058361053466796875, 0.06513214111328125, 0.07190322875976562, 0.07867431640625, 0.08544540405273438, 0.09221649169921875, 0.09898757934570312, 0.1057586669921875, 0.11252975463867188, 0.11930084228515625, 0.12607192993164062, 0.132843017578125, 0.13961410522460938, 0.14638519287109375, 0.15315628051757812, 0.1599273681640625, 0.16669845581054688, 0.17346954345703125, 0.18024063110351562, 0.18701171875]}, "gradients/encoder.encoder.layers.18.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 4.0, 4.0, 6.0, 9.0, 7.0, 8.0, 8.0, 16.0, 23.0, 38.0, 48.0, 89.0, 125.0, 194.0, 371.0, 670.0, 1534.0, 3955.0, 14475.0, 87959.0, 660417.0, 239910.0, 27873.0, 6519.0, 2162.0, 939.0, 493.0, 268.0, 147.0, 88.0, 51.0, 30.0, 42.0, 17.0, 11.0, 11.0, 9.0, 13.0, 3.0, 6.0, 2.0, 1.0, 3.0, 0.0, 0.0, 3.0, 1.0, 0.0, 0.0, 3.0, 1.0, 1.0], "bins": [-0.057037353515625, -0.05529356002807617, -0.053549766540527344, -0.051805973052978516, -0.05006217956542969, -0.04831838607788086, -0.04657459259033203, -0.0448307991027832, -0.043087005615234375, -0.04134321212768555, -0.03959941864013672, -0.03785562515258789, -0.03611183166503906, -0.034368038177490234, -0.032624244689941406, -0.030880451202392578, -0.02913665771484375, -0.027392864227294922, -0.025649070739746094, -0.023905277252197266, -0.022161483764648438, -0.02041769027709961, -0.01867389678955078, -0.016930103302001953, -0.015186309814453125, -0.013442516326904297, -0.011698722839355469, -0.00995492935180664, -0.008211135864257812, -0.006467342376708984, -0.004723548889160156, -0.002979755401611328, -0.0012359619140625, 0.0005078315734863281, 0.0022516250610351562, 0.003995418548583984, 0.0057392120361328125, 0.007483005523681641, 0.009226799011230469, 0.010970592498779297, 0.012714385986328125, 0.014458179473876953, 0.01620197296142578, 0.01794576644897461, 0.019689559936523438, 0.021433353424072266, 0.023177146911621094, 0.024920940399169922, 0.02666473388671875, 0.028408527374267578, 0.030152320861816406, 0.031896114349365234, 0.03363990783691406, 0.03538370132446289, 0.03712749481201172, 0.03887128829956055, 0.040615081787109375, 0.0423588752746582, 0.04410266876220703, 0.04584646224975586, 0.04759025573730469, 0.049334049224853516, 0.051077842712402344, 0.05282163619995117, 0.0545654296875]}, "gradients/encoder.encoder.layers.18.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 3.0, 0.0, 1.0, 2.0, 4.0, 11.0, 12.0, 20.0, 18.0, 44.0, 39.0, 57.0, 50.0, 93.0, 95.0, 82.0, 67.0, 78.0, 81.0, 53.0, 50.0, 56.0, 34.0, 11.0, 18.0, 14.0, 5.0, 5.0, 3.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.3172626495361328e-05, -1.2831762433052063e-05, -1.2490898370742798e-05, -1.2150034308433533e-05, -1.1809170246124268e-05, -1.1468306183815002e-05, -1.1127442121505737e-05, -1.0786578059196472e-05, -1.0445713996887207e-05, -1.0104849934577942e-05, -9.763985872268677e-06, -9.423121809959412e-06, -9.082257747650146e-06, -8.741393685340881e-06, -8.400529623031616e-06, -8.059665560722351e-06, -7.718801498413086e-06, -7.377937436103821e-06, -7.037073373794556e-06, -6.6962093114852905e-06, -6.355345249176025e-06, -6.01448118686676e-06, -5.673617124557495e-06, -5.33275306224823e-06, -4.991888999938965e-06, -4.6510249376297e-06, -4.3101608753204346e-06, -3.9692968130111694e-06, -3.6284327507019043e-06, -3.287568688392639e-06, -2.946704626083374e-06, -2.605840563774109e-06, -2.2649765014648438e-06, -1.9241124391555786e-06, -1.5832483768463135e-06, -1.2423843145370483e-06, -9.015202522277832e-07, -5.606561899185181e-07, -2.1979212760925293e-07, 1.210719347000122e-07, 4.6193599700927734e-07, 8.028000593185425e-07, 1.1436641216278076e-06, 1.4845281839370728e-06, 1.8253922462463379e-06, 2.166256308555603e-06, 2.507120370864868e-06, 2.8479844331741333e-06, 3.1888484954833984e-06, 3.5297125577926636e-06, 3.870576620101929e-06, 4.211440682411194e-06, 4.552304744720459e-06, 4.893168807029724e-06, 5.234032869338989e-06, 5.574896931648254e-06, 5.9157609939575195e-06, 6.256625056266785e-06, 6.59748911857605e-06, 6.938353180885315e-06, 7.27921724319458e-06, 7.620081305503845e-06, 7.96094536781311e-06, 8.301809430122375e-06, 8.64267349243164e-06]}, "gradients/encoder.encoder.layers.18.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 3.0, 5.0, 3.0, 5.0, 12.0, 15.0, 21.0, 34.0, 57.0, 125.0, 219.0, 544.0, 1259.0, 4696.0, 31743.0, 716349.0, 273958.0, 14719.0, 3006.0, 983.0, 391.0, 172.0, 114.0, 38.0, 34.0, 19.0, 12.0, 8.0, 6.0, 4.0, 1.0, 3.0, 3.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0941162109375, -0.09096145629882812, -0.08780670166015625, -0.08465194702148438, -0.0814971923828125, -0.07834243774414062, -0.07518768310546875, -0.07203292846679688, -0.068878173828125, -0.06572341918945312, -0.06256866455078125, -0.059413909912109375, -0.0562591552734375, -0.053104400634765625, -0.04994964599609375, -0.046794891357421875, -0.04364013671875, -0.040485382080078125, -0.03733062744140625, -0.034175872802734375, -0.0310211181640625, -0.027866363525390625, -0.02471160888671875, -0.021556854248046875, -0.018402099609375, -0.015247344970703125, -0.01209259033203125, -0.008937835693359375, -0.0057830810546875, -0.002628326416015625, 0.00052642822265625, 0.003681182861328125, 0.0068359375, 0.009990692138671875, 0.01314544677734375, 0.016300201416015625, 0.0194549560546875, 0.022609710693359375, 0.02576446533203125, 0.028919219970703125, 0.032073974609375, 0.035228729248046875, 0.03838348388671875, 0.041538238525390625, 0.0446929931640625, 0.047847747802734375, 0.05100250244140625, 0.054157257080078125, 0.05731201171875, 0.060466766357421875, 0.06362152099609375, 0.06677627563476562, 0.0699310302734375, 0.07308578491210938, 0.07624053955078125, 0.07939529418945312, 0.082550048828125, 0.08570480346679688, 0.08885955810546875, 0.09201431274414062, 0.0951690673828125, 0.09832382202148438, 0.10147857666015625, 0.10463333129882812, 0.1077880859375]}, "gradients/encoder.encoder.layers.18.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 9.0, 12.0, 11.0, 28.0, 39.0, 71.0, 122.0, 133.0, 159.0, 153.0, 102.0, 55.0, 45.0, 27.0, 11.0, 12.0, 9.0, 2.0, 2.0, 4.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09808349609375, -0.09552478790283203, -0.09296607971191406, -0.0904073715209961, -0.08784866333007812, -0.08528995513916016, -0.08273124694824219, -0.08017253875732422, -0.07761383056640625, -0.07505512237548828, -0.07249641418457031, -0.06993770599365234, -0.06737899780273438, -0.0648202896118164, -0.06226158142089844, -0.05970287322998047, -0.0571441650390625, -0.05458545684814453, -0.05202674865722656, -0.049468040466308594, -0.046909332275390625, -0.044350624084472656, -0.04179191589355469, -0.03923320770263672, -0.03667449951171875, -0.03411579132080078, -0.03155708312988281, -0.028998374938964844, -0.026439666748046875, -0.023880958557128906, -0.021322250366210938, -0.01876354217529297, -0.016204833984375, -0.013646125793457031, -0.011087417602539062, -0.008528709411621094, -0.005970001220703125, -0.0034112930297851562, -0.0008525848388671875, 0.0017061233520507812, 0.00426483154296875, 0.006823539733886719, 0.009382247924804688, 0.011940956115722656, 0.014499664306640625, 0.017058372497558594, 0.019617080688476562, 0.02217578887939453, 0.0247344970703125, 0.02729320526123047, 0.029851913452148438, 0.032410621643066406, 0.034969329833984375, 0.037528038024902344, 0.04008674621582031, 0.04264545440673828, 0.04520416259765625, 0.04776287078857422, 0.05032157897949219, 0.052880287170410156, 0.055438995361328125, 0.057997703552246094, 0.06055641174316406, 0.06311511993408203, 0.065673828125]}, "gradients/encoder.encoder.layers.18.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 8.0, 26.0, 186.0, 593.0, 159.0, 34.0, 5.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-4.482228755950928, -4.398406982421875, -4.3145856857299805, -4.230763912200928, -4.146942138671875, -4.063120365142822, -3.9792988300323486, -3.895477294921875, -3.8116555213928223, -3.7278337478637695, -3.644012212753296, -3.5601906776428223, -3.4763689041137695, -3.392547130584717, -3.308725595474243, -3.2249040603637695, -3.141082286834717, -3.057260513305664, -2.9734389781951904, -2.889617443084717, -2.805795669555664, -2.7219738960266113, -2.6381523609161377, -2.554330825805664, -2.4705090522766113, -2.3866872787475586, -2.302865743637085, -2.2190442085266113, -2.1352224349975586, -2.051400661468506, -1.9675791263580322, -1.883757472038269, -1.7999355792999268, -1.7161139249801636, -1.6322922706604004, -1.5484706163406372, -1.464648962020874, -1.3808273077011108, -1.2970056533813477, -1.2131839990615845, -1.1293623447418213, -1.045540690422058, -0.9617190361022949, -0.8778973817825317, -0.7940757274627686, -0.7102540731430054, -0.6264324188232422, -0.542610764503479, -0.4587891101837158, -0.37496745586395264, -0.29114580154418945, -0.20732414722442627, -0.12350249290466309, -0.0396808385848999, 0.04414081573486328, 0.12796247005462646, 0.21178412437438965, 0.29560577869415283, 0.379427433013916, 0.4632490873336792, 0.5470707416534424, 0.6308923959732056, 0.7147140502929688, 0.7985357046127319, 0.8823573589324951]}, "gradients/encoder.encoder.layers.18.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 1.0, 3.0, 2.0, 0.0, 3.0, 6.0, 3.0, 6.0, 13.0, 15.0, 10.0, 16.0, 19.0, 30.0, 23.0, 30.0, 34.0, 42.0, 50.0, 36.0, 48.0, 54.0, 42.0, 50.0, 46.0, 51.0, 54.0, 49.0, 33.0, 36.0, 29.0, 26.0, 27.0, 26.0, 25.0, 13.0, 5.0, 9.0, 9.0, 5.0, 3.0, 4.0, 8.0, 8.0, 2.0, 1.0, 3.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.8090323209762573, -0.7833037376403809, -0.7575752139091492, -0.7318466305732727, -0.706118106842041, -0.6803895235061646, -0.6546609997749329, -0.6289324164390564, -0.6032038927078247, -0.5774753093719482, -0.5517467856407166, -0.5260182023048401, -0.5002896785736084, -0.47456109523773193, -0.44883257150650024, -0.4231039881706238, -0.3973754346370697, -0.3716468811035156, -0.34591832756996155, -0.32018977403640747, -0.2944612205028534, -0.2687326669692993, -0.24300409853458405, -0.21727554500102997, -0.1915469914674759, -0.16581843793392181, -0.14008988440036774, -0.11436132341623306, -0.08863276988267899, -0.06290420889854431, -0.037175655364990234, -0.011447101831436157, 0.01428145170211792, 0.040010005235672, 0.06573855876922607, 0.09146711975336075, 0.11719567328691483, 0.1429242342710495, 0.16865278780460358, 0.19438134133815765, 0.22010989487171173, 0.2458384484052658, 0.2715670168399811, 0.29729557037353516, 0.32302412390708923, 0.3487526774406433, 0.3744812309741974, 0.40020978450775146, 0.42593833804130554, 0.4516668915748596, 0.4773954451084137, 0.5031239986419678, 0.5288525819778442, 0.5545811057090759, 0.5803096890449524, 0.6060382127761841, 0.6317667961120605, 0.657495379447937, 0.6832239031791687, 0.7089524865150452, 0.7346810102462769, 0.7604095935821533, 0.786138117313385, 0.8118667006492615, 0.8375952243804932]}, "gradients/encoder.encoder.layers.17.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 3.0, 3.0, 6.0, 3.0, 5.0, 5.0, 5.0, 8.0, 10.0, 12.0, 15.0, 13.0, 22.0, 16.0, 32.0, 33.0, 71.0, 110.0, 172.0, 419.0, 1057.0, 4405.0, 41249.0, 4062647.0, 75647.0, 6343.0, 1355.0, 374.0, 148.0, 61.0, 15.0, 16.0, 7.0, 0.0, 0.0, 4.0, 1.0, 1.0], "bins": [-0.7412109375, -0.7262611389160156, -0.7113113403320312, -0.6963615417480469, -0.6814117431640625, -0.6664619445800781, -0.6515121459960938, -0.6365623474121094, -0.621612548828125, -0.6066627502441406, -0.5917129516601562, -0.5767631530761719, -0.5618133544921875, -0.5468635559082031, -0.5319137573242188, -0.5169639587402344, -0.50201416015625, -0.4870643615722656, -0.47211456298828125, -0.4571647644042969, -0.4422149658203125, -0.4272651672363281, -0.41231536865234375, -0.3973655700683594, -0.382415771484375, -0.3674659729003906, -0.35251617431640625, -0.3375663757324219, -0.3226165771484375, -0.3076667785644531, -0.29271697998046875, -0.2777671813964844, -0.2628173828125, -0.24786758422851562, -0.23291778564453125, -0.21796798706054688, -0.2030181884765625, -0.18806838989257812, -0.17311859130859375, -0.15816879272460938, -0.143218994140625, -0.12826919555664062, -0.11331939697265625, -0.09836959838867188, -0.0834197998046875, -0.06847000122070312, -0.05352020263671875, -0.038570404052734375, -0.02362060546875, -0.008670806884765625, 0.00627899169921875, 0.021228790283203125, 0.0361785888671875, 0.051128387451171875, 0.06607818603515625, 0.08102798461914062, 0.095977783203125, 0.11092758178710938, 0.12587738037109375, 0.14082717895507812, 0.1557769775390625, 0.17072677612304688, 0.18567657470703125, 0.20062637329101562, 0.215576171875]}, "gradients/encoder.encoder.layers.17.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 3.0, 6.0, 14.0, 30.0, 66.0, 94.0, 141.0, 177.0, 186.0, 121.0, 85.0, 38.0, 27.0, 22.0, 0.0, 5.0, 3.0], "bins": [-0.1629638671875, -0.1599884033203125, -0.157012939453125, -0.1540374755859375, -0.15106201171875, -0.1480865478515625, -0.145111083984375, -0.1421356201171875, -0.13916015625, -0.1361846923828125, -0.133209228515625, -0.1302337646484375, -0.12725830078125, -0.1242828369140625, -0.121307373046875, -0.1183319091796875, -0.1153564453125, -0.1123809814453125, -0.109405517578125, -0.1064300537109375, -0.10345458984375, -0.1004791259765625, -0.097503662109375, -0.0945281982421875, -0.091552734375, -0.0885772705078125, -0.085601806640625, -0.0826263427734375, -0.07965087890625, -0.0766754150390625, -0.073699951171875, -0.0707244873046875, -0.0677490234375, -0.0647735595703125, -0.061798095703125, -0.0588226318359375, -0.05584716796875, -0.0528717041015625, -0.049896240234375, -0.0469207763671875, -0.0439453125, -0.0409698486328125, -0.037994384765625, -0.0350189208984375, -0.03204345703125, -0.0290679931640625, -0.026092529296875, -0.0231170654296875, -0.0201416015625, -0.0171661376953125, -0.014190673828125, -0.0112152099609375, -0.00823974609375, -0.0052642822265625, -0.002288818359375, 0.0006866455078125, 0.003662109375, 0.0066375732421875, 0.009613037109375, 0.0125885009765625, 0.01556396484375, 0.0185394287109375, 0.021514892578125, 0.0244903564453125, 0.0274658203125]}, "gradients/encoder.encoder.layers.17.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 17.0, 16.0, 29.0, 56.0, 119.0, 237.0, 904.0, 16117.0, 4169253.0, 6611.0, 571.0, 197.0, 81.0, 41.0, 22.0, 11.0, 9.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.72265625, -1.6753387451171875, -1.628021240234375, -1.5807037353515625, -1.53338623046875, -1.4860687255859375, -1.438751220703125, -1.3914337158203125, -1.3441162109375, -1.2967987060546875, -1.249481201171875, -1.2021636962890625, -1.15484619140625, -1.1075286865234375, -1.060211181640625, -1.0128936767578125, -0.965576171875, -0.9182586669921875, -0.870941162109375, -0.8236236572265625, -0.77630615234375, -0.7289886474609375, -0.681671142578125, -0.6343536376953125, -0.5870361328125, -0.5397186279296875, -0.492401123046875, -0.4450836181640625, -0.39776611328125, -0.3504486083984375, -0.303131103515625, -0.2558135986328125, -0.20849609375, -0.1611785888671875, -0.113861083984375, -0.0665435791015625, -0.01922607421875, 0.0280914306640625, 0.075408935546875, 0.1227264404296875, 0.1700439453125, 0.2173614501953125, 0.264678955078125, 0.3119964599609375, 0.35931396484375, 0.4066314697265625, 0.453948974609375, 0.5012664794921875, 0.548583984375, 0.5959014892578125, 0.643218994140625, 0.6905364990234375, 0.73785400390625, 0.7851715087890625, 0.832489013671875, 0.8798065185546875, 0.9271240234375, 0.9744415283203125, 1.021759033203125, 1.0690765380859375, 1.11639404296875, 1.1637115478515625, 1.211029052734375, 1.2583465576171875, 1.3056640625]}, "gradients/encoder.encoder.layers.17.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 0.0, 3.0, 1.0, 11.0, 4.0, 5.0, 4.0, 17.0, 18.0, 88.0, 454.0, 3173.0, 223.0, 46.0, 21.0, 10.0, 3.0, 2.0, 5.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.400390625, -0.3899497985839844, -0.37950897216796875, -0.3690681457519531, -0.3586273193359375, -0.3481864929199219, -0.33774566650390625, -0.3273048400878906, -0.316864013671875, -0.3064231872558594, -0.29598236083984375, -0.2855415344238281, -0.2751007080078125, -0.2646598815917969, -0.25421905517578125, -0.24377822875976562, -0.23333740234375, -0.22289657592773438, -0.21245574951171875, -0.20201492309570312, -0.1915740966796875, -0.18113327026367188, -0.17069244384765625, -0.16025161743164062, -0.149810791015625, -0.13936996459960938, -0.12892913818359375, -0.11848831176757812, -0.1080474853515625, -0.09760665893554688, -0.08716583251953125, -0.07672500610351562, -0.0662841796875, -0.055843353271484375, -0.04540252685546875, -0.034961700439453125, -0.0245208740234375, -0.014080047607421875, -0.00363922119140625, 0.006801605224609375, 0.017242431640625, 0.027683258056640625, 0.03812408447265625, 0.048564910888671875, 0.0590057373046875, 0.06944656372070312, 0.07988739013671875, 0.09032821655273438, 0.10076904296875, 0.11120986938476562, 0.12165069580078125, 0.13209152221679688, 0.1425323486328125, 0.15297317504882812, 0.16341400146484375, 0.17385482788085938, 0.184295654296875, 0.19473648071289062, 0.20517730712890625, 0.21561813354492188, 0.2260589599609375, 0.23649978637695312, 0.24694061279296875, 0.2573814392089844, 0.267822265625]}, "gradients/encoder.encoder.layers.17.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 11.0, 113.0, 862.0, 24.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-8.687887191772461, -8.524981498718262, -8.362075805664062, -8.199170112609863, -8.036263465881348, -7.873357772827148, -7.710452079772949, -7.54754638671875, -7.384640693664551, -7.221735000610352, -7.058828830718994, -6.895923137664795, -6.733017444610596, -6.570111274719238, -6.407205581665039, -6.24429988861084, -6.081393718719482, -5.918488025665283, -5.755581855773926, -5.592676162719727, -5.429770469665527, -5.266864776611328, -5.103958606719971, -4.9410529136657715, -4.778146743774414, -4.615241050720215, -4.452334880828857, -4.289429187774658, -4.126523494720459, -3.9636175632476807, -3.8007116317749023, -3.637805938720703, -3.474900245666504, -3.3119943141937256, -3.1490886211395264, -2.986182689666748, -2.823276996612549, -2.6603710651397705, -2.497465133666992, -2.334559440612793, -2.1716535091400146, -2.0087475776672363, -1.845841884613037, -1.6829359531402588, -1.52003014087677, -1.3571243286132812, -1.194218397140503, -1.0313125848770142, -0.8684067726135254, -0.7055009603500366, -0.5425950884819031, -0.3796892464160919, -0.21678340435028076, -0.05387759208679199, 0.10902827978134155, 0.2719341516494751, 0.43483996391296387, 0.5977457761764526, 0.7606516480445862, 0.9235575199127197, 1.0864633321762085, 1.2493691444396973, 1.4122750759124756, 1.5751808881759644, 1.7380867004394531]}, "gradients/encoder.encoder.layers.17.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 6.0, 15.0, 28.0, 32.0, 58.0, 84.0, 92.0, 160.0, 136.0, 131.0, 108.0, 64.0, 46.0, 27.0, 13.0, 5.0, 3.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0474042892456055, -1.012473464012146, -0.9775426387786865, -0.9426117539405823, -0.9076809287071228, -0.8727501034736633, -0.8378192186355591, -0.8028883934020996, -0.7679575681686401, -0.7330267429351807, -0.6980959177017212, -0.6631650328636169, -0.6282342076301575, -0.593303382396698, -0.5583724975585938, -0.5234416723251343, -0.4885108470916748, -0.45358002185821533, -0.41864916682243347, -0.3837183117866516, -0.34878748655319214, -0.31385666131973267, -0.2789258062839508, -0.24399496614933014, -0.20906412601470947, -0.1741332858800888, -0.13920244574546814, -0.10427160561084747, -0.0693407654762268, -0.03440992534160614, 0.0005209147930145264, 0.03545175492763519, 0.07038271427154541, 0.10531355440616608, 0.14024439454078674, 0.1751752346754074, 0.21010607481002808, 0.24503691494464874, 0.2799677550792694, 0.31489861011505127, 0.34982943534851074, 0.3847602605819702, 0.4196911156177521, 0.45462197065353394, 0.4895527958869934, 0.5244836211204529, 0.5594145059585571, 0.5943453311920166, 0.6292761564254761, 0.6642069816589355, 0.699137806892395, 0.7340686917304993, 0.7689995169639587, 0.8039303421974182, 0.8388612270355225, 0.8737920522689819, 0.9087228775024414, 0.9436537027359009, 0.9785845279693604, 1.0135153532028198, 1.0484461784362793, 1.0833771228790283, 1.1183079481124878, 1.1532387733459473, 1.1881695985794067]}, "gradients/encoder.encoder.layers.17.attention.out_proj.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 5.0, 2.0, 2.0, 2.0, 5.0, 5.0, 6.0, 10.0, 16.0, 20.0, 16.0, 22.0, 27.0, 52.0, 60.0, 79.0, 118.0, 161.0, 268.0, 507.0, 1148.0, 3181.0, 15170.0, 148908.0, 760570.0, 101258.0, 11938.0, 2806.0, 954.0, 462.0, 235.0, 152.0, 99.0, 72.0, 50.0, 48.0, 27.0, 20.0, 19.0, 12.0, 8.0, 10.0, 7.0, 8.0, 5.0, 5.0, 4.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0], "bins": [-0.2054443359375, -0.19846343994140625, -0.1914825439453125, -0.18450164794921875, -0.177520751953125, -0.17053985595703125, -0.1635589599609375, -0.15657806396484375, -0.14959716796875, -0.14261627197265625, -0.1356353759765625, -0.12865447998046875, -0.121673583984375, -0.11469268798828125, -0.1077117919921875, -0.10073089599609375, -0.09375, -0.08676910400390625, -0.0797882080078125, -0.07280731201171875, -0.065826416015625, -0.05884552001953125, -0.0518646240234375, -0.04488372802734375, -0.03790283203125, -0.03092193603515625, -0.0239410400390625, -0.01696014404296875, -0.009979248046875, -0.00299835205078125, 0.0039825439453125, 0.01096343994140625, 0.0179443359375, 0.02492523193359375, 0.0319061279296875, 0.03888702392578125, 0.045867919921875, 0.05284881591796875, 0.0598297119140625, 0.06681060791015625, 0.07379150390625, 0.08077239990234375, 0.0877532958984375, 0.09473419189453125, 0.101715087890625, 0.10869598388671875, 0.1156768798828125, 0.12265777587890625, 0.129638671875, 0.13661956787109375, 0.1436004638671875, 0.15058135986328125, 0.157562255859375, 0.16454315185546875, 0.1715240478515625, 0.17850494384765625, 0.18548583984375, 0.19246673583984375, 0.1994476318359375, 0.20642852783203125, 0.213409423828125, 0.22039031982421875, 0.2273712158203125, 0.23435211181640625, 0.2413330078125]}, "gradients/encoder.encoder.layers.17.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 4.0, 3.0, 6.0, 19.0, 37.0, 62.0, 95.0, 173.0, 145.0, 169.0, 132.0, 78.0, 44.0, 24.0, 9.0, 6.0, 5.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.134765625, -0.13181447982788086, -0.12886333465576172, -0.12591218948364258, -0.12296104431152344, -0.1200098991394043, -0.11705875396728516, -0.11410760879516602, -0.11115646362304688, -0.10820531845092773, -0.1052541732788086, -0.10230302810668945, -0.09935188293457031, -0.09640073776245117, -0.09344959259033203, -0.09049844741821289, -0.08754730224609375, -0.08459615707397461, -0.08164501190185547, -0.07869386672973633, -0.07574272155761719, -0.07279157638549805, -0.0698404312133789, -0.06688928604125977, -0.06393814086914062, -0.060986995697021484, -0.058035850524902344, -0.0550847053527832, -0.05213356018066406, -0.04918241500854492, -0.04623126983642578, -0.04328012466430664, -0.0403289794921875, -0.03737783432006836, -0.03442668914794922, -0.03147554397583008, -0.028524398803710938, -0.025573253631591797, -0.022622108459472656, -0.019670963287353516, -0.016719818115234375, -0.013768672943115234, -0.010817527770996094, -0.007866382598876953, -0.0049152374267578125, -0.001964092254638672, 0.0009870529174804688, 0.003938198089599609, 0.00688934326171875, 0.00984048843383789, 0.012791633605957031, 0.015742778778076172, 0.018693923950195312, 0.021645069122314453, 0.024596214294433594, 0.027547359466552734, 0.030498504638671875, 0.033449649810791016, 0.036400794982910156, 0.0393519401550293, 0.04230308532714844, 0.04525423049926758, 0.04820537567138672, 0.05115652084350586, 0.054107666015625]}, "gradients/encoder.encoder.layers.17.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 3.0, 3.0, 1.0, 1.0, 3.0, 11.0, 6.0, 5.0, 10.0, 14.0, 10.0, 15.0, 31.0, 49.0, 59.0, 96.0, 223.0, 474.0, 1280.0, 3927.0, 15323.0, 76988.0, 483765.0, 389842.0, 59194.0, 12111.0, 3120.0, 1100.0, 411.0, 184.0, 91.0, 63.0, 31.0, 29.0, 24.0, 15.0, 9.0, 8.0, 4.0, 5.0, 5.0, 6.0, 4.0, 4.0, 4.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1229248046875, -0.11894989013671875, -0.1149749755859375, -0.11100006103515625, -0.107025146484375, -0.10305023193359375, -0.0990753173828125, -0.09510040283203125, -0.09112548828125, -0.08715057373046875, -0.0831756591796875, -0.07920074462890625, -0.075225830078125, -0.07125091552734375, -0.0672760009765625, -0.06330108642578125, -0.059326171875, -0.05535125732421875, -0.0513763427734375, -0.04740142822265625, -0.043426513671875, -0.03945159912109375, -0.0354766845703125, -0.03150177001953125, -0.02752685546875, -0.02355194091796875, -0.0195770263671875, -0.01560211181640625, -0.011627197265625, -0.00765228271484375, -0.0036773681640625, 0.00029754638671875, 0.0042724609375, 0.00824737548828125, 0.0122222900390625, 0.01619720458984375, 0.020172119140625, 0.02414703369140625, 0.0281219482421875, 0.03209686279296875, 0.03607177734375, 0.04004669189453125, 0.0440216064453125, 0.04799652099609375, 0.051971435546875, 0.05594635009765625, 0.0599212646484375, 0.06389617919921875, 0.06787109375, 0.07184600830078125, 0.0758209228515625, 0.07979583740234375, 0.083770751953125, 0.08774566650390625, 0.0917205810546875, 0.09569549560546875, 0.09967041015625, 0.10364532470703125, 0.1076202392578125, 0.11159515380859375, 0.115570068359375, 0.11954498291015625, 0.1235198974609375, 0.12749481201171875, 0.1314697265625]}, "gradients/encoder.encoder.layers.17.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 3.0, 4.0, 4.0, 3.0, 7.0, 7.0, 7.0, 13.0, 19.0, 12.0, 17.0, 33.0, 27.0, 30.0, 26.0, 42.0, 36.0, 43.0, 35.0, 45.0, 44.0, 49.0, 42.0, 52.0, 55.0, 28.0, 40.0, 32.0, 43.0, 35.0, 26.0, 27.0, 25.0, 19.0, 18.0, 7.0, 12.0, 11.0, 8.0, 5.0, 7.0, 4.0, 4.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.11383056640625, -0.1100454330444336, -0.10626029968261719, -0.10247516632080078, -0.09869003295898438, -0.09490489959716797, -0.09111976623535156, -0.08733463287353516, -0.08354949951171875, -0.07976436614990234, -0.07597923278808594, -0.07219409942626953, -0.06840896606445312, -0.06462383270263672, -0.06083869934082031, -0.057053565979003906, -0.0532684326171875, -0.049483299255371094, -0.04569816589355469, -0.04191303253173828, -0.038127899169921875, -0.03434276580810547, -0.030557632446289062, -0.026772499084472656, -0.02298736572265625, -0.019202232360839844, -0.015417098999023438, -0.011631965637207031, -0.007846832275390625, -0.004061698913574219, -0.0002765655517578125, 0.0035085678100585938, 0.007293701171875, 0.011078834533691406, 0.014863967895507812, 0.01864910125732422, 0.022434234619140625, 0.02621936798095703, 0.030004501342773438, 0.033789634704589844, 0.03757476806640625, 0.041359901428222656, 0.04514503479003906, 0.04893016815185547, 0.052715301513671875, 0.05650043487548828, 0.06028556823730469, 0.0640707015991211, 0.0678558349609375, 0.0716409683227539, 0.07542610168457031, 0.07921123504638672, 0.08299636840820312, 0.08678150177001953, 0.09056663513183594, 0.09435176849365234, 0.09813690185546875, 0.10192203521728516, 0.10570716857910156, 0.10949230194091797, 0.11327743530273438, 0.11706256866455078, 0.12084770202636719, 0.1246328353881836, 0.12841796875]}, "gradients/encoder.encoder.layers.17.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 6.0, 6.0, 7.0, 4.0, 13.0, 27.0, 59.0, 98.0, 201.0, 673.0, 2945.0, 28424.0, 795015.0, 210088.0, 8889.0, 1405.0, 383.0, 157.0, 69.0, 34.0, 21.0, 11.0, 6.0, 7.0, 2.0, 7.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.1217041015625, -0.1183023452758789, -0.11490058898925781, -0.11149883270263672, -0.10809707641601562, -0.10469532012939453, -0.10129356384277344, -0.09789180755615234, -0.09449005126953125, -0.09108829498291016, -0.08768653869628906, -0.08428478240966797, -0.08088302612304688, -0.07748126983642578, -0.07407951354980469, -0.0706777572631836, -0.0672760009765625, -0.0638742446899414, -0.06047248840332031, -0.05707073211669922, -0.053668975830078125, -0.05026721954345703, -0.04686546325683594, -0.043463706970214844, -0.04006195068359375, -0.036660194396972656, -0.03325843811035156, -0.02985668182373047, -0.026454925537109375, -0.02305316925048828, -0.019651412963867188, -0.016249656677246094, -0.012847900390625, -0.009446144104003906, -0.0060443878173828125, -0.0026426315307617188, 0.000759124755859375, 0.004160881042480469, 0.0075626373291015625, 0.010964393615722656, 0.01436614990234375, 0.017767906188964844, 0.021169662475585938, 0.02457141876220703, 0.027973175048828125, 0.03137493133544922, 0.03477668762207031, 0.038178443908691406, 0.0415802001953125, 0.044981956481933594, 0.04838371276855469, 0.05178546905517578, 0.055187225341796875, 0.05858898162841797, 0.06199073791503906, 0.06539249420166016, 0.06879425048828125, 0.07219600677490234, 0.07559776306152344, 0.07899951934814453, 0.08240127563476562, 0.08580303192138672, 0.08920478820800781, 0.0926065444946289, 0.09600830078125]}, "gradients/encoder.encoder.layers.17.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 5.0, 6.0, 2.0, 6.0, 2.0, 11.0, 7.0, 8.0, 20.0, 13.0, 23.0, 31.0, 56.0, 47.0, 58.0, 65.0, 90.0, 77.0, 66.0, 82.0, 56.0, 71.0, 43.0, 37.0, 33.0, 21.0, 22.0, 12.0, 8.0, 8.0, 4.0, 3.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.0073184967041016e-05, -9.7593292593956e-06, -9.445473551750183e-06, -9.131617844104767e-06, -8.81776213645935e-06, -8.503906428813934e-06, -8.190050721168518e-06, -7.876195013523102e-06, -7.5623393058776855e-06, -7.248483598232269e-06, -6.934627890586853e-06, -6.620772182941437e-06, -6.3069164752960205e-06, -5.993060767650604e-06, -5.679205060005188e-06, -5.365349352359772e-06, -5.0514936447143555e-06, -4.737637937068939e-06, -4.423782229423523e-06, -4.109926521778107e-06, -3.7960708141326904e-06, -3.482215106487274e-06, -3.168359398841858e-06, -2.8545036911964417e-06, -2.5406479835510254e-06, -2.226792275905609e-06, -1.912936568260193e-06, -1.5990808606147766e-06, -1.2852251529693604e-06, -9.71369445323944e-07, -6.575137376785278e-07, -3.4365803003311157e-07, -2.9802322387695312e-08, 2.8405338525772095e-07, 5.979090929031372e-07, 9.117648005485535e-07, 1.2256205081939697e-06, 1.539476215839386e-06, 1.8533319234848022e-06, 2.1671876311302185e-06, 2.4810433387756348e-06, 2.794899046421051e-06, 3.1087547540664673e-06, 3.4226104617118835e-06, 3.7364661693573e-06, 4.050321877002716e-06, 4.364177584648132e-06, 4.678033292293549e-06, 4.991888999938965e-06, 5.305744707584381e-06, 5.619600415229797e-06, 5.933456122875214e-06, 6.24731183052063e-06, 6.561167538166046e-06, 6.875023245811462e-06, 7.188878953456879e-06, 7.502734661102295e-06, 7.816590368747711e-06, 8.130446076393127e-06, 8.444301784038544e-06, 8.75815749168396e-06, 9.072013199329376e-06, 9.385868906974792e-06, 9.699724614620209e-06, 1.0013580322265625e-05]}, "gradients/encoder.encoder.layers.17.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 4.0, 3.0, 1.0, 5.0, 9.0, 9.0, 15.0, 39.0, 56.0, 87.0, 177.0, 392.0, 951.0, 2411.0, 8221.0, 36957.0, 315443.0, 592320.0, 72155.0, 13100.0, 3752.0, 1300.0, 586.0, 246.0, 131.0, 64.0, 42.0, 28.0, 17.0, 10.0, 2.0, 7.0, 10.0, 3.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.08172607421875, -0.07963991165161133, -0.07755374908447266, -0.07546758651733398, -0.07338142395019531, -0.07129526138305664, -0.06920909881591797, -0.0671229362487793, -0.06503677368164062, -0.06295061111450195, -0.06086444854736328, -0.05877828598022461, -0.05669212341308594, -0.054605960845947266, -0.052519798278808594, -0.05043363571166992, -0.04834747314453125, -0.04626131057739258, -0.044175148010253906, -0.042088985443115234, -0.04000282287597656, -0.03791666030883789, -0.03583049774169922, -0.03374433517456055, -0.031658172607421875, -0.029572010040283203, -0.02748584747314453, -0.02539968490600586, -0.023313522338867188, -0.021227359771728516, -0.019141197204589844, -0.017055034637451172, -0.0149688720703125, -0.012882709503173828, -0.010796546936035156, -0.008710384368896484, -0.0066242218017578125, -0.004538059234619141, -0.0024518966674804688, -0.0003657341003417969, 0.001720428466796875, 0.003806591033935547, 0.005892753601074219, 0.00797891616821289, 0.010065078735351562, 0.012151241302490234, 0.014237403869628906, 0.016323566436767578, 0.01840972900390625, 0.020495891571044922, 0.022582054138183594, 0.024668216705322266, 0.026754379272460938, 0.02884054183959961, 0.03092670440673828, 0.03301286697387695, 0.035099029541015625, 0.0371851921081543, 0.03927135467529297, 0.04135751724243164, 0.04344367980957031, 0.045529842376708984, 0.047616004943847656, 0.04970216751098633, 0.051788330078125]}, "gradients/encoder.encoder.layers.17.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 5.0, 3.0, 6.0, 13.0, 14.0, 21.0, 18.0, 31.0, 41.0, 56.0, 75.0, 73.0, 102.0, 111.0, 92.0, 74.0, 64.0, 57.0, 31.0, 35.0, 22.0, 16.0, 13.0, 4.0, 8.0, 8.0, 2.0, 2.0, 5.0, 3.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.05865478515625, -0.05712747573852539, -0.05560016632080078, -0.05407285690307617, -0.05254554748535156, -0.05101823806762695, -0.049490928649902344, -0.047963619232177734, -0.046436309814453125, -0.044909000396728516, -0.043381690979003906, -0.0418543815612793, -0.04032707214355469, -0.03879976272583008, -0.03727245330810547, -0.03574514389038086, -0.03421783447265625, -0.03269052505493164, -0.03116321563720703, -0.029635906219482422, -0.028108596801757812, -0.026581287384033203, -0.025053977966308594, -0.023526668548583984, -0.021999359130859375, -0.020472049713134766, -0.018944740295410156, -0.017417430877685547, -0.015890121459960938, -0.014362812042236328, -0.012835502624511719, -0.01130819320678711, -0.0097808837890625, -0.00825357437133789, -0.006726264953613281, -0.005198955535888672, -0.0036716461181640625, -0.002144336700439453, -0.0006170272827148438, 0.0009102821350097656, 0.002437591552734375, 0.003964900970458984, 0.005492210388183594, 0.007019519805908203, 0.008546829223632812, 0.010074138641357422, 0.011601448059082031, 0.01312875747680664, 0.01465606689453125, 0.01618337631225586, 0.01771068572998047, 0.019237995147705078, 0.020765304565429688, 0.022292613983154297, 0.023819923400878906, 0.025347232818603516, 0.026874542236328125, 0.028401851654052734, 0.029929161071777344, 0.03145647048950195, 0.03298377990722656, 0.03451108932495117, 0.03603839874267578, 0.03756570816040039, 0.039093017578125]}, "gradients/encoder.encoder.layers.17.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 8.0, 54.0, 407.0, 452.0, 78.0, 15.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-4.123210906982422, -4.035541534423828, -3.9478721618652344, -3.8602027893066406, -3.7725331783294678, -3.684863805770874, -3.5971944332122803, -3.5095250606536865, -3.4218554496765137, -3.33418607711792, -3.246516704559326, -3.1588473320007324, -3.0711777210235596, -2.983508348464966, -2.895838975906372, -2.8081696033477783, -2.7205002307891846, -2.632830858230591, -2.545161485671997, -2.457491874694824, -2.3698225021362305, -2.2821531295776367, -2.194483757019043, -2.106814384460449, -2.0191450119018555, -1.9314756393432617, -1.8438061475753784, -1.7561367750167847, -1.6684672832489014, -1.5807979106903076, -1.4931285381317139, -1.4054591655731201, -1.3177893161773682, -1.2301199436187744, -1.1424504518508911, -1.0547810792922974, -0.9671116471290588, -0.8794422149658203, -0.7917728424072266, -0.704103410243988, -0.6164339780807495, -0.528764545917511, -0.44109514355659485, -0.3534257411956787, -0.2657563090324402, -0.17808687686920166, -0.09041750431060791, -0.0027480721473693848, 0.08492136001586914, 0.17259077727794647, 0.2602601945400238, 0.34792959690093994, 0.43559902906417847, 0.523268461227417, 0.6109378337860107, 0.6986072659492493, 0.7862766981124878, 0.8739461302757263, 0.9616155624389648, 1.0492849349975586, 1.1369543075561523, 1.2246237993240356, 1.3122931718826294, 1.3999626636505127, 1.4876320362091064]}, "gradients/encoder.encoder.layers.17.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 5.0, 11.0, 8.0, 13.0, 24.0, 26.0, 44.0, 37.0, 54.0, 62.0, 73.0, 84.0, 80.0, 98.0, 65.0, 73.0, 57.0, 57.0, 43.0, 24.0, 17.0, 23.0, 14.0, 10.0, 5.0, 5.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0030444860458374, -0.9652947187423706, -0.927544891834259, -0.8897950649261475, -0.8520452976226807, -0.8142955303192139, -0.7765457034111023, -0.7387958765029907, -0.7010461091995239, -0.6632963418960571, -0.6255465149879456, -0.587796688079834, -0.5500469207763672, -0.5122971534729004, -0.4745473265647888, -0.43679752945899963, -0.39904773235321045, -0.36129793524742126, -0.3235481381416321, -0.2857983410358429, -0.2480485439300537, -0.21029874682426453, -0.17254894971847534, -0.13479915261268616, -0.09704935550689697, -0.05929955840110779, -0.021549761295318604, 0.01620003581047058, 0.053949832916259766, 0.09169963002204895, 0.12944942712783813, 0.16719922423362732, 0.20494914054870605, 0.24269893765449524, 0.2804487347602844, 0.3181985318660736, 0.3559483289718628, 0.393698126077652, 0.43144792318344116, 0.46919772028923035, 0.5069475173950195, 0.5446972846984863, 0.5824471116065979, 0.6201969385147095, 0.6579467058181763, 0.6956964731216431, 0.7334463000297546, 0.7711961269378662, 0.808945894241333, 0.8466956615447998, 0.8844454884529114, 0.922195315361023, 0.9599450826644897, 0.9976948499679565, 1.035444736480713, 1.0731945037841797, 1.1109442710876465, 1.1486940383911133, 1.18644380569458, 1.2241936922073364, 1.2619434595108032, 1.29969322681427, 1.3374431133270264, 1.3751928806304932, 1.41294264793396]}, "gradients/encoder.encoder.layers.16.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 4.0, 3.0, 1.0, 4.0, 3.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 10.0, 7.0, 7.0, 17.0, 17.0, 27.0, 14.0, 27.0, 33.0, 44.0, 86.0, 111.0, 175.0, 256.0, 530.0, 1156.0, 3026.0, 10634.0, 104576.0, 4018401.0, 43734.0, 7308.0, 2196.0, 863.0, 415.0, 213.0, 136.0, 80.0, 59.0, 27.0, 29.0, 12.0, 2.0, 11.0, 7.0, 11.0, 3.0, 5.0, 0.0, 2.0], "bins": [-0.35693359375, -0.34869956970214844, -0.3404655456542969, -0.3322315216064453, -0.32399749755859375, -0.3157634735107422, -0.3075294494628906, -0.29929542541503906, -0.2910614013671875, -0.28282737731933594, -0.2745933532714844, -0.2663593292236328, -0.25812530517578125, -0.2498912811279297, -0.24165725708007812, -0.23342323303222656, -0.225189208984375, -0.21695518493652344, -0.20872116088867188, -0.2004871368408203, -0.19225311279296875, -0.1840190887451172, -0.17578506469726562, -0.16755104064941406, -0.1593170166015625, -0.15108299255371094, -0.14284896850585938, -0.1346149444580078, -0.12638092041015625, -0.11814689636230469, -0.10991287231445312, -0.10167884826660156, -0.09344482421875, -0.08521080017089844, -0.07697677612304688, -0.06874275207519531, -0.06050872802734375, -0.05227470397949219, -0.044040679931640625, -0.03580665588378906, -0.0275726318359375, -0.019338607788085938, -0.011104583740234375, -0.0028705596923828125, 0.00536346435546875, 0.013597488403320312, 0.021831512451171875, 0.030065536499023438, 0.038299560546875, 0.04653358459472656, 0.054767608642578125, 0.06300163269042969, 0.07123565673828125, 0.07946968078613281, 0.08770370483398438, 0.09593772888183594, 0.1041717529296875, 0.11240577697753906, 0.12063980102539062, 0.1288738250732422, 0.13710784912109375, 0.1453418731689453, 0.15357589721679688, 0.16180992126464844, 0.1700439453125]}, "gradients/encoder.encoder.layers.16.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 6.0, 7.0, 33.0, 27.0, 64.0, 81.0, 120.0, 128.0, 134.0, 131.0, 84.0, 73.0, 64.0, 27.0, 9.0, 5.0, 5.0, 6.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10137939453125, -0.09891510009765625, -0.0964508056640625, -0.09398651123046875, -0.091522216796875, -0.08905792236328125, -0.0865936279296875, -0.08412933349609375, -0.0816650390625, -0.07920074462890625, -0.0767364501953125, -0.07427215576171875, -0.071807861328125, -0.06934356689453125, -0.0668792724609375, -0.06441497802734375, -0.06195068359375, -0.05948638916015625, -0.0570220947265625, -0.05455780029296875, -0.052093505859375, -0.04962921142578125, -0.0471649169921875, -0.04470062255859375, -0.042236328125, -0.03977203369140625, -0.0373077392578125, -0.03484344482421875, -0.032379150390625, -0.02991485595703125, -0.0274505615234375, -0.02498626708984375, -0.02252197265625, -0.02005767822265625, -0.0175933837890625, -0.01512908935546875, -0.012664794921875, -0.01020050048828125, -0.0077362060546875, -0.00527191162109375, -0.0028076171875, -0.00034332275390625, 0.0021209716796875, 0.00458526611328125, 0.007049560546875, 0.00951385498046875, 0.0119781494140625, 0.01444244384765625, 0.01690673828125, 0.01937103271484375, 0.0218353271484375, 0.02429962158203125, 0.026763916015625, 0.02922821044921875, 0.0316925048828125, 0.03415679931640625, 0.03662109375, 0.03908538818359375, 0.0415496826171875, 0.04401397705078125, 0.046478271484375, 0.04894256591796875, 0.0514068603515625, 0.05387115478515625, 0.05633544921875]}, "gradients/encoder.encoder.layers.16.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 3.0, 7.0, 11.0, 10.0, 24.0, 35.0, 45.0, 52.0, 91.0, 142.0, 1502.0, 3781819.0, 408919.0, 1173.0, 150.0, 89.0, 53.0, 58.0, 40.0, 32.0, 15.0, 8.0, 8.0, 3.0, 6.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.57958984375, -0.5490188598632812, -0.5184478759765625, -0.48787689208984375, -0.457305908203125, -0.42673492431640625, -0.3961639404296875, -0.36559295654296875, -0.33502197265625, -0.30445098876953125, -0.2738800048828125, -0.24330902099609375, -0.212738037109375, -0.18216705322265625, -0.1515960693359375, -0.12102508544921875, -0.0904541015625, -0.05988311767578125, -0.0293121337890625, 0.00125885009765625, 0.031829833984375, 0.06240081787109375, 0.0929718017578125, 0.12354278564453125, 0.15411376953125, 0.18468475341796875, 0.2152557373046875, 0.24582672119140625, 0.276397705078125, 0.30696868896484375, 0.3375396728515625, 0.36811065673828125, 0.398681640625, 0.42925262451171875, 0.4598236083984375, 0.49039459228515625, 0.520965576171875, 0.5515365600585938, 0.5821075439453125, 0.6126785278320312, 0.64324951171875, 0.6738204956054688, 0.7043914794921875, 0.7349624633789062, 0.765533447265625, 0.7961044311523438, 0.8266754150390625, 0.8572463989257812, 0.8878173828125, 0.9183883666992188, 0.9489593505859375, 0.9795303344726562, 1.010101318359375, 1.0406723022460938, 1.0712432861328125, 1.1018142700195312, 1.13238525390625, 1.1629562377929688, 1.1935272216796875, 1.2240982055664062, 1.254669189453125, 1.2852401733398438, 1.3158111572265625, 1.3463821411132812, 1.376953125]}, "gradients/encoder.encoder.layers.16.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 3.0, 2.0, 6.0, 8.0, 18.0, 38.0, 97.0, 423.0, 3093.0, 259.0, 75.0, 40.0, 13.0, 9.0, 5.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.050384521484375, -0.04501962661743164, -0.03965473175048828, -0.03428983688354492, -0.028924942016601562, -0.023560047149658203, -0.018195152282714844, -0.012830257415771484, -0.007465362548828125, -0.0021004676818847656, 0.0032644271850585938, 0.008629322052001953, 0.013994216918945312, 0.019359111785888672, 0.02472400665283203, 0.03008890151977539, 0.03545379638671875, 0.04081869125366211, 0.04618358612060547, 0.05154848098754883, 0.05691337585449219, 0.06227827072143555, 0.0676431655883789, 0.07300806045532227, 0.07837295532226562, 0.08373785018920898, 0.08910274505615234, 0.0944676399230957, 0.09983253479003906, 0.10519742965698242, 0.11056232452392578, 0.11592721939086914, 0.1212921142578125, 0.12665700912475586, 0.13202190399169922, 0.13738679885864258, 0.14275169372558594, 0.1481165885925293, 0.15348148345947266, 0.15884637832641602, 0.16421127319335938, 0.16957616806030273, 0.1749410629272461, 0.18030595779418945, 0.1856708526611328, 0.19103574752807617, 0.19640064239501953, 0.2017655372619629, 0.20713043212890625, 0.2124953269958496, 0.21786022186279297, 0.22322511672973633, 0.2285900115966797, 0.23395490646362305, 0.2393198013305664, 0.24468469619750977, 0.2500495910644531, 0.2554144859313965, 0.26077938079833984, 0.2661442756652832, 0.27150917053222656, 0.2768740653991699, 0.2822389602661133, 0.28760385513305664, 0.29296875]}, "gradients/encoder.encoder.layers.16.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 3.0, 6.0, 8.0, 12.0, 57.0, 183.0, 384.0, 232.0, 77.0, 21.0, 9.0, 6.0, 2.0, 1.0, 2.0, 0.0, 2.0, 0.0, 4.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.8283349275588989, -0.7994133234024048, -0.7704916596412659, -0.7415700554847717, -0.7126483917236328, -0.6837267875671387, -0.6548051834106445, -0.6258835196495056, -0.5969619154930115, -0.5680403113365173, -0.5391186475753784, -0.5101970434188843, -0.48127540946006775, -0.4523537755012512, -0.4234321415424347, -0.39451050758361816, -0.36558887362480164, -0.3366672396659851, -0.3077456057071686, -0.27882397174835205, -0.2499023675918579, -0.22098073363304138, -0.19205909967422485, -0.16313748061656952, -0.134215846657753, -0.10529422014951706, -0.07637259364128113, -0.0474509596824646, -0.018529333174228668, 0.010392293334007263, 0.03931392729282379, 0.06823554635047913, 0.09715718030929565, 0.12607881426811218, 0.15500043332576752, 0.18392206728458405, 0.21284368634223938, 0.2417653203010559, 0.27068695425987244, 0.29960858821868896, 0.3285301923751831, 0.35745182633399963, 0.38637346029281616, 0.4152950644493103, 0.44421669840812683, 0.47313833236694336, 0.5020599365234375, 0.5309816002845764, 0.5599032640457153, 0.5888248682022095, 0.6177465319633484, 0.6466681361198425, 0.6755897998809814, 0.7045114040374756, 0.7334330081939697, 0.7623546719551086, 0.7912762761116028, 0.8201978802680969, 0.8491195440292358, 0.87804114818573, 0.9069628119468689, 0.935884416103363, 0.964806079864502, 0.9937276840209961, 1.0226492881774902]}, "gradients/encoder.encoder.layers.16.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 4.0, 3.0, 9.0, 14.0, 22.0, 23.0, 71.0, 66.0, 92.0, 105.0, 121.0, 125.0, 104.0, 89.0, 53.0, 34.0, 30.0, 16.0, 11.0, 7.0, 4.0, 5.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.31133735179901123, -0.29565608501434326, -0.2799748182296753, -0.26429352164268494, -0.24861225485801697, -0.232930988073349, -0.21724970638751984, -0.20156842470169067, -0.1858871579170227, -0.17020589113235474, -0.15452460944652557, -0.1388433277606964, -0.12316206097602844, -0.10748078674077988, -0.09179951250553131, -0.07611823827028275, -0.06043696403503418, -0.044755689799785614, -0.02907441556453705, -0.013393141329288483, 0.002288132905960083, 0.01796940714120865, 0.033650681376457214, 0.04933195561170578, 0.06501322984695435, 0.08069450408220291, 0.09637577831745148, 0.11205705255270004, 0.1277383267879486, 0.14341959357261658, 0.15910087525844574, 0.1747821569442749, 0.19046348333358765, 0.20614475011825562, 0.22182603180408478, 0.23750731348991394, 0.2531885802745819, 0.2688698470592499, 0.28455114364624023, 0.3002324104309082, 0.31591367721557617, 0.33159494400024414, 0.3472762107849121, 0.36295750737190247, 0.37863877415657043, 0.3943200409412384, 0.41000133752822876, 0.42568260431289673, 0.4413638710975647, 0.45704513788223267, 0.47272640466690063, 0.488407701253891, 0.5040889978408813, 0.5197702646255493, 0.5354515314102173, 0.5511327981948853, 0.5668140649795532, 0.5824953317642212, 0.5981765985488892, 0.6138578653335571, 0.6295391321182251, 0.6452204585075378, 0.6609017252922058, 0.6765829920768738, 0.6922642588615417]}, "gradients/encoder.encoder.layers.16.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 4.0, 8.0, 7.0, 6.0, 9.0, 9.0, 21.0, 18.0, 37.0, 49.0, 53.0, 88.0, 113.0, 181.0, 299.0, 499.0, 984.0, 2718.0, 10997.0, 86169.0, 719441.0, 200331.0, 19668.0, 3910.0, 1325.0, 598.0, 341.0, 212.0, 158.0, 77.0, 74.0, 45.0, 33.0, 23.0, 18.0, 12.0, 11.0, 4.0, 10.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.2259521484375, -0.21961593627929688, -0.21327972412109375, -0.20694351196289062, -0.2006072998046875, -0.19427108764648438, -0.18793487548828125, -0.18159866333007812, -0.175262451171875, -0.16892623901367188, -0.16259002685546875, -0.15625381469726562, -0.1499176025390625, -0.14358139038085938, -0.13724517822265625, -0.13090896606445312, -0.12457275390625, -0.11823654174804688, -0.11190032958984375, -0.10556411743164062, -0.0992279052734375, -0.09289169311523438, -0.08655548095703125, -0.08021926879882812, -0.073883056640625, -0.06754684448242188, -0.06121063232421875, -0.054874420166015625, -0.0485382080078125, -0.042201995849609375, -0.03586578369140625, -0.029529571533203125, -0.023193359375, -0.016857147216796875, -0.01052093505859375, -0.004184722900390625, 0.0021514892578125, 0.008487701416015625, 0.01482391357421875, 0.021160125732421875, 0.027496337890625, 0.033832550048828125, 0.04016876220703125, 0.046504974365234375, 0.0528411865234375, 0.059177398681640625, 0.06551361083984375, 0.07184982299804688, 0.07818603515625, 0.08452224731445312, 0.09085845947265625, 0.09719467163085938, 0.1035308837890625, 0.10986709594726562, 0.11620330810546875, 0.12253952026367188, 0.128875732421875, 0.13521194458007812, 0.14154815673828125, 0.14788436889648438, 0.1542205810546875, 0.16055679321289062, 0.16689300537109375, 0.17322921752929688, 0.1795654296875]}, "gradients/encoder.encoder.layers.16.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 9.0, 16.0, 26.0, 38.0, 65.0, 87.0, 134.0, 149.0, 149.0, 99.0, 95.0, 62.0, 30.0, 28.0, 6.0, 6.0, 3.0, 1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.104248046875, -0.10175132751464844, -0.09925460815429688, -0.09675788879394531, -0.09426116943359375, -0.09176445007324219, -0.08926773071289062, -0.08677101135253906, -0.0842742919921875, -0.08177757263183594, -0.07928085327148438, -0.07678413391113281, -0.07428741455078125, -0.07179069519042969, -0.06929397583007812, -0.06679725646972656, -0.064300537109375, -0.06180381774902344, -0.059307098388671875, -0.05681037902832031, -0.05431365966796875, -0.05181694030761719, -0.049320220947265625, -0.04682350158691406, -0.0443267822265625, -0.04183006286621094, -0.039333343505859375, -0.03683662414550781, -0.03433990478515625, -0.03184318542480469, -0.029346466064453125, -0.026849746704101562, -0.02435302734375, -0.021856307983398438, -0.019359588623046875, -0.016862869262695312, -0.01436614990234375, -0.011869430541992188, -0.009372711181640625, -0.0068759918212890625, -0.0043792724609375, -0.0018825531005859375, 0.000614166259765625, 0.0031108856201171875, 0.00560760498046875, 0.008104324340820312, 0.010601043701171875, 0.013097763061523438, 0.015594482421875, 0.018091201782226562, 0.020587921142578125, 0.023084640502929688, 0.02558135986328125, 0.028078079223632812, 0.030574798583984375, 0.03307151794433594, 0.0355682373046875, 0.03806495666503906, 0.040561676025390625, 0.04305839538574219, 0.04555511474609375, 0.04805183410644531, 0.050548553466796875, 0.05304527282714844, 0.0555419921875]}, "gradients/encoder.encoder.layers.16.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 10.0, 6.0, 10.0, 14.0, 25.0, 18.0, 34.0, 65.0, 117.0, 259.0, 469.0, 1228.0, 3634.0, 14661.0, 89644.0, 618623.0, 275065.0, 33975.0, 7056.0, 2085.0, 821.0, 347.0, 165.0, 78.0, 41.0, 34.0, 24.0, 13.0, 8.0, 5.0, 6.0, 5.0, 5.0, 5.0, 1.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.151123046875, -0.1467742919921875, -0.142425537109375, -0.1380767822265625, -0.13372802734375, -0.1293792724609375, -0.125030517578125, -0.1206817626953125, -0.1163330078125, -0.1119842529296875, -0.107635498046875, -0.1032867431640625, -0.09893798828125, -0.0945892333984375, -0.090240478515625, -0.0858917236328125, -0.08154296875, -0.0771942138671875, -0.072845458984375, -0.0684967041015625, -0.06414794921875, -0.0597991943359375, -0.055450439453125, -0.0511016845703125, -0.0467529296875, -0.0424041748046875, -0.038055419921875, -0.0337066650390625, -0.02935791015625, -0.0250091552734375, -0.020660400390625, -0.0163116455078125, -0.011962890625, -0.0076141357421875, -0.003265380859375, 0.0010833740234375, 0.00543212890625, 0.0097808837890625, 0.014129638671875, 0.0184783935546875, 0.0228271484375, 0.0271759033203125, 0.031524658203125, 0.0358734130859375, 0.04022216796875, 0.0445709228515625, 0.048919677734375, 0.0532684326171875, 0.0576171875, 0.0619659423828125, 0.066314697265625, 0.0706634521484375, 0.07501220703125, 0.0793609619140625, 0.083709716796875, 0.0880584716796875, 0.0924072265625, 0.0967559814453125, 0.101104736328125, 0.1054534912109375, 0.10980224609375, 0.1141510009765625, 0.118499755859375, 0.1228485107421875, 0.127197265625]}, "gradients/encoder.encoder.layers.16.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 3.0, 1.0, 1.0, 3.0, 4.0, 5.0, 10.0, 7.0, 7.0, 11.0, 20.0, 24.0, 18.0, 23.0, 26.0, 38.0, 31.0, 33.0, 34.0, 36.0, 40.0, 27.0, 53.0, 42.0, 50.0, 38.0, 46.0, 29.0, 48.0, 39.0, 35.0, 35.0, 28.0, 31.0, 30.0, 19.0, 16.0, 16.0, 10.0, 7.0, 5.0, 9.0, 4.0, 5.0, 4.0, 2.0, 2.0, 3.0, 4.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.11407470703125, -0.11045169830322266, -0.10682868957519531, -0.10320568084716797, -0.09958267211914062, -0.09595966339111328, -0.09233665466308594, -0.0887136459350586, -0.08509063720703125, -0.0814676284790039, -0.07784461975097656, -0.07422161102294922, -0.07059860229492188, -0.06697559356689453, -0.06335258483886719, -0.059729576110839844, -0.0561065673828125, -0.052483558654785156, -0.04886054992675781, -0.04523754119873047, -0.041614532470703125, -0.03799152374267578, -0.03436851501464844, -0.030745506286621094, -0.02712249755859375, -0.023499488830566406, -0.019876480102539062, -0.01625347137451172, -0.012630462646484375, -0.009007453918457031, -0.0053844451904296875, -0.0017614364624023438, 0.001861572265625, 0.005484580993652344, 0.009107589721679688, 0.012730598449707031, 0.016353607177734375, 0.01997661590576172, 0.023599624633789062, 0.027222633361816406, 0.03084564208984375, 0.034468650817871094, 0.03809165954589844, 0.04171466827392578, 0.045337677001953125, 0.04896068572998047, 0.05258369445800781, 0.056206703186035156, 0.0598297119140625, 0.06345272064208984, 0.06707572937011719, 0.07069873809814453, 0.07432174682617188, 0.07794475555419922, 0.08156776428222656, 0.0851907730102539, 0.08881378173828125, 0.0924367904663086, 0.09605979919433594, 0.09968280792236328, 0.10330581665039062, 0.10692882537841797, 0.11055183410644531, 0.11417484283447266, 0.1177978515625]}, "gradients/encoder.encoder.layers.16.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 3.0, 4.0, 5.0, 4.0, 9.0, 9.0, 14.0, 12.0, 26.0, 34.0, 43.0, 66.0, 98.0, 171.0, 291.0, 526.0, 1002.0, 2137.0, 4783.0, 12362.0, 42265.0, 211249.0, 552128.0, 166751.0, 35382.0, 10868.0, 4298.0, 1872.0, 935.0, 467.0, 280.0, 160.0, 105.0, 62.0, 39.0, 25.0, 17.0, 20.0, 10.0, 10.0, 5.0, 4.0, 2.0, 5.0, 2.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.041656494140625, -0.040410518646240234, -0.03916454315185547, -0.0379185676574707, -0.03667259216308594, -0.03542661666870117, -0.034180641174316406, -0.03293466567993164, -0.031688690185546875, -0.03044271469116211, -0.029196739196777344, -0.027950763702392578, -0.026704788208007812, -0.025458812713623047, -0.02421283721923828, -0.022966861724853516, -0.02172088623046875, -0.020474910736083984, -0.01922893524169922, -0.017982959747314453, -0.016736984252929688, -0.015491008758544922, -0.014245033264160156, -0.01299905776977539, -0.011753082275390625, -0.01050710678100586, -0.009261131286621094, -0.008015155792236328, -0.0067691802978515625, -0.005523204803466797, -0.004277229309082031, -0.0030312538146972656, -0.0017852783203125, -0.0005393028259277344, 0.0007066726684570312, 0.0019526481628417969, 0.0031986236572265625, 0.004444599151611328, 0.005690574645996094, 0.006936550140380859, 0.008182525634765625, 0.00942850112915039, 0.010674476623535156, 0.011920452117919922, 0.013166427612304688, 0.014412403106689453, 0.01565837860107422, 0.016904354095458984, 0.01815032958984375, 0.019396305084228516, 0.02064228057861328, 0.021888256072998047, 0.023134231567382812, 0.024380207061767578, 0.025626182556152344, 0.02687215805053711, 0.028118133544921875, 0.02936410903930664, 0.030610084533691406, 0.03185606002807617, 0.03310203552246094, 0.0343480110168457, 0.03559398651123047, 0.036839962005615234, 0.0380859375]}, "gradients/encoder.encoder.layers.16.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 4.0, 7.0, 10.0, 12.0, 17.0, 25.0, 39.0, 64.0, 84.0, 113.0, 122.0, 135.0, 114.0, 79.0, 62.0, 36.0, 26.0, 17.0, 12.0, 8.0, 1.0, 5.0, 2.0, 3.0, 0.0, 4.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.199411392211914e-05, -2.145674079656601e-05, -2.091936767101288e-05, -2.0381994545459747e-05, -1.9844621419906616e-05, -1.9307248294353485e-05, -1.8769875168800354e-05, -1.8232502043247223e-05, -1.7695128917694092e-05, -1.715775579214096e-05, -1.662038266658783e-05, -1.60830095410347e-05, -1.5545636415481567e-05, -1.5008263289928436e-05, -1.4470890164375305e-05, -1.3933517038822174e-05, -1.3396143913269043e-05, -1.2858770787715912e-05, -1.232139766216278e-05, -1.178402453660965e-05, -1.1246651411056519e-05, -1.0709278285503387e-05, -1.0171905159950256e-05, -9.634532034397125e-06, -9.097158908843994e-06, -8.559785783290863e-06, -8.022412657737732e-06, -7.485039532184601e-06, -6.94766640663147e-06, -6.410293281078339e-06, -5.8729201555252075e-06, -5.335547029972076e-06, -4.798173904418945e-06, -4.260800778865814e-06, -3.723427653312683e-06, -3.186054527759552e-06, -2.648681402206421e-06, -2.11130827665329e-06, -1.5739351511001587e-06, -1.0365620255470276e-06, -4.991888999938965e-07, 3.818422555923462e-08, 5.755573511123657e-07, 1.1129304766654968e-06, 1.650303602218628e-06, 2.187676727771759e-06, 2.72504985332489e-06, 3.2624229788780212e-06, 3.7997961044311523e-06, 4.3371692299842834e-06, 4.8745423555374146e-06, 5.411915481090546e-06, 5.949288606643677e-06, 6.486661732196808e-06, 7.024034857749939e-06, 7.56140798330307e-06, 8.098781108856201e-06, 8.636154234409332e-06, 9.173527359962463e-06, 9.710900485515594e-06, 1.0248273611068726e-05, 1.0785646736621857e-05, 1.1323019862174988e-05, 1.1860392987728119e-05, 1.239776611328125e-05]}, "gradients/encoder.encoder.layers.16.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 4.0, 2.0, 6.0, 10.0, 5.0, 16.0, 25.0, 30.0, 48.0, 96.0, 133.0, 219.0, 491.0, 1077.0, 2484.0, 7420.0, 28082.0, 182186.0, 668629.0, 125866.0, 21499.0, 6177.0, 2217.0, 912.0, 433.0, 194.0, 117.0, 70.0, 37.0, 30.0, 17.0, 5.0, 5.0, 9.0, 4.0, 2.0, 2.0, 1.0, 2.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.053955078125, -0.052056312561035156, -0.05015754699707031, -0.04825878143310547, -0.046360015869140625, -0.04446125030517578, -0.04256248474121094, -0.040663719177246094, -0.03876495361328125, -0.036866188049316406, -0.03496742248535156, -0.03306865692138672, -0.031169891357421875, -0.02927112579345703, -0.027372360229492188, -0.025473594665527344, -0.0235748291015625, -0.021676063537597656, -0.019777297973632812, -0.01787853240966797, -0.015979766845703125, -0.014081001281738281, -0.012182235717773438, -0.010283470153808594, -0.00838470458984375, -0.006485939025878906, -0.0045871734619140625, -0.0026884078979492188, -0.000789642333984375, 0.0011091232299804688, 0.0030078887939453125, 0.004906654357910156, 0.006805419921875, 0.008704185485839844, 0.010602951049804688, 0.012501716613769531, 0.014400482177734375, 0.01629924774169922, 0.018198013305664062, 0.020096778869628906, 0.02199554443359375, 0.023894309997558594, 0.025793075561523438, 0.02769184112548828, 0.029590606689453125, 0.03148937225341797, 0.03338813781738281, 0.035286903381347656, 0.0371856689453125, 0.039084434509277344, 0.04098320007324219, 0.04288196563720703, 0.044780731201171875, 0.04667949676513672, 0.04857826232910156, 0.050477027893066406, 0.05237579345703125, 0.054274559020996094, 0.05617332458496094, 0.05807209014892578, 0.059970855712890625, 0.06186962127685547, 0.06376838684082031, 0.06566715240478516, 0.06756591796875]}, "gradients/encoder.encoder.layers.16.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 4.0, 2.0, 0.0, 5.0, 6.0, 4.0, 2.0, 7.0, 8.0, 9.0, 11.0, 7.0, 18.0, 17.0, 25.0, 29.0, 33.0, 52.0, 56.0, 66.0, 70.0, 60.0, 67.0, 61.0, 63.0, 56.0, 41.0, 35.0, 45.0, 26.0, 23.0, 19.0, 14.0, 15.0, 10.0, 9.0, 6.0, 3.0, 5.0, 2.0, 5.0, 1.0, 3.0, 4.0, 2.0, 2.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0], "bins": [-0.0296783447265625, -0.02862238883972168, -0.02756643295288086, -0.02651047706604004, -0.02545452117919922, -0.0243985652923584, -0.023342609405517578, -0.022286653518676758, -0.021230697631835938, -0.020174741744995117, -0.019118785858154297, -0.018062829971313477, -0.017006874084472656, -0.015950918197631836, -0.014894962310791016, -0.013839006423950195, -0.012783050537109375, -0.011727094650268555, -0.010671138763427734, -0.009615182876586914, -0.008559226989746094, -0.0075032711029052734, -0.006447315216064453, -0.005391359329223633, -0.0043354034423828125, -0.003279447555541992, -0.002223491668701172, -0.0011675357818603516, -0.00011157989501953125, 0.0009443759918212891, 0.0020003318786621094, 0.0030562877655029297, 0.00411224365234375, 0.00516819953918457, 0.006224155426025391, 0.007280111312866211, 0.008336067199707031, 0.009392023086547852, 0.010447978973388672, 0.011503934860229492, 0.012559890747070312, 0.013615846633911133, 0.014671802520751953, 0.015727758407592773, 0.016783714294433594, 0.017839670181274414, 0.018895626068115234, 0.019951581954956055, 0.021007537841796875, 0.022063493728637695, 0.023119449615478516, 0.024175405502319336, 0.025231361389160156, 0.026287317276000977, 0.027343273162841797, 0.028399229049682617, 0.029455184936523438, 0.030511140823364258, 0.03156709671020508, 0.0326230525970459, 0.03367900848388672, 0.03473496437072754, 0.03579092025756836, 0.03684687614440918, 0.03790283203125]}, "gradients/encoder.encoder.layers.16.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 10.0, 27.0, 87.0, 208.0, 298.0, 232.0, 101.0, 27.0, 13.0, 2.0, 2.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8476474285125732, -1.8043205738067627, -1.7609935998916626, -1.7176666259765625, -1.674339771270752, -1.6310129165649414, -1.5876859426498413, -1.5443589687347412, -1.5010321140289307, -1.4577052593231201, -1.41437828540802, -1.37105131149292, -1.3277244567871094, -1.2843976020812988, -1.2410706281661987, -1.1977436542510986, -1.154416799545288, -1.1110899448394775, -1.0677629709243774, -1.0244359970092773, -0.9811091423034668, -0.9377822279930115, -0.8944553136825562, -0.8511283993721008, -0.8078014850616455, -0.7644745707511902, -0.7211476564407349, -0.6778207421302795, -0.6344938278198242, -0.5911669135093689, -0.5478399991989136, -0.5045130848884583, -0.4611862897872925, -0.41785937547683716, -0.37453246116638184, -0.3312055468559265, -0.2878786325454712, -0.24455171823501587, -0.20122480392456055, -0.15789788961410522, -0.1145709753036499, -0.07124406099319458, -0.027917146682739258, 0.015409767627716064, 0.05873668193817139, 0.10206359624862671, 0.14539051055908203, 0.18871742486953735, 0.23204433917999268, 0.275371253490448, 0.3186981678009033, 0.36202508211135864, 0.40535199642181396, 0.4486789107322693, 0.4920058250427246, 0.5353327393531799, 0.5786596536636353, 0.6219865679740906, 0.6653134822845459, 0.7086403965950012, 0.7519673109054565, 0.7952942252159119, 0.8386211395263672, 0.8819480538368225, 0.9252749681472778]}, "gradients/encoder.encoder.layers.16.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 2.0, 3.0, 6.0, 5.0, 15.0, 13.0, 10.0, 21.0, 36.0, 34.0, 36.0, 43.0, 59.0, 52.0, 70.0, 72.0, 80.0, 67.0, 55.0, 44.0, 47.0, 45.0, 34.0, 40.0, 21.0, 35.0, 24.0, 11.0, 9.0, 9.0, 6.0, 7.0, 5.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.8110131025314331, -0.7816190719604492, -0.7522250413894653, -0.7228310704231262, -0.6934370398521423, -0.6640430092811584, -0.6346489787101746, -0.6052550077438354, -0.5758609771728516, -0.5464669466018677, -0.5170729160308838, -0.4876789152622223, -0.4582849144935608, -0.4288908839225769, -0.399496853351593, -0.3701028525829315, -0.34070882201194763, -0.31131479144096375, -0.28192079067230225, -0.25252676010131836, -0.22313275933265686, -0.19373872876167297, -0.16434471309185028, -0.1349506974220276, -0.1055566817522049, -0.0761626660823822, -0.04676864668726921, -0.01737462729215622, 0.012019388377666473, 0.04141341149806976, 0.07080742716789246, 0.10020144283771515, 0.12959545850753784, 0.15898947417736053, 0.18838348984718323, 0.21777752041816711, 0.2471715211868286, 0.2765655517578125, 0.3059595823287964, 0.3353535830974579, 0.3647475838661194, 0.39414161443710327, 0.42353561520576477, 0.45292964577674866, 0.48232364654541016, 0.511717677116394, 0.5411117076873779, 0.5705057382583618, 0.5998997688293457, 0.6292937994003296, 0.6586878299713135, 0.6880818009376526, 0.7174758315086365, 0.7468698620796204, 0.7762638926506042, 0.8056578636169434, 0.8350518941879272, 0.8644459247589111, 0.893839955329895, 0.9232339262962341, 0.952627956867218, 0.9820219874382019, 1.011415958404541, 1.040809988975525, 1.0702040195465088]}, "gradients/encoder.encoder.layers.15.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 3.0, 1.0, 0.0, 4.0, 4.0, 8.0, 22.0, 65.0, 166.0, 586.0, 9580.0, 4179122.0, 4145.0, 416.0, 120.0, 27.0, 10.0, 6.0, 5.0, 4.0, 1.0, 0.0, 1.0, 1.0], "bins": [-1.5615234375, -1.5311431884765625, -1.500762939453125, -1.4703826904296875, -1.44000244140625, -1.4096221923828125, -1.379241943359375, -1.3488616943359375, -1.3184814453125, -1.2881011962890625, -1.257720947265625, -1.2273406982421875, -1.19696044921875, -1.1665802001953125, -1.136199951171875, -1.1058197021484375, -1.075439453125, -1.0450592041015625, -1.014678955078125, -0.9842987060546875, -0.95391845703125, -0.9235382080078125, -0.893157958984375, -0.8627777099609375, -0.8323974609375, -0.8020172119140625, -0.771636962890625, -0.7412567138671875, -0.71087646484375, -0.6804962158203125, -0.650115966796875, -0.6197357177734375, -0.58935546875, -0.5589752197265625, -0.528594970703125, -0.4982147216796875, -0.46783447265625, -0.4374542236328125, -0.407073974609375, -0.3766937255859375, -0.3463134765625, -0.3159332275390625, -0.285552978515625, -0.2551727294921875, -0.22479248046875, -0.1944122314453125, -0.164031982421875, -0.1336517333984375, -0.103271484375, -0.0728912353515625, -0.042510986328125, -0.0121307373046875, 0.01824951171875, 0.0486297607421875, 0.079010009765625, 0.1093902587890625, 0.1397705078125, 0.1701507568359375, 0.200531005859375, 0.2309112548828125, 0.26129150390625, 0.2916717529296875, 0.322052001953125, 0.3524322509765625, 0.3828125]}, "gradients/encoder.encoder.layers.15.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 2.0, 2.0, 15.0, 13.0, 20.0, 46.0, 85.0, 82.0, 128.0, 126.0, 140.0, 118.0, 89.0, 54.0, 47.0, 20.0, 9.0, 5.0, 3.0, 4.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10101318359375, -0.09853029251098633, -0.09604740142822266, -0.09356451034545898, -0.09108161926269531, -0.08859872817993164, -0.08611583709716797, -0.0836329460144043, -0.08115005493164062, -0.07866716384887695, -0.07618427276611328, -0.07370138168334961, -0.07121849060058594, -0.06873559951782227, -0.0662527084350586, -0.06376981735229492, -0.06128692626953125, -0.05880403518676758, -0.056321144104003906, -0.053838253021240234, -0.05135536193847656, -0.04887247085571289, -0.04638957977294922, -0.04390668869018555, -0.041423797607421875, -0.0389409065246582, -0.03645801544189453, -0.03397512435913086, -0.03149223327636719, -0.029009342193603516, -0.026526451110839844, -0.024043560028076172, -0.0215606689453125, -0.019077777862548828, -0.016594886779785156, -0.014111995697021484, -0.011629104614257812, -0.00914621353149414, -0.006663322448730469, -0.004180431365966797, -0.001697540283203125, 0.0007853507995605469, 0.0032682418823242188, 0.005751132965087891, 0.008234024047851562, 0.010716915130615234, 0.013199806213378906, 0.015682697296142578, 0.01816558837890625, 0.020648479461669922, 0.023131370544433594, 0.025614261627197266, 0.028097152709960938, 0.03058004379272461, 0.03306293487548828, 0.03554582595825195, 0.038028717041015625, 0.0405116081237793, 0.04299449920654297, 0.04547739028930664, 0.04796028137207031, 0.050443172454833984, 0.052926063537597656, 0.05540895462036133, 0.057891845703125]}, "gradients/encoder.encoder.layers.15.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 4.0, 3.0, 7.0, 6.0, 8.0, 15.0, 23.0, 43.0, 60.0, 83.0, 213.0, 1144.0, 9877.0, 2835204.0, 1336989.0, 9052.0, 1054.0, 224.0, 107.0, 68.0, 37.0, 27.0, 19.0, 13.0, 9.0, 6.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.289306640625, -0.2748298645019531, -0.26035308837890625, -0.24587631225585938, -0.2313995361328125, -0.21692276000976562, -0.20244598388671875, -0.18796920776367188, -0.173492431640625, -0.15901565551757812, -0.14453887939453125, -0.13006210327148438, -0.1155853271484375, -0.10110855102539062, -0.08663177490234375, -0.07215499877929688, -0.05767822265625, -0.043201446533203125, -0.02872467041015625, -0.014247894287109375, 0.0002288818359375, 0.014705657958984375, 0.02918243408203125, 0.043659210205078125, 0.058135986328125, 0.07261276245117188, 0.08708953857421875, 0.10156631469726562, 0.1160430908203125, 0.13051986694335938, 0.14499664306640625, 0.15947341918945312, 0.1739501953125, 0.18842697143554688, 0.20290374755859375, 0.21738052368164062, 0.2318572998046875, 0.24633407592773438, 0.26081085205078125, 0.2752876281738281, 0.289764404296875, 0.3042411804199219, 0.31871795654296875, 0.3331947326660156, 0.3476715087890625, 0.3621482849121094, 0.37662506103515625, 0.3911018371582031, 0.40557861328125, 0.4200553894042969, 0.43453216552734375, 0.4490089416503906, 0.4634857177734375, 0.4779624938964844, 0.49243927001953125, 0.5069160461425781, 0.521392822265625, 0.5358695983886719, 0.5503463745117188, 0.5648231506347656, 0.5792999267578125, 0.5937767028808594, 0.6082534790039062, 0.6227302551269531, 0.63720703125]}, "gradients/encoder.encoder.layers.15.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 2.0, 4.0, 4.0, 8.0, 10.0, 13.0, 27.0, 40.0, 94.0, 278.0, 2359.0, 906.0, 174.0, 76.0, 46.0, 20.0, 8.0, 5.0, 5.0, 5.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0633544921875, -0.059078216552734375, -0.05480194091796875, -0.050525665283203125, -0.0462493896484375, -0.041973114013671875, -0.03769683837890625, -0.033420562744140625, -0.029144287109375, -0.024868011474609375, -0.02059173583984375, -0.016315460205078125, -0.0120391845703125, -0.007762908935546875, -0.00348663330078125, 0.000789642333984375, 0.00506591796875, 0.009342193603515625, 0.01361846923828125, 0.017894744873046875, 0.0221710205078125, 0.026447296142578125, 0.03072357177734375, 0.034999847412109375, 0.039276123046875, 0.043552398681640625, 0.04782867431640625, 0.052104949951171875, 0.0563812255859375, 0.060657501220703125, 0.06493377685546875, 0.06921005249023438, 0.073486328125, 0.07776260375976562, 0.08203887939453125, 0.08631515502929688, 0.0905914306640625, 0.09486770629882812, 0.09914398193359375, 0.10342025756835938, 0.107696533203125, 0.11197280883789062, 0.11624908447265625, 0.12052536010742188, 0.1248016357421875, 0.12907791137695312, 0.13335418701171875, 0.13763046264648438, 0.14190673828125, 0.14618301391601562, 0.15045928955078125, 0.15473556518554688, 0.1590118408203125, 0.16328811645507812, 0.16756439208984375, 0.17184066772460938, 0.176116943359375, 0.18039321899414062, 0.18466949462890625, 0.18894577026367188, 0.1932220458984375, 0.19749832153320312, 0.20177459716796875, 0.20605087280273438, 0.2103271484375]}, "gradients/encoder.encoder.layers.15.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 115.0, 891.0, 11.0, 1.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.4221875667572021, -1.2685809135437012, -1.1149742603302002, -0.9613674879074097, -0.8077608346939087, -0.6541541814804077, -0.500547468662262, -0.3469407558441162, -0.19333410263061523, -0.03972741961479187, 0.1138792634010315, 0.26748594641685486, 0.4210926294326782, 0.5746992826461792, 0.728305995464325, 0.8819127082824707, 1.0355193614959717, 1.1891260147094727, 1.3427326679229736, 1.4963394403457642, 1.6499460935592651, 1.8035527467727661, 1.9571595191955566, 2.1107661724090576, 2.2643728256225586, 2.4179794788360596, 2.5715861320495605, 2.7251927852630615, 2.8787994384765625, 3.0324063301086426, 3.1860129833221436, 3.3396196365356445, 3.4932260513305664, 3.6468327045440674, 3.8004393577575684, 3.9540460109710693, 4.10765266418457, 4.26125955581665, 4.414865970611572, 4.568472862243652, 4.722079277038574, 4.875686168670654, 5.029292583465576, 5.182899475097656, 5.336505889892578, 5.490112781524658, 5.64371919631958, 5.79732608795166, 5.95093297958374, 6.10453987121582, 6.258146286010742, 6.411753177642822, 6.565359592437744, 6.718966484069824, 6.872572898864746, 7.026179790496826, 7.179786682128906, 7.333393573760986, 7.486999988555908, 7.640606880187988, 7.79421329498291, 7.94782018661499, 8.10142707824707, 8.255033493041992, 8.408639907836914]}, "gradients/encoder.encoder.layers.15.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 1.0, 3.0, 7.0, 6.0, 10.0, 12.0, 18.0, 29.0, 42.0, 43.0, 60.0, 65.0, 64.0, 79.0, 66.0, 98.0, 66.0, 79.0, 52.0, 52.0, 45.0, 33.0, 23.0, 20.0, 12.0, 10.0, 8.0, 4.0, 4.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.3687801957130432, -0.35645145177841187, -0.34412267804145813, -0.3317939341068268, -0.31946519017219543, -0.3071364164352417, -0.29480767250061035, -0.282478928565979, -0.27015018463134766, -0.2578214406967163, -0.24549268186092377, -0.23316392302513123, -0.22083517909049988, -0.20850642025470734, -0.1961776614189148, -0.18384891748428345, -0.1715201437473297, -0.15919138491153717, -0.14686264097690582, -0.13453388214111328, -0.12220513075590134, -0.10987637937068939, -0.09754762053489685, -0.0852188691496849, -0.07289011776447296, -0.06056136637926102, -0.048232611268758774, -0.03590385615825653, -0.023575104773044586, -0.011246353387832642, 0.0010824054479599, 0.013411156833171844, 0.02573990821838379, 0.038068659603595734, 0.05039741471409798, 0.06272616982460022, 0.07505492120981216, 0.08738367259502411, 0.09971243143081665, 0.1120411828160286, 0.12436993420124054, 0.13669869303703308, 0.14902743697166443, 0.16135619580745697, 0.1736849546432495, 0.18601369857788086, 0.1983424574136734, 0.21067121624946594, 0.2229999601840973, 0.23532871901988983, 0.24765746295452118, 0.2599862217903137, 0.27231496572494507, 0.2846437096595764, 0.29697248339653015, 0.3093012273311615, 0.32163000106811523, 0.3339587450027466, 0.3462875187397003, 0.35861626267433167, 0.370945006608963, 0.38327378034591675, 0.3956025242805481, 0.40793126821517944, 0.4202600121498108]}, "gradients/encoder.encoder.layers.15.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 4.0, 2.0, 4.0, 1.0, 3.0, 1.0, 9.0, 5.0, 7.0, 13.0, 26.0, 34.0, 47.0, 68.0, 107.0, 202.0, 348.0, 748.0, 1795.0, 6234.0, 43119.0, 635473.0, 330491.0, 22889.0, 4269.0, 1356.0, 547.0, 302.0, 150.0, 100.0, 59.0, 41.0, 36.0, 24.0, 19.0, 3.0, 6.0, 5.0, 8.0, 3.0, 4.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0], "bins": [-0.21435546875, -0.2080249786376953, -0.20169448852539062, -0.19536399841308594, -0.18903350830078125, -0.18270301818847656, -0.17637252807617188, -0.1700420379638672, -0.1637115478515625, -0.1573810577392578, -0.15105056762695312, -0.14472007751464844, -0.13838958740234375, -0.13205909729003906, -0.12572860717773438, -0.11939811706542969, -0.113067626953125, -0.10673713684082031, -0.10040664672851562, -0.09407615661621094, -0.08774566650390625, -0.08141517639160156, -0.07508468627929688, -0.06875419616699219, -0.0624237060546875, -0.05609321594238281, -0.049762725830078125, -0.04343223571777344, -0.03710174560546875, -0.030771255493164062, -0.024440765380859375, -0.018110275268554688, -0.01177978515625, -0.0054492950439453125, 0.000881195068359375, 0.0072116851806640625, 0.01354217529296875, 0.019872665405273438, 0.026203155517578125, 0.03253364562988281, 0.0388641357421875, 0.04519462585449219, 0.051525115966796875, 0.05785560607910156, 0.06418609619140625, 0.07051658630371094, 0.07684707641601562, 0.08317756652832031, 0.089508056640625, 0.09583854675292969, 0.10216903686523438, 0.10849952697753906, 0.11483001708984375, 0.12116050720214844, 0.12749099731445312, 0.1338214874267578, 0.1401519775390625, 0.1464824676513672, 0.15281295776367188, 0.15914344787597656, 0.16547393798828125, 0.17180442810058594, 0.17813491821289062, 0.1844654083251953, 0.1907958984375]}, "gradients/encoder.encoder.layers.15.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 3.0, 6.0, 8.0, 19.0, 33.0, 43.0, 61.0, 96.0, 119.0, 110.0, 123.0, 117.0, 89.0, 71.0, 46.0, 26.0, 15.0, 11.0, 5.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09368896484375, -0.09134244918823242, -0.08899593353271484, -0.08664941787719727, -0.08430290222167969, -0.08195638656616211, -0.07960987091064453, -0.07726335525512695, -0.07491683959960938, -0.0725703239440918, -0.07022380828857422, -0.06787729263305664, -0.06553077697753906, -0.06318426132202148, -0.060837745666503906, -0.05849123001098633, -0.05614471435546875, -0.05379819869995117, -0.051451683044433594, -0.049105167388916016, -0.04675865173339844, -0.04441213607788086, -0.04206562042236328, -0.0397191047668457, -0.037372589111328125, -0.03502607345581055, -0.03267955780029297, -0.03033304214477539, -0.027986526489257812, -0.025640010833740234, -0.023293495178222656, -0.020946979522705078, -0.0186004638671875, -0.016253948211669922, -0.013907432556152344, -0.011560916900634766, -0.009214401245117188, -0.006867885589599609, -0.004521369934082031, -0.002174854278564453, 0.000171661376953125, 0.002518177032470703, 0.004864692687988281, 0.007211208343505859, 0.009557723999023438, 0.011904239654541016, 0.014250755310058594, 0.016597270965576172, 0.01894378662109375, 0.021290302276611328, 0.023636817932128906, 0.025983333587646484, 0.028329849243164062, 0.03067636489868164, 0.03302288055419922, 0.0353693962097168, 0.037715911865234375, 0.04006242752075195, 0.04240894317626953, 0.04475545883178711, 0.04710197448730469, 0.049448490142822266, 0.051795005798339844, 0.05414152145385742, 0.056488037109375]}, "gradients/encoder.encoder.layers.15.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 2.0, 5.0, 5.0, 5.0, 6.0, 7.0, 12.0, 21.0, 18.0, 36.0, 42.0, 79.0, 121.0, 210.0, 437.0, 960.0, 2647.0, 11056.0, 73564.0, 588381.0, 326316.0, 35064.0, 6181.0, 1869.0, 692.0, 345.0, 190.0, 95.0, 47.0, 48.0, 25.0, 24.0, 6.0, 11.0, 7.0, 6.0, 3.0, 4.0, 4.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1246337890625, -0.1205902099609375, -0.116546630859375, -0.1125030517578125, -0.10845947265625, -0.1044158935546875, -0.100372314453125, -0.0963287353515625, -0.09228515625, -0.0882415771484375, -0.084197998046875, -0.0801544189453125, -0.07611083984375, -0.0720672607421875, -0.068023681640625, -0.0639801025390625, -0.0599365234375, -0.0558929443359375, -0.051849365234375, -0.0478057861328125, -0.04376220703125, -0.0397186279296875, -0.035675048828125, -0.0316314697265625, -0.027587890625, -0.0235443115234375, -0.019500732421875, -0.0154571533203125, -0.01141357421875, -0.0073699951171875, -0.003326416015625, 0.0007171630859375, 0.0047607421875, 0.0088043212890625, 0.012847900390625, 0.0168914794921875, 0.02093505859375, 0.0249786376953125, 0.029022216796875, 0.0330657958984375, 0.037109375, 0.0411529541015625, 0.045196533203125, 0.0492401123046875, 0.05328369140625, 0.0573272705078125, 0.061370849609375, 0.0654144287109375, 0.0694580078125, 0.0735015869140625, 0.077545166015625, 0.0815887451171875, 0.08563232421875, 0.0896759033203125, 0.093719482421875, 0.0977630615234375, 0.101806640625, 0.1058502197265625, 0.109893798828125, 0.1139373779296875, 0.11798095703125, 0.1220245361328125, 0.126068115234375, 0.1301116943359375, 0.1341552734375]}, "gradients/encoder.encoder.layers.15.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 7.0, 6.0, 7.0, 7.0, 14.0, 7.0, 13.0, 15.0, 22.0, 22.0, 26.0, 30.0, 26.0, 34.0, 33.0, 39.0, 35.0, 52.0, 46.0, 37.0, 44.0, 42.0, 41.0, 53.0, 44.0, 29.0, 31.0, 27.0, 22.0, 28.0, 22.0, 39.0, 22.0, 15.0, 15.0, 10.0, 13.0, 4.0, 7.0, 5.0, 3.0, 6.0, 4.0, 1.0, 1.0, 0.0, 0.0, 4.0], "bins": [-0.1319580078125, -0.12833118438720703, -0.12470436096191406, -0.1210775375366211, -0.11745071411132812, -0.11382389068603516, -0.11019706726074219, -0.10657024383544922, -0.10294342041015625, -0.09931659698486328, -0.09568977355957031, -0.09206295013427734, -0.08843612670898438, -0.0848093032836914, -0.08118247985839844, -0.07755565643310547, -0.0739288330078125, -0.07030200958251953, -0.06667518615722656, -0.0630483627319336, -0.059421539306640625, -0.055794715881347656, -0.05216789245605469, -0.04854106903076172, -0.04491424560546875, -0.04128742218017578, -0.03766059875488281, -0.034033775329589844, -0.030406951904296875, -0.026780128479003906, -0.023153305053710938, -0.01952648162841797, -0.015899658203125, -0.012272834777832031, -0.008646011352539062, -0.005019187927246094, -0.001392364501953125, 0.0022344589233398438, 0.0058612823486328125, 0.009488105773925781, 0.01311492919921875, 0.01674175262451172, 0.020368576049804688, 0.023995399475097656, 0.027622222900390625, 0.031249046325683594, 0.03487586975097656, 0.03850269317626953, 0.0421295166015625, 0.04575634002685547, 0.04938316345214844, 0.053009986877441406, 0.056636810302734375, 0.060263633728027344, 0.06389045715332031, 0.06751728057861328, 0.07114410400390625, 0.07477092742919922, 0.07839775085449219, 0.08202457427978516, 0.08565139770507812, 0.0892782211303711, 0.09290504455566406, 0.09653186798095703, 0.10015869140625]}, "gradients/encoder.encoder.layers.15.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 4.0, 1.0, 7.0, 10.0, 14.0, 22.0, 31.0, 74.0, 193.0, 571.0, 2425.0, 18064.0, 798628.0, 219036.0, 7552.0, 1354.0, 314.0, 138.0, 52.0, 19.0, 20.0, 11.0, 3.0, 6.0, 3.0, 2.0, 5.0, 2.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06561279296875, -0.06285762786865234, -0.06010246276855469, -0.05734729766845703, -0.054592132568359375, -0.05183696746826172, -0.04908180236816406, -0.046326637268066406, -0.04357147216796875, -0.040816307067871094, -0.03806114196777344, -0.03530597686767578, -0.032550811767578125, -0.02979564666748047, -0.027040481567382812, -0.024285316467285156, -0.0215301513671875, -0.018774986267089844, -0.016019821166992188, -0.013264656066894531, -0.010509490966796875, -0.007754325866699219, -0.0049991607666015625, -0.0022439956665039062, 0.00051116943359375, 0.0032663345336914062, 0.0060214996337890625, 0.008776664733886719, 0.011531829833984375, 0.014286994934082031, 0.017042160034179688, 0.019797325134277344, 0.022552490234375, 0.025307655334472656, 0.028062820434570312, 0.03081798553466797, 0.033573150634765625, 0.03632831573486328, 0.03908348083496094, 0.041838645935058594, 0.04459381103515625, 0.047348976135253906, 0.05010414123535156, 0.05285930633544922, 0.055614471435546875, 0.05836963653564453, 0.06112480163574219, 0.06387996673583984, 0.0666351318359375, 0.06939029693603516, 0.07214546203613281, 0.07490062713623047, 0.07765579223632812, 0.08041095733642578, 0.08316612243652344, 0.0859212875366211, 0.08867645263671875, 0.0914316177368164, 0.09418678283691406, 0.09694194793701172, 0.09969711303710938, 0.10245227813720703, 0.10520744323730469, 0.10796260833740234, 0.1107177734375]}, "gradients/encoder.encoder.layers.15.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 3.0, 6.0, 8.0, 4.0, 10.0, 15.0, 16.0, 27.0, 52.0, 49.0, 64.0, 81.0, 88.0, 111.0, 98.0, 75.0, 73.0, 67.0, 43.0, 38.0, 32.0, 15.0, 16.0, 10.0, 3.0, 5.0, 3.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-1.5556812286376953e-05, -1.520942896604538e-05, -1.4862045645713806e-05, -1.4514662325382233e-05, -1.416727900505066e-05, -1.3819895684719086e-05, -1.3472512364387512e-05, -1.3125129044055939e-05, -1.2777745723724365e-05, -1.2430362403392792e-05, -1.2082979083061218e-05, -1.1735595762729645e-05, -1.1388212442398071e-05, -1.1040829122066498e-05, -1.0693445801734924e-05, -1.034606248140335e-05, -9.998679161071777e-06, -9.651295840740204e-06, -9.30391252040863e-06, -8.956529200077057e-06, -8.609145879745483e-06, -8.26176255941391e-06, -7.914379239082336e-06, -7.566995918750763e-06, -7.2196125984191895e-06, -6.872229278087616e-06, -6.5248459577560425e-06, -6.177462637424469e-06, -5.8300793170928955e-06, -5.482695996761322e-06, -5.1353126764297485e-06, -4.787929356098175e-06, -4.4405460357666016e-06, -4.093162715435028e-06, -3.7457793951034546e-06, -3.398396074771881e-06, -3.0510127544403076e-06, -2.703629434108734e-06, -2.3562461137771606e-06, -2.008862793445587e-06, -1.6614794731140137e-06, -1.3140961527824402e-06, -9.667128324508667e-07, -6.193295121192932e-07, -2.7194619178771973e-07, 7.543712854385376e-08, 4.2282044887542725e-07, 7.702037692070007e-07, 1.1175870895385742e-06, 1.4649704098701477e-06, 1.8123537302017212e-06, 2.1597370505332947e-06, 2.507120370864868e-06, 2.8545036911964417e-06, 3.201887011528015e-06, 3.5492703318595886e-06, 3.896653652191162e-06, 4.244036972522736e-06, 4.591420292854309e-06, 4.9388036131858826e-06, 5.286186933517456e-06, 5.6335702538490295e-06, 5.980953574180603e-06, 6.3283368945121765e-06, 6.67572021484375e-06]}, "gradients/encoder.encoder.layers.15.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 5.0, 4.0, 5.0, 6.0, 22.0, 51.0, 116.0, 244.0, 730.0, 2698.0, 17264.0, 791486.0, 224950.0, 8450.0, 1699.0, 494.0, 180.0, 81.0, 40.0, 19.0, 4.0, 8.0, 5.0, 2.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1192626953125, -0.1159353256225586, -0.11260795593261719, -0.10928058624267578, -0.10595321655273438, -0.10262584686279297, -0.09929847717285156, -0.09597110748291016, -0.09264373779296875, -0.08931636810302734, -0.08598899841308594, -0.08266162872314453, -0.07933425903320312, -0.07600688934326172, -0.07267951965332031, -0.0693521499633789, -0.0660247802734375, -0.0626974105834961, -0.05937004089355469, -0.05604267120361328, -0.052715301513671875, -0.04938793182373047, -0.04606056213378906, -0.042733192443847656, -0.03940582275390625, -0.036078453063964844, -0.03275108337402344, -0.02942371368408203, -0.026096343994140625, -0.02276897430419922, -0.019441604614257812, -0.016114234924316406, -0.012786865234375, -0.009459495544433594, -0.0061321258544921875, -0.0028047561645507812, 0.000522613525390625, 0.0038499832153320312, 0.0071773529052734375, 0.010504722595214844, 0.01383209228515625, 0.017159461975097656, 0.020486831665039062, 0.02381420135498047, 0.027141571044921875, 0.03046894073486328, 0.03379631042480469, 0.037123680114746094, 0.0404510498046875, 0.043778419494628906, 0.04710578918457031, 0.05043315887451172, 0.053760528564453125, 0.05708789825439453, 0.06041526794433594, 0.06374263763427734, 0.06707000732421875, 0.07039737701416016, 0.07372474670410156, 0.07705211639404297, 0.08037948608398438, 0.08370685577392578, 0.08703422546386719, 0.0903615951538086, 0.09368896484375]}, "gradients/encoder.encoder.layers.15.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 0.0, 1.0, 4.0, 6.0, 11.0, 17.0, 18.0, 29.0, 39.0, 72.0, 100.0, 143.0, 155.0, 127.0, 93.0, 60.0, 55.0, 22.0, 22.0, 17.0, 7.0, 2.0, 0.0, 4.0, 0.0, 5.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.057281494140625, -0.055567264556884766, -0.05385303497314453, -0.0521388053894043, -0.05042457580566406, -0.04871034622192383, -0.046996116638183594, -0.04528188705444336, -0.043567657470703125, -0.04185342788696289, -0.040139198303222656, -0.03842496871948242, -0.03671073913574219, -0.03499650955200195, -0.03328227996826172, -0.031568050384521484, -0.02985382080078125, -0.028139591217041016, -0.02642536163330078, -0.024711132049560547, -0.022996902465820312, -0.021282672882080078, -0.019568443298339844, -0.01785421371459961, -0.016139984130859375, -0.01442575454711914, -0.012711524963378906, -0.010997295379638672, -0.009283065795898438, -0.007568836212158203, -0.005854606628417969, -0.004140377044677734, -0.0024261474609375, -0.0007119178771972656, 0.0010023117065429688, 0.002716541290283203, 0.0044307708740234375, 0.006145000457763672, 0.007859230041503906, 0.00957345962524414, 0.011287689208984375, 0.01300191879272461, 0.014716148376464844, 0.016430377960205078, 0.018144607543945312, 0.019858837127685547, 0.02157306671142578, 0.023287296295166016, 0.02500152587890625, 0.026715755462646484, 0.02842998504638672, 0.030144214630126953, 0.03185844421386719, 0.03357267379760742, 0.035286903381347656, 0.03700113296508789, 0.038715362548828125, 0.04042959213256836, 0.042143821716308594, 0.04385805130004883, 0.04557228088378906, 0.0472865104675293, 0.04900074005126953, 0.050714969635009766, 0.05242919921875]}, "gradients/encoder.encoder.layers.15.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 3.0, 14.0, 43.0, 178.0, 390.0, 257.0, 89.0, 27.0, 7.0, 3.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8725656270980835, -1.8241504430770874, -1.7757352590560913, -1.7273200750350952, -1.6789050102233887, -1.6304898262023926, -1.5820746421813965, -1.5336594581604004, -1.4852442741394043, -1.4368290901184082, -1.388413906097412, -1.339998722076416, -1.29158353805542, -1.2431684732437134, -1.1947532892227173, -1.1463381052017212, -1.097922921180725, -1.049507737159729, -1.001092553138733, -0.9526774287223816, -0.9042622447013855, -0.8558470606803894, -0.8074319362640381, -0.759016752243042, -0.7106015682220459, -0.6621863842010498, -0.6137712001800537, -0.5653560757637024, -0.5169408917427063, -0.4685257077217102, -0.4201105535030365, -0.3716953992843628, -0.32328009605407715, -0.27486491203308105, -0.22644975781440735, -0.17803458869457245, -0.12961941957473755, -0.08120425045490265, -0.03278908133506775, 0.015626072883605957, 0.06404125690460205, 0.11245642602443695, 0.16087159514427185, 0.20928676426410675, 0.25770193338394165, 0.30611711740493774, 0.35453227162361145, 0.40294742584228516, 0.45136260986328125, 0.49977779388427734, 0.5481929779052734, 0.5966081023216248, 0.6450232863426208, 0.6934384703636169, 0.7418535947799683, 0.7902687788009644, 0.8386839628219604, 0.8870991468429565, 0.9355143308639526, 0.983929455280304, 1.0323445796966553, 1.0807597637176514, 1.1291749477386475, 1.1775901317596436, 1.2260053157806396]}, "gradients/encoder.encoder.layers.15.layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 5.0, 0.0, 5.0, 2.0, 6.0, 6.0, 10.0, 16.0, 17.0, 18.0, 17.0, 24.0, 29.0, 36.0, 32.0, 37.0, 40.0, 44.0, 48.0, 37.0, 38.0, 44.0, 52.0, 44.0, 57.0, 37.0, 40.0, 46.0, 32.0, 38.0, 30.0, 18.0, 18.0, 12.0, 17.0, 16.0, 10.0, 10.0, 8.0, 5.0, 4.0, 4.0, 1.0, 1.0, 0.0, 5.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.44711625576019287, -0.4291492700576782, -0.4111822545528412, -0.39321526885032654, -0.3752482533454895, -0.35728126764297485, -0.3393142819404602, -0.32134726643562317, -0.30338025093078613, -0.2854132652282715, -0.26744624972343445, -0.2494792640209198, -0.23151224851608276, -0.21354526281356812, -0.19557826220989227, -0.17761126160621643, -0.15964427590370178, -0.14167727530002594, -0.1237102746963501, -0.10574328154325485, -0.08777628093957901, -0.06980928033590317, -0.05184228718280792, -0.03387528657913208, -0.015908285975456238, 0.0020587127655744553, 0.02002571150660515, 0.03799270838499069, 0.055959708988666534, 0.07392670959234238, 0.09189370274543762, 0.10986070334911346, 0.1278277039527893, 0.14579470455646515, 0.163761705160141, 0.18172869086265564, 0.19969570636749268, 0.21766269207000732, 0.23562969267368317, 0.253596693277359, 0.27156370878219604, 0.2895306944847107, 0.30749770998954773, 0.3254646956920624, 0.3434317111968994, 0.36139869689941406, 0.3793656826019287, 0.39733269810676575, 0.4152996838092804, 0.43326666951179504, 0.4512336850166321, 0.46920067071914673, 0.48716768622398376, 0.5051347017288208, 0.5231016874313354, 0.5410686731338501, 0.5590356588363647, 0.5770026445388794, 0.594969630241394, 0.6129366755485535, 0.6309036612510681, 0.6488706469535828, 0.6668376326560974, 0.6848046779632568, 0.7027716636657715]}, "gradients/encoder.encoder.layers.14.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 4.0, 1.0, 7.0, 7.0, 3.0, 9.0, 7.0, 11.0, 13.0, 20.0, 24.0, 39.0, 35.0, 96.0, 195.0, 426.0, 1109.0, 3609.0, 27341.0, 4129954.0, 25741.0, 3638.0, 1084.0, 449.0, 181.0, 110.0, 53.0, 34.0, 36.0, 19.0, 12.0, 9.0, 3.0, 4.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.4345703125, -0.42409515380859375, -0.4136199951171875, -0.40314483642578125, -0.392669677734375, -0.38219451904296875, -0.3717193603515625, -0.36124420166015625, -0.35076904296875, -0.34029388427734375, -0.3298187255859375, -0.31934356689453125, -0.308868408203125, -0.29839324951171875, -0.2879180908203125, -0.27744293212890625, -0.2669677734375, -0.25649261474609375, -0.2460174560546875, -0.23554229736328125, -0.225067138671875, -0.21459197998046875, -0.2041168212890625, -0.19364166259765625, -0.18316650390625, -0.17269134521484375, -0.1622161865234375, -0.15174102783203125, -0.141265869140625, -0.13079071044921875, -0.1203155517578125, -0.10984039306640625, -0.099365234375, -0.08889007568359375, -0.0784149169921875, -0.06793975830078125, -0.057464599609375, -0.04698944091796875, -0.0365142822265625, -0.02603912353515625, -0.01556396484375, -0.00508880615234375, 0.0053863525390625, 0.01586151123046875, 0.026336669921875, 0.03681182861328125, 0.0472869873046875, 0.05776214599609375, 0.0682373046875, 0.07871246337890625, 0.0891876220703125, 0.09966278076171875, 0.110137939453125, 0.12061309814453125, 0.1310882568359375, 0.14156341552734375, 0.15203857421875, 0.16251373291015625, 0.1729888916015625, 0.18346405029296875, 0.193939208984375, 0.20441436767578125, 0.2148895263671875, 0.22536468505859375, 0.23583984375]}, "gradients/encoder.encoder.layers.14.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 7.0, 10.0, 17.0, 41.0, 44.0, 68.0, 92.0, 125.0, 113.0, 122.0, 122.0, 87.0, 61.0, 39.0, 29.0, 10.0, 11.0, 3.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0987548828125, -0.0963296890258789, -0.09390449523925781, -0.09147930145263672, -0.08905410766601562, -0.08662891387939453, -0.08420372009277344, -0.08177852630615234, -0.07935333251953125, -0.07692813873291016, -0.07450294494628906, -0.07207775115966797, -0.06965255737304688, -0.06722736358642578, -0.06480216979980469, -0.062376976013183594, -0.0599517822265625, -0.057526588439941406, -0.05510139465332031, -0.05267620086669922, -0.050251007080078125, -0.04782581329345703, -0.04540061950683594, -0.042975425720214844, -0.04055023193359375, -0.038125038146972656, -0.03569984436035156, -0.03327465057373047, -0.030849456787109375, -0.02842426300048828, -0.025999069213867188, -0.023573875427246094, -0.021148681640625, -0.018723487854003906, -0.016298294067382812, -0.013873100280761719, -0.011447906494140625, -0.009022712707519531, -0.0065975189208984375, -0.004172325134277344, -0.00174713134765625, 0.0006780624389648438, 0.0031032562255859375, 0.005528450012207031, 0.007953643798828125, 0.010378837585449219, 0.012804031372070312, 0.015229225158691406, 0.0176544189453125, 0.020079612731933594, 0.022504806518554688, 0.02493000030517578, 0.027355194091796875, 0.02978038787841797, 0.03220558166503906, 0.034630775451660156, 0.03705596923828125, 0.039481163024902344, 0.04190635681152344, 0.04433155059814453, 0.046756744384765625, 0.04918193817138672, 0.05160713195800781, 0.054032325744628906, 0.05645751953125]}, "gradients/encoder.encoder.layers.14.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 2.0, 2.0, 2.0, 3.0, 5.0, 5.0, 9.0, 18.0, 24.0, 33.0, 49.0, 70.0, 88.0, 139.0, 252.0, 564.0, 4460.0, 4161607.0, 24992.0, 1031.0, 356.0, 179.0, 113.0, 99.0, 60.0, 44.0, 38.0, 19.0, 11.0, 10.0, 4.0, 6.0, 4.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.53125, -0.502838134765625, -0.47442626953125, -0.446014404296875, -0.4176025390625, -0.389190673828125, -0.36077880859375, -0.332366943359375, -0.303955078125, -0.275543212890625, -0.24713134765625, -0.218719482421875, -0.1903076171875, -0.161895751953125, -0.13348388671875, -0.105072021484375, -0.07666015625, -0.048248291015625, -0.01983642578125, 0.008575439453125, 0.0369873046875, 0.065399169921875, 0.09381103515625, 0.122222900390625, 0.150634765625, 0.179046630859375, 0.20745849609375, 0.235870361328125, 0.2642822265625, 0.292694091796875, 0.32110595703125, 0.349517822265625, 0.3779296875, 0.406341552734375, 0.43475341796875, 0.463165283203125, 0.4915771484375, 0.519989013671875, 0.54840087890625, 0.576812744140625, 0.605224609375, 0.633636474609375, 0.66204833984375, 0.690460205078125, 0.7188720703125, 0.747283935546875, 0.77569580078125, 0.804107666015625, 0.83251953125, 0.860931396484375, 0.88934326171875, 0.917755126953125, 0.9461669921875, 0.974578857421875, 1.00299072265625, 1.031402587890625, 1.059814453125, 1.088226318359375, 1.11663818359375, 1.145050048828125, 1.1734619140625, 1.201873779296875, 1.23028564453125, 1.258697509765625, 1.287109375]}, "gradients/encoder.encoder.layers.14.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 5.0, 11.0, 33.0, 97.0, 3053.0, 777.0, 68.0, 20.0, 10.0, 6.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10101318359375, -0.09421253204345703, -0.08741188049316406, -0.0806112289428711, -0.07381057739257812, -0.06700992584228516, -0.06020927429199219, -0.05340862274169922, -0.04660797119140625, -0.03980731964111328, -0.03300666809082031, -0.026206016540527344, -0.019405364990234375, -0.012604713439941406, -0.0058040618896484375, 0.0009965896606445312, 0.0077972412109375, 0.014597892761230469, 0.021398544311523438, 0.028199195861816406, 0.034999847412109375, 0.041800498962402344, 0.04860115051269531, 0.05540180206298828, 0.06220245361328125, 0.06900310516357422, 0.07580375671386719, 0.08260440826416016, 0.08940505981445312, 0.0962057113647461, 0.10300636291503906, 0.10980701446533203, 0.116607666015625, 0.12340831756591797, 0.13020896911621094, 0.1370096206665039, 0.14381027221679688, 0.15061092376708984, 0.1574115753173828, 0.16421222686767578, 0.17101287841796875, 0.17781352996826172, 0.1846141815185547, 0.19141483306884766, 0.19821548461914062, 0.2050161361694336, 0.21181678771972656, 0.21861743927001953, 0.2254180908203125, 0.23221874237060547, 0.23901939392089844, 0.2458200454711914, 0.2526206970214844, 0.25942134857177734, 0.2662220001220703, 0.2730226516723633, 0.27982330322265625, 0.2866239547729492, 0.2934246063232422, 0.30022525787353516, 0.3070259094238281, 0.3138265609741211, 0.32062721252441406, 0.32742786407470703, 0.334228515625]}, "gradients/encoder.encoder.layers.14.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 9.0, 18.0, 70.0, 290.0, 439.0, 117.0, 29.0, 9.0, 13.0, 5.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7024028897285461, -0.6638566255569458, -0.6253104209899902, -0.5867641568183899, -0.5482178926467896, -0.509671688079834, -0.47112542390823364, -0.4325791597366333, -0.39403292536735535, -0.3554866909980774, -0.31694042682647705, -0.2783941924571991, -0.23984794318675995, -0.2013016939163208, -0.16275545954704285, -0.1242091953754425, -0.08566296100616455, -0.0471167154610157, -0.008570469915866852, 0.0299757719039917, 0.06852202117443085, 0.10706827044487, 0.14561450481414795, 0.1841607689857483, 0.22270700335502625, 0.2612532377243042, 0.29979950189590454, 0.3383457362651825, 0.37689197063446045, 0.4154382348060608, 0.45398446917533875, 0.4925307333469391, 0.5310769081115723, 0.5696231722831726, 0.6081693768501282, 0.6467156410217285, 0.6852619051933289, 0.7238081693649292, 0.7623543739318848, 0.8009006381034851, 0.8394469022750854, 0.8779931664466858, 0.9165393710136414, 0.9550856351852417, 0.993631899356842, 1.0321781635284424, 1.070724368095398, 1.1092705726623535, 1.1478168964385986, 1.1863631010055542, 1.2249094247817993, 1.2634556293487549, 1.3020018339157104, 1.3405481576919556, 1.3790943622589111, 1.4176406860351562, 1.4561867713928223, 1.4947329759597778, 1.533279299736023, 1.5718255043029785, 1.610371708869934, 1.6489180326461792, 1.6874642372131348, 1.7260105609893799, 1.7645567655563354]}, "gradients/encoder.encoder.layers.14.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 3.0, 4.0, 3.0, 3.0, 3.0, 15.0, 33.0, 56.0, 65.0, 108.0, 123.0, 150.0, 127.0, 107.0, 86.0, 45.0, 36.0, 13.0, 13.0, 3.0, 3.0, 5.0, 4.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 5.0], "bins": [-0.9069772362709045, -0.8871030807495117, -0.8672289252281189, -0.8473547697067261, -0.8274805545806885, -0.8076063990592957, -0.7877322435379028, -0.76785808801651, -0.7479839324951172, -0.7281097769737244, -0.7082356214523315, -0.6883614659309387, -0.6684873104095459, -0.6486130952835083, -0.6287389397621155, -0.6088647842407227, -0.5889906287193298, -0.569116473197937, -0.5492423176765442, -0.5293681621551514, -0.5094939470291138, -0.48961982131004333, -0.4697456359863281, -0.4498714804649353, -0.4299973249435425, -0.41012316942214966, -0.39024901390075684, -0.3703748285770416, -0.3505006730556488, -0.330626517534256, -0.31075233221054077, -0.29087817668914795, -0.2710040211677551, -0.2511298656463623, -0.2312556952238083, -0.21138152480125427, -0.19150736927986145, -0.17163321375846863, -0.1517590433359146, -0.1318848729133606, -0.11201071739196777, -0.09213655441999435, -0.07226239144802094, -0.052388228476047516, -0.0325140655040741, -0.012639902532100677, 0.007234260439872742, 0.027108430862426758, 0.04698258638381958, 0.066856749355793, 0.08673091232776642, 0.10660507529973984, 0.12647923827171326, 0.14635339379310608, 0.1662275642156601, 0.1861017346382141, 0.20597589015960693, 0.22585004568099976, 0.24572421610355377, 0.2655983865261078, 0.2854725420475006, 0.30534669756889343, 0.32522088289260864, 0.34509503841400146, 0.3649691939353943]}, "gradients/encoder.encoder.layers.14.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 3.0, 3.0, 7.0, 5.0, 5.0, 5.0, 9.0, 21.0, 20.0, 39.0, 45.0, 54.0, 72.0, 122.0, 191.0, 265.0, 445.0, 781.0, 1480.0, 3692.0, 12103.0, 62781.0, 425486.0, 452147.0, 68457.0, 12816.0, 3739.0, 1587.0, 837.0, 462.0, 286.0, 182.0, 114.0, 97.0, 47.0, 35.0, 41.0, 18.0, 15.0, 13.0, 16.0, 4.0, 6.0, 1.0, 4.0, 1.0, 5.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.133544921875, -0.1292400360107422, -0.12493515014648438, -0.12063026428222656, -0.11632537841796875, -0.11202049255371094, -0.10771560668945312, -0.10341072082519531, -0.0991058349609375, -0.09480094909667969, -0.09049606323242188, -0.08619117736816406, -0.08188629150390625, -0.07758140563964844, -0.07327651977539062, -0.06897163391113281, -0.064666748046875, -0.06036186218261719, -0.056056976318359375, -0.05175209045410156, -0.04744720458984375, -0.04314231872558594, -0.038837432861328125, -0.03453254699707031, -0.0302276611328125, -0.025922775268554688, -0.021617889404296875, -0.017313003540039062, -0.01300811767578125, -0.008703231811523438, -0.004398345947265625, -9.34600830078125e-05, 0.00421142578125, 0.008516311645507812, 0.012821197509765625, 0.017126083374023438, 0.02143096923828125, 0.025735855102539062, 0.030040740966796875, 0.03434562683105469, 0.0386505126953125, 0.04295539855957031, 0.047260284423828125, 0.05156517028808594, 0.05587005615234375, 0.06017494201660156, 0.06447982788085938, 0.06878471374511719, 0.073089599609375, 0.07739448547363281, 0.08169937133789062, 0.08600425720214844, 0.09030914306640625, 0.09461402893066406, 0.09891891479492188, 0.10322380065917969, 0.1075286865234375, 0.11183357238769531, 0.11613845825195312, 0.12044334411621094, 0.12474822998046875, 0.12905311584472656, 0.13335800170898438, 0.1376628875732422, 0.1419677734375]}, "gradients/encoder.encoder.layers.14.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 4.0, 3.0, 1.0, 5.0, 8.0, 17.0, 25.0, 47.0, 58.0, 94.0, 97.0, 105.0, 104.0, 120.0, 93.0, 76.0, 61.0, 33.0, 24.0, 13.0, 7.0, 7.0, 2.0, 2.0, 3.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0], "bins": [-0.0997314453125, -0.09737300872802734, -0.09501457214355469, -0.09265613555908203, -0.09029769897460938, -0.08793926239013672, -0.08558082580566406, -0.0832223892211914, -0.08086395263671875, -0.0785055160522461, -0.07614707946777344, -0.07378864288330078, -0.07143020629882812, -0.06907176971435547, -0.06671333312988281, -0.06435489654541016, -0.0619964599609375, -0.059638023376464844, -0.05727958679199219, -0.05492115020751953, -0.052562713623046875, -0.05020427703857422, -0.04784584045410156, -0.045487403869628906, -0.04312896728515625, -0.040770530700683594, -0.03841209411621094, -0.03605365753173828, -0.033695220947265625, -0.03133678436279297, -0.028978347778320312, -0.026619911193847656, -0.024261474609375, -0.021903038024902344, -0.019544601440429688, -0.01718616485595703, -0.014827728271484375, -0.012469291687011719, -0.010110855102539062, -0.007752418518066406, -0.00539398193359375, -0.0030355453491210938, -0.0006771087646484375, 0.0016813278198242188, 0.004039764404296875, 0.006398200988769531, 0.008756637573242188, 0.011115074157714844, 0.0134735107421875, 0.015831947326660156, 0.018190383911132812, 0.02054882049560547, 0.022907257080078125, 0.02526569366455078, 0.027624130249023438, 0.029982566833496094, 0.03234100341796875, 0.034699440002441406, 0.03705787658691406, 0.03941631317138672, 0.041774749755859375, 0.04413318634033203, 0.04649162292480469, 0.048850059509277344, 0.05120849609375]}, "gradients/encoder.encoder.layers.14.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0, 0.0, 3.0, 4.0, 1.0, 4.0, 3.0, 7.0, 7.0, 8.0, 9.0, 19.0, 14.0, 31.0, 33.0, 46.0, 70.0, 146.0, 299.0, 690.0, 2181.0, 9294.0, 59891.0, 468716.0, 440144.0, 54854.0, 8740.0, 1973.0, 659.0, 294.0, 145.0, 80.0, 54.0, 38.0, 22.0, 17.0, 8.0, 18.0, 11.0, 8.0, 5.0, 5.0, 3.0, 1.0, 3.0, 2.0, 2.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.1304931640625, -0.12653636932373047, -0.12257957458496094, -0.1186227798461914, -0.11466598510742188, -0.11070919036865234, -0.10675239562988281, -0.10279560089111328, -0.09883880615234375, -0.09488201141357422, -0.09092521667480469, -0.08696842193603516, -0.08301162719726562, -0.0790548324584961, -0.07509803771972656, -0.07114124298095703, -0.0671844482421875, -0.06322765350341797, -0.05927085876464844, -0.055314064025878906, -0.051357269287109375, -0.047400474548339844, -0.04344367980957031, -0.03948688507080078, -0.03553009033203125, -0.03157329559326172, -0.027616500854492188, -0.023659706115722656, -0.019702911376953125, -0.015746116638183594, -0.011789321899414062, -0.007832527160644531, -0.003875732421875, 8.106231689453125e-05, 0.0040378570556640625, 0.007994651794433594, 0.011951446533203125, 0.015908241271972656, 0.019865036010742188, 0.02382183074951172, 0.02777862548828125, 0.03173542022705078, 0.03569221496582031, 0.039649009704589844, 0.043605804443359375, 0.047562599182128906, 0.05151939392089844, 0.05547618865966797, 0.0594329833984375, 0.06338977813720703, 0.06734657287597656, 0.0713033676147461, 0.07526016235351562, 0.07921695709228516, 0.08317375183105469, 0.08713054656982422, 0.09108734130859375, 0.09504413604736328, 0.09900093078613281, 0.10295772552490234, 0.10691452026367188, 0.1108713150024414, 0.11482810974121094, 0.11878490447998047, 0.12274169921875]}, "gradients/encoder.encoder.layers.14.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 0.0, 3.0, 1.0, 3.0, 3.0, 7.0, 5.0, 6.0, 5.0, 4.0, 8.0, 7.0, 16.0, 17.0, 17.0, 17.0, 27.0, 22.0, 29.0, 24.0, 34.0, 35.0, 29.0, 37.0, 45.0, 42.0, 43.0, 31.0, 42.0, 35.0, 49.0, 39.0, 35.0, 25.0, 24.0, 28.0, 30.0, 18.0, 16.0, 28.0, 14.0, 15.0, 12.0, 18.0, 17.0, 12.0, 9.0, 6.0, 6.0, 6.0, 5.0, 4.0, 0.0, 1.0, 4.0, 0.0, 2.0, 1.0], "bins": [-0.10675048828125, -0.10358428955078125, -0.1004180908203125, -0.09725189208984375, -0.094085693359375, -0.09091949462890625, -0.0877532958984375, -0.08458709716796875, -0.0814208984375, -0.07825469970703125, -0.0750885009765625, -0.07192230224609375, -0.068756103515625, -0.06558990478515625, -0.0624237060546875, -0.05925750732421875, -0.05609130859375, -0.05292510986328125, -0.0497589111328125, -0.04659271240234375, -0.043426513671875, -0.04026031494140625, -0.0370941162109375, -0.03392791748046875, -0.03076171875, -0.02759552001953125, -0.0244293212890625, -0.02126312255859375, -0.018096923828125, -0.01493072509765625, -0.0117645263671875, -0.00859832763671875, -0.00543212890625, -0.00226593017578125, 0.0009002685546875, 0.00406646728515625, 0.007232666015625, 0.01039886474609375, 0.0135650634765625, 0.01673126220703125, 0.0198974609375, 0.02306365966796875, 0.0262298583984375, 0.02939605712890625, 0.032562255859375, 0.03572845458984375, 0.0388946533203125, 0.04206085205078125, 0.04522705078125, 0.04839324951171875, 0.0515594482421875, 0.05472564697265625, 0.057891845703125, 0.06105804443359375, 0.0642242431640625, 0.06739044189453125, 0.070556640625, 0.07372283935546875, 0.0768890380859375, 0.08005523681640625, 0.083221435546875, 0.08638763427734375, 0.0895538330078125, 0.09272003173828125, 0.09588623046875]}, "gradients/encoder.encoder.layers.14.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 5.0, 2.0, 9.0, 4.0, 7.0, 11.0, 19.0, 27.0, 53.0, 76.0, 95.0, 223.0, 418.0, 935.0, 2735.0, 11028.0, 93259.0, 730031.0, 186029.0, 17532.0, 3698.0, 1237.0, 524.0, 245.0, 138.0, 73.0, 49.0, 34.0, 21.0, 17.0, 11.0, 8.0, 2.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.049835205078125, -0.04803133010864258, -0.046227455139160156, -0.044423580169677734, -0.04261970520019531, -0.04081583023071289, -0.03901195526123047, -0.03720808029174805, -0.035404205322265625, -0.0336003303527832, -0.03179645538330078, -0.02999258041381836, -0.028188705444335938, -0.026384830474853516, -0.024580955505371094, -0.022777080535888672, -0.02097320556640625, -0.019169330596923828, -0.017365455627441406, -0.015561580657958984, -0.013757705688476562, -0.01195383071899414, -0.010149955749511719, -0.008346080780029297, -0.006542205810546875, -0.004738330841064453, -0.0029344558715820312, -0.0011305809020996094, 0.0006732940673828125, 0.0024771690368652344, 0.004281044006347656, 0.006084918975830078, 0.0078887939453125, 0.009692668914794922, 0.011496543884277344, 0.013300418853759766, 0.015104293823242188, 0.01690816879272461, 0.01871204376220703, 0.020515918731689453, 0.022319793701171875, 0.024123668670654297, 0.02592754364013672, 0.02773141860961914, 0.029535293579101562, 0.031339168548583984, 0.033143043518066406, 0.03494691848754883, 0.03675079345703125, 0.03855466842651367, 0.040358543395996094, 0.042162418365478516, 0.04396629333496094, 0.04577016830444336, 0.04757404327392578, 0.0493779182434082, 0.051181793212890625, 0.05298566818237305, 0.05478954315185547, 0.05659341812133789, 0.05839729309082031, 0.060201168060302734, 0.062005043029785156, 0.06380891799926758, 0.06561279296875]}, "gradients/encoder.encoder.layers.14.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 2.0, 2.0, 5.0, 4.0, 7.0, 11.0, 8.0, 6.0, 5.0, 27.0, 18.0, 15.0, 26.0, 25.0, 30.0, 59.0, 40.0, 53.0, 45.0, 52.0, 68.0, 45.0, 49.0, 36.0, 51.0, 48.0, 38.0, 34.0, 34.0, 24.0, 26.0, 26.0, 19.0, 17.0, 10.0, 8.0, 3.0, 6.0, 10.0, 5.0, 5.0, 3.0, 2.0, 1.0, 3.0, 1.0, 2.0], "bins": [-7.212162017822266e-06, -7.022172212600708e-06, -6.83218240737915e-06, -6.642192602157593e-06, -6.452202796936035e-06, -6.2622129917144775e-06, -6.07222318649292e-06, -5.882233381271362e-06, -5.692243576049805e-06, -5.502253770828247e-06, -5.3122639656066895e-06, -5.122274160385132e-06, -4.932284355163574e-06, -4.742294549942017e-06, -4.552304744720459e-06, -4.362314939498901e-06, -4.172325134277344e-06, -3.982335329055786e-06, -3.7923455238342285e-06, -3.602355718612671e-06, -3.4123659133911133e-06, -3.2223761081695557e-06, -3.032386302947998e-06, -2.8423964977264404e-06, -2.652406692504883e-06, -2.462416887283325e-06, -2.2724270820617676e-06, -2.08243727684021e-06, -1.8924474716186523e-06, -1.7024576663970947e-06, -1.5124678611755371e-06, -1.3224780559539795e-06, -1.1324882507324219e-06, -9.424984455108643e-07, -7.525086402893066e-07, -5.62518835067749e-07, -3.725290298461914e-07, -1.825392246246338e-07, 7.450580596923828e-09, 1.9744038581848145e-07, 3.8743019104003906e-07, 5.774199962615967e-07, 7.674098014831543e-07, 9.57399606704712e-07, 1.1473894119262695e-06, 1.3373792171478271e-06, 1.5273690223693848e-06, 1.7173588275909424e-06, 1.9073486328125e-06, 2.0973384380340576e-06, 2.2873282432556152e-06, 2.477318048477173e-06, 2.6673078536987305e-06, 2.857297658920288e-06, 3.0472874641418457e-06, 3.2372772693634033e-06, 3.427267074584961e-06, 3.6172568798065186e-06, 3.807246685028076e-06, 3.997236490249634e-06, 4.187226295471191e-06, 4.377216100692749e-06, 4.567205905914307e-06, 4.757195711135864e-06, 4.947185516357422e-06]}, "gradients/encoder.encoder.layers.14.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 2.0, 2.0, 3.0, 2.0, 3.0, 7.0, 13.0, 14.0, 28.0, 42.0, 55.0, 121.0, 255.0, 437.0, 995.0, 2721.0, 11601.0, 96792.0, 755483.0, 158756.0, 15631.0, 3309.0, 1155.0, 537.0, 243.0, 131.0, 88.0, 43.0, 30.0, 25.0, 1.0, 9.0, 5.0, 5.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.055267333984375, -0.05326414108276367, -0.051260948181152344, -0.049257755279541016, -0.04725456237792969, -0.04525136947631836, -0.04324817657470703, -0.0412449836730957, -0.039241790771484375, -0.03723859786987305, -0.03523540496826172, -0.03323221206665039, -0.031229019165039062, -0.029225826263427734, -0.027222633361816406, -0.025219440460205078, -0.02321624755859375, -0.021213054656982422, -0.019209861755371094, -0.017206668853759766, -0.015203475952148438, -0.01320028305053711, -0.011197090148925781, -0.009193897247314453, -0.007190704345703125, -0.005187511444091797, -0.0031843185424804688, -0.0011811256408691406, 0.0008220672607421875, 0.0028252601623535156, 0.004828453063964844, 0.006831645965576172, 0.0088348388671875, 0.010838031768798828, 0.012841224670410156, 0.014844417572021484, 0.016847610473632812, 0.01885080337524414, 0.02085399627685547, 0.022857189178466797, 0.024860382080078125, 0.026863574981689453, 0.02886676788330078, 0.03086996078491211, 0.03287315368652344, 0.034876346588134766, 0.036879539489746094, 0.03888273239135742, 0.04088592529296875, 0.04288911819458008, 0.044892311096191406, 0.046895503997802734, 0.04889869689941406, 0.05090188980102539, 0.05290508270263672, 0.05490827560424805, 0.056911468505859375, 0.0589146614074707, 0.06091785430908203, 0.06292104721069336, 0.06492424011230469, 0.06692743301391602, 0.06893062591552734, 0.07093381881713867, 0.07293701171875]}, "gradients/encoder.encoder.layers.14.attention.q_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 7.0, 2.0, 7.0, 6.0, 6.0, 12.0, 15.0, 20.0, 26.0, 41.0, 43.0, 71.0, 82.0, 78.0, 95.0, 108.0, 83.0, 68.0, 48.0, 39.0, 38.0, 33.0, 18.0, 21.0, 8.0, 9.0, 4.0, 7.0, 1.0, 2.0, 4.0, 0.0, 3.0, 0.0, 1.0, 3.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.04656982421875, -0.04525136947631836, -0.04393291473388672, -0.04261445999145508, -0.04129600524902344, -0.0399775505065918, -0.038659095764160156, -0.037340641021728516, -0.036022186279296875, -0.034703731536865234, -0.033385276794433594, -0.03206682205200195, -0.030748367309570312, -0.029429912567138672, -0.02811145782470703, -0.02679300308227539, -0.02547454833984375, -0.02415609359741211, -0.02283763885498047, -0.021519184112548828, -0.020200729370117188, -0.018882274627685547, -0.017563819885253906, -0.016245365142822266, -0.014926910400390625, -0.013608455657958984, -0.012290000915527344, -0.010971546173095703, -0.009653091430664062, -0.008334636688232422, -0.007016181945800781, -0.005697727203369141, -0.0043792724609375, -0.0030608177185058594, -0.0017423629760742188, -0.0004239082336425781, 0.0008945465087890625, 0.002213001251220703, 0.0035314559936523438, 0.004849910736083984, 0.006168365478515625, 0.007486820220947266, 0.008805274963378906, 0.010123729705810547, 0.011442184448242188, 0.012760639190673828, 0.014079093933105469, 0.01539754867553711, 0.01671600341796875, 0.01803445816040039, 0.01935291290283203, 0.020671367645263672, 0.021989822387695312, 0.023308277130126953, 0.024626731872558594, 0.025945186614990234, 0.027263641357421875, 0.028582096099853516, 0.029900550842285156, 0.031219005584716797, 0.03253746032714844, 0.03385591506958008, 0.03517436981201172, 0.03649282455444336, 0.037811279296875]}, "gradients/encoder.encoder.layers.14.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 4.0, 15.0, 39.0, 154.0, 301.0, 336.0, 126.0, 25.0, 8.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-1.8469196557998657, -1.7996481657028198, -1.752376675605774, -1.705105185508728, -1.6578335762023926, -1.6105620861053467, -1.5632905960083008, -1.5160191059112549, -1.468747615814209, -1.421476125717163, -1.3742046356201172, -1.3269331455230713, -1.2796616554260254, -1.23239004611969, -1.185118556022644, -1.1378470659255981, -1.0905755758285522, -1.0433040857315063, -0.9960325956344604, -0.9487610459327698, -0.9014895558357239, -0.854218065738678, -0.8069465160369873, -0.7596750259399414, -0.7124035358428955, -0.6651320457458496, -0.6178605556488037, -0.570589005947113, -0.5233175158500671, -0.47604602575302124, -0.42877450585365295, -0.38150298595428467, -0.3342313766479492, -0.2869598865509033, -0.23968836665153503, -0.19241686165332794, -0.14514535665512085, -0.09787385165691376, -0.050602346658706665, -0.003330826759338379, 0.04394066333770752, 0.09121216833591461, 0.1384836733341217, 0.1857551783323288, 0.2330266833305359, 0.2802981734275818, 0.3275696933269501, 0.37484121322631836, 0.42211270332336426, 0.46938419342041016, 0.516655683517456, 0.5639272332191467, 0.6111987233161926, 0.6584702134132385, 0.7057417631149292, 0.7530132532119751, 0.800284743309021, 0.8475562334060669, 0.8948277235031128, 0.9420992732048035, 0.9893707633018494, 1.03664231300354, 1.083913803100586, 1.1311852931976318, 1.1784567832946777]}, "gradients/encoder.encoder.layers.14.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 0.0, 3.0, 6.0, 6.0, 7.0, 7.0, 7.0, 8.0, 13.0, 21.0, 16.0, 22.0, 32.0, 33.0, 41.0, 40.0, 46.0, 37.0, 49.0, 34.0, 58.0, 47.0, 39.0, 40.0, 45.0, 37.0, 30.0, 33.0, 33.0, 35.0, 33.0, 23.0, 20.0, 18.0, 9.0, 16.0, 9.0, 11.0, 11.0, 7.0, 3.0, 10.0, 3.0, 2.0, 0.0, 3.0, 3.0, 0.0, 1.0, 4.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5267287492752075, -0.5095049738883972, -0.4922812581062317, -0.4750574827194214, -0.45783373713493347, -0.44060999155044556, -0.42338621616363525, -0.40616247057914734, -0.3889387249946594, -0.3717149794101715, -0.3544912338256836, -0.3372674584388733, -0.3200437128543854, -0.30281996726989746, -0.28559619188308716, -0.26837244629859924, -0.25114870071411133, -0.2339249551296234, -0.2167011946439743, -0.1994774341583252, -0.18225368857383728, -0.16502994298934937, -0.14780618250370026, -0.13058242201805115, -0.11335867643356323, -0.09613492339849472, -0.07891117036342621, -0.0616874173283577, -0.044463664293289185, -0.027239911258220673, -0.01001615822315216, 0.007207594811916351, 0.024431288242340088, 0.0416550412774086, 0.05887879431247711, 0.07610254734754562, 0.09332630038261414, 0.11055005341768265, 0.12777380645275116, 0.14499756693840027, 0.16222131252288818, 0.1794450581073761, 0.1966688185930252, 0.21389257907867432, 0.23111632466316223, 0.24834007024765015, 0.26556384563446045, 0.28278759121894836, 0.3000113368034363, 0.3172350823879242, 0.3344588279724121, 0.3516826033592224, 0.3689063489437103, 0.38613009452819824, 0.40335386991500854, 0.42057761549949646, 0.4378013610839844, 0.4550251066684723, 0.4722488522529602, 0.4894726276397705, 0.506696343421936, 0.5239201188087463, 0.5411438941955566, 0.5583676099777222, 0.5755913853645325]}, "gradients/encoder.encoder.layers.13.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 3.0, 6.0, 3.0, 1.0, 7.0, 13.0, 11.0, 13.0, 19.0, 29.0, 36.0, 47.0, 61.0, 96.0, 205.0, 391.0, 1039.0, 3865.0, 33127.0, 4124014.0, 25837.0, 3602.0, 1008.0, 412.0, 188.0, 111.0, 38.0, 37.0, 16.0, 22.0, 10.0, 9.0, 5.0, 2.0, 0.0, 2.0, 0.0, 3.0, 1.0, 0.0, 2.0], "bins": [-0.5849609375, -0.5711822509765625, -0.557403564453125, -0.5436248779296875, -0.52984619140625, -0.5160675048828125, -0.502288818359375, -0.4885101318359375, -0.4747314453125, -0.4609527587890625, -0.447174072265625, -0.4333953857421875, -0.41961669921875, -0.4058380126953125, -0.392059326171875, -0.3782806396484375, -0.364501953125, -0.3507232666015625, -0.336944580078125, -0.3231658935546875, -0.30938720703125, -0.2956085205078125, -0.281829833984375, -0.2680511474609375, -0.2542724609375, -0.2404937744140625, -0.226715087890625, -0.2129364013671875, -0.19915771484375, -0.1853790283203125, -0.171600341796875, -0.1578216552734375, -0.14404296875, -0.1302642822265625, -0.116485595703125, -0.1027069091796875, -0.08892822265625, -0.0751495361328125, -0.061370849609375, -0.0475921630859375, -0.0338134765625, -0.0200347900390625, -0.006256103515625, 0.0075225830078125, 0.02130126953125, 0.0350799560546875, 0.048858642578125, 0.0626373291015625, 0.076416015625, 0.0901947021484375, 0.103973388671875, 0.1177520751953125, 0.13153076171875, 0.1453094482421875, 0.159088134765625, 0.1728668212890625, 0.1866455078125, 0.2004241943359375, 0.214202880859375, 0.2279815673828125, 0.24176025390625, 0.2555389404296875, 0.269317626953125, 0.2830963134765625, 0.296875]}, "gradients/encoder.encoder.layers.13.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 8.0, 12.0, 9.0, 15.0, 26.0, 52.0, 53.0, 74.0, 85.0, 102.0, 97.0, 117.0, 94.0, 72.0, 63.0, 34.0, 39.0, 24.0, 8.0, 7.0, 4.0, 3.0, 4.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.09161376953125, -0.08940792083740234, -0.08720207214355469, -0.08499622344970703, -0.08279037475585938, -0.08058452606201172, -0.07837867736816406, -0.0761728286743164, -0.07396697998046875, -0.0717611312866211, -0.06955528259277344, -0.06734943389892578, -0.06514358520507812, -0.06293773651123047, -0.06073188781738281, -0.058526039123535156, -0.0563201904296875, -0.054114341735839844, -0.05190849304199219, -0.04970264434814453, -0.047496795654296875, -0.04529094696044922, -0.04308509826660156, -0.040879249572753906, -0.03867340087890625, -0.036467552185058594, -0.03426170349121094, -0.03205585479736328, -0.029850006103515625, -0.02764415740966797, -0.025438308715820312, -0.023232460021972656, -0.021026611328125, -0.018820762634277344, -0.016614913940429688, -0.014409065246582031, -0.012203216552734375, -0.009997367858886719, -0.0077915191650390625, -0.005585670471191406, -0.00337982177734375, -0.0011739730834960938, 0.0010318756103515625, 0.0032377243041992188, 0.005443572998046875, 0.007649421691894531, 0.009855270385742188, 0.012061119079589844, 0.0142669677734375, 0.016472816467285156, 0.018678665161132812, 0.02088451385498047, 0.023090362548828125, 0.02529621124267578, 0.027502059936523438, 0.029707908630371094, 0.03191375732421875, 0.034119606018066406, 0.03632545471191406, 0.03853130340576172, 0.040737152099609375, 0.04294300079345703, 0.04514884948730469, 0.047354698181152344, 0.049560546875]}, "gradients/encoder.encoder.layers.13.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 3.0, 13.0, 8.0, 14.0, 33.0, 36.0, 42.0, 88.0, 131.0, 314.0, 824.0, 3268.0, 21349.0, 4071321.0, 87886.0, 6502.0, 1475.0, 485.0, 203.0, 109.0, 68.0, 41.0, 33.0, 20.0, 13.0, 10.0, 7.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4462890625, -0.4317588806152344, -0.41722869873046875, -0.4026985168457031, -0.3881683349609375, -0.3736381530761719, -0.35910797119140625, -0.3445777893066406, -0.330047607421875, -0.3155174255371094, -0.30098724365234375, -0.2864570617675781, -0.2719268798828125, -0.2573966979980469, -0.24286651611328125, -0.22833633422851562, -0.21380615234375, -0.19927597045898438, -0.18474578857421875, -0.17021560668945312, -0.1556854248046875, -0.14115524291992188, -0.12662506103515625, -0.11209487915039062, -0.097564697265625, -0.08303451538085938, -0.06850433349609375, -0.053974151611328125, -0.0394439697265625, -0.024913787841796875, -0.01038360595703125, 0.004146575927734375, 0.0186767578125, 0.033206939697265625, 0.04773712158203125, 0.062267303466796875, 0.0767974853515625, 0.09132766723632812, 0.10585784912109375, 0.12038803100585938, 0.134918212890625, 0.14944839477539062, 0.16397857666015625, 0.17850875854492188, 0.1930389404296875, 0.20756912231445312, 0.22209930419921875, 0.23662948608398438, 0.25115966796875, 0.2656898498535156, 0.28022003173828125, 0.2947502136230469, 0.3092803955078125, 0.3238105773925781, 0.33834075927734375, 0.3528709411621094, 0.367401123046875, 0.3819313049316406, 0.39646148681640625, 0.4109916687011719, 0.4255218505859375, 0.4400520324707031, 0.45458221435546875, 0.4691123962402344, 0.483642578125]}, "gradients/encoder.encoder.layers.13.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 5.0, 4.0, 5.0, 2.0, 11.0, 8.0, 14.0, 35.0, 70.0, 206.0, 2841.0, 646.0, 94.0, 53.0, 26.0, 19.0, 16.0, 8.0, 7.0, 2.0, 4.0, 5.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09722900390625, -0.09385013580322266, -0.09047126770019531, -0.08709239959716797, -0.08371353149414062, -0.08033466339111328, -0.07695579528808594, -0.0735769271850586, -0.07019805908203125, -0.0668191909790039, -0.06344032287597656, -0.06006145477294922, -0.056682586669921875, -0.05330371856689453, -0.04992485046386719, -0.046545982360839844, -0.0431671142578125, -0.039788246154785156, -0.03640937805175781, -0.03303050994873047, -0.029651641845703125, -0.02627277374267578, -0.022893905639648438, -0.019515037536621094, -0.01613616943359375, -0.012757301330566406, -0.009378433227539062, -0.005999565124511719, -0.002620697021484375, 0.0007581710815429688, 0.0041370391845703125, 0.007515907287597656, 0.010894775390625, 0.014273643493652344, 0.017652511596679688, 0.02103137969970703, 0.024410247802734375, 0.02778911590576172, 0.031167984008789062, 0.034546852111816406, 0.03792572021484375, 0.041304588317871094, 0.04468345642089844, 0.04806232452392578, 0.051441192626953125, 0.05482006072998047, 0.05819892883300781, 0.061577796936035156, 0.0649566650390625, 0.06833553314208984, 0.07171440124511719, 0.07509326934814453, 0.07847213745117188, 0.08185100555419922, 0.08522987365722656, 0.0886087417602539, 0.09198760986328125, 0.0953664779663086, 0.09874534606933594, 0.10212421417236328, 0.10550308227539062, 0.10888195037841797, 0.11226081848144531, 0.11563968658447266, 0.1190185546875]}, "gradients/encoder.encoder.layers.13.final_layer_norm.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 1.0, 3.0, 3.0, 13.0, 35.0, 112.0, 349.0, 324.0, 105.0, 38.0, 12.0, 3.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.31402260065078735, -0.28955355286598206, -0.26508450508117676, -0.24061548709869385, -0.21614643931388855, -0.19167739152908325, -0.16720835864543915, -0.14273932576179504, -0.11827027797698975, -0.09380123764276505, -0.06933219730854034, -0.04486315697431564, -0.020394116640090942, 0.0040749236941337585, 0.02854396402835846, 0.053012996912002563, 0.07748204469680786, 0.10195108503103256, 0.12642012536525726, 0.15088915824890137, 0.17535820603370667, 0.19982725381851196, 0.22429628670215607, 0.24876531958580017, 0.27323436737060547, 0.29770341515541077, 0.32217246294021606, 0.346641480922699, 0.3711105287075043, 0.39557957649230957, 0.4200485944747925, 0.4445176422595978, 0.4689866304397583, 0.4934556782245636, 0.5179247260093689, 0.5423937439918518, 0.5668628215789795, 0.5913318395614624, 0.6158008575439453, 0.6402698755264282, 0.6647389531135559, 0.6892079710960388, 0.7136770486831665, 0.7381460666656494, 0.7626150846481323, 0.78708416223526, 0.8115531802177429, 0.8360222578048706, 0.8604912757873535, 0.8849602937698364, 0.9094293713569641, 0.933898389339447, 0.9583674669265747, 0.9828364849090576, 1.0073055028915405, 1.0317745208740234, 1.056243658065796, 1.0807126760482788, 1.1051816940307617, 1.1296508312225342, 1.154119849205017, 1.1785888671875, 1.203057885169983, 1.2275269031524658, 1.2519959211349487]}, "gradients/encoder.encoder.layers.13.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 3.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 6.0, 12.0, 14.0, 16.0, 20.0, 22.0, 31.0, 50.0, 49.0, 82.0, 68.0, 67.0, 92.0, 91.0, 76.0, 61.0, 49.0, 55.0, 39.0, 30.0, 24.0, 20.0, 9.0, 10.0, 7.0, 2.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1919897198677063, -0.18325240910053253, -0.17451509833335876, -0.165777787566185, -0.15704047679901123, -0.14830316603183746, -0.1395658552646637, -0.13082855939865112, -0.12209124118089676, -0.11335393041372299, -0.10461661964654922, -0.09587931632995605, -0.08714200556278229, -0.07840469479560852, -0.06966738402843475, -0.060930073261260986, -0.05219276249408722, -0.04345545172691345, -0.034718140959739685, -0.025980833917856216, -0.01724352315068245, -0.008506212383508682, 0.00023109465837478638, 0.008968405425548553, 0.01770571619272232, 0.026443026959896088, 0.035180337727069855, 0.04391764476895332, 0.05265495553612709, 0.06139226630330086, 0.07012957334518433, 0.0788668841123581, 0.08760419487953186, 0.09634150564670563, 0.1050788164138794, 0.11381612718105316, 0.12255343794822693, 0.1312907487154007, 0.14002805948257446, 0.14876535534858704, 0.157502681016922, 0.16623999178409576, 0.17497730255126953, 0.1837146133184433, 0.19245192408561707, 0.20118923485279083, 0.2099265456199646, 0.21866384148597717, 0.22740115225315094, 0.2361384630203247, 0.24487577378749847, 0.25361308455467224, 0.2623503804206848, 0.2710877060890198, 0.27982500195503235, 0.2885623276233673, 0.2972996234893799, 0.30603691935539246, 0.3147742450237274, 0.32351154088974, 0.33224886655807495, 0.3409861624240875, 0.3497234880924225, 0.35846078395843506, 0.36719810962677]}, "gradients/encoder.encoder.layers.13.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 0.0, 2.0, 6.0, 4.0, 4.0, 9.0, 10.0, 12.0, 11.0, 24.0, 27.0, 37.0, 57.0, 95.0, 124.0, 191.0, 323.0, 516.0, 1101.0, 2515.0, 8301.0, 41675.0, 322785.0, 567290.0, 82327.0, 14111.0, 3828.0, 1405.0, 670.0, 363.0, 231.0, 168.0, 98.0, 67.0, 48.0, 35.0, 30.0, 14.0, 9.0, 12.0, 15.0, 1.0, 6.0, 2.0, 4.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.182861328125, -0.17751121520996094, -0.17216110229492188, -0.1668109893798828, -0.16146087646484375, -0.1561107635498047, -0.15076065063476562, -0.14541053771972656, -0.1400604248046875, -0.13471031188964844, -0.12936019897460938, -0.12401008605957031, -0.11865997314453125, -0.11330986022949219, -0.10795974731445312, -0.10260963439941406, -0.097259521484375, -0.09190940856933594, -0.08655929565429688, -0.08120918273925781, -0.07585906982421875, -0.07050895690917969, -0.06515884399414062, -0.05980873107910156, -0.0544586181640625, -0.04910850524902344, -0.043758392333984375, -0.03840827941894531, -0.03305816650390625, -0.027708053588867188, -0.022357940673828125, -0.017007827758789062, -0.01165771484375, -0.0063076019287109375, -0.000957489013671875, 0.0043926239013671875, 0.00974273681640625, 0.015092849731445312, 0.020442962646484375, 0.025793075561523438, 0.0311431884765625, 0.03649330139160156, 0.041843414306640625, 0.04719352722167969, 0.05254364013671875, 0.05789375305175781, 0.06324386596679688, 0.06859397888183594, 0.073944091796875, 0.07929420471191406, 0.08464431762695312, 0.08999443054199219, 0.09534454345703125, 0.10069465637207031, 0.10604476928710938, 0.11139488220214844, 0.1167449951171875, 0.12209510803222656, 0.12744522094726562, 0.1327953338623047, 0.13814544677734375, 0.1434955596923828, 0.14884567260742188, 0.15419578552246094, 0.1595458984375]}, "gradients/encoder.encoder.layers.13.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 7.0, 5.0, 12.0, 25.0, 30.0, 49.0, 56.0, 75.0, 80.0, 102.0, 111.0, 109.0, 96.0, 79.0, 53.0, 47.0, 23.0, 19.0, 9.0, 8.0, 1.0, 6.0, 0.0, 5.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.0955810546875, -0.09327125549316406, -0.09096145629882812, -0.08865165710449219, -0.08634185791015625, -0.08403205871582031, -0.08172225952148438, -0.07941246032714844, -0.0771026611328125, -0.07479286193847656, -0.07248306274414062, -0.07017326354980469, -0.06786346435546875, -0.06555366516113281, -0.06324386596679688, -0.06093406677246094, -0.058624267578125, -0.05631446838378906, -0.054004669189453125, -0.05169486999511719, -0.04938507080078125, -0.04707527160644531, -0.044765472412109375, -0.04245567321777344, -0.0401458740234375, -0.03783607482910156, -0.035526275634765625, -0.03321647644042969, -0.03090667724609375, -0.028596878051757812, -0.026287078857421875, -0.023977279663085938, -0.02166748046875, -0.019357681274414062, -0.017047882080078125, -0.014738082885742188, -0.01242828369140625, -0.010118484497070312, -0.007808685302734375, -0.0054988861083984375, -0.0031890869140625, -0.0008792877197265625, 0.001430511474609375, 0.0037403106689453125, 0.00605010986328125, 0.008359909057617188, 0.010669708251953125, 0.012979507446289062, 0.015289306640625, 0.017599105834960938, 0.019908905029296875, 0.022218704223632812, 0.02452850341796875, 0.026838302612304688, 0.029148101806640625, 0.03145790100097656, 0.0337677001953125, 0.03607749938964844, 0.038387298583984375, 0.04069709777832031, 0.04300689697265625, 0.04531669616699219, 0.047626495361328125, 0.04993629455566406, 0.05224609375]}, "gradients/encoder.encoder.layers.13.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 2.0, 2.0, 6.0, 8.0, 9.0, 8.0, 10.0, 21.0, 21.0, 41.0, 57.0, 72.0, 117.0, 220.0, 441.0, 1037.0, 2908.0, 11424.0, 62530.0, 455906.0, 438195.0, 59639.0, 11125.0, 2783.0, 970.0, 414.0, 211.0, 122.0, 76.0, 53.0, 41.0, 21.0, 11.0, 15.0, 13.0, 7.0, 3.0, 5.0, 3.0, 3.0, 4.0, 3.0, 3.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.150634765625, -0.14644908905029297, -0.14226341247558594, -0.1380777359008789, -0.13389205932617188, -0.12970638275146484, -0.1255207061767578, -0.12133502960205078, -0.11714935302734375, -0.11296367645263672, -0.10877799987792969, -0.10459232330322266, -0.10040664672851562, -0.0962209701538086, -0.09203529357910156, -0.08784961700439453, -0.0836639404296875, -0.07947826385498047, -0.07529258728027344, -0.0711069107055664, -0.06692123413085938, -0.06273555755615234, -0.05854988098144531, -0.05436420440673828, -0.05017852783203125, -0.04599285125732422, -0.04180717468261719, -0.037621498107910156, -0.033435821533203125, -0.029250144958496094, -0.025064468383789062, -0.02087879180908203, -0.016693115234375, -0.012507438659667969, -0.008321762084960938, -0.004136085510253906, 4.9591064453125e-05, 0.004235267639160156, 0.008420944213867188, 0.012606620788574219, 0.01679229736328125, 0.02097797393798828, 0.025163650512695312, 0.029349327087402344, 0.033535003662109375, 0.037720680236816406, 0.04190635681152344, 0.04609203338623047, 0.0502777099609375, 0.05446338653564453, 0.05864906311035156, 0.0628347396850586, 0.06702041625976562, 0.07120609283447266, 0.07539176940917969, 0.07957744598388672, 0.08376312255859375, 0.08794879913330078, 0.09213447570800781, 0.09632015228271484, 0.10050582885742188, 0.1046915054321289, 0.10887718200683594, 0.11306285858154297, 0.11724853515625]}, "gradients/encoder.encoder.layers.13.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 5.0, 5.0, 2.0, 0.0, 3.0, 6.0, 9.0, 11.0, 11.0, 7.0, 14.0, 14.0, 11.0, 18.0, 19.0, 20.0, 28.0, 28.0, 34.0, 36.0, 38.0, 38.0, 29.0, 31.0, 42.0, 42.0, 55.0, 38.0, 39.0, 49.0, 26.0, 31.0, 41.0, 23.0, 36.0, 24.0, 22.0, 16.0, 19.0, 16.0, 16.0, 12.0, 14.0, 7.0, 7.0, 3.0, 6.0, 2.0, 5.0, 2.0, 2.0, 3.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.1060791015625, -0.10232353210449219, -0.09856796264648438, -0.09481239318847656, -0.09105682373046875, -0.08730125427246094, -0.08354568481445312, -0.07979011535644531, -0.0760345458984375, -0.07227897644042969, -0.06852340698242188, -0.06476783752441406, -0.06101226806640625, -0.05725669860839844, -0.053501129150390625, -0.04974555969238281, -0.045989990234375, -0.04223442077636719, -0.038478851318359375, -0.03472328186035156, -0.03096771240234375, -0.027212142944335938, -0.023456573486328125, -0.019701004028320312, -0.0159454345703125, -0.012189865112304688, -0.008434295654296875, -0.0046787261962890625, -0.00092315673828125, 0.0028324127197265625, 0.006587982177734375, 0.010343551635742188, 0.01409912109375, 0.017854690551757812, 0.021610260009765625, 0.025365829467773438, 0.02912139892578125, 0.03287696838378906, 0.036632537841796875, 0.04038810729980469, 0.0441436767578125, 0.04789924621582031, 0.051654815673828125, 0.05541038513183594, 0.05916595458984375, 0.06292152404785156, 0.06667709350585938, 0.07043266296386719, 0.074188232421875, 0.07794380187988281, 0.08169937133789062, 0.08545494079589844, 0.08921051025390625, 0.09296607971191406, 0.09672164916992188, 0.10047721862792969, 0.1042327880859375, 0.10798835754394531, 0.11174392700195312, 0.11549949645996094, 0.11925506591796875, 0.12301063537597656, 0.12676620483398438, 0.1305217742919922, 0.13427734375]}, "gradients/encoder.encoder.layers.13.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 3.0, 3.0, 5.0, 4.0, 12.0, 16.0, 22.0, 39.0, 80.0, 98.0, 229.0, 563.0, 1557.0, 7882.0, 103805.0, 822123.0, 101557.0, 7943.0, 1567.0, 507.0, 252.0, 134.0, 71.0, 41.0, 20.0, 13.0, 7.0, 5.0, 3.0, 3.0, 0.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.051361083984375, -0.04885530471801758, -0.046349525451660156, -0.043843746185302734, -0.04133796691894531, -0.03883218765258789, -0.03632640838623047, -0.03382062911987305, -0.031314849853515625, -0.028809070587158203, -0.02630329132080078, -0.02379751205444336, -0.021291732788085938, -0.018785953521728516, -0.016280174255371094, -0.013774394989013672, -0.01126861572265625, -0.008762836456298828, -0.006257057189941406, -0.0037512779235839844, -0.0012454986572265625, 0.0012602806091308594, 0.0037660598754882812, 0.006271839141845703, 0.008777618408203125, 0.011283397674560547, 0.013789176940917969, 0.01629495620727539, 0.018800735473632812, 0.021306514739990234, 0.023812294006347656, 0.026318073272705078, 0.0288238525390625, 0.03132963180541992, 0.033835411071777344, 0.036341190338134766, 0.03884696960449219, 0.04135274887084961, 0.04385852813720703, 0.04636430740356445, 0.048870086669921875, 0.0513758659362793, 0.05388164520263672, 0.05638742446899414, 0.05889320373535156, 0.061398983001708984, 0.0639047622680664, 0.06641054153442383, 0.06891632080078125, 0.07142210006713867, 0.0739278793334961, 0.07643365859985352, 0.07893943786621094, 0.08144521713256836, 0.08395099639892578, 0.0864567756652832, 0.08896255493164062, 0.09146833419799805, 0.09397411346435547, 0.09647989273071289, 0.09898567199707031, 0.10149145126342773, 0.10399723052978516, 0.10650300979614258, 0.1090087890625]}, "gradients/encoder.encoder.layers.13.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 2.0, 7.0, 5.0, 5.0, 9.0, 12.0, 20.0, 24.0, 56.0, 62.0, 62.0, 65.0, 94.0, 101.0, 114.0, 93.0, 72.0, 58.0, 44.0, 38.0, 20.0, 11.0, 18.0, 4.0, 4.0, 4.0, 3.0, 3.0, 1.0, 0.0, 1.0, 3.0], "bins": [-1.9073486328125e-05, -1.8654391169548035e-05, -1.823529601097107e-05, -1.7816200852394104e-05, -1.739710569381714e-05, -1.6978010535240173e-05, -1.6558915376663208e-05, -1.6139820218086243e-05, -1.5720725059509277e-05, -1.5301629900932312e-05, -1.4882534742355347e-05, -1.4463439583778381e-05, -1.4044344425201416e-05, -1.362524926662445e-05, -1.3206154108047485e-05, -1.278705894947052e-05, -1.2367963790893555e-05, -1.194886863231659e-05, -1.1529773473739624e-05, -1.1110678315162659e-05, -1.0691583156585693e-05, -1.0272487998008728e-05, -9.853392839431763e-06, -9.434297680854797e-06, -9.015202522277832e-06, -8.596107363700867e-06, -8.177012205123901e-06, -7.757917046546936e-06, -7.338821887969971e-06, -6.919726729393005e-06, -6.50063157081604e-06, -6.081536412239075e-06, -5.662441253662109e-06, -5.243346095085144e-06, -4.824250936508179e-06, -4.405155777931213e-06, -3.986060619354248e-06, -3.5669654607772827e-06, -3.1478703022003174e-06, -2.728775143623352e-06, -2.3096799850463867e-06, -1.8905848264694214e-06, -1.471489667892456e-06, -1.0523945093154907e-06, -6.332993507385254e-07, -2.1420419216156006e-07, 2.0489096641540527e-07, 6.239861249923706e-07, 1.043081283569336e-06, 1.4621764421463013e-06, 1.8812716007232666e-06, 2.300366759300232e-06, 2.7194619178771973e-06, 3.1385570764541626e-06, 3.557652235031128e-06, 3.976747393608093e-06, 4.395842552185059e-06, 4.814937710762024e-06, 5.234032869338989e-06, 5.653128027915955e-06, 6.07222318649292e-06, 6.491318345069885e-06, 6.910413503646851e-06, 7.329508662223816e-06, 7.748603820800781e-06]}, "gradients/encoder.encoder.layers.13.attention.q_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 2.0, 5.0, 3.0, 3.0, 5.0, 14.0, 21.0, 18.0, 28.0, 28.0, 44.0, 51.0, 78.0, 118.0, 269.0, 501.0, 1710.0, 8812.0, 99867.0, 788279.0, 134739.0, 10762.0, 2022.0, 562.0, 210.0, 112.0, 61.0, 51.0, 33.0, 30.0, 38.0, 23.0, 17.0, 8.0, 7.0, 7.0, 6.0, 3.0, 5.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 1.0, 0.0, 2.0], "bins": [-0.09625244140625, -0.09354496002197266, -0.09083747863769531, -0.08812999725341797, -0.08542251586914062, -0.08271503448486328, -0.08000755310058594, -0.0773000717163086, -0.07459259033203125, -0.0718851089477539, -0.06917762756347656, -0.06647014617919922, -0.06376266479492188, -0.06105518341064453, -0.05834770202636719, -0.055640220642089844, -0.0529327392578125, -0.050225257873535156, -0.04751777648925781, -0.04481029510498047, -0.042102813720703125, -0.03939533233642578, -0.03668785095214844, -0.033980369567871094, -0.03127288818359375, -0.028565406799316406, -0.025857925415039062, -0.02315044403076172, -0.020442962646484375, -0.01773548126220703, -0.015027999877929688, -0.012320518493652344, -0.009613037109375, -0.006905555725097656, -0.0041980743408203125, -0.0014905929565429688, 0.001216888427734375, 0.003924369812011719, 0.0066318511962890625, 0.009339332580566406, 0.01204681396484375, 0.014754295349121094, 0.017461776733398438, 0.02016925811767578, 0.022876739501953125, 0.02558422088623047, 0.028291702270507812, 0.030999183654785156, 0.0337066650390625, 0.036414146423339844, 0.03912162780761719, 0.04182910919189453, 0.044536590576171875, 0.04724407196044922, 0.04995155334472656, 0.052659034729003906, 0.05536651611328125, 0.058073997497558594, 0.06078147888183594, 0.06348896026611328, 0.06619644165039062, 0.06890392303466797, 0.07161140441894531, 0.07431888580322266, 0.0770263671875]}, "gradients/encoder.encoder.layers.13.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 6.0, 8.0, 10.0, 15.0, 16.0, 22.0, 24.0, 28.0, 40.0, 55.0, 73.0, 69.0, 98.0, 96.0, 77.0, 70.0, 72.0, 52.0, 33.0, 41.0, 22.0, 18.0, 22.0, 7.0, 8.0, 9.0, 8.0, 4.0, 0.0, 2.0, 1.0, 4.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.053497314453125, -0.051906585693359375, -0.05031585693359375, -0.048725128173828125, -0.0471343994140625, -0.045543670654296875, -0.04395294189453125, -0.042362213134765625, -0.040771484375, -0.039180755615234375, -0.03759002685546875, -0.035999298095703125, -0.0344085693359375, -0.032817840576171875, -0.03122711181640625, -0.029636383056640625, -0.028045654296875, -0.026454925537109375, -0.02486419677734375, -0.023273468017578125, -0.0216827392578125, -0.020092010498046875, -0.01850128173828125, -0.016910552978515625, -0.01531982421875, -0.013729095458984375, -0.01213836669921875, -0.010547637939453125, -0.0089569091796875, -0.007366180419921875, -0.00577545166015625, -0.004184722900390625, -0.002593994140625, -0.001003265380859375, 0.00058746337890625, 0.002178192138671875, 0.0037689208984375, 0.005359649658203125, 0.00695037841796875, 0.008541107177734375, 0.0101318359375, 0.011722564697265625, 0.01331329345703125, 0.014904022216796875, 0.0164947509765625, 0.018085479736328125, 0.01967620849609375, 0.021266937255859375, 0.022857666015625, 0.024448394775390625, 0.02603912353515625, 0.027629852294921875, 0.0292205810546875, 0.030811309814453125, 0.03240203857421875, 0.033992767333984375, 0.03558349609375, 0.037174224853515625, 0.03876495361328125, 0.040355682373046875, 0.0419464111328125, 0.043537139892578125, 0.04512786865234375, 0.046718597412109375, 0.048309326171875]}, "gradients/encoder.encoder.layers.13.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 6.0, 8.0, 29.0, 26.0, 96.0, 175.0, 257.0, 232.0, 104.0, 51.0, 16.0, 8.0, 5.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.8727773427963257, -1.8301141262054443, -1.7874507904052734, -1.744787573814392, -1.7021243572235107, -1.6594610214233398, -1.6167978048324585, -1.5741345882415771, -1.5314712524414062, -1.488808035850525, -1.446144700050354, -1.4034814834594727, -1.3608182668685913, -1.31815505027771, -1.275491714477539, -1.2328284978866577, -1.1901652812957764, -1.147502064704895, -1.1048387289047241, -1.0621755123138428, -1.0195122957229614, -0.9768490195274353, -0.9341857433319092, -0.8915225267410278, -0.8488592505455017, -0.8061959743499756, -0.7635327577590942, -0.7208694815635681, -0.678206205368042, -0.6355429887771606, -0.5928797125816345, -0.5502164363861084, -0.5075533390045166, -0.46489009261131287, -0.42222684621810913, -0.379563570022583, -0.3369003236293793, -0.29423707723617554, -0.2515738010406494, -0.20891055464744568, -0.16624730825424194, -0.12358405441045761, -0.08092080056667328, -0.03825753927230835, 0.004405707120895386, 0.04706895351409912, 0.08973222970962524, 0.13239547610282898, 0.17505872249603271, 0.21772196888923645, 0.2603852152824402, 0.3030484914779663, 0.34571173787117004, 0.3883749842643738, 0.4310382604598999, 0.47370150685310364, 0.5163647532463074, 0.5590280294418335, 0.6016912460327148, 0.644354522228241, 0.6870177984237671, 0.7296810150146484, 0.7723442912101746, 0.8150075674057007, 0.857670783996582]}, "gradients/encoder.encoder.layers.13.layer_norm.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 6.0, 3.0, 6.0, 3.0, 7.0, 11.0, 10.0, 12.0, 20.0, 21.0, 26.0, 33.0, 23.0, 34.0, 39.0, 35.0, 43.0, 44.0, 46.0, 49.0, 52.0, 41.0, 48.0, 40.0, 36.0, 27.0, 34.0, 24.0, 36.0, 27.0, 23.0, 29.0, 20.0, 15.0, 11.0, 18.0, 10.0, 6.0, 7.0, 7.0, 8.0, 5.0, 2.0, 3.0, 3.0, 7.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6110549569129944, -0.5900056958198547, -0.5689563751220703, -0.5479071140289307, -0.526857852935791, -0.5058085918426514, -0.48475927114486694, -0.4637100100517273, -0.44266071915626526, -0.4216114282608032, -0.4005621671676636, -0.37951287627220154, -0.3584635853767395, -0.33741432428359985, -0.3163650333881378, -0.2953157424926758, -0.27426648139953613, -0.2532171905040741, -0.23216792941093445, -0.2111186385154724, -0.19006936252117157, -0.16902008652687073, -0.1479707956314087, -0.12692151963710785, -0.105872243642807, -0.08482296764850616, -0.06377368420362473, -0.042724400758743286, -0.021675124764442444, -0.0006258487701416016, 0.020423442125320435, 0.04147271811962128, 0.06252199411392212, 0.08357127010822296, 0.1046205535531044, 0.12566983699798584, 0.14671911299228668, 0.16776838898658752, 0.18881767988204956, 0.2098669558763504, 0.23091623187065125, 0.2519655227661133, 0.27301478385925293, 0.29406407475471497, 0.315113365650177, 0.33616262674331665, 0.3572119176387787, 0.3782612085342407, 0.39931046962738037, 0.4203597605228424, 0.44140902161598206, 0.4624583125114441, 0.48350757360458374, 0.5045568943023682, 0.5256061553955078, 0.5466554164886475, 0.5677046775817871, 0.5887539386749268, 0.6098032593727112, 0.6308525204658508, 0.6519017815589905, 0.6729511022567749, 0.6940003633499146, 0.7150496244430542, 0.7360989451408386]}, "gradients/encoder.encoder.layers.12.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 5.0, 11.0, 12.0, 20.0, 23.0, 50.0, 79.0, 128.0, 238.0, 563.0, 3044.0, 53623.0, 4123969.0, 10378.0, 1403.0, 400.0, 164.0, 85.0, 40.0, 20.0, 7.0, 6.0, 8.0, 3.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.72607421875, -0.708892822265625, -0.69171142578125, -0.674530029296875, -0.6573486328125, -0.640167236328125, -0.62298583984375, -0.605804443359375, -0.588623046875, -0.571441650390625, -0.55426025390625, -0.537078857421875, -0.5198974609375, -0.502716064453125, -0.48553466796875, -0.468353271484375, -0.451171875, -0.433990478515625, -0.41680908203125, -0.399627685546875, -0.3824462890625, -0.365264892578125, -0.34808349609375, -0.330902099609375, -0.313720703125, -0.296539306640625, -0.27935791015625, -0.262176513671875, -0.2449951171875, -0.227813720703125, -0.21063232421875, -0.193450927734375, -0.17626953125, -0.159088134765625, -0.14190673828125, -0.124725341796875, -0.1075439453125, -0.090362548828125, -0.07318115234375, -0.055999755859375, -0.038818359375, -0.021636962890625, -0.00445556640625, 0.012725830078125, 0.0299072265625, 0.047088623046875, 0.06427001953125, 0.081451416015625, 0.0986328125, 0.115814208984375, 0.13299560546875, 0.150177001953125, 0.1673583984375, 0.184539794921875, 0.20172119140625, 0.218902587890625, 0.236083984375, 0.253265380859375, 0.27044677734375, 0.287628173828125, 0.3048095703125, 0.321990966796875, 0.33917236328125, 0.356353759765625, 0.37353515625]}, "gradients/encoder.encoder.layers.12.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 3.0, 4.0, 18.0, 23.0, 35.0, 56.0, 75.0, 75.0, 88.0, 95.0, 125.0, 92.0, 90.0, 63.0, 56.0, 35.0, 24.0, 19.0, 11.0, 5.0, 4.0, 6.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.0968017578125, -0.09450149536132812, -0.09220123291015625, -0.08990097045898438, -0.0876007080078125, -0.08530044555664062, -0.08300018310546875, -0.08069992065429688, -0.078399658203125, -0.07609939575195312, -0.07379913330078125, -0.07149887084960938, -0.0691986083984375, -0.06689834594726562, -0.06459808349609375, -0.062297821044921875, -0.05999755859375, -0.057697296142578125, -0.05539703369140625, -0.053096771240234375, -0.0507965087890625, -0.048496246337890625, -0.04619598388671875, -0.043895721435546875, -0.041595458984375, -0.039295196533203125, -0.03699493408203125, -0.034694671630859375, -0.0323944091796875, -0.030094146728515625, -0.02779388427734375, -0.025493621826171875, -0.023193359375, -0.020893096923828125, -0.01859283447265625, -0.016292572021484375, -0.0139923095703125, -0.011692047119140625, -0.00939178466796875, -0.007091522216796875, -0.004791259765625, -0.002490997314453125, -0.00019073486328125, 0.002109527587890625, 0.0044097900390625, 0.006710052490234375, 0.00901031494140625, 0.011310577392578125, 0.01361083984375, 0.015911102294921875, 0.01821136474609375, 0.020511627197265625, 0.0228118896484375, 0.025112152099609375, 0.02741241455078125, 0.029712677001953125, 0.032012939453125, 0.034313201904296875, 0.03661346435546875, 0.038913726806640625, 0.0412139892578125, 0.043514251708984375, 0.04581451416015625, 0.048114776611328125, 0.0504150390625]}, "gradients/encoder.encoder.layers.12.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 5.0, 6.0, 10.0, 8.0, 15.0, 25.0, 37.0, 67.0, 117.0, 166.0, 288.0, 471.0, 936.0, 1905.0, 4515.0, 15938.0, 312389.0, 3821712.0, 24947.0, 5844.0, 2327.0, 1107.0, 561.0, 348.0, 216.0, 111.0, 81.0, 54.0, 30.0, 25.0, 12.0, 5.0, 6.0, 5.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.313232421875, -0.3034820556640625, -0.293731689453125, -0.2839813232421875, -0.27423095703125, -0.2644805908203125, -0.254730224609375, -0.2449798583984375, -0.2352294921875, -0.2254791259765625, -0.215728759765625, -0.2059783935546875, -0.19622802734375, -0.1864776611328125, -0.176727294921875, -0.1669769287109375, -0.1572265625, -0.1474761962890625, -0.137725830078125, -0.1279754638671875, -0.11822509765625, -0.1084747314453125, -0.098724365234375, -0.0889739990234375, -0.0792236328125, -0.0694732666015625, -0.059722900390625, -0.0499725341796875, -0.04022216796875, -0.0304718017578125, -0.020721435546875, -0.0109710693359375, -0.001220703125, 0.0085296630859375, 0.018280029296875, 0.0280303955078125, 0.03778076171875, 0.0475311279296875, 0.057281494140625, 0.0670318603515625, 0.0767822265625, 0.0865325927734375, 0.096282958984375, 0.1060333251953125, 0.11578369140625, 0.1255340576171875, 0.135284423828125, 0.1450347900390625, 0.15478515625, 0.1645355224609375, 0.174285888671875, 0.1840362548828125, 0.19378662109375, 0.2035369873046875, 0.213287353515625, 0.2230377197265625, 0.2327880859375, 0.2425384521484375, 0.252288818359375, 0.2620391845703125, 0.27178955078125, 0.2815399169921875, 0.291290283203125, 0.3010406494140625, 0.310791015625]}, "gradients/encoder.encoder.layers.12.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 3.0, 4.0, 2.0, 2.0, 4.0, 2.0, 7.0, 11.0, 9.0, 23.0, 40.0, 69.0, 227.0, 2899.0, 544.0, 103.0, 52.0, 27.0, 14.0, 7.0, 10.0, 3.0, 3.0, 4.0, 6.0, 3.0, 2.0, 0.0, 2.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.11248779296875, -0.10882568359375, -0.10516357421875, -0.10150146484375, -0.09783935546875, -0.09417724609375, -0.09051513671875, -0.08685302734375, -0.08319091796875, -0.07952880859375, -0.07586669921875, -0.07220458984375, -0.06854248046875, -0.06488037109375, -0.06121826171875, -0.05755615234375, -0.05389404296875, -0.05023193359375, -0.04656982421875, -0.04290771484375, -0.03924560546875, -0.03558349609375, -0.03192138671875, -0.02825927734375, -0.02459716796875, -0.02093505859375, -0.01727294921875, -0.01361083984375, -0.00994873046875, -0.00628662109375, -0.00262451171875, 0.00103759765625, 0.00469970703125, 0.00836181640625, 0.01202392578125, 0.01568603515625, 0.01934814453125, 0.02301025390625, 0.02667236328125, 0.03033447265625, 0.03399658203125, 0.03765869140625, 0.04132080078125, 0.04498291015625, 0.04864501953125, 0.05230712890625, 0.05596923828125, 0.05963134765625, 0.06329345703125, 0.06695556640625, 0.07061767578125, 0.07427978515625, 0.07794189453125, 0.08160400390625, 0.08526611328125, 0.08892822265625, 0.09259033203125, 0.09625244140625, 0.09991455078125, 0.10357666015625, 0.10723876953125, 0.11090087890625, 0.11456298828125, 0.11822509765625, 0.12188720703125]}, "gradients/encoder.encoder.layers.12.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0, 3.0, 2.0, 11.0, 33.0, 82.0, 264.0, 384.0, 164.0, 36.0, 14.0, 5.0, 4.0, 1.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6126707792282104, -0.5808740258216858, -0.5490772128105164, -0.5172804594039917, -0.48548367619514465, -0.4536868929862976, -0.42189013957977295, -0.3900933563709259, -0.35829657316207886, -0.3264997899532318, -0.29470300674438477, -0.2629062533378601, -0.23110947012901306, -0.19931268692016602, -0.16751591861248016, -0.1357191503047943, -0.10392236709594727, -0.07212559133768082, -0.04032881557941437, -0.008532039821147919, 0.02326473593711853, 0.055061519145965576, 0.08685828745365143, 0.11865505576133728, 0.15045183897018433, 0.18224862217903137, 0.21404539048671722, 0.24584215879440308, 0.2776389420032501, 0.30943572521209717, 0.3412324786186218, 0.37302926182746887, 0.40482592582702637, 0.4366227090358734, 0.46841949224472046, 0.5002162456512451, 0.5320130586624146, 0.5638098120689392, 0.5956065654754639, 0.6274033784866333, 0.659200131893158, 0.6909968852996826, 0.722793698310852, 0.7545904517173767, 0.7863872051239014, 0.8181840181350708, 0.8499807715415955, 0.8817775249481201, 0.9135743379592896, 0.9453710913658142, 0.9771679043769836, 1.0089646577835083, 1.0407614707946777, 1.0725581645965576, 1.104354977607727, 1.1361517906188965, 1.1679484844207764, 1.1997452974319458, 1.2315419912338257, 1.2633388042449951, 1.2951356172561646, 1.326932430267334, 1.3587291240692139, 1.3905259370803833, 1.4223227500915527]}, "gradients/encoder.encoder.layers.12.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 6.0, 1.0, 6.0, 11.0, 23.0, 30.0, 49.0, 62.0, 82.0, 88.0, 101.0, 107.0, 100.0, 74.0, 81.0, 71.0, 49.0, 24.0, 13.0, 9.0, 10.0, 6.0, 3.0, 2.0, 2.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0], "bins": [-0.6076111197471619, -0.5939531326293945, -0.580295205116272, -0.5666372179985046, -0.5529792308807373, -0.5393213033676147, -0.5256633162498474, -0.5120053291320801, -0.49834737181663513, -0.4846894145011902, -0.47103142738342285, -0.4573734700679779, -0.44371551275253296, -0.4300575256347656, -0.4163995683193207, -0.40274161100387573, -0.3890836238861084, -0.37542566657066345, -0.3617676794528961, -0.34810972213745117, -0.33445173501968384, -0.3207937777042389, -0.30713582038879395, -0.2934778332710266, -0.27981987595558167, -0.2661619186401367, -0.2525039315223694, -0.23884597420692444, -0.2251880019903183, -0.21153002977371216, -0.1978720724582672, -0.18421410024166107, -0.17055612802505493, -0.1568981558084488, -0.14324018359184265, -0.1295822262763977, -0.11592425405979156, -0.10226628184318542, -0.08860831707715988, -0.07495035231113434, -0.0612923800945282, -0.04763441160321236, -0.033976443111896515, -0.020318474620580673, -0.0066605061292648315, 0.006997466087341309, 0.020655430853366852, 0.034313395619392395, 0.047971367835998535, 0.06162933632731438, 0.07528730481863022, 0.08894526958465576, 0.1026032418012619, 0.11626121401786804, 0.129919171333313, 0.14357714354991913, 0.15723511576652527, 0.1708930879831314, 0.18455106019973755, 0.1982090175151825, 0.21186698973178864, 0.22552496194839478, 0.23918291926383972, 0.25284087657928467, 0.266498863697052]}, "gradients/encoder.encoder.layers.12.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 4.0, 0.0, 0.0, 5.0, 1.0, 7.0, 4.0, 9.0, 10.0, 12.0, 15.0, 26.0, 37.0, 55.0, 77.0, 100.0, 179.0, 291.0, 542.0, 1193.0, 3166.0, 12695.0, 106800.0, 803124.0, 102413.0, 12149.0, 3096.0, 1179.0, 527.0, 305.0, 173.0, 106.0, 74.0, 56.0, 35.0, 24.0, 22.0, 14.0, 13.0, 5.0, 7.0, 2.0, 5.0, 4.0, 2.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.2548828125, -0.24727249145507812, -0.23966217041015625, -0.23205184936523438, -0.2244415283203125, -0.21683120727539062, -0.20922088623046875, -0.20161056518554688, -0.194000244140625, -0.18638992309570312, -0.17877960205078125, -0.17116928100585938, -0.1635589599609375, -0.15594863891601562, -0.14833831787109375, -0.14072799682617188, -0.13311767578125, -0.12550735473632812, -0.11789703369140625, -0.11028671264648438, -0.1026763916015625, -0.09506607055664062, -0.08745574951171875, -0.07984542846679688, -0.072235107421875, -0.06462478637695312, -0.05701446533203125, -0.049404144287109375, -0.0417938232421875, -0.034183502197265625, -0.02657318115234375, -0.018962860107421875, -0.0113525390625, -0.003742218017578125, 0.00386810302734375, 0.011478424072265625, 0.0190887451171875, 0.026699066162109375, 0.03430938720703125, 0.041919708251953125, 0.049530029296875, 0.057140350341796875, 0.06475067138671875, 0.07236099243164062, 0.0799713134765625, 0.08758163452148438, 0.09519195556640625, 0.10280227661132812, 0.11041259765625, 0.11802291870117188, 0.12563323974609375, 0.13324356079101562, 0.1408538818359375, 0.14846420288085938, 0.15607452392578125, 0.16368484497070312, 0.171295166015625, 0.17890548706054688, 0.18651580810546875, 0.19412612915039062, 0.2017364501953125, 0.20934677124023438, 0.21695709228515625, 0.22456741333007812, 0.232177734375]}, "gradients/encoder.encoder.layers.12.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 4.0, 7.0, 11.0, 14.0, 36.0, 58.0, 67.0, 92.0, 95.0, 104.0, 131.0, 94.0, 86.0, 73.0, 46.0, 29.0, 18.0, 20.0, 11.0, 3.0, 6.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.10955810546875, -0.10701227188110352, -0.10446643829345703, -0.10192060470581055, -0.09937477111816406, -0.09682893753051758, -0.0942831039428711, -0.09173727035522461, -0.08919143676757812, -0.08664560317993164, -0.08409976959228516, -0.08155393600463867, -0.07900810241699219, -0.0764622688293457, -0.07391643524169922, -0.07137060165405273, -0.06882476806640625, -0.06627893447875977, -0.06373310089111328, -0.0611872673034668, -0.05864143371582031, -0.05609560012817383, -0.053549766540527344, -0.05100393295288086, -0.048458099365234375, -0.04591226577758789, -0.043366432189941406, -0.04082059860229492, -0.03827476501464844, -0.03572893142700195, -0.03318309783935547, -0.030637264251708984, -0.0280914306640625, -0.025545597076416016, -0.02299976348876953, -0.020453929901123047, -0.017908096313476562, -0.015362262725830078, -0.012816429138183594, -0.01027059555053711, -0.007724761962890625, -0.005178928375244141, -0.0026330947875976562, -8.726119995117188e-05, 0.0024585723876953125, 0.005004405975341797, 0.007550239562988281, 0.010096073150634766, 0.01264190673828125, 0.015187740325927734, 0.01773357391357422, 0.020279407501220703, 0.022825241088867188, 0.025371074676513672, 0.027916908264160156, 0.03046274185180664, 0.033008575439453125, 0.03555440902709961, 0.038100242614746094, 0.04064607620239258, 0.04319190979003906, 0.04573774337768555, 0.04828357696533203, 0.050829410552978516, 0.053375244140625]}, "gradients/encoder.encoder.layers.12.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 7.0, 4.0, 3.0, 13.0, 8.0, 20.0, 28.0, 51.0, 93.0, 137.0, 277.0, 542.0, 1375.0, 4000.0, 18642.0, 224283.0, 730598.0, 56302.0, 8199.0, 2273.0, 807.0, 415.0, 221.0, 99.0, 58.0, 36.0, 27.0, 11.0, 11.0, 6.0, 6.0, 5.0, 4.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.16455078125, -0.15892601013183594, -0.15330123901367188, -0.1476764678955078, -0.14205169677734375, -0.1364269256591797, -0.13080215454101562, -0.12517738342285156, -0.1195526123046875, -0.11392784118652344, -0.10830307006835938, -0.10267829895019531, -0.09705352783203125, -0.09142875671386719, -0.08580398559570312, -0.08017921447753906, -0.074554443359375, -0.06892967224121094, -0.06330490112304688, -0.05768013000488281, -0.05205535888671875, -0.04643058776855469, -0.040805816650390625, -0.03518104553222656, -0.0295562744140625, -0.023931503295898438, -0.018306732177734375, -0.012681961059570312, -0.00705718994140625, -0.0014324188232421875, 0.004192352294921875, 0.009817123413085938, 0.01544189453125, 0.021066665649414062, 0.026691436767578125, 0.03231620788574219, 0.03794097900390625, 0.04356575012207031, 0.049190521240234375, 0.05481529235839844, 0.0604400634765625, 0.06606483459472656, 0.07168960571289062, 0.07731437683105469, 0.08293914794921875, 0.08856391906738281, 0.09418869018554688, 0.09981346130371094, 0.105438232421875, 0.11106300354003906, 0.11668777465820312, 0.12231254577636719, 0.12793731689453125, 0.1335620880126953, 0.13918685913085938, 0.14481163024902344, 0.1504364013671875, 0.15606117248535156, 0.16168594360351562, 0.1673107147216797, 0.17293548583984375, 0.1785602569580078, 0.18418502807617188, 0.18980979919433594, 0.1954345703125]}, "gradients/encoder.encoder.layers.12.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 3.0, 3.0, 1.0, 4.0, 2.0, 5.0, 6.0, 7.0, 6.0, 10.0, 17.0, 21.0, 26.0, 25.0, 16.0, 24.0, 36.0, 29.0, 35.0, 39.0, 35.0, 51.0, 36.0, 51.0, 42.0, 54.0, 49.0, 50.0, 42.0, 35.0, 45.0, 32.0, 24.0, 23.0, 23.0, 24.0, 19.0, 7.0, 7.0, 12.0, 9.0, 12.0, 5.0, 4.0, 3.0, 3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.125732421875, -0.12141799926757812, -0.11710357666015625, -0.11278915405273438, -0.1084747314453125, -0.10416030883789062, -0.09984588623046875, -0.09553146362304688, -0.091217041015625, -0.08690261840820312, -0.08258819580078125, -0.07827377319335938, -0.0739593505859375, -0.06964492797851562, -0.06533050537109375, -0.061016082763671875, -0.05670166015625, -0.052387237548828125, -0.04807281494140625, -0.043758392333984375, -0.0394439697265625, -0.035129547119140625, -0.03081512451171875, -0.026500701904296875, -0.022186279296875, -0.017871856689453125, -0.01355743408203125, -0.009243011474609375, -0.0049285888671875, -0.000614166259765625, 0.00370025634765625, 0.008014678955078125, 0.0123291015625, 0.016643524169921875, 0.02095794677734375, 0.025272369384765625, 0.0295867919921875, 0.033901214599609375, 0.03821563720703125, 0.042530059814453125, 0.046844482421875, 0.051158905029296875, 0.05547332763671875, 0.059787750244140625, 0.0641021728515625, 0.06841659545898438, 0.07273101806640625, 0.07704544067382812, 0.08135986328125, 0.08567428588867188, 0.08998870849609375, 0.09430313110351562, 0.0986175537109375, 0.10293197631835938, 0.10724639892578125, 0.11156082153320312, 0.115875244140625, 0.12018966674804688, 0.12450408935546875, 0.12881851196289062, 0.1331329345703125, 0.13744735717773438, 0.14176177978515625, 0.14607620239257812, 0.150390625]}, "gradients/encoder.encoder.layers.12.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 5.0, 0.0, 3.0, 2.0, 2.0, 0.0, 2.0, 11.0, 8.0, 17.0, 21.0, 38.0, 55.0, 86.0, 123.0, 220.0, 424.0, 860.0, 2264.0, 6451.0, 24510.0, 119851.0, 572298.0, 258673.0, 45550.0, 10997.0, 3468.0, 1286.0, 590.0, 298.0, 160.0, 104.0, 86.0, 43.0, 14.0, 13.0, 8.0, 10.0, 3.0, 6.0, 1.0, 3.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.03741455078125, -0.036196231842041016, -0.03497791290283203, -0.03375959396362305, -0.03254127502441406, -0.03132295608520508, -0.030104637145996094, -0.02888631820678711, -0.027667999267578125, -0.02644968032836914, -0.025231361389160156, -0.024013042449951172, -0.022794723510742188, -0.021576404571533203, -0.02035808563232422, -0.019139766693115234, -0.01792144775390625, -0.016703128814697266, -0.015484809875488281, -0.014266490936279297, -0.013048171997070312, -0.011829853057861328, -0.010611534118652344, -0.00939321517944336, -0.008174896240234375, -0.006956577301025391, -0.005738258361816406, -0.004519939422607422, -0.0033016204833984375, -0.002083301544189453, -0.0008649826049804688, 0.0003533363342285156, 0.0015716552734375, 0.0027899742126464844, 0.004008293151855469, 0.005226612091064453, 0.0064449310302734375, 0.007663249969482422, 0.008881568908691406, 0.01009988784790039, 0.011318206787109375, 0.01253652572631836, 0.013754844665527344, 0.014973163604736328, 0.016191482543945312, 0.017409801483154297, 0.01862812042236328, 0.019846439361572266, 0.02106475830078125, 0.022283077239990234, 0.02350139617919922, 0.024719715118408203, 0.025938034057617188, 0.027156352996826172, 0.028374671936035156, 0.02959299087524414, 0.030811309814453125, 0.03202962875366211, 0.033247947692871094, 0.03446626663208008, 0.03568458557128906, 0.03690290451049805, 0.03812122344970703, 0.039339542388916016, 0.040557861328125]}, "gradients/encoder.encoder.layers.12.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 1.0, 2.0, 2.0, 7.0, 4.0, 14.0, 24.0, 27.0, 49.0, 66.0, 79.0, 104.0, 99.0, 130.0, 102.0, 72.0, 78.0, 53.0, 27.0, 22.0, 15.0, 16.0, 6.0, 7.0, 4.0, 2.0, 1.0, 0.0, 3.0], "bins": [-2.181529998779297e-05, -2.1360814571380615e-05, -2.0906329154968262e-05, -2.0451843738555908e-05, -1.9997358322143555e-05, -1.95428729057312e-05, -1.9088387489318848e-05, -1.8633902072906494e-05, -1.817941665649414e-05, -1.7724931240081787e-05, -1.7270445823669434e-05, -1.681596040725708e-05, -1.6361474990844727e-05, -1.5906989574432373e-05, -1.545250415802002e-05, -1.4998018741607666e-05, -1.4543533325195312e-05, -1.4089047908782959e-05, -1.3634562492370605e-05, -1.3180077075958252e-05, -1.2725591659545898e-05, -1.2271106243133545e-05, -1.1816620826721191e-05, -1.1362135410308838e-05, -1.0907649993896484e-05, -1.0453164577484131e-05, -9.998679161071777e-06, -9.544193744659424e-06, -9.08970832824707e-06, -8.635222911834717e-06, -8.180737495422363e-06, -7.72625207901001e-06, -7.271766662597656e-06, -6.817281246185303e-06, -6.362795829772949e-06, -5.908310413360596e-06, -5.453824996948242e-06, -4.999339580535889e-06, -4.544854164123535e-06, -4.090368747711182e-06, -3.635883331298828e-06, -3.1813979148864746e-06, -2.726912498474121e-06, -2.2724270820617676e-06, -1.817941665649414e-06, -1.3634562492370605e-06, -9.08970832824707e-07, -4.544854164123535e-07, 0.0, 4.544854164123535e-07, 9.08970832824707e-07, 1.3634562492370605e-06, 1.817941665649414e-06, 2.2724270820617676e-06, 2.726912498474121e-06, 3.1813979148864746e-06, 3.635883331298828e-06, 4.090368747711182e-06, 4.544854164123535e-06, 4.999339580535889e-06, 5.453824996948242e-06, 5.908310413360596e-06, 6.362795829772949e-06, 6.817281246185303e-06, 7.271766662597656e-06]}, "gradients/encoder.encoder.layers.12.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 4.0, 2.0, 8.0, 3.0, 8.0, 13.0, 20.0, 14.0, 32.0, 78.0, 129.0, 254.0, 555.0, 1280.0, 3296.0, 10152.0, 40380.0, 232205.0, 591124.0, 131914.0, 25555.0, 7180.0, 2454.0, 1016.0, 413.0, 222.0, 96.0, 55.0, 28.0, 18.0, 15.0, 9.0, 6.0, 7.0, 4.0, 5.0, 1.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.043731689453125, -0.042290687561035156, -0.04084968566894531, -0.03940868377685547, -0.037967681884765625, -0.03652667999267578, -0.03508567810058594, -0.033644676208496094, -0.03220367431640625, -0.030762672424316406, -0.029321670532226562, -0.02788066864013672, -0.026439666748046875, -0.02499866485595703, -0.023557662963867188, -0.022116661071777344, -0.0206756591796875, -0.019234657287597656, -0.017793655395507812, -0.01635265350341797, -0.014911651611328125, -0.013470649719238281, -0.012029647827148438, -0.010588645935058594, -0.00914764404296875, -0.007706642150878906, -0.0062656402587890625, -0.004824638366699219, -0.003383636474609375, -0.0019426345825195312, -0.0005016326904296875, 0.0009393692016601562, 0.00238037109375, 0.0038213729858398438, 0.0052623748779296875, 0.006703376770019531, 0.008144378662109375, 0.009585380554199219, 0.011026382446289062, 0.012467384338378906, 0.01390838623046875, 0.015349388122558594, 0.016790390014648438, 0.01823139190673828, 0.019672393798828125, 0.02111339569091797, 0.022554397583007812, 0.023995399475097656, 0.0254364013671875, 0.026877403259277344, 0.028318405151367188, 0.02975940704345703, 0.031200408935546875, 0.03264141082763672, 0.03408241271972656, 0.035523414611816406, 0.03696441650390625, 0.038405418395996094, 0.03984642028808594, 0.04128742218017578, 0.042728424072265625, 0.04416942596435547, 0.04561042785644531, 0.047051429748535156, 0.048492431640625]}, "gradients/encoder.encoder.layers.12.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 1.0, 1.0, 4.0, 5.0, 4.0, 6.0, 8.0, 8.0, 11.0, 15.0, 16.0, 26.0, 18.0, 24.0, 25.0, 42.0, 47.0, 55.0, 65.0, 60.0, 73.0, 71.0, 60.0, 61.0, 49.0, 36.0, 35.0, 47.0, 24.0, 29.0, 8.0, 15.0, 17.0, 7.0, 10.0, 7.0, 8.0, 2.0, 2.0, 1.0, 0.0, 3.0, 3.0, 0.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.029449462890625, -0.02852487564086914, -0.02760028839111328, -0.026675701141357422, -0.025751113891601562, -0.024826526641845703, -0.023901939392089844, -0.022977352142333984, -0.022052764892578125, -0.021128177642822266, -0.020203590393066406, -0.019279003143310547, -0.018354415893554688, -0.017429828643798828, -0.01650524139404297, -0.01558065414428711, -0.01465606689453125, -0.01373147964477539, -0.012806892395019531, -0.011882305145263672, -0.010957717895507812, -0.010033130645751953, -0.009108543395996094, -0.008183956146240234, -0.007259368896484375, -0.006334781646728516, -0.005410194396972656, -0.004485607147216797, -0.0035610198974609375, -0.002636432647705078, -0.0017118453979492188, -0.0007872581481933594, 0.0001373291015625, 0.0010619163513183594, 0.0019865036010742188, 0.002911090850830078, 0.0038356781005859375, 0.004760265350341797, 0.005684852600097656, 0.006609439849853516, 0.007534027099609375, 0.008458614349365234, 0.009383201599121094, 0.010307788848876953, 0.011232376098632812, 0.012156963348388672, 0.013081550598144531, 0.01400613784790039, 0.01493072509765625, 0.01585531234741211, 0.01677989959716797, 0.017704486846923828, 0.018629074096679688, 0.019553661346435547, 0.020478248596191406, 0.021402835845947266, 0.022327423095703125, 0.023252010345458984, 0.024176597595214844, 0.025101184844970703, 0.026025772094726562, 0.026950359344482422, 0.02787494659423828, 0.02879953384399414, 0.02972412109375]}, "gradients/encoder.encoder.layers.12.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 0.0, 1.0, 6.0, 3.0, 10.0, 29.0, 86.0, 199.0, 314.0, 242.0, 69.0, 33.0, 13.0, 5.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-2.5178558826446533, -2.4670417308807373, -2.416227340698242, -2.365413188934326, -2.31459903717041, -2.263784646987915, -2.212970495223999, -2.162156105041504, -2.111341953277588, -2.060527801513672, -2.0097134113311768, -1.9588992595672607, -1.9080849885940552, -1.8572707176208496, -1.8064565658569336, -1.755642294883728, -1.7048280239105225, -1.654013752937317, -1.6031994819641113, -1.5523853302001953, -1.5015710592269897, -1.4507567882537842, -1.3999426364898682, -1.3491283655166626, -1.298314094543457, -1.2474998235702515, -1.196685552597046, -1.1458714008331299, -1.0950571298599243, -1.0442428588867188, -0.993428647518158, -0.9426144361495972, -0.8918001651763916, -0.840985894203186, -0.7901716828346252, -0.7393574714660645, -0.6885432004928589, -0.6377289295196533, -0.5869147181510925, -0.5361005067825317, -0.48528623580932617, -0.434471994638443, -0.3836577534675598, -0.33284351229667664, -0.28202927112579346, -0.23121502995491028, -0.1804007887840271, -0.12958654761314392, -0.07877230644226074, -0.027958065271377563, 0.022856175899505615, 0.0736704170703888, 0.12448465824127197, 0.17529889941215515, 0.22611314058303833, 0.2769273817539215, 0.3277416229248047, 0.37855586409568787, 0.42937010526657104, 0.4801843464374542, 0.5309985876083374, 0.581812858581543, 0.6326270699501038, 0.6834412813186646, 0.7342555522918701]}, "gradients/encoder.encoder.layers.12.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 7.0, 12.0, 11.0, 14.0, 14.0, 8.0, 22.0, 22.0, 25.0, 24.0, 24.0, 20.0, 34.0, 22.0, 26.0, 28.0, 37.0, 31.0, 39.0, 33.0, 39.0, 46.0, 47.0, 30.0, 37.0, 34.0, 42.0, 26.0, 22.0, 32.0, 25.0, 19.0, 34.0, 11.0, 15.0, 13.0, 14.0, 12.0, 11.0, 6.0, 5.0, 10.0, 5.0, 4.0, 6.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 2.0, 3.0], "bins": [-0.5584851503372192, -0.5410619378089905, -0.5236387848854065, -0.5062155723571777, -0.48879238963127136, -0.471369206905365, -0.45394599437713623, -0.43652281165122986, -0.4190996289253235, -0.4016764461994171, -0.38425326347351074, -0.366830050945282, -0.3494068682193756, -0.33198368549346924, -0.3145604729652405, -0.2971372902393341, -0.27971410751342773, -0.26229092478752136, -0.2448677271604538, -0.22744452953338623, -0.21002134680747986, -0.1925981640815735, -0.17517496645450592, -0.15775176882743835, -0.14032858610153198, -0.12290539592504501, -0.10548220574855804, -0.08805901557207108, -0.0706358253955841, -0.05321263521909714, -0.03578944504261017, -0.0183662548661232, -0.0009430646896362305, 0.01648012548685074, 0.03390331566333771, 0.051326505839824677, 0.06874969601631165, 0.08617288619279861, 0.10359607636928558, 0.12101926654577255, 0.13844245672225952, 0.1558656394481659, 0.17328883707523346, 0.19071203470230103, 0.2081352174282074, 0.22555840015411377, 0.24298159778118134, 0.2604047954082489, 0.2778279781341553, 0.29525116086006165, 0.312674343585968, 0.3300975561141968, 0.34752073884010315, 0.3649439215660095, 0.3823671340942383, 0.39979031682014465, 0.417213499546051, 0.4346366822719574, 0.45205986499786377, 0.46948307752609253, 0.4869062602519989, 0.5043294429779053, 0.521752655506134, 0.539175808429718, 0.5565990209579468]}, "gradients/encoder.encoder.layers.11.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 1.0, 1.0, 8.0, 6.0, 10.0, 13.0, 21.0, 41.0, 80.0, 141.0, 409.0, 5845.0, 4186398.0, 897.0, 195.0, 96.0, 59.0, 31.0, 14.0, 5.0, 5.0, 6.0, 1.0, 3.0, 0.0, 1.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-2.4375, -2.3797454833984375, -2.321990966796875, -2.2642364501953125, -2.20648193359375, -2.1487274169921875, -2.090972900390625, -2.0332183837890625, -1.9754638671875, -1.9177093505859375, -1.859954833984375, -1.8022003173828125, -1.74444580078125, -1.6866912841796875, -1.628936767578125, -1.5711822509765625, -1.513427734375, -1.4556732177734375, -1.397918701171875, -1.3401641845703125, -1.28240966796875, -1.2246551513671875, -1.166900634765625, -1.1091461181640625, -1.0513916015625, -0.9936370849609375, -0.935882568359375, -0.8781280517578125, -0.82037353515625, -0.7626190185546875, -0.704864501953125, -0.6471099853515625, -0.58935546875, -0.5316009521484375, -0.473846435546875, -0.4160919189453125, -0.35833740234375, -0.3005828857421875, -0.242828369140625, -0.1850738525390625, -0.1273193359375, -0.0695648193359375, -0.011810302734375, 0.0459442138671875, 0.10369873046875, 0.1614532470703125, 0.219207763671875, 0.2769622802734375, 0.334716796875, 0.3924713134765625, 0.450225830078125, 0.5079803466796875, 0.56573486328125, 0.6234893798828125, 0.681243896484375, 0.7389984130859375, 0.7967529296875, 0.8545074462890625, 0.912261962890625, 0.9700164794921875, 1.02777099609375, 1.0855255126953125, 1.143280029296875, 1.2010345458984375, 1.2587890625]}, "gradients/encoder.encoder.layers.11.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 4.0, 6.0, 8.0, 15.0, 37.0, 43.0, 60.0, 94.0, 79.0, 96.0, 121.0, 110.0, 96.0, 75.0, 57.0, 28.0, 32.0, 18.0, 15.0, 6.0, 5.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0], "bins": [-0.112060546875, -0.10947990417480469, -0.10689926147460938, -0.10431861877441406, -0.10173797607421875, -0.09915733337402344, -0.09657669067382812, -0.09399604797363281, -0.0914154052734375, -0.08883476257324219, -0.08625411987304688, -0.08367347717285156, -0.08109283447265625, -0.07851219177246094, -0.07593154907226562, -0.07335090637207031, -0.070770263671875, -0.06818962097167969, -0.06560897827148438, -0.06302833557128906, -0.06044769287109375, -0.05786705017089844, -0.055286407470703125, -0.05270576477050781, -0.0501251220703125, -0.04754447937011719, -0.044963836669921875, -0.04238319396972656, -0.03980255126953125, -0.03722190856933594, -0.034641265869140625, -0.03206062316894531, -0.02947998046875, -0.026899337768554688, -0.024318695068359375, -0.021738052368164062, -0.01915740966796875, -0.016576766967773438, -0.013996124267578125, -0.011415481567382812, -0.0088348388671875, -0.0062541961669921875, -0.003673553466796875, -0.0010929107666015625, 0.00148773193359375, 0.0040683746337890625, 0.006649017333984375, 0.009229660034179688, 0.011810302734375, 0.014390945434570312, 0.016971588134765625, 0.019552230834960938, 0.02213287353515625, 0.024713516235351562, 0.027294158935546875, 0.029874801635742188, 0.0324554443359375, 0.03503608703613281, 0.037616729736328125, 0.04019737243652344, 0.04277801513671875, 0.04535865783691406, 0.047939300537109375, 0.05051994323730469, 0.0531005859375]}, "gradients/encoder.encoder.layers.11.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 1.0, 2.0, 3.0, 4.0, 8.0, 7.0, 12.0, 17.0, 27.0, 25.0, 38.0, 54.0, 75.0, 110.0, 111.0, 183.0, 228.0, 373.0, 540.0, 858.0, 1472.0, 2728.0, 7561.0, 109521.0, 4051027.0, 11129.0, 3365.0, 1765.0, 1001.0, 592.0, 413.0, 306.0, 190.0, 138.0, 111.0, 66.0, 71.0, 56.0, 35.0, 25.0, 12.0, 5.0, 7.0, 5.0, 7.0, 6.0, 4.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.2113037109375, -0.204925537109375, -0.19854736328125, -0.192169189453125, -0.185791015625, -0.179412841796875, -0.17303466796875, -0.166656494140625, -0.1602783203125, -0.153900146484375, -0.14752197265625, -0.141143798828125, -0.134765625, -0.128387451171875, -0.12200927734375, -0.115631103515625, -0.1092529296875, -0.102874755859375, -0.09649658203125, -0.090118408203125, -0.083740234375, -0.077362060546875, -0.07098388671875, -0.064605712890625, -0.0582275390625, -0.051849365234375, -0.04547119140625, -0.039093017578125, -0.03271484375, -0.026336669921875, -0.01995849609375, -0.013580322265625, -0.0072021484375, -0.000823974609375, 0.00555419921875, 0.011932373046875, 0.018310546875, 0.024688720703125, 0.03106689453125, 0.037445068359375, 0.0438232421875, 0.050201416015625, 0.05657958984375, 0.062957763671875, 0.0693359375, 0.075714111328125, 0.08209228515625, 0.088470458984375, 0.0948486328125, 0.101226806640625, 0.10760498046875, 0.113983154296875, 0.120361328125, 0.126739501953125, 0.13311767578125, 0.139495849609375, 0.1458740234375, 0.152252197265625, 0.15863037109375, 0.165008544921875, 0.17138671875, 0.177764892578125, 0.18414306640625, 0.190521240234375, 0.1968994140625]}, "gradients/encoder.encoder.layers.11.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 1.0, 4.0, 2.0, 6.0, 5.0, 16.0, 36.0, 423.0, 3516.0, 40.0, 19.0, 5.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0447998046875, -0.04330873489379883, -0.041817665100097656, -0.040326595306396484, -0.03883552551269531, -0.03734445571899414, -0.03585338592529297, -0.0343623161315918, -0.032871246337890625, -0.03138017654418945, -0.02988910675048828, -0.02839803695678711, -0.026906967163085938, -0.025415897369384766, -0.023924827575683594, -0.022433757781982422, -0.02094268798828125, -0.019451618194580078, -0.017960548400878906, -0.016469478607177734, -0.014978408813476562, -0.01348733901977539, -0.011996269226074219, -0.010505199432373047, -0.009014129638671875, -0.007523059844970703, -0.006031990051269531, -0.004540920257568359, -0.0030498504638671875, -0.0015587806701660156, -6.771087646484375e-05, 0.0014233589172363281, 0.0029144287109375, 0.004405498504638672, 0.005896568298339844, 0.007387638092041016, 0.008878707885742188, 0.01036977767944336, 0.011860847473144531, 0.013351917266845703, 0.014842987060546875, 0.016334056854248047, 0.01782512664794922, 0.01931619644165039, 0.020807266235351562, 0.022298336029052734, 0.023789405822753906, 0.025280475616455078, 0.02677154541015625, 0.028262615203857422, 0.029753684997558594, 0.031244754791259766, 0.03273582458496094, 0.03422689437866211, 0.03571796417236328, 0.03720903396606445, 0.038700103759765625, 0.0401911735534668, 0.04168224334716797, 0.04317331314086914, 0.04466438293457031, 0.046155452728271484, 0.047646522521972656, 0.04913759231567383, 0.050628662109375]}, "gradients/encoder.encoder.layers.11.final_layer_norm.weight": {"_type": "histogram", "values": [4.0, 3.0, 5.0, 10.0, 27.0, 119.0, 414.0, 311.0, 91.0, 27.0, 7.0, 2.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.08491823822259903, -0.07251914590597153, -0.060120053589344025, -0.04772096499800682, -0.03532187268137932, -0.022922780364751816, -0.010523691773414612, 0.0018754005432128906, 0.014274492859840393, 0.026673585176467896, 0.0390726774930954, 0.0514717660844326, 0.0638708621263504, 0.0762699544429779, 0.08866903930902481, 0.10106813162565231, 0.11346722394227982, 0.12586630880832672, 0.13826540112495422, 0.15066449344158173, 0.16306358575820923, 0.17546267807483673, 0.18786177039146423, 0.20026086270809174, 0.21265995502471924, 0.22505904734134674, 0.23745813965797424, 0.24985723197460175, 0.26225632429122925, 0.27465540170669556, 0.28705450892448425, 0.29945358633995056, 0.31185266375541687, 0.3242517411708832, 0.3366508483886719, 0.3490499258041382, 0.3614490330219269, 0.3738481104373932, 0.3862472176551819, 0.3986462950706482, 0.4110454022884369, 0.4234444797039032, 0.4358435869216919, 0.4482426643371582, 0.4606417715549469, 0.4730408489704132, 0.4854399561882019, 0.4978390336036682, 0.5102381110191345, 0.5226371884346008, 0.5350362658500671, 0.5474354028701782, 0.5598344802856445, 0.5722335577011108, 0.5846326351165771, 0.5970317721366882, 0.6094308495521545, 0.6218299269676208, 0.6342290043830872, 0.6466281414031982, 0.6590272188186646, 0.6714262962341309, 0.6838253736495972, 0.6962245106697083, 0.7086235880851746]}, "gradients/encoder.encoder.layers.11.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 4.0, 1.0, 8.0, 4.0, 10.0, 12.0, 29.0, 14.0, 29.0, 29.0, 34.0, 46.0, 40.0, 50.0, 39.0, 50.0, 63.0, 70.0, 54.0, 66.0, 56.0, 46.0, 52.0, 30.0, 36.0, 26.0, 22.0, 19.0, 20.0, 11.0, 12.0, 4.0, 5.0, 7.0, 4.0, 5.0, 3.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06170976161956787, -0.05943150818347931, -0.05715325102210045, -0.05487499386072159, -0.052596740424633026, -0.050318486988544464, -0.048040229827165604, -0.04576197266578674, -0.04348371922969818, -0.04120546579360962, -0.03892720863223076, -0.0366489514708519, -0.034370698034763336, -0.032092444598674774, -0.029814187437295914, -0.027535932138562202, -0.02525767683982849, -0.02297942154109478, -0.02070116624236107, -0.018422910943627357, -0.016144655644893646, -0.013866400346159935, -0.011588145047426224, -0.009309889748692513, -0.007031634449958801, -0.00475337915122509, -0.002475123852491379, -0.00019686855375766754, 0.0020813867449760437, 0.004359642043709755, 0.006637897342443466, 0.008916152641177177, 0.011194407939910889, 0.0134726632386446, 0.01575091853737831, 0.018029173836112022, 0.020307429134845734, 0.022585684433579445, 0.024863939732313156, 0.027142195031046867, 0.02942045032978058, 0.03169870376586914, 0.033976960927248, 0.03625521808862686, 0.038533471524715424, 0.040811724960803986, 0.043089982122182846, 0.04536823928356171, 0.04764649271965027, 0.04992474615573883, 0.05220300331711769, 0.05448126047849655, 0.056759513914585114, 0.059037767350673676, 0.061316024512052536, 0.0635942816734314, 0.06587253510951996, 0.06815078854560852, 0.07042904198169708, 0.07270730286836624, 0.0749855563044548, 0.07726380974054337, 0.07954207062721252, 0.08182032406330109, 0.08409857749938965]}, "gradients/encoder.encoder.layers.11.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 4.0, 8.0, 4.0, 5.0, 12.0, 20.0, 19.0, 37.0, 53.0, 63.0, 127.0, 171.0, 276.0, 509.0, 1165.0, 2945.0, 12195.0, 81837.0, 639029.0, 270624.0, 30210.0, 5761.0, 1750.0, 763.0, 331.0, 208.0, 120.0, 91.0, 55.0, 51.0, 31.0, 17.0, 22.0, 14.0, 10.0, 8.0, 3.0, 3.0, 4.0, 3.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1600341796875, -0.15465545654296875, -0.1492767333984375, -0.14389801025390625, -0.138519287109375, -0.13314056396484375, -0.1277618408203125, -0.12238311767578125, -0.11700439453125, -0.11162567138671875, -0.1062469482421875, -0.10086822509765625, -0.095489501953125, -0.09011077880859375, -0.0847320556640625, -0.07935333251953125, -0.073974609375, -0.06859588623046875, -0.0632171630859375, -0.05783843994140625, -0.052459716796875, -0.04708099365234375, -0.0417022705078125, -0.03632354736328125, -0.03094482421875, -0.02556610107421875, -0.0201873779296875, -0.01480865478515625, -0.009429931640625, -0.00405120849609375, 0.0013275146484375, 0.00670623779296875, 0.0120849609375, 0.01746368408203125, 0.0228424072265625, 0.02822113037109375, 0.033599853515625, 0.03897857666015625, 0.0443572998046875, 0.04973602294921875, 0.05511474609375, 0.06049346923828125, 0.0658721923828125, 0.07125091552734375, 0.076629638671875, 0.08200836181640625, 0.0873870849609375, 0.09276580810546875, 0.09814453125, 0.10352325439453125, 0.1089019775390625, 0.11428070068359375, 0.119659423828125, 0.12503814697265625, 0.1304168701171875, 0.13579559326171875, 0.14117431640625, 0.14655303955078125, 0.1519317626953125, 0.15731048583984375, 0.162689208984375, 0.16806793212890625, 0.1734466552734375, 0.17882537841796875, 0.1842041015625]}, "gradients/encoder.encoder.layers.11.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 3.0, 4.0, 7.0, 12.0, 19.0, 40.0, 45.0, 63.0, 87.0, 79.0, 107.0, 114.0, 95.0, 92.0, 74.0, 53.0, 38.0, 22.0, 24.0, 11.0, 12.0, 6.0, 2.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.1131591796875, -0.1105494499206543, -0.1079397201538086, -0.10532999038696289, -0.10272026062011719, -0.10011053085327148, -0.09750080108642578, -0.09489107131958008, -0.09228134155273438, -0.08967161178588867, -0.08706188201904297, -0.08445215225219727, -0.08184242248535156, -0.07923269271850586, -0.07662296295166016, -0.07401323318481445, -0.07140350341796875, -0.06879377365112305, -0.06618404388427734, -0.06357431411743164, -0.06096458435058594, -0.058354854583740234, -0.05574512481689453, -0.05313539505004883, -0.050525665283203125, -0.04791593551635742, -0.04530620574951172, -0.042696475982666016, -0.04008674621582031, -0.03747701644897461, -0.034867286682128906, -0.0322575569152832, -0.0296478271484375, -0.027038097381591797, -0.024428367614746094, -0.02181863784790039, -0.019208908081054688, -0.016599178314208984, -0.013989448547363281, -0.011379718780517578, -0.008769989013671875, -0.006160259246826172, -0.0035505294799804688, -0.0009407997131347656, 0.0016689300537109375, 0.004278659820556641, 0.006888389587402344, 0.009498119354248047, 0.01210784912109375, 0.014717578887939453, 0.017327308654785156, 0.01993703842163086, 0.022546768188476562, 0.025156497955322266, 0.02776622772216797, 0.030375957489013672, 0.032985687255859375, 0.03559541702270508, 0.03820514678955078, 0.040814876556396484, 0.04342460632324219, 0.04603433609008789, 0.048644065856933594, 0.0512537956237793, 0.053863525390625]}, "gradients/encoder.encoder.layers.11.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 4.0, 10.0, 10.0, 16.0, 15.0, 25.0, 36.0, 45.0, 80.0, 113.0, 173.0, 347.0, 746.0, 1762.0, 5259.0, 22296.0, 190926.0, 722597.0, 84627.0, 13382.0, 3593.0, 1283.0, 540.0, 252.0, 142.0, 93.0, 65.0, 34.0, 18.0, 15.0, 15.0, 11.0, 8.0, 10.0, 4.0, 1.0, 4.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.149169921875, -0.14469528198242188, -0.14022064208984375, -0.13574600219726562, -0.1312713623046875, -0.12679672241210938, -0.12232208251953125, -0.11784744262695312, -0.113372802734375, -0.10889816284179688, -0.10442352294921875, -0.09994888305664062, -0.0954742431640625, -0.09099960327148438, -0.08652496337890625, -0.08205032348632812, -0.07757568359375, -0.07310104370117188, -0.06862640380859375, -0.06415176391601562, -0.0596771240234375, -0.055202484130859375, -0.05072784423828125, -0.046253204345703125, -0.041778564453125, -0.037303924560546875, -0.03282928466796875, -0.028354644775390625, -0.0238800048828125, -0.019405364990234375, -0.01493072509765625, -0.010456085205078125, -0.0059814453125, -0.001506805419921875, 0.00296783447265625, 0.007442474365234375, 0.0119171142578125, 0.016391754150390625, 0.02086639404296875, 0.025341033935546875, 0.029815673828125, 0.034290313720703125, 0.03876495361328125, 0.043239593505859375, 0.0477142333984375, 0.052188873291015625, 0.05666351318359375, 0.061138153076171875, 0.06561279296875, 0.07008743286132812, 0.07456207275390625, 0.07903671264648438, 0.0835113525390625, 0.08798599243164062, 0.09246063232421875, 0.09693527221679688, 0.101409912109375, 0.10588455200195312, 0.11035919189453125, 0.11483383178710938, 0.1193084716796875, 0.12378311157226562, 0.12825775146484375, 0.13273239135742188, 0.13720703125]}, "gradients/encoder.encoder.layers.11.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 4.0, 6.0, 0.0, 0.0, 6.0, 10.0, 14.0, 10.0, 11.0, 28.0, 22.0, 20.0, 24.0, 38.0, 41.0, 29.0, 46.0, 48.0, 36.0, 53.0, 56.0, 46.0, 47.0, 46.0, 47.0, 47.0, 42.0, 37.0, 28.0, 23.0, 28.0, 22.0, 20.0, 16.0, 19.0, 7.0, 10.0, 5.0, 5.0, 4.0, 4.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1502685546875, -0.1456775665283203, -0.14108657836914062, -0.13649559020996094, -0.13190460205078125, -0.12731361389160156, -0.12272262573242188, -0.11813163757324219, -0.1135406494140625, -0.10894966125488281, -0.10435867309570312, -0.09976768493652344, -0.09517669677734375, -0.09058570861816406, -0.08599472045898438, -0.08140373229980469, -0.076812744140625, -0.07222175598144531, -0.06763076782226562, -0.06303977966308594, -0.05844879150390625, -0.05385780334472656, -0.049266815185546875, -0.04467582702636719, -0.0400848388671875, -0.03549385070800781, -0.030902862548828125, -0.026311874389648438, -0.02172088623046875, -0.017129898071289062, -0.012538909912109375, -0.007947921752929688, -0.00335693359375, 0.0012340545654296875, 0.005825042724609375, 0.010416030883789062, 0.01500701904296875, 0.019598007202148438, 0.024188995361328125, 0.028779983520507812, 0.0333709716796875, 0.03796195983886719, 0.042552947998046875, 0.04714393615722656, 0.05173492431640625, 0.05632591247558594, 0.060916900634765625, 0.06550788879394531, 0.070098876953125, 0.07468986511230469, 0.07928085327148438, 0.08387184143066406, 0.08846282958984375, 0.09305381774902344, 0.09764480590820312, 0.10223579406738281, 0.1068267822265625, 0.11141777038574219, 0.11600875854492188, 0.12059974670410156, 0.12519073486328125, 0.12978172302246094, 0.13437271118164062, 0.1389636993408203, 0.1435546875]}, "gradients/encoder.encoder.layers.11.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 4.0, 2.0, 5.0, 6.0, 7.0, 9.0, 6.0, 4.0, 13.0, 19.0, 23.0, 47.0, 66.0, 93.0, 153.0, 292.0, 580.0, 1235.0, 3421.0, 10750.0, 41114.0, 181069.0, 530903.0, 212253.0, 47675.0, 12152.0, 3865.0, 1398.0, 611.0, 318.0, 175.0, 96.0, 57.0, 38.0, 24.0, 19.0, 17.0, 7.0, 13.0, 10.0, 4.0, 5.0, 2.0, 7.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0195465087890625, -0.01871657371520996, -0.017886638641357422, -0.017056703567504883, -0.016226768493652344, -0.015396833419799805, -0.014566898345947266, -0.013736963272094727, -0.012907028198242188, -0.012077093124389648, -0.01124715805053711, -0.01041722297668457, -0.009587287902832031, -0.008757352828979492, -0.007927417755126953, -0.007097482681274414, -0.006267547607421875, -0.005437612533569336, -0.004607677459716797, -0.003777742385864258, -0.0029478073120117188, -0.0021178722381591797, -0.0012879371643066406, -0.00045800209045410156, 0.0003719329833984375, 0.0012018680572509766, 0.0020318031311035156, 0.0028617382049560547, 0.0036916732788085938, 0.004521608352661133, 0.005351543426513672, 0.006181478500366211, 0.00701141357421875, 0.007841348648071289, 0.008671283721923828, 0.009501218795776367, 0.010331153869628906, 0.011161088943481445, 0.011991024017333984, 0.012820959091186523, 0.013650894165039062, 0.014480829238891602, 0.01531076431274414, 0.01614069938659668, 0.01697063446044922, 0.017800569534301758, 0.018630504608154297, 0.019460439682006836, 0.020290374755859375, 0.021120309829711914, 0.021950244903564453, 0.022780179977416992, 0.02361011505126953, 0.02444005012512207, 0.02526998519897461, 0.02609992027282715, 0.026929855346679688, 0.027759790420532227, 0.028589725494384766, 0.029419660568237305, 0.030249595642089844, 0.031079530715942383, 0.03190946578979492, 0.03273940086364746, 0.0335693359375]}, "gradients/encoder.encoder.layers.11.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 3.0, 6.0, 6.0, 4.0, 16.0, 15.0, 15.0, 21.0, 23.0, 20.0, 42.0, 45.0, 41.0, 40.0, 57.0, 61.0, 64.0, 69.0, 64.0, 65.0, 36.0, 57.0, 54.0, 42.0, 26.0, 25.0, 23.0, 14.0, 12.0, 16.0, 4.0, 4.0, 3.0, 3.0, 2.0, 3.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-7.450580596923828e-06, -7.220543920993805e-06, -6.990507245063782e-06, -6.7604705691337585e-06, -6.530433893203735e-06, -6.300397217273712e-06, -6.070360541343689e-06, -5.840323865413666e-06, -5.610287189483643e-06, -5.380250513553619e-06, -5.150213837623596e-06, -4.920177161693573e-06, -4.69014048576355e-06, -4.460103809833527e-06, -4.230067133903503e-06, -4.00003045797348e-06, -3.769993782043457e-06, -3.539957106113434e-06, -3.3099204301834106e-06, -3.0798837542533875e-06, -2.8498470783233643e-06, -2.619810402393341e-06, -2.389773726463318e-06, -2.1597370505332947e-06, -1.9297003746032715e-06, -1.6996636986732483e-06, -1.469627022743225e-06, -1.239590346813202e-06, -1.0095536708831787e-06, -7.795169949531555e-07, -5.494803190231323e-07, -3.1944364309310913e-07, -8.940696716308594e-08, 1.4062970876693726e-07, 3.7066638469696045e-07, 6.007030606269836e-07, 8.307397365570068e-07, 1.06077641248703e-06, 1.2908130884170532e-06, 1.5208497643470764e-06, 1.7508864402770996e-06, 1.980923116207123e-06, 2.210959792137146e-06, 2.440996468067169e-06, 2.6710331439971924e-06, 2.9010698199272156e-06, 3.1311064958572388e-06, 3.361143171787262e-06, 3.591179847717285e-06, 3.821216523647308e-06, 4.0512531995773315e-06, 4.281289875507355e-06, 4.511326551437378e-06, 4.741363227367401e-06, 4.971399903297424e-06, 5.2014365792274475e-06, 5.431473255157471e-06, 5.661509931087494e-06, 5.891546607017517e-06, 6.12158328294754e-06, 6.3516199588775635e-06, 6.581656634807587e-06, 6.81169331073761e-06, 7.041729986667633e-06, 7.271766662597656e-06]}, "gradients/encoder.encoder.layers.11.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 3.0, 1.0, 2.0, 7.0, 6.0, 4.0, 7.0, 11.0, 14.0, 17.0, 26.0, 42.0, 63.0, 69.0, 121.0, 209.0, 388.0, 724.0, 1430.0, 3258.0, 8585.0, 27070.0, 105453.0, 393850.0, 369518.0, 98123.0, 25580.0, 7872.0, 2996.0, 1448.0, 673.0, 374.0, 225.0, 129.0, 93.0, 49.0, 38.0, 32.0, 9.0, 11.0, 8.0, 5.0, 5.0, 4.0, 3.0, 5.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 3.0], "bins": [-0.02996826171875, -0.029111385345458984, -0.02825450897216797, -0.027397632598876953, -0.026540756225585938, -0.025683879852294922, -0.024827003479003906, -0.02397012710571289, -0.023113250732421875, -0.02225637435913086, -0.021399497985839844, -0.020542621612548828, -0.019685745239257812, -0.018828868865966797, -0.01797199249267578, -0.017115116119384766, -0.01625823974609375, -0.015401363372802734, -0.014544486999511719, -0.013687610626220703, -0.012830734252929688, -0.011973857879638672, -0.011116981506347656, -0.01026010513305664, -0.009403228759765625, -0.00854635238647461, -0.007689476013183594, -0.006832599639892578, -0.0059757232666015625, -0.005118846893310547, -0.004261970520019531, -0.0034050941467285156, -0.0025482177734375, -0.0016913414001464844, -0.0008344650268554688, 2.2411346435546875e-05, 0.0008792877197265625, 0.0017361640930175781, 0.0025930404663085938, 0.0034499168395996094, 0.004306793212890625, 0.005163669586181641, 0.006020545959472656, 0.006877422332763672, 0.0077342987060546875, 0.008591175079345703, 0.009448051452636719, 0.010304927825927734, 0.01116180419921875, 0.012018680572509766, 0.012875556945800781, 0.013732433319091797, 0.014589309692382812, 0.015446186065673828, 0.016303062438964844, 0.01715993881225586, 0.018016815185546875, 0.01887369155883789, 0.019730567932128906, 0.020587444305419922, 0.021444320678710938, 0.022301197052001953, 0.02315807342529297, 0.024014949798583984, 0.024871826171875]}, "gradients/encoder.encoder.layers.11.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 0.0, 1.0, 4.0, 3.0, 4.0, 5.0, 8.0, 11.0, 15.0, 17.0, 23.0, 30.0, 24.0, 38.0, 51.0, 57.0, 70.0, 58.0, 75.0, 70.0, 73.0, 74.0, 58.0, 48.0, 45.0, 35.0, 27.0, 19.0, 29.0, 11.0, 10.0, 5.0, 4.0, 1.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.03564453125, -0.034726619720458984, -0.03380870819091797, -0.03289079666137695, -0.03197288513183594, -0.031054973602294922, -0.030137062072753906, -0.02921915054321289, -0.028301239013671875, -0.02738332748413086, -0.026465415954589844, -0.025547504425048828, -0.024629592895507812, -0.023711681365966797, -0.02279376983642578, -0.021875858306884766, -0.02095794677734375, -0.020040035247802734, -0.01912212371826172, -0.018204212188720703, -0.017286300659179688, -0.016368389129638672, -0.015450477600097656, -0.01453256607055664, -0.013614654541015625, -0.01269674301147461, -0.011778831481933594, -0.010860919952392578, -0.009943008422851562, -0.009025096893310547, -0.008107185363769531, -0.007189273834228516, -0.0062713623046875, -0.005353450775146484, -0.004435539245605469, -0.003517627716064453, -0.0025997161865234375, -0.0016818046569824219, -0.0007638931274414062, 0.00015401840209960938, 0.001071929931640625, 0.0019898414611816406, 0.0029077529907226562, 0.003825664520263672, 0.0047435760498046875, 0.005661487579345703, 0.006579399108886719, 0.007497310638427734, 0.00841522216796875, 0.009333133697509766, 0.010251045227050781, 0.011168956756591797, 0.012086868286132812, 0.013004779815673828, 0.013922691345214844, 0.01484060287475586, 0.015758514404296875, 0.01667642593383789, 0.017594337463378906, 0.018512248992919922, 0.019430160522460938, 0.020348072052001953, 0.02126598358154297, 0.022183895111083984, 0.023101806640625]}, "gradients/encoder.encoder.layers.11.layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 6.0, 3.0, 6.0, 6.0, 12.0, 19.0, 25.0, 31.0, 49.0, 69.0, 77.0, 122.0, 127.0, 124.0, 91.0, 71.0, 52.0, 41.0, 21.0, 20.0, 16.0, 7.0, 6.0, 6.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.35948291420936584, -0.34035542607307434, -0.3212279677391052, -0.3021004796028137, -0.2829730212688446, -0.2638455331325531, -0.2447180598974228, -0.22559058666229248, -0.20646311342716217, -0.18733564019203186, -0.16820816695690155, -0.14908069372177124, -0.12995320558547974, -0.11082573980093002, -0.09169825911521912, -0.0725707858800888, -0.053443312644958496, -0.034315839409828186, -0.015188362449407578, 0.003939114511013031, 0.02306658774614334, 0.04219406098127365, 0.06132154166698456, 0.08044901490211487, 0.09957648813724518, 0.11870396137237549, 0.1378314346075058, 0.1569589078426361, 0.1760863959789276, 0.19521385431289673, 0.21434134244918823, 0.23346881568431854, 0.25259625911712646, 0.27172374725341797, 0.2908512055873871, 0.3099786937236786, 0.3291061520576477, 0.3482336401939392, 0.3673611283302307, 0.38648858666419983, 0.40561604499816895, 0.42474353313446045, 0.44387099146842957, 0.46299847960472107, 0.4821259379386902, 0.5012534260749817, 0.5203809142112732, 0.5395083427429199, 0.5586358308792114, 0.5777633190155029, 0.5968908071517944, 0.6160182356834412, 0.6351457238197327, 0.6542732119560242, 0.6734007000923157, 0.6925281286239624, 0.7116556763648987, 0.7307831645011902, 0.7499106526374817, 0.7690380811691284, 0.7881655693054199, 0.8072930574417114, 0.8264205455780029, 0.8455480337142944, 0.8646754622459412]}, "gradients/encoder.encoder.layers.11.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 9.0, 5.0, 10.0, 9.0, 14.0, 17.0, 18.0, 21.0, 24.0, 31.0, 31.0, 26.0, 38.0, 29.0, 41.0, 59.0, 44.0, 47.0, 61.0, 43.0, 48.0, 43.0, 37.0, 32.0, 38.0, 37.0, 19.0, 25.0, 21.0, 18.0, 15.0, 17.0, 9.0, 7.0, 11.0, 8.0, 8.0, 4.0, 11.0, 1.0, 3.0, 3.0, 4.0, 0.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.7132899761199951, -0.6924432516098022, -0.6715965867042542, -0.6507498621940613, -0.6299031972885132, -0.6090564727783203, -0.5882097482681274, -0.5673630237579346, -0.5465163588523865, -0.5256696343421936, -0.5048229694366455, -0.48397624492645264, -0.46312955021858215, -0.44228285551071167, -0.4214361310005188, -0.4005894362926483, -0.37974274158477783, -0.35889604687690735, -0.33804935216903687, -0.317202627658844, -0.2963559329509735, -0.275509238243103, -0.25466251373291016, -0.23381581902503967, -0.2129691243171692, -0.1921224296092987, -0.17127572000026703, -0.15042901039123535, -0.12958231568336487, -0.10873561352491379, -0.08788891136646271, -0.06704220175743103, -0.04619550704956055, -0.025348804891109467, -0.004502102732658386, 0.016344599425792694, 0.037191301584243774, 0.058038003742694855, 0.07888470590114594, 0.09973141551017761, 0.1205781102180481, 0.14142480492591858, 0.16227151453495026, 0.18311822414398193, 0.20396491885185242, 0.2248116135597229, 0.24565832316875458, 0.26650503277778625, 0.28735172748565674, 0.3081984221935272, 0.3290451169013977, 0.3498918414115906, 0.37073853611946106, 0.39158523082733154, 0.4124319553375244, 0.4332786500453949, 0.4541253447532654, 0.47497203946113586, 0.49581873416900635, 0.5166654586791992, 0.5375121831893921, 0.5583588480949402, 0.5792055726051331, 0.6000522375106812, 0.620898962020874]}, "gradients/encoder.encoder.layers.10.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0, 2.0, 2.0, 4.0, 1.0, 1.0, 1.0, 4.0, 1.0, 5.0, 4.0, 8.0, 13.0, 27.0, 38.0, 78.0, 136.0, 321.0, 730.0, 2346.0, 4166674.0, 21309.0, 1489.0, 550.0, 266.0, 103.0, 61.0, 38.0, 20.0, 15.0, 15.0, 8.0, 6.0, 3.0, 3.0, 2.0, 2.0, 4.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.72607421875, -0.7091255187988281, -0.6921768188476562, -0.6752281188964844, -0.6582794189453125, -0.6413307189941406, -0.6243820190429688, -0.6074333190917969, -0.590484619140625, -0.5735359191894531, -0.5565872192382812, -0.5396385192871094, -0.5226898193359375, -0.5057411193847656, -0.48879241943359375, -0.4718437194824219, -0.45489501953125, -0.4379463195800781, -0.42099761962890625, -0.4040489196777344, -0.3871002197265625, -0.3701515197753906, -0.35320281982421875, -0.3362541198730469, -0.319305419921875, -0.3023567199707031, -0.28540802001953125, -0.2684593200683594, -0.2515106201171875, -0.23456192016601562, -0.21761322021484375, -0.20066452026367188, -0.1837158203125, -0.16676712036132812, -0.14981842041015625, -0.13286972045898438, -0.1159210205078125, -0.09897232055664062, -0.08202362060546875, -0.06507492065429688, -0.048126220703125, -0.031177520751953125, -0.01422882080078125, 0.002719879150390625, 0.0196685791015625, 0.036617279052734375, 0.05356597900390625, 0.07051467895507812, 0.08746337890625, 0.10441207885742188, 0.12136077880859375, 0.13830947875976562, 0.1552581787109375, 0.17220687866210938, 0.18915557861328125, 0.20610427856445312, 0.223052978515625, 0.24000167846679688, 0.25695037841796875, 0.2738990783691406, 0.2908477783203125, 0.3077964782714844, 0.32474517822265625, 0.3416938781738281, 0.358642578125]}, "gradients/encoder.encoder.layers.10.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 6.0, 9.0, 11.0, 21.0, 33.0, 42.0, 51.0, 90.0, 106.0, 98.0, 107.0, 109.0, 85.0, 77.0, 58.0, 34.0, 22.0, 16.0, 11.0, 8.0, 9.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.11248779296875, -0.10989141464233398, -0.10729503631591797, -0.10469865798950195, -0.10210227966308594, -0.09950590133666992, -0.0969095230102539, -0.09431314468383789, -0.09171676635742188, -0.08912038803100586, -0.08652400970458984, -0.08392763137817383, -0.08133125305175781, -0.0787348747253418, -0.07613849639892578, -0.07354211807250977, -0.07094573974609375, -0.06834936141967773, -0.06575298309326172, -0.0631566047668457, -0.06056022644042969, -0.05796384811401367, -0.055367469787597656, -0.05277109146118164, -0.050174713134765625, -0.04757833480834961, -0.044981956481933594, -0.04238557815551758, -0.03978919982910156, -0.03719282150268555, -0.03459644317626953, -0.032000064849853516, -0.0294036865234375, -0.026807308197021484, -0.02421092987060547, -0.021614551544189453, -0.019018173217773438, -0.016421794891357422, -0.013825416564941406, -0.01122903823852539, -0.008632659912109375, -0.006036281585693359, -0.0034399032592773438, -0.0008435249328613281, 0.0017528533935546875, 0.004349231719970703, 0.006945610046386719, 0.009541988372802734, 0.01213836669921875, 0.014734745025634766, 0.01733112335205078, 0.019927501678466797, 0.022523880004882812, 0.025120258331298828, 0.027716636657714844, 0.03031301498413086, 0.032909393310546875, 0.03550577163696289, 0.038102149963378906, 0.04069852828979492, 0.04329490661621094, 0.04589128494262695, 0.04848766326904297, 0.051084041595458984, 0.053680419921875]}, "gradients/encoder.encoder.layers.10.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 5.0, 9.0, 2.0, 16.0, 19.0, 21.0, 24.0, 37.0, 63.0, 63.0, 111.0, 147.0, 179.0, 277.0, 339.0, 535.0, 732.0, 1195.0, 1899.0, 3454.0, 7707.0, 33345.0, 4052367.0, 70308.0, 10803.0, 4243.0, 2206.0, 1301.0, 811.0, 583.0, 418.0, 277.0, 217.0, 170.0, 91.0, 83.0, 64.0, 49.0, 19.0, 32.0, 16.0, 17.0, 11.0, 10.0, 2.0, 4.0, 5.0, 2.0, 5.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.098876953125, -0.09584808349609375, -0.0928192138671875, -0.08979034423828125, -0.086761474609375, -0.08373260498046875, -0.0807037353515625, -0.07767486572265625, -0.07464599609375, -0.07161712646484375, -0.0685882568359375, -0.06555938720703125, -0.062530517578125, -0.05950164794921875, -0.0564727783203125, -0.05344390869140625, -0.0504150390625, -0.04738616943359375, -0.0443572998046875, -0.04132843017578125, -0.038299560546875, -0.03527069091796875, -0.0322418212890625, -0.02921295166015625, -0.02618408203125, -0.02315521240234375, -0.0201263427734375, -0.01709747314453125, -0.014068603515625, -0.01103973388671875, -0.0080108642578125, -0.00498199462890625, -0.001953125, 0.00107574462890625, 0.0041046142578125, 0.00713348388671875, 0.010162353515625, 0.01319122314453125, 0.0162200927734375, 0.01924896240234375, 0.02227783203125, 0.02530670166015625, 0.0283355712890625, 0.03136444091796875, 0.034393310546875, 0.03742218017578125, 0.0404510498046875, 0.04347991943359375, 0.0465087890625, 0.04953765869140625, 0.0525665283203125, 0.05559539794921875, 0.058624267578125, 0.06165313720703125, 0.0646820068359375, 0.06771087646484375, 0.07073974609375, 0.07376861572265625, 0.0767974853515625, 0.07982635498046875, 0.082855224609375, 0.08588409423828125, 0.0889129638671875, 0.09194183349609375, 0.094970703125]}, "gradients/encoder.encoder.layers.10.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 0.0, 0.0, 2.0, 0.0, 2.0, 0.0, 1.0, 2.0, 6.0, 6.0, 8.0, 13.0, 29.0, 47.0, 135.0, 3549.0, 182.0, 36.0, 22.0, 8.0, 7.0, 5.0, 2.0, 7.0, 2.0, 1.0, 4.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.0277099609375, -0.026972055435180664, -0.026234149932861328, -0.025496244430541992, -0.024758338928222656, -0.02402043342590332, -0.023282527923583984, -0.02254462242126465, -0.021806716918945312, -0.021068811416625977, -0.02033090591430664, -0.019593000411987305, -0.01885509490966797, -0.018117189407348633, -0.017379283905029297, -0.01664137840270996, -0.015903472900390625, -0.015165567398071289, -0.014427661895751953, -0.013689756393432617, -0.012951850891113281, -0.012213945388793945, -0.01147603988647461, -0.010738134384155273, -0.010000228881835938, -0.009262323379516602, -0.008524417877197266, -0.00778651237487793, -0.007048606872558594, -0.006310701370239258, -0.005572795867919922, -0.004834890365600586, -0.00409698486328125, -0.003359079360961914, -0.002621173858642578, -0.0018832683563232422, -0.0011453628540039062, -0.0004074573516845703, 0.0003304481506347656, 0.0010683536529541016, 0.0018062591552734375, 0.0025441646575927734, 0.0032820701599121094, 0.004019975662231445, 0.004757881164550781, 0.005495786666870117, 0.006233692169189453, 0.006971597671508789, 0.007709503173828125, 0.008447408676147461, 0.009185314178466797, 0.009923219680786133, 0.010661125183105469, 0.011399030685424805, 0.01213693618774414, 0.012874841690063477, 0.013612747192382812, 0.014350652694702148, 0.015088558197021484, 0.01582646369934082, 0.016564369201660156, 0.017302274703979492, 0.018040180206298828, 0.018778085708618164, 0.0195159912109375]}, "gradients/encoder.encoder.layers.10.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 3.0, 5.0, 9.0, 6.0, 22.0, 21.0, 36.0, 57.0, 93.0, 132.0, 163.0, 158.0, 115.0, 81.0, 39.0, 30.0, 15.0, 11.0, 2.0, 4.0, 3.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.13695700466632843, -0.13332800567150116, -0.1296989917755127, -0.12606999278068542, -0.12244099378585815, -0.11881198734045029, -0.11518298089504242, -0.11155398190021515, -0.10792497545480728, -0.10429596900939941, -0.10066697001457214, -0.09703796356916428, -0.09340895712375641, -0.08977995812892914, -0.08615095168352127, -0.0825219452381134, -0.07889294624328613, -0.07526393979787827, -0.071634940803051, -0.06800593435764313, -0.06437693536281586, -0.06074792891740799, -0.05711892247200012, -0.05348991975188255, -0.049860917031764984, -0.046231914311647415, -0.042602911591529846, -0.03897390514612198, -0.03534490242600441, -0.03171589970588684, -0.028086895123124123, -0.024457890540361404, -0.020828895270824432, -0.017199892550706863, -0.013570887967944145, -0.009941884316504002, -0.006312880665063858, -0.002683877944946289, 0.0009451266378164291, 0.004574131220579147, 0.008203133940696716, 0.01183213759213686, 0.015461141243577003, 0.01909014582633972, 0.02271914854645729, 0.02634815126657486, 0.029977155849337578, 0.033606160432100296, 0.037235163152217865, 0.040864165872335434, 0.044493168592453, 0.04812217503786087, 0.05175117775797844, 0.05538018047809601, 0.059009186923503876, 0.06263819336891174, 0.06626719236373901, 0.06989619880914688, 0.07352519780397415, 0.07715420424938202, 0.08078320324420929, 0.08441220968961716, 0.08804121613502502, 0.0916702151298523, 0.09529922157526016]}, "gradients/encoder.encoder.layers.10.final_layer_norm.bias": {"_type": "histogram", "values": [2.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 6.0, 5.0, 6.0, 3.0, 4.0, 16.0, 14.0, 13.0, 14.0, 12.0, 26.0, 21.0, 29.0, 28.0, 37.0, 37.0, 33.0, 36.0, 39.0, 44.0, 50.0, 33.0, 33.0, 38.0, 45.0, 36.0, 45.0, 46.0, 36.0, 27.0, 23.0, 25.0, 20.0, 17.0, 16.0, 17.0, 21.0, 18.0, 7.0, 7.0, 6.0, 4.0, 6.0, 4.0, 2.0, 0.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0], "bins": [-0.041365206241607666, -0.04009249806404114, -0.03881978988647461, -0.03754708543419838, -0.03627437725663185, -0.03500166907906532, -0.03372896462678909, -0.032456256449222565, -0.031183548271656036, -0.029910840094089508, -0.02863813377916813, -0.02736542746424675, -0.02609271928668022, -0.024820011109113693, -0.023547304794192314, -0.022274598479270935, -0.021001890301704407, -0.01972918212413788, -0.0184564758092165, -0.01718376949429512, -0.015911061316728592, -0.014638354070484638, -0.013365646824240685, -0.01209293957799673, -0.010820232331752777, -0.009547525085508823, -0.00827481783926487, -0.007002110593020916, -0.005729403346776962, -0.004456696100533009, -0.003183988854289055, -0.0019112816080451012, -0.0006385743618011475, 0.0006341328844428062, 0.00190684013068676, 0.0031795473769307137, 0.004452254623174667, 0.005724961869418621, 0.006997669115662575, 0.008270376361906528, 0.009543083608150482, 0.010815790854394436, 0.01208849810063839, 0.013361205346882343, 0.014633912593126297, 0.015906620770692825, 0.017179327085614204, 0.018452033400535583, 0.019724741578102112, 0.02099744975566864, 0.02227015607059002, 0.0235428623855114, 0.024815570563077927, 0.026088278740644455, 0.027360985055565834, 0.028633691370487213, 0.02990639954805374, 0.03117910772562027, 0.0324518159031868, 0.03372452035546303, 0.034997228533029556, 0.036269936710596085, 0.037542641162872314, 0.03881534934043884, 0.04008805751800537]}, "gradients/encoder.encoder.layers.10.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 0.0, 3.0, 1.0, 7.0, 7.0, 9.0, 14.0, 14.0, 24.0, 35.0, 36.0, 76.0, 105.0, 209.0, 341.0, 610.0, 1477.0, 4516.0, 23630.0, 353950.0, 615954.0, 37919.0, 6165.0, 1860.0, 701.0, 336.0, 206.0, 107.0, 66.0, 52.0, 34.0, 19.0, 21.0, 15.0, 12.0, 10.0, 7.0, 5.0, 3.0, 1.0, 0.0, 3.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.251708984375, -0.2445392608642578, -0.23736953735351562, -0.23019981384277344, -0.22303009033203125, -0.21586036682128906, -0.20869064331054688, -0.2015209197998047, -0.1943511962890625, -0.1871814727783203, -0.18001174926757812, -0.17284202575683594, -0.16567230224609375, -0.15850257873535156, -0.15133285522460938, -0.1441631317138672, -0.136993408203125, -0.1298236846923828, -0.12265396118164062, -0.11548423767089844, -0.10831451416015625, -0.10114479064941406, -0.09397506713867188, -0.08680534362792969, -0.0796356201171875, -0.07246589660644531, -0.06529617309570312, -0.05812644958496094, -0.05095672607421875, -0.04378700256347656, -0.036617279052734375, -0.029447555541992188, -0.02227783203125, -0.015108108520507812, -0.007938385009765625, -0.0007686614990234375, 0.00640106201171875, 0.013570785522460938, 0.020740509033203125, 0.027910232543945312, 0.0350799560546875, 0.04224967956542969, 0.049419403076171875, 0.05658912658691406, 0.06375885009765625, 0.07092857360839844, 0.07809829711914062, 0.08526802062988281, 0.092437744140625, 0.09960746765136719, 0.10677719116210938, 0.11394691467285156, 0.12111663818359375, 0.12828636169433594, 0.13545608520507812, 0.1426258087158203, 0.1497955322265625, 0.1569652557373047, 0.16413497924804688, 0.17130470275878906, 0.17847442626953125, 0.18564414978027344, 0.19281387329101562, 0.1999835968017578, 0.2071533203125]}, "gradients/encoder.encoder.layers.10.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 4.0, 9.0, 8.0, 12.0, 23.0, 35.0, 53.0, 61.0, 78.0, 98.0, 105.0, 112.0, 102.0, 88.0, 69.0, 53.0, 28.0, 30.0, 11.0, 13.0, 8.0, 5.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.1134033203125, -0.11077499389648438, -0.10814666748046875, -0.10551834106445312, -0.1028900146484375, -0.10026168823242188, -0.09763336181640625, -0.09500503540039062, -0.092376708984375, -0.08974838256835938, -0.08712005615234375, -0.08449172973632812, -0.0818634033203125, -0.07923507690429688, -0.07660675048828125, -0.07397842407226562, -0.07135009765625, -0.06872177124023438, -0.06609344482421875, -0.06346511840820312, -0.0608367919921875, -0.058208465576171875, -0.05558013916015625, -0.052951812744140625, -0.050323486328125, -0.047695159912109375, -0.04506683349609375, -0.042438507080078125, -0.0398101806640625, -0.037181854248046875, -0.03455352783203125, -0.031925201416015625, -0.029296875, -0.026668548583984375, -0.02404022216796875, -0.021411895751953125, -0.0187835693359375, -0.016155242919921875, -0.01352691650390625, -0.010898590087890625, -0.008270263671875, -0.005641937255859375, -0.00301361083984375, -0.000385284423828125, 0.0022430419921875, 0.004871368408203125, 0.00749969482421875, 0.010128021240234375, 0.01275634765625, 0.015384674072265625, 0.01801300048828125, 0.020641326904296875, 0.0232696533203125, 0.025897979736328125, 0.02852630615234375, 0.031154632568359375, 0.033782958984375, 0.036411285400390625, 0.03903961181640625, 0.041667938232421875, 0.0442962646484375, 0.046924591064453125, 0.04955291748046875, 0.052181243896484375, 0.0548095703125]}, "gradients/encoder.encoder.layers.10.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 3.0, 8.0, 8.0, 12.0, 9.0, 11.0, 31.0, 29.0, 41.0, 55.0, 79.0, 126.0, 242.0, 446.0, 972.0, 2383.0, 8276.0, 46398.0, 621943.0, 329603.0, 28517.0, 5976.0, 1887.0, 730.0, 331.0, 153.0, 83.0, 64.0, 44.0, 24.0, 16.0, 16.0, 16.0, 8.0, 7.0, 8.0, 5.0, 2.0, 3.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09393310546875, -0.0894327163696289, -0.08493232727050781, -0.08043193817138672, -0.07593154907226562, -0.07143115997314453, -0.06693077087402344, -0.062430381774902344, -0.05792999267578125, -0.053429603576660156, -0.04892921447753906, -0.04442882537841797, -0.039928436279296875, -0.03542804718017578, -0.030927658081054688, -0.026427268981933594, -0.0219268798828125, -0.017426490783691406, -0.012926101684570312, -0.008425712585449219, -0.003925323486328125, 0.0005750656127929688, 0.0050754547119140625, 0.009575843811035156, 0.01407623291015625, 0.018576622009277344, 0.023077011108398438, 0.02757740020751953, 0.032077789306640625, 0.03657817840576172, 0.04107856750488281, 0.045578956604003906, 0.050079345703125, 0.054579734802246094, 0.05908012390136719, 0.06358051300048828, 0.06808090209960938, 0.07258129119873047, 0.07708168029785156, 0.08158206939697266, 0.08608245849609375, 0.09058284759521484, 0.09508323669433594, 0.09958362579345703, 0.10408401489257812, 0.10858440399169922, 0.11308479309082031, 0.1175851821899414, 0.1220855712890625, 0.1265859603881836, 0.1310863494873047, 0.13558673858642578, 0.14008712768554688, 0.14458751678466797, 0.14908790588378906, 0.15358829498291016, 0.15808868408203125, 0.16258907318115234, 0.16708946228027344, 0.17158985137939453, 0.17609024047851562, 0.18059062957763672, 0.1850910186767578, 0.1895914077758789, 0.194091796875]}, "gradients/encoder.encoder.layers.10.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 4.0, 0.0, 5.0, 5.0, 11.0, 3.0, 9.0, 9.0, 16.0, 13.0, 23.0, 32.0, 27.0, 22.0, 25.0, 26.0, 28.0, 42.0, 30.0, 34.0, 46.0, 51.0, 60.0, 48.0, 41.0, 46.0, 48.0, 35.0, 45.0, 32.0, 23.0, 39.0, 22.0, 13.0, 20.0, 10.0, 10.0, 10.0, 5.0, 10.0, 6.0, 8.0, 3.0, 4.0, 3.0, 7.0, 0.0, 1.0, 1.0, 3.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1292724609375, -0.12497901916503906, -0.12068557739257812, -0.11639213562011719, -0.11209869384765625, -0.10780525207519531, -0.10351181030273438, -0.09921836853027344, -0.0949249267578125, -0.09063148498535156, -0.08633804321289062, -0.08204460144042969, -0.07775115966796875, -0.07345771789550781, -0.06916427612304688, -0.06487083435058594, -0.060577392578125, -0.05628395080566406, -0.051990509033203125, -0.04769706726074219, -0.04340362548828125, -0.03911018371582031, -0.034816741943359375, -0.030523300170898438, -0.0262298583984375, -0.021936416625976562, -0.017642974853515625, -0.013349533081054688, -0.00905609130859375, -0.0047626495361328125, -0.000469207763671875, 0.0038242340087890625, 0.00811767578125, 0.012411117553710938, 0.016704559326171875, 0.020998001098632812, 0.02529144287109375, 0.029584884643554688, 0.033878326416015625, 0.03817176818847656, 0.0424652099609375, 0.04675865173339844, 0.051052093505859375, 0.05534553527832031, 0.05963897705078125, 0.06393241882324219, 0.06822586059570312, 0.07251930236816406, 0.076812744140625, 0.08110618591308594, 0.08539962768554688, 0.08969306945800781, 0.09398651123046875, 0.09827995300292969, 0.10257339477539062, 0.10686683654785156, 0.1111602783203125, 0.11545372009277344, 0.11974716186523438, 0.12404060363769531, 0.12833404541015625, 0.1326274871826172, 0.13692092895507812, 0.14121437072753906, 0.1455078125]}, "gradients/encoder.encoder.layers.10.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 3.0, 3.0, 3.0, 5.0, 8.0, 13.0, 15.0, 15.0, 31.0, 36.0, 45.0, 65.0, 120.0, 213.0, 421.0, 907.0, 2533.0, 9515.0, 57772.0, 561692.0, 366677.0, 37680.0, 7070.0, 2125.0, 765.0, 323.0, 178.0, 94.0, 58.0, 51.0, 33.0, 23.0, 19.0, 10.0, 9.0, 6.0, 8.0, 7.0, 2.0, 2.0, 1.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0374755859375, -0.0363006591796875, -0.035125732421875, -0.0339508056640625, -0.03277587890625, -0.0316009521484375, -0.030426025390625, -0.0292510986328125, -0.028076171875, -0.0269012451171875, -0.025726318359375, -0.0245513916015625, -0.02337646484375, -0.0222015380859375, -0.021026611328125, -0.0198516845703125, -0.0186767578125, -0.0175018310546875, -0.016326904296875, -0.0151519775390625, -0.01397705078125, -0.0128021240234375, -0.011627197265625, -0.0104522705078125, -0.00927734375, -0.0081024169921875, -0.006927490234375, -0.0057525634765625, -0.00457763671875, -0.0034027099609375, -0.002227783203125, -0.0010528564453125, 0.0001220703125, 0.0012969970703125, 0.002471923828125, 0.0036468505859375, 0.00482177734375, 0.0059967041015625, 0.007171630859375, 0.0083465576171875, 0.009521484375, 0.0106964111328125, 0.011871337890625, 0.0130462646484375, 0.01422119140625, 0.0153961181640625, 0.016571044921875, 0.0177459716796875, 0.0189208984375, 0.0200958251953125, 0.021270751953125, 0.0224456787109375, 0.02362060546875, 0.0247955322265625, 0.025970458984375, 0.0271453857421875, 0.0283203125, 0.0294952392578125, 0.030670166015625, 0.0318450927734375, 0.03302001953125, 0.0341949462890625, 0.035369873046875, 0.0365447998046875, 0.0377197265625]}, "gradients/encoder.encoder.layers.10.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 3.0, 2.0, 4.0, 0.0, 8.0, 9.0, 5.0, 10.0, 12.0, 35.0, 15.0, 17.0, 36.0, 24.0, 49.0, 53.0, 39.0, 60.0, 60.0, 67.0, 50.0, 49.0, 61.0, 35.0, 76.0, 29.0, 45.0, 26.0, 20.0, 26.0, 20.0, 16.0, 11.0, 10.0, 8.0, 2.0, 7.0, 4.0, 6.0, 1.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-6.079673767089844e-06, -5.8766454458236694e-06, -5.673617124557495e-06, -5.470588803291321e-06, -5.2675604820251465e-06, -5.064532160758972e-06, -4.861503839492798e-06, -4.6584755182266235e-06, -4.455447196960449e-06, -4.252418875694275e-06, -4.049390554428101e-06, -3.846362233161926e-06, -3.643333911895752e-06, -3.4403055906295776e-06, -3.2372772693634033e-06, -3.034248948097229e-06, -2.8312206268310547e-06, -2.6281923055648804e-06, -2.425163984298706e-06, -2.2221356630325317e-06, -2.0191073417663574e-06, -1.816079020500183e-06, -1.6130506992340088e-06, -1.4100223779678345e-06, -1.2069940567016602e-06, -1.0039657354354858e-06, -8.009374141693115e-07, -5.979090929031372e-07, -3.948807716369629e-07, -1.9185245037078857e-07, 1.1175870895385742e-08, 2.1420419216156006e-07, 4.172325134277344e-07, 6.202608346939087e-07, 8.23289155960083e-07, 1.0263174772262573e-06, 1.2293457984924316e-06, 1.432374119758606e-06, 1.6354024410247803e-06, 1.8384307622909546e-06, 2.041459083557129e-06, 2.2444874048233032e-06, 2.4475157260894775e-06, 2.650544047355652e-06, 2.853572368621826e-06, 3.0566006898880005e-06, 3.259629011154175e-06, 3.462657332420349e-06, 3.6656856536865234e-06, 3.868713974952698e-06, 4.071742296218872e-06, 4.274770617485046e-06, 4.477798938751221e-06, 4.680827260017395e-06, 4.883855581283569e-06, 5.086883902549744e-06, 5.289912223815918e-06, 5.492940545082092e-06, 5.695968866348267e-06, 5.898997187614441e-06, 6.102025508880615e-06, 6.3050538301467896e-06, 6.508082151412964e-06, 6.711110472679138e-06, 6.9141387939453125e-06]}, "gradients/encoder.encoder.layers.10.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 2.0, 3.0, 6.0, 6.0, 10.0, 14.0, 16.0, 36.0, 45.0, 85.0, 161.0, 290.0, 705.0, 1770.0, 6356.0, 43938.0, 643636.0, 322133.0, 22691.0, 4346.0, 1274.0, 470.0, 254.0, 123.0, 74.0, 48.0, 22.0, 16.0, 12.0, 10.0, 4.0, 6.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.040496826171875, -0.03886747360229492, -0.037238121032714844, -0.035608768463134766, -0.03397941589355469, -0.03235006332397461, -0.03072071075439453, -0.029091358184814453, -0.027462005615234375, -0.025832653045654297, -0.02420330047607422, -0.02257394790649414, -0.020944595336914062, -0.019315242767333984, -0.017685890197753906, -0.016056537628173828, -0.01442718505859375, -0.012797832489013672, -0.011168479919433594, -0.009539127349853516, -0.007909774780273438, -0.006280422210693359, -0.004651069641113281, -0.003021717071533203, -0.001392364501953125, 0.00023698806762695312, 0.0018663406372070312, 0.0034956932067871094, 0.0051250457763671875, 0.006754398345947266, 0.008383750915527344, 0.010013103485107422, 0.0116424560546875, 0.013271808624267578, 0.014901161193847656, 0.016530513763427734, 0.018159866333007812, 0.01978921890258789, 0.02141857147216797, 0.023047924041748047, 0.024677276611328125, 0.026306629180908203, 0.02793598175048828, 0.02956533432006836, 0.031194686889648438, 0.032824039459228516, 0.034453392028808594, 0.03608274459838867, 0.03771209716796875, 0.03934144973754883, 0.040970802307128906, 0.042600154876708984, 0.04422950744628906, 0.04585886001586914, 0.04748821258544922, 0.0491175651550293, 0.050746917724609375, 0.05237627029418945, 0.05400562286376953, 0.05563497543334961, 0.05726432800292969, 0.058893680572509766, 0.060523033142089844, 0.06215238571166992, 0.06378173828125]}, "gradients/encoder.encoder.layers.10.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 4.0, 3.0, 7.0, 6.0, 9.0, 10.0, 6.0, 14.0, 25.0, 33.0, 44.0, 43.0, 61.0, 60.0, 100.0, 73.0, 91.0, 68.0, 63.0, 49.0, 44.0, 36.0, 33.0, 25.0, 22.0, 17.0, 13.0, 16.0, 7.0, 5.0, 2.0, 5.0, 1.0, 4.0, 1.0, 3.0, 1.0, 1.0, 2.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0281829833984375, -0.02721858024597168, -0.02625417709350586, -0.02528977394104004, -0.02432537078857422, -0.0233609676361084, -0.022396564483642578, -0.021432161331176758, -0.020467758178710938, -0.019503355026245117, -0.018538951873779297, -0.017574548721313477, -0.016610145568847656, -0.015645742416381836, -0.014681339263916016, -0.013716936111450195, -0.012752532958984375, -0.011788129806518555, -0.010823726654052734, -0.009859323501586914, -0.008894920349121094, -0.007930517196655273, -0.006966114044189453, -0.006001710891723633, -0.0050373077392578125, -0.004072904586791992, -0.003108501434326172, -0.0021440982818603516, -0.0011796951293945312, -0.00021529197692871094, 0.0007491111755371094, 0.0017135143280029297, 0.00267791748046875, 0.0036423206329345703, 0.004606723785400391, 0.005571126937866211, 0.006535530090332031, 0.0074999332427978516, 0.008464336395263672, 0.009428739547729492, 0.010393142700195312, 0.011357545852661133, 0.012321949005126953, 0.013286352157592773, 0.014250755310058594, 0.015215158462524414, 0.016179561614990234, 0.017143964767456055, 0.018108367919921875, 0.019072771072387695, 0.020037174224853516, 0.021001577377319336, 0.021965980529785156, 0.022930383682250977, 0.023894786834716797, 0.024859189987182617, 0.025823593139648438, 0.026787996292114258, 0.027752399444580078, 0.0287168025970459, 0.02968120574951172, 0.03064560890197754, 0.03161001205444336, 0.03257441520690918, 0.033538818359375]}, "gradients/encoder.encoder.layers.10.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 11.0, 22.0, 88.0, 294.0, 396.0, 144.0, 40.0, 13.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.22335684299469, -1.1562827825546265, -1.0892088413238525, -1.022134780883789, -0.9550607204437256, -0.8879867196083069, -0.8209127187728882, -0.7538386583328247, -0.686764657497406, -0.6196906566619873, -0.5526165962219238, -0.4855425953865051, -0.41846856474876404, -0.35139453411102295, -0.28432053327560425, -0.21724650263786316, -0.15017247200012207, -0.08309844881296158, -0.016024425625801086, 0.05104959011077881, 0.1181236207485199, 0.185197651386261, 0.2522716522216797, 0.3193456828594208, 0.38641971349716187, 0.45349374413490295, 0.520567774772644, 0.5876417756080627, 0.6547157764434814, 0.7217898368835449, 0.7888638377189636, 0.8559378385543823, 0.9230120182037354, 0.990086019039154, 1.0571600198745728, 1.1242340803146362, 1.1913081407546997, 1.2583820819854736, 1.325456142425537, 1.3925302028656006, 1.459604263305664, 1.5266783237457275, 1.5937522649765015, 1.660826325416565, 1.7279003858566284, 1.7949743270874023, 1.8620483875274658, 1.9291224479675293, 1.9961963891983032, 2.063270330429077, 2.1303443908691406, 2.197418451309204, 2.2644925117492676, 2.331566572189331, 2.3986406326293945, 2.465714454650879, 2.5327885150909424, 2.599862575531006, 2.6669366359710693, 2.734010696411133, 2.801084518432617, 2.8681585788726807, 2.935232639312744, 3.0023066997528076, 3.069380760192871]}, "gradients/encoder.encoder.layers.10.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 3.0, 1.0, 4.0, 2.0, 3.0, 8.0, 6.0, 5.0, 6.0, 5.0, 12.0, 13.0, 13.0, 11.0, 26.0, 34.0, 27.0, 44.0, 37.0, 42.0, 35.0, 46.0, 56.0, 60.0, 42.0, 63.0, 67.0, 44.0, 50.0, 42.0, 33.0, 33.0, 27.0, 22.0, 19.0, 11.0, 12.0, 14.0, 12.0, 8.0, 3.0, 4.0, 4.0, 4.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0], "bins": [-0.7941553592681885, -0.7700571417808533, -0.7459589838981628, -0.7218607664108276, -0.6977625489234924, -0.673664391040802, -0.6495661735534668, -0.6254680156707764, -0.6013697981834412, -0.577271580696106, -0.5531734228134155, -0.5290752053260803, -0.5049769878387451, -0.4808788299560547, -0.4567806124687195, -0.43268242478370667, -0.40858420729637146, -0.38448601961135864, -0.36038780212402344, -0.3362896144390106, -0.3121914267539978, -0.2880932092666626, -0.2639950215816498, -0.23989683389663696, -0.21579863131046295, -0.19170042872428894, -0.16760224103927612, -0.1435040384531021, -0.1194058433175087, -0.09530764818191528, -0.07120944559574127, -0.047111257910728455, -0.023013055324554443, 0.0010851416736841202, 0.025183338671922684, 0.049281537532806396, 0.07337973266839981, 0.09747792780399323, 0.12157613039016724, 0.14567431807518005, 0.16977252066135406, 0.19387072324752808, 0.2179689109325409, 0.2420671135187149, 0.2661653161048889, 0.29026350378990173, 0.31436169147491455, 0.33845990896224976, 0.3625580966472626, 0.3866562843322754, 0.4107545018196106, 0.4348526895046234, 0.45895087718963623, 0.48304909467697144, 0.5071473121643066, 0.5312454700469971, 0.5553436875343323, 0.5794419050216675, 0.6035400629043579, 0.6276382803916931, 0.6517364978790283, 0.6758346557617188, 0.699932873249054, 0.7240310907363892, 0.7481292486190796]}, "gradients/encoder.encoder.layers.9.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 4.0, 1.0, 16.0, 14.0, 18.0, 44.0, 60.0, 119.0, 280.0, 3370.0, 4189478.0, 535.0, 148.0, 88.0, 49.0, 33.0, 11.0, 6.0, 4.0, 2.0, 5.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0], "bins": [-2.8125, -2.747406005859375, -2.68231201171875, -2.617218017578125, -2.5521240234375, -2.487030029296875, -2.42193603515625, -2.356842041015625, -2.291748046875, -2.226654052734375, -2.16156005859375, -2.096466064453125, -2.0313720703125, -1.966278076171875, -1.90118408203125, -1.836090087890625, -1.77099609375, -1.705902099609375, -1.64080810546875, -1.575714111328125, -1.5106201171875, -1.445526123046875, -1.38043212890625, -1.315338134765625, -1.250244140625, -1.185150146484375, -1.12005615234375, -1.054962158203125, -0.9898681640625, -0.924774169921875, -0.85968017578125, -0.794586181640625, -0.7294921875, -0.664398193359375, -0.59930419921875, -0.534210205078125, -0.4691162109375, -0.404022216796875, -0.33892822265625, -0.273834228515625, -0.208740234375, -0.143646240234375, -0.07855224609375, -0.013458251953125, 0.0516357421875, 0.116729736328125, 0.18182373046875, 0.246917724609375, 0.31201171875, 0.377105712890625, 0.44219970703125, 0.507293701171875, 0.5723876953125, 0.637481689453125, 0.70257568359375, 0.767669677734375, 0.832763671875, 0.897857666015625, 0.96295166015625, 1.028045654296875, 1.0931396484375, 1.158233642578125, 1.22332763671875, 1.288421630859375, 1.353515625]}, "gradients/encoder.encoder.layers.9.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 3.0, 6.0, 6.0, 22.0, 25.0, 33.0, 65.0, 68.0, 82.0, 83.0, 111.0, 119.0, 88.0, 81.0, 65.0, 51.0, 40.0, 24.0, 10.0, 11.0, 9.0, 6.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.1124267578125, -0.10981082916259766, -0.10719490051269531, -0.10457897186279297, -0.10196304321289062, -0.09934711456298828, -0.09673118591308594, -0.0941152572631836, -0.09149932861328125, -0.0888833999633789, -0.08626747131347656, -0.08365154266357422, -0.08103561401367188, -0.07841968536376953, -0.07580375671386719, -0.07318782806396484, -0.0705718994140625, -0.06795597076416016, -0.06534004211425781, -0.06272411346435547, -0.060108184814453125, -0.05749225616455078, -0.05487632751464844, -0.052260398864746094, -0.04964447021484375, -0.047028541564941406, -0.04441261291503906, -0.04179668426513672, -0.039180755615234375, -0.03656482696533203, -0.03394889831542969, -0.031332969665527344, -0.028717041015625, -0.026101112365722656, -0.023485183715820312, -0.02086925506591797, -0.018253326416015625, -0.01563739776611328, -0.013021469116210938, -0.010405540466308594, -0.00778961181640625, -0.005173683166503906, -0.0025577545166015625, 5.817413330078125e-05, 0.002674102783203125, 0.005290031433105469, 0.007905960083007812, 0.010521888732910156, 0.0131378173828125, 0.015753746032714844, 0.018369674682617188, 0.02098560333251953, 0.023601531982421875, 0.02621746063232422, 0.028833389282226562, 0.031449317932128906, 0.03406524658203125, 0.036681175231933594, 0.03929710388183594, 0.04191303253173828, 0.044528961181640625, 0.04714488983154297, 0.04976081848144531, 0.052376747131347656, 0.05499267578125]}, "gradients/encoder.encoder.layers.9.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 0.0, 0.0, 3.0, 6.0, 4.0, 8.0, 10.0, 16.0, 22.0, 28.0, 31.0, 53.0, 51.0, 96.0, 135.0, 188.0, 265.0, 382.0, 604.0, 1011.0, 2279.0, 7906.0, 4107507.0, 64341.0, 5058.0, 1714.0, 841.0, 504.0, 328.0, 226.0, 176.0, 137.0, 90.0, 81.0, 54.0, 31.0, 39.0, 13.0, 17.0, 10.0, 7.0, 7.0, 5.0, 4.0, 3.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.3017578125, -0.29378700256347656, -0.2858161926269531, -0.2778453826904297, -0.26987457275390625, -0.2619037628173828, -0.2539329528808594, -0.24596214294433594, -0.2379913330078125, -0.23002052307128906, -0.22204971313476562, -0.2140789031982422, -0.20610809326171875, -0.1981372833251953, -0.19016647338867188, -0.18219566345214844, -0.174224853515625, -0.16625404357910156, -0.15828323364257812, -0.1503124237060547, -0.14234161376953125, -0.1343708038330078, -0.12639999389648438, -0.11842918395996094, -0.1104583740234375, -0.10248756408691406, -0.09451675415039062, -0.08654594421386719, -0.07857513427734375, -0.07060432434082031, -0.06263351440429688, -0.05466270446777344, -0.04669189453125, -0.03872108459472656, -0.030750274658203125, -0.022779464721679688, -0.01480865478515625, -0.0068378448486328125, 0.001132965087890625, 0.009103775024414062, 0.0170745849609375, 0.025045394897460938, 0.033016204833984375, 0.04098701477050781, 0.04895782470703125, 0.05692863464355469, 0.06489944458007812, 0.07287025451660156, 0.080841064453125, 0.08881187438964844, 0.09678268432617188, 0.10475349426269531, 0.11272430419921875, 0.12069511413574219, 0.12866592407226562, 0.13663673400878906, 0.1446075439453125, 0.15257835388183594, 0.16054916381835938, 0.1685199737548828, 0.17649078369140625, 0.1844615936279297, 0.19243240356445312, 0.20040321350097656, 0.2083740234375]}, "gradients/encoder.encoder.layers.9.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [3.0, 1.0, 5.0, 1.0, 4.0, 2.0, 6.0, 22.0, 86.0, 3832.0, 70.0, 26.0, 8.0, 5.0, 3.0, 4.0, 3.0, 1.0, 1.0, 0.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0140838623046875, -0.012603044509887695, -0.01112222671508789, -0.009641408920288086, -0.008160591125488281, -0.0066797733306884766, -0.005198955535888672, -0.003718137741088867, -0.0022373199462890625, -0.0007565021514892578, 0.0007243156433105469, 0.0022051334381103516, 0.0036859512329101562, 0.005166769027709961, 0.006647586822509766, 0.00812840461730957, 0.009609222412109375, 0.01109004020690918, 0.012570858001708984, 0.014051675796508789, 0.015532493591308594, 0.0170133113861084, 0.018494129180908203, 0.019974946975708008, 0.021455764770507812, 0.022936582565307617, 0.024417400360107422, 0.025898218154907227, 0.02737903594970703, 0.028859853744506836, 0.03034067153930664, 0.031821489334106445, 0.03330230712890625, 0.034783124923706055, 0.03626394271850586, 0.037744760513305664, 0.03922557830810547, 0.04070639610290527, 0.04218721389770508, 0.04366803169250488, 0.04514884948730469, 0.04662966728210449, 0.0481104850769043, 0.0495913028717041, 0.051072120666503906, 0.05255293846130371, 0.054033756256103516, 0.05551457405090332, 0.056995391845703125, 0.05847620964050293, 0.059957027435302734, 0.06143784523010254, 0.06291866302490234, 0.06439948081970215, 0.06588029861450195, 0.06736111640930176, 0.06884193420410156, 0.07032275199890137, 0.07180356979370117, 0.07328438758850098, 0.07476520538330078, 0.07624602317810059, 0.07772684097290039, 0.0792076587677002, 0.0806884765625]}, "gradients/encoder.encoder.layers.9.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 2.0, 3.0, 4.0, 17.0, 81.0, 455.0, 301.0, 101.0, 30.0, 14.0, 4.0, 5.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7454192638397217, -0.7239395380020142, -0.7024598121643066, -0.6809800267219543, -0.6595003008842468, -0.6380205750465393, -0.6165408492088318, -0.5950610637664795, -0.573581337928772, -0.5521016120910645, -0.5306218862533569, -0.5091421008110046, -0.4876623749732971, -0.4661826491355896, -0.4447029232978821, -0.4232231676578522, -0.40174344182014465, -0.38026371598243713, -0.3587839603424072, -0.3373042345046997, -0.3158244788646698, -0.2943447530269623, -0.2728649973869324, -0.25138527154922485, -0.22990553081035614, -0.20842579007148743, -0.1869460493326187, -0.16546630859375, -0.14398658275604248, -0.12250683456659317, -0.10102710127830505, -0.07954736053943634, -0.05806761980056763, -0.036587879061698914, -0.015108142048120499, 0.006371594965457916, 0.02785133570432663, 0.04933107644319534, 0.07081080973148346, 0.09229055047035217, 0.11377029120922089, 0.1352500319480896, 0.1567297726869583, 0.17820951342582703, 0.19968923926353455, 0.22116899490356445, 0.24264872074127197, 0.2641284465789795, 0.2856082022190094, 0.3070879280567169, 0.3285676836967468, 0.35004740953445435, 0.37152716517448425, 0.3930068910121918, 0.4144866466522217, 0.4359663724899292, 0.4574460983276367, 0.47892582416534424, 0.5004055500030518, 0.521885335445404, 0.5433650612831116, 0.5648447871208191, 0.5863245129585266, 0.6078042984008789, 0.6292840242385864]}, "gradients/encoder.encoder.layers.9.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 7.0, 8.0, 7.0, 9.0, 10.0, 4.0, 17.0, 21.0, 23.0, 16.0, 21.0, 29.0, 30.0, 25.0, 33.0, 42.0, 32.0, 26.0, 41.0, 32.0, 41.0, 43.0, 42.0, 33.0, 24.0, 37.0, 40.0, 25.0, 38.0, 30.0, 21.0, 28.0, 19.0, 29.0, 14.0, 19.0, 22.0, 8.0, 11.0, 12.0, 7.0, 6.0, 12.0, 3.0, 3.0, 3.0, 4.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.09581488370895386, -0.09286656975746155, -0.08991824835538864, -0.08696992695331573, -0.08402161300182343, -0.08107329905033112, -0.07812497764825821, -0.0751766562461853, -0.072228342294693, -0.06928002834320068, -0.06633170694112778, -0.06338338553905487, -0.06043507158756256, -0.05748675391077995, -0.054538436233997345, -0.05159011855721474, -0.04864180088043213, -0.04569348320364952, -0.04274516552686691, -0.039796847850084305, -0.0368485301733017, -0.03390021249651909, -0.03095189481973648, -0.028003577142953873, -0.025055259466171265, -0.022106941789388657, -0.01915862411260605, -0.01621030643582344, -0.013261988759040833, -0.010313671082258224, -0.0073653534054756165, -0.004417035728693008, -0.0014687180519104004, 0.0014795996248722076, 0.004427917301654816, 0.007376234978437424, 0.010324552655220032, 0.01327287033200264, 0.016221188008785248, 0.019169505685567856, 0.022117823362350464, 0.025066141039133072, 0.02801445871591568, 0.030962776392698288, 0.033911094069480896, 0.036859411746263504, 0.03980772942304611, 0.04275604709982872, 0.04570436477661133, 0.048652682453393936, 0.051601000130176544, 0.05454931780695915, 0.05749763548374176, 0.06044595316052437, 0.06339427083730698, 0.06634259223937988, 0.06929090619087219, 0.0722392201423645, 0.07518754154443741, 0.07813586294651031, 0.08108417689800262, 0.08403249084949493, 0.08698081225156784, 0.08992913365364075, 0.09287744760513306]}, "gradients/encoder.encoder.layers.9.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 5.0, 3.0, 1.0, 6.0, 7.0, 12.0, 10.0, 21.0, 25.0, 47.0, 67.0, 115.0, 274.0, 435.0, 1043.0, 2463.0, 7880.0, 80139.0, 893844.0, 51483.0, 6617.0, 2237.0, 909.0, 430.0, 219.0, 109.0, 62.0, 27.0, 10.0, 27.0, 10.0, 4.0, 4.0, 7.0, 4.0, 2.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.25830078125, -0.25008392333984375, -0.2418670654296875, -0.23365020751953125, -0.225433349609375, -0.21721649169921875, -0.2089996337890625, -0.20078277587890625, -0.19256591796875, -0.18434906005859375, -0.1761322021484375, -0.16791534423828125, -0.159698486328125, -0.15148162841796875, -0.1432647705078125, -0.13504791259765625, -0.1268310546875, -0.11861419677734375, -0.1103973388671875, -0.10218048095703125, -0.093963623046875, -0.08574676513671875, -0.0775299072265625, -0.06931304931640625, -0.06109619140625, -0.05287933349609375, -0.0446624755859375, -0.03644561767578125, -0.028228759765625, -0.02001190185546875, -0.0117950439453125, -0.00357818603515625, 0.004638671875, 0.01285552978515625, 0.0210723876953125, 0.02928924560546875, 0.037506103515625, 0.04572296142578125, 0.0539398193359375, 0.06215667724609375, 0.07037353515625, 0.07859039306640625, 0.0868072509765625, 0.09502410888671875, 0.103240966796875, 0.11145782470703125, 0.1196746826171875, 0.12789154052734375, 0.1361083984375, 0.14432525634765625, 0.1525421142578125, 0.16075897216796875, 0.168975830078125, 0.17719268798828125, 0.1854095458984375, 0.19362640380859375, 0.20184326171875, 0.21006011962890625, 0.2182769775390625, 0.22649383544921875, 0.234710693359375, 0.24292755126953125, 0.2511444091796875, 0.25936126708984375, 0.267578125]}, "gradients/encoder.encoder.layers.9.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 3.0, 1.0, 13.0, 16.0, 21.0, 37.0, 41.0, 50.0, 53.0, 68.0, 86.0, 91.0, 101.0, 80.0, 85.0, 55.0, 64.0, 45.0, 34.0, 25.0, 11.0, 9.0, 8.0, 6.0, 0.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.1080322265625, -0.10552740097045898, -0.10302257537841797, -0.10051774978637695, -0.09801292419433594, -0.09550809860229492, -0.0930032730102539, -0.09049844741821289, -0.08799362182617188, -0.08548879623413086, -0.08298397064208984, -0.08047914505004883, -0.07797431945800781, -0.0754694938659668, -0.07296466827392578, -0.07045984268188477, -0.06795501708984375, -0.06545019149780273, -0.06294536590576172, -0.0604405403137207, -0.05793571472167969, -0.05543088912963867, -0.052926063537597656, -0.05042123794555664, -0.047916412353515625, -0.04541158676147461, -0.042906761169433594, -0.04040193557739258, -0.03789710998535156, -0.03539228439331055, -0.03288745880126953, -0.030382633209228516, -0.0278778076171875, -0.025372982025146484, -0.02286815643310547, -0.020363330841064453, -0.017858505249023438, -0.015353679656982422, -0.012848854064941406, -0.01034402847290039, -0.007839202880859375, -0.005334377288818359, -0.0028295516967773438, -0.0003247261047363281, 0.0021800994873046875, 0.004684925079345703, 0.007189750671386719, 0.009694576263427734, 0.01219940185546875, 0.014704227447509766, 0.01720905303955078, 0.019713878631591797, 0.022218704223632812, 0.024723529815673828, 0.027228355407714844, 0.02973318099975586, 0.032238006591796875, 0.03474283218383789, 0.037247657775878906, 0.03975248336791992, 0.04225730895996094, 0.04476213455200195, 0.04726696014404297, 0.049771785736083984, 0.052276611328125]}, "gradients/encoder.encoder.layers.9.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 2.0, 3.0, 5.0, 8.0, 17.0, 27.0, 39.0, 49.0, 107.0, 238.0, 645.0, 2244.0, 24142.0, 962729.0, 53846.0, 3192.0, 704.0, 237.0, 141.0, 81.0, 45.0, 33.0, 9.0, 11.0, 4.0, 1.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.355224609375, -0.34668540954589844, -0.3381462097167969, -0.3296070098876953, -0.32106781005859375, -0.3125286102294922, -0.3039894104003906, -0.29545021057128906, -0.2869110107421875, -0.27837181091308594, -0.2698326110839844, -0.2612934112548828, -0.25275421142578125, -0.2442150115966797, -0.23567581176757812, -0.22713661193847656, -0.218597412109375, -0.21005821228027344, -0.20151901245117188, -0.1929798126220703, -0.18444061279296875, -0.1759014129638672, -0.16736221313476562, -0.15882301330566406, -0.1502838134765625, -0.14174461364746094, -0.13320541381835938, -0.12466621398925781, -0.11612701416015625, -0.10758781433105469, -0.09904861450195312, -0.09050941467285156, -0.08197021484375, -0.07343101501464844, -0.06489181518554688, -0.05635261535644531, -0.04781341552734375, -0.03927421569824219, -0.030735015869140625, -0.022195816040039062, -0.0136566162109375, -0.0051174163818359375, 0.003421783447265625, 0.011960983276367188, 0.02050018310546875, 0.029039382934570312, 0.037578582763671875, 0.04611778259277344, 0.054656982421875, 0.06319618225097656, 0.07173538208007812, 0.08027458190917969, 0.08881378173828125, 0.09735298156738281, 0.10589218139648438, 0.11443138122558594, 0.1229705810546875, 0.13150978088378906, 0.14004898071289062, 0.1485881805419922, 0.15712738037109375, 0.1656665802001953, 0.17420578002929688, 0.18274497985839844, 0.1912841796875]}, "gradients/encoder.encoder.layers.9.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 4.0, 1.0, 4.0, 4.0, 4.0, 9.0, 12.0, 7.0, 32.0, 19.0, 30.0, 45.0, 42.0, 52.0, 48.0, 50.0, 57.0, 67.0, 78.0, 65.0, 76.0, 51.0, 48.0, 36.0, 38.0, 34.0, 31.0, 19.0, 14.0, 8.0, 11.0, 7.0, 3.0, 3.0, 4.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2266845703125, -0.2198486328125, -0.2130126953125, -0.2061767578125, -0.1993408203125, -0.1925048828125, -0.1856689453125, -0.1788330078125, -0.1719970703125, -0.1651611328125, -0.1583251953125, -0.1514892578125, -0.1446533203125, -0.1378173828125, -0.1309814453125, -0.1241455078125, -0.1173095703125, -0.1104736328125, -0.1036376953125, -0.0968017578125, -0.0899658203125, -0.0831298828125, -0.0762939453125, -0.0694580078125, -0.0626220703125, -0.0557861328125, -0.0489501953125, -0.0421142578125, -0.0352783203125, -0.0284423828125, -0.0216064453125, -0.0147705078125, -0.0079345703125, -0.0010986328125, 0.0057373046875, 0.0125732421875, 0.0194091796875, 0.0262451171875, 0.0330810546875, 0.0399169921875, 0.0467529296875, 0.0535888671875, 0.0604248046875, 0.0672607421875, 0.0740966796875, 0.0809326171875, 0.0877685546875, 0.0946044921875, 0.1014404296875, 0.1082763671875, 0.1151123046875, 0.1219482421875, 0.1287841796875, 0.1356201171875, 0.1424560546875, 0.1492919921875, 0.1561279296875, 0.1629638671875, 0.1697998046875, 0.1766357421875, 0.1834716796875, 0.1903076171875, 0.1971435546875, 0.2039794921875, 0.2108154296875]}, "gradients/encoder.encoder.layers.9.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 4.0, 2.0, 5.0, 8.0, 10.0, 18.0, 17.0, 32.0, 41.0, 55.0, 96.0, 199.0, 362.0, 671.0, 1687.0, 4997.0, 22166.0, 232006.0, 716064.0, 56046.0, 9296.0, 2671.0, 977.0, 499.0, 260.0, 150.0, 80.0, 49.0, 33.0, 15.0, 16.0, 5.0, 8.0, 2.0, 4.0, 4.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.049407958984375, -0.047921180725097656, -0.04643440246582031, -0.04494762420654297, -0.043460845947265625, -0.04197406768798828, -0.04048728942871094, -0.039000511169433594, -0.03751373291015625, -0.036026954650878906, -0.03454017639160156, -0.03305339813232422, -0.031566619873046875, -0.03007984161376953, -0.028593063354492188, -0.027106285095214844, -0.0256195068359375, -0.024132728576660156, -0.022645950317382812, -0.02115917205810547, -0.019672393798828125, -0.01818561553955078, -0.016698837280273438, -0.015212059020996094, -0.01372528076171875, -0.012238502502441406, -0.010751724243164062, -0.009264945983886719, -0.007778167724609375, -0.006291389465332031, -0.0048046112060546875, -0.0033178329467773438, -0.0018310546875, -0.00034427642822265625, 0.0011425018310546875, 0.0026292800903320312, 0.004116058349609375, 0.005602836608886719, 0.0070896148681640625, 0.008576393127441406, 0.01006317138671875, 0.011549949645996094, 0.013036727905273438, 0.014523506164550781, 0.016010284423828125, 0.01749706268310547, 0.018983840942382812, 0.020470619201660156, 0.0219573974609375, 0.023444175720214844, 0.024930953979492188, 0.02641773223876953, 0.027904510498046875, 0.02939128875732422, 0.030878067016601562, 0.032364845275878906, 0.03385162353515625, 0.035338401794433594, 0.03682518005371094, 0.03831195831298828, 0.039798736572265625, 0.04128551483154297, 0.04277229309082031, 0.044259071350097656, 0.045745849609375]}, "gradients/encoder.encoder.layers.9.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 3.0, 4.0, 4.0, 0.0, 4.0, 6.0, 9.0, 11.0, 19.0, 25.0, 15.0, 32.0, 38.0, 47.0, 51.0, 64.0, 68.0, 83.0, 57.0, 82.0, 63.0, 70.0, 54.0, 39.0, 39.0, 23.0, 32.0, 18.0, 10.0, 6.0, 8.0, 8.0, 6.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 2.0], "bins": [-1.0848045349121094e-05, -1.0574236512184143e-05, -1.0300427675247192e-05, -1.0026618838310242e-05, -9.752810001373291e-06, -9.47900116443634e-06, -9.20519232749939e-06, -8.931383490562439e-06, -8.657574653625488e-06, -8.383765816688538e-06, -8.109956979751587e-06, -7.836148142814636e-06, -7.5623393058776855e-06, -7.288530468940735e-06, -7.014721632003784e-06, -6.7409127950668335e-06, -6.467103958129883e-06, -6.193295121192932e-06, -5.9194862842559814e-06, -5.645677447319031e-06, -5.37186861038208e-06, -5.098059773445129e-06, -4.824250936508179e-06, -4.550442099571228e-06, -4.276633262634277e-06, -4.002824425697327e-06, -3.729015588760376e-06, -3.4552067518234253e-06, -3.1813979148864746e-06, -2.907589077949524e-06, -2.6337802410125732e-06, -2.3599714040756226e-06, -2.086162567138672e-06, -1.8123537302017212e-06, -1.5385448932647705e-06, -1.2647360563278198e-06, -9.909272193908691e-07, -7.171183824539185e-07, -4.4330954551696777e-07, -1.695007085800171e-07, 1.043081283569336e-07, 3.781169652938843e-07, 6.51925802230835e-07, 9.257346391677856e-07, 1.1995434761047363e-06, 1.473352313041687e-06, 1.7471611499786377e-06, 2.0209699869155884e-06, 2.294778823852539e-06, 2.5685876607894897e-06, 2.8423964977264404e-06, 3.116205334663391e-06, 3.390014171600342e-06, 3.6638230085372925e-06, 3.937631845474243e-06, 4.211440682411194e-06, 4.4852495193481445e-06, 4.759058356285095e-06, 5.032867193222046e-06, 5.306676030158997e-06, 5.580484867095947e-06, 5.854293704032898e-06, 6.128102540969849e-06, 6.401911377906799e-06, 6.67572021484375e-06]}, "gradients/encoder.encoder.layers.9.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 2.0, 2.0, 6.0, 3.0, 5.0, 4.0, 3.0, 9.0, 13.0, 12.0, 34.0, 72.0, 120.0, 356.0, 998.0, 4069.0, 28392.0, 835430.0, 166022.0, 9995.0, 2023.0, 579.0, 215.0, 86.0, 47.0, 20.0, 8.0, 6.0, 5.0, 4.0, 3.0, 1.0, 3.0, 2.0, 2.0, 2.0, 2.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.08612060546875, -0.08332157135009766, -0.08052253723144531, -0.07772350311279297, -0.07492446899414062, -0.07212543487548828, -0.06932640075683594, -0.0665273666381836, -0.06372833251953125, -0.060929298400878906, -0.05813026428222656, -0.05533123016357422, -0.052532196044921875, -0.04973316192626953, -0.04693412780761719, -0.044135093688964844, -0.0413360595703125, -0.038537025451660156, -0.03573799133300781, -0.03293895721435547, -0.030139923095703125, -0.02734088897705078, -0.024541854858398438, -0.021742820739746094, -0.01894378662109375, -0.016144752502441406, -0.013345718383789062, -0.010546684265136719, -0.007747650146484375, -0.004948616027832031, -0.0021495819091796875, 0.0006494522094726562, 0.003448486328125, 0.006247520446777344, 0.009046554565429688, 0.011845588684082031, 0.014644622802734375, 0.01744365692138672, 0.020242691040039062, 0.023041725158691406, 0.02584075927734375, 0.028639793395996094, 0.03143882751464844, 0.03423786163330078, 0.037036895751953125, 0.03983592987060547, 0.04263496398925781, 0.045433998107910156, 0.0482330322265625, 0.051032066345214844, 0.05383110046386719, 0.05663013458251953, 0.059429168701171875, 0.06222820281982422, 0.06502723693847656, 0.0678262710571289, 0.07062530517578125, 0.0734243392944336, 0.07622337341308594, 0.07902240753173828, 0.08182144165039062, 0.08462047576904297, 0.08741950988769531, 0.09021854400634766, 0.093017578125]}, "gradients/encoder.encoder.layers.9.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 3.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 1.0, 1.0, 1.0, 3.0, 6.0, 4.0, 5.0, 19.0, 28.0, 53.0, 123.0, 210.0, 232.0, 157.0, 71.0, 32.0, 18.0, 9.0, 7.0, 8.0, 4.0, 0.0, 4.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.12109375, -0.11774730682373047, -0.11440086364746094, -0.1110544204711914, -0.10770797729492188, -0.10436153411865234, -0.10101509094238281, -0.09766864776611328, -0.09432220458984375, -0.09097576141357422, -0.08762931823730469, -0.08428287506103516, -0.08093643188476562, -0.0775899887084961, -0.07424354553222656, -0.07089710235595703, -0.0675506591796875, -0.06420421600341797, -0.06085777282714844, -0.057511329650878906, -0.054164886474609375, -0.050818443298339844, -0.04747200012207031, -0.04412555694580078, -0.04077911376953125, -0.03743267059326172, -0.03408622741699219, -0.030739784240722656, -0.027393341064453125, -0.024046897888183594, -0.020700454711914062, -0.01735401153564453, -0.014007568359375, -0.010661125183105469, -0.0073146820068359375, -0.003968238830566406, -0.000621795654296875, 0.0027246475219726562, 0.0060710906982421875, 0.009417533874511719, 0.01276397705078125, 0.01611042022705078, 0.019456863403320312, 0.022803306579589844, 0.026149749755859375, 0.029496192932128906, 0.03284263610839844, 0.03618907928466797, 0.0395355224609375, 0.04288196563720703, 0.04622840881347656, 0.049574851989746094, 0.052921295166015625, 0.056267738342285156, 0.05961418151855469, 0.06296062469482422, 0.06630706787109375, 0.06965351104736328, 0.07299995422363281, 0.07634639739990234, 0.07969284057617188, 0.0830392837524414, 0.08638572692871094, 0.08973217010498047, 0.09307861328125]}, "gradients/encoder.encoder.layers.9.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 3.0, 3.0, 13.0, 35.0, 50.0, 129.0, 245.0, 226.0, 173.0, 72.0, 31.0, 17.0, 7.0, 5.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.337580919265747, -1.2996152639389038, -1.2616496086120605, -1.2236839532852173, -1.185718297958374, -1.1477526426315308, -1.1097869873046875, -1.0718213319778442, -1.033855676651001, -0.9958900213241577, -0.9579243659973145, -0.9199587106704712, -0.8819930553436279, -0.8440274000167847, -0.8060617446899414, -0.7680960893630981, -0.7301303744316101, -0.6921647191047668, -0.6541990637779236, -0.6162334084510803, -0.5782677531242371, -0.5403020977973938, -0.5023363828659058, -0.4643707573413849, -0.4264051020145416, -0.38843944668769836, -0.3504737913608551, -0.31250810623168945, -0.2745424509048462, -0.23657681047916412, -0.19861114025115967, -0.1606454849243164, -0.12267982959747314, -0.08471417427062988, -0.046748511493206024, -0.008782848715782166, 0.029182806611061096, 0.06714846193790436, 0.10511413216590881, 0.14307978749275208, 0.18104544281959534, 0.2190110981464386, 0.25697675347328186, 0.2949424386024475, 0.33290809392929077, 0.37087374925613403, 0.4088394045829773, 0.44680505990982056, 0.4847707152366638, 0.5227363705635071, 0.5607020258903503, 0.5986676812171936, 0.6366333365440369, 0.6745989918708801, 0.7125647068023682, 0.7505303621292114, 0.7884960174560547, 0.826461672782898, 0.8644273281097412, 0.9023929834365845, 0.9403586387634277, 0.978324294090271, 1.0162899494171143, 1.0542556047439575, 1.0922212600708008]}, "gradients/encoder.encoder.layers.9.layer_norm.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 4.0, 4.0, 3.0, 9.0, 5.0, 13.0, 15.0, 22.0, 17.0, 25.0, 28.0, 21.0, 31.0, 36.0, 35.0, 48.0, 39.0, 56.0, 51.0, 51.0, 58.0, 45.0, 60.0, 46.0, 41.0, 25.0, 30.0, 37.0, 30.0, 24.0, 21.0, 14.0, 9.0, 7.0, 13.0, 14.0, 8.0, 4.0, 6.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6151859760284424, -0.5917969942092896, -0.5684080123901367, -0.5450190305709839, -0.521630048751831, -0.4982410669326782, -0.4748521149158478, -0.45146313309669495, -0.4280741512775421, -0.4046851694583893, -0.38129618763923645, -0.357907235622406, -0.3345182538032532, -0.31112927198410034, -0.2877402901649475, -0.2643513083457947, -0.24096232652664185, -0.217573344707489, -0.19418436288833618, -0.17079539597034454, -0.1474064141511917, -0.12401743233203888, -0.10062846541404724, -0.07723948359489441, -0.05385050177574158, -0.030461523681879044, -0.00707254558801651, 0.016316428780555725, 0.03970541059970856, 0.06309439241886139, 0.08648335933685303, 0.10987234115600586, 0.13326138257980347, 0.1566503643989563, 0.18003934621810913, 0.20342831313610077, 0.2268172949552536, 0.2502062916755676, 0.27359524369239807, 0.2969842255115509, 0.32037320733070374, 0.34376218914985657, 0.3671511709690094, 0.39054012298583984, 0.4139291048049927, 0.4373180866241455, 0.46070706844329834, 0.48409605026245117, 0.507485032081604, 0.5308740139007568, 0.5542629957199097, 0.5776519775390625, 0.6010409593582153, 0.6244299411773682, 0.647818922996521, 0.6712079048156738, 0.6945968866348267, 0.7179858684539795, 0.7413748502731323, 0.7647638320922852, 0.788152813911438, 0.8115417957305908, 0.8349307775497437, 0.8583197593688965, 0.8817086815834045]}, "gradients/encoder.encoder.layers.8.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 6.0, 4.0, 12.0, 14.0, 22.0, 29.0, 55.0, 88.0, 121.0, 490.0, 4189031.0, 3965.0, 183.0, 91.0, 67.0, 44.0, 25.0, 20.0, 7.0, 3.0, 5.0, 5.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0], "bins": [-1.6328125, -1.59368896484375, -1.5545654296875, -1.51544189453125, -1.476318359375, -1.43719482421875, -1.3980712890625, -1.35894775390625, -1.31982421875, -1.28070068359375, -1.2415771484375, -1.20245361328125, -1.163330078125, -1.12420654296875, -1.0850830078125, -1.04595947265625, -1.0068359375, -0.96771240234375, -0.9285888671875, -0.88946533203125, -0.850341796875, -0.81121826171875, -0.7720947265625, -0.73297119140625, -0.69384765625, -0.65472412109375, -0.6156005859375, -0.57647705078125, -0.537353515625, -0.49822998046875, -0.4591064453125, -0.41998291015625, -0.380859375, -0.34173583984375, -0.3026123046875, -0.26348876953125, -0.224365234375, -0.18524169921875, -0.1461181640625, -0.10699462890625, -0.06787109375, -0.02874755859375, 0.0103759765625, 0.04949951171875, 0.088623046875, 0.12774658203125, 0.1668701171875, 0.20599365234375, 0.2451171875, 0.28424072265625, 0.3233642578125, 0.36248779296875, 0.401611328125, 0.44073486328125, 0.4798583984375, 0.51898193359375, 0.55810546875, 0.59722900390625, 0.6363525390625, 0.67547607421875, 0.714599609375, 0.75372314453125, 0.7928466796875, 0.83197021484375, 0.87109375]}, "gradients/encoder.encoder.layers.8.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 4.0, 9.0, 3.0, 13.0, 20.0, 21.0, 31.0, 55.0, 56.0, 76.0, 59.0, 77.0, 117.0, 94.0, 83.0, 59.0, 54.0, 51.0, 39.0, 27.0, 13.0, 20.0, 9.0, 5.0, 5.0, 0.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 3.0], "bins": [-0.10418701171875, -0.10174083709716797, -0.09929466247558594, -0.0968484878540039, -0.09440231323242188, -0.09195613861083984, -0.08950996398925781, -0.08706378936767578, -0.08461761474609375, -0.08217144012451172, -0.07972526550292969, -0.07727909088134766, -0.07483291625976562, -0.0723867416381836, -0.06994056701660156, -0.06749439239501953, -0.0650482177734375, -0.06260204315185547, -0.06015586853027344, -0.057709693908691406, -0.055263519287109375, -0.052817344665527344, -0.05037117004394531, -0.04792499542236328, -0.04547882080078125, -0.04303264617919922, -0.04058647155761719, -0.038140296936035156, -0.035694122314453125, -0.033247947692871094, -0.030801773071289062, -0.02835559844970703, -0.025909423828125, -0.02346324920654297, -0.021017074584960938, -0.018570899963378906, -0.016124725341796875, -0.013678550720214844, -0.011232376098632812, -0.008786201477050781, -0.00634002685546875, -0.0038938522338867188, -0.0014476776123046875, 0.0009984970092773438, 0.003444671630859375, 0.005890846252441406, 0.008337020874023438, 0.010783195495605469, 0.0132293701171875, 0.01567554473876953, 0.018121719360351562, 0.020567893981933594, 0.023014068603515625, 0.025460243225097656, 0.027906417846679688, 0.03035259246826172, 0.03279876708984375, 0.03524494171142578, 0.03769111633300781, 0.040137290954589844, 0.042583465576171875, 0.045029640197753906, 0.04747581481933594, 0.04992198944091797, 0.0523681640625]}, "gradients/encoder.encoder.layers.8.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 2.0, 4.0, 3.0, 5.0, 3.0, 2.0, 12.0, 13.0, 32.0, 28.0, 44.0, 61.0, 106.0, 132.0, 239.0, 387.0, 700.0, 1389.0, 2869.0, 7671.0, 40895.0, 4060099.0, 63449.0, 9212.0, 3364.0, 1665.0, 789.0, 408.0, 233.0, 147.0, 124.0, 59.0, 49.0, 27.0, 24.0, 17.0, 14.0, 4.0, 6.0, 4.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1422119140625, -0.1377086639404297, -0.13320541381835938, -0.12870216369628906, -0.12419891357421875, -0.11969566345214844, -0.11519241333007812, -0.11068916320800781, -0.1061859130859375, -0.10168266296386719, -0.09717941284179688, -0.09267616271972656, -0.08817291259765625, -0.08366966247558594, -0.07916641235351562, -0.07466316223144531, -0.070159912109375, -0.06565666198730469, -0.061153411865234375, -0.05665016174316406, -0.05214691162109375, -0.04764366149902344, -0.043140411376953125, -0.03863716125488281, -0.0341339111328125, -0.029630661010742188, -0.025127410888671875, -0.020624160766601562, -0.01612091064453125, -0.011617660522460938, -0.007114410400390625, -0.0026111602783203125, 0.00189208984375, 0.0063953399658203125, 0.010898590087890625, 0.015401840209960938, 0.01990509033203125, 0.024408340454101562, 0.028911590576171875, 0.03341484069824219, 0.0379180908203125, 0.04242134094238281, 0.046924591064453125, 0.05142784118652344, 0.05593109130859375, 0.06043434143066406, 0.06493759155273438, 0.06944084167480469, 0.073944091796875, 0.07844734191894531, 0.08295059204101562, 0.08745384216308594, 0.09195709228515625, 0.09646034240722656, 0.10096359252929688, 0.10546684265136719, 0.1099700927734375, 0.11447334289550781, 0.11897659301757812, 0.12347984313964844, 0.12798309326171875, 0.13248634338378906, 0.13698959350585938, 0.1414928436279297, 0.14599609375]}, "gradients/encoder.encoder.layers.8.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 5.0, 5.0, 3.0, 14.0, 20.0, 24.0, 64.0, 111.0, 3376.0, 280.0, 95.0, 35.0, 12.0, 6.0, 4.0, 4.0, 5.0, 4.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.08282470703125, -0.08097314834594727, -0.07912158966064453, -0.0772700309753418, -0.07541847229003906, -0.07356691360473633, -0.0717153549194336, -0.06986379623413086, -0.06801223754882812, -0.06616067886352539, -0.06430912017822266, -0.06245756149291992, -0.06060600280761719, -0.05875444412231445, -0.05690288543701172, -0.055051326751708984, -0.05319976806640625, -0.051348209381103516, -0.04949665069580078, -0.04764509201049805, -0.04579353332519531, -0.04394197463989258, -0.042090415954589844, -0.04023885726928711, -0.038387298583984375, -0.03653573989868164, -0.034684181213378906, -0.03283262252807617, -0.030981063842773438, -0.029129505157470703, -0.02727794647216797, -0.025426387786865234, -0.0235748291015625, -0.021723270416259766, -0.01987171173095703, -0.018020153045654297, -0.016168594360351562, -0.014317035675048828, -0.012465476989746094, -0.01061391830444336, -0.008762359619140625, -0.006910800933837891, -0.005059242248535156, -0.003207683563232422, -0.0013561248779296875, 0.0004954338073730469, 0.0023469924926757812, 0.004198551177978516, 0.00605010986328125, 0.007901668548583984, 0.009753227233886719, 0.011604785919189453, 0.013456344604492188, 0.015307903289794922, 0.017159461975097656, 0.01901102066040039, 0.020862579345703125, 0.02271413803100586, 0.024565696716308594, 0.026417255401611328, 0.028268814086914062, 0.030120372772216797, 0.03197193145751953, 0.033823490142822266, 0.035675048828125]}, "gradients/encoder.encoder.layers.8.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 3.0, 3.0, 9.0, 8.0, 25.0, 59.0, 169.0, 351.0, 285.0, 70.0, 24.0, 6.0, 4.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.0081409215927124, -0.9898035526275635, -0.9714662432670593, -0.9531288743019104, -0.9347915053367615, -0.9164541959762573, -0.8981168270111084, -0.8797794580459595, -0.8614421486854553, -0.8431047797203064, -0.8247674703598022, -0.8064301013946533, -0.7880927324295044, -0.7697554230690002, -0.7514180541038513, -0.7330807447433472, -0.7147433757781982, -0.6964060068130493, -0.6780686974525452, -0.6597313284873962, -0.6413939595222473, -0.6230566501617432, -0.6047192811965942, -0.5863819122314453, -0.5680445432662964, -0.5497071743011475, -0.5313698649406433, -0.5130324959754944, -0.49469515681266785, -0.4763578176498413, -0.4580204486846924, -0.43968310952186584, -0.4213457405567169, -0.4030084013938904, -0.38467103242874146, -0.3663336932659149, -0.3479963541030884, -0.32965898513793945, -0.3113216459751129, -0.2929843068122864, -0.27464693784713745, -0.2563095986843109, -0.23797224462032318, -0.21963489055633545, -0.2012975513935089, -0.18296019732952118, -0.16462284326553345, -0.1462855041027069, -0.12794816493988037, -0.10961081832647324, -0.0912734717130661, -0.07293611764907837, -0.054598771035671234, -0.0362614244222641, -0.017924070358276367, 0.0004132688045501709, 0.018750622868537903, 0.03708796948194504, 0.05542531982064247, 0.0737626701593399, 0.09210001677274704, 0.11043736338615417, 0.1287747174501419, 0.14711205661296844, 0.16544941067695618]}, "gradients/encoder.encoder.layers.8.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 3.0, 4.0, 2.0, 8.0, 8.0, 13.0, 7.0, 15.0, 13.0, 16.0, 30.0, 32.0, 34.0, 35.0, 30.0, 43.0, 37.0, 54.0, 42.0, 60.0, 39.0, 51.0, 49.0, 43.0, 31.0, 33.0, 46.0, 37.0, 27.0, 26.0, 15.0, 20.0, 17.0, 23.0, 14.0, 11.0, 8.0, 10.0, 11.0, 1.0, 6.0, 1.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.12366318702697754, -0.11954590678215027, -0.1154286190867424, -0.11131133884191513, -0.10719405114650726, -0.10307677090167999, -0.09895949065685272, -0.09484221041202545, -0.09072492271661758, -0.08660764247179031, -0.08249035477638245, -0.07837307453155518, -0.0742557942867279, -0.07013850659132004, -0.06602122634649277, -0.0619039423763752, -0.05778665840625763, -0.05366937443614006, -0.04955209046602249, -0.04543481022119522, -0.04131752625107765, -0.03720024228096008, -0.03308296203613281, -0.028965678066015244, -0.024848394095897675, -0.020731110125780106, -0.016613828018307686, -0.012496544979512691, -0.008379261940717697, -0.004261977970600128, -0.00014469586312770844, 0.003972586244344711, 0.00808987021446228, 0.012207153253257275, 0.01632443629205227, 0.02044171839952469, 0.024559002369642258, 0.028676286339759827, 0.0327935665845871, 0.036910850554704666, 0.041028134524822235, 0.045145418494939804, 0.04926270246505737, 0.053379982709884644, 0.05749726668000221, 0.06161455065011978, 0.06573183089494705, 0.06984911859035492, 0.07396639883518219, 0.07808367908000946, 0.08220096677541733, 0.0863182470202446, 0.09043553471565247, 0.09455281496047974, 0.098670095205307, 0.10278737545013428, 0.10690466314554214, 0.11102194339036942, 0.11513923108577728, 0.11925651133060455, 0.12337379157543182, 0.1274910867214203, 0.13160836696624756, 0.13572564721107483, 0.1398429274559021]}, "gradients/encoder.encoder.layers.8.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 4.0, 2.0, 7.0, 18.0, 21.0, 28.0, 47.0, 75.0, 119.0, 234.0, 427.0, 1150.0, 3737.0, 19696.0, 183345.0, 719048.0, 103149.0, 12950.0, 2782.0, 913.0, 355.0, 201.0, 95.0, 56.0, 28.0, 23.0, 14.0, 13.0, 8.0, 8.0, 2.0, 3.0, 5.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0], "bins": [-0.176513671875, -0.17214298248291016, -0.1677722930908203, -0.16340160369873047, -0.15903091430664062, -0.15466022491455078, -0.15028953552246094, -0.1459188461303711, -0.14154815673828125, -0.1371774673461914, -0.13280677795410156, -0.12843608856201172, -0.12406539916992188, -0.11969470977783203, -0.11532402038574219, -0.11095333099365234, -0.1065826416015625, -0.10221195220947266, -0.09784126281738281, -0.09347057342529297, -0.08909988403320312, -0.08472919464111328, -0.08035850524902344, -0.0759878158569336, -0.07161712646484375, -0.0672464370727539, -0.06287574768066406, -0.05850505828857422, -0.054134368896484375, -0.04976367950439453, -0.04539299011230469, -0.041022300720214844, -0.036651611328125, -0.032280921936035156, -0.027910232543945312, -0.02353954315185547, -0.019168853759765625, -0.014798164367675781, -0.010427474975585938, -0.006056785583496094, -0.00168609619140625, 0.0026845932006835938, 0.0070552825927734375, 0.011425971984863281, 0.015796661376953125, 0.02016735076904297, 0.024538040161132812, 0.028908729553222656, 0.0332794189453125, 0.037650108337402344, 0.04202079772949219, 0.04639148712158203, 0.050762176513671875, 0.05513286590576172, 0.05950355529785156, 0.0638742446899414, 0.06824493408203125, 0.0726156234741211, 0.07698631286621094, 0.08135700225830078, 0.08572769165039062, 0.09009838104248047, 0.09446907043457031, 0.09883975982666016, 0.10321044921875]}, "gradients/encoder.encoder.layers.8.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 2.0, 2.0, 2.0, 1.0, 4.0, 8.0, 8.0, 19.0, 28.0, 28.0, 45.0, 55.0, 70.0, 75.0, 82.0, 93.0, 94.0, 78.0, 82.0, 48.0, 48.0, 37.0, 36.0, 21.0, 13.0, 13.0, 7.0, 5.0, 1.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0], "bins": [-0.10662841796875, -0.10413599014282227, -0.10164356231689453, -0.0991511344909668, -0.09665870666503906, -0.09416627883911133, -0.0916738510131836, -0.08918142318725586, -0.08668899536132812, -0.08419656753540039, -0.08170413970947266, -0.07921171188354492, -0.07671928405761719, -0.07422685623168945, -0.07173442840576172, -0.06924200057983398, -0.06674957275390625, -0.06425714492797852, -0.06176471710205078, -0.05927228927612305, -0.05677986145019531, -0.05428743362426758, -0.051795005798339844, -0.04930257797241211, -0.046810150146484375, -0.04431772232055664, -0.041825294494628906, -0.03933286666870117, -0.03684043884277344, -0.0343480110168457, -0.03185558319091797, -0.029363155364990234, -0.0268707275390625, -0.024378299713134766, -0.02188587188720703, -0.019393444061279297, -0.016901016235351562, -0.014408588409423828, -0.011916160583496094, -0.00942373275756836, -0.006931304931640625, -0.004438877105712891, -0.0019464492797851562, 0.0005459785461425781, 0.0030384063720703125, 0.005530834197998047, 0.008023262023925781, 0.010515689849853516, 0.01300811767578125, 0.015500545501708984, 0.01799297332763672, 0.020485401153564453, 0.022977828979492188, 0.025470256805419922, 0.027962684631347656, 0.03045511245727539, 0.032947540283203125, 0.03543996810913086, 0.037932395935058594, 0.04042482376098633, 0.04291725158691406, 0.0454096794128418, 0.04790210723876953, 0.050394535064697266, 0.052886962890625]}, "gradients/encoder.encoder.layers.8.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 2.0, 1.0, 8.0, 3.0, 1.0, 3.0, 7.0, 9.0, 5.0, 11.0, 12.0, 17.0, 23.0, 29.0, 40.0, 52.0, 88.0, 124.0, 236.0, 419.0, 769.0, 2102.0, 8743.0, 82487.0, 826356.0, 112402.0, 10270.0, 2376.0, 889.0, 386.0, 227.0, 136.0, 96.0, 62.0, 32.0, 34.0, 25.0, 23.0, 10.0, 8.0, 9.0, 10.0, 3.0, 3.0, 9.0, 1.0, 3.0, 1.0, 1.0, 1.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.147705078125, -0.14316749572753906, -0.13862991333007812, -0.1340923309326172, -0.12955474853515625, -0.1250171661376953, -0.12047958374023438, -0.11594200134277344, -0.1114044189453125, -0.10686683654785156, -0.10232925415039062, -0.09779167175292969, -0.09325408935546875, -0.08871650695800781, -0.08417892456054688, -0.07964134216308594, -0.075103759765625, -0.07056617736816406, -0.06602859497070312, -0.06149101257324219, -0.05695343017578125, -0.05241584777832031, -0.047878265380859375, -0.04334068298339844, -0.0388031005859375, -0.03426551818847656, -0.029727935791015625, -0.025190353393554688, -0.02065277099609375, -0.016115188598632812, -0.011577606201171875, -0.0070400238037109375, -0.00250244140625, 0.0020351409912109375, 0.006572723388671875, 0.011110305786132812, 0.01564788818359375, 0.020185470581054688, 0.024723052978515625, 0.029260635375976562, 0.0337982177734375, 0.03833580017089844, 0.042873382568359375, 0.04741096496582031, 0.05194854736328125, 0.05648612976074219, 0.061023712158203125, 0.06556129455566406, 0.070098876953125, 0.07463645935058594, 0.07917404174804688, 0.08371162414550781, 0.08824920654296875, 0.09278678894042969, 0.09732437133789062, 0.10186195373535156, 0.1063995361328125, 0.11093711853027344, 0.11547470092773438, 0.12001228332519531, 0.12454986572265625, 0.1290874481201172, 0.13362503051757812, 0.13816261291503906, 0.1427001953125]}, "gradients/encoder.encoder.layers.8.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 3.0, 1.0, 0.0, 2.0, 0.0, 1.0, 3.0, 7.0, 8.0, 6.0, 4.0, 9.0, 11.0, 11.0, 11.0, 14.0, 17.0, 17.0, 17.0, 31.0, 25.0, 39.0, 36.0, 37.0, 40.0, 41.0, 45.0, 44.0, 53.0, 45.0, 40.0, 41.0, 37.0, 44.0, 23.0, 28.0, 33.0, 33.0, 18.0, 26.0, 13.0, 13.0, 21.0, 9.0, 14.0, 7.0, 14.0, 5.0, 7.0, 2.0, 1.0, 6.0, 2.0, 1.0, 0.0, 3.0, 0.0, 1.0], "bins": [-0.1690673828125, -0.1641979217529297, -0.15932846069335938, -0.15445899963378906, -0.14958953857421875, -0.14472007751464844, -0.13985061645507812, -0.1349811553955078, -0.1301116943359375, -0.1252422332763672, -0.12037277221679688, -0.11550331115722656, -0.11063385009765625, -0.10576438903808594, -0.10089492797851562, -0.09602546691894531, -0.091156005859375, -0.08628654479980469, -0.08141708374023438, -0.07654762268066406, -0.07167816162109375, -0.06680870056152344, -0.061939239501953125, -0.05706977844238281, -0.0522003173828125, -0.04733085632324219, -0.042461395263671875, -0.03759193420410156, -0.03272247314453125, -0.027853012084960938, -0.022983551025390625, -0.018114089965820312, -0.01324462890625, -0.008375167846679688, -0.003505706787109375, 0.0013637542724609375, 0.00623321533203125, 0.011102676391601562, 0.015972137451171875, 0.020841598510742188, 0.0257110595703125, 0.030580520629882812, 0.035449981689453125, 0.04031944274902344, 0.04518890380859375, 0.05005836486816406, 0.054927825927734375, 0.05979728698730469, 0.064666748046875, 0.06953620910644531, 0.07440567016601562, 0.07927513122558594, 0.08414459228515625, 0.08901405334472656, 0.09388351440429688, 0.09875297546386719, 0.1036224365234375, 0.10849189758300781, 0.11336135864257812, 0.11823081970214844, 0.12310028076171875, 0.12796974182128906, 0.13283920288085938, 0.1377086639404297, 0.142578125]}, "gradients/encoder.encoder.layers.8.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 5.0, 7.0, 10.0, 13.0, 11.0, 15.0, 22.0, 33.0, 51.0, 76.0, 139.0, 296.0, 640.0, 2006.0, 8955.0, 102162.0, 856275.0, 67996.0, 6962.0, 1674.0, 577.0, 272.0, 140.0, 75.0, 46.0, 23.0, 16.0, 16.0, 12.0, 8.0, 8.0, 3.0, 3.0, 1.0, 2.0, 7.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.048828125, -0.04745006561279297, -0.04607200622558594, -0.044693946838378906, -0.043315887451171875, -0.041937828063964844, -0.04055976867675781, -0.03918170928955078, -0.03780364990234375, -0.03642559051513672, -0.03504753112792969, -0.033669471740722656, -0.032291412353515625, -0.030913352966308594, -0.029535293579101562, -0.02815723419189453, -0.0267791748046875, -0.02540111541748047, -0.024023056030273438, -0.022644996643066406, -0.021266937255859375, -0.019888877868652344, -0.018510818481445312, -0.01713275909423828, -0.01575469970703125, -0.014376640319824219, -0.012998580932617188, -0.011620521545410156, -0.010242462158203125, -0.008864402770996094, -0.0074863433837890625, -0.006108283996582031, -0.004730224609375, -0.0033521652221679688, -0.0019741058349609375, -0.0005960464477539062, 0.000782012939453125, 0.0021600723266601562, 0.0035381317138671875, 0.004916191101074219, 0.00629425048828125, 0.007672309875488281, 0.009050369262695312, 0.010428428649902344, 0.011806488037109375, 0.013184547424316406, 0.014562606811523438, 0.01594066619873047, 0.0173187255859375, 0.01869678497314453, 0.020074844360351562, 0.021452903747558594, 0.022830963134765625, 0.024209022521972656, 0.025587081909179688, 0.02696514129638672, 0.02834320068359375, 0.02972126007080078, 0.031099319458007812, 0.032477378845214844, 0.033855438232421875, 0.035233497619628906, 0.03661155700683594, 0.03798961639404297, 0.03936767578125]}, "gradients/encoder.encoder.layers.8.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 2.0, 6.0, 1.0, 3.0, 8.0, 18.0, 6.0, 16.0, 20.0, 30.0, 47.0, 42.0, 64.0, 64.0, 73.0, 87.0, 76.0, 69.0, 91.0, 63.0, 49.0, 53.0, 22.0, 23.0, 24.0, 11.0, 12.0, 11.0, 3.0, 3.0, 6.0, 1.0, 3.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0], "bins": [-9.834766387939453e-06, -9.57585871219635e-06, -9.316951036453247e-06, -9.058043360710144e-06, -8.799135684967041e-06, -8.540228009223938e-06, -8.281320333480835e-06, -8.022412657737732e-06, -7.763504981994629e-06, -7.504597306251526e-06, -7.245689630508423e-06, -6.98678195476532e-06, -6.727874279022217e-06, -6.468966603279114e-06, -6.210058927536011e-06, -5.951151251792908e-06, -5.692243576049805e-06, -5.433335900306702e-06, -5.174428224563599e-06, -4.915520548820496e-06, -4.656612873077393e-06, -4.3977051973342896e-06, -4.1387975215911865e-06, -3.8798898458480835e-06, -3.6209821701049805e-06, -3.3620744943618774e-06, -3.1031668186187744e-06, -2.8442591428756714e-06, -2.5853514671325684e-06, -2.3264437913894653e-06, -2.0675361156463623e-06, -1.8086284399032593e-06, -1.5497207641601562e-06, -1.2908130884170532e-06, -1.0319054126739502e-06, -7.729977369308472e-07, -5.140900611877441e-07, -2.551823854446411e-07, 3.725290298461914e-09, 2.6263296604156494e-07, 5.21540641784668e-07, 7.80448317527771e-07, 1.039355993270874e-06, 1.298263669013977e-06, 1.55717134475708e-06, 1.816079020500183e-06, 2.074986696243286e-06, 2.333894371986389e-06, 2.592802047729492e-06, 2.8517097234725952e-06, 3.1106173992156982e-06, 3.3695250749588013e-06, 3.6284327507019043e-06, 3.887340426445007e-06, 4.14624810218811e-06, 4.405155777931213e-06, 4.664063453674316e-06, 4.9229711294174194e-06, 5.1818788051605225e-06, 5.4407864809036255e-06, 5.6996941566467285e-06, 5.9586018323898315e-06, 6.2175095081329346e-06, 6.476417183876038e-06, 6.735324859619141e-06]}, "gradients/encoder.encoder.layers.8.attention.q_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 2.0, 4.0, 4.0, 2.0, 5.0, 3.0, 5.0, 15.0, 27.0, 36.0, 53.0, 98.0, 175.0, 394.0, 827.0, 2495.0, 11321.0, 137940.0, 831947.0, 53454.0, 6775.0, 1724.0, 622.0, 284.0, 140.0, 79.0, 48.0, 21.0, 14.0, 12.0, 8.0, 3.0, 6.0, 5.0, 3.0, 2.0, 1.0, 1.0, 4.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.0526123046875, -0.050933837890625, -0.04925537109375, -0.047576904296875, -0.0458984375, -0.044219970703125, -0.04254150390625, -0.040863037109375, -0.0391845703125, -0.037506103515625, -0.03582763671875, -0.034149169921875, -0.032470703125, -0.030792236328125, -0.02911376953125, -0.027435302734375, -0.0257568359375, -0.024078369140625, -0.02239990234375, -0.020721435546875, -0.01904296875, -0.017364501953125, -0.01568603515625, -0.014007568359375, -0.0123291015625, -0.010650634765625, -0.00897216796875, -0.007293701171875, -0.005615234375, -0.003936767578125, -0.00225830078125, -0.000579833984375, 0.0010986328125, 0.002777099609375, 0.00445556640625, 0.006134033203125, 0.0078125, 0.009490966796875, 0.01116943359375, 0.012847900390625, 0.0145263671875, 0.016204833984375, 0.01788330078125, 0.019561767578125, 0.021240234375, 0.022918701171875, 0.02459716796875, 0.026275634765625, 0.0279541015625, 0.029632568359375, 0.03131103515625, 0.032989501953125, 0.03466796875, 0.036346435546875, 0.03802490234375, 0.039703369140625, 0.0413818359375, 0.043060302734375, 0.04473876953125, 0.046417236328125, 0.048095703125, 0.049774169921875, 0.05145263671875, 0.053131103515625, 0.0548095703125]}, "gradients/encoder.encoder.layers.8.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 0.0, 1.0, 3.0, 2.0, 6.0, 3.0, 5.0, 9.0, 9.0, 13.0, 25.0, 35.0, 40.0, 56.0, 77.0, 93.0, 130.0, 91.0, 97.0, 84.0, 50.0, 57.0, 38.0, 22.0, 17.0, 12.0, 15.0, 5.0, 4.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.044342041015625, -0.04303741455078125, -0.0417327880859375, -0.04042816162109375, -0.03912353515625, -0.03781890869140625, -0.0365142822265625, -0.03520965576171875, -0.033905029296875, -0.03260040283203125, -0.0312957763671875, -0.02999114990234375, -0.0286865234375, -0.02738189697265625, -0.0260772705078125, -0.02477264404296875, -0.023468017578125, -0.02216339111328125, -0.0208587646484375, -0.01955413818359375, -0.01824951171875, -0.01694488525390625, -0.0156402587890625, -0.01433563232421875, -0.013031005859375, -0.01172637939453125, -0.0104217529296875, -0.00911712646484375, -0.0078125, -0.00650787353515625, -0.0052032470703125, -0.00389862060546875, -0.002593994140625, -0.00128936767578125, 1.52587890625e-05, 0.00131988525390625, 0.00262451171875, 0.00392913818359375, 0.0052337646484375, 0.00653839111328125, 0.007843017578125, 0.00914764404296875, 0.0104522705078125, 0.01175689697265625, 0.0130615234375, 0.01436614990234375, 0.0156707763671875, 0.01697540283203125, 0.018280029296875, 0.01958465576171875, 0.0208892822265625, 0.02219390869140625, 0.02349853515625, 0.02480316162109375, 0.0261077880859375, 0.02741241455078125, 0.028717041015625, 0.03002166748046875, 0.0313262939453125, 0.03263092041015625, 0.033935546875, 0.03524017333984375, 0.0365447998046875, 0.03784942626953125, 0.039154052734375]}, "gradients/encoder.encoder.layers.8.layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 9.0, 12.0, 29.0, 94.0, 228.0, 325.0, 214.0, 65.0, 22.0, 10.0, 4.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5948129296302795, -0.5469017624855042, -0.49899062514305115, -0.45107948780059814, -0.40316832065582275, -0.35525715351104736, -0.30734601616859436, -0.25943487882614136, -0.21152371168136597, -0.16361255943775177, -0.11570140719413757, -0.06779025495052338, -0.01987910270690918, 0.028032049536705017, 0.07594320178031921, 0.12385433912277222, 0.1717655062675476, 0.2196766585111618, 0.267587810754776, 0.315498948097229, 0.3634101152420044, 0.4113212823867798, 0.4592324197292328, 0.5071435570716858, 0.5550547242164612, 0.6029658913612366, 0.6508769989013672, 0.6987881660461426, 0.746699333190918, 0.7946105003356934, 0.8425216674804688, 0.8904327750205994, 0.9383440017700195, 0.9862551689147949, 1.0341663360595703, 1.0820775032043457, 1.129988670349121, 1.177899718284607, 1.2258108854293823, 1.2737220525741577, 1.321633219718933, 1.3695443868637085, 1.4174555540084839, 1.4653667211532593, 1.5132777690887451, 1.5611889362335205, 1.609100103378296, 1.6570112705230713, 1.7049224376678467, 1.752833604812622, 1.8007447719573975, 1.8486559391021729, 1.8965671062469482, 1.944478154182434, 1.9923893213272095, 2.0403003692626953, 2.0882115364074707, 2.136122703552246, 2.1840338706970215, 2.231945037841797, 2.2798562049865723, 2.3277673721313477, 2.375678539276123, 2.4235897064208984, 2.471500873565674]}, "gradients/encoder.encoder.layers.8.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 1.0, 0.0, 3.0, 12.0, 6.0, 10.0, 11.0, 10.0, 16.0, 10.0, 30.0, 20.0, 35.0, 38.0, 44.0, 46.0, 45.0, 60.0, 60.0, 60.0, 47.0, 67.0, 59.0, 43.0, 43.0, 39.0, 44.0, 22.0, 24.0, 22.0, 27.0, 13.0, 8.0, 8.0, 9.0, 5.0, 3.0, 6.0, 4.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7938513159751892, -0.7662767767906189, -0.7387021780014038, -0.7111276388168335, -0.6835530996322632, -0.6559785008430481, -0.6284039616584778, -0.6008293628692627, -0.5732548236846924, -0.5456802845001221, -0.518105685710907, -0.49053114652633667, -0.46295657753944397, -0.43538200855255127, -0.40780746936798096, -0.38023290038108826, -0.35265833139419556, -0.32508376240730286, -0.29750919342041016, -0.26993465423583984, -0.24236008524894714, -0.21478551626205444, -0.18721096217632294, -0.15963640809059143, -0.13206183910369873, -0.10448727756738663, -0.07691271603107452, -0.04933815449476242, -0.021763592958450317, 0.005810976028442383, 0.03338553011417389, 0.060960084199905396, 0.0885346531867981, 0.1161092147231102, 0.1436837762594223, 0.1712583303451538, 0.1988328993320465, 0.2264074683189392, 0.2539820075035095, 0.2815565764904022, 0.3091311454772949, 0.3367057144641876, 0.3642802834510803, 0.39185482263565063, 0.41942939162254333, 0.44700396060943604, 0.47457849979400635, 0.5021530389785767, 0.5297276377677917, 0.5573021769523621, 0.5848767757415771, 0.6124513149261475, 0.6400258541107178, 0.6676004528999329, 0.6951749920845032, 0.7227495908737183, 0.7503241300582886, 0.7778986692428589, 0.805473268032074, 0.8330478072166443, 0.8606224060058594, 0.8881969451904297, 0.915771484375, 0.9433460235595703, 0.9709206223487854]}, "gradients/encoder.encoder.layers.7.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 2.0, 3.0, 2.0, 4.0, 7.0, 6.0, 7.0, 6.0, 8.0, 9.0, 23.0, 23.0, 52.0, 50.0, 86.0, 157.0, 287.0, 602.0, 1683.0, 7791.0, 99799.0, 4025171.0, 49380.0, 6563.0, 1501.0, 513.0, 261.0, 118.0, 70.0, 33.0, 23.0, 20.0, 7.0, 7.0, 4.0, 3.0, 4.0, 2.0, 3.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.31494140625, -0.30733299255371094, -0.2997245788574219, -0.2921161651611328, -0.28450775146484375, -0.2768993377685547, -0.2692909240722656, -0.26168251037597656, -0.2540740966796875, -0.24646568298339844, -0.23885726928710938, -0.2312488555908203, -0.22364044189453125, -0.2160320281982422, -0.20842361450195312, -0.20081520080566406, -0.193206787109375, -0.18559837341308594, -0.17798995971679688, -0.1703815460205078, -0.16277313232421875, -0.1551647186279297, -0.14755630493164062, -0.13994789123535156, -0.1323394775390625, -0.12473106384277344, -0.11712265014648438, -0.10951423645019531, -0.10190582275390625, -0.09429740905761719, -0.08668899536132812, -0.07908058166503906, -0.07147216796875, -0.06386375427246094, -0.056255340576171875, -0.04864692687988281, -0.04103851318359375, -0.03343009948730469, -0.025821685791015625, -0.018213272094726562, -0.0106048583984375, -0.0029964447021484375, 0.004611968994140625, 0.012220382690429688, 0.01982879638671875, 0.027437210083007812, 0.035045623779296875, 0.04265403747558594, 0.050262451171875, 0.05787086486816406, 0.06547927856445312, 0.07308769226074219, 0.08069610595703125, 0.08830451965332031, 0.09591293334960938, 0.10352134704589844, 0.1111297607421875, 0.11873817443847656, 0.12634658813476562, 0.1339550018310547, 0.14156341552734375, 0.1491718292236328, 0.15678024291992188, 0.16438865661621094, 0.1719970703125]}, "gradients/encoder.encoder.layers.7.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 8.0, 9.0, 15.0, 37.0, 47.0, 52.0, 59.0, 82.0, 92.0, 90.0, 94.0, 96.0, 82.0, 82.0, 43.0, 43.0, 24.0, 18.0, 10.0, 9.0, 2.0, 4.0, 3.0, 2.0, 3.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09735107421875, -0.09479045867919922, -0.09222984313964844, -0.08966922760009766, -0.08710861206054688, -0.0845479965209961, -0.08198738098144531, -0.07942676544189453, -0.07686614990234375, -0.07430553436279297, -0.07174491882324219, -0.0691843032836914, -0.06662368774414062, -0.06406307220458984, -0.06150245666503906, -0.05894184112548828, -0.0563812255859375, -0.05382061004638672, -0.05125999450683594, -0.048699378967285156, -0.046138763427734375, -0.043578147888183594, -0.04101753234863281, -0.03845691680908203, -0.03589630126953125, -0.03333568572998047, -0.030775070190429688, -0.028214454650878906, -0.025653839111328125, -0.023093223571777344, -0.020532608032226562, -0.01797199249267578, -0.015411376953125, -0.012850761413574219, -0.010290145874023438, -0.007729530334472656, -0.005168914794921875, -0.0026082992553710938, -4.76837158203125e-05, 0.0025129318237304688, 0.00507354736328125, 0.007634162902832031, 0.010194778442382812, 0.012755393981933594, 0.015316009521484375, 0.017876625061035156, 0.020437240600585938, 0.02299785614013672, 0.0255584716796875, 0.02811908721923828, 0.030679702758789062, 0.033240318298339844, 0.035800933837890625, 0.038361549377441406, 0.04092216491699219, 0.04348278045654297, 0.04604339599609375, 0.04860401153564453, 0.05116462707519531, 0.053725242614746094, 0.056285858154296875, 0.058846473693847656, 0.06140708923339844, 0.06396770477294922, 0.0665283203125]}, "gradients/encoder.encoder.layers.7.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 5.0, 6.0, 1.0, 1.0, 8.0, 14.0, 8.0, 9.0, 25.0, 29.0, 59.0, 94.0, 206.0, 608.0, 2174.0, 11619.0, 187496.0, 3945584.0, 39759.0, 4753.0, 1132.0, 344.0, 133.0, 77.0, 42.0, 26.0, 23.0, 14.0, 5.0, 6.0, 5.0, 8.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.2471923828125, -0.2399768829345703, -0.23276138305664062, -0.22554588317871094, -0.21833038330078125, -0.21111488342285156, -0.20389938354492188, -0.1966838836669922, -0.1894683837890625, -0.1822528839111328, -0.17503738403320312, -0.16782188415527344, -0.16060638427734375, -0.15339088439941406, -0.14617538452148438, -0.1389598846435547, -0.131744384765625, -0.12452888488769531, -0.11731338500976562, -0.11009788513183594, -0.10288238525390625, -0.09566688537597656, -0.08845138549804688, -0.08123588562011719, -0.0740203857421875, -0.06680488586425781, -0.059589385986328125, -0.05237388610839844, -0.04515838623046875, -0.03794288635253906, -0.030727386474609375, -0.023511886596679688, -0.01629638671875, -0.009080886840820312, -0.001865386962890625, 0.0053501129150390625, 0.01256561279296875, 0.019781112670898438, 0.026996612548828125, 0.03421211242675781, 0.0414276123046875, 0.04864311218261719, 0.055858612060546875, 0.06307411193847656, 0.07028961181640625, 0.07750511169433594, 0.08472061157226562, 0.09193611145019531, 0.099151611328125, 0.10636711120605469, 0.11358261108398438, 0.12079811096191406, 0.12801361083984375, 0.13522911071777344, 0.14244461059570312, 0.1496601104736328, 0.1568756103515625, 0.1640911102294922, 0.17130661010742188, 0.17852210998535156, 0.18573760986328125, 0.19295310974121094, 0.20016860961914062, 0.2073841094970703, 0.214599609375]}, "gradients/encoder.encoder.layers.7.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 7.0, 12.0, 15.0, 14.0, 15.0, 15.0, 19.0, 22.0, 47.0, 79.0, 132.0, 502.0, 1622.0, 973.0, 277.0, 97.0, 51.0, 38.0, 34.0, 20.0, 17.0, 17.0, 10.0, 11.0, 8.0, 5.0, 5.0, 3.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0], "bins": [-0.1353759765625, -0.13147544860839844, -0.12757492065429688, -0.12367439270019531, -0.11977386474609375, -0.11587333679199219, -0.11197280883789062, -0.10807228088378906, -0.1041717529296875, -0.10027122497558594, -0.09637069702148438, -0.09247016906738281, -0.08856964111328125, -0.08466911315917969, -0.08076858520507812, -0.07686805725097656, -0.072967529296875, -0.06906700134277344, -0.06516647338867188, -0.06126594543457031, -0.05736541748046875, -0.05346488952636719, -0.049564361572265625, -0.04566383361816406, -0.0417633056640625, -0.03786277770996094, -0.033962249755859375, -0.030061721801757812, -0.02616119384765625, -0.022260665893554688, -0.018360137939453125, -0.014459609985351562, -0.01055908203125, -0.0066585540771484375, -0.002758026123046875, 0.0011425018310546875, 0.00504302978515625, 0.008943557739257812, 0.012844085693359375, 0.016744613647460938, 0.0206451416015625, 0.024545669555664062, 0.028446197509765625, 0.03234672546386719, 0.03624725341796875, 0.04014778137207031, 0.044048309326171875, 0.04794883728027344, 0.051849365234375, 0.05574989318847656, 0.059650421142578125, 0.06355094909667969, 0.06745147705078125, 0.07135200500488281, 0.07525253295898438, 0.07915306091308594, 0.0830535888671875, 0.08695411682128906, 0.09085464477539062, 0.09475517272949219, 0.09865570068359375, 0.10255622863769531, 0.10645675659179688, 0.11035728454589844, 0.1142578125]}, "gradients/encoder.encoder.layers.7.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 4.0, 1.0, 2.0, 3.0, 7.0, 5.0, 23.0, 26.0, 39.0, 98.0, 166.0, 217.0, 173.0, 108.0, 60.0, 31.0, 13.0, 13.0, 11.0, 4.0, 3.0, 3.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1522479057312012, -1.1167049407958984, -1.0811619758605957, -1.045619010925293, -1.0100760459899902, -0.9745330810546875, -0.9389901161193848, -0.903447151184082, -0.8679041862487793, -0.8323612213134766, -0.7968182563781738, -0.7612752914428711, -0.7257323265075684, -0.6901893615722656, -0.6546463966369629, -0.6191034317016602, -0.5835604071617126, -0.5480174422264099, -0.5124744772911072, -0.47693151235580444, -0.4413885474205017, -0.405845582485199, -0.37030258774757385, -0.3347596228122711, -0.2992166578769684, -0.26367369294166565, -0.22813072800636292, -0.192587748169899, -0.15704478323459625, -0.12150181829929352, -0.08595883846282959, -0.050415873527526855, -0.014872908592224121, 0.020670060068368912, 0.056213028728961945, 0.09175600111484528, 0.127298966050148, 0.16284193098545074, 0.19838491082191467, 0.2339278757572174, 0.26947084069252014, 0.3050138056278229, 0.3405567705631256, 0.37609976530075073, 0.41164273023605347, 0.4471856951713562, 0.48272866010665894, 0.5182716250419617, 0.5538145899772644, 0.5893575549125671, 0.6249005198478699, 0.6604434847831726, 0.6959864497184753, 0.7315294146537781, 0.7670724391937256, 0.8026154041290283, 0.838158369064331, 0.8737013339996338, 0.9092442989349365, 0.9447872638702393, 0.980330228805542, 1.0158731937408447, 1.0514161586761475, 1.0869591236114502, 1.122502088546753]}, "gradients/encoder.encoder.layers.7.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 1.0, 3.0, 1.0, 5.0, 8.0, 11.0, 15.0, 13.0, 8.0, 12.0, 16.0, 24.0, 16.0, 25.0, 28.0, 30.0, 33.0, 31.0, 37.0, 39.0, 32.0, 43.0, 32.0, 41.0, 37.0, 45.0, 35.0, 42.0, 37.0, 41.0, 42.0, 41.0, 27.0, 20.0, 26.0, 19.0, 14.0, 11.0, 8.0, 12.0, 9.0, 11.0, 5.0, 6.0, 4.0, 4.0, 4.0, 4.0, 2.0, 1.0, 1.0, 2.0], "bins": [-0.48485374450683594, -0.47137588262557983, -0.45789802074432373, -0.4444201588630676, -0.4309422969818115, -0.4174644351005554, -0.4039865732192993, -0.3905087113380432, -0.3770308494567871, -0.363552987575531, -0.3500751256942749, -0.3365972638130188, -0.3231194019317627, -0.3096415400505066, -0.2961636781692505, -0.2826858162879944, -0.2692079544067383, -0.2557300925254822, -0.24225223064422607, -0.22877436876296997, -0.21529650688171387, -0.20181864500045776, -0.18834078311920166, -0.17486292123794556, -0.16138502955436707, -0.14790716767311096, -0.13442930579185486, -0.12095144391059875, -0.10747358202934265, -0.09399571269750595, -0.08051785081624985, -0.06703998893499374, -0.05356213450431824, -0.040084272623062134, -0.02660640887916088, -0.013128545135259628, 0.0003493167459964752, 0.013827182352542877, 0.02730504423379898, 0.040782906115055084, 0.05426076799631119, 0.06773862987756729, 0.0812164917588234, 0.0946943610906601, 0.1081722229719162, 0.1216500848531723, 0.1351279467344284, 0.1486058086156845, 0.1620836704969406, 0.17556153237819672, 0.18903939425945282, 0.20251725614070892, 0.21599511802196503, 0.22947299480438232, 0.24295085668563843, 0.25642871856689453, 0.26990658044815063, 0.28338444232940674, 0.29686230421066284, 0.31034016609191895, 0.32381802797317505, 0.33729588985443115, 0.35077375173568726, 0.36425161361694336, 0.37772947549819946]}, "gradients/encoder.encoder.layers.7.attention.out_proj.weight": {"_type": "histogram", "values": [3.0, 3.0, 5.0, 6.0, 7.0, 9.0, 12.0, 4.0, 16.0, 22.0, 43.0, 47.0, 49.0, 66.0, 110.0, 167.0, 285.0, 433.0, 957.0, 2108.0, 5913.0, 21581.0, 120949.0, 619955.0, 227550.0, 34245.0, 8518.0, 2881.0, 1188.0, 546.0, 290.0, 176.0, 105.0, 82.0, 62.0, 39.0, 32.0, 21.0, 21.0, 16.0, 7.0, 5.0, 7.0, 14.0, 3.0, 1.0, 3.0, 3.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.1029052734375, -0.09855461120605469, -0.09420394897460938, -0.08985328674316406, -0.08550262451171875, -0.08115196228027344, -0.07680130004882812, -0.07245063781738281, -0.0680999755859375, -0.06374931335449219, -0.059398651123046875, -0.05504798889160156, -0.05069732666015625, -0.04634666442871094, -0.041996002197265625, -0.03764533996582031, -0.033294677734375, -0.028944015502929688, -0.024593353271484375, -0.020242691040039062, -0.01589202880859375, -0.011541366577148438, -0.007190704345703125, -0.0028400421142578125, 0.0015106201171875, 0.0058612823486328125, 0.010211944580078125, 0.014562606811523438, 0.01891326904296875, 0.023263931274414062, 0.027614593505859375, 0.03196525573730469, 0.03631591796875, 0.04066658020019531, 0.045017242431640625, 0.04936790466308594, 0.05371856689453125, 0.05806922912597656, 0.062419891357421875, 0.06677055358886719, 0.0711212158203125, 0.07547187805175781, 0.07982254028320312, 0.08417320251464844, 0.08852386474609375, 0.09287452697753906, 0.09722518920898438, 0.10157585144042969, 0.105926513671875, 0.11027717590332031, 0.11462783813476562, 0.11897850036621094, 0.12332916259765625, 0.12767982482910156, 0.13203048706054688, 0.1363811492919922, 0.1407318115234375, 0.1450824737548828, 0.14943313598632812, 0.15378379821777344, 0.15813446044921875, 0.16248512268066406, 0.16683578491210938, 0.1711864471435547, 0.175537109375]}, "gradients/encoder.encoder.layers.7.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 0.0, 1.0, 2.0, 7.0, 13.0, 14.0, 28.0, 39.0, 45.0, 56.0, 74.0, 67.0, 80.0, 88.0, 91.0, 81.0, 68.0, 61.0, 65.0, 37.0, 32.0, 17.0, 16.0, 13.0, 6.0, 2.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.09893798828125, -0.09657478332519531, -0.09421157836914062, -0.09184837341308594, -0.08948516845703125, -0.08712196350097656, -0.08475875854492188, -0.08239555358886719, -0.0800323486328125, -0.07766914367675781, -0.07530593872070312, -0.07294273376464844, -0.07057952880859375, -0.06821632385253906, -0.06585311889648438, -0.06348991394042969, -0.061126708984375, -0.05876350402832031, -0.056400299072265625, -0.05403709411621094, -0.05167388916015625, -0.04931068420410156, -0.046947479248046875, -0.04458427429199219, -0.0422210693359375, -0.03985786437988281, -0.037494659423828125, -0.03513145446777344, -0.03276824951171875, -0.030405044555664062, -0.028041839599609375, -0.025678634643554688, -0.0233154296875, -0.020952224731445312, -0.018589019775390625, -0.016225814819335938, -0.01386260986328125, -0.011499404907226562, -0.009136199951171875, -0.0067729949951171875, -0.0044097900390625, -0.0020465850830078125, 0.000316619873046875, 0.0026798248291015625, 0.00504302978515625, 0.0074062347412109375, 0.009769439697265625, 0.012132644653320312, 0.014495849609375, 0.016859054565429688, 0.019222259521484375, 0.021585464477539062, 0.02394866943359375, 0.026311874389648438, 0.028675079345703125, 0.031038284301757812, 0.0334014892578125, 0.03576469421386719, 0.038127899169921875, 0.04049110412597656, 0.04285430908203125, 0.04521751403808594, 0.047580718994140625, 0.04994392395019531, 0.05230712890625]}, "gradients/encoder.encoder.layers.7.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 4.0, 2.0, 4.0, 2.0, 5.0, 2.0, 3.0, 8.0, 5.0, 6.0, 13.0, 12.0, 16.0, 14.0, 31.0, 42.0, 49.0, 49.0, 100.0, 135.0, 206.0, 324.0, 546.0, 1191.0, 2749.0, 10181.0, 83121.0, 826145.0, 106569.0, 11173.0, 2976.0, 1166.0, 652.0, 364.0, 234.0, 133.0, 88.0, 57.0, 48.0, 37.0, 22.0, 27.0, 14.0, 10.0, 9.0, 7.0, 9.0, 5.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.151123046875, -0.1458282470703125, -0.140533447265625, -0.1352386474609375, -0.12994384765625, -0.1246490478515625, -0.119354248046875, -0.1140594482421875, -0.1087646484375, -0.1034698486328125, -0.098175048828125, -0.0928802490234375, -0.08758544921875, -0.0822906494140625, -0.076995849609375, -0.0717010498046875, -0.06640625, -0.0611114501953125, -0.055816650390625, -0.0505218505859375, -0.04522705078125, -0.0399322509765625, -0.034637451171875, -0.0293426513671875, -0.0240478515625, -0.0187530517578125, -0.013458251953125, -0.0081634521484375, -0.00286865234375, 0.0024261474609375, 0.007720947265625, 0.0130157470703125, 0.018310546875, 0.0236053466796875, 0.028900146484375, 0.0341949462890625, 0.03948974609375, 0.0447845458984375, 0.050079345703125, 0.0553741455078125, 0.0606689453125, 0.0659637451171875, 0.071258544921875, 0.0765533447265625, 0.08184814453125, 0.0871429443359375, 0.092437744140625, 0.0977325439453125, 0.10302734375, 0.1083221435546875, 0.113616943359375, 0.1189117431640625, 0.12420654296875, 0.1295013427734375, 0.134796142578125, 0.1400909423828125, 0.1453857421875, 0.1506805419921875, 0.155975341796875, 0.1612701416015625, 0.16656494140625, 0.1718597412109375, 0.177154541015625, 0.1824493408203125, 0.187744140625]}, "gradients/encoder.encoder.layers.7.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 3.0, 4.0, 3.0, 7.0, 9.0, 8.0, 14.0, 22.0, 14.0, 24.0, 27.0, 28.0, 45.0, 47.0, 52.0, 52.0, 52.0, 47.0, 59.0, 58.0, 51.0, 42.0, 50.0, 47.0, 50.0, 43.0, 34.0, 25.0, 21.0, 20.0, 12.0, 15.0, 7.0, 4.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2047119140625, -0.1985454559326172, -0.19237899780273438, -0.18621253967285156, -0.18004608154296875, -0.17387962341308594, -0.16771316528320312, -0.1615467071533203, -0.1553802490234375, -0.1492137908935547, -0.14304733276367188, -0.13688087463378906, -0.13071441650390625, -0.12454795837402344, -0.11838150024414062, -0.11221504211425781, -0.106048583984375, -0.09988212585449219, -0.09371566772460938, -0.08754920959472656, -0.08138275146484375, -0.07521629333496094, -0.06904983520507812, -0.06288337707519531, -0.0567169189453125, -0.05055046081542969, -0.044384002685546875, -0.03821754455566406, -0.03205108642578125, -0.025884628295898438, -0.019718170166015625, -0.013551712036132812, -0.00738525390625, -0.0012187957763671875, 0.004947662353515625, 0.011114120483398438, 0.01728057861328125, 0.023447036743164062, 0.029613494873046875, 0.03577995300292969, 0.0419464111328125, 0.04811286926269531, 0.054279327392578125, 0.06044578552246094, 0.06661224365234375, 0.07277870178222656, 0.07894515991210938, 0.08511161804199219, 0.091278076171875, 0.09744453430175781, 0.10361099243164062, 0.10977745056152344, 0.11594390869140625, 0.12211036682128906, 0.12827682495117188, 0.1344432830810547, 0.1406097412109375, 0.1467761993408203, 0.15294265747070312, 0.15910911560058594, 0.16527557373046875, 0.17144203186035156, 0.17760848999023438, 0.1837749481201172, 0.18994140625]}, "gradients/encoder.encoder.layers.7.attention.k_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 1.0, 2.0, 2.0, 2.0, 1.0, 6.0, 3.0, 6.0, 8.0, 7.0, 19.0, 22.0, 31.0, 37.0, 66.0, 96.0, 150.0, 227.0, 432.0, 779.0, 1667.0, 4273.0, 16599.0, 324409.0, 668460.0, 22295.0, 4950.0, 1919.0, 915.0, 416.0, 263.0, 167.0, 99.0, 70.0, 41.0, 36.0, 22.0, 20.0, 11.0, 4.0, 9.0, 6.0, 3.0, 1.0, 2.0, 4.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0], "bins": [-0.080078125, -0.07785844802856445, -0.0756387710571289, -0.07341909408569336, -0.07119941711425781, -0.06897974014282227, -0.06676006317138672, -0.06454038619995117, -0.062320709228515625, -0.06010103225708008, -0.05788135528564453, -0.055661678314208984, -0.05344200134277344, -0.05122232437133789, -0.049002647399902344, -0.0467829704284668, -0.04456329345703125, -0.0423436164855957, -0.040123939514160156, -0.03790426254272461, -0.03568458557128906, -0.033464908599853516, -0.03124523162841797, -0.029025554656982422, -0.026805877685546875, -0.024586200714111328, -0.02236652374267578, -0.020146846771240234, -0.017927169799804688, -0.01570749282836914, -0.013487815856933594, -0.011268138885498047, -0.0090484619140625, -0.006828784942626953, -0.004609107971191406, -0.0023894309997558594, -0.0001697540283203125, 0.0020499229431152344, 0.004269599914550781, 0.006489276885986328, 0.008708953857421875, 0.010928630828857422, 0.013148307800292969, 0.015367984771728516, 0.017587661743164062, 0.01980733871459961, 0.022027015686035156, 0.024246692657470703, 0.02646636962890625, 0.028686046600341797, 0.030905723571777344, 0.03312540054321289, 0.03534507751464844, 0.037564754486083984, 0.03978443145751953, 0.04200410842895508, 0.044223785400390625, 0.04644346237182617, 0.04866313934326172, 0.050882816314697266, 0.05310249328613281, 0.05532217025756836, 0.057541847229003906, 0.05976152420043945, 0.061981201171875]}, "gradients/encoder.encoder.layers.7.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 2.0, 0.0, 2.0, 2.0, 1.0, 3.0, 4.0, 2.0, 7.0, 12.0, 15.0, 12.0, 29.0, 74.0, 108.0, 169.0, 213.0, 150.0, 94.0, 55.0, 26.0, 4.0, 5.0, 10.0, 2.0, 1.0, 2.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.086162567138672e-05, -2.0069070160388947e-05, -1.9276514649391174e-05, -1.8483959138393402e-05, -1.769140362739563e-05, -1.6898848116397858e-05, -1.6106292605400085e-05, -1.5313737094402313e-05, -1.4521181583404541e-05, -1.3728626072406769e-05, -1.2936070561408997e-05, -1.2143515050411224e-05, -1.1350959539413452e-05, -1.055840402841568e-05, -9.765848517417908e-06, -8.973293006420135e-06, -8.180737495422363e-06, -7.388181984424591e-06, -6.595626473426819e-06, -5.803070962429047e-06, -5.010515451431274e-06, -4.217959940433502e-06, -3.42540442943573e-06, -2.6328489184379578e-06, -1.8402934074401855e-06, -1.0477378964424133e-06, -2.551823854446411e-07, 5.373731255531311e-07, 1.3299286365509033e-06, 2.1224841475486755e-06, 2.9150396585464478e-06, 3.70759516954422e-06, 4.500150680541992e-06, 5.292706191539764e-06, 6.085261702537537e-06, 6.877817213535309e-06, 7.670372724533081e-06, 8.462928235530853e-06, 9.255483746528625e-06, 1.0048039257526398e-05, 1.084059476852417e-05, 1.1633150279521942e-05, 1.2425705790519714e-05, 1.3218261301517487e-05, 1.4010816812515259e-05, 1.4803372323513031e-05, 1.5595927834510803e-05, 1.6388483345508575e-05, 1.7181038856506348e-05, 1.797359436750412e-05, 1.8766149878501892e-05, 1.9558705389499664e-05, 2.0351260900497437e-05, 2.114381641149521e-05, 2.193637192249298e-05, 2.2728927433490753e-05, 2.3521482944488525e-05, 2.4314038455486298e-05, 2.510659396648407e-05, 2.5899149477481842e-05, 2.6691704988479614e-05, 2.7484260499477386e-05, 2.827681601047516e-05, 2.906937152147293e-05, 2.9861927032470703e-05]}, "gradients/encoder.encoder.layers.7.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 2.0, 3.0, 2.0, 5.0, 5.0, 9.0, 17.0, 24.0, 40.0, 56.0, 127.0, 269.0, 582.0, 1671.0, 6745.0, 75535.0, 923178.0, 33572.0, 4508.0, 1251.0, 489.0, 210.0, 107.0, 60.0, 40.0, 16.0, 11.0, 5.0, 8.0, 3.0, 6.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.091796875, -0.0887746810913086, -0.08575248718261719, -0.08273029327392578, -0.07970809936523438, -0.07668590545654297, -0.07366371154785156, -0.07064151763916016, -0.06761932373046875, -0.06459712982177734, -0.06157493591308594, -0.05855274200439453, -0.055530548095703125, -0.05250835418701172, -0.04948616027832031, -0.046463966369628906, -0.0434417724609375, -0.040419578552246094, -0.03739738464355469, -0.03437519073486328, -0.031352996826171875, -0.02833080291748047, -0.025308609008789062, -0.022286415100097656, -0.01926422119140625, -0.016242027282714844, -0.013219833374023438, -0.010197639465332031, -0.007175445556640625, -0.004153251647949219, -0.0011310577392578125, 0.0018911361694335938, 0.004913330078125, 0.007935523986816406, 0.010957717895507812, 0.013979911804199219, 0.017002105712890625, 0.02002429962158203, 0.023046493530273438, 0.026068687438964844, 0.02909088134765625, 0.032113075256347656, 0.03513526916503906, 0.03815746307373047, 0.041179656982421875, 0.04420185089111328, 0.04722404479980469, 0.050246238708496094, 0.0532684326171875, 0.056290626525878906, 0.05931282043457031, 0.06233501434326172, 0.06535720825195312, 0.06837940216064453, 0.07140159606933594, 0.07442378997802734, 0.07744598388671875, 0.08046817779541016, 0.08349037170410156, 0.08651256561279297, 0.08953475952148438, 0.09255695343017578, 0.09557914733886719, 0.0986013412475586, 0.10162353515625]}, "gradients/encoder.encoder.layers.7.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 2.0, 1.0, 3.0, 5.0, 1.0, 13.0, 13.0, 10.0, 21.0, 32.0, 42.0, 68.0, 82.0, 123.0, 137.0, 134.0, 101.0, 64.0, 43.0, 33.0, 20.0, 18.0, 12.0, 10.0, 1.0, 3.0, 4.0, 4.0, 4.0, 4.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06280517578125, -0.060944557189941406, -0.05908393859863281, -0.05722332000732422, -0.055362701416015625, -0.05350208282470703, -0.05164146423339844, -0.049780845642089844, -0.04792022705078125, -0.046059608459472656, -0.04419898986816406, -0.04233837127685547, -0.040477752685546875, -0.03861713409423828, -0.03675651550292969, -0.034895896911621094, -0.0330352783203125, -0.031174659729003906, -0.029314041137695312, -0.02745342254638672, -0.025592803955078125, -0.02373218536376953, -0.021871566772460938, -0.020010948181152344, -0.01815032958984375, -0.016289710998535156, -0.014429092407226562, -0.012568473815917969, -0.010707855224609375, -0.008847236633300781, -0.0069866180419921875, -0.005125999450683594, -0.003265380859375, -0.0014047622680664062, 0.0004558563232421875, 0.0023164749145507812, 0.004177093505859375, 0.006037712097167969, 0.007898330688476562, 0.009758949279785156, 0.01161956787109375, 0.013480186462402344, 0.015340805053710938, 0.01720142364501953, 0.019062042236328125, 0.02092266082763672, 0.022783279418945312, 0.024643898010253906, 0.0265045166015625, 0.028365135192871094, 0.030225753784179688, 0.03208637237548828, 0.033946990966796875, 0.03580760955810547, 0.03766822814941406, 0.039528846740722656, 0.04138946533203125, 0.043250083923339844, 0.04511070251464844, 0.04697132110595703, 0.048831939697265625, 0.05069255828857422, 0.05255317687988281, 0.054413795471191406, 0.0562744140625]}, "gradients/encoder.encoder.layers.7.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 5.0, 14.0, 31.0, 57.0, 132.0, 214.0, 247.0, 155.0, 77.0, 41.0, 12.0, 6.0, 8.0, 1.0, 2.0, 5.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0], "bins": [-1.839813470840454, -1.7981863021850586, -1.7565590143203735, -1.714931845664978, -1.673304557800293, -1.6316773891448975, -1.590050220489502, -1.5484230518341064, -1.5067957639694214, -1.4651685953140259, -1.4235413074493408, -1.3819141387939453, -1.3402869701385498, -1.2986596822738647, -1.2570325136184692, -1.2154052257537842, -1.1737780570983887, -1.1321508884429932, -1.090523600578308, -1.0488964319229126, -1.0072691440582275, -0.965641975402832, -0.9240148067474365, -0.8823875784873962, -0.840760350227356, -0.7991331219673157, -0.7575058937072754, -0.7158787250518799, -0.6742514967918396, -0.6326242685317993, -0.5909970998764038, -0.5493698716163635, -0.5077426433563232, -0.46611541509628296, -0.42448821663856506, -0.38286101818084717, -0.3412337899208069, -0.2996065616607666, -0.2579793632030487, -0.2163521647453308, -0.17472493648529053, -0.13309772312641144, -0.09147050976753235, -0.04984329640865326, -0.00821608304977417, 0.03341113030910492, 0.07503834366798401, 0.1166655421257019, 0.1582927703857422, 0.19991998374462128, 0.24154719710350037, 0.28317439556121826, 0.32480162382125854, 0.36642885208129883, 0.4080560505390167, 0.4496832489967346, 0.4913104772567749, 0.5329377055168152, 0.5745649337768555, 0.616192102432251, 0.6578193306922913, 0.6994465589523315, 0.741073727607727, 0.7827009558677673, 0.8243281841278076]}, "gradients/encoder.encoder.layers.7.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 3.0, 3.0, 3.0, 3.0, 7.0, 6.0, 7.0, 11.0, 11.0, 18.0, 14.0, 17.0, 30.0, 35.0, 30.0, 34.0, 29.0, 53.0, 61.0, 60.0, 46.0, 87.0, 54.0, 51.0, 49.0, 39.0, 43.0, 32.0, 27.0, 33.0, 16.0, 16.0, 22.0, 18.0, 13.0, 7.0, 5.0, 9.0, 5.0, 1.0, 1.0, 2.0, 1.0, 1.0, 4.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7061905860900879, -0.6833276152610779, -0.6604645848274231, -0.6376016139984131, -0.6147386431694031, -0.5918756723403931, -0.5690126419067383, -0.5461496710777283, -0.5232867002487183, -0.5004237294197083, -0.47756072878837585, -0.45469772815704346, -0.43183475732803345, -0.40897175669670105, -0.38610875606536865, -0.36324578523635864, -0.34038278460502625, -0.31751978397369385, -0.29465681314468384, -0.27179381251335144, -0.24893084168434143, -0.22606784105300903, -0.20320485532283783, -0.18034186959266663, -0.15747888386249542, -0.13461589813232422, -0.11175291240215302, -0.08888991922140121, -0.06602693349123001, -0.04316394776105881, -0.020300954580307007, 0.0025620311498641968, 0.0254250168800354, 0.048288002610206604, 0.07115098834037781, 0.09401398152112961, 0.11687696725130081, 0.1397399604320526, 0.16260294616222382, 0.18546593189239502, 0.20832891762256622, 0.23119190335273743, 0.2540549039840698, 0.27691787481307983, 0.29978087544441223, 0.32264384627342224, 0.34550684690475464, 0.36836981773376465, 0.39123281836509705, 0.41409581899642944, 0.43695878982543945, 0.45982179045677185, 0.48268476128578186, 0.5055477619171143, 0.5284107327461243, 0.5512737035751343, 0.5741367340087891, 0.5969997048377991, 0.6198627352714539, 0.6427257061004639, 0.6655886769294739, 0.6884516477584839, 0.7113146781921387, 0.7341776490211487, 0.7570406198501587]}, "gradients/encoder.encoder.layers.6.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 1.0, 2.0, 2.0, 3.0, 4.0, 8.0, 16.0, 56.0, 147.0, 432.0, 4149467.0, 43540.0, 380.0, 147.0, 56.0, 17.0, 8.0, 7.0, 1.0, 1.0, 2.0], "bins": [-4.63671875, -4.550758361816406, -4.4647979736328125, -4.378837585449219, -4.292877197265625, -4.206916809082031, -4.1209564208984375, -4.034996032714844, -3.94903564453125, -3.8630752563476562, -3.7771148681640625, -3.6911544799804688, -3.605194091796875, -3.5192337036132812, -3.4332733154296875, -3.3473129272460938, -3.2613525390625, -3.1753921508789062, -3.0894317626953125, -3.0034713745117188, -2.917510986328125, -2.8315505981445312, -2.7455902099609375, -2.6596298217773438, -2.57366943359375, -2.4877090454101562, -2.4017486572265625, -2.3157882690429688, -2.229827880859375, -2.1438674926757812, -2.0579071044921875, -1.9719467163085938, -1.885986328125, -1.8000259399414062, -1.7140655517578125, -1.6281051635742188, -1.542144775390625, -1.4561843872070312, -1.3702239990234375, -1.2842636108398438, -1.19830322265625, -1.1123428344726562, -1.0263824462890625, -0.9404220581054688, -0.854461669921875, -0.7685012817382812, -0.6825408935546875, -0.5965805053710938, -0.5106201171875, -0.42465972900390625, -0.3386993408203125, -0.25273895263671875, -0.166778564453125, -0.08081817626953125, 0.0051422119140625, 0.09110260009765625, 0.17706298828125, 0.26302337646484375, 0.3489837646484375, 0.43494415283203125, 0.520904541015625, 0.6068649291992188, 0.6928253173828125, 0.7787857055664062, 0.86474609375]}, "gradients/encoder.encoder.layers.6.feed_forward.output_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 4.0, 9.0, 11.0, 21.0, 38.0, 37.0, 49.0, 62.0, 73.0, 92.0, 98.0, 81.0, 84.0, 94.0, 57.0, 49.0, 37.0, 26.0, 26.0, 18.0, 11.0, 9.0, 6.0, 2.0, 4.0, 3.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0], "bins": [-0.09259033203125, -0.09034395217895508, -0.08809757232666016, -0.08585119247436523, -0.08360481262207031, -0.08135843276977539, -0.07911205291748047, -0.07686567306518555, -0.07461929321289062, -0.0723729133605957, -0.07012653350830078, -0.06788015365600586, -0.06563377380371094, -0.06338739395141602, -0.061141014099121094, -0.05889463424682617, -0.05664825439453125, -0.05440187454223633, -0.052155494689941406, -0.049909114837646484, -0.04766273498535156, -0.04541635513305664, -0.04316997528076172, -0.0409235954284668, -0.038677215576171875, -0.03643083572387695, -0.03418445587158203, -0.03193807601928711, -0.029691696166992188, -0.027445316314697266, -0.025198936462402344, -0.022952556610107422, -0.0207061767578125, -0.018459796905517578, -0.016213417053222656, -0.013967037200927734, -0.011720657348632812, -0.00947427749633789, -0.007227897644042969, -0.004981517791748047, -0.002735137939453125, -0.0004887580871582031, 0.0017576217651367188, 0.004004001617431641, 0.0062503814697265625, 0.008496761322021484, 0.010743141174316406, 0.012989521026611328, 0.01523590087890625, 0.017482280731201172, 0.019728660583496094, 0.021975040435791016, 0.024221420288085938, 0.02646780014038086, 0.02871417999267578, 0.030960559844970703, 0.033206939697265625, 0.03545331954956055, 0.03769969940185547, 0.03994607925415039, 0.04219245910644531, 0.044438838958740234, 0.046685218811035156, 0.04893159866333008, 0.051177978515625]}, "gradients/encoder.encoder.layers.6.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 1.0, 8.0, 7.0, 8.0, 9.0, 28.0, 42.0, 60.0, 83.0, 183.0, 458.0, 1308.0, 6338.0, 88045.0, 4042050.0, 49152.0, 4508.0, 1091.0, 421.0, 199.0, 93.0, 64.0, 42.0, 35.0, 11.0, 17.0, 10.0, 6.0, 7.0, 3.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.23583984375, -0.22723770141601562, -0.21863555908203125, -0.21003341674804688, -0.2014312744140625, -0.19282913208007812, -0.18422698974609375, -0.17562484741210938, -0.167022705078125, -0.15842056274414062, -0.14981842041015625, -0.14121627807617188, -0.1326141357421875, -0.12401199340820312, -0.11540985107421875, -0.10680770874023438, -0.09820556640625, -0.08960342407226562, -0.08100128173828125, -0.07239913940429688, -0.0637969970703125, -0.055194854736328125, -0.04659271240234375, -0.037990570068359375, -0.029388427734375, -0.020786285400390625, -0.01218414306640625, -0.003582000732421875, 0.0050201416015625, 0.013622283935546875, 0.02222442626953125, 0.030826568603515625, 0.0394287109375, 0.048030853271484375, 0.05663299560546875, 0.06523513793945312, 0.0738372802734375, 0.08243942260742188, 0.09104156494140625, 0.09964370727539062, 0.108245849609375, 0.11684799194335938, 0.12545013427734375, 0.13405227661132812, 0.1426544189453125, 0.15125656127929688, 0.15985870361328125, 0.16846084594726562, 0.17706298828125, 0.18566513061523438, 0.19426727294921875, 0.20286941528320312, 0.2114715576171875, 0.22007369995117188, 0.22867584228515625, 0.23727798461914062, 0.245880126953125, 0.2544822692871094, 0.26308441162109375, 0.2716865539550781, 0.2802886962890625, 0.2888908386230469, 0.29749298095703125, 0.3060951232910156, 0.314697265625]}, "gradients/encoder.encoder.layers.6.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 3.0, 2.0, 1.0, 6.0, 9.0, 15.0, 29.0, 29.0, 67.0, 160.0, 843.0, 2250.0, 430.0, 116.0, 43.0, 35.0, 19.0, 9.0, 8.0, 2.0, 2.0, 3.0, 0.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.14794921875, -0.14132308959960938, -0.13469696044921875, -0.12807083129882812, -0.1214447021484375, -0.11481857299804688, -0.10819244384765625, -0.10156631469726562, -0.094940185546875, -0.08831405639648438, -0.08168792724609375, -0.07506179809570312, -0.0684356689453125, -0.061809539794921875, -0.05518341064453125, -0.048557281494140625, -0.04193115234375, -0.035305023193359375, -0.02867889404296875, -0.022052764892578125, -0.0154266357421875, -0.008800506591796875, -0.00217437744140625, 0.004451751708984375, 0.011077880859375, 0.017704010009765625, 0.02433013916015625, 0.030956268310546875, 0.0375823974609375, 0.044208526611328125, 0.05083465576171875, 0.057460784912109375, 0.0640869140625, 0.07071304321289062, 0.07733917236328125, 0.08396530151367188, 0.0905914306640625, 0.09721755981445312, 0.10384368896484375, 0.11046981811523438, 0.117095947265625, 0.12372207641601562, 0.13034820556640625, 0.13697433471679688, 0.1436004638671875, 0.15022659301757812, 0.15685272216796875, 0.16347885131835938, 0.17010498046875, 0.17673110961914062, 0.18335723876953125, 0.18998336791992188, 0.1966094970703125, 0.20323562622070312, 0.20986175537109375, 0.21648788452148438, 0.223114013671875, 0.22974014282226562, 0.23636627197265625, 0.24299240112304688, 0.2496185302734375, 0.2562446594238281, 0.26287078857421875, 0.2694969177246094, 0.276123046875]}, "gradients/encoder.encoder.layers.6.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 3.0, 2.0, 1.0, 1.0, 1.0, 0.0, 2.0, 4.0, 11.0, 12.0, 23.0, 52.0, 98.0, 181.0, 216.0, 196.0, 95.0, 53.0, 27.0, 6.0, 7.0, 5.0, 6.0, 2.0, 4.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-1.1641261577606201, -1.1294547319412231, -1.0947834253311157, -1.0601119995117188, -1.0254405736923218, -0.9907692670822144, -0.9560978412628174, -0.9214264750480652, -0.886755108833313, -0.8520837426185608, -0.8174123167991638, -0.7827409505844116, -0.7480695843696594, -0.7133982181549072, -0.6787267923355103, -0.6440554261207581, -0.6093840003013611, -0.5747126340866089, -0.5400412082672119, -0.5053698420524597, -0.4706984758377075, -0.43602707982063293, -0.40135568380355835, -0.36668431758880615, -0.33201292157173157, -0.297341525554657, -0.2626701593399048, -0.2279987633228302, -0.1933273822069168, -0.15865600109100342, -0.12398460507392883, -0.08931322395801544, -0.05464184284210205, -0.01997045800089836, 0.014700926840305328, 0.049372315406799316, 0.08404369652271271, 0.1187150776386261, 0.15338647365570068, 0.18805785477161407, 0.22272923588752747, 0.25740063190460205, 0.29207199811935425, 0.32674339413642883, 0.3614147901535034, 0.3960861563682556, 0.4307575523853302, 0.4654289484024048, 0.500100314617157, 0.5347716808319092, 0.5694431066513062, 0.6041144728660583, 0.6387858390808105, 0.6734572649002075, 0.7081286311149597, 0.7427999973297119, 0.7774714231491089, 0.8121427893638611, 0.8468142151832581, 0.8814855813980103, 0.9161569476127625, 0.9508283138275146, 0.9854997396469116, 1.0201711654663086, 1.054842472076416]}, "gradients/encoder.encoder.layers.6.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 3.0, 4.0, 2.0, 12.0, 14.0, 13.0, 19.0, 28.0, 27.0, 40.0, 41.0, 42.0, 57.0, 60.0, 61.0, 71.0, 74.0, 69.0, 56.0, 53.0, 44.0, 47.0, 37.0, 41.0, 24.0, 20.0, 16.0, 12.0, 4.0, 5.0, 4.0, 4.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6333287954330444, -0.6137583255767822, -0.59418785572052, -0.574617326259613, -0.5550468564033508, -0.5354763865470886, -0.5159059166908264, -0.4963354468345642, -0.4767649471759796, -0.4571944773197174, -0.4376239776611328, -0.4180535078048706, -0.3984830379486084, -0.3789125382900238, -0.3593420684337616, -0.339771568775177, -0.3202010989189148, -0.3006306290626526, -0.281060129404068, -0.2614896595478058, -0.24191917479038239, -0.22234869003295898, -0.20277822017669678, -0.18320773541927338, -0.16363725066184998, -0.14406676590442657, -0.12449628859758377, -0.10492581129074097, -0.08535532653331757, -0.06578484177589417, -0.04621436446905136, -0.026643887162208557, -0.007073342800140381, 0.012497138231992722, 0.032067619264125824, 0.051638100296258926, 0.07120858132839203, 0.09077906608581543, 0.11034954339265823, 0.12992002069950104, 0.14949050545692444, 0.16906099021434784, 0.18863147497177124, 0.20820194482803345, 0.22777242958545685, 0.24734291434288025, 0.26691338419914246, 0.28648388385772705, 0.30605435371398926, 0.32562482357025146, 0.34519532322883606, 0.36476579308509827, 0.38433629274368286, 0.40390676259994507, 0.4234772324562073, 0.4430477023124695, 0.4626182019710541, 0.4821886718273163, 0.5017591714859009, 0.5213296413421631, 0.5409001111984253, 0.5604705810546875, 0.5800411105155945, 0.5996115803718567, 0.6191820502281189]}, "gradients/encoder.encoder.layers.6.attention.out_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 3.0, 4.0, 3.0, 5.0, 5.0, 8.0, 12.0, 31.0, 42.0, 92.0, 182.0, 571.0, 2231.0, 19139.0, 715022.0, 299276.0, 9696.0, 1491.0, 431.0, 154.0, 61.0, 45.0, 20.0, 14.0, 8.0, 4.0, 2.0, 3.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 2.0, 1.0, 1.0], "bins": [-0.385986328125, -0.37579345703125, -0.3656005859375, -0.35540771484375, -0.34521484375, -0.33502197265625, -0.3248291015625, -0.31463623046875, -0.304443359375, -0.29425048828125, -0.2840576171875, -0.27386474609375, -0.263671875, -0.25347900390625, -0.2432861328125, -0.23309326171875, -0.222900390625, -0.21270751953125, -0.2025146484375, -0.19232177734375, -0.18212890625, -0.17193603515625, -0.1617431640625, -0.15155029296875, -0.141357421875, -0.13116455078125, -0.1209716796875, -0.11077880859375, -0.1005859375, -0.09039306640625, -0.0802001953125, -0.07000732421875, -0.059814453125, -0.04962158203125, -0.0394287109375, -0.02923583984375, -0.01904296875, -0.00885009765625, 0.0013427734375, 0.01153564453125, 0.021728515625, 0.03192138671875, 0.0421142578125, 0.05230712890625, 0.0625, 0.07269287109375, 0.0828857421875, 0.09307861328125, 0.103271484375, 0.11346435546875, 0.1236572265625, 0.13385009765625, 0.14404296875, 0.15423583984375, 0.1644287109375, 0.17462158203125, 0.184814453125, 0.19500732421875, 0.2052001953125, 0.21539306640625, 0.2255859375, 0.23577880859375, 0.2459716796875, 0.25616455078125, 0.266357421875]}, "gradients/encoder.encoder.layers.6.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 3.0, 1.0, 4.0, 4.0, 10.0, 14.0, 25.0, 24.0, 35.0, 48.0, 62.0, 80.0, 72.0, 95.0, 91.0, 97.0, 58.0, 75.0, 51.0, 47.0, 33.0, 25.0, 16.0, 16.0, 12.0, 1.0, 4.0, 3.0, 2.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1031494140625, -0.10059309005737305, -0.0980367660522461, -0.09548044204711914, -0.09292411804199219, -0.09036779403686523, -0.08781147003173828, -0.08525514602661133, -0.08269882202148438, -0.08014249801635742, -0.07758617401123047, -0.07502985000610352, -0.07247352600097656, -0.06991720199584961, -0.06736087799072266, -0.0648045539855957, -0.06224822998046875, -0.0596919059753418, -0.057135581970214844, -0.05457925796508789, -0.05202293395996094, -0.049466609954833984, -0.04691028594970703, -0.04435396194458008, -0.041797637939453125, -0.03924131393432617, -0.03668498992919922, -0.034128665924072266, -0.03157234191894531, -0.02901601791381836, -0.026459693908691406, -0.023903369903564453, -0.0213470458984375, -0.018790721893310547, -0.016234397888183594, -0.01367807388305664, -0.011121749877929688, -0.008565425872802734, -0.006009101867675781, -0.003452777862548828, -0.000896453857421875, 0.0016598701477050781, 0.004216194152832031, 0.006772518157958984, 0.009328842163085938, 0.01188516616821289, 0.014441490173339844, 0.016997814178466797, 0.01955413818359375, 0.022110462188720703, 0.024666786193847656, 0.02722311019897461, 0.029779434204101562, 0.032335758209228516, 0.03489208221435547, 0.03744840621948242, 0.040004730224609375, 0.04256105422973633, 0.04511737823486328, 0.047673702239990234, 0.05023002624511719, 0.05278635025024414, 0.055342674255371094, 0.05789899826049805, 0.060455322265625]}, "gradients/encoder.encoder.layers.6.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 4.0, 1.0, 1.0, 2.0, 3.0, 1.0, 5.0, 3.0, 8.0, 17.0, 15.0, 18.0, 26.0, 34.0, 49.0, 89.0, 129.0, 204.0, 343.0, 738.0, 1640.0, 5040.0, 23183.0, 204426.0, 708546.0, 85916.0, 12519.0, 3104.0, 1186.0, 483.0, 296.0, 159.0, 108.0, 71.0, 56.0, 39.0, 23.0, 23.0, 11.0, 19.0, 7.0, 6.0, 6.0, 4.0, 3.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.160400390625, -0.15558624267578125, -0.1507720947265625, -0.14595794677734375, -0.141143798828125, -0.13632965087890625, -0.1315155029296875, -0.12670135498046875, -0.12188720703125, -0.11707305908203125, -0.1122589111328125, -0.10744476318359375, -0.102630615234375, -0.09781646728515625, -0.0930023193359375, -0.08818817138671875, -0.0833740234375, -0.07855987548828125, -0.0737457275390625, -0.06893157958984375, -0.064117431640625, -0.05930328369140625, -0.0544891357421875, -0.04967498779296875, -0.04486083984375, -0.04004669189453125, -0.0352325439453125, -0.03041839599609375, -0.025604248046875, -0.02079010009765625, -0.0159759521484375, -0.01116180419921875, -0.00634765625, -0.00153350830078125, 0.0032806396484375, 0.00809478759765625, 0.012908935546875, 0.01772308349609375, 0.0225372314453125, 0.02735137939453125, 0.03216552734375, 0.03697967529296875, 0.0417938232421875, 0.04660797119140625, 0.051422119140625, 0.05623626708984375, 0.0610504150390625, 0.06586456298828125, 0.0706787109375, 0.07549285888671875, 0.0803070068359375, 0.08512115478515625, 0.089935302734375, 0.09474945068359375, 0.0995635986328125, 0.10437774658203125, 0.10919189453125, 0.11400604248046875, 0.1188201904296875, 0.12363433837890625, 0.128448486328125, 0.13326263427734375, 0.1380767822265625, 0.14289093017578125, 0.147705078125]}, "gradients/encoder.encoder.layers.6.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 4.0, 8.0, 6.0, 8.0, 9.0, 3.0, 12.0, 7.0, 13.0, 21.0, 19.0, 14.0, 22.0, 31.0, 20.0, 33.0, 40.0, 36.0, 36.0, 36.0, 45.0, 38.0, 44.0, 49.0, 53.0, 44.0, 25.0, 39.0, 26.0, 43.0, 39.0, 20.0, 23.0, 21.0, 20.0, 8.0, 22.0, 15.0, 11.0, 8.0, 8.0, 7.0, 5.0, 4.0, 7.0, 3.0, 5.0, 2.0, 0.0, 2.0, 0.0, 0.0, 2.0], "bins": [-0.168701171875, -0.16376495361328125, -0.1588287353515625, -0.15389251708984375, -0.148956298828125, -0.14402008056640625, -0.1390838623046875, -0.13414764404296875, -0.12921142578125, -0.12427520751953125, -0.1193389892578125, -0.11440277099609375, -0.109466552734375, -0.10453033447265625, -0.0995941162109375, -0.09465789794921875, -0.0897216796875, -0.08478546142578125, -0.0798492431640625, -0.07491302490234375, -0.069976806640625, -0.06504058837890625, -0.0601043701171875, -0.05516815185546875, -0.05023193359375, -0.04529571533203125, -0.0403594970703125, -0.03542327880859375, -0.030487060546875, -0.02555084228515625, -0.0206146240234375, -0.01567840576171875, -0.0107421875, -0.00580596923828125, -0.0008697509765625, 0.00406646728515625, 0.009002685546875, 0.01393890380859375, 0.0188751220703125, 0.02381134033203125, 0.02874755859375, 0.03368377685546875, 0.0386199951171875, 0.04355621337890625, 0.048492431640625, 0.05342864990234375, 0.0583648681640625, 0.06330108642578125, 0.0682373046875, 0.07317352294921875, 0.0781097412109375, 0.08304595947265625, 0.087982177734375, 0.09291839599609375, 0.0978546142578125, 0.10279083251953125, 0.10772705078125, 0.11266326904296875, 0.1175994873046875, 0.12253570556640625, 0.127471923828125, 0.13240814208984375, 0.1373443603515625, 0.14228057861328125, 0.147216796875]}, "gradients/encoder.encoder.layers.6.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 1.0, 4.0, 4.0, 1.0, 3.0, 4.0, 16.0, 11.0, 22.0, 34.0, 72.0, 137.0, 271.0, 830.0, 2563.0, 12040.0, 190344.0, 803566.0, 31832.0, 4688.0, 1267.0, 449.0, 183.0, 92.0, 52.0, 28.0, 18.0, 8.0, 5.0, 3.0, 5.0, 3.0, 2.0, 2.0, 2.0, 3.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.08135986328125, -0.07813358306884766, -0.07490730285644531, -0.07168102264404297, -0.06845474243164062, -0.06522846221923828, -0.06200218200683594, -0.058775901794433594, -0.05554962158203125, -0.052323341369628906, -0.04909706115722656, -0.04587078094482422, -0.042644500732421875, -0.03941822052001953, -0.03619194030761719, -0.032965660095214844, -0.0297393798828125, -0.026513099670410156, -0.023286819458007812, -0.02006053924560547, -0.016834259033203125, -0.013607978820800781, -0.010381698608398438, -0.007155418395996094, -0.00392913818359375, -0.0007028579711914062, 0.0025234222412109375, 0.005749702453613281, 0.008975982666015625, 0.012202262878417969, 0.015428543090820312, 0.018654823303222656, 0.021881103515625, 0.025107383728027344, 0.028333663940429688, 0.03155994415283203, 0.034786224365234375, 0.03801250457763672, 0.04123878479003906, 0.044465065002441406, 0.04769134521484375, 0.050917625427246094, 0.05414390563964844, 0.05737018585205078, 0.060596466064453125, 0.06382274627685547, 0.06704902648925781, 0.07027530670166016, 0.0735015869140625, 0.07672786712646484, 0.07995414733886719, 0.08318042755126953, 0.08640670776367188, 0.08963298797607422, 0.09285926818847656, 0.0960855484008789, 0.09931182861328125, 0.1025381088256836, 0.10576438903808594, 0.10899066925048828, 0.11221694946289062, 0.11544322967529297, 0.11866950988769531, 0.12189579010009766, 0.1251220703125]}, "gradients/encoder.encoder.layers.6.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 2.0, 4.0, 3.0, 3.0, 3.0, 8.0, 8.0, 11.0, 19.0, 36.0, 60.0, 91.0, 163.0, 168.0, 139.0, 111.0, 57.0, 35.0, 31.0, 19.0, 8.0, 8.0, 8.0, 4.0, 5.0, 1.0, 5.0, 2.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.187490463256836e-05, -2.10469588637352e-05, -2.021901309490204e-05, -1.9391067326068878e-05, -1.8563121557235718e-05, -1.7735175788402557e-05, -1.6907230019569397e-05, -1.6079284250736237e-05, -1.5251338481903076e-05, -1.4423392713069916e-05, -1.3595446944236755e-05, -1.2767501175403595e-05, -1.1939555406570435e-05, -1.1111609637737274e-05, -1.0283663868904114e-05, -9.455718100070953e-06, -8.627772331237793e-06, -7.799826562404633e-06, -6.971880793571472e-06, -6.143935024738312e-06, -5.315989255905151e-06, -4.488043487071991e-06, -3.6600977182388306e-06, -2.83215194940567e-06, -2.0042061805725098e-06, -1.1762604117393494e-06, -3.4831464290618896e-07, 4.796311259269714e-07, 1.3075768947601318e-06, 2.1355226635932922e-06, 2.9634684324264526e-06, 3.791414201259613e-06, 4.6193599700927734e-06, 5.447305738925934e-06, 6.275251507759094e-06, 7.103197276592255e-06, 7.931143045425415e-06, 8.759088814258575e-06, 9.587034583091736e-06, 1.0414980351924896e-05, 1.1242926120758057e-05, 1.2070871889591217e-05, 1.2898817658424377e-05, 1.3726763427257538e-05, 1.4554709196090698e-05, 1.538265496492386e-05, 1.621060073375702e-05, 1.703854650259018e-05, 1.786649227142334e-05, 1.86944380402565e-05, 1.952238380908966e-05, 2.035032957792282e-05, 2.117827534675598e-05, 2.2006221115589142e-05, 2.2834166884422302e-05, 2.3662112653255463e-05, 2.4490058422088623e-05, 2.5318004190921783e-05, 2.6145949959754944e-05, 2.6973895728588104e-05, 2.7801841497421265e-05, 2.8629787266254425e-05, 2.9457733035087585e-05, 3.0285678803920746e-05, 3.1113624572753906e-05]}, "gradients/encoder.encoder.layers.6.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 4.0, 4.0, 1.0, 7.0, 10.0, 11.0, 14.0, 15.0, 26.0, 39.0, 61.0, 91.0, 154.0, 325.0, 580.0, 1582.0, 4425.0, 17203.0, 110119.0, 734429.0, 149722.0, 21270.0, 5268.0, 1733.0, 651.0, 318.0, 204.0, 104.0, 57.0, 37.0, 31.0, 19.0, 13.0, 4.0, 9.0, 6.0, 9.0, 5.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.06170654296875, -0.059546470642089844, -0.05738639831542969, -0.05522632598876953, -0.053066253662109375, -0.05090618133544922, -0.04874610900878906, -0.046586036682128906, -0.04442596435546875, -0.042265892028808594, -0.04010581970214844, -0.03794574737548828, -0.035785675048828125, -0.03362560272216797, -0.03146553039550781, -0.029305458068847656, -0.0271453857421875, -0.024985313415527344, -0.022825241088867188, -0.02066516876220703, -0.018505096435546875, -0.01634502410888672, -0.014184951782226562, -0.012024879455566406, -0.00986480712890625, -0.007704734802246094, -0.0055446624755859375, -0.0033845901489257812, -0.001224517822265625, 0.0009355545043945312, 0.0030956268310546875, 0.005255699157714844, 0.007415771484375, 0.009575843811035156, 0.011735916137695312, 0.013895988464355469, 0.016056060791015625, 0.01821613311767578, 0.020376205444335938, 0.022536277770996094, 0.02469635009765625, 0.026856422424316406, 0.029016494750976562, 0.03117656707763672, 0.033336639404296875, 0.03549671173095703, 0.03765678405761719, 0.039816856384277344, 0.0419769287109375, 0.044137001037597656, 0.04629707336425781, 0.04845714569091797, 0.050617218017578125, 0.05277729034423828, 0.05493736267089844, 0.057097434997558594, 0.05925750732421875, 0.061417579650878906, 0.06357765197753906, 0.06573772430419922, 0.06789779663085938, 0.07005786895751953, 0.07221794128417969, 0.07437801361083984, 0.0765380859375]}, "gradients/encoder.encoder.layers.6.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 4.0, 4.0, 3.0, 5.0, 2.0, 8.0, 4.0, 15.0, 7.0, 10.0, 13.0, 17.0, 30.0, 32.0, 44.0, 30.0, 56.0, 66.0, 66.0, 81.0, 85.0, 78.0, 59.0, 55.0, 46.0, 40.0, 28.0, 25.0, 10.0, 14.0, 12.0, 12.0, 9.0, 9.0, 3.0, 9.0, 6.0, 2.0, 3.0, 2.0, 7.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.047637939453125, -0.04619932174682617, -0.044760704040527344, -0.043322086334228516, -0.04188346862792969, -0.04044485092163086, -0.03900623321533203, -0.0375676155090332, -0.036128997802734375, -0.03469038009643555, -0.03325176239013672, -0.03181314468383789, -0.030374526977539062, -0.028935909271240234, -0.027497291564941406, -0.026058673858642578, -0.02462005615234375, -0.023181438446044922, -0.021742820739746094, -0.020304203033447266, -0.018865585327148438, -0.01742696762084961, -0.01598834991455078, -0.014549732208251953, -0.013111114501953125, -0.011672496795654297, -0.010233879089355469, -0.00879526138305664, -0.0073566436767578125, -0.005918025970458984, -0.004479408264160156, -0.003040790557861328, -0.0016021728515625, -0.00016355514526367188, 0.0012750625610351562, 0.0027136802673339844, 0.0041522979736328125, 0.005590915679931641, 0.007029533386230469, 0.008468151092529297, 0.009906768798828125, 0.011345386505126953, 0.012784004211425781, 0.01422262191772461, 0.015661239624023438, 0.017099857330322266, 0.018538475036621094, 0.019977092742919922, 0.02141571044921875, 0.022854328155517578, 0.024292945861816406, 0.025731563568115234, 0.027170181274414062, 0.02860879898071289, 0.03004741668701172, 0.03148603439331055, 0.032924652099609375, 0.0343632698059082, 0.03580188751220703, 0.03724050521850586, 0.03867912292480469, 0.040117740631103516, 0.041556358337402344, 0.04299497604370117, 0.04443359375]}, "gradients/encoder.encoder.layers.6.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 2.0, 12.0, 31.0, 58.0, 193.0, 276.0, 227.0, 101.0, 55.0, 22.0, 11.0, 4.0, 4.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.9872668981552124, -0.9363853931427002, -0.8855039477348328, -0.8346225023269653, -0.7837409973144531, -0.7328594923019409, -0.6819780468940735, -0.631096601486206, -0.5802150964736938, -0.5293335914611816, -0.4784521460533142, -0.4275706708431244, -0.37668919563293457, -0.32580772042274475, -0.27492624521255493, -0.2240447700023651, -0.1731632947921753, -0.12228181958198547, -0.07140034437179565, -0.020518869161605835, 0.030362606048583984, 0.0812440812587738, 0.13212555646896362, 0.18300703167915344, 0.23388850688934326, 0.2847699820995331, 0.3356514573097229, 0.3865329325199127, 0.43741440773010254, 0.48829588294029236, 0.5391773581504822, 0.5900588035583496, 0.6409404277801514, 0.6918219327926636, 0.742703378200531, 0.7935848236083984, 0.8444663286209106, 0.8953478336334229, 0.9462292790412903, 0.9971107244491577, 1.04799222946167, 1.0988737344741821, 1.1497552394866943, 1.200636625289917, 1.2515181303024292, 1.3023996353149414, 1.353281021118164, 1.4041625261306763, 1.4550440311431885, 1.5059255361557007, 1.556807041168213, 1.6076884269714355, 1.6585699319839478, 1.70945143699646, 1.7603328227996826, 1.8112143278121948, 1.862095832824707, 1.9129773378372192, 1.9638588428497314, 2.014740228652954, 2.065621852874756, 2.1165032386779785, 2.167384624481201, 2.218266248703003, 2.2691476345062256]}, "gradients/encoder.encoder.layers.6.layer_norm.bias": {"_type": "histogram", "values": [3.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 3.0, 8.0, 5.0, 9.0, 9.0, 20.0, 19.0, 22.0, 18.0, 28.0, 32.0, 38.0, 29.0, 40.0, 56.0, 50.0, 55.0, 68.0, 67.0, 71.0, 58.0, 42.0, 35.0, 42.0, 35.0, 22.0, 29.0, 19.0, 14.0, 11.0, 17.0, 14.0, 6.0, 4.0, 2.0, 6.0, 1.0, 3.0, 1.0, 1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.738601565361023, -0.712329089641571, -0.6860566139221191, -0.6597841382026672, -0.6335116624832153, -0.6072391867637634, -0.5809667110443115, -0.5546941757202148, -0.5284217596054077, -0.5021492838859558, -0.4758768081665039, -0.449604332447052, -0.4233318567276001, -0.3970593810081482, -0.3707868754863739, -0.344514399766922, -0.3182418942451477, -0.2919694185256958, -0.2656969428062439, -0.2394244521856308, -0.2131519764661789, -0.186879500746727, -0.1606070101261139, -0.134334534406662, -0.10806205868721008, -0.08178958296775818, -0.05551709979772568, -0.029244616627693176, -0.002972140908241272, 0.023300334811210632, 0.04957282543182373, 0.07584530115127563, 0.10211777687072754, 0.12839025259017944, 0.15466272830963135, 0.18093521893024445, 0.20720769464969635, 0.23348017036914825, 0.25975266098976135, 0.28602513670921326, 0.31229761242866516, 0.33857008814811707, 0.36484256386756897, 0.39111506938934326, 0.41738754510879517, 0.44366002082824707, 0.469932496547699, 0.4962049722671509, 0.5224774479866028, 0.5487499237060547, 0.5750223994255066, 0.6012948751449585, 0.6275673508644104, 0.6538398265838623, 0.680112361907959, 0.7063847780227661, 0.7326573133468628, 0.7589297890663147, 0.7852022647857666, 0.8114747405052185, 0.8377472162246704, 0.8640196919441223, 0.8902921676635742, 0.9165647029876709, 0.942837119102478]}, "gradients/encoder.encoder.layers.5.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 3.0, 2.0, 1.0, 0.0, 3.0, 2.0, 1.0, 3.0, 6.0, 12.0, 31.0, 62.0, 170.0, 441.0, 2584.0, 898254.0, 3288101.0, 3985.0, 468.0, 103.0, 33.0, 21.0, 9.0, 1.0, 2.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.826171875, -0.8099880218505859, -0.7938041687011719, -0.7776203155517578, -0.7614364624023438, -0.7452526092529297, -0.7290687561035156, -0.7128849029541016, -0.6967010498046875, -0.6805171966552734, -0.6643333435058594, -0.6481494903564453, -0.6319656372070312, -0.6157817840576172, -0.5995979309082031, -0.5834140777587891, -0.567230224609375, -0.5510463714599609, -0.5348625183105469, -0.5186786651611328, -0.5024948120117188, -0.4863109588623047, -0.4701271057128906, -0.45394325256347656, -0.4377593994140625, -0.42157554626464844, -0.4053916931152344, -0.3892078399658203, -0.37302398681640625, -0.3568401336669922, -0.3406562805175781, -0.32447242736816406, -0.30828857421875, -0.29210472106933594, -0.2759208679199219, -0.2597370147705078, -0.24355316162109375, -0.2273693084716797, -0.21118545532226562, -0.19500160217285156, -0.1788177490234375, -0.16263389587402344, -0.14645004272460938, -0.1302661895751953, -0.11408233642578125, -0.09789848327636719, -0.08171463012695312, -0.06553077697753906, -0.049346923828125, -0.03316307067871094, -0.016979217529296875, -0.0007953643798828125, 0.01538848876953125, 0.03157234191894531, 0.047756195068359375, 0.06394004821777344, 0.0801239013671875, 0.09630775451660156, 0.11249160766601562, 0.1286754608154297, 0.14485931396484375, 0.1610431671142578, 0.17722702026367188, 0.19341087341308594, 0.2095947265625]}, "gradients/encoder.encoder.layers.5.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 8.0, 16.0, 10.0, 23.0, 40.0, 47.0, 57.0, 86.0, 104.0, 94.0, 106.0, 98.0, 93.0, 86.0, 45.0, 38.0, 24.0, 13.0, 5.0, 4.0, 7.0, 2.0, 2.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0], "bins": [-0.1142578125, -0.11147594451904297, -0.10869407653808594, -0.1059122085571289, -0.10313034057617188, -0.10034847259521484, -0.09756660461425781, -0.09478473663330078, -0.09200286865234375, -0.08922100067138672, -0.08643913269042969, -0.08365726470947266, -0.08087539672851562, -0.0780935287475586, -0.07531166076660156, -0.07252979278564453, -0.0697479248046875, -0.06696605682373047, -0.06418418884277344, -0.061402320861816406, -0.058620452880859375, -0.055838584899902344, -0.05305671691894531, -0.05027484893798828, -0.04749298095703125, -0.04471111297607422, -0.04192924499511719, -0.039147377014160156, -0.036365509033203125, -0.033583641052246094, -0.030801773071289062, -0.02801990509033203, -0.025238037109375, -0.02245616912841797, -0.019674301147460938, -0.016892433166503906, -0.014110565185546875, -0.011328697204589844, -0.008546829223632812, -0.005764961242675781, -0.00298309326171875, -0.00020122528076171875, 0.0025806427001953125, 0.005362510681152344, 0.008144378662109375, 0.010926246643066406, 0.013708114624023438, 0.01648998260498047, 0.0192718505859375, 0.02205371856689453, 0.024835586547851562, 0.027617454528808594, 0.030399322509765625, 0.033181190490722656, 0.03596305847167969, 0.03874492645263672, 0.04152679443359375, 0.04430866241455078, 0.04709053039550781, 0.049872398376464844, 0.052654266357421875, 0.055436134338378906, 0.05821800231933594, 0.06099987030029297, 0.06378173828125]}, "gradients/encoder.encoder.layers.5.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 2.0, 4.0, 5.0, 9.0, 14.0, 9.0, 25.0, 27.0, 33.0, 79.0, 142.0, 270.0, 679.0, 1563.0, 5091.0, 24143.0, 439530.0, 3642973.0, 65300.0, 9858.0, 2661.0, 963.0, 444.0, 208.0, 95.0, 52.0, 46.0, 21.0, 9.0, 15.0, 7.0, 5.0, 2.0, 4.0, 0.0, 0.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.166259765625, -0.16014862060546875, -0.1540374755859375, -0.14792633056640625, -0.141815185546875, -0.13570404052734375, -0.1295928955078125, -0.12348175048828125, -0.11737060546875, -0.11125946044921875, -0.1051483154296875, -0.09903717041015625, -0.092926025390625, -0.08681488037109375, -0.0807037353515625, -0.07459259033203125, -0.0684814453125, -0.06237030029296875, -0.0562591552734375, -0.05014801025390625, -0.044036865234375, -0.03792572021484375, -0.0318145751953125, -0.02570343017578125, -0.01959228515625, -0.01348114013671875, -0.0073699951171875, -0.00125885009765625, 0.004852294921875, 0.01096343994140625, 0.0170745849609375, 0.02318572998046875, 0.029296875, 0.03540802001953125, 0.0415191650390625, 0.04763031005859375, 0.053741455078125, 0.05985260009765625, 0.0659637451171875, 0.07207489013671875, 0.07818603515625, 0.08429718017578125, 0.0904083251953125, 0.09651947021484375, 0.102630615234375, 0.10874176025390625, 0.1148529052734375, 0.12096405029296875, 0.1270751953125, 0.13318634033203125, 0.1392974853515625, 0.14540863037109375, 0.151519775390625, 0.15763092041015625, 0.1637420654296875, 0.16985321044921875, 0.17596435546875, 0.18207550048828125, 0.1881866455078125, 0.19429779052734375, 0.200408935546875, 0.20652008056640625, 0.2126312255859375, 0.21874237060546875, 0.224853515625]}, "gradients/encoder.encoder.layers.5.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 4.0, 0.0, 6.0, 2.0, 2.0, 2.0, 5.0, 13.0, 7.0, 13.0, 17.0, 35.0, 49.0, 78.0, 154.0, 420.0, 1200.0, 1142.0, 419.0, 194.0, 106.0, 65.0, 42.0, 31.0, 24.0, 19.0, 8.0, 6.0, 7.0, 5.0, 1.0, 4.0, 1.0, 3.0, 2.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.157470703125, -0.15254974365234375, -0.1476287841796875, -0.14270782470703125, -0.137786865234375, -0.13286590576171875, -0.1279449462890625, -0.12302398681640625, -0.11810302734375, -0.11318206787109375, -0.1082611083984375, -0.10334014892578125, -0.098419189453125, -0.09349822998046875, -0.0885772705078125, -0.08365631103515625, -0.0787353515625, -0.07381439208984375, -0.0688934326171875, -0.06397247314453125, -0.059051513671875, -0.05413055419921875, -0.0492095947265625, -0.04428863525390625, -0.03936767578125, -0.03444671630859375, -0.0295257568359375, -0.02460479736328125, -0.019683837890625, -0.01476287841796875, -0.0098419189453125, -0.00492095947265625, 0.0, 0.00492095947265625, 0.0098419189453125, 0.01476287841796875, 0.019683837890625, 0.02460479736328125, 0.0295257568359375, 0.03444671630859375, 0.03936767578125, 0.04428863525390625, 0.0492095947265625, 0.05413055419921875, 0.059051513671875, 0.06397247314453125, 0.0688934326171875, 0.07381439208984375, 0.0787353515625, 0.08365631103515625, 0.0885772705078125, 0.09349822998046875, 0.098419189453125, 0.10334014892578125, 0.1082611083984375, 0.11318206787109375, 0.11810302734375, 0.12302398681640625, 0.1279449462890625, 0.13286590576171875, 0.137786865234375, 0.14270782470703125, 0.1476287841796875, 0.15254974365234375, 0.157470703125]}, "gradients/encoder.encoder.layers.5.final_layer_norm.weight": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 3.0, 4.0, 4.0, 5.0, 18.0, 44.0, 79.0, 124.0, 221.0, 220.0, 140.0, 68.0, 34.0, 23.0, 5.0, 4.0, 5.0, 4.0, 4.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.9099547266960144, -0.8617578744888306, -0.8135610818862915, -0.7653642296791077, -0.7171673774719238, -0.6689705848693848, -0.6207737326622009, -0.5725768804550171, -0.524380087852478, -0.4761832654476166, -0.42798641324043274, -0.3797895908355713, -0.33159273862838745, -0.283395916223526, -0.23519909381866455, -0.1870022416114807, -0.13880538940429688, -0.09060855209827423, -0.042411722242832184, 0.005785107612609863, 0.05398194491863251, 0.10217878222465515, 0.1503756046295166, 0.19857245683670044, 0.2467692792415619, 0.29496610164642334, 0.3431629538536072, 0.39135977625846863, 0.4395565986633301, 0.4877534508705139, 0.5359503030776978, 0.5841470956802368, 0.6323438882827759, 0.6805407404899597, 0.7287375330924988, 0.7769343852996826, 0.8251312375068665, 0.8733280897140503, 0.9215248823165894, 0.9697217345237732, 1.017918586730957, 1.066115379333496, 1.1143122911453247, 1.1625090837478638, 1.2107058763504028, 1.2589027881622314, 1.3070995807647705, 1.3552963733673096, 1.4034931659698486, 1.4516899585723877, 1.4998868703842163, 1.5480836629867554, 1.5962804555892944, 1.644477367401123, 1.692674160003662, 1.7408709526062012, 1.7890678644180298, 1.8372646570205688, 1.8854615688323975, 1.9336583614349365, 1.9818551540374756, 2.0300519466400146, 2.078248977661133, 2.126445770263672, 2.174642562866211]}, "gradients/encoder.encoder.layers.5.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 3.0, 2.0, 3.0, 6.0, 6.0, 4.0, 12.0, 11.0, 16.0, 19.0, 22.0, 42.0, 37.0, 39.0, 44.0, 46.0, 58.0, 64.0, 67.0, 46.0, 74.0, 63.0, 60.0, 51.0, 35.0, 38.0, 30.0, 28.0, 19.0, 25.0, 10.0, 5.0, 8.0, 12.0, 5.0, 3.0, 1.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.9162216186523438, -0.8932319283485413, -0.8702422380447388, -0.8472525477409363, -0.8242628574371338, -0.8012731671333313, -0.7782834768295288, -0.7552937865257263, -0.7323040962219238, -0.7093144059181213, -0.6863247156143188, -0.6633350253105164, -0.6403453350067139, -0.6173556447029114, -0.5943659543991089, -0.5713762640953064, -0.5483865737915039, -0.5253968834877014, -0.5024071931838989, -0.47941750288009644, -0.45642781257629395, -0.43343812227249146, -0.41044843196868896, -0.3874587416648865, -0.364469051361084, -0.3414793610572815, -0.318489670753479, -0.2954999804496765, -0.272510290145874, -0.24952059984207153, -0.22653090953826904, -0.20354121923446655, -0.18055158853530884, -0.15756189823150635, -0.13457220792770386, -0.11158251762390137, -0.08859282732009888, -0.06560313701629639, -0.042613446712493896, -0.019623756408691406, 0.003365933895111084, 0.026355624198913574, 0.049345314502716064, 0.07233500480651855, 0.09532469511032104, 0.11831438541412354, 0.14130407571792603, 0.16429376602172852, 0.187283456325531, 0.2102731466293335, 0.233262836933136, 0.2562525272369385, 0.27924221754074097, 0.30223190784454346, 0.32522159814834595, 0.34821128845214844, 0.3712009787559509, 0.3941906690597534, 0.4171803593635559, 0.4401700496673584, 0.4631597399711609, 0.4861494302749634, 0.5091391205787659, 0.5321288108825684, 0.5551185011863708]}, "gradients/encoder.encoder.layers.5.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 2.0, 5.0, 2.0, 2.0, 6.0, 11.0, 11.0, 8.0, 19.0, 22.0, 31.0, 45.0, 74.0, 137.0, 192.0, 402.0, 746.0, 1866.0, 5783.0, 26878.0, 178893.0, 616400.0, 180569.0, 26965.0, 5844.0, 1895.0, 839.0, 356.0, 222.0, 113.0, 75.0, 53.0, 32.0, 26.0, 11.0, 7.0, 7.0, 6.0, 4.0, 4.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.12322998046875, -0.11839771270751953, -0.11356544494628906, -0.1087331771850586, -0.10390090942382812, -0.09906864166259766, -0.09423637390136719, -0.08940410614013672, -0.08457183837890625, -0.07973957061767578, -0.07490730285644531, -0.07007503509521484, -0.06524276733398438, -0.060410499572753906, -0.05557823181152344, -0.05074596405029297, -0.0459136962890625, -0.04108142852783203, -0.03624916076660156, -0.031416893005371094, -0.026584625244140625, -0.021752357482910156, -0.016920089721679688, -0.012087821960449219, -0.00725555419921875, -0.0024232864379882812, 0.0024089813232421875, 0.007241249084472656, 0.012073516845703125, 0.016905784606933594, 0.021738052368164062, 0.02657032012939453, 0.031402587890625, 0.03623485565185547, 0.04106712341308594, 0.045899391174316406, 0.050731658935546875, 0.055563926696777344, 0.06039619445800781, 0.06522846221923828, 0.07006072998046875, 0.07489299774169922, 0.07972526550292969, 0.08455753326416016, 0.08938980102539062, 0.0942220687866211, 0.09905433654785156, 0.10388660430908203, 0.1087188720703125, 0.11355113983154297, 0.11838340759277344, 0.1232156753540039, 0.12804794311523438, 0.13288021087646484, 0.1377124786376953, 0.14254474639892578, 0.14737701416015625, 0.15220928192138672, 0.1570415496826172, 0.16187381744384766, 0.16670608520507812, 0.1715383529663086, 0.17637062072753906, 0.18120288848876953, 0.18603515625]}, "gradients/encoder.encoder.layers.5.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 2.0, 18.0, 16.0, 30.0, 31.0, 36.0, 48.0, 70.0, 105.0, 95.0, 83.0, 101.0, 102.0, 67.0, 69.0, 44.0, 29.0, 26.0, 10.0, 12.0, 5.0, 1.0, 5.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.10968017578125, -0.10690593719482422, -0.10413169860839844, -0.10135746002197266, -0.09858322143554688, -0.0958089828491211, -0.09303474426269531, -0.09026050567626953, -0.08748626708984375, -0.08471202850341797, -0.08193778991699219, -0.0791635513305664, -0.07638931274414062, -0.07361507415771484, -0.07084083557128906, -0.06806659698486328, -0.0652923583984375, -0.06251811981201172, -0.05974388122558594, -0.056969642639160156, -0.054195404052734375, -0.051421165466308594, -0.04864692687988281, -0.04587268829345703, -0.04309844970703125, -0.04032421112060547, -0.03754997253417969, -0.034775733947753906, -0.032001495361328125, -0.029227256774902344, -0.026453018188476562, -0.02367877960205078, -0.020904541015625, -0.01813030242919922, -0.015356063842773438, -0.012581825256347656, -0.009807586669921875, -0.007033348083496094, -0.0042591094970703125, -0.0014848709106445312, 0.00128936767578125, 0.004063606262207031, 0.0068378448486328125, 0.009612083435058594, 0.012386322021484375, 0.015160560607910156, 0.017934799194335938, 0.02070903778076172, 0.0234832763671875, 0.02625751495361328, 0.029031753540039062, 0.031805992126464844, 0.034580230712890625, 0.037354469299316406, 0.04012870788574219, 0.04290294647216797, 0.04567718505859375, 0.04845142364501953, 0.05122566223144531, 0.053999900817871094, 0.056774139404296875, 0.059548377990722656, 0.06232261657714844, 0.06509685516357422, 0.06787109375]}, "gradients/encoder.encoder.layers.5.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 3.0, 1.0, 4.0, 3.0, 3.0, 4.0, 10.0, 8.0, 12.0, 11.0, 15.0, 25.0, 32.0, 43.0, 70.0, 99.0, 131.0, 194.0, 325.0, 508.0, 950.0, 2139.0, 6743.0, 41373.0, 660455.0, 302561.0, 24312.0, 4772.0, 1681.0, 852.0, 416.0, 275.0, 144.0, 111.0, 66.0, 58.0, 34.0, 33.0, 28.0, 21.0, 11.0, 14.0, 5.0, 8.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.2105712890625, -0.20416641235351562, -0.19776153564453125, -0.19135665893554688, -0.1849517822265625, -0.17854690551757812, -0.17214202880859375, -0.16573715209960938, -0.159332275390625, -0.15292739868164062, -0.14652252197265625, -0.14011764526367188, -0.1337127685546875, -0.12730789184570312, -0.12090301513671875, -0.11449813842773438, -0.10809326171875, -0.10168838500976562, -0.09528350830078125, -0.08887863159179688, -0.0824737548828125, -0.07606887817382812, -0.06966400146484375, -0.06325912475585938, -0.056854248046875, -0.050449371337890625, -0.04404449462890625, -0.037639617919921875, -0.0312347412109375, -0.024829864501953125, -0.01842498779296875, -0.012020111083984375, -0.005615234375, 0.000789642333984375, 0.00719451904296875, 0.013599395751953125, 0.0200042724609375, 0.026409149169921875, 0.03281402587890625, 0.039218902587890625, 0.045623779296875, 0.052028656005859375, 0.05843353271484375, 0.06483840942382812, 0.0712432861328125, 0.07764816284179688, 0.08405303955078125, 0.09045791625976562, 0.09686279296875, 0.10326766967773438, 0.10967254638671875, 0.11607742309570312, 0.1224822998046875, 0.12888717651367188, 0.13529205322265625, 0.14169692993164062, 0.148101806640625, 0.15450668334960938, 0.16091156005859375, 0.16731643676757812, 0.1737213134765625, 0.18012619018554688, 0.18653106689453125, 0.19293594360351562, 0.1993408203125]}, "gradients/encoder.encoder.layers.5.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 3.0, 1.0, 0.0, 0.0, 2.0, 4.0, 5.0, 3.0, 1.0, 6.0, 7.0, 10.0, 14.0, 15.0, 21.0, 21.0, 28.0, 27.0, 29.0, 31.0, 27.0, 35.0, 50.0, 34.0, 38.0, 40.0, 49.0, 61.0, 55.0, 40.0, 40.0, 47.0, 36.0, 42.0, 38.0, 31.0, 25.0, 26.0, 13.0, 14.0, 13.0, 14.0, 8.0, 4.0, 3.0, 3.0, 2.0, 2.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.189453125, -0.18335914611816406, -0.17726516723632812, -0.1711711883544922, -0.16507720947265625, -0.1589832305908203, -0.15288925170898438, -0.14679527282714844, -0.1407012939453125, -0.13460731506347656, -0.12851333618164062, -0.12241935729980469, -0.11632537841796875, -0.11023139953613281, -0.10413742065429688, -0.09804344177246094, -0.091949462890625, -0.08585548400878906, -0.07976150512695312, -0.07366752624511719, -0.06757354736328125, -0.06147956848144531, -0.055385589599609375, -0.04929161071777344, -0.0431976318359375, -0.03710365295410156, -0.031009674072265625, -0.024915695190429688, -0.01882171630859375, -0.012727737426757812, -0.006633758544921875, -0.0005397796630859375, 0.00555419921875, 0.011648178100585938, 0.017742156982421875, 0.023836135864257812, 0.02993011474609375, 0.03602409362792969, 0.042118072509765625, 0.04821205139160156, 0.0543060302734375, 0.06040000915527344, 0.06649398803710938, 0.07258796691894531, 0.07868194580078125, 0.08477592468261719, 0.09086990356445312, 0.09696388244628906, 0.103057861328125, 0.10915184020996094, 0.11524581909179688, 0.12133979797363281, 0.12743377685546875, 0.1335277557373047, 0.13962173461914062, 0.14571571350097656, 0.1518096923828125, 0.15790367126464844, 0.16399765014648438, 0.1700916290283203, 0.17618560791015625, 0.1822795867919922, 0.18837356567382812, 0.19446754455566406, 0.2005615234375]}, "gradients/encoder.encoder.layers.5.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 4.0, 4.0, 6.0, 4.0, 6.0, 5.0, 11.0, 12.0, 23.0, 18.0, 35.0, 22.0, 60.0, 82.0, 126.0, 178.0, 248.0, 401.0, 669.0, 1121.0, 2089.0, 3986.0, 9055.0, 24137.0, 138473.0, 764166.0, 71512.0, 17413.0, 7104.0, 3261.0, 1733.0, 956.0, 550.0, 342.0, 224.0, 145.0, 108.0, 85.0, 42.0, 40.0, 22.0, 18.0, 14.0, 17.0, 8.0, 9.0, 3.0, 5.0, 5.0, 4.0, 3.0, 0.0, 1.0, 1.0, 0.0, 3.0, 2.0], "bins": [-0.07525634765625, -0.07293033599853516, -0.07060432434082031, -0.06827831268310547, -0.06595230102539062, -0.06362628936767578, -0.06130027770996094, -0.058974266052246094, -0.05664825439453125, -0.054322242736816406, -0.05199623107910156, -0.04967021942138672, -0.047344207763671875, -0.04501819610595703, -0.04269218444824219, -0.040366172790527344, -0.0380401611328125, -0.035714149475097656, -0.03338813781738281, -0.03106212615966797, -0.028736114501953125, -0.02641010284423828, -0.024084091186523438, -0.021758079528808594, -0.01943206787109375, -0.017106056213378906, -0.014780044555664062, -0.012454032897949219, -0.010128021240234375, -0.007802009582519531, -0.0054759979248046875, -0.0031499862670898438, -0.000823974609375, 0.0015020370483398438, 0.0038280487060546875, 0.006154060363769531, 0.008480072021484375, 0.010806083679199219, 0.013132095336914062, 0.015458106994628906, 0.01778411865234375, 0.020110130310058594, 0.022436141967773438, 0.02476215362548828, 0.027088165283203125, 0.02941417694091797, 0.03174018859863281, 0.034066200256347656, 0.0363922119140625, 0.038718223571777344, 0.04104423522949219, 0.04337024688720703, 0.045696258544921875, 0.04802227020263672, 0.05034828186035156, 0.052674293518066406, 0.05500030517578125, 0.057326316833496094, 0.05965232849121094, 0.06197834014892578, 0.06430435180664062, 0.06663036346435547, 0.06895637512207031, 0.07128238677978516, 0.0736083984375]}, "gradients/encoder.encoder.layers.5.attention.k_proj.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 3.0, 1.0, 6.0, 0.0, 2.0, 6.0, 6.0, 11.0, 18.0, 32.0, 93.0, 229.0, 316.0, 153.0, 62.0, 24.0, 7.0, 9.0, 6.0, 4.0, 10.0, 5.0, 1.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-4.7326087951660156e-05, -4.546064883470535e-05, -4.359520971775055e-05, -4.1729770600795746e-05, -3.986433148384094e-05, -3.799889236688614e-05, -3.6133453249931335e-05, -3.426801413297653e-05, -3.240257501602173e-05, -3.0537135899066925e-05, -2.867169678211212e-05, -2.6806257665157318e-05, -2.4940818548202515e-05, -2.307537943124771e-05, -2.1209940314292908e-05, -1.9344501197338104e-05, -1.74790620803833e-05, -1.5613622963428497e-05, -1.3748183846473694e-05, -1.188274472951889e-05, -1.0017305612564087e-05, -8.151866495609283e-06, -6.28642737865448e-06, -4.4209882616996765e-06, -2.555549144744873e-06, -6.901100277900696e-07, 1.1753290891647339e-06, 3.0407682061195374e-06, 4.906207323074341e-06, 6.771646440029144e-06, 8.637085556983948e-06, 1.0502524673938751e-05, 1.2367963790893555e-05, 1.4233402907848358e-05, 1.609884202480316e-05, 1.7964281141757965e-05, 1.982972025871277e-05, 2.1695159375667572e-05, 2.3560598492622375e-05, 2.542603760957718e-05, 2.7291476726531982e-05, 2.9156915843486786e-05, 3.102235496044159e-05, 3.288779407739639e-05, 3.4753233194351196e-05, 3.6618672311306e-05, 3.84841114282608e-05, 4.034955054521561e-05, 4.221498966217041e-05, 4.4080428779125214e-05, 4.594586789608002e-05, 4.781130701303482e-05, 4.9676746129989624e-05, 5.154218524694443e-05, 5.340762436389923e-05, 5.5273063480854034e-05, 5.713850259780884e-05, 5.900394171476364e-05, 6.0869380831718445e-05, 6.273481994867325e-05, 6.460025906562805e-05, 6.646569818258286e-05, 6.833113729953766e-05, 7.019657641649246e-05, 7.206201553344727e-05]}, "gradients/encoder.encoder.layers.5.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 3.0, 0.0, 2.0, 3.0, 5.0, 13.0, 15.0, 27.0, 39.0, 105.0, 208.0, 515.0, 1830.0, 10297.0, 229421.0, 786421.0, 16019.0, 2535.0, 643.0, 239.0, 112.0, 50.0, 30.0, 14.0, 5.0, 10.0, 2.0, 4.0, 3.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1727294921875, -0.16734886169433594, -0.16196823120117188, -0.1565876007080078, -0.15120697021484375, -0.1458263397216797, -0.14044570922851562, -0.13506507873535156, -0.1296844482421875, -0.12430381774902344, -0.11892318725585938, -0.11354255676269531, -0.10816192626953125, -0.10278129577636719, -0.09740066528320312, -0.09202003479003906, -0.086639404296875, -0.08125877380371094, -0.07587814331054688, -0.07049751281738281, -0.06511688232421875, -0.05973625183105469, -0.054355621337890625, -0.04897499084472656, -0.0435943603515625, -0.03821372985839844, -0.032833099365234375, -0.027452468872070312, -0.02207183837890625, -0.016691207885742188, -0.011310577392578125, -0.0059299468994140625, -0.00054931640625, 0.0048313140869140625, 0.010211944580078125, 0.015592575073242188, 0.02097320556640625, 0.026353836059570312, 0.031734466552734375, 0.03711509704589844, 0.0424957275390625, 0.04787635803222656, 0.053256988525390625, 0.05863761901855469, 0.06401824951171875, 0.06939888000488281, 0.07477951049804688, 0.08016014099121094, 0.085540771484375, 0.09092140197753906, 0.09630203247070312, 0.10168266296386719, 0.10706329345703125, 0.11244392395019531, 0.11782455444335938, 0.12320518493652344, 0.1285858154296875, 0.13396644592285156, 0.13934707641601562, 0.1447277069091797, 0.15010833740234375, 0.1554889678955078, 0.16086959838867188, 0.16625022888183594, 0.171630859375]}, "gradients/encoder.encoder.layers.5.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 4.0, 4.0, 2.0, 3.0, 10.0, 13.0, 7.0, 8.0, 15.0, 21.0, 44.0, 72.0, 93.0, 138.0, 169.0, 148.0, 92.0, 57.0, 24.0, 28.0, 7.0, 12.0, 8.0, 7.0, 8.0, 1.0, 8.0, 4.0, 2.0, 1.0, 0.0, 2.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.10028076171875, -0.09728240966796875, -0.0942840576171875, -0.09128570556640625, -0.088287353515625, -0.08528900146484375, -0.0822906494140625, -0.07929229736328125, -0.0762939453125, -0.07329559326171875, -0.0702972412109375, -0.06729888916015625, -0.064300537109375, -0.06130218505859375, -0.0583038330078125, -0.05530548095703125, -0.05230712890625, -0.04930877685546875, -0.0463104248046875, -0.04331207275390625, -0.040313720703125, -0.03731536865234375, -0.0343170166015625, -0.03131866455078125, -0.0283203125, -0.02532196044921875, -0.0223236083984375, -0.01932525634765625, -0.016326904296875, -0.01332855224609375, -0.0103302001953125, -0.00733184814453125, -0.00433349609375, -0.00133514404296875, 0.0016632080078125, 0.00466156005859375, 0.007659912109375, 0.01065826416015625, 0.0136566162109375, 0.01665496826171875, 0.0196533203125, 0.02265167236328125, 0.0256500244140625, 0.02864837646484375, 0.031646728515625, 0.03464508056640625, 0.0376434326171875, 0.04064178466796875, 0.04364013671875, 0.04663848876953125, 0.0496368408203125, 0.05263519287109375, 0.055633544921875, 0.05863189697265625, 0.0616302490234375, 0.06462860107421875, 0.067626953125, 0.07062530517578125, 0.0736236572265625, 0.07662200927734375, 0.079620361328125, 0.08261871337890625, 0.0856170654296875, 0.08861541748046875, 0.09161376953125]}, "gradients/encoder.encoder.layers.5.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 2.0, 2.0, 4.0, 5.0, 1.0, 11.0, 16.0, 40.0, 78.0, 154.0, 318.0, 173.0, 95.0, 62.0, 24.0, 13.0, 6.0, 0.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.1227328777313232, -1.0651572942733765, -1.0075817108154297, -0.9500061273574829, -0.8924304842948914, -0.8348549008369446, -0.7772793173789978, -0.7197036743164062, -0.6621280908584595, -0.6045525074005127, -0.5469769239425659, -0.48940131068229675, -0.4318256974220276, -0.3742501139640808, -0.31667453050613403, -0.25909891724586487, -0.20152336359024048, -0.1439477652311325, -0.08637217432260513, -0.02879658341407776, 0.028779014945030212, 0.08635461330413818, 0.14393019676208496, 0.20150581002235413, 0.2590813934803009, 0.3166569769382477, 0.37423259019851685, 0.4318081736564636, 0.4893837571144104, 0.546959400177002, 0.6045349836349487, 0.6621105670928955, 0.7196861505508423, 0.7772617340087891, 0.8348373174667358, 0.8924129009246826, 0.9499885439872742, 1.0075640678405762, 1.0651397705078125, 1.1227153539657593, 1.180290937423706, 1.2378665208816528, 1.2954421043395996, 1.3530176877975464, 1.4105932712554932, 1.4681689739227295, 1.5257444381713867, 1.583320140838623, 1.6408956050872803, 1.698471188545227, 1.7560467720031738, 1.8136223554611206, 1.8711979389190674, 1.9287736415863037, 1.986349105834961, 2.0439248085021973, 2.1015005111694336, 2.15907621383667, 2.216651678085327, 2.2742273807525635, 2.3318028450012207, 2.389378547668457, 2.4469540119171143, 2.5045297145843506, 2.562105178833008]}, "gradients/encoder.encoder.layers.5.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 1.0, 3.0, 0.0, 0.0, 4.0, 4.0, 5.0, 3.0, 7.0, 5.0, 9.0, 9.0, 8.0, 9.0, 11.0, 11.0, 19.0, 17.0, 28.0, 18.0, 24.0, 26.0, 44.0, 32.0, 23.0, 44.0, 49.0, 55.0, 64.0, 57.0, 47.0, 38.0, 27.0, 36.0, 35.0, 36.0, 21.0, 18.0, 17.0, 14.0, 18.0, 15.0, 20.0, 7.0, 15.0, 10.0, 11.0, 7.0, 6.0, 6.0, 8.0, 4.0, 3.0, 0.0, 7.0, 3.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6883071660995483, -0.6660804748535156, -0.6438537836074829, -0.6216270327568054, -0.5994003415107727, -0.57717365026474, -0.5549468994140625, -0.5327202081680298, -0.5104935169219971, -0.48826682567596436, -0.46604010462760925, -0.44381338357925415, -0.42158669233322144, -0.3993600010871887, -0.3771332800388336, -0.3549065589904785, -0.3326798677444458, -0.3104531764984131, -0.288226455450058, -0.2659997344017029, -0.24377304315567017, -0.22154633700847626, -0.19931963086128235, -0.17709292471408844, -0.15486621856689453, -0.13263951241970062, -0.11041280627250671, -0.0881861001253128, -0.0659593939781189, -0.04373268783092499, -0.02150598168373108, 0.0007207244634628296, 0.02294743061065674, 0.04517413675785065, 0.06740084290504456, 0.08962754905223846, 0.11185425519943237, 0.13408096134662628, 0.1563076674938202, 0.1785343736410141, 0.200761079788208, 0.22298778593540192, 0.24521449208259583, 0.2674412131309509, 0.28966790437698364, 0.31189459562301636, 0.33412131667137146, 0.35634803771972656, 0.3785747289657593, 0.400801420211792, 0.4230281412601471, 0.4452548623085022, 0.4674815535545349, 0.4897082448005676, 0.5119349956512451, 0.5341616868972778, 0.5563883781433105, 0.5786150693893433, 0.600841760635376, 0.6230685114860535, 0.6452952027320862, 0.6675218939781189, 0.6897486448287964, 0.7119753360748291, 0.7342020273208618]}, "gradients/encoder.encoder.layers.4.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 2.0, 0.0, 3.0, 0.0, 0.0, 1.0, 0.0, 1.0, 3.0, 4.0, 1.0, 2.0, 2.0, 3.0, 4.0, 4.0, 4.0, 4.0, 10.0, 16.0, 22.0, 40.0, 49.0, 81.0, 117.0, 166.0, 248.0, 462.0, 958.0, 2557.0, 10366.0, 175059.0, 3919319.0, 72065.0, 8804.0, 2173.0, 792.0, 436.0, 213.0, 111.0, 70.0, 34.0, 28.0, 23.0, 13.0, 4.0, 10.0, 4.0, 3.0, 3.0, 1.0, 3.0, 3.0], "bins": [-0.318115234375, -0.31078338623046875, -0.3034515380859375, -0.29611968994140625, -0.288787841796875, -0.28145599365234375, -0.2741241455078125, -0.26679229736328125, -0.25946044921875, -0.25212860107421875, -0.2447967529296875, -0.23746490478515625, -0.230133056640625, -0.22280120849609375, -0.2154693603515625, -0.20813751220703125, -0.2008056640625, -0.19347381591796875, -0.1861419677734375, -0.17881011962890625, -0.171478271484375, -0.16414642333984375, -0.1568145751953125, -0.14948272705078125, -0.14215087890625, -0.13481903076171875, -0.1274871826171875, -0.12015533447265625, -0.112823486328125, -0.10549163818359375, -0.0981597900390625, -0.09082794189453125, -0.08349609375, -0.07616424560546875, -0.0688323974609375, -0.06150054931640625, -0.054168701171875, -0.04683685302734375, -0.0395050048828125, -0.03217315673828125, -0.02484130859375, -0.01750946044921875, -0.0101776123046875, -0.00284576416015625, 0.004486083984375, 0.01181793212890625, 0.0191497802734375, 0.02648162841796875, 0.0338134765625, 0.04114532470703125, 0.0484771728515625, 0.05580902099609375, 0.063140869140625, 0.07047271728515625, 0.0778045654296875, 0.08513641357421875, 0.09246826171875, 0.09980010986328125, 0.1071319580078125, 0.11446380615234375, 0.121795654296875, 0.12912750244140625, 0.1364593505859375, 0.14379119873046875, 0.151123046875]}, "gradients/encoder.encoder.layers.4.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 1.0, 2.0, 7.0, 8.0, 12.0, 20.0, 26.0, 43.0, 49.0, 72.0, 77.0, 108.0, 111.0, 85.0, 79.0, 85.0, 73.0, 51.0, 35.0, 27.0, 13.0, 9.0, 3.0, 11.0, 4.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.115966796875, -0.11325883865356445, -0.1105508804321289, -0.10784292221069336, -0.10513496398925781, -0.10242700576782227, -0.09971904754638672, -0.09701108932495117, -0.09430313110351562, -0.09159517288208008, -0.08888721466064453, -0.08617925643920898, -0.08347129821777344, -0.08076333999633789, -0.07805538177490234, -0.0753474235534668, -0.07263946533203125, -0.0699315071105957, -0.06722354888916016, -0.06451559066772461, -0.06180763244628906, -0.059099674224853516, -0.05639171600341797, -0.05368375778198242, -0.050975799560546875, -0.04826784133911133, -0.04555988311767578, -0.042851924896240234, -0.04014396667480469, -0.03743600845336914, -0.034728050231933594, -0.03202009201049805, -0.0293121337890625, -0.026604175567626953, -0.023896217346191406, -0.02118825912475586, -0.018480300903320312, -0.015772342681884766, -0.013064384460449219, -0.010356426239013672, -0.007648468017578125, -0.004940509796142578, -0.0022325515747070312, 0.0004754066467285156, 0.0031833648681640625, 0.005891323089599609, 0.008599281311035156, 0.011307239532470703, 0.01401519775390625, 0.016723155975341797, 0.019431114196777344, 0.02213907241821289, 0.024847030639648438, 0.027554988861083984, 0.03026294708251953, 0.03297090530395508, 0.035678863525390625, 0.03838682174682617, 0.04109477996826172, 0.043802738189697266, 0.04651069641113281, 0.04921865463256836, 0.051926612854003906, 0.05463457107543945, 0.057342529296875]}, "gradients/encoder.encoder.layers.4.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 3.0, 1.0, 3.0, 4.0, 7.0, 10.0, 9.0, 18.0, 23.0, 26.0, 24.0, 23.0, 44.0, 66.0, 70.0, 102.0, 177.0, 246.0, 493.0, 1176.0, 4214.0, 27043.0, 1814477.0, 2306661.0, 31557.0, 4757.0, 1464.0, 670.0, 297.0, 199.0, 131.0, 85.0, 64.0, 38.0, 25.0, 18.0, 25.0, 19.0, 9.0, 6.0, 3.0, 0.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.2430419921875, -0.23545074462890625, -0.2278594970703125, -0.22026824951171875, -0.212677001953125, -0.20508575439453125, -0.1974945068359375, -0.18990325927734375, -0.18231201171875, -0.17472076416015625, -0.1671295166015625, -0.15953826904296875, -0.151947021484375, -0.14435577392578125, -0.1367645263671875, -0.12917327880859375, -0.12158203125, -0.11399078369140625, -0.1063995361328125, -0.09880828857421875, -0.091217041015625, -0.08362579345703125, -0.0760345458984375, -0.06844329833984375, -0.06085205078125, -0.05326080322265625, -0.0456695556640625, -0.03807830810546875, -0.030487060546875, -0.02289581298828125, -0.0153045654296875, -0.00771331787109375, -0.0001220703125, 0.00746917724609375, 0.0150604248046875, 0.02265167236328125, 0.030242919921875, 0.03783416748046875, 0.0454254150390625, 0.05301666259765625, 0.06060791015625, 0.06819915771484375, 0.0757904052734375, 0.08338165283203125, 0.090972900390625, 0.09856414794921875, 0.1061553955078125, 0.11374664306640625, 0.121337890625, 0.12892913818359375, 0.1365203857421875, 0.14411163330078125, 0.151702880859375, 0.15929412841796875, 0.1668853759765625, 0.17447662353515625, 0.18206787109375, 0.18965911865234375, 0.1972503662109375, 0.20484161376953125, 0.212432861328125, 0.22002410888671875, 0.2276153564453125, 0.23520660400390625, 0.2427978515625]}, "gradients/encoder.encoder.layers.4.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 3.0, 3.0, 5.0, 7.0, 12.0, 17.0, 29.0, 53.0, 99.0, 227.0, 690.0, 1512.0, 842.0, 289.0, 111.0, 60.0, 42.0, 29.0, 17.0, 7.0, 11.0, 3.0, 4.0, 5.0, 1.0, 1.0, 1.0, 0.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.213623046875, -0.20685958862304688, -0.20009613037109375, -0.19333267211914062, -0.1865692138671875, -0.17980575561523438, -0.17304229736328125, -0.16627883911132812, -0.159515380859375, -0.15275192260742188, -0.14598846435546875, -0.13922500610351562, -0.1324615478515625, -0.12569808959960938, -0.11893463134765625, -0.11217117309570312, -0.10540771484375, -0.09864425659179688, -0.09188079833984375, -0.08511734008789062, -0.0783538818359375, -0.07159042358398438, -0.06482696533203125, -0.058063507080078125, -0.051300048828125, -0.044536590576171875, -0.03777313232421875, -0.031009674072265625, -0.0242462158203125, -0.017482757568359375, -0.01071929931640625, -0.003955841064453125, 0.0028076171875, 0.009571075439453125, 0.01633453369140625, 0.023097991943359375, 0.0298614501953125, 0.036624908447265625, 0.04338836669921875, 0.050151824951171875, 0.056915283203125, 0.06367874145507812, 0.07044219970703125, 0.07720565795898438, 0.0839691162109375, 0.09073257446289062, 0.09749603271484375, 0.10425949096679688, 0.11102294921875, 0.11778640747070312, 0.12454986572265625, 0.13131332397460938, 0.1380767822265625, 0.14484024047851562, 0.15160369873046875, 0.15836715698242188, 0.165130615234375, 0.17189407348632812, 0.17865753173828125, 0.18542098999023438, 0.1921844482421875, 0.19894790649414062, 0.20571136474609375, 0.21247482299804688, 0.21923828125]}, "gradients/encoder.encoder.layers.4.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 5.0, 5.0, 2.0, 1.0, 6.0, 6.0, 7.0, 24.0, 25.0, 49.0, 99.0, 184.0, 184.0, 164.0, 96.0, 56.0, 34.0, 22.0, 4.0, 11.0, 10.0, 4.0, 1.0, 1.0, 3.0, 3.0, 3.0, 1.0, 1.0, 2.0, 1.0, 0.0, 2.0], "bins": [-2.1288230419158936, -2.0794289112091064, -2.0300347805023193, -1.9806406497955322, -1.9312465190887451, -1.881852388381958, -1.832458257675171, -1.7830641269683838, -1.7336699962615967, -1.6842758655548096, -1.6348817348480225, -1.5854876041412354, -1.5360934734344482, -1.4866993427276611, -1.437305212020874, -1.387911081314087, -1.3385170698165894, -1.2891229391098022, -1.2397288084030151, -1.190334677696228, -1.140940546989441, -1.0915464162826538, -1.0421524047851562, -0.9927582144737244, -0.9433640837669373, -0.8939699530601501, -0.844575822353363, -0.7951817512512207, -0.7457876205444336, -0.6963934898376465, -0.6469993591308594, -0.5976052284240723, -0.5482112169265747, -0.4988170862197876, -0.4494229555130005, -0.40002885460853577, -0.35063472390174866, -0.30124059319496155, -0.2518464922904968, -0.20245236158370972, -0.1530582308769226, -0.1036641076207161, -0.05426998436450958, -0.004875868558883667, 0.04451826214790344, 0.09391239285469055, 0.14330649375915527, 0.19270062446594238, 0.2420947551727295, 0.2914888858795166, 0.3408830165863037, 0.39027711749076843, 0.43967124819755554, 0.48906537890434265, 0.5384594798088074, 0.5878536105155945, 0.6372477412223816, 0.6866418719291687, 0.7360360026359558, 0.7854300737380981, 0.8348242044448853, 0.8842183351516724, 0.9336124658584595, 0.9830065965652466, 1.0324007272720337]}, "gradients/encoder.encoder.layers.4.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 2.0, 2.0, 0.0, 3.0, 3.0, 3.0, 9.0, 8.0, 13.0, 11.0, 10.0, 17.0, 16.0, 27.0, 24.0, 23.0, 33.0, 29.0, 32.0, 29.0, 47.0, 47.0, 48.0, 48.0, 59.0, 47.0, 52.0, 50.0, 50.0, 36.0, 35.0, 21.0, 31.0, 21.0, 26.0, 15.0, 21.0, 11.0, 7.0, 7.0, 7.0, 10.0, 6.0, 6.0, 2.0, 2.0, 3.0, 4.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0], "bins": [-0.5960007309913635, -0.5767236351966858, -0.5574465394020081, -0.5381694436073303, -0.5188924074172974, -0.49961528182029724, -0.4803382158279419, -0.46106112003326416, -0.4417840242385864, -0.4225069284439087, -0.40322983264923096, -0.3839527666568756, -0.3646756708621979, -0.34539857506752014, -0.3261215090751648, -0.30684441328048706, -0.2875673174858093, -0.2682902216911316, -0.24901314079761505, -0.2297360599040985, -0.21045896410942078, -0.19118186831474304, -0.1719047874212265, -0.15262770652770996, -0.13335061073303223, -0.11407352238893509, -0.09479643404483795, -0.07551934570074081, -0.05624225735664368, -0.03696516901254654, -0.017688080668449402, 0.0015890002250671387, 0.020866096019744873, 0.04014318436384201, 0.05942027270793915, 0.07869736105203629, 0.09797444939613342, 0.11725153774023056, 0.1365286260843277, 0.15580570697784424, 0.17508280277252197, 0.1943598985671997, 0.21363697946071625, 0.2329140603542328, 0.2521911561489105, 0.27146825194358826, 0.2907453179359436, 0.31002241373062134, 0.3292995095252991, 0.3485766053199768, 0.36785370111465454, 0.3871307671070099, 0.4064078629016876, 0.42568495869636536, 0.4449620246887207, 0.46423912048339844, 0.48351621627807617, 0.5027933120727539, 0.5220704078674316, 0.5413475036621094, 0.5606245994567871, 0.5799016356468201, 0.5991787314414978, 0.6184558272361755, 0.6377329230308533]}, "gradients/encoder.encoder.layers.4.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 4.0, 1.0, 1.0, 2.0, 1.0, 4.0, 4.0, 8.0, 12.0, 8.0, 18.0, 21.0, 34.0, 46.0, 64.0, 130.0, 274.0, 503.0, 1056.0, 2597.0, 7297.0, 24141.0, 116223.0, 571132.0, 260319.0, 45852.0, 11850.0, 3925.0, 1584.0, 624.0, 356.0, 176.0, 103.0, 74.0, 36.0, 31.0, 19.0, 12.0, 3.0, 6.0, 7.0, 4.0, 3.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.229248046875, -0.2236175537109375, -0.217987060546875, -0.2123565673828125, -0.20672607421875, -0.2010955810546875, -0.195465087890625, -0.1898345947265625, -0.1842041015625, -0.1785736083984375, -0.172943115234375, -0.1673126220703125, -0.16168212890625, -0.1560516357421875, -0.150421142578125, -0.1447906494140625, -0.13916015625, -0.1335296630859375, -0.127899169921875, -0.1222686767578125, -0.11663818359375, -0.1110076904296875, -0.105377197265625, -0.0997467041015625, -0.0941162109375, -0.0884857177734375, -0.082855224609375, -0.0772247314453125, -0.07159423828125, -0.0659637451171875, -0.060333251953125, -0.0547027587890625, -0.049072265625, -0.0434417724609375, -0.037811279296875, -0.0321807861328125, -0.02655029296875, -0.0209197998046875, -0.015289306640625, -0.0096588134765625, -0.0040283203125, 0.0016021728515625, 0.007232666015625, 0.0128631591796875, 0.01849365234375, 0.0241241455078125, 0.029754638671875, 0.0353851318359375, 0.041015625, 0.0466461181640625, 0.052276611328125, 0.0579071044921875, 0.06353759765625, 0.0691680908203125, 0.074798583984375, 0.0804290771484375, 0.0860595703125, 0.0916900634765625, 0.097320556640625, 0.1029510498046875, 0.10858154296875, 0.1142120361328125, 0.119842529296875, 0.1254730224609375, 0.131103515625]}, "gradients/encoder.encoder.layers.4.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 3.0, 0.0, 3.0, 1.0, 0.0, 5.0, 11.0, 8.0, 25.0, 25.0, 41.0, 51.0, 59.0, 78.0, 69.0, 91.0, 88.0, 79.0, 95.0, 69.0, 54.0, 52.0, 36.0, 26.0, 15.0, 11.0, 9.0, 6.0, 3.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 1.0], "bins": [-0.1280517578125, -0.12516450881958008, -0.12227725982666016, -0.11939001083374023, -0.11650276184082031, -0.11361551284790039, -0.11072826385498047, -0.10784101486206055, -0.10495376586914062, -0.1020665168762207, -0.09917926788330078, -0.09629201889038086, -0.09340476989746094, -0.09051752090454102, -0.0876302719116211, -0.08474302291870117, -0.08185577392578125, -0.07896852493286133, -0.0760812759399414, -0.07319402694702148, -0.07030677795410156, -0.06741952896118164, -0.06453227996826172, -0.0616450309753418, -0.058757781982421875, -0.05587053298950195, -0.05298328399658203, -0.05009603500366211, -0.04720878601074219, -0.044321537017822266, -0.041434288024902344, -0.03854703903198242, -0.0356597900390625, -0.03277254104614258, -0.029885292053222656, -0.026998043060302734, -0.024110794067382812, -0.02122354507446289, -0.01833629608154297, -0.015449047088623047, -0.012561798095703125, -0.009674549102783203, -0.006787300109863281, -0.0039000511169433594, -0.0010128021240234375, 0.0018744468688964844, 0.004761695861816406, 0.007648944854736328, 0.01053619384765625, 0.013423442840576172, 0.016310691833496094, 0.019197940826416016, 0.022085189819335938, 0.02497243881225586, 0.02785968780517578, 0.030746936798095703, 0.033634185791015625, 0.03652143478393555, 0.03940868377685547, 0.04229593276977539, 0.04518318176269531, 0.048070430755615234, 0.050957679748535156, 0.05384492874145508, 0.056732177734375]}, "gradients/encoder.encoder.layers.4.attention.v_proj.weight": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 4.0, 1.0, 2.0, 3.0, 1.0, 3.0, 4.0, 9.0, 9.0, 7.0, 10.0, 10.0, 25.0, 28.0, 55.0, 65.0, 86.0, 123.0, 163.0, 250.0, 353.0, 597.0, 1035.0, 2048.0, 4585.0, 14187.0, 70623.0, 679082.0, 229406.0, 30685.0, 8179.0, 3080.0, 1506.0, 834.0, 520.0, 287.0, 206.0, 136.0, 112.0, 59.0, 45.0, 30.0, 22.0, 34.0, 15.0, 14.0, 7.0, 3.0, 10.0, 2.0, 4.0, 1.0, 4.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.200927734375, -0.19439697265625, -0.1878662109375, -0.18133544921875, -0.1748046875, -0.16827392578125, -0.1617431640625, -0.15521240234375, -0.148681640625, -0.14215087890625, -0.1356201171875, -0.12908935546875, -0.12255859375, -0.11602783203125, -0.1094970703125, -0.10296630859375, -0.096435546875, -0.08990478515625, -0.0833740234375, -0.07684326171875, -0.0703125, -0.06378173828125, -0.0572509765625, -0.05072021484375, -0.044189453125, -0.03765869140625, -0.0311279296875, -0.02459716796875, -0.01806640625, -0.01153564453125, -0.0050048828125, 0.00152587890625, 0.008056640625, 0.01458740234375, 0.0211181640625, 0.02764892578125, 0.0341796875, 0.04071044921875, 0.0472412109375, 0.05377197265625, 0.060302734375, 0.06683349609375, 0.0733642578125, 0.07989501953125, 0.08642578125, 0.09295654296875, 0.0994873046875, 0.10601806640625, 0.112548828125, 0.11907958984375, 0.1256103515625, 0.13214111328125, 0.138671875, 0.14520263671875, 0.1517333984375, 0.15826416015625, 0.164794921875, 0.17132568359375, 0.1778564453125, 0.18438720703125, 0.19091796875, 0.19744873046875, 0.2039794921875, 0.21051025390625, 0.217041015625]}, "gradients/encoder.encoder.layers.4.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 1.0, 0.0, 0.0, 1.0, 2.0, 2.0, 4.0, 3.0, 6.0, 2.0, 1.0, 7.0, 4.0, 5.0, 11.0, 15.0, 15.0, 15.0, 25.0, 28.0, 29.0, 30.0, 47.0, 52.0, 42.0, 43.0, 47.0, 49.0, 49.0, 39.0, 42.0, 44.0, 42.0, 37.0, 36.0, 19.0, 32.0, 24.0, 30.0, 24.0, 18.0, 21.0, 15.0, 13.0, 6.0, 4.0, 7.0, 9.0, 2.0, 2.0, 4.0, 0.0, 3.0, 2.0, 1.0, 0.0, 3.0, 2.0, 2.0], "bins": [-0.1929931640625, -0.1872406005859375, -0.181488037109375, -0.1757354736328125, -0.16998291015625, -0.1642303466796875, -0.158477783203125, -0.1527252197265625, -0.14697265625, -0.1412200927734375, -0.135467529296875, -0.1297149658203125, -0.12396240234375, -0.1182098388671875, -0.112457275390625, -0.1067047119140625, -0.1009521484375, -0.0951995849609375, -0.089447021484375, -0.0836944580078125, -0.07794189453125, -0.0721893310546875, -0.066436767578125, -0.0606842041015625, -0.054931640625, -0.0491790771484375, -0.043426513671875, -0.0376739501953125, -0.03192138671875, -0.0261688232421875, -0.020416259765625, -0.0146636962890625, -0.0089111328125, -0.0031585693359375, 0.002593994140625, 0.0083465576171875, 0.01409912109375, 0.0198516845703125, 0.025604248046875, 0.0313568115234375, 0.037109375, 0.0428619384765625, 0.048614501953125, 0.0543670654296875, 0.06011962890625, 0.0658721923828125, 0.071624755859375, 0.0773773193359375, 0.0831298828125, 0.0888824462890625, 0.094635009765625, 0.1003875732421875, 0.10614013671875, 0.1118927001953125, 0.117645263671875, 0.1233978271484375, 0.129150390625, 0.1349029541015625, 0.140655517578125, 0.1464080810546875, 0.15216064453125, 0.1579132080078125, 0.163665771484375, 0.1694183349609375, 0.1751708984375]}, "gradients/encoder.encoder.layers.4.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 2.0, 1.0, 3.0, 3.0, 3.0, 7.0, 8.0, 10.0, 19.0, 20.0, 48.0, 58.0, 81.0, 149.0, 201.0, 351.0, 578.0, 919.0, 1556.0, 2642.0, 4858.0, 9422.0, 22054.0, 68986.0, 546072.0, 296608.0, 55662.0, 19228.0, 8498.0, 4343.0, 2501.0, 1392.0, 914.0, 493.0, 347.0, 189.0, 124.0, 63.0, 59.0, 40.0, 24.0, 11.0, 7.0, 5.0, 4.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0611572265625, -0.058966636657714844, -0.05677604675292969, -0.05458545684814453, -0.052394866943359375, -0.05020427703857422, -0.04801368713378906, -0.045823097229003906, -0.04363250732421875, -0.041441917419433594, -0.03925132751464844, -0.03706073760986328, -0.034870147705078125, -0.03267955780029297, -0.030488967895507812, -0.028298377990722656, -0.0261077880859375, -0.023917198181152344, -0.021726608276367188, -0.01953601837158203, -0.017345428466796875, -0.015154838562011719, -0.012964248657226562, -0.010773658752441406, -0.00858306884765625, -0.006392478942871094, -0.0042018890380859375, -0.0020112991333007812, 0.000179290771484375, 0.0023698806762695312, 0.0045604705810546875, 0.006751060485839844, 0.008941650390625, 0.011132240295410156, 0.013322830200195312, 0.015513420104980469, 0.017704010009765625, 0.01989459991455078, 0.022085189819335938, 0.024275779724121094, 0.02646636962890625, 0.028656959533691406, 0.030847549438476562, 0.03303813934326172, 0.035228729248046875, 0.03741931915283203, 0.03960990905761719, 0.041800498962402344, 0.0439910888671875, 0.046181678771972656, 0.04837226867675781, 0.05056285858154297, 0.052753448486328125, 0.05494403839111328, 0.05713462829589844, 0.059325218200683594, 0.06151580810546875, 0.0637063980102539, 0.06589698791503906, 0.06808757781982422, 0.07027816772460938, 0.07246875762939453, 0.07465934753417969, 0.07684993743896484, 0.07904052734375]}, "gradients/encoder.encoder.layers.4.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 3.0, 2.0, 1.0, 2.0, 3.0, 4.0, 3.0, 3.0, 4.0, 5.0, 3.0, 7.0, 7.0, 20.0, 10.0, 15.0, 18.0, 39.0, 56.0, 86.0, 157.0, 174.0, 123.0, 81.0, 46.0, 30.0, 27.0, 19.0, 8.0, 2.0, 11.0, 8.0, 5.0, 5.0, 5.0, 1.0, 2.0, 4.0, 1.0, 2.0, 4.0, 3.0, 2.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0], "bins": [-3.6835670471191406e-05, -3.5732053220272064e-05, -3.462843596935272e-05, -3.352481871843338e-05, -3.242120146751404e-05, -3.1317584216594696e-05, -3.0213966965675354e-05, -2.9110349714756012e-05, -2.800673246383667e-05, -2.6903115212917328e-05, -2.5799497961997986e-05, -2.4695880711078644e-05, -2.3592263460159302e-05, -2.248864620923996e-05, -2.1385028958320618e-05, -2.0281411707401276e-05, -1.9177794456481934e-05, -1.807417720556259e-05, -1.697055995464325e-05, -1.5866942703723907e-05, -1.4763325452804565e-05, -1.3659708201885223e-05, -1.2556090950965881e-05, -1.145247370004654e-05, -1.0348856449127197e-05, -9.245239198207855e-06, -8.141621947288513e-06, -7.038004696369171e-06, -5.934387445449829e-06, -4.830770194530487e-06, -3.727152943611145e-06, -2.623535692691803e-06, -1.519918441772461e-06, -4.163011908531189e-07, 6.873160600662231e-07, 1.7909333109855652e-06, 2.8945505619049072e-06, 3.998167812824249e-06, 5.101785063743591e-06, 6.205402314662933e-06, 7.309019565582275e-06, 8.412636816501617e-06, 9.51625406742096e-06, 1.0619871318340302e-05, 1.1723488569259644e-05, 1.2827105820178986e-05, 1.3930723071098328e-05, 1.503434032201767e-05, 1.6137957572937012e-05, 1.7241574823856354e-05, 1.8345192074775696e-05, 1.9448809325695038e-05, 2.055242657661438e-05, 2.1656043827533722e-05, 2.2759661078453064e-05, 2.3863278329372406e-05, 2.4966895580291748e-05, 2.607051283121109e-05, 2.7174130082130432e-05, 2.8277747333049774e-05, 2.9381364583969116e-05, 3.0484981834888458e-05, 3.15885990858078e-05, 3.269221633672714e-05, 3.3795833587646484e-05]}, "gradients/encoder.encoder.layers.4.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 0.0, 1.0, 3.0, 0.0, 1.0, 6.0, 3.0, 4.0, 12.0, 9.0, 12.0, 21.0, 29.0, 25.0, 55.0, 70.0, 107.0, 196.0, 344.0, 597.0, 1209.0, 2640.0, 6625.0, 20235.0, 94021.0, 728943.0, 151577.0, 27459.0, 8353.0, 3052.0, 1355.0, 635.0, 355.0, 214.0, 123.0, 77.0, 52.0, 40.0, 32.0, 18.0, 14.0, 10.0, 7.0, 1.0, 6.0, 4.0, 7.0, 6.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.09173583984375, -0.08873939514160156, -0.08574295043945312, -0.08274650573730469, -0.07975006103515625, -0.07675361633300781, -0.07375717163085938, -0.07076072692871094, -0.0677642822265625, -0.06476783752441406, -0.061771392822265625, -0.05877494812011719, -0.05577850341796875, -0.05278205871582031, -0.049785614013671875, -0.04678916931152344, -0.043792724609375, -0.04079627990722656, -0.037799835205078125, -0.03480339050292969, -0.03180694580078125, -0.028810501098632812, -0.025814056396484375, -0.022817611694335938, -0.0198211669921875, -0.016824722290039062, -0.013828277587890625, -0.010831832885742188, -0.00783538818359375, -0.0048389434814453125, -0.001842498779296875, 0.0011539459228515625, 0.004150390625, 0.0071468353271484375, 0.010143280029296875, 0.013139724731445312, 0.01613616943359375, 0.019132614135742188, 0.022129058837890625, 0.025125503540039062, 0.0281219482421875, 0.031118392944335938, 0.034114837646484375, 0.03711128234863281, 0.04010772705078125, 0.04310417175292969, 0.046100616455078125, 0.04909706115722656, 0.052093505859375, 0.05508995056152344, 0.058086395263671875, 0.06108283996582031, 0.06407928466796875, 0.06707572937011719, 0.07007217407226562, 0.07306861877441406, 0.0760650634765625, 0.07906150817871094, 0.08205795288085938, 0.08505439758300781, 0.08805084228515625, 0.09104728698730469, 0.09404373168945312, 0.09704017639160156, 0.10003662109375]}, "gradients/encoder.encoder.layers.4.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 4.0, 4.0, 2.0, 3.0, 5.0, 6.0, 3.0, 9.0, 7.0, 3.0, 8.0, 17.0, 18.0, 27.0, 39.0, 39.0, 56.0, 72.0, 91.0, 78.0, 95.0, 84.0, 78.0, 60.0, 44.0, 35.0, 25.0, 17.0, 18.0, 9.0, 13.0, 7.0, 9.0, 3.0, 7.0, 1.0, 3.0, 2.0, 3.0, 0.0, 0.0, 0.0, 1.0, 4.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 2.0], "bins": [-0.07421875, -0.0720052719116211, -0.06979179382324219, -0.06757831573486328, -0.06536483764648438, -0.06315135955810547, -0.06093788146972656, -0.058724403381347656, -0.05651092529296875, -0.054297447204589844, -0.05208396911621094, -0.04987049102783203, -0.047657012939453125, -0.04544353485107422, -0.04323005676269531, -0.041016578674316406, -0.0388031005859375, -0.036589622497558594, -0.03437614440917969, -0.03216266632080078, -0.029949188232421875, -0.02773571014404297, -0.025522232055664062, -0.023308753967285156, -0.02109527587890625, -0.018881797790527344, -0.016668319702148438, -0.014454841613769531, -0.012241363525390625, -0.010027885437011719, -0.007814407348632812, -0.005600929260253906, -0.003387451171875, -0.0011739730834960938, 0.0010395050048828125, 0.0032529830932617188, 0.005466461181640625, 0.007679939270019531, 0.009893417358398438, 0.012106895446777344, 0.01432037353515625, 0.016533851623535156, 0.018747329711914062, 0.02096080780029297, 0.023174285888671875, 0.02538776397705078, 0.027601242065429688, 0.029814720153808594, 0.0320281982421875, 0.034241676330566406, 0.03645515441894531, 0.03866863250732422, 0.040882110595703125, 0.04309558868408203, 0.04530906677246094, 0.047522544860839844, 0.04973602294921875, 0.051949501037597656, 0.05416297912597656, 0.05637645721435547, 0.058589935302734375, 0.06080341339111328, 0.06301689147949219, 0.0652303695678711, 0.06744384765625]}, "gradients/encoder.encoder.layers.4.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 3.0, 1.0, 1.0, 0.0, 2.0, 5.0, 5.0, 21.0, 28.0, 69.0, 115.0, 225.0, 252.0, 122.0, 71.0, 38.0, 22.0, 14.0, 7.0, 2.0, 3.0, 3.0, 0.0, 3.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.1968374252319336, -2.130263328552246, -2.0636892318725586, -1.997115135192871, -1.9305410385131836, -1.863966941833496, -1.7973928451538086, -1.730818748474121, -1.6642446517944336, -1.597670555114746, -1.5310964584350586, -1.464522361755371, -1.3979482650756836, -1.331374168395996, -1.2648000717163086, -1.198225975036621, -1.1316518783569336, -1.065077781677246, -0.9985036849975586, -0.9319295883178711, -0.8653554916381836, -0.7987813949584961, -0.7322072982788086, -0.6656332015991211, -0.5990591049194336, -0.5324850082397461, -0.4659109115600586, -0.3993368148803711, -0.3327627182006836, -0.2661886215209961, -0.1996145248413086, -0.1330404281616211, -0.06646609306335449, 0.00010800361633300781, 0.06668210029602051, 0.133256196975708, 0.1998302936553955, 0.266404390335083, 0.3329784870147705, 0.399552583694458, 0.4661266803741455, 0.532700777053833, 0.5992748737335205, 0.665848970413208, 0.7324230670928955, 0.798997163772583, 0.8655712604522705, 0.932145357131958, 0.9987194538116455, 1.065293550491333, 1.1318676471710205, 1.198441743850708, 1.2650158405303955, 1.331589937210083, 1.3981640338897705, 1.464738130569458, 1.5313122272491455, 1.597886323928833, 1.6644604206085205, 1.731034517288208, 1.7976086139678955, 1.864182710647583, 1.9307568073272705, 1.997330904006958, 2.0639050006866455]}, "gradients/encoder.encoder.layers.4.layer_norm.bias": {"_type": "histogram", "values": [2.0, 1.0, 1.0, 0.0, 1.0, 2.0, 1.0, 3.0, 1.0, 9.0, 2.0, 9.0, 8.0, 12.0, 10.0, 7.0, 11.0, 12.0, 22.0, 16.0, 22.0, 13.0, 24.0, 34.0, 33.0, 29.0, 37.0, 36.0, 46.0, 64.0, 71.0, 69.0, 67.0, 39.0, 33.0, 42.0, 34.0, 15.0, 26.0, 19.0, 13.0, 20.0, 24.0, 16.0, 11.0, 11.0, 10.0, 6.0, 9.0, 5.0, 2.0, 2.0, 3.0, 3.0, 4.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7328797578811646, -0.7089084982872009, -0.6849372386932373, -0.6609660387039185, -0.6369947791099548, -0.6130235195159912, -0.5890522599220276, -0.565081000328064, -0.5411098003387451, -0.5171385407447815, -0.49316731095314026, -0.46919605135917664, -0.4452248215675354, -0.4212535619735718, -0.39728230237960815, -0.37331104278564453, -0.3493397831916809, -0.3253685235977173, -0.30139729380607605, -0.2774260342121124, -0.2534548044204712, -0.22948354482650757, -0.20551228523254395, -0.18154104053974152, -0.1575697958469391, -0.13359855115413666, -0.10962729901075363, -0.0856560468673706, -0.061684802174568176, -0.03771355748176575, -0.013742297887802124, 0.010228946805000305, 0.03420025110244751, 0.05817149952054024, 0.08214274793863297, 0.10611400008201599, 0.13008524477481842, 0.15405648946762085, 0.17802774906158447, 0.2019989937543869, 0.22597023844718933, 0.24994148313999176, 0.2739127278327942, 0.2978839874267578, 0.32185524702072144, 0.34582647681236267, 0.3697977364063263, 0.39376896619796753, 0.41774022579193115, 0.4417114853858948, 0.465682715177536, 0.48965397477149963, 0.5136252045631409, 0.5375964641571045, 0.5615677237510681, 0.5855389833450317, 0.6095101833343506, 0.6334814429283142, 0.6574527025222778, 0.6814239025115967, 0.7053951621055603, 0.7293664216995239, 0.7533376812934875, 0.7773089408874512, 0.8012802004814148]}, "gradients/encoder.encoder.layers.3.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0, 2.0, 0.0, 4.0, 2.0, 1.0, 4.0, 4.0, 11.0, 7.0, 7.0, 19.0, 31.0, 47.0, 62.0, 122.0, 199.0, 394.0, 1078.0, 3934.0, 30024.0, 3282362.0, 852971.0, 18537.0, 2996.0, 788.0, 313.0, 146.0, 97.0, 49.0, 29.0, 14.0, 12.0, 9.0, 4.0, 3.0, 6.0, 2.0, 1.0, 3.0, 1.0, 0.0, 1.0], "bins": [-0.3447265625, -0.3366851806640625, -0.328643798828125, -0.3206024169921875, -0.31256103515625, -0.3045196533203125, -0.296478271484375, -0.2884368896484375, -0.2803955078125, -0.2723541259765625, -0.264312744140625, -0.2562713623046875, -0.24822998046875, -0.2401885986328125, -0.232147216796875, -0.2241058349609375, -0.216064453125, -0.2080230712890625, -0.199981689453125, -0.1919403076171875, -0.18389892578125, -0.1758575439453125, -0.167816162109375, -0.1597747802734375, -0.1517333984375, -0.1436920166015625, -0.135650634765625, -0.1276092529296875, -0.11956787109375, -0.1115264892578125, -0.103485107421875, -0.0954437255859375, -0.08740234375, -0.0793609619140625, -0.071319580078125, -0.0632781982421875, -0.05523681640625, -0.0471954345703125, -0.039154052734375, -0.0311126708984375, -0.0230712890625, -0.0150299072265625, -0.006988525390625, 0.0010528564453125, 0.00909423828125, 0.0171356201171875, 0.025177001953125, 0.0332183837890625, 0.041259765625, 0.0493011474609375, 0.057342529296875, 0.0653839111328125, 0.07342529296875, 0.0814666748046875, 0.089508056640625, 0.0975494384765625, 0.1055908203125, 0.1136322021484375, 0.121673583984375, 0.1297149658203125, 0.13775634765625, 0.1457977294921875, 0.153839111328125, 0.1618804931640625, 0.169921875]}, "gradients/encoder.encoder.layers.3.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 0.0, 2.0, 2.0, 3.0, 5.0, 13.0, 22.0, 37.0, 44.0, 44.0, 63.0, 72.0, 80.0, 99.0, 85.0, 107.0, 78.0, 72.0, 51.0, 42.0, 33.0, 19.0, 14.0, 7.0, 9.0, 5.0, 4.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0], "bins": [-0.1253662109375, -0.12258625030517578, -0.11980628967285156, -0.11702632904052734, -0.11424636840820312, -0.1114664077758789, -0.10868644714355469, -0.10590648651123047, -0.10312652587890625, -0.10034656524658203, -0.09756660461425781, -0.0947866439819336, -0.09200668334960938, -0.08922672271728516, -0.08644676208496094, -0.08366680145263672, -0.0808868408203125, -0.07810688018798828, -0.07532691955566406, -0.07254695892333984, -0.06976699829101562, -0.0669870376586914, -0.06420707702636719, -0.06142711639404297, -0.05864715576171875, -0.05586719512939453, -0.05308723449707031, -0.050307273864746094, -0.047527313232421875, -0.044747352600097656, -0.04196739196777344, -0.03918743133544922, -0.036407470703125, -0.03362751007080078, -0.030847549438476562, -0.028067588806152344, -0.025287628173828125, -0.022507667541503906, -0.019727706909179688, -0.01694774627685547, -0.01416778564453125, -0.011387825012207031, -0.008607864379882812, -0.005827903747558594, -0.003047943115234375, -0.00026798248291015625, 0.0025119781494140625, 0.005291938781738281, 0.0080718994140625, 0.010851860046386719, 0.013631820678710938, 0.016411781311035156, 0.019191741943359375, 0.021971702575683594, 0.024751663208007812, 0.02753162384033203, 0.03031158447265625, 0.03309154510498047, 0.03587150573730469, 0.038651466369628906, 0.041431427001953125, 0.044211387634277344, 0.04699134826660156, 0.04977130889892578, 0.05255126953125]}, "gradients/encoder.encoder.layers.3.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 1.0, 3.0, 3.0, 4.0, 7.0, 9.0, 18.0, 18.0, 32.0, 56.0, 59.0, 99.0, 164.0, 309.0, 733.0, 2857.0, 41751.0, 4110041.0, 34522.0, 2227.0, 579.0, 261.0, 147.0, 117.0, 82.0, 58.0, 33.0, 24.0, 23.0, 12.0, 11.0, 8.0, 6.0, 7.0, 4.0, 4.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.48583984375, -0.4693603515625, -0.452880859375, -0.4364013671875, -0.419921875, -0.4034423828125, -0.386962890625, -0.3704833984375, -0.35400390625, -0.3375244140625, -0.321044921875, -0.3045654296875, -0.2880859375, -0.2716064453125, -0.255126953125, -0.2386474609375, -0.22216796875, -0.2056884765625, -0.189208984375, -0.1727294921875, -0.15625, -0.1397705078125, -0.123291015625, -0.1068115234375, -0.09033203125, -0.0738525390625, -0.057373046875, -0.0408935546875, -0.0244140625, -0.0079345703125, 0.008544921875, 0.0250244140625, 0.04150390625, 0.0579833984375, 0.074462890625, 0.0909423828125, 0.107421875, 0.1239013671875, 0.140380859375, 0.1568603515625, 0.17333984375, 0.1898193359375, 0.206298828125, 0.2227783203125, 0.2392578125, 0.2557373046875, 0.272216796875, 0.2886962890625, 0.30517578125, 0.3216552734375, 0.338134765625, 0.3546142578125, 0.37109375, 0.3875732421875, 0.404052734375, 0.4205322265625, 0.43701171875, 0.4534912109375, 0.469970703125, 0.4864501953125, 0.5029296875, 0.5194091796875, 0.535888671875, 0.5523681640625, 0.56884765625]}, "gradients/encoder.encoder.layers.3.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 3.0, 1.0, 1.0, 4.0, 4.0, 4.0, 15.0, 25.0, 30.0, 67.0, 159.0, 754.0, 2038.0, 708.0, 157.0, 58.0, 26.0, 18.0, 7.0, 6.0, 2.0, 0.0, 0.0, 2.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1748046875, -0.1634979248046875, -0.152191162109375, -0.1408843994140625, -0.12957763671875, -0.1182708740234375, -0.106964111328125, -0.0956573486328125, -0.0843505859375, -0.0730438232421875, -0.061737060546875, -0.0504302978515625, -0.03912353515625, -0.0278167724609375, -0.016510009765625, -0.0052032470703125, 0.006103515625, 0.0174102783203125, 0.028717041015625, 0.0400238037109375, 0.05133056640625, 0.0626373291015625, 0.073944091796875, 0.0852508544921875, 0.0965576171875, 0.1078643798828125, 0.119171142578125, 0.1304779052734375, 0.14178466796875, 0.1530914306640625, 0.164398193359375, 0.1757049560546875, 0.18701171875, 0.1983184814453125, 0.209625244140625, 0.2209320068359375, 0.23223876953125, 0.2435455322265625, 0.254852294921875, 0.2661590576171875, 0.2774658203125, 0.2887725830078125, 0.300079345703125, 0.3113861083984375, 0.32269287109375, 0.3339996337890625, 0.345306396484375, 0.3566131591796875, 0.367919921875, 0.3792266845703125, 0.390533447265625, 0.4018402099609375, 0.41314697265625, 0.4244537353515625, 0.435760498046875, 0.4470672607421875, 0.4583740234375, 0.4696807861328125, 0.480987548828125, 0.4922943115234375, 0.50360107421875, 0.5149078369140625, 0.526214599609375, 0.5375213623046875, 0.548828125]}, "gradients/encoder.encoder.layers.3.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 11.0, 22.0, 36.0, 48.0, 88.0, 153.0, 176.0, 178.0, 127.0, 66.0, 35.0, 15.0, 10.0, 12.0, 6.0, 4.0, 1.0, 7.0, 5.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.3489787578582764, -2.2850213050842285, -2.2210640907287598, -2.157106637954712, -2.093149423599243, -2.0291919708251953, -1.965234637260437, -1.9012773036956787, -1.8373199701309204, -1.773362636566162, -1.7094053030014038, -1.6454479694366455, -1.5814905166625977, -1.517533302307129, -1.453575849533081, -1.3896185159683228, -1.3256611824035645, -1.2617038488388062, -1.1977465152740479, -1.1337891817092896, -1.0698318481445312, -1.0058743953704834, -0.9419170618057251, -0.8779597282409668, -0.8140023946762085, -0.7500450611114502, -0.6860877275466919, -0.6221303343772888, -0.5581730008125305, -0.4942156672477722, -0.43025830388069153, -0.36630094051361084, -0.302343487739563, -0.2383861392736435, -0.174428790807724, -0.1104714423418045, -0.04651409387588501, 0.01744323968887329, 0.08140060305595398, 0.14535796642303467, 0.20931529998779297, 0.27327263355255127, 0.33722999691963196, 0.40118736028671265, 0.46514469385147095, 0.5291020274162292, 0.5930594205856323, 0.6570167541503906, 0.7209740877151489, 0.7849314212799072, 0.8488887548446655, 0.9128461480140686, 0.9768034815788269, 1.0407607555389404, 1.1047182083129883, 1.1686755418777466, 1.2326328754425049, 1.2965902090072632, 1.3605475425720215, 1.4245048761367798, 1.488462209701538, 1.552419662475586, 1.6163769960403442, 1.6803343296051025, 1.7442916631698608]}, "gradients/encoder.encoder.layers.3.final_layer_norm.bias": {"_type": "histogram", "values": [3.0, 2.0, 1.0, 1.0, 2.0, 2.0, 3.0, 9.0, 5.0, 10.0, 13.0, 15.0, 16.0, 23.0, 31.0, 40.0, 36.0, 38.0, 53.0, 60.0, 52.0, 65.0, 58.0, 54.0, 53.0, 41.0, 39.0, 52.0, 42.0, 39.0, 32.0, 35.0, 29.0, 14.0, 15.0, 9.0, 11.0, 7.0, 6.0, 1.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.5868810415267944, -0.5615463852882385, -0.5362116694450378, -0.5108770132064819, -0.48554229736328125, -0.46020764112472534, -0.43487292528152466, -0.40953826904296875, -0.38420355319976807, -0.35886886715888977, -0.3335341811180115, -0.3081994950771332, -0.2828648090362549, -0.257530152797699, -0.23219545185565948, -0.2068607658147812, -0.1815260946750641, -0.1561914086341858, -0.1308567225933075, -0.1055220440030098, -0.0801873579621315, -0.0548526793718338, -0.029517993330955505, -0.0041833072900772095, 0.021151378750801086, 0.04648606479167938, 0.07182075083255768, 0.09715542942285538, 0.12249011546373367, 0.14782479405403137, 0.17315948009490967, 0.19849416613578796, 0.22382885217666626, 0.24916353821754456, 0.27449822425842285, 0.29983291029930115, 0.32516759634017944, 0.35050225257873535, 0.37583696842193604, 0.40117162466049194, 0.4265063405036926, 0.4518410265445709, 0.4771757125854492, 0.5025103688240051, 0.5278450846672058, 0.5531797409057617, 0.5785144567489624, 0.6038491129875183, 0.6291837692260742, 0.6545184254646301, 0.6798531413078308, 0.7051877975463867, 0.7305225133895874, 0.7558571696281433, 0.781191885471344, 0.8065265417098999, 0.8318612575531006, 0.8571959137916565, 0.8825306296348572, 0.9078652858734131, 0.9332000017166138, 0.9585346579551697, 0.9838693737983704, 1.0092040300369263, 1.034538745880127]}, "gradients/encoder.encoder.layers.3.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 4.0, 9.0, 9.0, 7.0, 30.0, 26.0, 52.0, 75.0, 128.0, 230.0, 412.0, 809.0, 1532.0, 3455.0, 8049.0, 20745.0, 60022.0, 178617.0, 368388.0, 261071.0, 93115.0, 30932.0, 11662.0, 4765.0, 2111.0, 1100.0, 498.0, 290.0, 144.0, 102.0, 55.0, 39.0, 24.0, 12.0, 9.0, 14.0, 7.0, 4.0, 2.0, 0.0, 1.0, 0.0, 3.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.11566162109375, -0.1122293472290039, -0.10879707336425781, -0.10536479949951172, -0.10193252563476562, -0.09850025177001953, -0.09506797790527344, -0.09163570404052734, -0.08820343017578125, -0.08477115631103516, -0.08133888244628906, -0.07790660858154297, -0.07447433471679688, -0.07104206085205078, -0.06760978698730469, -0.0641775131225586, -0.0607452392578125, -0.057312965393066406, -0.05388069152832031, -0.05044841766357422, -0.047016143798828125, -0.04358386993408203, -0.04015159606933594, -0.036719322204589844, -0.03328704833984375, -0.029854774475097656, -0.026422500610351562, -0.02299022674560547, -0.019557952880859375, -0.01612567901611328, -0.012693405151367188, -0.009261131286621094, -0.005828857421875, -0.0023965835571289062, 0.0010356903076171875, 0.004467964172363281, 0.007900238037109375, 0.011332511901855469, 0.014764785766601562, 0.018197059631347656, 0.02162933349609375, 0.025061607360839844, 0.028493881225585938, 0.03192615509033203, 0.035358428955078125, 0.03879070281982422, 0.04222297668457031, 0.045655250549316406, 0.0490875244140625, 0.052519798278808594, 0.05595207214355469, 0.05938434600830078, 0.06281661987304688, 0.06624889373779297, 0.06968116760253906, 0.07311344146728516, 0.07654571533203125, 0.07997798919677734, 0.08341026306152344, 0.08684253692626953, 0.09027481079101562, 0.09370708465576172, 0.09713935852050781, 0.1005716323852539, 0.10400390625]}, "gradients/encoder.encoder.layers.3.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 2.0, 4.0, 3.0, 6.0, 6.0, 18.0, 10.0, 19.0, 25.0, 48.0, 57.0, 70.0, 76.0, 60.0, 67.0, 87.0, 65.0, 79.0, 68.0, 62.0, 41.0, 43.0, 34.0, 25.0, 12.0, 6.0, 12.0, 4.0, 3.0, 2.0, 2.0, 0.0, 2.0, 0.0, 2.0], "bins": [-0.1256103515625, -0.12277889251708984, -0.11994743347167969, -0.11711597442626953, -0.11428451538085938, -0.11145305633544922, -0.10862159729003906, -0.1057901382446289, -0.10295867919921875, -0.1001272201538086, -0.09729576110839844, -0.09446430206298828, -0.09163284301757812, -0.08880138397216797, -0.08596992492675781, -0.08313846588134766, -0.0803070068359375, -0.07747554779052734, -0.07464408874511719, -0.07181262969970703, -0.06898117065429688, -0.06614971160888672, -0.06331825256347656, -0.060486793518066406, -0.05765533447265625, -0.054823875427246094, -0.05199241638183594, -0.04916095733642578, -0.046329498291015625, -0.04349803924560547, -0.04066658020019531, -0.037835121154785156, -0.035003662109375, -0.032172203063964844, -0.029340744018554688, -0.02650928497314453, -0.023677825927734375, -0.02084636688232422, -0.018014907836914062, -0.015183448791503906, -0.01235198974609375, -0.009520530700683594, -0.0066890716552734375, -0.0038576126098632812, -0.001026153564453125, 0.0018053054809570312, 0.0046367645263671875, 0.007468223571777344, 0.0102996826171875, 0.013131141662597656, 0.015962600708007812, 0.01879405975341797, 0.021625518798828125, 0.02445697784423828, 0.027288436889648438, 0.030119895935058594, 0.03295135498046875, 0.035782814025878906, 0.03861427307128906, 0.04144573211669922, 0.044277191162109375, 0.04710865020751953, 0.04994010925292969, 0.052771568298339844, 0.05560302734375]}, "gradients/encoder.encoder.layers.3.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 2.0, 1.0, 1.0, 2.0, 3.0, 0.0, 6.0, 5.0, 7.0, 9.0, 9.0, 13.0, 23.0, 39.0, 30.0, 58.0, 97.0, 131.0, 192.0, 330.0, 546.0, 819.0, 1613.0, 3393.0, 8352.0, 29923.0, 144633.0, 620417.0, 183337.0, 36562.0, 9968.0, 3670.0, 1769.0, 987.0, 592.0, 371.0, 206.0, 130.0, 115.0, 53.0, 50.0, 37.0, 21.0, 12.0, 7.0, 8.0, 7.0, 5.0, 1.0, 5.0, 2.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.174560546875, -0.16903114318847656, -0.16350173950195312, -0.1579723358154297, -0.15244293212890625, -0.1469135284423828, -0.14138412475585938, -0.13585472106933594, -0.1303253173828125, -0.12479591369628906, -0.11926651000976562, -0.11373710632324219, -0.10820770263671875, -0.10267829895019531, -0.09714889526367188, -0.09161949157714844, -0.086090087890625, -0.08056068420410156, -0.07503128051757812, -0.06950187683105469, -0.06397247314453125, -0.05844306945800781, -0.052913665771484375, -0.04738426208496094, -0.0418548583984375, -0.03632545471191406, -0.030796051025390625, -0.025266647338867188, -0.01973724365234375, -0.014207839965820312, -0.008678436279296875, -0.0031490325927734375, 0.00238037109375, 0.007909774780273438, 0.013439178466796875, 0.018968582153320312, 0.02449798583984375, 0.030027389526367188, 0.035556793212890625, 0.04108619689941406, 0.0466156005859375, 0.05214500427246094, 0.057674407958984375, 0.06320381164550781, 0.06873321533203125, 0.07426261901855469, 0.07979202270507812, 0.08532142639160156, 0.090850830078125, 0.09638023376464844, 0.10190963745117188, 0.10743904113769531, 0.11296844482421875, 0.11849784851074219, 0.12402725219726562, 0.12955665588378906, 0.1350860595703125, 0.14061546325683594, 0.14614486694335938, 0.1516742706298828, 0.15720367431640625, 0.1627330780029297, 0.16826248168945312, 0.17379188537597656, 0.1793212890625]}, "gradients/encoder.encoder.layers.3.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 2.0, 0.0, 2.0, 2.0, 0.0, 2.0, 2.0, 7.0, 6.0, 5.0, 6.0, 6.0, 13.0, 21.0, 16.0, 23.0, 16.0, 22.0, 27.0, 43.0, 30.0, 31.0, 36.0, 44.0, 33.0, 54.0, 48.0, 46.0, 68.0, 57.0, 53.0, 35.0, 35.0, 26.0, 37.0, 27.0, 24.0, 13.0, 13.0, 20.0, 14.0, 5.0, 9.0, 8.0, 11.0, 6.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0], "bins": [-0.21923828125, -0.2129840850830078, -0.20672988891601562, -0.20047569274902344, -0.19422149658203125, -0.18796730041503906, -0.18171310424804688, -0.1754589080810547, -0.1692047119140625, -0.1629505157470703, -0.15669631958007812, -0.15044212341308594, -0.14418792724609375, -0.13793373107910156, -0.13167953491210938, -0.1254253387451172, -0.119171142578125, -0.11291694641113281, -0.10666275024414062, -0.10040855407714844, -0.09415435791015625, -0.08790016174316406, -0.08164596557617188, -0.07539176940917969, -0.0691375732421875, -0.06288337707519531, -0.056629180908203125, -0.05037498474121094, -0.04412078857421875, -0.03786659240722656, -0.031612396240234375, -0.025358200073242188, -0.01910400390625, -0.012849807739257812, -0.006595611572265625, -0.0003414154052734375, 0.00591278076171875, 0.012166976928710938, 0.018421173095703125, 0.024675369262695312, 0.0309295654296875, 0.03718376159667969, 0.043437957763671875, 0.04969215393066406, 0.05594635009765625, 0.06220054626464844, 0.06845474243164062, 0.07470893859863281, 0.080963134765625, 0.08721733093261719, 0.09347152709960938, 0.09972572326660156, 0.10597991943359375, 0.11223411560058594, 0.11848831176757812, 0.12474250793457031, 0.1309967041015625, 0.1372509002685547, 0.14350509643554688, 0.14975929260253906, 0.15601348876953125, 0.16226768493652344, 0.16852188110351562, 0.1747760772705078, 0.1810302734375]}, "gradients/encoder.encoder.layers.3.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 2.0, 3.0, 1.0, 2.0, 4.0, 4.0, 6.0, 14.0, 10.0, 16.0, 20.0, 24.0, 44.0, 70.0, 79.0, 137.0, 177.0, 298.0, 546.0, 841.0, 1563.0, 3537.0, 10530.0, 60562.0, 799486.0, 143950.0, 17095.0, 4808.0, 1959.0, 1043.0, 595.0, 416.0, 243.0, 160.0, 80.0, 72.0, 46.0, 38.0, 23.0, 11.0, 19.0, 8.0, 8.0, 6.0, 2.0, 3.0, 1.0, 2.0, 0.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.1802978515625, -0.1746044158935547, -0.16891098022460938, -0.16321754455566406, -0.15752410888671875, -0.15183067321777344, -0.14613723754882812, -0.1404438018798828, -0.1347503662109375, -0.1290569305419922, -0.12336349487304688, -0.11767005920410156, -0.11197662353515625, -0.10628318786621094, -0.10058975219726562, -0.09489631652832031, -0.089202880859375, -0.08350944519042969, -0.07781600952148438, -0.07212257385253906, -0.06642913818359375, -0.06073570251464844, -0.055042266845703125, -0.04934883117675781, -0.0436553955078125, -0.03796195983886719, -0.032268524169921875, -0.026575088500976562, -0.02088165283203125, -0.015188217163085938, -0.009494781494140625, -0.0038013458251953125, 0.00189208984375, 0.0075855255126953125, 0.013278961181640625, 0.018972396850585938, 0.02466583251953125, 0.030359268188476562, 0.036052703857421875, 0.04174613952636719, 0.0474395751953125, 0.05313301086425781, 0.058826446533203125, 0.06451988220214844, 0.07021331787109375, 0.07590675354003906, 0.08160018920898438, 0.08729362487792969, 0.092987060546875, 0.09868049621582031, 0.10437393188476562, 0.11006736755371094, 0.11576080322265625, 0.12145423889160156, 0.12714767456054688, 0.1328411102294922, 0.1385345458984375, 0.1442279815673828, 0.14992141723632812, 0.15561485290527344, 0.16130828857421875, 0.16700172424316406, 0.17269515991210938, 0.1783885955810547, 0.18408203125]}, "gradients/encoder.encoder.layers.3.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 0.0, 1.0, 4.0, 3.0, 3.0, 3.0, 2.0, 6.0, 11.0, 19.0, 19.0, 33.0, 85.0, 138.0, 225.0, 209.0, 92.0, 63.0, 27.0, 23.0, 13.0, 7.0, 9.0, 2.0, 2.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 2.0, 2.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-9.274482727050781e-05, -9.016040712594986e-05, -8.75759869813919e-05, -8.499156683683395e-05, -8.2407146692276e-05, -7.982272654771805e-05, -7.72383064031601e-05, -7.465388625860214e-05, -7.206946611404419e-05, -6.948504596948624e-05, -6.690062582492828e-05, -6.431620568037033e-05, -6.173178553581238e-05, -5.9147365391254425e-05, -5.656294524669647e-05, -5.397852510213852e-05, -5.1394104957580566e-05, -4.8809684813022614e-05, -4.622526466846466e-05, -4.364084452390671e-05, -4.1056424379348755e-05, -3.84720042347908e-05, -3.588758409023285e-05, -3.3303163945674896e-05, -3.071874380111694e-05, -2.813432365655899e-05, -2.5549903512001038e-05, -2.2965483367443085e-05, -2.0381063222885132e-05, -1.779664307832718e-05, -1.5212222933769226e-05, -1.2627802789211273e-05, -1.004338264465332e-05, -7.4589625000953674e-06, -4.8745423555374146e-06, -2.2901222109794617e-06, 2.942979335784912e-07, 2.878718078136444e-06, 5.463138222694397e-06, 8.04755836725235e-06, 1.0631978511810303e-05, 1.3216398656368256e-05, 1.580081880092621e-05, 1.838523894548416e-05, 2.0969659090042114e-05, 2.3554079234600067e-05, 2.613849937915802e-05, 2.8722919523715973e-05, 3.1307339668273926e-05, 3.389175981283188e-05, 3.647617995738983e-05, 3.9060600101947784e-05, 4.164502024650574e-05, 4.422944039106369e-05, 4.681386053562164e-05, 4.9398280680179596e-05, 5.198270082473755e-05, 5.45671209692955e-05, 5.7151541113853455e-05, 5.973596125841141e-05, 6.232038140296936e-05, 6.490480154752731e-05, 6.748922169208527e-05, 7.007364183664322e-05, 7.265806198120117e-05]}, "gradients/encoder.encoder.layers.3.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 0.0, 2.0, 1.0, 2.0, 2.0, 3.0, 2.0, 8.0, 13.0, 18.0, 30.0, 55.0, 124.0, 208.0, 412.0, 864.0, 2173.0, 6984.0, 39076.0, 784199.0, 191206.0, 16459.0, 4036.0, 1441.0, 609.0, 277.0, 157.0, 76.0, 51.0, 30.0, 12.0, 11.0, 10.0, 4.0, 3.0, 5.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2017822265625, -0.19500732421875, -0.188232421875, -0.18145751953125, -0.1746826171875, -0.16790771484375, -0.1611328125, -0.15435791015625, -0.1475830078125, -0.14080810546875, -0.134033203125, -0.12725830078125, -0.1204833984375, -0.11370849609375, -0.10693359375, -0.10015869140625, -0.0933837890625, -0.08660888671875, -0.079833984375, -0.07305908203125, -0.0662841796875, -0.05950927734375, -0.052734375, -0.04595947265625, -0.0391845703125, -0.03240966796875, -0.025634765625, -0.01885986328125, -0.0120849609375, -0.00531005859375, 0.00146484375, 0.00823974609375, 0.0150146484375, 0.02178955078125, 0.028564453125, 0.03533935546875, 0.0421142578125, 0.04888916015625, 0.0556640625, 0.06243896484375, 0.0692138671875, 0.07598876953125, 0.082763671875, 0.08953857421875, 0.0963134765625, 0.10308837890625, 0.10986328125, 0.11663818359375, 0.1234130859375, 0.13018798828125, 0.136962890625, 0.14373779296875, 0.1505126953125, 0.15728759765625, 0.1640625, 0.17083740234375, 0.1776123046875, 0.18438720703125, 0.191162109375, 0.19793701171875, 0.2047119140625, 0.21148681640625, 0.21826171875, 0.22503662109375, 0.2318115234375]}, "gradients/encoder.encoder.layers.3.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 2.0, 7.0, 3.0, 17.0, 15.0, 24.0, 30.0, 54.0, 79.0, 127.0, 141.0, 172.0, 120.0, 60.0, 56.0, 37.0, 22.0, 14.0, 8.0, 5.0, 5.0, 2.0, 2.0, 0.0, 1.0, 2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1253662109375, -0.1205596923828125, -0.115753173828125, -0.1109466552734375, -0.10614013671875, -0.1013336181640625, -0.096527099609375, -0.0917205810546875, -0.0869140625, -0.0821075439453125, -0.077301025390625, -0.0724945068359375, -0.06768798828125, -0.0628814697265625, -0.058074951171875, -0.0532684326171875, -0.0484619140625, -0.0436553955078125, -0.038848876953125, -0.0340423583984375, -0.02923583984375, -0.0244293212890625, -0.019622802734375, -0.0148162841796875, -0.010009765625, -0.0052032470703125, -0.000396728515625, 0.0044097900390625, 0.00921630859375, 0.0140228271484375, 0.018829345703125, 0.0236358642578125, 0.0284423828125, 0.0332489013671875, 0.038055419921875, 0.0428619384765625, 0.04766845703125, 0.0524749755859375, 0.057281494140625, 0.0620880126953125, 0.06689453125, 0.0717010498046875, 0.076507568359375, 0.0813140869140625, 0.08612060546875, 0.0909271240234375, 0.095733642578125, 0.1005401611328125, 0.1053466796875, 0.1101531982421875, 0.114959716796875, 0.1197662353515625, 0.12457275390625, 0.1293792724609375, 0.134185791015625, 0.1389923095703125, 0.143798828125, 0.1486053466796875, 0.153411865234375, 0.1582183837890625, 0.16302490234375, 0.1678314208984375, 0.172637939453125, 0.1774444580078125, 0.1822509765625]}, "gradients/encoder.encoder.layers.3.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 2.0, 2.0, 6.0, 6.0, 5.0, 19.0, 28.0, 89.0, 225.0, 404.0, 143.0, 45.0, 27.0, 7.0, 2.0, 3.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.8303780555725098, -3.711782455444336, -3.593186855316162, -3.4745912551879883, -3.3559954166412354, -3.2373998165130615, -3.1188042163848877, -3.000208616256714, -2.88161301612854, -2.763017416000366, -2.6444218158721924, -2.5258259773254395, -2.4072303771972656, -2.288634777069092, -2.170039176940918, -2.051443576812744, -1.9328478574752808, -1.814252257347107, -1.6956565380096436, -1.5770609378814697, -1.458465337753296, -1.339869737625122, -1.2212740182876587, -1.1026784181594849, -0.9840827584266663, -0.8654870986938477, -0.7468914985656738, -0.6282958388328552, -0.5097001791000366, -0.3911045789718628, -0.2725089192390442, -0.15391331911087036, -0.03531765937805176, 0.08327797800302505, 0.20187361538410187, 0.3204692602157593, 0.4390648901462555, 0.5576605200767517, 0.6762561798095703, 0.7948517799377441, 0.9134474396705627, 1.0320430994033813, 1.1506386995315552, 1.2692344188690186, 1.3878300189971924, 1.5064256191253662, 1.62502121925354, 1.7436168193817139, 1.8622125387191772, 1.980808138847351, 2.0994038581848145, 2.2179994583129883, 2.336595058441162, 2.455190658569336, 2.5737862586975098, 2.6923818588256836, 2.8109776973724365, 2.9295732975006104, 3.048168897628784, 3.166764736175537, 3.285360336303711, 3.4039559364318848, 3.5225515365600586, 3.6411471366882324, 3.7597427368164062]}, "gradients/encoder.encoder.layers.3.layer_norm.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 2.0, 4.0, 2.0, 5.0, 3.0, 8.0, 4.0, 6.0, 6.0, 9.0, 11.0, 13.0, 15.0, 12.0, 21.0, 17.0, 18.0, 19.0, 22.0, 35.0, 28.0, 29.0, 39.0, 40.0, 52.0, 79.0, 70.0, 63.0, 41.0, 37.0, 31.0, 40.0, 29.0, 26.0, 27.0, 11.0, 24.0, 19.0, 12.0, 14.0, 10.0, 13.0, 11.0, 9.0, 7.0, 5.0, 5.0, 2.0, 5.0, 6.0, 0.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.7192637920379639, -0.6943606734275818, -0.6694576144218445, -0.6445544958114624, -0.6196514368057251, -0.594748318195343, -0.5698451995849609, -0.5449421405792236, -0.5200390219688416, -0.49513593316078186, -0.47023284435272217, -0.4453297257423401, -0.4204266369342804, -0.3955235481262207, -0.3706204295158386, -0.34571734070777893, -0.32081425189971924, -0.29591116309165955, -0.27100807428359985, -0.24610495567321777, -0.22120186686515808, -0.1962987780570984, -0.1713956743478775, -0.14649257063865662, -0.12158948183059692, -0.09668638557195663, -0.07178328931331635, -0.046880193054676056, -0.021977096796035767, 0.0029259994626045227, 0.027829095721244812, 0.0527321994304657, 0.07763528823852539, 0.10253838449716568, 0.12744148075580597, 0.15234458446502686, 0.17724767327308655, 0.20215076208114624, 0.22705386579036713, 0.251956969499588, 0.2768600583076477, 0.3017631471157074, 0.3266662359237671, 0.35156935453414917, 0.37647244334220886, 0.40137553215026855, 0.42627865076065063, 0.4511817395687103, 0.47608482837677, 0.5009879469871521, 0.5258910059928894, 0.5507941246032715, 0.5756971836090088, 0.6006003022193909, 0.625503420829773, 0.6504064798355103, 0.6753095984458923, 0.7002127170562744, 0.7251157760620117, 0.7500188946723938, 0.7749220132827759, 0.7998250722885132, 0.8247281908988953, 0.8496313095092773, 0.8745343685150146]}, "gradients/encoder.encoder.layers.2.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 3.0, 2.0, 4.0, 0.0, 5.0, 6.0, 5.0, 7.0, 9.0, 12.0, 21.0, 21.0, 24.0, 43.0, 54.0, 79.0, 128.0, 221.0, 362.0, 639.0, 1235.0, 2638.0, 6355.0, 19562.0, 116290.0, 2639830.0, 1316007.0, 65679.0, 15027.0, 5439.0, 2164.0, 1045.0, 557.0, 312.0, 191.0, 109.0, 80.0, 50.0, 27.0, 14.0, 13.0, 7.0, 6.0, 4.0, 2.0, 3.0, 0.0, 3.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.1939697265625, -0.18884849548339844, -0.18372726440429688, -0.1786060333251953, -0.17348480224609375, -0.1683635711669922, -0.16324234008789062, -0.15812110900878906, -0.1529998779296875, -0.14787864685058594, -0.14275741577148438, -0.1376361846923828, -0.13251495361328125, -0.1273937225341797, -0.12227249145507812, -0.11715126037597656, -0.112030029296875, -0.10690879821777344, -0.10178756713867188, -0.09666633605957031, -0.09154510498046875, -0.08642387390136719, -0.08130264282226562, -0.07618141174316406, -0.0710601806640625, -0.06593894958496094, -0.060817718505859375, -0.05569648742675781, -0.05057525634765625, -0.04545402526855469, -0.040332794189453125, -0.03521156311035156, -0.03009033203125, -0.024969100952148438, -0.019847869873046875, -0.014726638793945312, -0.00960540771484375, -0.0044841766357421875, 0.000637054443359375, 0.0057582855224609375, 0.0108795166015625, 0.016000747680664062, 0.021121978759765625, 0.026243209838867188, 0.03136444091796875, 0.03648567199707031, 0.041606903076171875, 0.04672813415527344, 0.051849365234375, 0.05697059631347656, 0.062091827392578125, 0.06721305847167969, 0.07233428955078125, 0.07745552062988281, 0.08257675170898438, 0.08769798278808594, 0.0928192138671875, 0.09794044494628906, 0.10306167602539062, 0.10818290710449219, 0.11330413818359375, 0.11842536926269531, 0.12354660034179688, 0.12866783142089844, 0.1337890625]}, "gradients/encoder.encoder.layers.2.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 3.0, 3.0, 8.0, 11.0, 16.0, 18.0, 36.0, 37.0, 58.0, 59.0, 71.0, 80.0, 86.0, 92.0, 81.0, 80.0, 74.0, 52.0, 36.0, 38.0, 19.0, 15.0, 17.0, 11.0, 6.0, 3.0, 4.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.12152099609375, -0.1184988021850586, -0.11547660827636719, -0.11245441436767578, -0.10943222045898438, -0.10641002655029297, -0.10338783264160156, -0.10036563873291016, -0.09734344482421875, -0.09432125091552734, -0.09129905700683594, -0.08827686309814453, -0.08525466918945312, -0.08223247528076172, -0.07921028137207031, -0.0761880874633789, -0.0731658935546875, -0.0701436996459961, -0.06712150573730469, -0.06409931182861328, -0.061077117919921875, -0.05805492401123047, -0.05503273010253906, -0.052010536193847656, -0.04898834228515625, -0.045966148376464844, -0.04294395446777344, -0.03992176055908203, -0.036899566650390625, -0.03387737274169922, -0.030855178833007812, -0.027832984924316406, -0.024810791015625, -0.021788597106933594, -0.018766403198242188, -0.01574420928955078, -0.012722015380859375, -0.009699821472167969, -0.0066776275634765625, -0.0036554336547851562, -0.00063323974609375, 0.0023889541625976562, 0.0054111480712890625, 0.008433341979980469, 0.011455535888671875, 0.014477729797363281, 0.017499923706054688, 0.020522117614746094, 0.0235443115234375, 0.026566505432128906, 0.029588699340820312, 0.03261089324951172, 0.035633087158203125, 0.03865528106689453, 0.04167747497558594, 0.044699668884277344, 0.04772186279296875, 0.050744056701660156, 0.05376625061035156, 0.05678844451904297, 0.059810638427734375, 0.06283283233642578, 0.06585502624511719, 0.0688772201538086, 0.0718994140625]}, "gradients/encoder.encoder.layers.2.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 2.0, 1.0, 3.0, 0.0, 1.0, 3.0, 2.0, 10.0, 7.0, 5.0, 14.0, 23.0, 23.0, 54.0, 73.0, 164.0, 293.0, 740.0, 2249.0, 11637.0, 208856.0, 3915946.0, 46391.0, 5405.0, 1330.0, 539.0, 230.0, 115.0, 56.0, 43.0, 23.0, 19.0, 9.0, 5.0, 10.0, 2.0, 3.0, 5.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.470947265625, -0.4568023681640625, -0.442657470703125, -0.4285125732421875, -0.41436767578125, -0.4002227783203125, -0.386077880859375, -0.3719329833984375, -0.3577880859375, -0.3436431884765625, -0.329498291015625, -0.3153533935546875, -0.30120849609375, -0.2870635986328125, -0.272918701171875, -0.2587738037109375, -0.24462890625, -0.2304840087890625, -0.216339111328125, -0.2021942138671875, -0.18804931640625, -0.1739044189453125, -0.159759521484375, -0.1456146240234375, -0.1314697265625, -0.1173248291015625, -0.103179931640625, -0.0890350341796875, -0.07489013671875, -0.0607452392578125, -0.046600341796875, -0.0324554443359375, -0.018310546875, -0.0041656494140625, 0.009979248046875, 0.0241241455078125, 0.03826904296875, 0.0524139404296875, 0.066558837890625, 0.0807037353515625, 0.0948486328125, 0.1089935302734375, 0.123138427734375, 0.1372833251953125, 0.15142822265625, 0.1655731201171875, 0.179718017578125, 0.1938629150390625, 0.2080078125, 0.2221527099609375, 0.236297607421875, 0.2504425048828125, 0.26458740234375, 0.2787322998046875, 0.292877197265625, 0.3070220947265625, 0.3211669921875, 0.3353118896484375, 0.349456787109375, 0.3636016845703125, 0.37774658203125, 0.3918914794921875, 0.406036376953125, 0.4201812744140625, 0.434326171875]}, "gradients/encoder.encoder.layers.2.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 4.0, 2.0, 1.0, 2.0, 4.0, 1.0, 9.0, 7.0, 15.0, 9.0, 15.0, 21.0, 29.0, 40.0, 74.0, 124.0, 197.0, 460.0, 933.0, 1055.0, 511.0, 222.0, 122.0, 79.0, 37.0, 20.0, 18.0, 15.0, 13.0, 9.0, 10.0, 6.0, 7.0, 3.0, 1.0, 2.0, 2.0, 2.0, 0.0, 2.0, 3.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2364501953125, -0.22951698303222656, -0.22258377075195312, -0.2156505584716797, -0.20871734619140625, -0.2017841339111328, -0.19485092163085938, -0.18791770935058594, -0.1809844970703125, -0.17405128479003906, -0.16711807250976562, -0.1601848602294922, -0.15325164794921875, -0.1463184356689453, -0.13938522338867188, -0.13245201110839844, -0.125518798828125, -0.11858558654785156, -0.11165237426757812, -0.10471916198730469, -0.09778594970703125, -0.09085273742675781, -0.08391952514648438, -0.07698631286621094, -0.0700531005859375, -0.06311988830566406, -0.056186676025390625, -0.04925346374511719, -0.04232025146484375, -0.03538703918457031, -0.028453826904296875, -0.021520614624023438, -0.01458740234375, -0.0076541900634765625, -0.000720977783203125, 0.0062122344970703125, 0.01314544677734375, 0.020078659057617188, 0.027011871337890625, 0.03394508361816406, 0.0408782958984375, 0.04781150817871094, 0.054744720458984375, 0.06167793273925781, 0.06861114501953125, 0.07554435729980469, 0.08247756958007812, 0.08941078186035156, 0.096343994140625, 0.10327720642089844, 0.11021041870117188, 0.11714363098144531, 0.12407684326171875, 0.1310100555419922, 0.13794326782226562, 0.14487648010253906, 0.1518096923828125, 0.15874290466308594, 0.16567611694335938, 0.1726093292236328, 0.17954254150390625, 0.1864757537841797, 0.19340896606445312, 0.20034217834472656, 0.207275390625]}, "gradients/encoder.encoder.layers.2.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 3.0, 2.0, 5.0, 7.0, 6.0, 7.0, 13.0, 29.0, 39.0, 50.0, 83.0, 130.0, 154.0, 146.0, 120.0, 83.0, 58.0, 34.0, 17.0, 10.0, 7.0, 6.0, 3.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-2.565342903137207, -2.510693311691284, -2.4560437202453613, -2.4013938903808594, -2.3467442989349365, -2.2920947074890137, -2.237445116043091, -2.182795286178589, -2.128145694732666, -2.073496103286743, -2.0188465118408203, -1.964196801185608, -1.9095470905303955, -1.8548974990844727, -1.8002477884292603, -1.7455981969833374, -1.690948486328125, -1.6362988948822021, -1.5816491842269897, -1.526999592781067, -1.4723498821258545, -1.4177002906799316, -1.3630505800247192, -1.3084009885787964, -1.2537513971328735, -1.1991018056869507, -1.1444520950317383, -1.0898025035858154, -1.035152792930603, -0.9805032014846802, -0.9258534908294678, -0.8712038993835449, -0.8165541887283325, -0.7619045376777649, -0.7072548866271973, -0.6526052355766296, -0.597955584526062, -0.5433059930801392, -0.48865631222724915, -0.4340066611766815, -0.3793570101261139, -0.32470735907554626, -0.27005770802497864, -0.2154080718755722, -0.16075842082500458, -0.10610878467559814, -0.05145913362503052, 0.0031905174255371094, 0.057840168476104736, 0.11248981952667236, 0.16713947057724, 0.22178910672664642, 0.27643877267837524, 0.3310883939266205, 0.3857380449771881, 0.44038769602775574, 0.49503734707832336, 0.5496869683265686, 0.6043366193771362, 0.6589862704277039, 0.7136359214782715, 0.7682855725288391, 0.8229352235794067, 0.8775848746299744, 0.932234525680542]}, "gradients/encoder.encoder.layers.2.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 4.0, 7.0, 1.0, 4.0, 6.0, 5.0, 15.0, 10.0, 20.0, 22.0, 21.0, 16.0, 21.0, 25.0, 21.0, 33.0, 41.0, 28.0, 43.0, 31.0, 47.0, 43.0, 44.0, 53.0, 45.0, 46.0, 52.0, 40.0, 38.0, 35.0, 30.0, 28.0, 14.0, 14.0, 25.0, 13.0, 12.0, 13.0, 9.0, 9.0, 8.0, 7.0, 5.0, 1.0, 0.0, 3.0, 3.0, 2.0, 1.0, 3.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.7177403569221497, -0.6949856877326965, -0.6722310185432434, -0.6494764089584351, -0.6267217397689819, -0.6039670705795288, -0.5812124013900757, -0.5584577322006226, -0.5357030630111694, -0.5129483938217163, -0.49019375443458557, -0.46743908524513245, -0.4446844458580017, -0.4219297766685486, -0.39917510747909546, -0.37642043828964233, -0.353665828704834, -0.33091115951538086, -0.3081565201282501, -0.285401850938797, -0.26264721155166626, -0.23989254236221313, -0.21713787317276, -0.19438321888446808, -0.17162856459617615, -0.14887391030788422, -0.12611925601959229, -0.10336458683013916, -0.08060993254184723, -0.0578552782535553, -0.03510060906410217, -0.012345954775810242, 0.01040869951248169, 0.03316335752606392, 0.05591801553964615, 0.07867267727851868, 0.10142733156681061, 0.12418198585510254, 0.14693665504455566, 0.1696913093328476, 0.19244596362113953, 0.21520061790943146, 0.2379552721977234, 0.2607099413871765, 0.28346461057662964, 0.3062192499637604, 0.3289739191532135, 0.35172855854034424, 0.37448322772979736, 0.3972378969192505, 0.4199925363063812, 0.44274720549583435, 0.4655018448829651, 0.4882565140724182, 0.5110111832618713, 0.5337658524513245, 0.5565204620361328, 0.5792751312255859, 0.6020298004150391, 0.6247844696044922, 0.6475390791893005, 0.6702937483787537, 0.6930484175682068, 0.7158030867576599, 0.738557755947113]}, "gradients/encoder.encoder.layers.2.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 2.0, 1.0, 3.0, 3.0, 3.0, 8.0, 7.0, 12.0, 7.0, 27.0, 37.0, 77.0, 103.0, 238.0, 463.0, 1003.0, 2075.0, 4787.0, 11943.0, 31492.0, 90088.0, 258398.0, 385227.0, 169128.0, 58237.0, 20789.0, 8010.0, 3435.0, 1553.0, 646.0, 339.0, 188.0, 106.0, 50.0, 36.0, 18.0, 14.0, 5.0, 1.0, 4.0, 2.0, 2.0, 0.0, 1.0, 0.0, 1.0], "bins": [-0.17333984375, -0.1690378189086914, -0.1647357940673828, -0.16043376922607422, -0.15613174438476562, -0.15182971954345703, -0.14752769470214844, -0.14322566986083984, -0.13892364501953125, -0.13462162017822266, -0.13031959533691406, -0.12601757049560547, -0.12171554565429688, -0.11741352081298828, -0.11311149597167969, -0.1088094711303711, -0.1045074462890625, -0.1002054214477539, -0.09590339660644531, -0.09160137176513672, -0.08729934692382812, -0.08299732208251953, -0.07869529724121094, -0.07439327239990234, -0.07009124755859375, -0.06578922271728516, -0.06148719787597656, -0.05718517303466797, -0.052883148193359375, -0.04858112335205078, -0.04427909851074219, -0.039977073669433594, -0.035675048828125, -0.031373023986816406, -0.027070999145507812, -0.02276897430419922, -0.018466949462890625, -0.014164924621582031, -0.009862899780273438, -0.005560874938964844, -0.00125885009765625, 0.0030431747436523438, 0.0073451995849609375, 0.011647224426269531, 0.015949249267578125, 0.02025127410888672, 0.024553298950195312, 0.028855323791503906, 0.0331573486328125, 0.037459373474121094, 0.04176139831542969, 0.04606342315673828, 0.050365447998046875, 0.05466747283935547, 0.05896949768066406, 0.06327152252197266, 0.06757354736328125, 0.07187557220458984, 0.07617759704589844, 0.08047962188720703, 0.08478164672851562, 0.08908367156982422, 0.09338569641113281, 0.0976877212524414, 0.10198974609375]}, "gradients/encoder.encoder.layers.2.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 0.0, 5.0, 1.0, 4.0, 7.0, 11.0, 9.0, 10.0, 19.0, 24.0, 29.0, 38.0, 42.0, 50.0, 47.0, 65.0, 66.0, 68.0, 84.0, 77.0, 65.0, 41.0, 48.0, 49.0, 34.0, 30.0, 29.0, 19.0, 7.0, 16.0, 8.0, 7.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1231689453125, -0.120086669921875, -0.11700439453125, -0.113922119140625, -0.11083984375, -0.107757568359375, -0.10467529296875, -0.101593017578125, -0.0985107421875, -0.095428466796875, -0.09234619140625, -0.089263916015625, -0.086181640625, -0.083099365234375, -0.08001708984375, -0.076934814453125, -0.0738525390625, -0.070770263671875, -0.06768798828125, -0.064605712890625, -0.0615234375, -0.058441162109375, -0.05535888671875, -0.052276611328125, -0.0491943359375, -0.046112060546875, -0.04302978515625, -0.039947509765625, -0.036865234375, -0.033782958984375, -0.03070068359375, -0.027618408203125, -0.0245361328125, -0.021453857421875, -0.01837158203125, -0.015289306640625, -0.01220703125, -0.009124755859375, -0.00604248046875, -0.002960205078125, 0.0001220703125, 0.003204345703125, 0.00628662109375, 0.009368896484375, 0.012451171875, 0.015533447265625, 0.01861572265625, 0.021697998046875, 0.0247802734375, 0.027862548828125, 0.03094482421875, 0.034027099609375, 0.037109375, 0.040191650390625, 0.04327392578125, 0.046356201171875, 0.0494384765625, 0.052520751953125, 0.05560302734375, 0.058685302734375, 0.061767578125, 0.064849853515625, 0.06793212890625, 0.071014404296875, 0.0740966796875]}, "gradients/encoder.encoder.layers.2.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 6.0, 6.0, 7.0, 5.0, 9.0, 16.0, 24.0, 26.0, 22.0, 63.0, 89.0, 125.0, 170.0, 278.0, 468.0, 911.0, 2481.0, 8905.0, 49185.0, 602038.0, 332079.0, 39727.0, 7695.0, 2168.0, 901.0, 443.0, 236.0, 123.0, 103.0, 69.0, 51.0, 36.0, 24.0, 15.0, 14.0, 9.0, 10.0, 3.0, 4.0, 4.0, 3.0, 2.0, 4.0, 1.0, 2.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.283203125, -0.2740516662597656, -0.26490020751953125, -0.2557487487792969, -0.2465972900390625, -0.23744583129882812, -0.22829437255859375, -0.21914291381835938, -0.209991455078125, -0.20083999633789062, -0.19168853759765625, -0.18253707885742188, -0.1733856201171875, -0.16423416137695312, -0.15508270263671875, -0.14593124389648438, -0.13677978515625, -0.12762832641601562, -0.11847686767578125, -0.10932540893554688, -0.1001739501953125, -0.09102249145507812, -0.08187103271484375, -0.07271957397460938, -0.063568115234375, -0.054416656494140625, -0.04526519775390625, -0.036113739013671875, -0.0269622802734375, -0.017810821533203125, -0.00865936279296875, 0.000492095947265625, 0.0096435546875, 0.018795013427734375, 0.02794647216796875, 0.037097930908203125, 0.0462493896484375, 0.055400848388671875, 0.06455230712890625, 0.07370376586914062, 0.082855224609375, 0.09200668334960938, 0.10115814208984375, 0.11030960083007812, 0.1194610595703125, 0.12861251831054688, 0.13776397705078125, 0.14691543579101562, 0.15606689453125, 0.16521835327148438, 0.17436981201171875, 0.18352127075195312, 0.1926727294921875, 0.20182418823242188, 0.21097564697265625, 0.22012710571289062, 0.229278564453125, 0.23843002319335938, 0.24758148193359375, 0.2567329406738281, 0.2658843994140625, 0.2750358581542969, 0.28418731689453125, 0.2933387756347656, 0.302490234375]}, "gradients/encoder.encoder.layers.2.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 3.0, 3.0, 4.0, 3.0, 2.0, 5.0, 4.0, 9.0, 8.0, 17.0, 13.0, 10.0, 16.0, 17.0, 20.0, 17.0, 23.0, 37.0, 35.0, 46.0, 60.0, 52.0, 58.0, 49.0, 63.0, 48.0, 56.0, 58.0, 50.0, 29.0, 32.0, 25.0, 28.0, 24.0, 19.0, 17.0, 16.0, 9.0, 5.0, 5.0, 3.0, 9.0, 1.0, 1.0, 2.0, 2.0, 2.0, 3.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.271240234375, -0.2615699768066406, -0.25189971923828125, -0.24222946166992188, -0.2325592041015625, -0.22288894653320312, -0.21321868896484375, -0.20354843139648438, -0.193878173828125, -0.18420791625976562, -0.17453765869140625, -0.16486740112304688, -0.1551971435546875, -0.14552688598632812, -0.13585662841796875, -0.12618637084960938, -0.11651611328125, -0.10684585571289062, -0.09717559814453125, -0.08750534057617188, -0.0778350830078125, -0.06816482543945312, -0.05849456787109375, -0.048824310302734375, -0.039154052734375, -0.029483795166015625, -0.01981353759765625, -0.010143280029296875, -0.0004730224609375, 0.009197235107421875, 0.01886749267578125, 0.028537750244140625, 0.0382080078125, 0.047878265380859375, 0.05754852294921875, 0.06721878051757812, 0.0768890380859375, 0.08655929565429688, 0.09622955322265625, 0.10589981079101562, 0.115570068359375, 0.12524032592773438, 0.13491058349609375, 0.14458084106445312, 0.1542510986328125, 0.16392135620117188, 0.17359161376953125, 0.18326187133789062, 0.19293212890625, 0.20260238647460938, 0.21227264404296875, 0.22194290161132812, 0.2316131591796875, 0.24128341674804688, 0.25095367431640625, 0.2606239318847656, 0.270294189453125, 0.2799644470214844, 0.28963470458984375, 0.2993049621582031, 0.3089752197265625, 0.3186454772949219, 0.32831573486328125, 0.3379859924316406, 0.34765625]}, "gradients/encoder.encoder.layers.2.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 1.0, 1.0, 3.0, 8.0, 7.0, 9.0, 9.0, 14.0, 19.0, 25.0, 45.0, 55.0, 91.0, 154.0, 240.0, 455.0, 891.0, 1897.0, 4557.0, 13212.0, 53896.0, 669625.0, 249327.0, 36891.0, 10185.0, 3659.0, 1526.0, 761.0, 408.0, 218.0, 127.0, 67.0, 52.0, 32.0, 35.0, 11.0, 14.0, 9.0, 5.0, 6.0, 5.0, 5.0, 1.0, 2.0, 0.0, 2.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-0.1451416015625, -0.14058876037597656, -0.13603591918945312, -0.1314830780029297, -0.12693023681640625, -0.12237739562988281, -0.11782455444335938, -0.11327171325683594, -0.1087188720703125, -0.10416603088378906, -0.09961318969726562, -0.09506034851074219, -0.09050750732421875, -0.08595466613769531, -0.08140182495117188, -0.07684898376464844, -0.072296142578125, -0.06774330139160156, -0.06319046020507812, -0.05863761901855469, -0.05408477783203125, -0.04953193664550781, -0.044979095458984375, -0.04042625427246094, -0.0358734130859375, -0.03132057189941406, -0.026767730712890625, -0.022214889526367188, -0.01766204833984375, -0.013109207153320312, -0.008556365966796875, -0.0040035247802734375, 0.00054931640625, 0.0051021575927734375, 0.009654998779296875, 0.014207839965820312, 0.01876068115234375, 0.023313522338867188, 0.027866363525390625, 0.03241920471191406, 0.0369720458984375, 0.04152488708496094, 0.046077728271484375, 0.05063056945800781, 0.05518341064453125, 0.05973625183105469, 0.06428909301757812, 0.06884193420410156, 0.073394775390625, 0.07794761657714844, 0.08250045776367188, 0.08705329895019531, 0.09160614013671875, 0.09615898132324219, 0.10071182250976562, 0.10526466369628906, 0.1098175048828125, 0.11437034606933594, 0.11892318725585938, 0.12347602844238281, 0.12802886962890625, 0.1325817108154297, 0.13713455200195312, 0.14168739318847656, 0.146240234375]}, "gradients/encoder.encoder.layers.2.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 4.0, 4.0, 1.0, 7.0, 2.0, 4.0, 13.0, 4.0, 6.0, 12.0, 13.0, 20.0, 13.0, 21.0, 24.0, 38.0, 66.0, 79.0, 83.0, 122.0, 126.0, 77.0, 65.0, 49.0, 28.0, 19.0, 19.0, 13.0, 23.0, 6.0, 5.0, 10.0, 7.0, 4.0, 3.0, 6.0, 2.0, 1.0, 2.0, 8.0, 1.0, 0.0, 0.0, 1.0, 4.0, 0.0, 1.0], "bins": [-4.172325134277344e-05, -4.057958722114563e-05, -3.943592309951782e-05, -3.8292258977890015e-05, -3.714859485626221e-05, -3.60049307346344e-05, -3.486126661300659e-05, -3.3717602491378784e-05, -3.2573938369750977e-05, -3.143027424812317e-05, -3.028661012649536e-05, -2.9142946004867554e-05, -2.7999281883239746e-05, -2.685561776161194e-05, -2.571195363998413e-05, -2.4568289518356323e-05, -2.3424625396728516e-05, -2.2280961275100708e-05, -2.11372971534729e-05, -1.9993633031845093e-05, -1.8849968910217285e-05, -1.7706304788589478e-05, -1.656264066696167e-05, -1.5418976545333862e-05, -1.4275312423706055e-05, -1.3131648302078247e-05, -1.198798418045044e-05, -1.0844320058822632e-05, -9.700655937194824e-06, -8.556991815567017e-06, -7.413327693939209e-06, -6.269663572311401e-06, -5.125999450683594e-06, -3.982335329055786e-06, -2.8386712074279785e-06, -1.695007085800171e-06, -5.513429641723633e-07, 5.923211574554443e-07, 1.735985279083252e-06, 2.8796494007110596e-06, 4.023313522338867e-06, 5.166977643966675e-06, 6.310641765594482e-06, 7.45430588722229e-06, 8.597970008850098e-06, 9.741634130477905e-06, 1.0885298252105713e-05, 1.202896237373352e-05, 1.3172626495361328e-05, 1.4316290616989136e-05, 1.5459954738616943e-05, 1.660361886024475e-05, 1.774728298187256e-05, 1.8890947103500366e-05, 2.0034611225128174e-05, 2.117827534675598e-05, 2.232193946838379e-05, 2.3465603590011597e-05, 2.4609267711639404e-05, 2.5752931833267212e-05, 2.689659595489502e-05, 2.8040260076522827e-05, 2.9183924198150635e-05, 3.0327588319778442e-05, 3.147125244140625e-05]}, "gradients/encoder.encoder.layers.2.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 2.0, 1.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 4.0, 6.0, 3.0, 14.0, 13.0, 18.0, 23.0, 46.0, 65.0, 103.0, 186.0, 332.0, 677.0, 1588.0, 4419.0, 17706.0, 109097.0, 803240.0, 88828.0, 15264.0, 4037.0, 1425.0, 673.0, 315.0, 208.0, 84.0, 70.0, 34.0, 24.0, 15.0, 8.0, 6.0, 6.0, 8.0, 1.0, 1.0, 5.0, 1.0, 1.0, 3.0, 2.0, 1.0, 1.0], "bins": [-0.2030029296875, -0.197723388671875, -0.19244384765625, -0.187164306640625, -0.181884765625, -0.176605224609375, -0.17132568359375, -0.166046142578125, -0.1607666015625, -0.155487060546875, -0.15020751953125, -0.144927978515625, -0.1396484375, -0.134368896484375, -0.12908935546875, -0.123809814453125, -0.1185302734375, -0.113250732421875, -0.10797119140625, -0.102691650390625, -0.097412109375, -0.092132568359375, -0.08685302734375, -0.081573486328125, -0.0762939453125, -0.071014404296875, -0.06573486328125, -0.060455322265625, -0.05517578125, -0.049896240234375, -0.04461669921875, -0.039337158203125, -0.0340576171875, -0.028778076171875, -0.02349853515625, -0.018218994140625, -0.012939453125, -0.007659912109375, -0.00238037109375, 0.002899169921875, 0.0081787109375, 0.013458251953125, 0.01873779296875, 0.024017333984375, 0.029296875, 0.034576416015625, 0.03985595703125, 0.045135498046875, 0.0504150390625, 0.055694580078125, 0.06097412109375, 0.066253662109375, 0.071533203125, 0.076812744140625, 0.08209228515625, 0.087371826171875, 0.0926513671875, 0.097930908203125, 0.10321044921875, 0.108489990234375, 0.11376953125, 0.119049072265625, 0.12432861328125, 0.129608154296875, 0.1348876953125]}, "gradients/encoder.encoder.layers.2.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 2.0, 0.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 2.0, 1.0, 5.0, 6.0, 7.0, 7.0, 14.0, 20.0, 19.0, 27.0, 43.0, 47.0, 82.0, 83.0, 107.0, 116.0, 102.0, 77.0, 43.0, 50.0, 31.0, 26.0, 23.0, 20.0, 9.0, 6.0, 6.0, 7.0, 6.0, 2.0, 7.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.112548828125, -0.1095266342163086, -0.10650444030761719, -0.10348224639892578, -0.10046005249023438, -0.09743785858154297, -0.09441566467285156, -0.09139347076416016, -0.08837127685546875, -0.08534908294677734, -0.08232688903808594, -0.07930469512939453, -0.07628250122070312, -0.07326030731201172, -0.07023811340332031, -0.0672159194946289, -0.0641937255859375, -0.061171531677246094, -0.05814933776855469, -0.05512714385986328, -0.052104949951171875, -0.04908275604248047, -0.04606056213378906, -0.043038368225097656, -0.04001617431640625, -0.036993980407714844, -0.03397178649902344, -0.03094959259033203, -0.027927398681640625, -0.02490520477294922, -0.021883010864257812, -0.018860816955566406, -0.015838623046875, -0.012816429138183594, -0.009794235229492188, -0.006772041320800781, -0.003749847412109375, -0.0007276535034179688, 0.0022945404052734375, 0.005316734313964844, 0.00833892822265625, 0.011361122131347656, 0.014383316040039062, 0.01740550994873047, 0.020427703857421875, 0.02344989776611328, 0.026472091674804688, 0.029494285583496094, 0.0325164794921875, 0.035538673400878906, 0.03856086730957031, 0.04158306121826172, 0.044605255126953125, 0.04762744903564453, 0.05064964294433594, 0.053671836853027344, 0.05669403076171875, 0.059716224670410156, 0.06273841857910156, 0.06576061248779297, 0.06878280639648438, 0.07180500030517578, 0.07482719421386719, 0.0778493881225586, 0.08087158203125]}, "gradients/encoder.encoder.layers.2.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 2.0, 0.0, 1.0, 1.0, 6.0, 6.0, 13.0, 20.0, 37.0, 79.0, 120.0, 160.0, 314.0, 115.0, 67.0, 29.0, 20.0, 10.0, 4.0, 4.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.135388135910034, -3.053584337234497, -2.971780300140381, -2.8899765014648438, -2.8081727027893066, -2.7263689041137695, -2.6445648670196533, -2.562761068344116, -2.48095703125, -2.399153232574463, -2.3173491954803467, -2.2355453968048096, -2.1537415981292725, -2.0719375610351562, -1.9901337623596191, -1.908329963684082, -1.826526165008545, -1.7447222471237183, -1.6629184484481812, -1.5811145305633545, -1.4993107318878174, -1.4175068140029907, -1.335702896118164, -1.253899097442627, -1.1720951795578003, -1.0902912616729736, -1.0084874629974365, -0.9266835451126099, -0.844879686832428, -0.7630758285522461, -0.6812719106674194, -0.5994680523872375, -0.5176639556884766, -0.4358600974082947, -0.3540562093257904, -0.27225232124328613, -0.19044846296310425, -0.10864460468292236, -0.02684071660041809, 0.05496317148208618, 0.13676702976226807, 0.21857090294361115, 0.3003747761249542, 0.3821786642074585, 0.4639825224876404, 0.5457863807678223, 0.6275902986526489, 0.7093941569328308, 0.7911980152130127, 0.8730018734931946, 0.9548057317733765, 1.0366096496582031, 1.1184134483337402, 1.200217366218567, 1.2820212841033936, 1.3638250827789307, 1.4456290006637573, 1.527432918548584, 1.609236717224121, 1.6910406351089478, 1.7728445529937744, 1.8546483516693115, 1.9364522695541382, 2.018256187438965, 2.100059986114502]}, "gradients/encoder.encoder.layers.2.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 1.0, 2.0, 0.0, 2.0, 1.0, 0.0, 0.0, 3.0, 9.0, 6.0, 5.0, 9.0, 8.0, 9.0, 18.0, 14.0, 18.0, 16.0, 27.0, 26.0, 17.0, 19.0, 25.0, 21.0, 44.0, 20.0, 60.0, 57.0, 93.0, 93.0, 62.0, 40.0, 29.0, 17.0, 28.0, 32.0, 22.0, 22.0, 18.0, 16.0, 22.0, 11.0, 17.0, 8.0, 12.0, 12.0, 6.0, 3.0, 2.0, 3.0, 1.0, 5.0, 1.0, 2.0, 2.0, 1.0, 0.0, 1.0, 2.0, 1.0], "bins": [-1.1448421478271484, -1.1100022792816162, -1.075162410736084, -1.0403225421905518, -1.0054826736450195, -0.9706428050994873, -0.9358029365539551, -0.9009630680084229, -0.8661231994628906, -0.8312833309173584, -0.7964434623718262, -0.761603593826294, -0.7267637252807617, -0.6919238567352295, -0.6570839881896973, -0.622244119644165, -0.5874043107032776, -0.5525644421577454, -0.5177245736122131, -0.4828847050666809, -0.4480448365211487, -0.41320496797561646, -0.3783651292324066, -0.3435252606868744, -0.30868539214134216, -0.27384552359580994, -0.2390056550502777, -0.20416580140590668, -0.16932593286037445, -0.13448606431484222, -0.09964621067047119, -0.06480634212493896, -0.02996647357940674, 0.00487339124083519, 0.03971325606107712, 0.07455311715602875, 0.10939298570156097, 0.1442328542470932, 0.17907270789146423, 0.21391257643699646, 0.2487524449825287, 0.2835923135280609, 0.31843218207359314, 0.353272020816803, 0.3881118893623352, 0.42295175790786743, 0.45779162645339966, 0.4926314949989319, 0.5274713635444641, 0.5623112320899963, 0.5971511006355286, 0.6319909691810608, 0.666830837726593, 0.7016707062721252, 0.7365105152130127, 0.7713503837585449, 0.8061902523040771, 0.8410301208496094, 0.8758699893951416, 0.9107098579406738, 0.945549726486206, 0.9803895950317383, 1.0152294635772705, 1.0500693321228027, 1.084909200668335]}, "gradients/encoder.encoder.layers.1.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0, 1.0, 3.0, 1.0, 4.0, 4.0, 6.0, 2.0, 9.0, 21.0, 15.0, 29.0, 26.0, 38.0, 61.0, 94.0, 149.0, 236.0, 437.0, 735.0, 1513.0, 3550.0, 10771.0, 59271.0, 2318776.0, 1729793.0, 52376.0, 10053.0, 3214.0, 1392.0, 683.0, 388.0, 238.0, 135.0, 88.0, 60.0, 42.0, 22.0, 22.0, 10.0, 11.0, 5.0, 3.0, 4.0, 3.0, 1.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.258056640625, -0.250457763671875, -0.24285888671875, -0.235260009765625, -0.2276611328125, -0.220062255859375, -0.21246337890625, -0.204864501953125, -0.197265625, -0.189666748046875, -0.18206787109375, -0.174468994140625, -0.1668701171875, -0.159271240234375, -0.15167236328125, -0.144073486328125, -0.136474609375, -0.128875732421875, -0.12127685546875, -0.113677978515625, -0.1060791015625, -0.098480224609375, -0.09088134765625, -0.083282470703125, -0.07568359375, -0.068084716796875, -0.06048583984375, -0.052886962890625, -0.0452880859375, -0.037689208984375, -0.03009033203125, -0.022491455078125, -0.014892578125, -0.007293701171875, 0.00030517578125, 0.007904052734375, 0.0155029296875, 0.023101806640625, 0.03070068359375, 0.038299560546875, 0.0458984375, 0.053497314453125, 0.06109619140625, 0.068695068359375, 0.0762939453125, 0.083892822265625, 0.09149169921875, 0.099090576171875, 0.106689453125, 0.114288330078125, 0.12188720703125, 0.129486083984375, 0.1370849609375, 0.144683837890625, 0.15228271484375, 0.159881591796875, 0.16748046875, 0.175079345703125, 0.18267822265625, 0.190277099609375, 0.1978759765625, 0.205474853515625, 0.21307373046875, 0.220672607421875, 0.228271484375]}, "gradients/encoder.encoder.layers.1.feed_forward.output_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 6.0, 6.0, 5.0, 6.0, 9.0, 6.0, 20.0, 16.0, 18.0, 29.0, 31.0, 40.0, 40.0, 40.0, 44.0, 49.0, 59.0, 59.0, 57.0, 64.0, 57.0, 56.0, 45.0, 35.0, 37.0, 32.0, 28.0, 24.0, 25.0, 19.0, 14.0, 15.0, 6.0, 1.0, 3.0, 8.0, 2.0, 3.0, 0.0, 2.0, 0.0, 1.0], "bins": [-0.09503173828125, -0.09267139434814453, -0.09031105041503906, -0.0879507064819336, -0.08559036254882812, -0.08323001861572266, -0.08086967468261719, -0.07850933074951172, -0.07614898681640625, -0.07378864288330078, -0.07142829895019531, -0.06906795501708984, -0.06670761108398438, -0.0643472671508789, -0.06198692321777344, -0.05962657928466797, -0.0572662353515625, -0.05490589141845703, -0.05254554748535156, -0.050185203552246094, -0.047824859619140625, -0.045464515686035156, -0.04310417175292969, -0.04074382781982422, -0.03838348388671875, -0.03602313995361328, -0.03366279602050781, -0.031302452087402344, -0.028942108154296875, -0.026581764221191406, -0.024221420288085938, -0.02186107635498047, -0.019500732421875, -0.01714038848876953, -0.014780044555664062, -0.012419700622558594, -0.010059356689453125, -0.007699012756347656, -0.0053386688232421875, -0.0029783248901367188, -0.00061798095703125, 0.0017423629760742188, 0.0041027069091796875, 0.006463050842285156, 0.008823394775390625, 0.011183738708496094, 0.013544082641601562, 0.01590442657470703, 0.0182647705078125, 0.02062511444091797, 0.022985458374023438, 0.025345802307128906, 0.027706146240234375, 0.030066490173339844, 0.03242683410644531, 0.03478717803955078, 0.03714752197265625, 0.03950786590576172, 0.04186820983886719, 0.044228553771972656, 0.046588897705078125, 0.048949241638183594, 0.05130958557128906, 0.05366992950439453, 0.0560302734375]}, "gradients/encoder.encoder.layers.1.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 4.0, 7.0, 9.0, 13.0, 38.0, 77.0, 187.0, 465.0, 1770.0, 51271.0, 4125518.0, 13250.0, 1131.0, 345.0, 115.0, 41.0, 24.0, 11.0, 9.0, 4.0, 3.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-1.037109375, -1.0040283203125, -0.970947265625, -0.9378662109375, -0.90478515625, -0.8717041015625, -0.838623046875, -0.8055419921875, -0.7724609375, -0.7393798828125, -0.706298828125, -0.6732177734375, -0.64013671875, -0.6070556640625, -0.573974609375, -0.5408935546875, -0.5078125, -0.4747314453125, -0.441650390625, -0.4085693359375, -0.37548828125, -0.3424072265625, -0.309326171875, -0.2762451171875, -0.2431640625, -0.2100830078125, -0.177001953125, -0.1439208984375, -0.11083984375, -0.0777587890625, -0.044677734375, -0.0115966796875, 0.021484375, 0.0545654296875, 0.087646484375, 0.1207275390625, 0.15380859375, 0.1868896484375, 0.219970703125, 0.2530517578125, 0.2861328125, 0.3192138671875, 0.352294921875, 0.3853759765625, 0.41845703125, 0.4515380859375, 0.484619140625, 0.5177001953125, 0.55078125, 0.5838623046875, 0.616943359375, 0.6500244140625, 0.68310546875, 0.7161865234375, 0.749267578125, 0.7823486328125, 0.8154296875, 0.8485107421875, 0.881591796875, 0.9146728515625, 0.94775390625, 0.9808349609375, 1.013916015625, 1.0469970703125, 1.080078125]}, "gradients/encoder.encoder.layers.1.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 5.0, 7.0, 7.0, 11.0, 25.0, 29.0, 86.0, 157.0, 556.0, 1798.0, 941.0, 239.0, 99.0, 55.0, 24.0, 15.0, 10.0, 7.0, 4.0, 4.0, 3.0, 2.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.68359375, -0.6675682067871094, -0.6515426635742188, -0.6355171203613281, -0.6194915771484375, -0.6034660339355469, -0.5874404907226562, -0.5714149475097656, -0.555389404296875, -0.5393638610839844, -0.5233383178710938, -0.5073127746582031, -0.4912872314453125, -0.4752616882324219, -0.45923614501953125, -0.4432106018066406, -0.42718505859375, -0.4111595153808594, -0.39513397216796875, -0.3791084289550781, -0.3630828857421875, -0.3470573425292969, -0.33103179931640625, -0.3150062561035156, -0.298980712890625, -0.2829551696777344, -0.26692962646484375, -0.2509040832519531, -0.2348785400390625, -0.21885299682617188, -0.20282745361328125, -0.18680191040039062, -0.1707763671875, -0.15475082397460938, -0.13872528076171875, -0.12269973754882812, -0.1066741943359375, -0.09064865112304688, -0.07462310791015625, -0.058597564697265625, -0.042572021484375, -0.026546478271484375, -0.01052093505859375, 0.005504608154296875, 0.0215301513671875, 0.037555694580078125, 0.05358123779296875, 0.06960678100585938, 0.08563232421875, 0.10165786743164062, 0.11768341064453125, 0.13370895385742188, 0.1497344970703125, 0.16576004028320312, 0.18178558349609375, 0.19781112670898438, 0.213836669921875, 0.22986221313476562, 0.24588775634765625, 0.2619132995605469, 0.2779388427734375, 0.2939643859863281, 0.30998992919921875, 0.3260154724121094, 0.342041015625]}, "gradients/encoder.encoder.layers.1.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 6.0, 8.0, 12.0, 22.0, 28.0, 43.0, 97.0, 145.0, 165.0, 173.0, 115.0, 83.0, 58.0, 21.0, 15.0, 3.0, 7.0, 4.0, 1.0, 0.0, 3.0, 2.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.0948410034179688, -3.0188400745391846, -2.9428391456604004, -2.8668384552001953, -2.790837526321411, -2.714836597442627, -2.6388356685638428, -2.5628347396850586, -2.4868338108062744, -2.4108328819274902, -2.334831953048706, -2.258831024169922, -2.182830333709717, -2.1068294048309326, -2.0308284759521484, -1.9548275470733643, -1.8788267374038696, -1.8028258085250854, -1.7268249988555908, -1.6508240699768066, -1.5748231410980225, -1.4988222122192383, -1.4228214025497437, -1.3468204736709595, -1.2708196640014648, -1.1948187351226807, -1.118817925453186, -1.0428169965744019, -0.9668160676956177, -0.8908151984214783, -0.8148143291473389, -0.7388134002685547, -0.6628124713897705, -0.5868116021156311, -0.5108106732368469, -0.4348098039627075, -0.3588089048862457, -0.28280800580978394, -0.20680713653564453, -0.13080620765686035, -0.05480533838272095, 0.021195553243160248, 0.09719644486904144, 0.17319732904434204, 0.24919822812080383, 0.3251991271972656, 0.40119999647140503, 0.4772009253501892, 0.5532017946243286, 0.629202663898468, 0.7052035927772522, 0.7812044620513916, 0.8572053909301758, 0.9332062602043152, 1.0092071294784546, 1.0852080583572388, 1.1612088680267334, 1.2372097969055176, 1.3132106065750122, 1.3892115354537964, 1.4652124643325806, 1.5412132740020752, 1.6172142028808594, 1.6932151317596436, 1.7692160606384277]}, "gradients/encoder.encoder.layers.1.final_layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 0.0, 0.0, 2.0, 3.0, 1.0, 3.0, 2.0, 7.0, 4.0, 4.0, 4.0, 5.0, 16.0, 8.0, 11.0, 9.0, 15.0, 13.0, 21.0, 18.0, 26.0, 26.0, 28.0, 28.0, 41.0, 36.0, 53.0, 52.0, 66.0, 61.0, 49.0, 42.0, 49.0, 37.0, 34.0, 35.0, 35.0, 29.0, 18.0, 21.0, 14.0, 14.0, 18.0, 8.0, 9.0, 9.0, 5.0, 6.0, 4.0, 5.0, 4.0, 1.0, 5.0, 2.0, 1.0, 1.0, 0.0, 3.0, 0.0, 0.0, 1.0], "bins": [-1.1733834743499756, -1.1367570161819458, -1.1001306772232056, -1.0635042190551758, -1.026877760887146, -0.990251362323761, -0.953624963760376, -0.9169985055923462, -0.8803721070289612, -0.8437457084655762, -0.8071192502975464, -0.7704928517341614, -0.7338664531707764, -0.6972399950027466, -0.6606135964393616, -0.6239871978759766, -0.5873607397079468, -0.5507343411445618, -0.514107882976532, -0.477481484413147, -0.4408550560474396, -0.4042286276817322, -0.36760222911834717, -0.33097580075263977, -0.2943493723869324, -0.257722944021225, -0.22109653055667877, -0.18447011709213257, -0.14784368872642517, -0.11121726036071777, -0.07459084689617157, -0.037964433431625366, -0.0013381242752075195, 0.03528829663991928, 0.07191471755504608, 0.10854113847017288, 0.14516755938529968, 0.18179398775100708, 0.21842040121555328, 0.2550468146800995, 0.2916732430458069, 0.3282996714115143, 0.3649260997772217, 0.4015524983406067, 0.4381789267063141, 0.4748053550720215, 0.5114317536354065, 0.5480581521987915, 0.5846846103668213, 0.6213110089302063, 0.6579374670982361, 0.6945638656616211, 0.7311903238296509, 0.7678167223930359, 0.8044431209564209, 0.8410695791244507, 0.8776959776878357, 0.9143223762512207, 0.9509488344192505, 0.9875752329826355, 1.0242016315460205, 1.0608280897140503, 1.09745454788208, 1.1340808868408203, 1.17070734500885]}, "gradients/encoder.encoder.layers.1.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 5.0, 2.0, 3.0, 1.0, 5.0, 11.0, 12.0, 8.0, 21.0, 25.0, 34.0, 50.0, 71.0, 103.0, 163.0, 239.0, 330.0, 486.0, 723.0, 1115.0, 1638.0, 2706.0, 4423.0, 7191.0, 12871.0, 22976.0, 44493.0, 92860.0, 199845.0, 296333.0, 182513.0, 84834.0, 41029.0, 21182.0, 11895.0, 7030.0, 4057.0, 2494.0, 1659.0, 1039.0, 670.0, 441.0, 317.0, 213.0, 156.0, 94.0, 45.0, 49.0, 37.0, 19.0, 17.0, 12.0, 6.0, 12.0, 6.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.1177978515625, -0.11392784118652344, -0.11005783081054688, -0.10618782043457031, -0.10231781005859375, -0.09844779968261719, -0.09457778930664062, -0.09070777893066406, -0.0868377685546875, -0.08296775817871094, -0.07909774780273438, -0.07522773742675781, -0.07135772705078125, -0.06748771667480469, -0.06361770629882812, -0.05974769592285156, -0.055877685546875, -0.05200767517089844, -0.048137664794921875, -0.04426765441894531, -0.04039764404296875, -0.03652763366699219, -0.032657623291015625, -0.028787612915039062, -0.0249176025390625, -0.021047592163085938, -0.017177581787109375, -0.013307571411132812, -0.00943756103515625, -0.0055675506591796875, -0.001697540283203125, 0.0021724700927734375, 0.00604248046875, 0.009912490844726562, 0.013782501220703125, 0.017652511596679688, 0.02152252197265625, 0.025392532348632812, 0.029262542724609375, 0.03313255310058594, 0.0370025634765625, 0.04087257385253906, 0.044742584228515625, 0.04861259460449219, 0.05248260498046875, 0.05635261535644531, 0.060222625732421875, 0.06409263610839844, 0.067962646484375, 0.07183265686035156, 0.07570266723632812, 0.07957267761230469, 0.08344268798828125, 0.08731269836425781, 0.09118270874023438, 0.09505271911621094, 0.0989227294921875, 0.10279273986816406, 0.10666275024414062, 0.11053276062011719, 0.11440277099609375, 0.11827278137207031, 0.12214279174804688, 0.12601280212402344, 0.1298828125]}, "gradients/encoder.encoder.layers.1.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 3.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 2.0, 1.0, 4.0, 4.0, 6.0, 9.0, 6.0, 8.0, 13.0, 12.0, 19.0, 23.0, 20.0, 19.0, 29.0, 32.0, 42.0, 42.0, 56.0, 47.0, 41.0, 64.0, 55.0, 58.0, 48.0, 51.0, 43.0, 45.0, 38.0, 29.0, 28.0, 14.0, 36.0, 13.0, 15.0, 8.0, 8.0, 7.0, 3.0, 3.0, 2.0, 3.0, 3.0, 4.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.093994140625, -0.09105587005615234, -0.08811759948730469, -0.08517932891845703, -0.08224105834960938, -0.07930278778076172, -0.07636451721191406, -0.0734262466430664, -0.07048797607421875, -0.0675497055053711, -0.06461143493652344, -0.06167316436767578, -0.058734893798828125, -0.05579662322998047, -0.05285835266113281, -0.049920082092285156, -0.0469818115234375, -0.044043540954589844, -0.04110527038574219, -0.03816699981689453, -0.035228729248046875, -0.03229045867919922, -0.029352188110351562, -0.026413917541503906, -0.02347564697265625, -0.020537376403808594, -0.017599105834960938, -0.014660835266113281, -0.011722564697265625, -0.008784294128417969, -0.0058460235595703125, -0.0029077529907226562, 3.0517578125e-05, 0.0029687881469726562, 0.0059070587158203125, 0.008845329284667969, 0.011783599853515625, 0.014721870422363281, 0.017660140991210938, 0.020598411560058594, 0.02353668212890625, 0.026474952697753906, 0.029413223266601562, 0.03235149383544922, 0.035289764404296875, 0.03822803497314453, 0.04116630554199219, 0.044104576110839844, 0.0470428466796875, 0.049981117248535156, 0.05291938781738281, 0.05585765838623047, 0.058795928955078125, 0.06173419952392578, 0.06467247009277344, 0.0676107406616211, 0.07054901123046875, 0.0734872817993164, 0.07642555236816406, 0.07936382293701172, 0.08230209350585938, 0.08524036407470703, 0.08817863464355469, 0.09111690521240234, 0.09405517578125]}, "gradients/encoder.encoder.layers.1.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 2.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 4.0, 2.0, 3.0, 1.0, 4.0, 9.0, 10.0, 22.0, 20.0, 22.0, 26.0, 44.0, 53.0, 85.0, 133.0, 217.0, 320.0, 557.0, 1094.0, 2423.0, 7714.0, 41131.0, 743827.0, 220251.0, 21574.0, 4983.0, 1873.0, 855.0, 442.0, 254.0, 192.0, 121.0, 82.0, 49.0, 36.0, 27.0, 26.0, 23.0, 9.0, 10.0, 12.0, 6.0, 8.0, 5.0, 1.0, 1.0, 0.0, 1.0, 2.0, 3.0], "bins": [-0.48486328125, -0.4716987609863281, -0.45853424072265625, -0.4453697204589844, -0.4322052001953125, -0.4190406799316406, -0.40587615966796875, -0.3927116394042969, -0.379547119140625, -0.3663825988769531, -0.35321807861328125, -0.3400535583496094, -0.3268890380859375, -0.3137245178222656, -0.30055999755859375, -0.2873954772949219, -0.27423095703125, -0.2610664367675781, -0.24790191650390625, -0.23473739624023438, -0.2215728759765625, -0.20840835571289062, -0.19524383544921875, -0.18207931518554688, -0.168914794921875, -0.15575027465820312, -0.14258575439453125, -0.12942123413085938, -0.1162567138671875, -0.10309219360351562, -0.08992767333984375, -0.07676315307617188, -0.0635986328125, -0.050434112548828125, -0.03726959228515625, -0.024105072021484375, -0.0109405517578125, 0.002223968505859375, 0.01538848876953125, 0.028553009033203125, 0.041717529296875, 0.054882049560546875, 0.06804656982421875, 0.08121109008789062, 0.0943756103515625, 0.10754013061523438, 0.12070465087890625, 0.13386917114257812, 0.14703369140625, 0.16019821166992188, 0.17336273193359375, 0.18652725219726562, 0.1996917724609375, 0.21285629272460938, 0.22602081298828125, 0.23918533325195312, 0.252349853515625, 0.2655143737792969, 0.27867889404296875, 0.2918434143066406, 0.3050079345703125, 0.3181724548339844, 0.33133697509765625, 0.3445014953613281, 0.357666015625]}, "gradients/encoder.encoder.layers.1.attention.v_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 5.0, 3.0, 5.0, 7.0, 4.0, 4.0, 9.0, 13.0, 12.0, 15.0, 13.0, 23.0, 21.0, 31.0, 34.0, 43.0, 66.0, 63.0, 51.0, 57.0, 70.0, 68.0, 56.0, 41.0, 51.0, 46.0, 35.0, 26.0, 26.0, 17.0, 24.0, 16.0, 9.0, 12.0, 9.0, 4.0, 4.0, 4.0, 5.0, 3.0, 4.0, 4.0, 2.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 2.0], "bins": [-0.447998046875, -0.4349861145019531, -0.42197418212890625, -0.4089622497558594, -0.3959503173828125, -0.3829383850097656, -0.36992645263671875, -0.3569145202636719, -0.343902587890625, -0.3308906555175781, -0.31787872314453125, -0.3048667907714844, -0.2918548583984375, -0.2788429260253906, -0.26583099365234375, -0.2528190612792969, -0.23980712890625, -0.22679519653320312, -0.21378326416015625, -0.20077133178710938, -0.1877593994140625, -0.17474746704101562, -0.16173553466796875, -0.14872360229492188, -0.135711669921875, -0.12269973754882812, -0.10968780517578125, -0.09667587280273438, -0.0836639404296875, -0.07065200805664062, -0.05764007568359375, -0.044628143310546875, -0.0316162109375, -0.018604278564453125, -0.00559234619140625, 0.007419586181640625, 0.0204315185546875, 0.033443450927734375, 0.04645538330078125, 0.059467315673828125, 0.072479248046875, 0.08549118041992188, 0.09850311279296875, 0.11151504516601562, 0.1245269775390625, 0.13753890991210938, 0.15055084228515625, 0.16356277465820312, 0.17657470703125, 0.18958663940429688, 0.20259857177734375, 0.21561050415039062, 0.2286224365234375, 0.24163436889648438, 0.25464630126953125, 0.2676582336425781, 0.280670166015625, 0.2936820983886719, 0.30669403076171875, 0.3197059631347656, 0.3327178955078125, 0.3457298278808594, 0.35874176025390625, 0.3717536926269531, 0.384765625]}, "gradients/encoder.encoder.layers.1.attention.k_proj.weight": {"_type": "histogram", "values": [3.0, 1.0, 0.0, 2.0, 1.0, 0.0, 1.0, 3.0, 2.0, 2.0, 5.0, 4.0, 3.0, 9.0, 3.0, 11.0, 15.0, 14.0, 27.0, 43.0, 66.0, 95.0, 157.0, 272.0, 421.0, 785.0, 1435.0, 2886.0, 6892.0, 23719.0, 150425.0, 764356.0, 72381.0, 14653.0, 4829.0, 2324.0, 1098.0, 588.0, 353.0, 212.0, 164.0, 94.0, 54.0, 44.0, 40.0, 20.0, 15.0, 9.0, 8.0, 7.0, 4.0, 5.0, 5.0, 1.0, 0.0, 0.0, 0.0, 5.0, 1.0, 1.0, 0.0, 0.0, 1.0, 2.0], "bins": [-0.10601806640625, -0.10263347625732422, -0.09924888610839844, -0.09586429595947266, -0.09247970581054688, -0.0890951156616211, -0.08571052551269531, -0.08232593536376953, -0.07894134521484375, -0.07555675506591797, -0.07217216491699219, -0.0687875747680664, -0.06540298461914062, -0.062018394470214844, -0.05863380432128906, -0.05524921417236328, -0.0518646240234375, -0.04848003387451172, -0.04509544372558594, -0.041710853576660156, -0.038326263427734375, -0.034941673278808594, -0.03155708312988281, -0.02817249298095703, -0.02478790283203125, -0.02140331268310547, -0.018018722534179688, -0.014634132385253906, -0.011249542236328125, -0.007864952087402344, -0.0044803619384765625, -0.0010957717895507812, 0.002288818359375, 0.005673408508300781, 0.009057998657226562, 0.012442588806152344, 0.015827178955078125, 0.019211769104003906, 0.022596359252929688, 0.02598094940185547, 0.02936553955078125, 0.03275012969970703, 0.03613471984863281, 0.039519309997558594, 0.042903900146484375, 0.046288490295410156, 0.04967308044433594, 0.05305767059326172, 0.0564422607421875, 0.05982685089111328, 0.06321144104003906, 0.06659603118896484, 0.06998062133789062, 0.0733652114868164, 0.07674980163574219, 0.08013439178466797, 0.08351898193359375, 0.08690357208251953, 0.09028816223144531, 0.0936727523803711, 0.09705734252929688, 0.10044193267822266, 0.10382652282714844, 0.10721111297607422, 0.110595703125]}, "gradients/encoder.encoder.layers.1.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 1.0, 2.0, 2.0, 2.0, 6.0, 5.0, 4.0, 4.0, 3.0, 8.0, 12.0, 13.0, 17.0, 27.0, 53.0, 56.0, 82.0, 98.0, 125.0, 134.0, 88.0, 68.0, 45.0, 26.0, 36.0, 18.0, 16.0, 10.0, 8.0, 6.0, 9.0, 6.0, 6.0, 3.0, 3.0, 4.0, 3.0, 4.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.5703182220458984e-05, -3.426987677812576e-05, -3.283657133579254e-05, -3.140326589345932e-05, -2.99699604511261e-05, -2.8536655008792877e-05, -2.7103349566459656e-05, -2.5670044124126434e-05, -2.4236738681793213e-05, -2.280343323945999e-05, -2.137012779712677e-05, -1.993682235479355e-05, -1.8503516912460327e-05, -1.7070211470127106e-05, -1.5636906027793884e-05, -1.4203600585460663e-05, -1.2770295143127441e-05, -1.133698970079422e-05, -9.903684258460999e-06, -8.470378816127777e-06, -7.037073373794556e-06, -5.603767931461334e-06, -4.170462489128113e-06, -2.7371570467948914e-06, -1.30385160446167e-06, 1.2945383787155151e-07, 1.562759280204773e-06, 2.9960647225379944e-06, 4.429370164871216e-06, 5.862675607204437e-06, 7.295981049537659e-06, 8.72928649187088e-06, 1.0162591934204102e-05, 1.1595897376537323e-05, 1.3029202818870544e-05, 1.4462508261203766e-05, 1.5895813703536987e-05, 1.732911914587021e-05, 1.876242458820343e-05, 2.019573003053665e-05, 2.1629035472869873e-05, 2.3062340915203094e-05, 2.4495646357536316e-05, 2.5928951799869537e-05, 2.736225724220276e-05, 2.879556268453598e-05, 3.02288681268692e-05, 3.166217356920242e-05, 3.3095479011535645e-05, 3.4528784453868866e-05, 3.596208989620209e-05, 3.739539533853531e-05, 3.882870078086853e-05, 4.026200622320175e-05, 4.169531166553497e-05, 4.3128617107868195e-05, 4.4561922550201416e-05, 4.599522799253464e-05, 4.742853343486786e-05, 4.886183887720108e-05, 5.02951443195343e-05, 5.172844976186752e-05, 5.3161755204200745e-05, 5.4595060646533966e-05, 5.602836608886719e-05]}, "gradients/encoder.encoder.layers.1.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 1.0, 4.0, 3.0, 9.0, 8.0, 12.0, 19.0, 33.0, 51.0, 94.0, 167.0, 355.0, 977.0, 3650.0, 27238.0, 884677.0, 120621.0, 7872.0, 1681.0, 557.0, 234.0, 114.0, 73.0, 44.0, 29.0, 8.0, 10.0, 7.0, 5.0, 3.0, 1.0, 1.0, 0.0, 4.0, 2.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-0.187255859375, -0.18116188049316406, -0.17506790161132812, -0.1689739227294922, -0.16287994384765625, -0.1567859649658203, -0.15069198608398438, -0.14459800720214844, -0.1385040283203125, -0.13241004943847656, -0.12631607055664062, -0.12022209167480469, -0.11412811279296875, -0.10803413391113281, -0.10194015502929688, -0.09584617614746094, -0.089752197265625, -0.08365821838378906, -0.07756423950195312, -0.07147026062011719, -0.06537628173828125, -0.05928230285644531, -0.053188323974609375, -0.04709434509277344, -0.0410003662109375, -0.03490638732910156, -0.028812408447265625, -0.022718429565429688, -0.01662445068359375, -0.010530471801757812, -0.004436492919921875, 0.0016574859619140625, 0.00775146484375, 0.013845443725585938, 0.019939422607421875, 0.026033401489257812, 0.03212738037109375, 0.03822135925292969, 0.044315338134765625, 0.05040931701660156, 0.0565032958984375, 0.06259727478027344, 0.06869125366210938, 0.07478523254394531, 0.08087921142578125, 0.08697319030761719, 0.09306716918945312, 0.09916114807128906, 0.105255126953125, 0.11134910583496094, 0.11744308471679688, 0.12353706359863281, 0.12963104248046875, 0.1357250213623047, 0.14181900024414062, 0.14791297912597656, 0.1540069580078125, 0.16010093688964844, 0.16619491577148438, 0.1722888946533203, 0.17838287353515625, 0.1844768524169922, 0.19057083129882812, 0.19666481018066406, 0.2027587890625]}, "gradients/encoder.encoder.layers.1.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 4.0, 2.0, 3.0, 1.0, 2.0, 10.0, 11.0, 17.0, 26.0, 59.0, 74.0, 143.0, 185.0, 189.0, 114.0, 66.0, 43.0, 27.0, 18.0, 9.0, 6.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1571044921875, -0.15201377868652344, -0.14692306518554688, -0.1418323516845703, -0.13674163818359375, -0.1316509246826172, -0.12656021118164062, -0.12146949768066406, -0.1163787841796875, -0.11128807067871094, -0.10619735717773438, -0.10110664367675781, -0.09601593017578125, -0.09092521667480469, -0.08583450317382812, -0.08074378967285156, -0.075653076171875, -0.07056236267089844, -0.06547164916992188, -0.06038093566894531, -0.05529022216796875, -0.05019950866699219, -0.045108795166015625, -0.04001808166503906, -0.0349273681640625, -0.029836654663085938, -0.024745941162109375, -0.019655227661132812, -0.01456451416015625, -0.009473800659179688, -0.004383087158203125, 0.0007076263427734375, 0.00579833984375, 0.010889053344726562, 0.015979766845703125, 0.021070480346679688, 0.02616119384765625, 0.03125190734863281, 0.036342620849609375, 0.04143333435058594, 0.0465240478515625, 0.05161476135253906, 0.056705474853515625, 0.06179618835449219, 0.06688690185546875, 0.07197761535644531, 0.07706832885742188, 0.08215904235839844, 0.087249755859375, 0.09234046936035156, 0.09743118286132812, 0.10252189636230469, 0.10761260986328125, 0.11270332336425781, 0.11779403686523438, 0.12288475036621094, 0.1279754638671875, 0.13306617736816406, 0.13815689086914062, 0.1432476043701172, 0.14833831787109375, 0.1534290313720703, 0.15851974487304688, 0.16361045837402344, 0.168701171875]}, "gradients/encoder.encoder.layers.1.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 2.0, 11.0, 3.0, 6.0, 16.0, 29.0, 45.0, 72.0, 135.0, 386.0, 145.0, 69.0, 39.0, 21.0, 12.0, 5.0, 10.0, 1.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 1.0], "bins": [-3.1913084983825684, -3.0927889347076416, -2.994269371032715, -2.895749568939209, -2.7972300052642822, -2.6987104415893555, -2.6001908779144287, -2.501671314239502, -2.403151750564575, -2.3046321868896484, -2.2061126232147217, -2.107593059539795, -2.009073257446289, -1.9105536937713623, -1.8120341300964355, -1.7135145664215088, -1.6149948835372925, -1.5164753198623657, -1.4179556369781494, -1.3194360733032227, -1.220916509628296, -1.1223969459533691, -1.0238772630691528, -0.9253576993942261, -0.8268380761146545, -0.728318452835083, -0.6297988891601562, -0.5312792658805847, -0.43275967240333557, -0.3342400789260864, -0.2357204556465149, -0.13720089197158813, -0.0386812686920166, 0.05983833223581314, 0.15835793316364288, 0.2568775415420532, 0.35539713501930237, 0.4539167284965515, 0.552436351776123, 0.6509559154510498, 0.7494755387306213, 0.8479951620101929, 0.9465147256851196, 1.045034408569336, 1.1435539722442627, 1.2420735359191895, 1.3405930995941162, 1.439112663269043, 1.5376323461532593, 1.636151909828186, 1.7346715927124023, 1.833191156387329, 1.9317107200622559, 2.0302302837371826, 2.1287498474121094, 2.2272696495056152, 2.325789213180542, 2.4243087768554688, 2.5228283405303955, 2.6213479042053223, 2.719867706298828, 2.818387269973755, 2.9169068336486816, 3.0154263973236084, 3.113945960998535]}, "gradients/encoder.encoder.layers.1.layer_norm.bias": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 2.0, 3.0, 2.0, 4.0, 1.0, 5.0, 4.0, 4.0, 8.0, 12.0, 10.0, 10.0, 8.0, 18.0, 14.0, 14.0, 21.0, 23.0, 33.0, 21.0, 29.0, 31.0, 45.0, 49.0, 127.0, 173.0, 53.0, 40.0, 40.0, 26.0, 33.0, 22.0, 15.0, 12.0, 16.0, 18.0, 10.0, 17.0, 9.0, 9.0, 3.0, 5.0, 8.0, 3.0, 1.0, 2.0, 3.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0], "bins": [-1.8218750953674316, -1.7668397426605225, -1.7118045091629028, -1.6567691564559937, -1.6017338037490845, -1.5466985702514648, -1.4916632175445557, -1.4366278648376465, -1.3815926313400269, -1.3265572786331177, -1.271522045135498, -1.2164866924285889, -1.1614513397216797, -1.10641610622406, -1.0513807535171509, -0.9963454604148865, -0.9413101077079773, -0.8862748146057129, -0.8312394618988037, -0.7762041687965393, -0.7211688756942749, -0.6661335229873657, -0.6110982298851013, -0.5560629367828369, -0.5010275840759277, -0.44599226117134094, -0.39095696806907654, -0.33592164516448975, -0.28088635206222534, -0.22585102915763855, -0.17081570625305176, -0.11578041315078735, -0.06074512004852295, -0.005709808319807053, 0.049325503408908844, 0.10436081886291504, 0.15939612686634064, 0.21443143486976624, 0.269466757774353, 0.32450205087661743, 0.3795373737812042, 0.434572696685791, 0.4896079897880554, 0.5446432828903198, 0.599678635597229, 0.6547139286994934, 0.7097492218017578, 0.764784574508667, 0.8198198676109314, 0.8748551607131958, 0.929890513420105, 0.9849258065223694, 1.0399610996246338, 1.094996452331543, 1.1500318050384521, 1.2050670385360718, 1.260102391242981, 1.3151377439498901, 1.3701729774475098, 1.425208330154419, 1.4802436828613281, 1.5352789163589478, 1.590314269065857, 1.6453495025634766, 1.7003848552703857]}, "gradients/encoder.encoder.layers.0.feed_forward.output_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 4.0, 0.0, 1.0, 2.0, 5.0, 5.0, 2.0, 2.0, 6.0, 13.0, 18.0, 22.0, 42.0, 37.0, 72.0, 165.0, 270.0, 521.0, 1053.0, 2617.0, 8114.0, 37274.0, 400296.0, 3297437.0, 396007.0, 37522.0, 8183.0, 2533.0, 1026.0, 443.0, 266.0, 139.0, 85.0, 46.0, 25.0, 20.0, 8.0, 7.0, 5.0, 3.0, 1.0, 2.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.348876953125, -0.33931732177734375, -0.3297576904296875, -0.32019805908203125, -0.310638427734375, -0.30107879638671875, -0.2915191650390625, -0.28195953369140625, -0.27239990234375, -0.26284027099609375, -0.2532806396484375, -0.24372100830078125, -0.234161376953125, -0.22460174560546875, -0.2150421142578125, -0.20548248291015625, -0.1959228515625, -0.18636322021484375, -0.1768035888671875, -0.16724395751953125, -0.157684326171875, -0.14812469482421875, -0.1385650634765625, -0.12900543212890625, -0.11944580078125, -0.10988616943359375, -0.1003265380859375, -0.09076690673828125, -0.081207275390625, -0.07164764404296875, -0.0620880126953125, -0.05252838134765625, -0.04296875, -0.03340911865234375, -0.0238494873046875, -0.01428985595703125, -0.004730224609375, 0.00482940673828125, 0.0143890380859375, 0.02394866943359375, 0.03350830078125, 0.04306793212890625, 0.0526275634765625, 0.06218719482421875, 0.071746826171875, 0.08130645751953125, 0.0908660888671875, 0.10042572021484375, 0.1099853515625, 0.11954498291015625, 0.1291046142578125, 0.13866424560546875, 0.148223876953125, 0.15778350830078125, 0.1673431396484375, 0.17690277099609375, 0.18646240234375, 0.19602203369140625, 0.2055816650390625, 0.21514129638671875, 0.224700927734375, 0.23426055908203125, 0.2438201904296875, 0.25337982177734375, 0.262939453125]}, "gradients/encoder.encoder.layers.0.feed_forward.output_dense.bias": {"_type": "histogram", "values": [2.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 2.0, 3.0, 3.0, 5.0, 5.0, 4.0, 3.0, 10.0, 16.0, 12.0, 13.0, 13.0, 22.0, 29.0, 25.0, 22.0, 25.0, 33.0, 39.0, 41.0, 47.0, 44.0, 53.0, 52.0, 42.0, 51.0, 63.0, 36.0, 37.0, 26.0, 34.0, 33.0, 24.0, 20.0, 25.0, 23.0, 20.0, 10.0, 11.0, 12.0, 7.0, 3.0, 4.0, 3.0, 4.0, 2.0, 4.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.07794189453125, -0.07540512084960938, -0.07286834716796875, -0.07033157348632812, -0.0677947998046875, -0.06525802612304688, -0.06272125244140625, -0.060184478759765625, -0.057647705078125, -0.055110931396484375, -0.05257415771484375, -0.050037384033203125, -0.0475006103515625, -0.044963836669921875, -0.04242706298828125, -0.039890289306640625, -0.037353515625, -0.034816741943359375, -0.03227996826171875, -0.029743194580078125, -0.0272064208984375, -0.024669647216796875, -0.02213287353515625, -0.019596099853515625, -0.017059326171875, -0.014522552490234375, -0.01198577880859375, -0.009449005126953125, -0.0069122314453125, -0.004375457763671875, -0.00183868408203125, 0.000698089599609375, 0.00323486328125, 0.005771636962890625, 0.00830841064453125, 0.010845184326171875, 0.0133819580078125, 0.015918731689453125, 0.01845550537109375, 0.020992279052734375, 0.023529052734375, 0.026065826416015625, 0.02860260009765625, 0.031139373779296875, 0.0336761474609375, 0.036212921142578125, 0.03874969482421875, 0.041286468505859375, 0.0438232421875, 0.046360015869140625, 0.04889678955078125, 0.051433563232421875, 0.0539703369140625, 0.056507110595703125, 0.05904388427734375, 0.061580657958984375, 0.064117431640625, 0.06665420532226562, 0.06919097900390625, 0.07172775268554688, 0.0742645263671875, 0.07680130004882812, 0.07933807373046875, 0.08187484741210938, 0.08441162109375]}, "gradients/encoder.encoder.layers.0.feed_forward.intermediate_dense.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 5.0, 5.0, 5.0, 3.0, 5.0, 16.0, 20.0, 30.0, 50.0, 71.0, 116.0, 204.0, 384.0, 695.0, 1711.0, 8171.0, 364765.0, 3794788.0, 18560.0, 2643.0, 882.0, 518.0, 231.0, 145.0, 96.0, 63.0, 39.0, 27.0, 15.0, 9.0, 4.0, 6.0, 4.0, 2.0, 2.0, 4.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 2.0], "bins": [-1.0478515625, -1.0188751220703125, -0.989898681640625, -0.9609222412109375, -0.93194580078125, -0.9029693603515625, -0.873992919921875, -0.8450164794921875, -0.8160400390625, -0.7870635986328125, -0.758087158203125, -0.7291107177734375, -0.70013427734375, -0.6711578369140625, -0.642181396484375, -0.6132049560546875, -0.584228515625, -0.5552520751953125, -0.526275634765625, -0.4972991943359375, -0.46832275390625, -0.4393463134765625, -0.410369873046875, -0.3813934326171875, -0.3524169921875, -0.3234405517578125, -0.294464111328125, -0.2654876708984375, -0.23651123046875, -0.2075347900390625, -0.178558349609375, -0.1495819091796875, -0.12060546875, -0.0916290283203125, -0.062652587890625, -0.0336761474609375, -0.00469970703125, 0.0242767333984375, 0.053253173828125, 0.0822296142578125, 0.1112060546875, 0.1401824951171875, 0.169158935546875, 0.1981353759765625, 0.22711181640625, 0.2560882568359375, 0.285064697265625, 0.3140411376953125, 0.343017578125, 0.3719940185546875, 0.400970458984375, 0.4299468994140625, 0.45892333984375, 0.4878997802734375, 0.516876220703125, 0.5458526611328125, 0.5748291015625, 0.6038055419921875, 0.632781982421875, 0.6617584228515625, 0.69073486328125, 0.7197113037109375, 0.748687744140625, 0.7776641845703125, 0.806640625]}, "gradients/encoder.encoder.layers.0.feed_forward.intermediate_dense.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 2.0, 6.0, 5.0, 2.0, 3.0, 6.0, 7.0, 12.0, 25.0, 35.0, 31.0, 43.0, 76.0, 69.0, 137.0, 202.0, 304.0, 626.0, 729.0, 610.0, 396.0, 240.0, 155.0, 118.0, 71.0, 45.0, 31.0, 21.0, 26.0, 15.0, 19.0, 8.0, 6.0, 0.0, 1.0, 2.0, 0.0, 1.0, 3.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.305419921875, -0.29390716552734375, -0.2823944091796875, -0.27088165283203125, -0.259368896484375, -0.24785614013671875, -0.2363433837890625, -0.22483062744140625, -0.21331787109375, -0.20180511474609375, -0.1902923583984375, -0.17877960205078125, -0.167266845703125, -0.15575408935546875, -0.1442413330078125, -0.13272857666015625, -0.1212158203125, -0.10970306396484375, -0.0981903076171875, -0.08667755126953125, -0.075164794921875, -0.06365203857421875, -0.0521392822265625, -0.04062652587890625, -0.02911376953125, -0.01760101318359375, -0.0060882568359375, 0.00542449951171875, 0.016937255859375, 0.02845001220703125, 0.0399627685546875, 0.05147552490234375, 0.06298828125, 0.07450103759765625, 0.0860137939453125, 0.09752655029296875, 0.109039306640625, 0.12055206298828125, 0.1320648193359375, 0.14357757568359375, 0.15509033203125, 0.16660308837890625, 0.1781158447265625, 0.18962860107421875, 0.201141357421875, 0.21265411376953125, 0.2241668701171875, 0.23567962646484375, 0.2471923828125, 0.25870513916015625, 0.2702178955078125, 0.28173065185546875, 0.293243408203125, 0.30475616455078125, 0.3162689208984375, 0.32778167724609375, 0.33929443359375, 0.35080718994140625, 0.3623199462890625, 0.37383270263671875, 0.385345458984375, 0.39685821533203125, 0.4083709716796875, 0.41988372802734375, 0.431396484375]}, "gradients/encoder.encoder.layers.0.final_layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 3.0, 2.0, 3.0, 5.0, 6.0, 18.0, 23.0, 46.0, 104.0, 214.0, 263.0, 191.0, 76.0, 28.0, 11.0, 6.0, 4.0, 3.0, 2.0, 1.0, 2.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-6.403827667236328, -6.197922706604004, -5.992018222808838, -5.786113262176514, -5.5802083015441895, -5.374303817749023, -5.168398857116699, -4.962493896484375, -4.756588935852051, -4.550683975219727, -4.3447794914245605, -4.138874530792236, -3.932969570159912, -3.727064847946167, -3.521160125732422, -3.3152551651000977, -3.1093506813049316, -2.9034459590911865, -2.6975409984588623, -2.491636276245117, -2.285731315612793, -2.079826593399048, -1.8739218711853027, -1.668017029762268, -1.4621121883392334, -1.2562073469161987, -1.050302505493164, -0.844397783279419, -0.6384929418563843, -0.4325881004333496, -0.2266833782196045, -0.020778536796569824, 0.18512582778930664, 0.3910306394100189, 0.5969354510307312, 0.8028402328491211, 1.0087450742721558, 1.2146499156951904, 1.4205546379089355, 1.6264594793319702, 1.8323643207550049, 2.03826904296875, 2.244174003601074, 2.4500787258148193, 2.6559834480285645, 2.8618884086608887, 3.067793130874634, 3.273697853088379, 3.479602813720703, 3.6855075359344482, 3.8914124965667725, 4.097317218780518, 4.303222179412842, 4.509126663208008, 4.715031623840332, 4.920936584472656, 5.1268415451049805, 5.332746505737305, 5.538650989532471, 5.744555950164795, 5.950460910797119, 6.156365394592285, 6.362270355224609, 6.568175315856934, 6.7740797996521]}, "gradients/encoder.encoder.layers.0.final_layer_norm.bias": {"_type": "histogram", "values": [3.0, 1.0, 1.0, 0.0, 4.0, 5.0, 2.0, 4.0, 3.0, 4.0, 12.0, 10.0, 13.0, 9.0, 11.0, 20.0, 21.0, 28.0, 19.0, 17.0, 19.0, 20.0, 29.0, 36.0, 38.0, 42.0, 49.0, 42.0, 44.0, 50.0, 49.0, 51.0, 41.0, 40.0, 32.0, 35.0, 25.0, 28.0, 21.0, 21.0, 21.0, 14.0, 22.0, 6.0, 17.0, 6.0, 9.0, 5.0, 5.0, 5.0, 3.0, 2.0, 3.0, 1.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 1.0], "bins": [-1.5701229572296143, -1.5177509784698486, -1.465378999710083, -1.4130070209503174, -1.3606350421905518, -1.3082630634307861, -1.25589120388031, -1.2035192251205444, -1.1511472463607788, -1.0987752676010132, -1.0464032888412476, -0.9940313696861267, -0.9416593909263611, -0.8892874121665955, -0.8369154930114746, -0.784543514251709, -0.7321715354919434, -0.6797995567321777, -0.6274275779724121, -0.5750556588172913, -0.5226836800575256, -0.47031170129776, -0.4179397523403168, -0.36556780338287354, -0.3131958246231079, -0.2608238458633423, -0.20845189690589905, -0.15607993304729462, -0.10370796918869019, -0.05133599042892456, 0.0010359585285186768, 0.053407907485961914, 0.10578000545501709, 0.15815196931362152, 0.21052393317222595, 0.2628958821296692, 0.3152678608894348, 0.36763983964920044, 0.4200117886066437, 0.4723837375640869, 0.5247557163238525, 0.5771276950836182, 0.6294996738433838, 0.6818715929985046, 0.7342435717582703, 0.7866155505180359, 0.8389874696731567, 0.8913594484329224, 0.943731427192688, 0.9961034059524536, 1.0484753847122192, 1.1008473634719849, 1.153219223022461, 1.2055912017822266, 1.2579631805419922, 1.3103351593017578, 1.3627071380615234, 1.415079116821289, 1.4674510955810547, 1.5198230743408203, 1.572195053100586, 1.6245670318603516, 1.6769388914108276, 1.7293108701705933, 1.7816828489303589]}, "gradients/encoder.encoder.layers.0.attention.out_proj.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 2.0, 3.0, 3.0, 4.0, 5.0, 16.0, 16.0, 37.0, 49.0, 96.0, 171.0, 339.0, 641.0, 1732.0, 4597.0, 18917.0, 144942.0, 772434.0, 85129.0, 13278.0, 3615.0, 1327.0, 593.0, 279.0, 144.0, 72.0, 40.0, 30.0, 22.0, 9.0, 13.0, 6.0, 1.0, 2.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.427734375, -0.41452789306640625, -0.4013214111328125, -0.38811492919921875, -0.374908447265625, -0.36170196533203125, -0.3484954833984375, -0.33528900146484375, -0.32208251953125, -0.30887603759765625, -0.2956695556640625, -0.28246307373046875, -0.269256591796875, -0.25605010986328125, -0.2428436279296875, -0.22963714599609375, -0.2164306640625, -0.20322418212890625, -0.1900177001953125, -0.17681121826171875, -0.163604736328125, -0.15039825439453125, -0.1371917724609375, -0.12398529052734375, -0.11077880859375, -0.09757232666015625, -0.0843658447265625, -0.07115936279296875, -0.057952880859375, -0.04474639892578125, -0.0315399169921875, -0.01833343505859375, -0.005126953125, 0.00807952880859375, 0.0212860107421875, 0.03449249267578125, 0.047698974609375, 0.06090545654296875, 0.0741119384765625, 0.08731842041015625, 0.10052490234375, 0.11373138427734375, 0.1269378662109375, 0.14014434814453125, 0.153350830078125, 0.16655731201171875, 0.1797637939453125, 0.19297027587890625, 0.2061767578125, 0.21938323974609375, 0.2325897216796875, 0.24579620361328125, 0.259002685546875, 0.27220916748046875, 0.2854156494140625, 0.29862213134765625, 0.31182861328125, 0.32503509521484375, 0.3382415771484375, 0.35144805908203125, 0.364654541015625, 0.37786102294921875, 0.3910675048828125, 0.40427398681640625, 0.41748046875]}, "gradients/encoder.encoder.layers.0.attention.out_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 4.0, 3.0, 3.0, 5.0, 5.0, 8.0, 14.0, 22.0, 24.0, 40.0, 59.0, 63.0, 73.0, 96.0, 108.0, 93.0, 90.0, 74.0, 56.0, 59.0, 42.0, 28.0, 18.0, 16.0, 3.0, 4.0, 2.0, 2.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.2298583984375, -0.2221546173095703, -0.21445083618164062, -0.20674705505371094, -0.19904327392578125, -0.19133949279785156, -0.18363571166992188, -0.1759319305419922, -0.1682281494140625, -0.1605243682861328, -0.15282058715820312, -0.14511680603027344, -0.13741302490234375, -0.12970924377441406, -0.12200546264648438, -0.11430168151855469, -0.106597900390625, -0.09889411926269531, -0.09119033813476562, -0.08348655700683594, -0.07578277587890625, -0.06807899475097656, -0.060375213623046875, -0.05267143249511719, -0.0449676513671875, -0.03726387023925781, -0.029560089111328125, -0.021856307983398438, -0.01415252685546875, -0.0064487457275390625, 0.001255035400390625, 0.008958816528320312, 0.01666259765625, 0.024366378784179688, 0.032070159912109375, 0.03977394104003906, 0.04747772216796875, 0.05518150329589844, 0.06288528442382812, 0.07058906555175781, 0.0782928466796875, 0.08599662780761719, 0.09370040893554688, 0.10140419006347656, 0.10910797119140625, 0.11681175231933594, 0.12451553344726562, 0.1322193145751953, 0.139923095703125, 0.1476268768310547, 0.15533065795898438, 0.16303443908691406, 0.17073822021484375, 0.17844200134277344, 0.18614578247070312, 0.1938495635986328, 0.2015533447265625, 0.2092571258544922, 0.21696090698242188, 0.22466468811035156, 0.23236846923828125, 0.24007225036621094, 0.24777603149414062, 0.2554798126220703, 0.26318359375]}, "gradients/encoder.encoder.layers.0.attention.v_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 1.0, 4.0, 6.0, 5.0, 9.0, 11.0, 17.0, 14.0, 22.0, 22.0, 43.0, 68.0, 78.0, 150.0, 261.0, 472.0, 1011.0, 2798.0, 11693.0, 103988.0, 880191.0, 37973.0, 6212.0, 1811.0, 731.0, 348.0, 200.0, 129.0, 86.0, 66.0, 49.0, 32.0, 18.0, 14.0, 8.0, 5.0, 6.0, 8.0, 4.0, 2.0, 1.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.262939453125, -0.2511482238769531, -0.23935699462890625, -0.22756576538085938, -0.2157745361328125, -0.20398330688476562, -0.19219207763671875, -0.18040084838867188, -0.168609619140625, -0.15681838989257812, -0.14502716064453125, -0.13323593139648438, -0.1214447021484375, -0.10965347290039062, -0.09786224365234375, -0.08607101440429688, -0.07427978515625, -0.062488555908203125, -0.05069732666015625, -0.038906097412109375, -0.0271148681640625, -0.015323638916015625, -0.00353240966796875, 0.008258819580078125, 0.020050048828125, 0.031841278076171875, 0.04363250732421875, 0.055423736572265625, 0.0672149658203125, 0.07900619506835938, 0.09079742431640625, 0.10258865356445312, 0.1143798828125, 0.12617111206054688, 0.13796234130859375, 0.14975357055664062, 0.1615447998046875, 0.17333602905273438, 0.18512725830078125, 0.19691848754882812, 0.208709716796875, 0.22050094604492188, 0.23229217529296875, 0.24408340454101562, 0.2558746337890625, 0.2676658630371094, 0.27945709228515625, 0.2912483215332031, 0.30303955078125, 0.3148307800292969, 0.32662200927734375, 0.3384132385253906, 0.3502044677734375, 0.3619956970214844, 0.37378692626953125, 0.3855781555175781, 0.397369384765625, 0.4091606140136719, 0.42095184326171875, 0.4327430725097656, 0.4445343017578125, 0.4563255310058594, 0.46811676025390625, 0.4799079895019531, 0.49169921875]}, "gradients/encoder.encoder.layers.0.attention.v_proj.bias": {"_type": "histogram", "values": [2.0, 1.0, 0.0, 1.0, 1.0, 2.0, 2.0, 3.0, 2.0, 1.0, 4.0, 8.0, 6.0, 6.0, 10.0, 8.0, 11.0, 14.0, 21.0, 22.0, 26.0, 22.0, 33.0, 29.0, 57.0, 56.0, 70.0, 82.0, 85.0, 77.0, 61.0, 44.0, 44.0, 30.0, 26.0, 21.0, 17.0, 14.0, 15.0, 17.0, 13.0, 12.0, 2.0, 9.0, 9.0, 7.0, 3.0, 3.0, 7.0, 1.0, 2.0, 3.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.4423828125, -0.426666259765625, -0.41094970703125, -0.395233154296875, -0.3795166015625, -0.363800048828125, -0.34808349609375, -0.332366943359375, -0.316650390625, -0.300933837890625, -0.28521728515625, -0.269500732421875, -0.2537841796875, -0.238067626953125, -0.22235107421875, -0.206634521484375, -0.19091796875, -0.175201416015625, -0.15948486328125, -0.143768310546875, -0.1280517578125, -0.112335205078125, -0.09661865234375, -0.080902099609375, -0.065185546875, -0.049468994140625, -0.03375244140625, -0.018035888671875, -0.0023193359375, 0.013397216796875, 0.02911376953125, 0.044830322265625, 0.060546875, 0.076263427734375, 0.09197998046875, 0.107696533203125, 0.1234130859375, 0.139129638671875, 0.15484619140625, 0.170562744140625, 0.186279296875, 0.201995849609375, 0.21771240234375, 0.233428955078125, 0.2491455078125, 0.264862060546875, 0.28057861328125, 0.296295166015625, 0.31201171875, 0.327728271484375, 0.34344482421875, 0.359161376953125, 0.3748779296875, 0.390594482421875, 0.40631103515625, 0.422027587890625, 0.437744140625, 0.453460693359375, 0.46917724609375, 0.484893798828125, 0.5006103515625, 0.516326904296875, 0.53204345703125, 0.547760009765625, 0.5634765625]}, "gradients/encoder.encoder.layers.0.attention.k_proj.weight": {"_type": "histogram", "values": [1.0, 2.0, 0.0, 3.0, 1.0, 2.0, 0.0, 1.0, 1.0, 3.0, 1.0, 1.0, 2.0, 5.0, 7.0, 6.0, 12.0, 16.0, 18.0, 20.0, 34.0, 41.0, 60.0, 87.0, 162.0, 341.0, 793.0, 2629.0, 13572.0, 812797.0, 203778.0, 10515.0, 2196.0, 703.0, 297.0, 153.0, 84.0, 49.0, 40.0, 44.0, 26.0, 16.0, 8.0, 13.0, 5.0, 5.0, 4.0, 3.0, 2.0, 4.0, 1.0, 0.0, 4.0, 1.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0], "bins": [-0.111572265625, -0.10784530639648438, -0.10411834716796875, -0.10039138793945312, -0.0966644287109375, -0.09293746948242188, -0.08921051025390625, -0.08548355102539062, -0.081756591796875, -0.07802963256835938, -0.07430267333984375, -0.07057571411132812, -0.0668487548828125, -0.06312179565429688, -0.05939483642578125, -0.055667877197265625, -0.05194091796875, -0.048213958740234375, -0.04448699951171875, -0.040760040283203125, -0.0370330810546875, -0.033306121826171875, -0.02957916259765625, -0.025852203369140625, -0.022125244140625, -0.018398284912109375, -0.01467132568359375, -0.010944366455078125, -0.0072174072265625, -0.003490447998046875, 0.00023651123046875, 0.003963470458984375, 0.0076904296875, 0.011417388916015625, 0.01514434814453125, 0.018871307373046875, 0.0225982666015625, 0.026325225830078125, 0.03005218505859375, 0.033779144287109375, 0.037506103515625, 0.041233062744140625, 0.04496002197265625, 0.048686981201171875, 0.0524139404296875, 0.056140899658203125, 0.05986785888671875, 0.06359481811523438, 0.06732177734375, 0.07104873657226562, 0.07477569580078125, 0.07850265502929688, 0.0822296142578125, 0.08595657348632812, 0.08968353271484375, 0.09341049194335938, 0.097137451171875, 0.10086441040039062, 0.10459136962890625, 0.10831832885742188, 0.1120452880859375, 0.11577224731445312, 0.11949920654296875, 0.12322616577148438, 0.126953125]}, "gradients/encoder.encoder.layers.0.attention.k_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 1.0, 1.0, 0.0, 2.0, 1.0, 0.0, 0.0, 2.0, 2.0, 0.0, 0.0, 0.0, 2.0, 2.0, 1.0, 5.0, 3.0, 6.0, 6.0, 13.0, 8.0, 14.0, 11.0, 19.0, 19.0, 27.0, 44.0, 53.0, 90.0, 124.0, 158.0, 128.0, 65.0, 48.0, 31.0, 30.0, 27.0, 17.0, 11.0, 5.0, 6.0, 6.0, 4.0, 3.0, 4.0, 3.0, 7.0, 0.0, 2.0, 1.0, 1.0, 2.0, 3.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.5822391510009766e-05, -3.4714117646217346e-05, -3.360584378242493e-05, -3.249756991863251e-05, -3.138929605484009e-05, -3.028102219104767e-05, -2.917274832725525e-05, -2.806447446346283e-05, -2.695620059967041e-05, -2.584792673587799e-05, -2.473965287208557e-05, -2.3631379008293152e-05, -2.2523105144500732e-05, -2.1414831280708313e-05, -2.0306557416915894e-05, -1.9198283553123474e-05, -1.8090009689331055e-05, -1.6981735825538635e-05, -1.5873461961746216e-05, -1.4765188097953796e-05, -1.3656914234161377e-05, -1.2548640370368958e-05, -1.1440366506576538e-05, -1.0332092642784119e-05, -9.2238187789917e-06, -8.11554491519928e-06, -7.00727105140686e-06, -5.898997187614441e-06, -4.7907233238220215e-06, -3.682449460029602e-06, -2.5741755962371826e-06, -1.4659017324447632e-06, -3.5762786865234375e-07, 7.506459951400757e-07, 1.8589198589324951e-06, 2.9671937227249146e-06, 4.075467586517334e-06, 5.183741450309753e-06, 6.292015314102173e-06, 7.400289177894592e-06, 8.508563041687012e-06, 9.616836905479431e-06, 1.072511076927185e-05, 1.183338463306427e-05, 1.294165849685669e-05, 1.4049932360649109e-05, 1.5158206224441528e-05, 1.6266480088233948e-05, 1.7374753952026367e-05, 1.8483027815818787e-05, 1.9591301679611206e-05, 2.0699575543403625e-05, 2.1807849407196045e-05, 2.2916123270988464e-05, 2.4024397134780884e-05, 2.5132670998573303e-05, 2.6240944862365723e-05, 2.7349218726158142e-05, 2.845749258995056e-05, 2.956576645374298e-05, 3.06740403175354e-05, 3.178231418132782e-05, 3.289058804512024e-05, 3.399886190891266e-05, 3.510713577270508e-05]}, "gradients/encoder.encoder.layers.0.attention.q_proj.weight": {"_type": "histogram", "values": [1.0, 1.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 1.0, 4.0, 4.0, 2.0, 2.0, 3.0, 4.0, 9.0, 9.0, 5.0, 16.0, 35.0, 53.0, 100.0, 262.0, 735.0, 3119.0, 22772.0, 942965.0, 70626.0, 5967.0, 1185.0, 386.0, 150.0, 51.0, 34.0, 24.0, 9.0, 9.0, 5.0, 5.0, 4.0, 2.0, 1.0, 3.0, 2.0, 0.0, 1.0, 2.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.1309814453125, -0.1268482208251953, -0.12271499633789062, -0.11858177185058594, -0.11444854736328125, -0.11031532287597656, -0.10618209838867188, -0.10204887390136719, -0.0979156494140625, -0.09378242492675781, -0.08964920043945312, -0.08551597595214844, -0.08138275146484375, -0.07724952697753906, -0.07311630249023438, -0.06898307800292969, -0.064849853515625, -0.06071662902832031, -0.056583404541015625, -0.05245018005371094, -0.04831695556640625, -0.04418373107910156, -0.040050506591796875, -0.03591728210449219, -0.0317840576171875, -0.027650833129882812, -0.023517608642578125, -0.019384384155273438, -0.01525115966796875, -0.011117935180664062, -0.006984710693359375, -0.0028514862060546875, 0.00128173828125, 0.0054149627685546875, 0.009548187255859375, 0.013681411743164062, 0.01781463623046875, 0.021947860717773438, 0.026081085205078125, 0.030214309692382812, 0.0343475341796875, 0.03848075866699219, 0.042613983154296875, 0.04674720764160156, 0.05088043212890625, 0.05501365661621094, 0.059146881103515625, 0.06328010559082031, 0.067413330078125, 0.07154655456542969, 0.07567977905273438, 0.07981300354003906, 0.08394622802734375, 0.08807945251464844, 0.09221267700195312, 0.09634590148925781, 0.1004791259765625, 0.10461235046386719, 0.10874557495117188, 0.11287879943847656, 0.11701202392578125, 0.12114524841308594, 0.12527847290039062, 0.1294116973876953, 0.133544921875]}, "gradients/encoder.encoder.layers.0.attention.q_proj.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 1.0, 2.0, 3.0, 3.0, 4.0, 2.0, 5.0, 9.0, 5.0, 7.0, 4.0, 13.0, 23.0, 21.0, 32.0, 53.0, 75.0, 127.0, 141.0, 149.0, 108.0, 58.0, 45.0, 36.0, 23.0, 14.0, 17.0, 8.0, 4.0, 6.0, 5.0, 6.0, 3.0, 1.0, 1.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.0802001953125, -0.07769107818603516, -0.07518196105957031, -0.07267284393310547, -0.07016372680664062, -0.06765460968017578, -0.06514549255371094, -0.0626363754272461, -0.06012725830078125, -0.057618141174316406, -0.05510902404785156, -0.05259990692138672, -0.050090789794921875, -0.04758167266845703, -0.04507255554199219, -0.042563438415527344, -0.0400543212890625, -0.037545204162597656, -0.03503608703613281, -0.03252696990966797, -0.030017852783203125, -0.02750873565673828, -0.024999618530273438, -0.022490501403808594, -0.01998138427734375, -0.017472267150878906, -0.014963150024414062, -0.012454032897949219, -0.009944915771484375, -0.007435798645019531, -0.0049266815185546875, -0.0024175643920898438, 9.1552734375e-05, 0.0026006698608398438, 0.0051097869873046875, 0.007618904113769531, 0.010128021240234375, 0.012637138366699219, 0.015146255493164062, 0.017655372619628906, 0.02016448974609375, 0.022673606872558594, 0.025182723999023438, 0.02769184112548828, 0.030200958251953125, 0.03271007537841797, 0.03521919250488281, 0.037728309631347656, 0.0402374267578125, 0.042746543884277344, 0.04525566101074219, 0.04776477813720703, 0.050273895263671875, 0.05278301239013672, 0.05529212951660156, 0.057801246643066406, 0.06031036376953125, 0.0628194808959961, 0.06532859802246094, 0.06783771514892578, 0.07034683227539062, 0.07285594940185547, 0.07536506652832031, 0.07787418365478516, 0.08038330078125]}, "gradients/encoder.encoder.layers.0.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 1.0, 0.0, 0.0, 3.0, 1.0, 5.0, 3.0, 4.0, 8.0, 10.0, 11.0, 20.0, 25.0, 46.0, 78.0, 244.0, 361.0, 71.0, 51.0, 22.0, 22.0, 5.0, 6.0, 4.0, 4.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-2.7144558429718018, -2.651451587677002, -2.588447093963623, -2.5254428386688232, -2.4624385833740234, -2.3994343280792236, -2.336430072784424, -2.273425579071045, -2.210421323776245, -2.1474170684814453, -2.0844125747680664, -2.0214083194732666, -1.9584040641784668, -1.895399808883667, -1.8323954343795776, -1.7693910598754883, -1.7063868045806885, -1.6433825492858887, -1.5803781747817993, -1.51737380027771, -1.4543695449829102, -1.3913652896881104, -1.328360915184021, -1.2653565406799316, -1.2023522853851318, -1.139348030090332, -1.0763436555862427, -1.0133392810821533, -0.9503350257873535, -0.8873307108879089, -0.8243263959884644, -0.7613220810890198, -0.6983180046081543, -0.6353136897087097, -0.5723093748092651, -0.5093050599098206, -0.446300745010376, -0.3832964301109314, -0.3202921152114868, -0.25728780031204224, -0.19428348541259766, -0.13127917051315308, -0.0682748556137085, -0.005270540714263916, 0.057733774185180664, 0.12073808908462524, 0.18374240398406982, 0.2467467188835144, 0.309751033782959, 0.37275534868240356, 0.43575966358184814, 0.4987639784812927, 0.5617682933807373, 0.6247726082801819, 0.6877769231796265, 0.750781238079071, 0.8137855529785156, 0.8767898678779602, 0.9397941827774048, 1.0027985572814941, 1.065802812576294, 1.1288070678710938, 1.191811442375183, 1.2548158168792725, 1.3178200721740723]}, "gradients/encoder.encoder.layers.0.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 4.0, 3.0, 2.0, 3.0, 4.0, 1.0, 8.0, 5.0, 5.0, 4.0, 3.0, 10.0, 5.0, 15.0, 16.0, 7.0, 18.0, 17.0, 26.0, 21.0, 23.0, 28.0, 36.0, 159.0, 279.0, 67.0, 23.0, 24.0, 27.0, 21.0, 22.0, 13.0, 21.0, 9.0, 13.0, 10.0, 6.0, 8.0, 6.0, 10.0, 4.0, 7.0, 7.0, 9.0, 0.0, 3.0, 3.0, 1.0, 0.0, 0.0, 3.0, 0.0, 1.0], "bins": [-1.4992005825042725, -1.456620693206787, -1.4140408039093018, -1.3714609146118164, -1.3288811445236206, -1.2863012552261353, -1.24372136592865, -1.2011414766311646, -1.1585615873336792, -1.1159816980361938, -1.0734018087387085, -1.0308220386505127, -0.9882420897483826, -0.945662260055542, -0.9030823707580566, -0.8605024814605713, -0.8179226517677307, -0.7753427624702454, -0.7327629327774048, -0.6901830434799194, -0.6476031541824341, -0.6050232648849487, -0.5624434351921082, -0.5198635458946228, -0.47728368639945984, -0.4347038269042969, -0.3921239376068115, -0.34954407811164856, -0.3069642186164856, -0.26438432931900024, -0.22180446982383728, -0.17922458052635193, -0.13664472103118896, -0.09406484663486481, -0.051484979689121246, -0.008905112743377686, 0.03367476165294647, 0.07625463604927063, 0.1188344955444336, 0.16141438484191895, 0.2039942443370819, 0.24657411873340607, 0.2891539931297302, 0.3317338526248932, 0.37431371212005615, 0.4168936014175415, 0.45947346091270447, 0.5020533800125122, 0.5446332097053528, 0.5872130990028381, 0.6297929286956787, 0.6723728179931641, 0.7149527072906494, 0.7575325965881348, 0.8001124262809753, 0.8426923155784607, 0.8852721452713013, 0.9278520345687866, 0.9704318642616272, 1.0130116939544678, 1.0555915832519531, 1.0981714725494385, 1.1407513618469238, 1.1833312511444092, 1.2259111404418945]}, "gradients/encoder.encoder.pos_conv_embed.conv.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 2.0, 2.0, 3.0, 4.0, 5.0, 9.0, 15.0, 15.0, 29.0, 41.0, 79.0, 88.0, 369.0, 137.0, 69.0, 57.0, 35.0, 28.0, 11.0, 9.0, 4.0, 2.0, 1.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.39501953125, -0.38425445556640625, -0.3734893798828125, -0.36272430419921875, -0.351959228515625, -0.34119415283203125, -0.3304290771484375, -0.31966400146484375, -0.30889892578125, -0.29813385009765625, -0.2873687744140625, -0.27660369873046875, -0.265838623046875, -0.25507354736328125, -0.2443084716796875, -0.23354339599609375, -0.2227783203125, -0.21201324462890625, -0.2012481689453125, -0.19048309326171875, -0.179718017578125, -0.16895294189453125, -0.1581878662109375, -0.14742279052734375, -0.13665771484375, -0.12589263916015625, -0.1151275634765625, -0.10436248779296875, -0.093597412109375, -0.08283233642578125, -0.0720672607421875, -0.06130218505859375, -0.050537109375, -0.03977203369140625, -0.0290069580078125, -0.01824188232421875, -0.007476806640625, 0.00328826904296875, 0.0140533447265625, 0.02481842041015625, 0.03558349609375, 0.04634857177734375, 0.0571136474609375, 0.06787872314453125, 0.078643798828125, 0.08940887451171875, 0.1001739501953125, 0.11093902587890625, 0.1217041015625, 0.13246917724609375, 0.1432342529296875, 0.15399932861328125, 0.164764404296875, 0.17552947998046875, 0.1862945556640625, 0.19705963134765625, 0.20782470703125, 0.21858978271484375, 0.2293548583984375, 0.24011993408203125, 0.250885009765625, 0.26165008544921875, 0.2724151611328125, 0.28318023681640625, 0.2939453125]}, "gradients/encoder.encoder.pos_conv_embed.conv.weight_v": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 2.0, 8.0, 5.0, 4.0, 14.0, 12.0, 10.0, 17.0, 10.0, 27.0, 33.0, 51.0, 108.0, 147.0, 291.0, 712.0, 2533.0, 17028.0, 8349241.0, 14556.0, 2401.0, 695.0, 290.0, 162.0, 66.0, 56.0, 36.0, 21.0, 12.0, 10.0, 8.0, 6.0, 4.0, 2.0, 3.0, 4.0, 1.0, 2.0, 4.0, 5.0, 3.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.5725394487380981, -1.525553822517395, -1.478568196296692, -1.4315824508666992, -1.384596824645996, -1.337611198425293, -1.2906255722045898, -1.2436399459838867, -1.1966543197631836, -1.1496686935424805, -1.1026830673217773, -1.0556974411010742, -1.0087116956710815, -0.9617260694503784, -0.9147404432296753, -0.8677548170089722, -0.8207690715789795, -0.7737834453582764, -0.7267977595329285, -0.6798121333122253, -0.6328264474868774, -0.5858408212661743, -0.5388551950454712, -0.4918695390224457, -0.44488388299942017, -0.39789822697639465, -0.35091257095336914, -0.303926944732666, -0.2569412887096405, -0.209955632686615, -0.16297000646591187, -0.11598435044288635, -0.06899881362915039, -0.022013165056705475, 0.02497248351573944, 0.07195812463760376, 0.11894378066062927, 0.16592943668365479, 0.2129150629043579, 0.2599007189273834, 0.30688637495040894, 0.35387203097343445, 0.40085768699645996, 0.4478433132171631, 0.4948289692401886, 0.5418146252632141, 0.5888002514839172, 0.6357859373092651, 0.6827715635299683, 0.7297571897506714, 0.7767428755760193, 0.8237285017967224, 0.8707141876220703, 0.9176998138427734, 0.9646854400634766, 1.0116710662841797, 1.0586566925048828, 1.105642318725586, 1.152627944946289, 1.1996135711669922, 1.2465993165969849, 1.293584942817688, 1.3405705690383911, 1.3875561952590942, 1.434541940689087]}, "gradients/encoder.encoder.pos_conv_embed.conv.weight_g": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 3.0, 0.0, 1.0, 1.0, 1.0, 1.0, 2.0, 5.0, 1.0, 1.0, 1.0, 0.0, 3.0, 2.0, 2.0, 5.0, 3.0, 5.0, 5.0, 3.0, 5.0, 3.0, 3.0, 10.0, 5.0, 3.0, 5.0, 3.0, 2.0, 3.0, 4.0, 4.0, 2.0, 3.0, 3.0, 3.0, 4.0, 3.0, 3.0, 1.0, 1.0, 0.0, 0.0, 0.0, 1.0, 1.0, 1.0, 1.0, 0.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0], "bins": [-0.8564257025718689, -0.8252968788146973, -0.7941679954528809, -0.7630391716957092, -0.7319103479385376, -0.7007814645767212, -0.6696526408195496, -0.6385238170623779, -0.6073949337005615, -0.5762661099433899, -0.5451372265815735, -0.5140084028244019, -0.48287954926490784, -0.4517506957054138, -0.4206218719482422, -0.38949301838874817, -0.35836416482925415, -0.32723531126976013, -0.2961064577102661, -0.2649776339530945, -0.23384878039360046, -0.20271992683410645, -0.17159108817577362, -0.1404622495174408, -0.10933339595794678, -0.07820454984903336, -0.047075703740119934, -0.015946857631206512, 0.01518198847770691, 0.04631084203720093, 0.07743968069553375, 0.10856851935386658, 0.13969731330871582, 0.17082616686820984, 0.20195500552654266, 0.2330838441848755, 0.2642126977443695, 0.2953415513038635, 0.32647037506103516, 0.3575992286205292, 0.3887280821800232, 0.4198569357395172, 0.45098578929901123, 0.48211461305618286, 0.5132434368133545, 0.5443723201751709, 0.5755011439323425, 0.6066299676895142, 0.6377588510513306, 0.6688876748085022, 0.7000165581703186, 0.7311453819274902, 0.7622742652893066, 0.7934030890464783, 0.8245319128036499, 0.8556607961654663, 0.8867896199226379, 0.9179184436798096, 0.949047327041626, 0.9801761507987976, 1.0113049745559692, 1.0424338579177856, 1.073562741279602, 1.104691505432129, 1.1358203887939453]}, "gradients/encoder.feature_projection.projection.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 2.0, 2.0, 2.0, 4.0, 8.0, 10.0, 18.0, 39.0, 63.0, 136.0, 306.0, 909.0, 3582.0, 24361.0, 325045.0, 154536.0, 12048.0, 2067.0, 635.0, 240.0, 116.0, 48.0, 44.0, 24.0, 15.0, 8.0, 7.0, 6.0, 1.0, 2.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-3.03515625, -2.950469970703125, -2.86578369140625, -2.781097412109375, -2.6964111328125, -2.611724853515625, -2.52703857421875, -2.442352294921875, -2.357666015625, -2.272979736328125, -2.18829345703125, -2.103607177734375, -2.0189208984375, -1.934234619140625, -1.84954833984375, -1.764862060546875, -1.68017578125, -1.595489501953125, -1.51080322265625, -1.426116943359375, -1.3414306640625, -1.256744384765625, -1.17205810546875, -1.087371826171875, -1.002685546875, -0.917999267578125, -0.83331298828125, -0.748626708984375, -0.6639404296875, -0.579254150390625, -0.49456787109375, -0.409881591796875, -0.3251953125, -0.240509033203125, -0.15582275390625, -0.071136474609375, 0.0135498046875, 0.098236083984375, 0.18292236328125, 0.267608642578125, 0.352294921875, 0.436981201171875, 0.52166748046875, 0.606353759765625, 0.6910400390625, 0.775726318359375, 0.86041259765625, 0.945098876953125, 1.02978515625, 1.114471435546875, 1.19915771484375, 1.283843994140625, 1.3685302734375, 1.453216552734375, 1.53790283203125, 1.622589111328125, 1.707275390625, 1.791961669921875, 1.87664794921875, 1.961334228515625, 2.0460205078125, 2.130706787109375, 2.21539306640625, 2.300079345703125, 2.384765625]}, "gradients/encoder.feature_projection.projection.bias": {"_type": "histogram", "values": [1.0, 1.0, 1.0, 2.0, 0.0, 0.0, 4.0, 1.0, 1.0, 4.0, 6.0, 6.0, 12.0, 5.0, 8.0, 8.0, 15.0, 24.0, 27.0, 27.0, 38.0, 42.0, 55.0, 59.0, 80.0, 64.0, 60.0, 66.0, 63.0, 63.0, 47.0, 53.0, 40.0, 32.0, 30.0, 18.0, 11.0, 11.0, 8.0, 8.0, 5.0, 2.0, 2.0, 2.0, 1.0, 2.0, 1.0, 3.0, 0.0, 2.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0, 1.0], "bins": [-0.11474609375, -0.11037445068359375, -0.1060028076171875, -0.10163116455078125, -0.097259521484375, -0.09288787841796875, -0.0885162353515625, -0.08414459228515625, -0.07977294921875, -0.07540130615234375, -0.0710296630859375, -0.06665802001953125, -0.062286376953125, -0.05791473388671875, -0.0535430908203125, -0.04917144775390625, -0.0447998046875, -0.04042816162109375, -0.0360565185546875, -0.03168487548828125, -0.027313232421875, -0.02294158935546875, -0.0185699462890625, -0.01419830322265625, -0.00982666015625, -0.00545501708984375, -0.0010833740234375, 0.00328826904296875, 0.007659912109375, 0.01203155517578125, 0.0164031982421875, 0.02077484130859375, 0.025146484375, 0.02951812744140625, 0.0338897705078125, 0.03826141357421875, 0.042633056640625, 0.04700469970703125, 0.0513763427734375, 0.05574798583984375, 0.06011962890625, 0.06449127197265625, 0.0688629150390625, 0.07323455810546875, 0.077606201171875, 0.08197784423828125, 0.0863494873046875, 0.09072113037109375, 0.0950927734375, 0.09946441650390625, 0.1038360595703125, 0.10820770263671875, 0.112579345703125, 0.11695098876953125, 0.1213226318359375, 0.12569427490234375, 0.13006591796875, 0.13443756103515625, 0.1388092041015625, 0.14318084716796875, 0.147552490234375, 0.15192413330078125, 0.1562957763671875, 0.16066741943359375, 0.1650390625]}, "gradients/encoder.feature_projection.layer_norm.weight": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 0.0, 2.0, 0.0, 3.0, 2.0, 4.0, 1.0, 2.0, 6.0, 3.0, 4.0, 6.0, 15.0, 25.0, 67.0, 109.0, 104.0, 62.0, 25.0, 12.0, 23.0, 12.0, 5.0, 2.0, 3.0, 6.0, 1.0, 1.0, 1.0, 3.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-1.0841953754425049, -1.0307884216308594, -0.9773814678192139, -0.9239745140075684, -0.8705675601959229, -0.8171606063842773, -0.7637537121772766, -0.7103467583656311, -0.6569398045539856, -0.6035328507423401, -0.5501258969306946, -0.49671897292137146, -0.44331201910972595, -0.38990506529808044, -0.3364981412887573, -0.2830911874771118, -0.2296842336654663, -0.1762772798538208, -0.12287034094333649, -0.06946340203285217, -0.016056448221206665, 0.03735050559043884, 0.09075742959976196, 0.14416438341140747, 0.19757133722305298, 0.2509782910346985, 0.304385244846344, 0.3577921688556671, 0.4111991226673126, 0.46460607647895813, 0.5180130004882812, 0.5714199542999268, 0.6248269081115723, 0.6782338619232178, 0.7316408157348633, 0.7850477695465088, 0.8384547233581543, 0.8918616771697998, 0.9452685713768005, 0.998675525188446, 1.0520825386047363, 1.1054894924163818, 1.1588964462280273, 1.2123034000396729, 1.2657103538513184, 1.3191173076629639, 1.3725242614746094, 1.4259312152862549, 1.4793380498886108, 1.5327450037002563, 1.5861519575119019, 1.6395589113235474, 1.6929658651351929, 1.7463728189468384, 1.7997796535491943, 1.8531866073608398, 1.9065935611724854, 1.9600005149841309, 2.0134074687957764, 2.066814422607422, 2.1202213764190674, 2.173628330230713, 2.2270352840423584, 2.280442237854004, 2.3338491916656494]}, "gradients/encoder.feature_projection.layer_norm.bias": {"_type": "histogram", "values": [1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 1.0, 1.0, 2.0, 1.0, 0.0, 3.0, 2.0, 4.0, 4.0, 4.0, 1.0, 6.0, 3.0, 7.0, 2.0, 2.0, 1.0, 3.0, 4.0, 19.0, 44.0, 85.0, 122.0, 68.0, 44.0, 15.0, 7.0, 8.0, 8.0, 4.0, 4.0, 1.0, 4.0, 2.0, 2.0, 1.0, 4.0, 6.0, 3.0, 1.0, 1.0, 2.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 1.0, 0.0, 1.0, 0.0, 0.0, 0.0, 0.0, 0.0, 1.0], "bins": [-0.6400466561317444, -0.6164583563804626, -0.5928699970245361, -0.5692816972732544, -0.5456933379173279, -0.5221050381660461, -0.49851667881011963, -0.4749283790588379, -0.4513400197029114, -0.42775169014930725, -0.4041633605957031, -0.380575031042099, -0.3569867014884949, -0.33339837193489075, -0.3098100423812866, -0.2862217426300049, -0.26263341307640076, -0.23904508352279663, -0.2154567539691925, -0.19186842441558838, -0.16828009486198425, -0.14469176530838013, -0.1211034506559372, -0.09751512110233307, -0.07392679154872894, -0.05033846199512482, -0.02675013616681099, -0.003161810338497162, 0.020426519215106964, 0.04401484876871109, 0.06760317087173462, 0.09119150042533875, 0.11477982997894287, 0.138368159532547, 0.16195648908615112, 0.18554481863975525, 0.20913314819335938, 0.2327214777469635, 0.2563098073005676, 0.27989810705184937, 0.3034864664077759, 0.32707479596138, 0.35066312551498413, 0.37425145506858826, 0.3978397846221924, 0.4214281141757965, 0.44501644372940063, 0.4686047434806824, 0.4921930730342865, 0.5157814025878906, 0.5393697023391724, 0.5629580616950989, 0.5865463614463806, 0.6101347208023071, 0.6337230205535889, 0.6573113799095154, 0.6808996796607971, 0.7044879794120789, 0.7280763387680054, 0.7516646385192871, 0.7752529978752136, 0.7988412976264954, 0.8224296569824219, 0.8460179567337036, 0.8696063160896301]}, "train/loss": 4.7913, "train/learning_rate": 6.92e-06, "train/epoch": 0.34, "train/global_step": 350, "_runtime": 1648, "_timestamp": 1646089211, "_step": 349}', '{"train/loss": 4.0929, "train/learning_rate": 6.9400000000000005e-06, "train/epoch": 0.34, "train/global_step": 351, "_runtime": 1654, "_timestamp": 1646089217, "_step": 350}', '{"train/loss": 4.1596, "train/learning_rate": 6.96e-06, "train/epoch": 0.35, "train/global_step": 352, "_runtime": 1660, "_timestamp": 1646089223, "_step": 351}', '{"train/loss": 4.1149, "train/learning_rate": 6.98e-06, "train/epoch": 0.35, "train/global_step": 353, "_runtime": 1665, "_timestamp": 1646089228, "_step": 352}', '{"train/loss": 4.2482, "train/learning_rate": 7e-06, "train/epoch": 0.35, "train/global_step": 354, "_runtime": 1671, "_timestamp": 1646089234, "_step": 353}', '{"train/loss": 4.1062, "train/learning_rate": 7.0200000000000006e-06, "train/epoch": 0.35, "train/global_step": 355, "_runtime": 1677, "_timestamp": 1646089240, "_step": 354}']}, 'wandb-events.jsonl': {'offset': 54, 'content': ['{"system.gpu.0.gpu": 39.93, "system.gpu.0.memory": 27.8, "system.gpu.0.memoryAllocated": 99.37, "system.gpu.0.temp": 45.8, "system.gpu.process.0.gpu": 39.93, "system.gpu.process.0.memory": 27.8, "system.gpu.process.0.memoryAllocated": 99.37, "system.gpu.process.0.temp": 45.8, "system.gpu.0.powerWatts": 141.2, "system.gpu.0.powerPercent": 47.07, "system.gpu.process.0.powerWatts": 141.2, "system.gpu.process.0.powerPercent": 47.07, "system.gpu.1.gpu": 47.13, "system.gpu.1.memory": 10.0, "system.gpu.1.memoryAllocated": 98.04, "system.gpu.1.temp": 42.4, "system.gpu.1.powerWatts": 129.89, "system.gpu.1.powerPercent": 43.3, "system.cpu": 16.2, "system.memory": 19.3, "system.disk": 44.8, "system.proc.memory.availableMB": 48640.75, "system.proc.memory.rssMB": 148.92, "system.proc.memory.percent": 0.25, "system.proc.cpu.threads": 12.0, "system.network.sent": 132275359, "system.network.recv": 4968315, "_wandb": true, "_timestamp": 1646089241, "_runtime": 1678}']}, 'output.log': {'offset': 1013, 'content': ["2022-02-28T23:00:10.906692 {'loss': 4.6835, 'learning_rate': 6.9e-06, 'epoch': 0.34}\n", 'ERROR 2022-02-28T23:00:12.913324 [WARNING|modeling_utils.py:388] 2022-02-28 23:00:10,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', "2022-02-28T23:00:12.909271 {'loss': 4.7913, 'learning_rate': 6.92e-06, 'epoch': 0.34}\n", 'ERROR 2022-02-28T23:00:12.913324 [WARNING|modeling_utils.py:388] 2022-02-28 23:00:10,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', 'ERROR 2022-02-28T23:00:19.164827 34%|███████████████████████████▌ | 351/1019 [27:32<37:01, 3.33s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', "2022-02-28T23:00:18.985987 {'loss': 4.0929, 'learning_rate': 6.9400000000000005e-06, 'epoch': 0.34}\n", 'ERROR 2022-02-28T23:00:19.164827 34%|███████████████████████████▌ | 351/1019 [27:32<37:01, 3.33s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', 'ERROR 2022-02-28T23:00:23.401806 35%|███████████████████████████▋ | 352/1019 [27:38<45:30, 4.09s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', 'ERROR 2022-02-28T23:00:23.401806 35%|███████████████████████████▋ | 352/1019 [27:38<45:30, 4.09s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', "2022-02-28T23:00:25.034114 {'loss': 4.1596, 'learning_rate': 6.96e-06, 'epoch': 0.35}\n", 'ERROR 2022-02-28T23:00:29.511868 35%|███████████████████████████▋ | 353/1019 [27:44<51:02, 4.60s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', "2022-02-28T23:00:29.175610 {'loss': 4.1149, 'learning_rate': 6.98e-06, 'epoch': 0.35}\n", 'ERROR 2022-02-28T23:00:29.511868 35%|███████████████████████████▋ | 353/1019 [27:44<51:02, 4.60s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', 'ERROR 2022-02-28T23:00:35.709429 35%|███████████████████████████▊ | 354/1019 [27:50<54:53, 4.95s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', 'ERROR 2022-02-28T23:00:35.709429 35%|███████████████████████████▊ | 354/1019 [27:50<54:53, 4.95s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n', "2022-02-28T23:00:37.212075 {'loss': 4.2482, 'learning_rate': 7e-06, 'epoch': 0.35}\n", 'ERROR 2022-02-28T23:00:37.880877 35%|███████████████████████████▊ | 354/1019 [27:50<54:53, 4.95s/it]g-point operations will not be computed-28 23:00:02,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed\n']}}, 'dropped': 0}} 2022-02-28 23:00:41,897 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:43,898 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:46,091 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:46,144 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:46,230 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:46,348 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:00:46,349 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:00:46,899 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:47,899 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:48,899 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:50,900 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:51,843 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:51,897 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:51,983 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:52,981 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:52,982 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:53,982 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:56,983 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:57,488 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:00:57,536 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:00:57,620 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:00:57,983 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:00:58,984 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:00:59,984 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:01,558 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:01:01,559 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:01:02,985 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:03,135 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:03,192 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:03,281 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:03,986 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:03,986 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:04,986 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:06,987 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:08,790 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:08,845 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:08,932 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:08,987 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:09,988 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:10,988 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:11,515 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:01:12,989 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:14,294 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:14,348 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:14,434 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:14,989 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:15,990 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:16,643 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:01:16,644 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:01:16,990 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:18,991 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:19,762 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:19,815 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:19,899 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:19,991 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:20,991 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:21,992 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:22,992 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:25,169 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:25,222 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:25,305 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:25,993 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:25,993 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:27,994 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:29,994 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:30,516 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:30,569 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:30,652 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:30,995 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:31,745 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:01:31,746 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:01:31,995 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:32,995 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:35,813 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:35,866 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:35,953 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:35,996 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:35,997 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:37,997 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:38,997 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:39,998 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:41,222 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:41,275 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:41,361 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:41,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:41,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:42,099 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:01:42,999 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:46,000 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:46,568 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:46,624 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:46,710 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:46,811 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:01:46,812 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:01:47,000 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:48,001 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:49,001 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:50,002 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:51,909 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:51,958 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:52,045 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:53,043 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:53,044 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:57,045 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:01:57,272 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:01:57,323 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:01:57,408 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:01:58,045 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:01:59,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:01,046 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:02,210 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:02:02,212 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:02:02,566 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:02,651 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:02,736 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:03,047 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:05,048 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:07,049 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:07,670 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:07,719 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:07,806 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:08,049 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:09,049 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:11,050 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:12,594 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:02:12,702 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:12,750 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:12,838 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:13,051 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:13,051 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:17,052 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:17,259 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:02:17,260 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:02:17,800 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:17,853 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:17,936 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:18,053 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:19,053 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:22,054 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:22,910 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:22,962 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:23,052 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:23,054 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:24,055 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:26,055 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:27,936 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:27,987 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:28,069 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:28,071 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:29,069 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:30,069 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:32,070 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:32,326 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:02:32,327 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:02:32,909 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:32,962 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:33,046 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:33,070 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:34,070 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:36,071 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:37,874 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:37,928 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:38,012 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:38,072 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:40,073 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:42,074 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:42,754 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:42,818 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:42,901 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:43,074 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:43,254 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:02:44,074 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:46,075 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:47,383 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:02:47,385 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:02:47,649 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:47,704 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:47,791 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:48,076 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:48,076 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:50,076 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:52,077 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:52,516 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:52,567 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:52,651 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:53,077 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:54,078 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:56,078 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:02:57,285 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:02:57,338 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:02:57,423 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:02:58,079 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:02:59,079 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:00,079 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:01,080 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:02,040 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:02,095 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:02,182 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:02,448 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:03:02,449 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:03:03,086 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:03,086 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:04,086 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:06,777 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:06,829 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:06,916 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:07,087 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:07,087 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:08,087 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:10,088 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:11,303 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:11,357 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:11,441 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:12,089 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:12,089 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:13,698 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:03:14,089 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:15,795 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:15,850 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:15,936 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:16,090 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:16,090 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:17,723 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:03:17,724 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:03:18,091 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:20,092 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:20,182 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:20,236 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:20,320 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:21,092 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:22,092 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:24,093 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:24,470 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:24,518 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:24,632 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:25,093 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:26,094 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:28,094 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:28,640 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:28,697 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:28,822 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:29,095 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:30,095 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:32,096 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:32,717 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:32,770 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:32,854 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:03:32,857 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:32,858 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:03:33,096 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:34,096 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:36,097 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:36,554 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:36,608 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:36,693 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:37,097 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:38,098 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:40,090 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:40,129 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:40,143 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:40,231 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:41,125 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:42,125 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:43,518 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:43,580 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:43,664 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:44,126 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:44,126 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:44,327 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:03:46,126 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:46,688 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:46,744 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:46,829 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:47,127 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:47,999 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:03:48,001 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:03:48,127 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:49,692 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:49,746 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:49,831 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:50,127 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:50,128 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:52,128 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:52,289 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:52,365 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:52,450 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:53,128 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:54,129 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:54,604 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:54,660 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:54,747 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:55,129 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:56,129 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:56,677 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:56,726 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:56,809 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:57,130 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:03:58,130 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:03:58,514 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:03:58,569 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:03:58,653 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:03:59,130 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:00,120 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:00,173 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:00,177 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:00,267 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:01,163 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:02,079 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:02,213 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:02,257 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:02,344 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:03,116 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:04:03,117 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:04:03,208 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:04,208 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:06,209 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:08,136 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:08,189 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:08,280 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:09,278 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:10,278 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:12,279 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:14,007 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:14,082 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:14,168 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:14,279 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:14,952 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:04:15,280 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:18,304 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:04:18,305 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:04:19,281 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:19,775 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:19,829 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:19,917 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:20,282 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:21,282 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:23,283 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:25,411 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:25,466 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:25,552 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:26,284 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:27,284 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:29,285 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:31,135 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:31,193 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:31,280 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:31,286 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:33,286 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:33,349 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:04:33,350 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:04:35,287 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:36,858 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:36,913 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:36,994 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:37,288 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:37,288 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:39,288 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:41,289 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:42,478 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:42,555 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:42,639 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:43,289 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:44,290 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:45,290 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:45,545 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:04:48,092 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:48,144 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:48,229 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:48,291 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:48,292 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:48,488 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:04:48,490 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:04:50,292 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:51,292 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:52,293 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:53,753 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:53,806 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:53,896 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:04:54,293 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:04:55,294 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:56,294 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:58,295 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:04:59,337 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:04:59,380 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:04:59,466 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:00,295 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:00,296 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:01,296 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:02,296 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:03,731 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:05:03,733 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:05:04,297 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:04,755 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:04,826 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:04,909 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:05,297 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:06,297 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:07,298 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:08,298 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:10,198 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:10,251 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:10,335 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:11,333 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:11,334 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:12,334 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:14,334 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:15,667 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:15,730 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:15,818 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:16,055 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:05:16,335 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:16,335 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:18,336 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:18,819 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:05:18,820 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:05:20,336 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:21,097 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:21,154 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:21,243 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:21,337 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:22,337 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:24,338 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:25,338 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:26,445 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:26,499 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:26,584 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:27,339 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:28,339 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:29,339 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:31,340 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:31,845 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:31,901 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:31,987 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:32,340 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:33,341 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:34,215 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:05:34,216 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:05:34,341 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:37,142 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:37,197 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:37,285 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:37,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:37,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:38,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:39,343 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:41,344 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:42,504 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:42,561 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:42,674 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:43,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:43,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:44,345 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:46,535 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:05:47,346 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:47,757 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:47,812 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:47,899 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:48,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:49,297 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:05:49,298 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:05:49,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:50,347 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:51,348 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:52,980 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:53,036 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:53,121 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:53,348 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:05:54,349 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:58,252 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:05:58,306 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:05:58,393 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:05:58,395 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:05:59,393 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:00,393 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:02,394 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:03,437 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:03,489 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:03,577 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:04,395 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:04,395 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:04,509 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:06:04,511 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:06:06,396 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:08,396 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:08,520 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:08,577 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:08,690 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:09,397 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:10,397 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:12,398 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:13,580 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:13,632 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:13,718 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:14,399 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:14,399 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:17,080 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:06:18,400 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:18,624 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:18,677 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:18,764 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:19,401 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:19,642 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:06:19,644 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:06:20,401 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:21,401 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:22,402 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:23,592 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:23,644 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:23,729 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:24,403 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:24,403 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:25,403 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:28,404 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:28,575 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:28,629 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:28,716 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:29,404 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:29,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:30,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:32,405 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:33,531 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:33,586 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:33,673 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:34,406 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:34,406 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:34,723 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:06:34,725 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:06:35,407 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:37,407 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:38,366 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:38,421 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:38,511 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:39,427 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:41,427 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:43,159 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:43,217 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:43,349 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:43,428 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:43,428 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:45,429 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:47,429 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:47,532 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:06:47,956 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:48,011 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:48,098 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:48,430 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:49,430 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:49,775 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:06:49,777 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:06:51,431 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:52,610 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:52,664 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:52,751 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:53,432 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:53,432 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:57,227 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:06:57,280 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:06:57,366 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:06:57,433 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:06:57,433 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:06:59,434 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:01,434 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:01,724 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:01,778 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:01,863 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:02,434 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:03,435 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:04,841 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:07:04,843 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:07:05,435 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:06,142 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:06,198 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:06,292 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:06,436 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:07,436 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:09,437 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:10,483 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:10,536 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:10,620 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:11,437 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:11,438 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:14,438 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:14,794 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:14,845 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:14,932 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:15,439 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:16,439 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:18,137 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:07:18,440 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:18,971 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:19,025 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:19,113 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:19,440 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:19,932 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:07:19,934 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:07:20,440 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:22,441 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:23,007 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:23,061 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:23,150 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:23,442 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:24,442 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:26,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:26,895 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:26,949 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:27,040 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:27,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:28,443 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:30,444 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:30,637 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:30,691 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:30,781 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:31,444 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:32,445 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:34,204 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:34,289 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:34,381 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:34,445 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:34,446 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:35,001 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:07:35,003 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:07:36,446 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:37,482 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:37,535 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:37,618 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:38,447 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:38,447 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:40,447 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:40,566 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:40,623 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:40,714 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:41,448 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:42,448 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:43,344 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:43,397 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:43,481 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:43,487 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:44,482 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:45,824 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:45,879 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:45,973 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:46,482 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:46,483 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:48,014 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:48,078 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:48,170 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:48,483 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:48,483 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:48,902 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:07:49,896 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:49,949 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:50,038 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:50,088 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:07:50,089 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:07:50,484 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:50,484 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:51,536 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:51,591 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:51,682 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:52,485 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:52,485 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:53,594 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:53,769 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:53,854 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:07:54,485 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:54,486 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:07:59,487 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:07:59,782 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:07:59,837 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:07:59,926 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:00,487 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:00,488 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:01,488 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:05,170 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:08:05,172 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:08:05,489 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:05,780 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:05,835 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:05,952 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:06,489 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:07,490 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:11,491 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:11,699 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:11,754 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:11,840 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:12,491 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:13,492 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:17,493 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:17,574 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:17,629 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:17,748 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:18,493 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:19,494 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:19,496 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:08:20,253 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:08:20,255 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:08:21,495 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:23,343 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:23,416 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:23,498 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:23,504 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:25,499 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:27,500 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:29,059 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:29,111 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:29,198 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:29,500 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:31,501 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:33,502 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:34,804 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:34,855 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:34,943 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:35,386 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:08:35,387 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:08:35,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:36,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:37,503 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:38,504 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:40,500 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:40,553 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:40,643 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:41,537 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:41,537 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:42,537 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:44,538 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:46,121 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:46,173 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:46,268 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:46,539 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:47,539 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:48,539 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:50,026 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:08:50,540 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:50,551 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:08:50,552 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:08:51,656 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:51,711 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:51,833 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:52,541 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:53,541 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:54,541 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:56,542 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:57,282 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:08:57,337 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:08:57,430 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:08:57,542 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:08:58,543 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:08:59,543 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:02,544 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:02,784 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:02,837 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:02,927 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:03,545 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:03,545 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:04,545 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:05,709 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:09:05,711 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:09:06,545 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:08,268 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:08,325 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:08,422 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:08,546 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:10,547 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:12,548 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:13,781 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:13,834 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:13,920 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:14,548 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:14,549 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:16,549 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:17,550 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:19,208 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:19,266 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:19,351 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:19,550 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:20,551 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:20,628 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:09:20,868 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:09:20,869 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:09:21,551 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:23,552 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:24,611 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:24,664 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:24,772 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:25,553 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:25,553 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:26,553 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:27,553 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:29,554 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:30,048 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:30,117 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:30,211 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:30,554 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:31,555 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:32,555 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:33,555 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:35,437 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:35,491 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:35,578 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:36,085 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:09:36,086 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:09:36,572 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:36,572 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:37,572 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:39,573 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:40,725 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:40,779 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:40,872 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:41,574 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:41,574 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:42,574 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:44,575 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:45,986 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:46,039 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:46,127 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:46,575 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:46,576 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:48,576 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:50,577 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:51,164 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:09:51,165 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:09:51,235 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:09:51,265 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:51,322 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:51,473 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:51,577 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:52,577 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:56,505 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:09:56,560 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:09:56,594 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:09:56,657 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:09:57,594 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:09:58,595 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:00,595 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:01,721 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:01,777 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:01,862 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:02,596 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:02,596 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:06,228 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:10:06,229 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:10:06,597 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:06,960 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:07,015 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:07,105 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:07,597 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:08,598 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:10,598 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:12,080 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:12,135 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:12,224 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:12,599 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:13,599 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:17,170 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:17,225 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:17,309 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:17,601 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:17,601 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:19,601 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:21,272 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:10:21,274 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:10:21,602 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:21,796 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:10:22,202 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:22,256 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:22,347 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:22,602 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:23,603 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:25,603 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:27,185 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:27,237 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:27,330 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:27,604 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:29,605 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:31,605 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:32,168 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:32,220 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:32,318 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:32,606 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:33,606 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:35,607 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:36,319 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:10:36,320 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:10:37,192 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:37,248 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:37,341 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:37,607 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:37,608 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:39,608 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:41,609 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:42,063 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:42,117 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:42,252 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:42,609 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:43,609 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:45,610 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:46,913 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:46,967 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:47,060 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:47,611 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:47,611 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:48,612 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:51,365 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:10:51,366 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:10:51,726 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:51,778 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:51,877 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:52,445 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:10:52,613 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:52,613 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:53,613 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:54,614 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:56,460 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:10:56,513 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:10:56,604 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:10:56,614 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:56,614 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:10:57,615 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:10:58,615 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:00,616 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:01,021 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:01,096 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:01,184 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:01,616 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:01,616 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:02,617 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:04,617 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:05,543 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:05,595 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:05,683 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:06,486 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:11:06,488 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:11:06,678 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:06,678 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:07,678 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:08,678 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:09,952 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:10,006 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:10,100 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:10,679 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:10,679 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:11,679 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:12,680 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:14,325 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:14,378 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:14,466 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:14,680 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:15,681 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:17,681 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:18,509 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:18,561 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:18,675 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:18,682 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:19,682 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:20,683 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:21,581 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:11:21,583 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:11:21,683 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:22,562 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:22,623 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:22,719 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:23,057 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:11:23,712 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:23,713 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:25,713 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:26,498 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:26,548 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:26,642 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:26,713 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:27,714 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:29,714 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:29,999 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:30,074 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:30,162 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:30,715 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:31,715 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:32,715 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:33,348 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:33,391 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:33,480 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:33,716 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:33,716 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:34,716 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:35,716 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:36,444 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:36,498 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:36,587 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:36,698 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:11:36,700 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:11:36,717 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:37,717 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:38,717 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:39,283 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:39,337 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:39,430 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:39,718 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:39,718 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:40,718 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:41,719 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:41,860 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:41,913 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:42,003 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:42,719 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:43,719 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:44,193 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:44,247 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:44,336 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:44,720 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:44,720 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:45,720 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:46,227 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:46,279 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:46,368 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:46,720 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:46,721 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:47,721 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:47,942 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:47,996 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:48,084 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:48,721 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:48,721 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:49,721 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:49,968 DEBUG SenderThread:234672 [sender.py:send():235] send: history 2022-02-28 23:11:50,140 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:11:50,226 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:11:50,722 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:50,722 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:11:51,722 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:51,864 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:11:51,865 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:11:52,723 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:53,685 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:11:53,723 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:57,724 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:11:59,725 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:03,726 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:06,921 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:12:06,922 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:12:07,727 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:11,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:13,729 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:17,731 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:21,732 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:21,978 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: stop_status 2022-02-28 23:12:21,978 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: stop_status 2022-02-28 23:12:24,049 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:12:25,733 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:29,734 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:33,736 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:35,548 DEBUG SenderThread:234672 [sender.py:send():235] send: telemetry 2022-02-28 23:12:35,549 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:35,549 DEBUG SenderThread:234672 [sender.py:send():235] send: exit 2022-02-28 23:12:35,549 INFO SenderThread:234672 [sender.py:send_exit():371] handling exit code: 1 2022-02-28 23:12:35,549 INFO SenderThread:234672 [sender.py:send_exit():373] handling runtime: 2391 2022-02-28 23:12:35,606 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:12:35,607 INFO SenderThread:234672 [sender.py:send_exit():379] send defer 2022-02-28 23:12:35,607 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:35,607 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:35,608 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 0 2022-02-28 23:12:35,608 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:35,608 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 0 2022-02-28 23:12:35,608 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 1 2022-02-28 23:12:35,608 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:35,608 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 1 2022-02-28 23:12:35,725 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:35,726 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 1 2022-02-28 23:12:35,726 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 2 2022-02-28 23:12:35,726 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:35,726 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:35,726 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 2 2022-02-28 23:12:35,727 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:35,727 DEBUG SenderThread:234672 [sender.py:send():235] send: stats 2022-02-28 23:12:35,727 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:35,727 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 2 2022-02-28 23:12:35,727 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 3 2022-02-28 23:12:35,728 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:35,728 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 3 2022-02-28 23:12:35,778 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:35,778 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:12:35,789 DEBUG SenderThread:234672 [sender.py:send():235] send: summary 2022-02-28 23:12:35,878 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:35,881 INFO SenderThread:234672 [sender.py:_save_file():944] saving file wandb-summary.json with policy end 2022-02-28 23:12:35,882 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:35,882 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 3 2022-02-28 23:12:35,882 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 4 2022-02-28 23:12:35,882 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:35,883 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:35,883 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 4 2022-02-28 23:12:35,883 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:35,883 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 4 2022-02-28 23:12:35,984 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:36,754 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 5 2022-02-28 23:12:36,754 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:36,755 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:36,755 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 5 2022-02-28 23:12:36,755 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:36,755 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 5 2022-02-28 23:12:36,755 INFO SenderThread:234672 [dir_watcher.py:finish():283] shutting down directory watcher 2022-02-28 23:12:36,768 INFO Thread-8 :234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/config.yaml 2022-02-28 23:12:36,768 INFO SenderThread:234672 [dir_watcher.py:_on_file_modified():230] file/dir modified: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:12:36,769 INFO SenderThread:234672 [dir_watcher.py:finish():313] scan: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files 2022-02-28 23:12:36,769 INFO SenderThread:234672 [dir_watcher.py:finish():327] scan save: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-metadata.json wandb-metadata.json 2022-02-28 23:12:36,769 INFO SenderThread:234672 [dir_watcher.py:finish():327] scan save: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log output.log 2022-02-28 23:12:36,769 INFO SenderThread:234672 [dir_watcher.py:finish():327] scan save: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json wandb-summary.json 2022-02-28 23:12:36,770 INFO SenderThread:234672 [dir_watcher.py:finish():327] scan save: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/requirements.txt requirements.txt 2022-02-28 23:12:36,772 INFO SenderThread:234672 [dir_watcher.py:finish():327] scan save: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/config.yaml config.yaml 2022-02-28 23:12:36,772 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 6 2022-02-28 23:12:36,775 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:36,775 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 6 2022-02-28 23:12:36,778 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:36,781 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 6 2022-02-28 23:12:36,781 INFO SenderThread:234672 [file_pusher.py:finish():177] shutting down file pusher 2022-02-28 23:12:36,860 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:36,862 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:36,963 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:36,964 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:37,065 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:37,065 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:37,073 INFO Thread-14 :234672 [upload_job.py:push():137] Uploaded file /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/requirements.txt 2022-02-28 23:12:37,085 INFO Thread-15 :234672 [upload_job.py:push():137] Uploaded file /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/config.yaml 2022-02-28 23:12:37,115 INFO Thread-12 :234672 [upload_job.py:push():137] Uploaded file /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/output.log 2022-02-28 23:12:37,165 INFO Thread-13 :234672 [upload_job.py:push():137] Uploaded file /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/files/wandb-summary.json 2022-02-28 23:12:37,167 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:37,167 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:37,269 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:37,269 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:37,366 INFO Thread-7 :234672 [sender.py:transition_state():392] send defer: 7 2022-02-28 23:12:37,367 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:37,367 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 7 2022-02-28 23:12:37,367 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:37,367 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 7 2022-02-28 23:12:37,374 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:38,780 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 8 2022-02-28 23:12:38,781 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:38,781 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:38,781 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 8 2022-02-28 23:12:38,781 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:38,782 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 8 2022-02-28 23:12:38,782 INFO SenderThread:234672 [sender.py:transition_state():392] send defer: 9 2022-02-28 23:12:38,782 DEBUG SenderThread:234672 [sender.py:send():235] send: final 2022-02-28 23:12:38,782 DEBUG SenderThread:234672 [sender.py:send():235] send: footer 2022-02-28 23:12:38,782 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: defer 2022-02-28 23:12:38,783 INFO HandlerThread:234672 [handler.py:handle_request_defer():154] handle defer: 9 2022-02-28 23:12:38,784 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: defer 2022-02-28 23:12:38,784 INFO SenderThread:234672 [sender.py:send_request_defer():388] handle sender defer: 9 2022-02-28 23:12:38,882 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: poll_exit 2022-02-28 23:12:38,883 DEBUG SenderThread:234672 [sender.py:send_request():249] send_request: poll_exit 2022-02-28 23:12:38,883 INFO SenderThread:234672 [file_pusher.py:join():182] waiting for file pusher 2022-02-28 23:12:38,940 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: get_summary 2022-02-28 23:12:39,037 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: sampled_history 2022-02-28 23:12:39,041 DEBUG HandlerThread:234672 [handler.py:handle_request():131] handle_request: shutdown 2022-02-28 23:12:39,041 INFO HandlerThread:234672 [handler.py:finish():739] shutting down handler 2022-02-28 23:12:39,784 INFO WriterThread:234672 [datastore.py:close():281] close: /home/sanchit_huggingface_co/wav2vec2-gpt2-wandb-grid-search/wandb/run-20220228_223243-2ay2wvge/run-2ay2wvge.wandb 2022-02-28 23:12:39,939 INFO SenderThread:234672 [sender.py:finish():1075] shutting down sender 2022-02-28 23:12:39,939 INFO SenderThread:234672 [file_pusher.py:finish():177] shutting down file pusher 2022-02-28 23:12:39,939 INFO SenderThread:234672 [file_pusher.py:join():182] waiting for file pusher 2022-02-28 23:12:39,946 INFO MainThread:234672 [internal.py:handle_exit():79] Internal process exited